IDEAS home Printed from https://ideas.repec.org/a/eee/jmvana/v167y2018icp319-330.html
   My bibliography  Save this article

A model selection approach for multiple sequence segmentation and dimensionality reduction

Author

Listed:
  • Castro, Bruno M.
  • Lemes, Renan B.
  • Cesar, Jonatas
  • Hünemeier, Tábita
  • Leonardi, Florencia

Abstract

In this paper we consider the problem of segmenting n aligned random sequences of equal length m into a finite number of independent blocks. We propose a penalized maximum likelihood criterion to infer simultaneously the number of points of independence as well as the position of each point. We show how to compute exactly the estimator by means of a dynamic programming algorithm with time complexity O(m2n). We also propose another method, called hierarchical algorithm, that provides an approximation to the estimator when the sample size increases and runs in time O{mln(m)n}. Our main theoretical results are the strong consistency of both estimators when the sample size n grows to infinity. We illustrate the convergence of these algorithms through some simulation examples and we apply the method to identify recombination hotspots in real SNPs data.

Suggested Citation

  • Castro, Bruno M. & Lemes, Renan B. & Cesar, Jonatas & Hünemeier, Tábita & Leonardi, Florencia, 2018. "A model selection approach for multiple sequence segmentation and dimensionality reduction," Journal of Multivariate Analysis, Elsevier, vol. 167(C), pages 319-330.
  • Handle: RePEc:eee:jmvana:v:167:y:2018:i:c:p:319-330
    DOI: 10.1016/j.jmva.2018.05.006
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0047259X18302331
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jmva.2018.05.006?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Richard J. Boys & Daniel A. Henderson, 2004. "A Bayesian Approach to DNA Sequence Segmentation," Biometrics, The International Biometric Society, vol. 60(3), pages 573-581, September.
    2. Clive J Hoggart & John C Whittaker & Maria De Iorio & David J Balding, 2008. "Simultaneous Analysis of All SNPs in Genome-Wide and Re-Sequencing Association Studies," PLOS Genetics, Public Library of Science, vol. 4(7), pages 1-8, July.
    3. Douglas M. Hawkins, 1976. "Point Estimation of the Parameters of Piecewise Regression Models," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 25(1), pages 51-57, March.
    4. Fridlyand, Jane & Snijders, Antoine M. & Pinkel, Dan & Albertson, Donna G. & Jain, A.N.Ajay N., 2004. "Hidden Markov models approach to the analysis of array CGH data," Journal of Multivariate Analysis, Elsevier, vol. 90(1), pages 132-153, July.
    5. Jushan Bai & Pierre Perron, 2003. "Computation and analysis of multiple structural change models," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 18(1), pages 1-22.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Florencia Leonardi & Matías Lopez‐Rosenfeld & Daniela Rodriguez & Magno T. F. Severino & Mariela Sued, 2021. "Independent block identification in multivariate time series," Journal of Time Series Analysis, Wiley Blackwell, vol. 42(1), pages 19-33, January.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Salvatore Fasola & Vito M. R. Muggeo & Helmut Küchenhoff, 2018. "A heuristic, iterative algorithm for change-point detection in abrupt change models," Computational Statistics, Springer, vol. 33(2), pages 997-1015, June.
    2. Alessandro Casini & Pierre Perron, 2018. "Structural Breaks in Time Series," Boston University - Department of Economics - Working Papers Series WP2019-02, Boston University - Department of Economics.
    3. Casini, Alessandro & Perron, Pierre, 2021. "Continuous record Laplace-based inference about the break date in structural change models," Journal of Econometrics, Elsevier, vol. 224(1), pages 3-21.
    4. Lee, Chien-Chiang & Chen, Mei-Ping & Chang, Chi-Hung, 2013. "Dynamic relationships between industry returns and stock market returns," The North American Journal of Economics and Finance, Elsevier, vol. 26(C), pages 119-144.
    5. Moonsoo Park & Yanhong Jin & Alan Love, 2011. "Dynamic and contemporaneous causality in a supply chain: an application of the US beef industry," Applied Economics, Taylor & Francis Journals, vol. 43(30), pages 4785-4801.
    6. Pierre Perron & Yohei Yamamoto, 2015. "Using OLS to Estimate and Test for Structural Changes in Models with Endogenous Regressors," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 30(1), pages 119-144, January.
    7. Pierre Perron & Yohei Yamamoto, 2008. "Estimating and Testing Multiple Structural Changes in Models with Endogenous Regressors," Boston University - Department of Economics - Working Papers Series wp2008-017, Boston University - Department of Economics.
    8. Bilal Mehmood & Syed Hassan Raza & Mahwish Rana & Huma Sohaib & Muhammad Azhar Khan, 2014. "Triangular Relationship between Energy Consumption, Price Index and National Income in Asian Countries: A Pooled Mean Group Approach in Presence of Structural Breaks," International Journal of Energy Economics and Policy, Econjournals, vol. 4(4), pages 610-620.
    9. Matteo Mogliani, 2010. "Residual-based tests for cointegration and multiple deterministic structural breaks: A Monte Carlo study," Working Papers halshs-00564897, HAL.
    10. Hilary S. Booth & Conrad J. Burden & John H. Maindonald & Lucia Santoso & Matthew J. Wakefield & Susan R. Wilson, 2005. "Discussion of “A Bayesian Approach to DNA Sequence Segmentation”," Biometrics, The International Biometric Society, vol. 61(2), pages 635-637, June.
    11. Bernard, Jean-Thomas & Idoudi, Nadhem & Khalaf, Lynda & Yelou, Clement, 2007. "Finite sample multivariate structural change tests with application to energy demand models," Journal of Econometrics, Elsevier, vol. 141(2), pages 1219-1244, December.
    12. Nuruddeen Usman & Kodili Nwanneka & Nduka, 2023. "Announcement Effect of COVID-19 on Cryptocurrencies," Asian Economics Letters, Asia-Pacific Applied Economics Association, vol. 3(3), pages 1-4.
    13. Mina Kim & Deokwoo Nam & Jian Wang & Jason J. Wu, 2013. "International trade price stickiness and exchange rate pass-through in micro data: a case study on U.S.–China trade," Globalization Institute Working Papers 135, Federal Reserve Bank of Dallas.
    14. Marfatia, Hardik A., 2015. "Monetary policy's time-varying impact on the US bond markets: Role of financial stress and risks," The North American Journal of Economics and Finance, Elsevier, vol. 34(C), pages 103-123.
    15. Vicente Esteve & Manuel Navarro-Ibáñez & María A. Prats, 2013. "The present value model of US stock prices revisited: long-run evidence with structural breaks, 1871-2010," Working Papers 04/13, Instituto Universitario de Análisis Económico y Social.
    16. Kumar, Nikeel Nishkar & Patel, Arvind, 2023. "Nonlinear effect of air travel tourism demand on economic growth in Fiji," Journal of Air Transport Management, Elsevier, vol. 109(C).
    17. Murach, Michael & Wagner, Helmut & Kim, Jungsuk & Park, Donghyun, 2022. "Trajectories to high income: Comparing the growth dynamics in China, South Korea, and Japan with cointegrated VAR models," Structural Change and Economic Dynamics, Elsevier, vol. 62(C), pages 492-511.
    18. Karakotsios, Achillefs & Katrakilidis, Constantinos & Kroupis, Nikolaos, 2021. "The dynamic linkages between food prices and oil prices. Does asymmetry matter?," The Journal of Economic Asymmetries, Elsevier, vol. 23(C).
    19. Salisu, Afees A. & Adekunle, Wasiu & Alimi, Wasiu A. & Emmanuel, Zachariah, 2019. "Predicting exchange rate with commodity prices: New evidence from Westerlund and Narayan (2015) estimator with structural breaks and asymmetries," Resources Policy, Elsevier, vol. 62(C), pages 33-56.
    20. Pierre Perron & Yohei Yamamoto, 2022. "Structural change tests under heteroskedasticity: Joint estimation versus two‐steps methods," Journal of Time Series Analysis, Wiley Blackwell, vol. 43(3), pages 389-411, May.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:jmvana:v:167:y:2018:i:c:p:319-330. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/622892/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.