IDEAS home Printed from https://ideas.repec.org/a/eee/phsmap/v221y1995i1p180-192.html
   My bibliography  Save this article

Statistical properties of DNA sequences

Author

Listed:
  • Peng, C.-K.
  • Buldyrev, S.V.
  • Goldberger, A.L.
  • Havlin, S.
  • Mantegna, R.N.
  • Simons, M.
  • Stanley, H.E.

Abstract

We review evidence supporting the idea that the DNA sequence in genese containing non-coding regions is correlated, and that the correlation is remarkably long range — indeed, nucleotides thousands of base pairs distant are correlated. We do not find such a long-range correlation in the coding regions of the gene. We resolve the problem of the “non-stationarity” feature of the sequence of base pairs by applying a new algorithm called detrended fluctuation analysis (DFA). We address the claim of Voss that there is no difference in the statistical properties of coding and non-coding regions of DNA by systematically applying the DFA algorithm, as well as standard FFT analysis, to every DNA sequence (33 301 coding and 29 453 non-coding) in the entire GenBank database. Finally, we describe briefly some recent work showing that the non-coding sequences have certain statistical features in common with natural and artificial languages. Specifically, we adapt to DNA the Zipf approach to analyzing linguistic texts. These statistical properties of non-coding sequences support the possibility that non-coding regions of DNA may carry biological information.

Suggested Citation

  • Peng, C.-K. & Buldyrev, S.V. & Goldberger, A.L. & Havlin, S. & Mantegna, R.N. & Simons, M. & Stanley, H.E., 1995. "Statistical properties of DNA sequences," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 221(1), pages 180-192.
  • Handle: RePEc:eee:phsmap:v:221:y:1995:i:1:p:180-192
    DOI: 10.1016/0378-4371(95)00247-5
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/0378437195002475
    Download Restriction: Full text for ScienceDirect subscribers only. Journal offers the option of making the article available online on Science direct for a fee of $3,000

    File URL: https://libkey.io/10.1016/0378-4371(95)00247-5?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Stanley, H.E. & Buldyrev, S.V. & Goldberger, A.L. & Goldberger, Z.D. & Havlin, S. & Mantegna, R.N. & Ossadnik, S.M. & Peng, C.-K. & Simons, M., 1994. "Statistical mechanics in biology: how ubiquitous are long-range correlations?," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 205(1), pages 214-253.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Xiong, Gang & Zhang, Shuning & Yang, Xiaoniu, 2012. "The fractal energy measurement and the singularity energy spectrum analysis," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 391(24), pages 6347-6361.
    2. Efthimios S. Skordas & Stavros-Richard G. Christopoulos & Nicholas V. Sarlis, 2020. "Detrended fluctuation analysis of seismicity and order parameter fluctuations before the M7.1 Ridgecrest earthquake," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 100(2), pages 697-711, January.
    3. Şahin, Gökhan & Erentürk, Murat & Hacinliyan, Avadis, 2009. "Detrended fluctuation analysis in natural languages using non-corpus parametrization," Chaos, Solitons & Fractals, Elsevier, vol. 41(1), pages 198-205.
    4. Xiong, Gang & Zhang, Shuning & Liu, Qiang, 2012. "The time-singularity multifractal spectrum distribution," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 391(20), pages 4727-4739.
    5. Maria Pia Beccar Varela & Francis Biney & Ionut Florescu, 2015. "Long correlations and fractional difference analysis applied to the study of memory effects in high-frequency (tick) data," Quantitative Finance, Taylor & Francis Journals, vol. 15(8), pages 1365-1374, August.
    6. Kosmidis, Kosmas & Hütt, Marc-Thorsten, 2023. "DNA visibility graphs," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 626(C).
    7. Asif, Raheel & Frömmel, Michael, 2022. "Testing Long memory in exchange rates and its implications for the adaptive market hypothesis," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 593(C).
    8. Xu, Na & Shang, Pengjian & Kamae, Santi, 2009. "Minimizing the effect of exponential trends in detrended fluctuation analysis," Chaos, Solitons & Fractals, Elsevier, vol. 41(1), pages 311-316.
    9. Mariani, M.C. & Florescu, I. & Beccar Varela, M.P. & Ncheuguim, E., 2010. "Study of memory effects in international market indices," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 389(8), pages 1653-1664.
    10. Mariani, M.C. & Libbin, J.D. & Kumar Mani, V. & Beccar Varela, M.P. & Erickson, C.A. & Valles-Rosales, D.J., 2008. "Long correlations and Normalized Truncated Levy Models applied to the study of Indian Market Indices in comparison with other emerging markets," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 387(5), pages 1273-1282.
    11. Serrano, E. & Figliola, A., 2009. "Wavelet Leaders: A new method to estimate the multifractal singularity spectra," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 388(14), pages 2793-2805.
    12. Sidorov, S.P. & Faizliev, A.R. & Balash, V.A. & Korobov, E.A., 2016. "Long-range correlation analysis of economic news flow intensity," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 444(C), pages 205-212.
    13. Machado Filho, A. & da Silva, M.F. & Zebende, G.F., 2014. "Autocorrelation and cross-correlation in time series of homicide and attempted homicide," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 400(C), pages 12-19.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Alvarez-Ramirez, Jose & Espinosa-Paredes, Gilberto & Vazquez, Alejandro, 2005. "Detrended fluctuation analysis of the neutronic power from a nuclear reactor," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 351(2), pages 227-240.
    2. Chiarucci, Riccardo & Ruzzenenti, Franco & Loffredo, Maria I., 2014. "Detecting spatial homogeneity in the World Trade Web with Detrended Fluctuation Analysis," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 401(C), pages 1-7.
    3. Pavlos, G.P. & Karakatsanis, L.P. & Iliopoulos, A.C. & Pavlos, E.G. & Xenakis, M.N. & Clark, Peter & Duke, Jamie & Monos, D.S., 2015. "Measuring complexity, nonextensivity and chaos in the DNA sequence of the Major Histocompatibility Complex," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 438(C), pages 188-209.
    4. Urbanowicz, Krzysztof & Kantz, Holger & Holyst, Janusz A., 2005. "Anti-deterministic behaviour of discrete systems that are less predictable than noise," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 350(2), pages 189-198.
    5. Cizeau, Pierre & Liu, Yanhui & Meyer, Martin & Peng, C.-K. & Eugene Stanley, H., 1997. "Volatility distribution in the S&P500 stock index," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 245(3), pages 441-445.
    6. Podobnik, Boris & Ivanov, Plamen Ch. & Grosse, Ivo & Matia, Kaushik & Eugene Stanley, H., 2004. "ARCH–GARCH approaches to modeling high-frequency financial data," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 344(1), pages 216-220.
    7. Silva, R. & Silva, J.R.P. & Anselmo, D.H.A.L. & Alcaniz, J.S. & da Silva, W.J.C. & Costa, M.O., 2020. "An alternative description of power law correlations in DNA sequences," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 545(C).
    8. Frank Emmert-Streib, 2013. "Structural Properties and Complexity of a New Network Class: Collatz Step Graphs," PLOS ONE, Public Library of Science, vol. 8(2), pages 1-14, February.
    9. Liu, Yanhui & Cizeau, Pierre & Meyer, Martin & Peng, C.-K. & Eugene Stanley, H., 1997. "Correlations in economic time series," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 245(3), pages 437-440.
    10. Bertrand M. Roehner, 2010. "Fifteen years of econophysics: worries, hopes and prospects," Papers 1004.3229, arXiv.org.
    11. Oikonomou, Thomas & Kaloudis, Konstantinos & Bagci, G. Baris, 2021. "The q-exponentials do not maximize the Rényi entropy," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 578(C).
    12. Karakatsanis, L.P. & Pavlos, G.P. & Iliopoulos, A.C. & Pavlos, E.G. & Clark, P.M. & Duke, J.L. & Monos, D.S., 2018. "Assessing information content and interactive relationships of subgenomic DNA sequences of the MHC using complexity theory approaches based on the non-extensive statistical mechanics," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 505(C), pages 77-93.
    13. Riccardo Chiarucci & Franco Ruzzenenti & Maria I. Loffredo, 2013. "Detecting spatial homogeneity in the world trade web with Detrended Fluctuation Analysis," Papers 1308.0526, arXiv.org, revised Nov 2013.
    14. Buldyrev, S.V. & Dokholyan, N.V. & Goldberger, A.L. & Havlin, S. & Peng, C.-K. & Stanley, H.E. & Viswanathan, G.M., 1998. "Analysis of DNA sequences using methods of statistical physics," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 249(1), pages 430-438.
    15. Koscielny-Bunde, Eva & Bunde, Armin & Havlin, Shlomo & Goldreich, Yair, 1996. "Analysis of daily temperature fluctuations," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 231(4), pages 393-396.
    16. Zebende, G.F. & Pereira, M.G. & Nogueira Jr., E. & Moret, M.A., 2005. "Universal persistence in astrophysical sources," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 349(3), pages 452-458.
    17. Vitanov, Nikolay K. & Yankulova, Elka D., 2006. "Multifractal analysis of the long-range correlations in the cardiac dynamics of Drosophila melanogaster," Chaos, Solitons & Fractals, Elsevier, vol. 28(3), pages 768-775.
    18. Ortiz-Tánchez, Eduardo & Ebeling, Werner & Lanius, Karl, 2002. "MEI, SOI and mid-range correlations in the onset of El Niño–Southern Oscillation," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 310(3), pages 509-520.
    19. Vandewalle, N. & Ausloos, M., 1997. "Coherent and random sequences in financial fluctuations," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 246(3), pages 454-459.
    20. Staudacher, M. & Telser, S. & Amann, A. & Hinterhuber, H. & Ritsch-Marte, M., 2005. "A new method for change-point detection developed for on-line analysis of the heart beat variability during sleep," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 349(3), pages 582-596.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:phsmap:v:221:y:1995:i:1:p:180-192. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.journals.elsevier.com/physica-a-statistical-mechpplications/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.