IDEAS home Printed from https://ideas.repec.org/a/eee/phsmap/v388y2009i18p3969-3984.html
   My bibliography  Save this article

The minimum description length principle for probability density estimation by regular histograms

Author

Listed:
  • Chapeau-Blondeau, François
  • Rousseau, David

Abstract

The minimum description length principle is a general methodology for statistical modeling and inference that selects the best explanation for observed data as the one allowing the shortest description of them. Application of this principle to the important task of probability density estimation by histograms was previously proposed. We review this approach and provide additional illustrative examples and an application to real-world data, with a presentation emphasizing intuition and concrete arguments. We also consider alternative ways of measuring the description lengths, that can be found to be more suited in this context. We explicitly exhibit, analyze and compare, the complete forms of the description lengths with formulas involving the information entropy and redundancy of the data, and not given elsewhere. Histogram estimation as performed here naturally extends to multidimensional data, and offers for them flexible and optimal subquantization schemes. The framework can be very useful for modeling and reduction of complexity of observed data, based on a general principle from statistical information theory, and placed within a unifying informational perspective.

Suggested Citation

  • Chapeau-Blondeau, François & Rousseau, David, 2009. "The minimum description length principle for probability density estimation by regular histograms," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 388(18), pages 3969-3984.
  • Handle: RePEc:eee:phsmap:v:388:y:2009:i:18:p:3969-3984
    DOI: 10.1016/j.physa.2009.05.044
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0378437109004294
    Download Restriction: Full text for ScienceDirect subscribers only. Journal offers the option of making the article available online on Science direct for a fee of $3,000

    File URL: https://libkey.io/10.1016/j.physa.2009.05.044?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Lee, Jeong Won & Park, Joongwoo Brian & Jo, Hang-Hyun & Yang, Jae-Suk & Moon, Hie-Tae, 2009. "Minimum entropy density method for the time series analysis," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 388(2), pages 137-144.
    2. Diambra, L, 2000. "Maximum entropy approach to nonlinear modeling," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 278(1), pages 140-149.
    3. Martinez, S. & Plastino, A. & Soffer, B.H., 2005. "Information and thermodynamics’ first law," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 356(1), pages 167-171.
    4. Carbone, Anna & Stanley, H. Eugene, 2007. "Scaling properties and entropy of long-range correlated time series," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 384(1), pages 21-24.
    5. Lee, Thomas C. M., 2000. "Regression spline smoothing using the minimum description length principle," Statistics & Probability Letters, Elsevier, vol. 48(1), pages 71-82, May.
    6. Meir, Ronny & Fontanari, Jose F., 1993. "Data compression and prediction in neural networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 200(1), pages 644-654.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Kalimeri, M. & Papadimitriou, C. & Balasis, G. & Eftaxias, K., 2008. "Dynamical complexity detection in pre-seismic emissions using nonadditive Tsallis entropy," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 387(5), pages 1161-1172.
    2. Dumansky, Y.V. & Lyakh, Y.E. & Gorshkov, O.G. & Gurianov, V.G. & Prihodchenko, V.V., 2012. "Fractal dimensionality analysis of normal and cancerous mammary gland thermograms," Chaos, Solitons & Fractals, Elsevier, vol. 45(12), pages 1494-1500.
    3. Ponta, Linda & Murialdo, Pietro & Carbone, Anna, 2021. "Information measure for long-range correlated time series: Quantifying horizon dependence in financial markets," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 570(C).
    4. Gajda, Janusz & Bartnicki, Grzegorz & Burnecki, Krzysztof, 2018. "Modeling of water usage by means of ARFIMA–GARCH processes," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 512(C), pages 644-657.
    5. Juan Benjamín Duarte Duarte & Juan Manuel Mascareñas Pérez-Iñigo, 2014. "¿Han sido los mercados bursátiles eficientes informacionalmente?," Apuntes del Cenes, Universidad Pedagógica y Tecnológica de Colombia, June.
    6. Kagerer, Kathrin, 2013. "A short introduction to splines in least squares regression analysis," University of Regensburg Working Papers in Business, Economics and Management Information Systems 472, University of Regensburg, Department of Economics.
    7. Stanley, H. Eugene & Plerou, Vasiliki & Gabaix, Xavier, 2008. "A statistical physics view of financial fluctuations: Evidence for scaling and universality," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 387(15), pages 3967-3981.
    8. Juan Benjamín Duarte Duarte & Juan Manuel Mascare?nas Pérez-Iñigo, 2014. "Comprobación de la eficiencia débil en los principales mercados financieros latinoamericanos," Estudios Gerenciales, Universidad Icesi, November.
    9. Lavička, H. & Lin, L. & Novotný, J., 2010. "Employment, Production and Consumption model: Patterns of phase transitions," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 389(8), pages 1708-1720.
    10. Wesfreid, Eva & Billat, Véronique, 2012. "Randomness and changes of heart rate and respiratory frequency during high altitude mountain ascent without acclimatization," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 391(4), pages 1575-1590.
    11. V Dimitrova & M Fernández-Martínez & M A Sánchez-Granero & J E Trinidad Segovia, 2019. "Some comments on Bitcoin market (in)efficiency," PLOS ONE, Public Library of Science, vol. 14(7), pages 1-14, July.
    12. Pietro Murialdo & Linda Ponta & Anna Carbone, 2020. "Long-Range Dependence in Financial Markets: a Moving Average Cluster Entropy Approach," Papers 2004.14736, arXiv.org.
    13. Farzadian, O. & Niry, M.D., 2016. "Delocalization of mechanical waves in the ladder chain of DNA with correlated disorder," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 450(C), pages 95-103.
    14. Ponta, Linda & Carbone, Anna, 2018. "Information measure for financial time series: Quantifying short-term market heterogeneity," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 510(C), pages 132-144.
    15. Potirakis, S.M. & Minadakis, G. & Eftaxias, K., 2012. "Analysis of electromagnetic pre-seismic emissions using Fisher information and Tsallis entropy," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 391(1), pages 300-306.
    16. Zunino, Luciano & Tabak, Benjamin M. & Serinaldi, Francesco & Zanin, Massimiliano & Pérez, Darío G. & Rosso, Osvaldo A., 2011. "Commodity predictability analysis with a permutation information theory approach," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 390(5), pages 876-890.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:phsmap:v:388:y:2009:i:18:p:3969-3984. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.journals.elsevier.com/physica-a-statistical-mechpplications/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.