IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v54y2010i12p3313-3323.html
   My bibliography  Save this article

Combining regular and irregular histograms by penalized likelihood

Author

Listed:
  • Rozenholc, Yves
  • Mildenberger, Thoralf
  • Gather, Ursula

Abstract

A new fully automatic procedure for the construction of histograms is proposed. It consists of constructing both a regular and an irregular histogram and then choosing between the two. To choose the number of bins in the irregular histogram, two different penalties motivated by recent work in model selection are proposed. A description of the algorithm and a proper tuning of the penalties is given. Finally, different versions of the procedure are compared to other existing proposals for a wide range of densities and sample sizes. In the simulations, the squared Hellinger risk of the new procedure is always at most twice as large as the risk of the best of the other methods. The procedure is implemented in the R-Package histogram available from CRAN.1

Suggested Citation

  • Rozenholc, Yves & Mildenberger, Thoralf & Gather, Ursula, 2010. "Combining regular and irregular histograms by penalized likelihood," Computational Statistics & Data Analysis, Elsevier, vol. 54(12), pages 3313-3323, December.
  • Handle: RePEc:eee:csdana:v:54:y:2010:i:12:p:3313-3323
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167-9473(10)00166-0
    Download Restriction: Full text for ScienceDirect subscribers only.
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. F. Comte & Y. Rozenholc, 2004. "A new algorithm for fixed design regression and denoising," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 56(3), pages 449-473, September.
    2. Chen, X. R. & Zhao, L. C., 1987. "Almost sure L1-norm convergence for data-based histogram density estimates," Journal of Multivariate Analysis, Elsevier, vol. 21(1), pages 179-188, February.
    3. Joachim Engel, 1997. "The multiresolution histogram," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 46(1), pages 41-57, January.
    4. Celisse, Alain & Robin, Stephane, 2008. "Nonparametric density estimation by exact leave-p-out cross-validation," Computational Statistics & Data Analysis, Elsevier, vol. 52(5), pages 2350-2368, January.
    5. Luc Devroye & Gábor Lugosi, 2004. "Bin width selection in multivariate histograms by the combinatorial method," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 13(1), pages 129-145, June.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Zelaya Mendizábal, Valentina & Boullé, Marc & Rossi, Fabrice, 2023. "Fast and fully-automated histograms for large-scale data sets," Computational Statistics & Data Analysis, Elsevier, vol. 180(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Rozenholc, Yves & Mildenberger, Thoralf & Gather, Ursula, 2009. "Constructing irregular histograms by penalized likelihood," Technical Reports 2009,04, Technische Universität Dortmund, Sonderforschungsbereich 475: Komplexitätsreduktion in multivariaten Datenstrukturen.
    2. Geffray, S. & Klutchnikoff, N. & Vimond, M., 2016. "Illumination problems in digital images. A statistical point of view," Journal of Multivariate Analysis, Elsevier, vol. 150(C), pages 191-213.
    3. Schmisser, Émeline, 2019. "Non parametric estimation of the diffusion coefficients of a diffusion with jumps," Stochastic Processes and their Applications, Elsevier, vol. 129(12), pages 5364-5405.
    4. Schmisser Emeline, 2011. "Non-parametric drift estimation for diffusions from noisy data," Statistics & Risk Modeling, De Gruyter, vol. 28(2), pages 119-150, May.
    5. Lacour, Claire, 2008. "Nonparametric estimation of the stationary density and the transition density of a Markov chain," Stochastic Processes and their Applications, Elsevier, vol. 118(2), pages 232-260, February.
    6. Emeline Schmisser, 2012. "Non-parametric estimation of the diffusion coefficient from noisy data," Statistical Inference for Stochastic Processes, Springer, vol. 15(3), pages 193-223, October.
    7. Comte, F. & Genon-Catalot, V. & Rozenholc, Y., 2009. "Nonparametric adaptive estimation for integrated diffusions," Stochastic Processes and their Applications, Elsevier, vol. 119(3), pages 811-834, March.
    8. Van Hanh Nguyen & Catherine Matias, 2014. "On Efficient Estimators of the Proportion of True Null Hypotheses in a Multiple Testing Setup," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 41(4), pages 1167-1194, December.
    9. Zelaya Mendizábal, Valentina & Boullé, Marc & Rossi, Fabrice, 2023. "Fast and fully-automated histograms for large-scale data sets," Computational Statistics & Data Analysis, Elsevier, vol. 180(C).
    10. Davies, P. Laurie & Gather, Ursula & Nordman, Daniel & Weinert, Henrike, 2007. "Constructing a regular histogram : a comparison of methods," Technical Reports 2007,14, Technische Universität Dortmund, Sonderforschungsbereich 475: Komplexitätsreduktion in multivariaten Datenstrukturen.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:54:y:2010:i:12:p:3313-3323. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.