IDEAS home Printed from https://ideas.repec.org/a/bla/scjsta/v37y2010i2p321-337.html
   My bibliography  Save this article

Density Estimation by Total Variation Penalized Likelihood Driven by the Sparsity ℓ1 Information Criterion

Author

Listed:
  • SYLVAIN SARDY
  • PAUL TSENG

Abstract

. We propose a non‐linear density estimator, which is locally adaptive, like wavelet estimators, and positive everywhere, without a log‐ or root‐transform. This estimator is based on maximizing a non‐parametric log‐likelihood function regularized by a total variation penalty. The smoothness is driven by a single penalty parameter, and to avoid cross‐validation, we derive an information criterion based on the idea of universal penalty. The penalized log‐likelihood maximization is reformulated as an ℓ1‐penalized strictly convex programme whose unique solution is the density estimate. A Newton‐type method cannot be applied to calculate the estimate because the ℓ1‐penalty is non‐differentiable. Instead, we use a dual block coordinate relaxation method that exploits the problem structure. By comparing with kernel, spline and taut string estimators on a Monte Carlo simulation, and by investigating the sensitivity to ties on two real data sets, we observe that the new estimator achieves good L1 and L2 risk for densities with sharp features, and behaves well with ties.

Suggested Citation

  • Sylvain Sardy & Paul Tseng, 2010. "Density Estimation by Total Variation Penalized Likelihood Driven by the Sparsity ℓ1 Information Criterion," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 37(2), pages 321-337, June.
  • Handle: RePEc:bla:scjsta:v:37:y:2010:i:2:p:321-337
    DOI: 10.1111/j.1467-9469.2009.00672.x
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/j.1467-9469.2009.00672.x
    Download Restriction: no

    File URL: https://libkey.io/10.1111/j.1467-9469.2009.00672.x?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. P. Tseng, 2001. "Convergence of a Block Coordinate Descent Method for Nondifferentiable Minimization," Journal of Optimization Theory and Applications, Springer, vol. 109(3), pages 475-494, June.
    2. Sardy, Sylvain & Tseng, Paul, 2004. "On the Statistical Analysis of Smoothing by Maximizing Dirty Markov Random Field Posterior Distributions," Journal of the American Statistical Association, American Statistical Association, vol. 99, pages 191-204, January.
    3. Sylvain Sardy, 2009. "Adaptive Posterior Mode Estimation of a Sparse Sequence for Model Selection," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 36(4), pages 577-601, December.
    4. Pinheiro, Aluisio & Vidakovic, Brani, 1997. "Estimating the square root of a density via compactly supported wavelets," Computational Statistics & Data Analysis, Elsevier, vol. 25(4), pages 399-415, September.
    5. Kooperberg, Charles & Stone, Charles J., 1991. "A study of logspline density estimation," Computational Statistics & Data Analysis, Elsevier, vol. 12(3), pages 327-347, November.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Bak, Kwan-Young & Jhong, Jae-Hwan & Lee, JungJun & Shin, Jae-Kyung & Koo, Ja-Yong, 2021. "Penalized logspline density estimation using total variation penalty," Computational Statistics & Data Analysis, Elsevier, vol. 153(C).
    2. Qu, Leming & Yin, Wotao, 2012. "Copula density estimation by total variation penalized likelihood with linear equality constraints," Computational Statistics & Data Analysis, Elsevier, vol. 56(2), pages 384-398.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Federico Palacios-González & Rosa M. García-Fernández, 2020. "A faster algorithm to estimate multiresolution densities," Computational Statistics, Springer, vol. 35(3), pages 1207-1230, September.
    2. Bak, Kwan-Young & Jhong, Jae-Hwan & Lee, JungJun & Shin, Jae-Kyung & Koo, Ja-Yong, 2021. "Penalized logspline density estimation using total variation penalty," Computational Statistics & Data Analysis, Elsevier, vol. 153(C).
    3. Morettin Pedro A. & Toloi Clelia M.C. & Chiann Chang & de Miranda José C.S., 2011. "Wavelet Estimation of Copulas for Time Series," Journal of Time Series Econometrics, De Gruyter, vol. 3(3), pages 1-31, October.
    4. Jun Yan & Jian Huang, 2012. "Model Selection for Cox Models with Time-Varying Coefficients," Biometrics, The International Biometric Society, vol. 68(2), pages 419-428, June.
    5. Vincent, Martin & Hansen, Niels Richard, 2014. "Sparse group lasso and high dimensional multinomial classification," Computational Statistics & Data Analysis, Elsevier, vol. 71(C), pages 771-786.
    6. Shuang Zhang & Xingdong Feng, 2022. "Distributed identification of heterogeneous treatment effects," Computational Statistics, Springer, vol. 37(1), pages 57-89, March.
    7. Jung, Yoon Mo & Whang, Joyce Jiyoung & Yun, Sangwoon, 2020. "Sparse probabilistic K-means," Applied Mathematics and Computation, Elsevier, vol. 382(C).
    8. Seunghwan Lee & Sang Cheol Kim & Donghyeon Yu, 2023. "An efficient GPU-parallel coordinate descent algorithm for sparse precision matrix estimation via scaled lasso," Computational Statistics, Springer, vol. 38(1), pages 217-242, March.
    9. Le Thi Khanh Hien & Duy Nhat Phan & Nicolas Gillis, 2022. "Inertial alternating direction method of multipliers for non-convex non-smooth optimization," Computational Optimization and Applications, Springer, vol. 83(1), pages 247-285, September.
    10. Victor Chernozhukov & Whitney K. Newey & Victor Quintas-Martinez & Vasilis Syrgkanis, 2021. "Automatic Debiased Machine Learning via Riesz Regression," Papers 2104.14737, arXiv.org, revised Mar 2024.
    11. Jiahe Lin & George Michailidis, 2019. "Approximate Factor Models with Strongly Correlated Idiosyncratic Errors," Papers 1912.04123, arXiv.org.
    12. Emilie Chouzenoux & Jean-Christophe Pesquet & Audrey Repetti, 2016. "A block coordinate variable metric forward–backward algorithm," Journal of Global Optimization, Springer, vol. 66(3), pages 457-485, November.
    13. Kirkby, J. Lars & Leitao, Álvaro & Nguyen, Duy, 2021. "Nonparametric density estimation and bandwidth selection with B-spline bases: A novel Galerkin method," Computational Statistics & Data Analysis, Elsevier, vol. 159(C).
    14. Chang, Meng-Shiuh & Wu, Ximing, 2015. "Transformation-based nonparametric estimation of multivariate densities," Journal of Multivariate Analysis, Elsevier, vol. 135(C), pages 71-88.
    15. Chavez-Demoulin, V. & Embrechts, P. & Sardy, S., 2014. "Extreme-quantile tracking for financial time series," Journal of Econometrics, Elsevier, vol. 181(1), pages 44-52.
    16. Koenker, Roger & Portnoy, Stephen, 2000. "Some pathological regression asymptotics under stable conditions," Statistics & Probability Letters, Elsevier, vol. 50(3), pages 219-228, November.
    17. Chen, Kun & Huang, Rui & Chan, Ngai Hang & Yau, Chun Yip, 2019. "Subgroup analysis of zero-inflated Poisson regression model with applications to insurance data," Insurance: Mathematics and Economics, Elsevier, vol. 86(C), pages 8-18.
    18. Richard Spady & Sami Stouli, 2020. "Gaussian Transforms Modeling and the Estimation of Distributional Regression Functions," Papers 2011.06416, arXiv.org.
    19. Koo, Ja-Yong & Kooperberg, Charles, 2000. "Logspline density estimation for binned data," Statistics & Probability Letters, Elsevier, vol. 46(2), pages 133-147, January.
    20. Paul Tseng, 2004. "An Analysis of the EM Algorithm and Entropy-Like Proximal Point Methods," Mathematics of Operations Research, INFORMS, vol. 29(1), pages 27-44, February.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:scjsta:v:37:y:2010:i:2:p:321-337. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.blackwellpublishing.com/journal.asp?ref=0303-6898 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.