IDEAS home Printed from https://ideas.repec.org/a/spr/aistmt/v74y2022i1d10.1007_s10463-021-00791-6.html
   My bibliography  Save this article

Asymptotic behavior of the number of distinct values in a sample from the geometric stick-breaking process

Author

Listed:
  • Pierpaolo De Blasi

    (University of Torino and Collegio Carlo Alberto)

  • Ramsés H. Mena

    (IIMAS Universidad Nacional Autónoma de México)

  • Igor Prünster

    (Bocconi University and BIDSA)

Abstract

Discrete random probability measures are a key ingredient of Bayesian nonparametric inference. A sample generates ties with positive probability and a fundamental object of both theoretical and applied interest is the corresponding number of distinct values. The growth rate can be determined from the rate of decay of the small frequencies implying that, when the decreasingly ordered frequencies admit a tractable form, the asymptotics of the number of distinct values can be conveniently assessed. We focus on the geometric stick-breaking process and we investigate the effect of the distribution for the success probability on the asymptotic behavior of the number of distinct values. A whole range of logarithmic behaviors are obtained by appropriately tuning the prior. A two-term expansion is also derived and illustrated in a comparison with a larger family of discrete random probability measures having an additional parameter given by the scale of the negative binomial distribution.

Suggested Citation

  • Pierpaolo De Blasi & Ramsés H. Mena & Igor Prünster, 2022. "Asymptotic behavior of the number of distinct values in a sample from the geometric stick-breaking process," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 74(1), pages 143-165, February.
  • Handle: RePEc:spr:aistmt:v:74:y:2022:i:1:d:10.1007_s10463-021-00791-6
    DOI: 10.1007/s10463-021-00791-6
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10463-021-00791-6
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10463-021-00791-6?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Hatjispyros, Spyridon J. & Merkatas, Christos & Nicoleris, Theodoros & Walker, Stephen G., 2018. "Dependent mixtures of geometric weights priors," Computational Statistics & Data Analysis, Elsevier, vol. 119(C), pages 1-18.
    2. De Blasi, Pierpaolo & Martínez, Asael Fabian & Mena, Ramsés H. & Prünster, Igor, 2020. "On the inferential implications of decreasing weight structures in mixture models," Computational Statistics & Data Analysis, Elsevier, vol. 147(C).
    3. François Caron & Emily B. Fox, 2017. "Sparse graphs using exchangeable random measures," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(5), pages 1295-1366, November.
    4. Raffaele Argiento & Andrea Cremaschi & Marina Vannucci, 2020. "Hierarchical Normalized Completely Random Measures to Cluster Grouped Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 115(529), pages 318-333, January.
    5. Antonio Lijoi & Ramsés H. Mena & Igor Prünster, 2007. "Controlling the reinforcement in Bayesian non‐parametric mixture models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 69(4), pages 715-740, September.
    6. Antonio Lijoi & Ramsés H. Mena & Igor Prünster, 2007. "A Bayesian Nonparametric Method for Prediction in EST Analysis," ICER Working Papers - Applied Mathematics Series 16-2007, ICER - International Centre for Economic Research.
    7. Ishwaran H. & James L. F, 2001. "Gibbs Sampling Methods for Stick Breaking Priors," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 161-173, March.
    8. Gutiérrez, Luis & Gutiérrez-Peña, Eduardo & Mena, Ramsés H., 2014. "Bayesian nonparametric classification for spectroscopy data," Computational Statistics & Data Analysis, Elsevier, vol. 78(C), pages 56-68.
    9. David B. Dahl & Ryan Day & Jerry W. Tsai, 2017. "Random Partition Distribution Indexed by Pairwise Information," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(518), pages 721-732, April.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Hatjispyros, Spyridon J. & Merkatas, Christos & Walker, Stephen G., 2023. "Mixture models with decreasing weights," Computational Statistics & Data Analysis, Elsevier, vol. 179(C).
    2. Iksanov, Alexander & Kotelnikova, Valeriya, 2022. "Small counts in nested Karlin’s occupancy scheme generated by discrete Weibull-like distributions," Stochastic Processes and their Applications, Elsevier, vol. 153(C), pages 283-320.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Julyan Arbel & Stefano Favaro, 2021. "Approximating Predictive Probabilities of Gibbs-Type Priors," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 83(1), pages 496-519, February.
    2. De Blasi, Pierpaolo & Martínez, Asael Fabian & Mena, Ramsés H. & Prünster, Igor, 2020. "On the inferential implications of decreasing weight structures in mixture models," Computational Statistics & Data Analysis, Elsevier, vol. 147(C).
    3. Stefano Favaro & Antonio Lijoi & Igor Prünster, 2012. "A New Estimator of the Discovery Probability," Biometrics, The International Biometric Society, vol. 68(4), pages 1188-1196, December.
    4. Pierpaolo De Blasi & Stefano Favaro & Antonio Lijoi & Ramsés H. Mena & Igor Prünster & Mattteo Ruggiero, 2013. "Are Gibbs-type priors the most natural generalization of the Dirichlet process?," DEM Working Papers Series 054, University of Pavia, Department of Economics and Management.
    5. Cesari, Oriana & Favaro, Stefano & Nipoti, Bernardo, 2014. "Posterior analysis of rare variants in Gibbs-type species sampling models," Journal of Multivariate Analysis, Elsevier, vol. 131(C), pages 79-98.
    6. Laura Liu & Hyungsik Roger Moon & Frank Schorfheide, 2023. "Forecasting with a panel Tobit model," Quantitative Economics, Econometric Society, vol. 14(1), pages 117-159, January.
    7. Kolossiatis, M. & Griffin, J.E. & Steel, M.F.J., 2011. "Modeling overdispersion with the normalized tempered stable distribution," Computational Statistics & Data Analysis, Elsevier, vol. 55(7), pages 2288-2301, July.
    8. José J. Quinlan & Fernando A. Quintana & Garritt L. Page, 2021. "On a class of repulsive mixture models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 30(2), pages 445-461, June.
    9. Igari, Ryosuke & Hoshino, Takahiro, 2018. "A Bayesian data combination approach for repeated durations under unobserved missing indicators: Application to interpurchase-timing in marketing," Computational Statistics & Data Analysis, Elsevier, vol. 126(C), pages 150-166.
    10. Stefano Favaro & Antonio Lijoi & Igor Prünster, 2012. "On the stick–breaking representation of normalized inverse Gaussian priors," DEM Working Papers Series 008, University of Pavia, Department of Economics and Management.
    11. Abel Rodriguez & Enrique ter Horst, 2008. "Measuring expectations in options markets: An application to the SP500 index," Papers 0901.0033, arXiv.org.
    12. Ryo Kato & Takahiro Hoshino, 2020. "Semiparametric Bayesian multiple imputation for regression models with missing mixed continuous–discrete covariates," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 72(3), pages 803-825, June.
    13. Wang, Ketong & Porter, Michael D., 2018. "Optimal Bayesian clustering using non-negative matrix factorization," Computational Statistics & Data Analysis, Elsevier, vol. 128(C), pages 395-411.
    14. Bikramjit Das & Tiandong Wang & Gengling Dai, 2022. "Asymptotic Behavior of Common Connections in Sparse Random Networks," Methodology and Computing in Applied Probability, Springer, vol. 24(3), pages 2071-2092, September.
    15. Angelos Dassios & Junyi Zhang, 2023. "Exact Simulation of Poisson-Dirichlet Distribution and Generalised Gamma Process," Methodology and Computing in Applied Probability, Springer, vol. 25(2), pages 1-21, June.
    16. Zhang, Hongmei & Ghosh, Kaushik & Ghosh, Pulak, 2012. "Sampling designs via a multivariate hypergeometric-Dirichlet process model for a multi-species assemblage with unknown heterogeneity," Computational Statistics & Data Analysis, Elsevier, vol. 56(8), pages 2562-2573.
    17. Riva Palacio, Alan & Leisen, Fabrizio, 2018. "Integrability conditions for compound random measures," Statistics & Probability Letters, Elsevier, vol. 135(C), pages 32-37.
    18. Laura Liu, 2018. "Density Forecasts in Panel Data Models : A Semiparametric Bayesian Perspective," Finance and Economics Discussion Series 2018-036, Board of Governors of the Federal Reserve System (U.S.).
    19. Billio, Monica & Casarin, Roberto & Rossini, Luca, 2019. "Bayesian nonparametric sparse VAR models," Journal of Econometrics, Elsevier, vol. 212(1), pages 97-115.
    20. Yuan Fang & Dimitris Karlis & Sanjeena Subedi, 2022. "Infinite Mixtures of Multivariate Normal-Inverse Gaussian Distributions for Clustering of Skewed Data," Journal of Classification, Springer;The Classification Society, vol. 39(3), pages 510-552, November.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:aistmt:v:74:y:2022:i:1:d:10.1007_s10463-021-00791-6. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.