IDEAS home Printed from https://ideas.repec.org/a/taf/amstat/v71y2017i3p191-201.html
   My bibliography  Save this article

A Note on the Inverse Birthday Problem With Applications

Author

Listed:
  • Wen-Han Hwang
  • Richard Huggins
  • Lu-Fang Chen

Abstract

The classical birthday problem considers the probability that at least two people in a group of size N share the same birthday. The inverse birthday problem considers the estimation of the size N of a group given the number of different birthdays in the group. In practice, this problem is analogous to estimating the size of a population from occurrence data only. The inverse problem can be solved via two simple approaches including the method of moments for a multinominal model and the maximum likelihood estimate of a Poisson model, which we present in this study. We investigate properties of both methods and show that they can yield asymptotically equivalent Wald-type interval estimators. Moreover, we show that these methods estimate a lower bound for the population size when birth rates are nonhomogenous or individuals in the population are aggregated. A simulation study was conducted to evaluate the performance of the point estimates arising from the two approaches and to compare the performance of seven interval estimators, including likelihood ratio and log-transformation methods. We illustrate the utility of these methods by estimating: (1) the abundance of tree species over a 50-hectare forest plot, (2) the number of Chlamydia infections when only the number of different birthdays of the patients is known, and (3) the number of rainy days when the number of rainy weeks is known. Supplementary materials for this article are available online.

Suggested Citation

  • Wen-Han Hwang & Richard Huggins & Lu-Fang Chen, 2017. "A Note on the Inverse Birthday Problem With Applications," The American Statistician, Taylor & Francis Journals, vol. 71(3), pages 191-201, July.
  • Handle: RePEc:taf:amstat:v:71:y:2017:i:3:p:191-201
    DOI: 10.1080/00031305.2016.1255657
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1080/00031305.2016.1255657
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1080/00031305.2016.1255657?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Lucio Barabesi & Marzia Marcheselli, 2011. "Parameter estimation in the classical occupancy model," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 20(3), pages 305-327, August.
    2. Shirley Pledger, 2000. "Unified Maximum Likelihood Estimates for Closed Capture–Recapture Models Using Mixtures," Biometrics, The International Biometric Society, vol. 56(2), pages 434-442, June.
    3. Williamson, Patricia Pepple & Mays, Darcy P. & Abay Asmerom, Ghidewon & Yang, Yingying, 2009. "Revisiting the Classical Occupancy Problem," The American Statistician, American Statistical Association, vol. 63(4), pages 356-360.
    4. Müller, Christine H. & Huggins, Richard & Hwang, Wen-Han, 2011. "Consistent estimation of species abundance from a presence-absence map," Statistics & Probability Letters, Elsevier, vol. 81(9), pages 1449-1457, September.
    5. Dorazio, Robert M. & Royle, J. Andrew, 2005. "Estimating Size and Composition of Biological Communities by Modeling the Occurrence of Species," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 389-398, June.
    6. Gneiting, Tilmann & Raftery, Adrian E., 2007. "Strictly Proper Scoring Rules, Prediction, and Estimation," Journal of the American Statistical Association, American Statistical Association, vol. 102, pages 359-378, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Wen‐Han Hwang & Richard Huggins & Jakub Stoklosa, 2022. "A model for analyzing clustered occurrence data," Biometrics, The International Biometric Society, vol. 78(2), pages 598-611, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Wen-Han Hwang & Richard Huggins, 2016. "Estimating Abundance from Presence–Absence Maps via a Paired Negative-Binomial Model," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 43(2), pages 573-586, June.
    2. J. Andrew Royle, 2006. "Site Occupancy Models with Heterogeneous Detection Probabilities," Biometrics, The International Biometric Society, vol. 62(1), pages 97-102, March.
    3. Azar, Pablo D. & Micali, Silvio, 2018. "Computational principal agent problems," Theoretical Economics, Econometric Society, vol. 13(2), May.
    4. Rubio, F.J. & Steel, M.F.J., 2011. "Inference for grouped data with a truncated skew-Laplace distribution," Computational Statistics & Data Analysis, Elsevier, vol. 55(12), pages 3218-3231, December.
    5. Paul S. F. Yip & Hua-Zhen Lin & Liqun Xi, 2005. "A Semiparametric Method for Estimating Population Size for Capture–Recapture Experiments with Random Covariates in Continuous Time," Biometrics, The International Biometric Society, vol. 61(4), pages 1085-1092, December.
    6. R de Fondeville & A C Davison, 2018. "High-dimensional peaks-over-threshold inference," Biometrika, Biometrika Trust, vol. 105(3), pages 575-592.
    7. Domenico Piccolo & Rosaria Simone, 2019. "The class of cub models: statistical foundations, inferential issues and empirical evidence," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 28(3), pages 389-435, September.
    8. Finn Lindgren, 2015. "Comments on: Comparing and selecting spatial predictors using local criteria," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 24(1), pages 35-44, March.
    9. Chuliá, Helena & Garrón, Ignacio & Uribe, Jorge M., 2024. "Daily growth at risk: Financial or real drivers? The answer is not always the same," International Journal of Forecasting, Elsevier, vol. 40(2), pages 762-776.
    10. Laura Liu & Hyungsik Roger Moon & Frank Schorfheide, 2023. "Forecasting with a panel Tobit model," Quantitative Economics, Econometric Society, vol. 14(1), pages 117-159, January.
    11. Armantier, Olivier & Treich, Nicolas, 2013. "Eliciting beliefs: Proper scoring rules, incentives, stakes and hedging," European Economic Review, Elsevier, vol. 62(C), pages 17-40.
    12. Peysakhovich, Alexander & Plagborg-Møller, Mikkel, 2012. "A note on proper scoring rules and risk aversion," Economics Letters, Elsevier, vol. 117(1), pages 357-361.
    13. Merkle, Edgar C. & Steyvers, Mark & Mellers, Barbara & Tetlock, Philip E., 2017. "A neglected dimension of good forecasting judgment: The questions we choose also matter," International Journal of Forecasting, Elsevier, vol. 33(4), pages 817-832.
    14. Remy Elbez & Jeff Folz & Alan McLean & Hernan Roca & Joseph M Labuz & Kenneth J Pienta & Shuichi Takayama & Raoul Kopelman, 2021. "Cell-morphodynamic phenotype classification with application to cancer metastasis using cell magnetorotation and machine-learning," PLOS ONE, Public Library of Science, vol. 16(11), pages 1-14, November.
    15. repec:bny:wpaper:0088 is not listed on IDEAS
    16. Lahiri, Kajal & Yang, Liu, 2013. "Forecasting Binary Outcomes," Handbook of Economic Forecasting, in: G. Elliott & C. Granger & A. Timmermann (ed.), Handbook of Economic Forecasting, edition 1, volume 2, chapter 0, pages 1025-1106, Elsevier.
    17. Ricardo Crisóstomo, 2021. "Estimating real‐world probabilities: A forward‐looking behavioral framework," Journal of Futures Markets, John Wiley & Sons, Ltd., vol. 41(11), pages 1797-1823, November.
    18. Blasques, Francisco & van Brummelen, Janneke & Gorgi, Paolo & Koopman, Siem Jan, 2024. "Maximum Likelihood Estimation for Non-Stationary Location Models with Mixture of Normal Distributions," Journal of Econometrics, Elsevier, vol. 238(1).
    19. Łukasz Lenart, 2017. "Examination of Seasonal Volatility in HICP for Baltic Region Countries: Non-Parametric Test versus Forecasting Experiment," Central European Journal of Economic Modelling and Econometrics, Central European Journal of Economic Modelling and Econometrics, vol. 9(1), pages 29-67, March.
    20. Magnus Reif, 2020. "Macroeconomics, Nonlinearities, and the Business Cycle," ifo Beiträge zur Wirtschaftsforschung, ifo Institute - Leibniz Institute for Economic Research at the University of Munich, number 87.
    21. Kiss, Tamás & Mazur, Stepan & Nguyen, Hoang, 2022. "Predicting returns and dividend growth — The role of non-Gaussian innovations," Finance Research Letters, Elsevier, vol. 46(PA).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:taf:amstat:v:71:y:2017:i:3:p:191-201. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Longhurst (email available below). General contact details of provider: http://www.tandfonline.com/UTAS20 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.