IDEAS home Printed from https://ideas.repec.org/a/spr/stabio/v16y2024i2d10.1007_s12561-023-09408-3.html
   My bibliography  Save this article

Statistical Methods for Modeling Exposure Variables Subject to Limit of Detection

Author

Listed:
  • Eunsil Seok

    (New York University Grossman School of Medicine)

  • Akhgar Ghassabian

    (New York University Grossman School of Medicine
    New York University Grossman School of Medicine)

  • Yuyan Wang

    (New York University Grossman School of Medicine)

  • Mengling Liu

    (New York University Grossman School of Medicine)

Abstract

Environmental health research aims to assess the impact of environmental exposures, making it crucial to understand their effects due to their broad impacts on the general population. However, a common issue with measuring exposures using bio-samples in laboratory is that values below the limit of detection (LOD) are either left unreported or inaccurately read by machines, which subsequently influences the analysis and assessment of exposure effects on health outcomes. We address the challenge of handling exposure variables subject to LOD when they are treated as either covariates or an outcome. We evaluate the performance of commonly-used methods including complete-case analysis and fill-in method, and advanced techniques such as multiple imputation, missing-indicator model, two-part model, Tobit model, and several others. We compare these methods through simulations and a dataset from NHANES 2013–2014. Our numerical studies show that the missing-indicator model generally yields reasonable estimates when considering exposure variables as covariates under various settings, while other methods tend to be sensitive to the LOD-missing proportions and/or distributional skewness of exposures. When modeling an exposure variable as the outcome, Tobit model performs well under Gaussian distribution and quantile regression generally provides robust estimates across various shapes of the outcome’s distribution. In the presence of missing data due to LOD, different statistical models should be considered for being aligned with scientific questions, model assumptions, requirements of data distributions, as well as their interpretations. Sensitivity analysis to handle LOD-missing exposures can improve the robustness of model conclusions.

Suggested Citation

  • Eunsil Seok & Akhgar Ghassabian & Yuyan Wang & Mengling Liu, 2024. "Statistical Methods for Modeling Exposure Variables Subject to Limit of Detection," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 16(2), pages 435-458, July.
  • Handle: RePEc:spr:stabio:v:16:y:2024:i:2:d:10.1007_s12561-023-09408-3
    DOI: 10.1007/s12561-023-09408-3
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s12561-023-09408-3
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s12561-023-09408-3?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. van Buuren, Stef & Groothuis-Oudshoorn, Karin, 2011. "mice: Multivariate Imputation by Chained Equations in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 45(i03).
    2. Hua He & Xuenan Mi & Jerry Cornell & Wan Tang & Tanika Kelly & Hui Shen & Hongwen Deng & Yan Du, 2020. "Statistical Issues on Analysis of Censored Data Due to Detection Limit," International Journal of Statistics and Probability, Canadian Center of Science and Education, vol. 9(4), pages 1-49, July.
    3. Amemiya, Takeshi, 1984. "Tobit models: A survey," Journal of Econometrics, Elsevier, vol. 24(1-2), pages 3-61.
    4. Koenker, Roger W & Bassett, Gilbert, Jr, 1978. "Regression Quantiles," Econometrica, Econometric Society, vol. 46(1), pages 33-50, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Gerko Vink & Laurence E. Frank & Jeroen Pannekoek & Stef Buuren, 2014. "Predictive mean matching imputation of semicontinuous variables," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 68(1), pages 61-90, February.
    2. Marco Capasso & Tania Treibich & Bart Verspagen, 2015. "The medium-term effect of R&D on firm growth," Small Business Economics, Springer, vol. 45(1), pages 39-62, June.
    3. Bertanha, Marinho & McCallum, Andrew H. & Seegert, Nathan, 2023. "Better bunching, nicer notching," Journal of Econometrics, Elsevier, vol. 237(2).
    4. Marco Geraci & Alexander McLain, 2018. "Multiple Imputation for Bounded Variables," Psychometrika, Springer;The Psychometric Society, vol. 83(4), pages 919-940, December.
    5. Madhavan-Nambiar, Padmanand & Florkowski, Wojciech & Chinnan, Manjeet & Ressurrecion, Anna, 2014. "Factors Driving Fruit and Vegetable Expenditures and Consumption Frequency in Lesser Developed Country: an Analysis of Urban Households from the Republic of Uganda," 2014 Annual Meeting, February 1-4, 2014, Dallas, Texas 162414, Southern Agricultural Economics Association.
    6. Gustavsen, Geir Waehler & Jolliffe, Dean & Rickertsen, Kyrre, 2008. "Censored Quantile Regression and Purchases of Ice Cream," 2008 Annual Meeting, July 27-29, 2008, Orlando, Florida 6534, American Agricultural Economics Association (New Name 2008: Agricultural and Applied Economics Association).
    7. Danúbia R. Cunha & Jose Angelo Divino & Helton Saulo, 2022. "On a log-symmetric quantile tobit model applied to female labor supply data," Journal of Applied Statistics, Taylor & Francis Journals, vol. 49(16), pages 4225-4253, December.
    8. Gustavsen, Geir Waehler & Rickertsen, Kyrre, 2004. "For Whom Reduced Prices Count: A Censored Quantile Regression Analysis Of Vegetable Demand," 2004 Annual meeting, August 1-4, Denver, CO 20172, American Agricultural Economics Association (New Name 2008: Agricultural and Applied Economics Association).
    9. Kim, Mo Se & Lee, Byung Sung & Lee, Hye Seon & Lee, Seung Ho & Lee, Junseok & Kim, Wonse, 2020. "Robust estimation of outage costs in South Korea using a machine learning technique: Bayesian Tobit quantile regression," Applied Energy, Elsevier, vol. 278(C).
    10. Jayabrata Biswas & Pulak Ghosh & Kiranmoy Das, 2020. "A semi-parametric quantile regression approach to zero-inflated and incomplete longitudinal outcomes," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 104(2), pages 261-283, June.
    11. Akosah, Nana Kwame & Alagidede, Imhotep Paul & Schaling, Eric, 2020. "Testing for asymmetry in monetary policy rule for small-open developing economies: Multiscale Bayesian quantile evidence from Ghana," The Journal of Economic Asymmetries, Elsevier, vol. 22(C).
    12. Paul Hewson & Keming Yu, 2008. "Quantile regression for binary performance indicators," Applied Stochastic Models in Business and Industry, John Wiley & Sons, vol. 24(5), pages 401-418, September.
    13. Salimata Sissoko, 2011. "Working Paper 03-11 - Niveau de décentralisation de la négociation et structure des salaires," Working Papers 1103, Federal Planning Bureau, Belgium.
    14. Korom, Philipp, 2016. "Inherited advantage: The importance of inheritance for private wealth accumulation in Europe," MPIfG Discussion Paper 16/11, Max Planck Institute for the Study of Societies.
    15. Calcagno, R. & Renneboog, L.D.R., 2004. "Capital Structure and Managerial Compensation : The Effects of Renumeration Seniority," Discussion Paper 2004-120, Tilburg University, Center for Economic Research.
    16. Daniele, Vittorio, 2007. "Criminalità e investimenti esteri. Un’analisi per le province italiane [The effect of organized crime on Foreign Investments. An Empirical Analysis for the Italian Provinces]," MPRA Paper 6417, University Library of Munich, Germany.
    17. Ma, Lingjie & Koenker, Roger, 2006. "Quantile regression methods for recursive structural equation models," Journal of Econometrics, Elsevier, vol. 134(2), pages 471-506, October.
    18. Dutta, Anupam & Bouri, Elie & Rothovius, Timo & Uddin, Gazi Salah, 2023. "Climate risk and green investments: New evidence," Energy, Elsevier, vol. 265(C).
    19. Cowling, Marc & Ughetto, Elisa & Lee, Neil, 2018. "The innovation debt penalty: Cost of debt, loan default, and the effects of a public loan guarantee on high-tech firms," Technological Forecasting and Social Change, Elsevier, vol. 127(C), pages 166-176.
    20. Abhilash Bandam & Eedris Busari & Chloi Syranidou & Jochen Linssen & Detlef Stolten, 2022. "Classification of Building Types in Germany: A Data-Driven Modeling Approach," Data, MDPI, vol. 7(4), pages 1-23, April.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:stabio:v:16:y:2024:i:2:d:10.1007_s12561-023-09408-3. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.