IDEAS home Printed from https://ideas.repec.org/a/spr/sankhb/v83y2021i1d10.1007_s13571-021-00251-4.html
   My bibliography  Save this article

Imputation for Skewed Data: Multivariate Lomax Case

Author

Listed:
  • Zhixin Lun

    (Oakland University
    University of California)

  • Ravindra Khattree

    (Oakland University)

Abstract

Most multiple imputation methods for multivariate missing data have been developed for normally distributed data. However, methods may not be suitable for nonnegative and/or highly skewed data. We propose an approach by using Expectation-Maximization (EM) method based on the assumption of multivariate Lomax distribution on non-negative skewed data. Extensive simulations show that this proposed method outperforms the regular normality-based EM and k-nearest-neighbor (k NN) imputation methods under the missing completely at random (MCAR) mechanism. An application on a real-world biomedical data is then provided.

Suggested Citation

  • Zhixin Lun & Ravindra Khattree, 2021. "Imputation for Skewed Data: Multivariate Lomax Case," Sankhya B: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 83(1), pages 86-113, May.
  • Handle: RePEc:spr:sankhb:v:83:y:2021:i:1:d:10.1007_s13571-021-00251-4
    DOI: 10.1007/s13571-021-00251-4
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s13571-021-00251-4
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s13571-021-00251-4?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Yulei He & Trivellore E. Raghunathan, 2012. "Multiple imputation using multivariate gh transformations," Journal of Applied Statistics, Taylor & Francis Journals, vol. 39(10), pages 2177-2198, June.
    2. Kowarik, Alexander & Templ, Matthias, 2016. "Imputation with the R Package VIM," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 74(i07).
    3. Michael W. Robbins & Sujit K. Ghosh & Joshua D. Habiger, 2013. "Imputation in High-Dimensional Economic Data as Applied to the Agricultural Resource Management Survey," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 108(501), pages 81-95, March.
    4. Hakan Demirtas & Donald Hedeker, 2008. "Imputing continuous data under some non‐Gaussian distributions," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 62(2), pages 193-205, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Paul T. von Hippel, 2013. "Should a Normal Imputation Model be Modified to Impute Skewed Variables?," Sociological Methods & Research, , vol. 42(1), pages 105-138, February.
    2. Marco Geraci & Alexander McLain, 2018. "Multiple Imputation for Bounded Variables," Psychometrika, Springer;The Psychometric Society, vol. 83(4), pages 919-940, December.
    3. Juana Sanchez & Sydney Noelle Kahmann, 2017. "R&D, Attrition and Multiple Imputation in BRDIS," Working Papers 17-13, Center for Economic Studies, U.S. Census Bureau.
    4. Henry Webel & Lili Niu & Annelaura Bach Nielsen & Marie Locard-Paulet & Matthias Mann & Lars Juhl Jensen & Simon Rasmussen, 2024. "Imputation of label-free quantitative mass spectrometry-based proteomics data using self-supervised deep learning," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
    5. Bram Janssens & Matthias Bogaert & Mathijs Maton, 2023. "Predicting the next Pogačar: a data analytical approach to detect young professional cycling talents," Annals of Operations Research, Springer, vol. 325(1), pages 557-588, June.
    6. Chhetri, Netra & Ghimire, Rajiv & Wagner, Melissa & Wang, Meng, 2020. "Global citizen deliberation: Case of world-wide views on climate and energy," Energy Policy, Elsevier, vol. 147(C).
    7. Ieva Burakauskaitė & Andrius Čiginas, 2023. "An Approach to Integrating a Non-Probability Sample in the Population Census," Mathematics, MDPI, vol. 11(8), pages 1-14, April.
    8. Carlos Miguel Lemos & Ross Joseph Gore & Ivan Puga-Gonzalez & F LeRon Shults, 2019. "Dimensionality and factorial invariance of religiosity among Christians and the religiously unaffiliated: A cross-cultural analysis based on the International Social Survey Programme," PLOS ONE, Public Library of Science, vol. 14(5), pages 1-36, May.
    9. Nicholas Tierney & Dianne Cook, 2018. "Expanding tidy data principles to facilitate missing data exploration, visualization and assessment of imputations," Monash Econometrics and Business Statistics Working Papers 14/18, Monash University, Department of Econometrics and Business Statistics.
    10. Burns, Christopher & Prager, Daniel & Ghosh, Sujit & Goodwin, Barry, 2015. "Imputing for Missing Data in the ARMS Household Section: A Multivariate Imputation Approach," 2015 AAEA & WAEA Joint Annual Meeting, July 26-28, San Francisco, California 205291, Agricultural and Applied Economics Association.
    11. Robbins, Michael W. & White, T. Kirk, 2014. "Direct Payments, Cash Rents, Land Values, and the Effects of Imputation in U.S. Farm-level Data," Agricultural and Resource Economics Review, Northeastern Agricultural and Resource Economics Association, vol. 43(3), pages 1-20, December.
    12. Sonja Herrmann & Christian Nagel, 2023. "Early Careers of Graduates from Private and Public Universities in Germany: A Comparison of Income Differences Regarding the First Employment," Research in Higher Education, Springer;Association for Institutional Research, vol. 64(1), pages 129-146, February.
    13. Oliver Hirsch & Charles Christian Adarkwah, 2018. "The Issue of Burnout and Work Satisfaction in Younger GPs—A Cluster Analysis Utilizing the HaMEdSi Study," IJERPH, MDPI, vol. 15(10), pages 1-10, October.
    14. Ahmad R. Alsaber & Jiazhu Pan & Adeeba Al-Hurban, 2021. "Handling Complex Missing Data Using Random Forest Approach for an Air Quality Monitoring Dataset: A Case Study of Kuwait Environmental Data (2012 to 2018)," IJERPH, MDPI, vol. 18(3), pages 1-25, February.
    15. Marc Kuhn & Viola Marquardt & Sarah Selinka, 2021. "“Is Sharing Really Caring?”: The Role of Environmental Concern and Trust Reflecting Usage Intention of “Station-Based” and “Free-Floating”—Carsharing Business Models," Sustainability, MDPI, vol. 13(13), pages 1-18, July.
    16. Elena Escolano-Pérez & Marta Bestué, 2021. "Academic Achievement in Spanish Secondary School Students: The Inter-Related Role of Executive Functions, Physical Activity and Gender," IJERPH, MDPI, vol. 18(4), pages 1-25, February.
    17. Xu, Ganggang & Genton, Marc G., 2015. "Efficient maximum approximated likelihood inference for Tukey’s g-and-h distribution," Computational Statistics & Data Analysis, Elsevier, vol. 91(C), pages 78-91.
    18. Nengsih Titin Agustin & Bertrand Frédéric & Maumy-Bertrand Myriam & Meyer Nicolas, 2019. "Determining the number of components in PLS regression on incomplete data set," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 18(6), pages 1-28, December.
    19. Florian M. Hollenbach & Iavor Bojinov & Shahryar Minhas & Nils W. Metternich & Michael D. Ward & Alexander Volfovsky, 2021. "Multiple Imputation Using Gaussian Copulas," Sociological Methods & Research, , vol. 50(3), pages 1259-1283, August.
    20. Morehart, Mitch & Milkove, Dan & Xu, Yang, 2014. "Multivariate Farm Debt Imputation in the Agricultural Resource Management Survey (ARMS)," 2014 Annual Meeting, July 27-29, 2014, Minneapolis, Minnesota 169401, Agricultural and Applied Economics Association.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:sankhb:v:83:y:2021:i:1:d:10.1007_s13571-021-00251-4. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.