IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v11y2022i1p73-d1014308.html
   My bibliography  Save this article

A Noise-Aware Multiple Imputation Algorithm for Missing Data

Author

Listed:
  • Fangfang Li

    (School of Computer Science and Engineering, Northeastern University, Shenyang 110169, China)

  • Hui Sun

    (School of Computer Science and Engineering, Northeastern University, Shenyang 110169, China)

  • Yu Gu

    (School of Computer Science and Engineering, Northeastern University, Shenyang 110169, China)

  • Ge Yu

    (School of Computer Science and Engineering, Northeastern University, Shenyang 110169, China)

Abstract

Missing data is a common and inevitable phenomenon. In practical applications, the datasets usually contain noises for various reasons. Most of the existing missing data imputing algorithms are affected by noises which reduce the accuracy of the imputation. This paper proposes a noise-aware missing data multiple imputation algorithm NPMI in static data. Different multiple imputation models are proposed according to the missing mechanism of data. Secondly, the method to determine the imputation order of multivariablesmissing is given. A random sampling consistency algorithm is proposed to estimate the initial values of the parameters of the multiple imputation model to reduce the influence of noise data and improve the algorithm’s robustness. Experiments on two real datasets and two synthetic datasets verify the accuracy and efficiency of the proposed NPMI algorithm, and the results are analyzed.

Suggested Citation

  • Fangfang Li & Hui Sun & Yu Gu & Ge Yu, 2022. "A Noise-Aware Multiple Imputation Algorithm for Missing Data," Mathematics, MDPI, vol. 11(1), pages 1-16, December.
  • Handle: RePEc:gam:jmathe:v:11:y:2022:i:1:p:73-:d:1014308
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/11/1/73/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/11/1/73/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Sascha Lindig & Atse Louwen & David Moser & Marko Topic, 2020. "Outdoor PV System Monitoring—Input Data Quality, Data Imputation and Filtering Approaches," Energies, MDPI, vol. 13(19), pages 1-18, September.
    2. Zhiwei Fu & Bruce L. Golden & Shreevardhan Lele & S. Raghavan & Edward A. Wasil, 2003. "A Genetic Algorithm-Based Approach for Building Accurate Decision Trees," INFORMS Journal on Computing, INFORMS, vol. 15(1), pages 3-22, February.
    3. Hai‐Anh Dang & Dean Jolliffe & Calogero Carletto, 2019. "Data Gaps, Data Incomparability, And Data Imputation: A Review Of Poverty Measurement Methods For Data‐Scarce Environments," Journal of Economic Surveys, Wiley Blackwell, vol. 33(3), pages 757-797, July.
    4. Lai, Peng & Wang, Qihua, 2014. "Semiparametric efficient estimation for partially linear single-index models with responses missing at random," Journal of Multivariate Analysis, Elsevier, vol. 128(C), pages 33-50.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Dang, Hai-Anh H. & Serajuddin, Umar, 2020. "Tracking the sustainable development goals: Emerging measurement challenges and further reflections," World Development, Elsevier, vol. 127(C).
    2. Dang, Hai-Anh H. & Verme, Paolo, 2019. "Estimating Poverty for Refugee Populations: Can Cross-Survey Imputation Methods Substitute for Data Scarcity?," GLO Discussion Paper Series 429, Global Labor Organization (GLO).
    3. Dang,Hai-Anh H. & Kilic,Talip & Carletto,Calogero & Abanokova,Kseniya, 2021. "Poverty Imputation in Contexts without Consumption Data : A Revisit with Further Refinements," Policy Research Working Paper Series 9838, The World Bank.
    4. Romero-Fiances, Irene & Livera, Andreas & Theristis, Marios & Makrides, George & Stein, Joshua S. & Nofuentes, Gustavo & de la Casa, Juan & Georghiou, George E., 2022. "Impact of duration and missing data on the long-term photovoltaic degradation rate estimation," Renewable Energy, Elsevier, vol. 181(C), pages 738-748.
    5. Dang, Hai-Anh H & Carletto, Calogero, 2022. "Recall Bias Revisited: Measure Farm Labor Using Mixed-Mode Surveys and Multiple Imputation," IZA Discussion Papers 14997, Institute of Labor Economics (IZA).
    6. Dang,Hai-Anh H. & Lanjouw,Peter F., 2020. "Welfare Dynamics in India over a Quarter Century : Poverty, Vulnerability, and Mobility during 1987-2012," Policy Research Working Paper Series 9231, The World Bank.
    7. Zaira Najam & John Gibson, 2021. "Does within-country poverty convergence depend on spatial spillovers and the type of poverty measure? Evidence from Pakistan," Working Papers in Economics 21/07, University of Waikato.
    8. Hai-Anh H. Dang & Peter F. Lanjouw, 2023. "Regression-based imputation for poverty measurement in data-scarce settings," Chapters, in: Jacques Silber (ed.), Research Handbook on Measuring Poverty and Deprivation, chapter 13, pages 141-150, Edward Elgar Publishing.
    9. Zhiwei Fu & Bruce Golden & Shreevardhan Lele & S. Raghavan & Edward Wasil, 2003. "Genetically Engineered Decision Trees: Population Diversity Produces Smarter Trees," Operations Research, INFORMS, vol. 51(6), pages 894-907, December.
    10. Theresa Beltramo & Hai-Anh Dang & Ibrahima Sarr & Paolo Verme, 2024. "Estimating poverty among refugee populations: a cross-survey imputation exercise for Chad," Oxford Development Studies, Taylor & Francis Journals, vol. 52(1), pages 94-113, January.
    11. Dang, Hai-Anh H & Kilic, Talip & Hlasny, Vladimir & Abanokova, Kseniya & Carletto, Calogero, 2024. "Using Survey-to-Survey Imputation to Fill Poverty Data Gaps at a Low Cost: Evidence from a Randomized Survey Experiment," IZA Discussion Papers 16792, Institute of Labor Economics (IZA).
    12. Dang, Hai-Anh H & Hallegatte, Stephane & Trinh, Trong-Anh, 2023. "Does Global Warming Worsen Poverty and Inequality? An Updated Review," IZA Discussion Papers 16570, Institute of Labor Economics (IZA).
    13. François Bourguignon & A. Hector Moreno M., 2020. "On synthetic income panels," Working Papers halshs-01988068, HAL.
    14. Hai-Anh H. Dang & Talip Kilic & Ksenia Abanokova & Gero Carletto, 2024. "Imputing Poverty Indicators without Consumption Data : An Exploratory Analysis," Policy Research Working Paper Series 10867, The World Bank.
    15. Dang, Hai-Anh H. & Raju, Dhushyanth & Tanaka, Tomomi & Abanokova, Kseniya, 2024. "Tackling the Last Hurdles of Poverty Entrenchment: An Investigation of Poverty Dynamics for Ghana during 2005/06-2016/17," GLO Discussion Paper Series 1376, Global Labor Organization (GLO).
    16. Paolo Verme, 2023. "Predicting Poverty with Missing Incomes," Working Papers 642, ECINEQ, Society for the Study of Economic Inequality.
    17. Peter Lanjouw & Hai-Anh Dang, 2018. "Inequality trends and dynamics in India: The bird’s-eye and the granular perspectives," WIDER Working Paper Series 189, World Institute for Development Economic Research (UNU-WIDER).
    18. Luo, Wei & Cai, Xizhen, 2016. "A new estimator for efficient dimension reduction in regression," Journal of Multivariate Analysis, Elsevier, vol. 145(C), pages 236-249.
    19. Naseeha Islam & Pramesh Koju & Reetu Manandhar & Sudip Shrestha & Charlotte Smith, 2020. "Assessing the Impacts of Relative Wealth and Geospatial Factors on Water Access in Rural Nepal: A Community Case Study," IJERPH, MDPI, vol. 17(18), pages 1-13, September.
    20. Abanokova, Kseniya & Dang, Hai-Anh H & Lokshin, Michael, 2020. "The Important Role of Equivalence Scales: Household Size, Composition, and Poverty Dynamics in Russia," IZA Discussion Papers 13043, Institute of Labor Economics (IZA).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:11:y:2022:i:1:p:73-:d:1014308. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.