IDEAS home Printed from https://ideas.repec.org/a/spr/mathme/v68y2008i3p407-428.html
   My bibliography  Save this article

A new approach for data editing and imputation

Author

Listed:
  • Sergio Delgado-Quintero
  • Juan-José Salazar-González

Abstract

The editing-and-imputation problem concerns the question of finding errors in a record which does not satisfy a set of consistency rules. Once some potential errors have been localizated, it is also necessary to impute new values to the associated fields. The output dataset should consist of valid records and preserve similar statistical properties as the input dataset. Most of this work is usually done manually by statistical agencies, thus consuming a great deal of human resources. This paper presents a mathematical programming model to optimally solve the problem on surveys with categorical values and particular edits. We also describe a heuristic approach to deal with the more complex surveys. The heuristic procedure follows a combination of the widely-accepted hot-deck donor scheme and the multivariate regression analysis. It has been implemented in a graphical user interface running on standard personal computers, and has been tested on real-world surveys. This paper demonstrates the satisfactory performance of our automatic procedure. Copyright Springer-Verlag 2008

Suggested Citation

  • Sergio Delgado-Quintero & Juan-José Salazar-González, 2008. "A new approach for data editing and imputation," Mathematical Methods of Operations Research, Springer;Gesellschaft für Operations Research (GOR);Nederlands Genootschap voor Besliskunde (NGB), vol. 68(3), pages 407-428, December.
  • Handle: RePEc:spr:mathme:v:68:y:2008:i:3:p:407-428
    DOI: 10.1007/s00186-008-0237-6
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1007/s00186-008-0237-6
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1007/s00186-008-0237-6?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. R. S. Garfinkel & A. S. Kunnathur & G. E. Liepins, 1986. "Optimal Imputation of Erroneous Data: Categorical Data, General Edits," Operations Research, INFORMS, vol. 34(5), pages 744-751, October.
    2. Olinsky, Alan & Chen, Shaw & Harlow, Lisa, 2003. "The comparative efficacy of imputation methods for missing data in structural equation modeling," European Journal of Operational Research, Elsevier, vol. 151(1), pages 53-79, November.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Kelly, Scott & Shipworth, Michelle & Shipworth, David & Gentry, Michael & Wright, Andrew & Pollitt, Michael & Crawford-Brown, Doug & Lomas, Kevin, 2013. "Predicting the diversity of internal temperatures from the English residential sector using panel methods," Applied Energy, Elsevier, vol. 102(C), pages 601-621.
    2. Nancy, Jane Y. & Khanna, Nehemiah H. & Arputharaj, Kannan, 2017. "Imputing missing values in unevenly spaced clinical time series data to build an effective temporal classification framework," Computational Statistics & Data Analysis, Elsevier, vol. 112(C), pages 63-79.
    3. Kaitlin A. Harding & Karly M. Murphy & Amy Mezulis, 2019. "Ruminating on the Positive: Paths from Trait Positive Emotionality to Event-Specific Gratitude," Journal of Happiness Studies, Springer, vol. 20(1), pages 101-117, January.
    4. Farag, Sendy & Lyons, Glenn, 2012. "To use or not to use? An empirical study of pre-trip public transport information for business and leisure trips and comparison with car travel," Transport Policy, Elsevier, vol. 20(C), pages 82-92.
    5. Joewono, Tri B. & Tarigan, Ari K.M. & Susilo, Yusak O., 2016. "Road-based public transportation in urban areas of Indonesia: What policies do users expect to improve the service quality?," Transport Policy, Elsevier, vol. 49(C), pages 114-124.
    6. Mahmoud Hassan & Walid Oueslati & Damien Rousselière, 2020. "Exploring the link between energy based taxes and economic growth," Environmental Economics and Policy Studies, Springer;Society for Environmental Economics and Policy Studies - SEEPS, vol. 22(1), pages 67-87, January.
    7. Howat, Gary & Assaker, Guy, 2016. "Outcome quality in participant sport and recreation service quality models: Empirical results from public aquatic centres in Australia," Sport Management Review, Elsevier, vol. 19(5), pages 520-535.
    8. Hyemi Lee, 2019. "Understanding Ethical Consumers Through Person/Thing Orientation Approach," Journal of Business Ethics, Springer, vol. 158(3), pages 637-658, September.
    9. Karanja, Erastus & Zaveri, Jigish & Ahmed, Ashraf, 2013. "How do MIS researchers handle missing data in survey-based research: A content analysis approach," International Journal of Information Management, Elsevier, vol. 33(5), pages 734-751.
    10. Huiwen Wang & Shan Lu & Yide Liu, 2022. "Missing data imputation in PLS-SEM," Quality & Quantity: International Journal of Methodology, Springer, vol. 56(6), pages 4777-4795, December.
    11. W. Hallam & C. Olsson & M. O’Connor & M. Hawkins & J. Toumbourou & G. Bowes & R. McGee & A. Sanson, 2014. "Association Between Adolescent Eudaimonic Behaviours and Emotional Competence in Young Adulthood," Journal of Happiness Studies, Springer, vol. 15(5), pages 1165-1177, October.
    12. Margaret Penning & Guiping Liu & Pak Chou, 2014. "Measuring Loneliness Among Middle-Aged and Older Adults: The UCLA and de Jong Gierveld Loneliness Scales," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 118(3), pages 1147-1166, September.
    13. M Parzen & S Lipsitz & R Metters & G Fitzmaurice, 2010. "Correlation when data are missing," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 61(6), pages 1049-1056, June.
    14. Karkoulian, Silva & Assaker, Guy & Hallak, Rob, 2016. "An empirical study of 360-degree feedback, organizational justice, and firm sustainability," Journal of Business Research, Elsevier, vol. 69(5), pages 1862-1867.
    15. Yoon, Byungun & Jeong, Yujin & Lee, Keeeun & Lee, Sungjoo, 2020. "A systematic approach to prioritizing R&D projects based on customer-perceived value using opinion mining," Technovation, Elsevier, vol. 98(C).
    16. Courtney M. Cooper & Jeff B. Langman & Dilshani Sarathchandra & Chantal A. Vella & Chloe B. Wardropper, 2020. "Perceived Risk and Intentions to Practice Health Protective Behaviors in a Mining-Impacted Region," IJERPH, MDPI, vol. 17(21), pages 1-18, October.
    17. Lukoschek, Carmen Sabrina & Gerlach, Gisela & Stock, Ruth Maria & Xin, Katherine, 2018. "Leading to sustainable organizational unit performance: Antecedents and outcomes of executives' dual innovation leadership," Journal of Business Research, Elsevier, vol. 91(C), pages 266-276.
    18. Hallak, Rob & Assaker, Guy & O’Connor, Peter & Lee, Craig, 2018. "Firm performance in the upscale restaurant sector: The effects of resilience, creative self-efficacy, innovation and industry experience," Journal of Retailing and Consumer Services, Elsevier, vol. 40(C), pages 229-240.
    19. George Petrakos & Claudio Conversano & Gregory Farmakis & Francesco Mola & Roberta Siciliano & Photis Stavropoulos, 2004. "New ways of specifying data edits," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 167(2), pages 249-274, May.
    20. Sun, Yanqing & Sun, Mengru, 2021. "How peer influence mediates the effects of video games playing on adolescents’ aggressive behavior," Children and Youth Services Review, Elsevier, vol. 130(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:mathme:v:68:y:2008:i:3:p:407-428. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.