IDEAS home Printed from https://ideas.repec.org/a/gam/jijerp/v19y2022i3p1307-d732482.html
   My bibliography  Save this article

Strategies for Imputation of High-Resolution Environmental Data in Clinical Randomized Controlled Trials

Author

Listed:
  • Yohan Kim

    (Institute for Sustainable Futures, University of Technology Sydney, 235 Jones Street, Ultimo, NSW 2007, Australia)

  • Scott Kelly

    (Institute for Sustainable Futures, University of Technology Sydney, 235 Jones Street, Ultimo, NSW 2007, Australia)

  • Deepu Krishnan

    (Institute for Sustainable Futures, University of Technology Sydney, 235 Jones Street, Ultimo, NSW 2007, Australia)

  • Jay Falletta

    (Institute for Sustainable Futures, University of Technology Sydney, 235 Jones Street, Ultimo, NSW 2007, Australia)

  • Kerryn Wilmot

    (Institute for Sustainable Futures, University of Technology Sydney, 235 Jones Street, Ultimo, NSW 2007, Australia)

Abstract

Time series data collected in clinical trials can have varying degrees of missingness, adding challenges during statistical analyses. An additional layer of complexity is introduced for missing data in randomized controlled trials (RCT), where researchers must remain blinded between intervention and control groups. Such restriction severely limits the applicability of conventional imputation methods that would utilize other participants’ data for improved performance. This paper explores and compares various methods to impute high-resolution temperature logger data in RCT settings. In addition to the conventional non-parametric approaches, we propose a spline regression (SR) approach that captures the dynamics of indoor temperature by time of day that is unique to each participant. We investigate how the inclusion of external temperature and energy use can improve the model performance. Results show that SR imputation results in 16% smaller root mean squared error (RMSE) compared to conventional imputation methods, with the gap widening to 22% when more than half of data is missing. The SR method is particularly useful in cases where missingness occurs simultaneously for multiple participants, such as concurrent battery failures. We demonstrate how proper modelling of periodic dynamics can lead to significantly improved imputation performance, even with limited data.

Suggested Citation

  • Yohan Kim & Scott Kelly & Deepu Krishnan & Jay Falletta & Kerryn Wilmot, 2022. "Strategies for Imputation of High-Resolution Environmental Data in Clinical Randomized Controlled Trials," IJERPH, MDPI, vol. 19(3), pages 1-17, January.
  • Handle: RePEc:gam:jijerp:v:19:y:2022:i:3:p:1307-:d:732482
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1660-4601/19/3/1307/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1660-4601/19/3/1307/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Kelly, Scott & Shipworth, Michelle & Shipworth, David & Gentry, Michael & Wright, Andrew & Pollitt, Michael & Crawford-Brown, Doug & Lomas, Kevin, 2013. "Predicting the diversity of internal temperatures from the English residential sector using panel methods," Applied Energy, Elsevier, vol. 102(C), pages 601-621.
    2. Jeremy Mennis & Michael Mason & Donna L. Coffman & Kevin Henry, 2018. "Geographic Imputation of Missing Activity Space Data from Ecological Momentary Assessment (EMA) GPS Positions," IJERPH, MDPI, vol. 15(12), pages 1-15, December.
    3. Jiang, R. & Murthy, D.N.P., 2011. "A study of Weibull shape parameter: Properties and significance," Reliability Engineering and System Safety, Elsevier, vol. 96(12), pages 1619-1626.
    4. Robert J. Hill & Michael Scholz, 2018. "Can Geospatial Data Improve House Price Indexes? A Hedonic Imputation Approach with Splines," Review of Income and Wealth, International Association for Research in Income and Wealth, vol. 64(4), pages 737-756, December.
    5. Maria Lucia Parrella & Giuseppina Albano & Michele La Rocca & Cira Perna, 2019. "Reconstructing missing data sequences in multivariate time series: an application to environmental data," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 28(2), pages 359-383, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Maria Lucia Parrella & Giuseppina Albano & Cira Perna & Michele La Rocca, 2021. "Bootstrap joint prediction regions for sequences of missing values in spatio-temporal datasets," Computational Statistics, Springer, vol. 36(4), pages 2917-2938, December.
    2. Julian Granna & Wolfgang Brunauer & Stefan Lang, 2022. "Proposing a global model to manage the bias-variance tradeoff in the context of hedonic house price models," Working Papers 2022-12, Faculty of Economics and Statistics, Universität Innsbruck.
    3. Dewan, Isha & Dijoux, Yann, 2015. "Modelling repairable systems with an early life under competing risks and asymmetric virtual age," Reliability Engineering and System Safety, Elsevier, vol. 144(C), pages 215-224.
    4. McKenna, R. & Hofmann, L. & Merkel, E. & Fichtner, W. & Strachan, N., 2016. "Analysing socioeconomic diversity and scaling effects on residential electricity load profiles in the context of low carbon technology uptake," Energy Policy, Elsevier, vol. 97(C), pages 13-26.
    5. Daniel Melser & Robert J. Hill, 2019. "Residential Real Estate, Risk, Return and Diversification: Some Empirical Evidence," The Journal of Real Estate Finance and Economics, Springer, vol. 59(1), pages 111-146, July.
    6. Robert J. Hill & Alicia N. Rambaldi & Michael Scholz, 2021. "Higher frequency hedonic property price indices: a state-space approach," Empirical Economics, Springer, vol. 61(1), pages 417-441, July.
    7. Eyre, Nick & Baruah, Pranab, 2015. "Uncertainties in future energy demand in UK residential heating," Energy Policy, Elsevier, vol. 87(C), pages 641-653.
    8. Lü, Xiaoshu & Lu, Tao & Kibert, Charles J. & Viljanen, Martti, 2015. "Modeling and forecasting energy consumption for heterogeneous buildings using a physical–statistical approach," Applied Energy, Elsevier, vol. 144(C), pages 261-275.
    9. Anthony Lepinteur & Sofie R. Waltl, 2020. "Tracking Owners’ Sentiments: Subjective Home Values, Expectations and House Price Dynamics," Department of Economics Working Papers wuwp299, Vienna University of Economics and Business, Department of Economics.
    10. Robert S. Martin, 2025. "Democratic Aggregation: Issues and Implications for Consumer Price Indexes," Review of Income and Wealth, International Association for Research in Income and Wealth, vol. 71(1), February.
    11. Jiang, R., 2013. "A tradeoff BX life and its applications," Reliability Engineering and System Safety, Elsevier, vol. 113(C), pages 1-6.
    12. Ahlfeldt, Gabriel M. & Heblich, Stephan & Seidel, Tobias, 2023. "Micro-geographic property price and rent indices," Regional Science and Urban Economics, Elsevier, vol. 98(C).
    13. Acitas, Sukru & Aladag, Cagdas Hakan & Senoglu, Birdal, 2019. "A new approach for estimating the parameters of Weibull distribution via particle swarm optimization: An application to the strengths of glass fibre data," Reliability Engineering and System Safety, Elsevier, vol. 183(C), pages 116-127.
    14. R. H. Ilyasov & V. A. Plotnikov, 2022. "Oil Production and Carbon Emissions: Spline Analysis of Relationships," Administrative Consulting, Russian Presidential Academy of National Economy and Public Administration. North-West Institute of Management., issue 5.
    15. E. Bothma & J. S. Allison & I. J. H. Visagie, 2022. "New classes of tests for the Weibull distribution using Stein’s method in the presence of random right censoring," Computational Statistics, Springer, vol. 37(4), pages 1751-1770, September.
    16. Lu, Heli & Liu, Guifang, 2014. "Spatial effects of carbon dioxide emissions from residential energy consumption: A county-level study using enhanced nocturnal lighting," Applied Energy, Elsevier, vol. 131(C), pages 297-306.
    17. Manzoor Ellahi & Ghulam Abbas & Irfan Khan & Paul Mario Koola & Mashood Nasir & Ali Raza & Umar Farooq, 2019. "Recent Approaches of Forecasting and Optimal Economic Dispatch to Overcome Intermittency of Wind and Photovoltaic (PV) Systems: A Review," Energies, MDPI, vol. 12(22), pages 1-30, November.
    18. Ardeshir Mahdavi & Christiane Berger & Hadeer Amin & Eleni Ampatzi & Rune Korsholm Andersen & Elie Azar & Verena M. Barthelmes & Matteo Favero & Jakob Hahn & Dolaana Khovalyg & Henrik N. Knudsen & Ale, 2021. "The Role of Occupants in Buildings’ Energy Performance Gap: Myth or Reality?," Sustainability, MDPI, vol. 13(6), pages 1-44, March.
    19. Wang, Shaojian & Fang, Chuanglin & Guan, Xingliang & Pang, Bo & Ma, Haitao, 2014. "Urbanisation, energy consumption, and carbon dioxide emissions in China: A panel data analysis of China’s provinces," Applied Energy, Elsevier, vol. 136(C), pages 738-749.
    20. Kiche J & Oscar Ngesa & George Orwa, 2019. "On Generalized Gamma Distribution and Its Application to Survival Data," International Journal of Statistics and Probability, Canadian Center of Science and Education, vol. 8(5), pages 85-102, September.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jijerp:v:19:y:2022:i:3:p:1307-:d:732482. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.