IDEAS home Printed from https://ideas.repec.org/a/gam/jftint/v13y2021i11p274-d666444.html
   My bibliography  Save this article

Analytics on Anonymity for Privacy Retention in Smart Health Data

Author

Listed:
  • Sevgi Arca

    (Department of Computer Science, Texas Tech University, Lubbock, TX 79409, USA)

  • Rattikorn Hewett

    (Department of Computer Science, Texas Tech University, Lubbock, TX 79409, USA)

Abstract

Advancements in smart technology, wearable and mobile devices, and Internet of Things, have made smart health an integral part of modern living to better individual healthcare and well-being. By enhancing self-monitoring, data collection and sharing among users and service providers, smart health can increase healthy lifestyles, timely treatments, and save lives. However, as health data become larger and more accessible to multiple parties, they become vulnerable to privacy attacks. One way to safeguard privacy is to increase users’ anonymity as anonymity increases indistinguishability making it harder for re-identification. Still the challenge is not only to preserve data privacy but also to ensure that the shared data are sufficiently informative to be useful. Our research studies health data analytics focusing on anonymity for privacy protection. This paper presents a multi-faceted analytical approach to (1) identifying attributes susceptible to information leakages by using entropy-based measure to analyze information loss, (2) anonymizing the data by generalization using attribute hierarchies, and (3) balancing between anonymity and informativeness by our anonymization technique that produces anonymized data satisfying a given anonymity requirement while optimizing data retention. Our anonymization technique is an automated Artificial Intelligent search based on two simple heuristics. The paper describes and illustrates the detailed approach and analytics including pre and post anonymization analytics. Experiments on published data are performed on the anonymization technique. Results, compared with other similar techniques, show that our anonymization technique gives the most effective data sharing solution, with respect to computational cost and balancing between anonymity and data retention.

Suggested Citation

  • Sevgi Arca & Rattikorn Hewett, 2021. "Analytics on Anonymity for Privacy Retention in Smart Health Data," Future Internet, MDPI, vol. 13(11), pages 1-20, October.
  • Handle: RePEc:gam:jftint:v:13:y:2021:i:11:p:274-:d:666444
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1999-5903/13/11/274/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1999-5903/13/11/274/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Luc Rocher & Julien M. Hendrickx & Yves-Alexandre de Montjoye, 2019. "Estimating the success of re-identifications in incomplete datasets using generative models," Nature Communications, Nature, vol. 10(1), pages 1-9, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. John R. J. Thompson & Longlong Feng & R. Mark Reesor & Chuck Grace, 2021. "Know Your Clients’ Behaviours: A Cluster Analysis of Financial Transactions," JRFM, MDPI, vol. 14(2), pages 1-29, January.
    2. Ron S. Jarmin & John M. Abowd & Robert Ashmead & Ryan Cumings-Menon & Nathan Goldschlag & Michael B. Hawes & Sallie Ann Keller & Daniel Kifer & Philip Leclerc & Jerome P. Reiter & Rolando A. Rodrígue, 2023. "An in-depth examination of requirements for disclosure risk assessment," Proceedings of the National Academy of Sciences, Proceedings of the National Academy of Sciences, vol. 120(43), pages 2220558120-, October.
    3. Ratul Das Chaudhury & Chongwoo Choe, 2023. "Digital Privacy: GDPR and Its Lessons for Australia," Australian Economic Review, The University of Melbourne, Melbourne Institute of Applied Economic and Social Research, vol. 56(2), pages 204-220, June.
    4. Rehse, Dominik & Tremöhlen, Felix, 2020. "Fostering participation in digital public health interventions: The case of digital contact tracing," ZEW Discussion Papers 20-076, ZEW - Leibniz Centre for European Economic Research.
    5. Tesary Lin & Sanjog Misra, 2022. "Frontiers: The Identity Fragmentation Bias," Marketing Science, INFORMS, vol. 41(3), pages 433-440, May.
    6. Atabey, Ayça & Pothong, Kruakae & Livingstone, Sonia, 2023. "Glossary of terms relating to children’s digital lives," LSE Research Online Documents on Economics 119728, London School of Economics and Political Science, LSE Library.
    7. German Data Forum RatSWD (ed.), 2020. "Data collection using new information technology," RatSWD Output Series, German Data Forum (RatSWD), volume 6, number 6-6en.
    8. Jeongwook Lee & Joon Jin Song & Yongku Kim & Jung In Seo, 2020. "Estimation and Prediction of Record Values Using Pivotal Quantities and Copulas," Mathematics, MDPI, vol. 8(10), pages 1-16, October.
    9. Miren Gutierrez & John Bryant, 2022. "The Fading Gloss of Data Science: Towards an Agenda that Faces the Challenges of Big Data for Development and Humanitarian Action," Development, Palgrave Macmillan;Society for International Deveopment, vol. 65(1), pages 80-93, March.
    10. Se-Ra Oh & Young-Duk Seo & Euijong Lee & Young-Gab Kim, 2021. "A Comprehensive Survey on Security and Privacy for Electronic Health Data," IJERPH, MDPI, vol. 18(18), pages 1-48, September.
    11. Carlo Giacomo Leo & Maria Rosaria Tumolo & Saverio Sabina & Riccardo Colella & Virginia Recchia & Giuseppe Ponzini & Dimitrios Ioannis Fotiadis & Antonella Bodini & Pierpaolo Mincarone, 2022. "Health Technology Assessment for In Silico Medicine: Social, Ethical and Legal Aspects," IJERPH, MDPI, vol. 19(3), pages 1-13, January.
    12. James Steele & Matthew Wade & Robert J. Copeland & Stuart Stokes & Rachel Stokes & Steven Mann, 2021. "The National ReferAll Database: An Open Dataset of Exercise Referral Schemes Across the UK," IJERPH, MDPI, vol. 18(9), pages 1-17, April.
    13. Heng Xu & Nan Zhang, 2022. "Implications of Data Anonymization on the Statistical Evidence of Disparity," Management Science, INFORMS, vol. 68(4), pages 2600-2618, April.
    14. Anastasia Roukouni & Gonçalo Homem de Almeida Correia, 2020. "Evaluation Methods for the Impacts of Shared Mobility: Classification and Critical Review," Sustainability, MDPI, vol. 12(24), pages 1-22, December.
    15. Till Koebe & Alejandra Arias-Salazar & Timo Schmid, 2023. "Releasing survey microdata with exact cluster locations and additional privacy safeguards," Palgrave Communications, Palgrave Macmillan, vol. 10(1), pages 1-13, December.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jftint:v:13:y:2021:i:11:p:274-:d:666444. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.