IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v16y2025i1d10.1038_s41467-024-55198-7.html
   My bibliography  Save this article

Machine learning derived retinal pigment score from ophthalmic imaging shows ethnicity is not biology

Author

Listed:
  • Anand E. Rajesh

    (University of Washington
    The Roger and Angie Karalis Johnson Retina Center)

  • Abraham Olvera-Barrios

    (Moorfields Eye Hospital NHS Foundation Trust & University College London Institute of Ophthalmology)

  • Alasdair N. Warwick

    (Moorfields Eye Hospital NHS Foundation Trust & University College London Institute of Ophthalmology
    University College London Institute of Cardiovascular Science)

  • Yue Wu

    (University of Washington
    The Roger and Angie Karalis Johnson Retina Center)

  • Kelsey V. Stuart

    (Moorfields Eye Hospital NHS Foundation Trust & University College London Institute of Ophthalmology)

  • Mahantesh I. Biradar

    (Moorfields Eye Hospital NHS Foundation Trust & University College London Institute of Ophthalmology)

  • Chuin Ying Ung

    (Guy’s and St Thomas’ NHS Foundation Trust)

  • Anthony P. Khawaja

    (Moorfields Eye Hospital NHS Foundation Trust & University College London Institute of Ophthalmology
    University of Cambridge)

  • Robert Luben

    (Moorfields Eye Hospital NHS Foundation Trust & University College London Institute of Ophthalmology
    University of Cambridge)

  • Paul J. Foster

    (Moorfields Eye Hospital NHS Foundation Trust & University College London Institute of Ophthalmology)

  • Charles R. Cleland

    (London School of Hygiene & Tropical Medicine
    Kilimanjaro Christian Medical Centre)

  • William U. Makupa

    (Kilimanjaro Christian Medical Centre)

  • Alastair K. Denniston

    (NIHR Birmingham Biomedical Research Centre)

  • Matthew J. Burton

    (Moorfields Eye Hospital NHS Foundation Trust & University College London Institute of Ophthalmology
    London School of Hygiene & Tropical Medicine)

  • Andrew Bastawrous

    (Kilimanjaro Christian Medical Centre
    PEEK Vision)

  • Pearse A. Keane

    (Moorfields Eye Hospital NHS Foundation Trust & University College London Institute of Ophthalmology)

  • Mark A. Chia

    (Moorfields Eye Hospital NHS Foundation Trust & University College London Institute of Ophthalmology)

  • Angus W. Turner

    (University of Western Australia)

  • Cecilia S. Lee

    (University of Washington
    The Roger and Angie Karalis Johnson Retina Center)

  • Adnan Tufail

    (Moorfields Eye Hospital NHS Foundation Trust & University College London Institute of Ophthalmology)

  • Aaron Y. Lee

    (University of Washington
    The Roger and Angie Karalis Johnson Retina Center)

  • Catherine Egan

    (Moorfields Eye Hospital NHS Foundation Trust & University College London Institute of Ophthalmology)

Abstract

Few metrics exist to describe phenotypic diversity within ophthalmic imaging datasets, with researchers often using ethnicity as a surrogate marker for biological variability. We derived a continuous, measured metric, the retinal pigment score (RPS), that quantifies the degree of pigmentation from a colour fundus photograph of the eye. RPS was validated using two large epidemiological studies with demographic and genetic data (UK Biobank and EPIC-Norfolk Study) and reproduced in a Tanzanian, an Australian, and a Chinese dataset. A genome-wide association study (GWAS) of RPS from UK Biobank identified 20 loci with known associations with skin, iris and hair pigmentation, of which eight were replicated in the EPIC-Norfolk cohort. There was a strong association between RPS and ethnicity, however, there was substantial overlap between each ethnicity and the respective distributions of RPS scores. RPS decouples traditional demographic variables from clinical imaging characteristics. RPS may serve as a useful metric to quantify the diversity of the training, validation, and testing datasets used in the development of AI algorithms to ensure adequate inclusion and explainability of the model performance, critical in evaluating all currently deployed AI models. The code to derive RPS is publicly available at: https://github.com/uw-biomedical-ml/retinal-pigmentation-score .

Suggested Citation

  • Anand E. Rajesh & Abraham Olvera-Barrios & Alasdair N. Warwick & Yue Wu & Kelsey V. Stuart & Mahantesh I. Biradar & Chuin Ying Ung & Anthony P. Khawaja & Robert Luben & Paul J. Foster & Charles R. Cle, 2025. "Machine learning derived retinal pigment score from ophthalmic imaging shows ethnicity is not biology," Nature Communications, Nature, vol. 16(1), pages 1-14, December.
  • Handle: RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-024-55198-7
    DOI: 10.1038/s41467-024-55198-7
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-024-55198-7
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-024-55198-7?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Hannah Currant & Pirro Hysi & Tomas W Fitzgerald & Puya Gharahkhani & Pieter W M Bonnemaijer & Anne Senabouth & Alex W Hewitt & UK Biobank Eye and Vision Consortium & International Glaucoma Genetics C, 2021. "Genetic variation affects morphological retinal phenotypes extracted from UK Biobank optical coherence tomography images," PLOS Genetics, Public Library of Science, vol. 17(5), pages 1-27, May.
    2. Clare Bycroft & Colin Freeman & Desislava Petkova & Gavin Band & Lloyd T. Elliott & Kevin Sharp & Allan Motyer & Damjan Vukcevic & Olivier Delaneau & Jared O’Connell & Adrian Cortes & Samantha Welsh &, 2018. "The UK Biobank resource with deep phenotyping and genomic data," Nature, Nature, vol. 562(7726), pages 203-209, October.
    3. Adriana Buskin & Lili Zhu & Valeria Chichagova & Basudha Basu & Sina Mozaffari-Jovin & David Dolan & Alastair Droop & Joseph Collin & Revital Bronstein & Sudeep Mehrotra & Michael Farkas & Gerrit Hilg, 2018. "Disrupted alternative splicing for genes implicated in splicing and ciliogenesis causes PRPF31 retinitis pigmentosa," Nature Communications, Nature, vol. 9(1), pages 1-19, December.
    4. Guo Yin & Ya Xing Wang & Zhi Yun Zheng & Hua Yang & Liang Xu & Jost B Jonas & the Beijing Eye Study Group, 2012. "Ocular Axial Length and Its Associations in Chinese: The Beijing Eye Study," PLOS ONE, Public Library of Science, vol. 7(8), pages 1-8, August.
    5. Mitja I. Kurki & Juha Karjalainen & Priit Palta & Timo P. Sipilä & Kati Kristiansson & Kati M. Donner & Mary P. Reeve & Hannele Laivuori & Mervi Aavikko & Mari A. Kaunisto & Anu Loukola & Elisa Lahtel, 2023. "Author Correction: FinnGen provides genetic insights from a well-phenotyped isolated population," Nature, Nature, vol. 615(7952), pages 19-19, March.
    6. Kyoko Watanabe & Erdogan Taskesen & Arjen Bochoven & Danielle Posthuma, 2017. "Functional mapping and annotation of genetic associations with FUMA," Nature Communications, Nature, vol. 8(1), pages 1-11, December.
    7. Mitja I. Kurki & Juha Karjalainen & Priit Palta & Timo P. Sipilä & Kati Kristiansson & Kati M. Donner & Mary P. Reeve & Hannele Laivuori & Mervi Aavikko & Mari A. Kaunisto & Anu Loukola & Elisa Lahtel, 2023. "FinnGen provides genetic insights from a well-phenotyped isolated population," Nature, Nature, vol. 613(7944), pages 508-518, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Xiao-Yu He & Bang-Sheng Wu & Liu Yang & Yu Guo & Yue-Ting Deng & Ze-Yu Li & Chen-Jie Fei & Wei-Shi Liu & Yi-Jun Ge & Jujiao Kang & Jianfeng Feng & Wei Cheng & Qiang Dong & Jin-Tai Yu, 2024. "Genetic associations of protein-coding variants in venous thromboembolism," Nature Communications, Nature, vol. 15(1), pages 1-12, December.
    2. Caitlin E. Carey & Rebecca Shafee & Robbee Wedow & Amanda Elliott & Duncan S. Palmer & John Compitello & Masahiro Kanai & Liam Abbott & Patrick Schultz & Konrad J. Karczewski & Samuel C. Bryant & Caro, 2024. "Principled distillation of UK Biobank phenotype data reveals underlying structure in human variation," Nature Human Behaviour, Nature, vol. 8(8), pages 1599-1615, August.
    3. Bingxin Zhao & Yujue Li & Zirui Fan & Zhenyi Wu & Juan Shu & Xiaochen Yang & Yilin Yang & Xifeng Wang & Bingxuan Li & Xiyao Wang & Carlos Copana & Yue Yang & Jinjie Lin & Yun Li & Jason L. Stein & Joa, 2024. "Eye-brain connections revealed by multimodal retinal and brain imaging genetics," Nature Communications, Nature, vol. 15(1), pages 1-19, December.
    4. Jujiao Kang & Yue-Ting Deng & Bang-Sheng Wu & Wei-Shi Liu & Ze-Yu Li & Shitong Xiang & Liu Yang & Jia You & Xiaohong Gong & Tianye Jia & Jin-Tai Yu & Wei Cheng & Jianfeng Feng, 2024. "Whole exome sequencing analysis identifies genes for alcohol consumption," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
    5. Xingjie Hao & Zhonghe Shao & Ning Zhang & Minghui Jiang & Xi Cao & Si Li & Yunlong Guan & Chaolong Wang, 2023. "Integrative genome-wide analyses identify novel loci associated with kidney stones and provide insights into its genetic architecture," Nature Communications, Nature, vol. 14(1), pages 1-12, December.
    6. Ruoyu Tian & Tian Ge & Hyeokmoon Kweon & Daniel B. Rocha & Max Lam & Jimmy Z. Liu & Kritika Singh & Daniel F. Levey & Joel Gelernter & Murray B. Stein & Ellen A. Tsai & Hailiang Huang & Christopher F., 2024. "Whole-exome sequencing in UK Biobank reveals rare genetic architecture for depression," Nature Communications, Nature, vol. 15(1), pages 1-12, December.
    7. Jonathan Mitchell & Niedzica Camacho & Patrick Shea & Konrad H. Stopsack & Vijai Joseph & Oliver S. Burren & Ryan S. Dhindsa & Abhishek Nag & Jacob E. Berchuck & Amanda O’Neill & Ali Abbasi & Anthony , 2025. "Assessing the contribution of rare protein-coding germline variants to prostate cancer risk and severity in 37,184 cases," Nature Communications, Nature, vol. 16(1), pages 1-11, December.
    8. Linda Ottensmann & Rubina Tabassum & Sanni E. Ruotsalainen & Mathias J. Gerl & Christian Klose & Elisabeth Widén & Kai Simons & Samuli Ripatti & Matti Pirinen, 2023. "Genome-wide association analysis of plasma lipidome identifies 495 genetic associations," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    9. Andras Gezsi & Sandra Auwera & Hannu Mäkinen & Nora Eszlari & Gabor Hullam & Tamas Nagy & Sarah Bonk & Rubèn González-Colom & Xenia Gonda & Linda Garvert & Teemu Paajanen & Zsofia Gal & Kevin Kirchner, 2024. "Unique genetic and risk-factor profiles in clusters of major depressive disorder-related multimorbidity trajectories," Nature Communications, Nature, vol. 15(1), pages 1-18, December.
    10. Alexander T. Williams & Jing Chen & Kayesha Coley & Chiara Batini & Abril Izquierdo & Richard Packer & Erik Abner & Stavroula Kanoni & David J. Shepherd & Robert C. Free & Edward J. Hollox & Nigel J. , 2023. "Genome-wide association study of thyroid-stimulating hormone highlights new genes, pathways and associations with thyroid disease," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    11. Dmitrii Usoltsev & Nikita Kolosov & Oxana Rotar & Alexander Loboda & Maria Boyarinova & Ekaterina Moguchaya & Ekaterina Kolesova & Anastasia Erina & Kristina Tolkunova & Valeriia Rezapova & Ivan Molot, 2024. "Complex trait susceptibilities and population diversity in a sample of 4,145 Russians," Nature Communications, Nature, vol. 15(1), pages 1-10, December.
    12. V. E. Jackson & Y. Wu & R. Bonelli & J. P. Owen & L. W. Scott & S. Farashi & Y. Kihara & M. L. Gantner & C. Egan & K. M. Williams & B. R. E. Ansell & A. Tufail & A. Y. Lee & M. Bahlo, 2025. "Multi-omic spatial effects on high-resolution AI-derived retinal thickness," Nature Communications, Nature, vol. 16(1), pages 1-19, December.
    13. William R. Reay & Dylan J. Kiltschewskij & Maria A. Biase & Zachary F. Gerring & Kousik Kundu & Praveen Surendran & Laura A. Greco & Erin D. Clarke & Clare E. Collins & Alison M. Mondul & Demetrius Al, 2024. "Genetic influences on circulating retinol and its relationship to human health," Nature Communications, Nature, vol. 15(1), pages 1-20, December.
    14. Chamlee Cho & Beomsu Kim & Dan Say Kim & Mi Yeong Hwang & Injeong Shim & Minku Song & Yeong Chan Lee & Sang-Hyuk Jung & Sung Kweon Cho & Woong-Yang Park & Woojae Myung & Bong-Jo Kim & Ron Do & Hyon K., 2024. "Large-scale cross-ancestry genome-wide meta-analysis of serum urate," Nature Communications, Nature, vol. 15(1), pages 1-17, December.
    15. Sang‑Hyuk Jung & Haemin Kim & Young Mi Jung & Manu Shivakumar & Brenda Xiao & Jaeyoung Kim & Beomjin Jang & Jae-Seung Yun & Hong-Hee Won & Chan-Wook Park & Joong Shin Park & Jong Kwan Jun & Dokyoon Ki, 2025. "Healthy lifestyle reduces cardiovascular risk in women with genetic predisposition to hypertensive disorders of pregnancy," Nature Communications, Nature, vol. 16(1), pages 1-11, December.
    16. Hui Chen & Zeyang Wang & Lihai Gong & Qixuan Wang & Wenyan Chen & Jia Wang & Xuelian Ma & Ruofan Ding & Xing Li & Xudong Zou & Mireya Plass & Cheng Lian & Ting Ni & Gong-Hong Wei & Wei Li & Lin Deng &, 2024. "A distinct class of pan-cancer susceptibility genes revealed by an alternative polyadenylation transcriptome-wide association study," Nature Communications, Nature, vol. 15(1), pages 1-16, December.
    17. Liang-Dar Hwang & Gabriel Cuellar-Partida & Loic Yengo & Jian Zeng & Jarkko Toivonen & Mikko Arvas & Robin N. Beaumont & Rachel M. Freathy & Gunn-Helen Moen & Nicole M. Warrington & David M. Evans, 2024. "DINGO: increasing the power of locus discovery in maternal and fetal genome-wide association studies of perinatal traits," Nature Communications, Nature, vol. 15(1), pages 1-14, December.
    18. Natalie DeForest & Yuqi Wang & Zhiyi Zhu & Jacqueline S. Dron & Ryan Koesterer & Pradeep Natarajan & Jason Flannick & Tiffany Amariuta & Gina M. Peloso & Amit R. Majithia, 2024. "Genome-wide discovery and integrative genomic characterization of insulin resistance loci using serum triglycerides to HDL-cholesterol ratio as a proxy," Nature Communications, Nature, vol. 15(1), pages 1-17, December.
    19. Shahram Bahrami & Kaja Nordengen & Jaroslav Rokicki & Alexey A. Shadrin & Zillur Rahman & Olav B. Smeland & Piotr P. Jaholkowski & Nadine Parker & Pravesh Parekh & Kevin S. O’Connell & Torbjørn Elvsås, 2024. "The genetic landscape of basal ganglia and implications for common brain disorders," Nature Communications, Nature, vol. 15(1), pages 1-14, December.
    20. Shiyu Zhang & Zheng Wang & Yijing Wang & Yixiao Zhu & Qiao Zhou & Xingxing Jian & Guihu Zhao & Jian Qiu & Kun Xia & Beisha Tang & Julian Mutz & Jinchen Li & Bin Li, 2024. "A metabolomic profile of biological aging in 250,341 individuals from the UK Biobank," Nature Communications, Nature, vol. 15(1), pages 1-19, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-024-55198-7. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.