Author
Listed:
- Paulo Amaral
(INSPER Institute of Education and Research)
- Silvia Carbonell-Sala
(Centre for Genomic Regulation (CRG))
- Francisco M. Vega
(Stanford University School of Medicine
Tempus Labs)
- Tiago Faial
(Nature Genetics)
- Adam Frankish
(European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus)
- Thomas Gingeras
(Cold Spring Harbor Laboratory)
- Roderic Guigo
(Centre for Genomic Regulation (CRG)
Universitat Pompeu Fabra (UPF))
- Jennifer L. Harrow
(Centre for Genomics Research, Discovery Sciences, AstraZeneca)
- Artemis G. Hatzigeorgiou
(Universithy of Thessaly
Hellenic Pasteur Institute)
- Rory Johnson
(University College Dublin
University College Dublin
Bern University Hospital, University of Bern
University of Bern)
- Terence D. Murphy
(National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health)
- Mihaela Pertea
(Johns Hopkins University
Johns Hopkins University
Johns Hopkins University)
- Kim D. Pruitt
(National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health)
- Shashikant Pujar
(National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health)
- Hazuki Takahashi
(RIKEN Center for Integrative Medical Sciences)
- Igor Ulitsky
(Weizmann Institute of Science
Weizmann Institute of Science)
- Ales Varabyou
(Johns Hopkins University
Johns Hopkins University)
- Christine A. Wells
(The University of Melbourne)
- Mark Yandell
(University of Utah)
- Piero Carninci
(RIKEN Center for Integrative Medical Sciences
Human Technopole)
- Steven L. Salzberg
(Johns Hopkins University
Johns Hopkins University
Johns Hopkins University
Johns Hopkins University)
Abstract
Scientists have been trying to identify every gene in the human genome since the initial draft was published in 2001. In the years since, much progress has been made in identifying protein-coding genes, currently estimated to number fewer than 20,000, with an ever-expanding number of distinct protein-coding isoforms. Here we review the status of the human gene catalogue and the efforts to complete it in recent years. Beside the ongoing annotation of protein-coding genes, their isoforms and pseudogenes, the invention of high-throughput RNA sequencing and other technological breakthroughs have led to a rapid growth in the number of reported non-coding RNA genes. For most of these non-coding RNAs, the functional relevance is currently unclear; we look at recent advances that offer paths forward to identifying their functions and towards eventually completing the human gene catalogue. Finally, we examine the need for a universal annotation standard that includes all medically significant genes and maintains their relationships with different reference genomes for the use of the human gene catalogue in clinical settings.
Suggested Citation
Paulo Amaral & Silvia Carbonell-Sala & Francisco M. Vega & Tiago Faial & Adam Frankish & Thomas Gingeras & Roderic Guigo & Jennifer L. Harrow & Artemis G. Hatzigeorgiou & Rory Johnson & Terence D. Mur, 2023.
"The status of the human gene catalogue,"
Nature, Nature, vol. 622(7981), pages 41-47, October.
Handle:
RePEc:nat:nature:v:622:y:2023:i:7981:d:10.1038_s41586-023-06490-x
DOI: 10.1038/s41586-023-06490-x
Download full text from publisher
As the access to this document is restricted, you may want to search for a different version of it.
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:nature:v:622:y:2023:i:7981:d:10.1038_s41586-023-06490-x. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.