人类基因目录的现状。

The status of the human gene catalogue.

机构信息

INSPER Institute of Education and Research, Sao Paulo, Brazil.

Centre for Genomic Regulation (CRG), Barcelona, Spain.

出版信息

Nature. 2023 Oct;622(7981):41-47. doi: 10.1038/s41586-023-06490-x. Epub 2023 Oct 4.

DOI:10.1038/s41586-023-06490-x

PMID:37794265

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10575709/

Abstract

Scientists have been trying to identify every gene in the human genome since the initial draft was published in 2001. In the years since, much progress has been made in identifying protein-coding genes, currently estimated to number fewer than 20,000, with an ever-expanding number of distinct protein-coding isoforms. Here we review the status of the human gene catalogue and the efforts to complete it in recent years. Beside the ongoing annotation of protein-coding genes, their isoforms and pseudogenes, the invention of high-throughput RNA sequencing and other technological breakthroughs have led to a rapid growth in the number of reported non-coding RNA genes. For most of these non-coding RNAs, the functional relevance is currently unclear; we look at recent advances that offer paths forward to identifying their functions and towards eventually completing the human gene catalogue. Finally, we examine the need for a universal annotation standard that includes all medically significant genes and maintains their relationships with different reference genomes for the use of the human gene catalogue in clinical settings.

摘要

自 2001 年首次公布人类基因组初稿以来，科学家们一直试图确定人类基因组中的每一个基因。自那时以来，在鉴定蛋白质编码基因方面已经取得了很大进展，目前估计蛋白质编码基因的数量不到 20000 个，具有越来越多独特的蛋白质编码亚型。在这里，我们回顾了人类基因目录的现状以及近年来完成它的努力。除了正在进行的蛋白质编码基因、其亚型和假基因的注释外，高通量 RNA 测序和其他技术突破的发明导致报告的非编码 RNA 基因数量迅速增加。对于这些非编码 RNA 中的大多数，其功能相关性目前尚不清楚；我们着眼于最近的进展，这些进展为确定它们的功能并最终完成人类基因目录提供了途径。最后，我们研究了是否需要一个通用的注释标准，该标准包含所有具有医学意义的基因，并维护它们与不同参考基因组的关系，以便在临床环境中使用人类基因目录。

相似文献

The status of the human gene catalogue.

Nature. 2023 Oct;622(7981):41-47. doi: 10.1038/s41586-023-06490-x. Epub 2023 Oct 4.

The status of the human gene catalogue.

ArXiv. 2023 Mar 24:arXiv:2303.13996v1.

Creating reference gene annotation for the mouse C57BL6/J genome assembly.

Mamm Genome. 2015 Oct;26(9-10):366-78. doi: 10.1007/s00335-015-9583-x. Epub 2015 Jul 18.

Assessing the impact of human genome annotation choice on RNA-seq expression estimates.

BMC Bioinformatics. 2013;14 Suppl 11(Suppl 11):S8. doi: 10.1186/1471-2105-14-S11-S8. Epub 2013 Nov 4.

A draft map of the human proteome.

Nature. 2014 May 29;509(7502):575-81. doi: 10.1038/nature13302.

[Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].

Yi Chuan Xue Bao. 2004 May;31(5):431-43.

Comparison of RefSeq protein-coding regions in human and vertebrate genomes.

BMC Genomics. 2013 Sep 25;14:654. doi: 10.1186/1471-2164-14-654.

Comprehensive comparative homeobox gene annotation in human and mouse.

Database (Oxford). 2015 Sep 27;2015. doi: 10.1093/database/bav091. Print 2015.

Landscape of transcription in human cells.

Nature. 2012 Sep 6;489(7414):101-8. doi: 10.1038/nature11233.

引用本文的文献

Enhancing transcriptome expression quantification through accurate assignment of long RNA sequencing reads with TranSigner.

Genome Biol. 2025 Aug 28;26(1):257. doi: 10.1186/s13059-025-03723-2.

Regulation of transcriptome plasticity by mTOR signaling pathway.

Exp Mol Med. 2025 Aug 14. doi: 10.1038/s12276-025-01508-y.

How multispecific molecules are transforming pharmacotherapy.

Nat Rev Drug Discov. 2025 Aug 1. doi: 10.1038/s41573-025-01262-w.

CRISPR-based functional genomics tools in vertebrate models.

Exp Mol Med. 2025 Jul;57(7):1355-1372. doi: 10.1038/s12276-025-01514-0. Epub 2025 Jul 31.

Targeting epigenetic regulators as a promising avenue to overcome cancer therapy resistance.

Signal Transduct Target Ther. 2025 Jul 18;10(1):219. doi: 10.1038/s41392-025-02266-z.

Residual DNA impurities in AAV vectors-nature and transcription.

Mol Ther Methods Clin Dev. 2025 Jun 4;33(3):101503. doi: 10.1016/j.omtm.2025.101503. eCollection 2025 Sep 11.

Non-Coding RNAs in Diagnostic Pathology of High-Grade Central Osteosarcoma.

Diagnostics (Basel). 2025 May 28;15(11):1355. doi: 10.3390/diagnostics15111355.

Harnessing miRNA dynamics in HIV-1-infected macrophages: Unveiling new targeted therapeutics using systems biology.

Comput Struct Biotechnol J. 2025 May 1;27:1754-1771. doi: 10.1016/j.csbj.2025.04.040. eCollection 2025.

The Dark Metabolome/Lipidome and In-Source Fragmentation.

Anal Sci Adv. 2025 May 14;6(1):e70012. doi: 10.1002/ansa.70012. eCollection 2025 Jun.

The Role of the p21-Activated Kinase Family in Tumor Immunity.

Int J Mol Sci. 2025 Apr 20;26(8):3885. doi: 10.3390/ijms26083885.

本文引用的文献

A draft human pangenome reference.

Nature. 2023 May;617(7960):312-324. doi: 10.1038/s41586-023-05896-x. Epub 2023 May 10.

Antisense-oligonucleotide-mediated perturbation of long non-coding RNA reveals functional features in stem cells and across cell types.

Cell Rep. 2022 Dec 27;41(13):111893. doi: 10.1016/j.celrep.2022.111893.

GENCODE: reference annotation for the human and mouse genomes in 2023.

Nucleic Acids Res. 2023 Jan 6;51(D1):D942-D949. doi: 10.1093/nar/gkac1071.

Developmental dynamics of RNA translation in the human brain.

Nat Neurosci. 2022 Oct;25(10):1353-1365. doi: 10.1038/s41593-022-01164-9. Epub 2022 Sep 28.

Transcriptome variation in human tissues revealed by long-read sequencing.

Nature. 2022 Aug;608(7922):353-359. doi: 10.1038/s41586-022-05035-y. Epub 2022 Aug 3.

Standardized annotation of translated open reading frames.

Nat Biotechnol. 2022 Jul;40(7):994-999. doi: 10.1038/s41587-022-01369-0.

A joint NCBI and EMBL-EBI transcript set for clinical genomics and research.

Nature. 2022 Apr;604(7905):310-315. doi: 10.1038/s41586-022-04558-8. Epub 2022 Apr 6.

The complete sequence of a human genome.

Science. 2022 Apr;376(6588):44-53. doi: 10.1126/science.abj6987. Epub 2022 Mar 31.

Annotating and prioritizing genomic variants using the Ensembl Variant Effect Predictor-A tutorial.

Hum Mutat. 2022 Aug;43(8):986-997. doi: 10.1002/humu.24298. Epub 2021 Dec 2.

Identification of high-confidence human poly(A) RNA isoform scaffolds using nanopore sequencing.

RNA. 2022 Feb;28(2):162-176. doi: 10.1261/rna.078703.121. Epub 2021 Nov 2.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

人类基因目录的现状。

The status of the human gene catalogue.

机构信息

INSPER Institute of Education and Research, Sao Paulo, Brazil.

Centre for Genomic Regulation (CRG), Barcelona, Spain.

出版信息

Nature. 2023 Oct;622(7981):41-47. doi: 10.1038/s41586-023-06490-x. Epub 2023 Oct 4.

DOI:10.1038/s41586-023-06490-x

PMID:37794265

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10575709/

Abstract

摘要

人类基因目录的现状。

The status of the human gene catalogue.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

人类基因目录的现状。

The status of the human gene catalogue.

机构信息

出版信息