European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton CB10 1SD, United Kingdom; email:
Annu Rev Genomics Hum Genet. 2020 Aug 31;21:55-79. doi: 10.1146/annurev-genom-121119-083418. Epub 2020 May 18.
Our understanding of the human genome has continuously expanded since its draft publication in 2001. Over the years, novel assays have allowed us to progressively overlay layers of knowledge above the raw sequence of A's, T's, G's, and C's. The reference human genome sequence is now a complex knowledge base maintained under the shared stewardship of multiple specialist communities. Its complexity stems from the fact that it is simultaneously a template for transcription, a record of evolution, a vehicle for genetics, and a functional molecule. In short, the human genome serves as a frame of reference at the intersection of a diversity of scientific fields. In recent years, the progressive fall in sequencing costs has given increasing importance to the quality of the human reference genome, as hundreds of thousands of individuals are being sequenced yearly, often for clinical applications. Also, novel sequencing-based assays shed light on novel functions of the genome, especially with respect to gene expression regulation. Keeping the human genome annotation up to date and accurate is therefore an ongoing partnership between reference annotation projects and the greater community worldwide.
自 2001 年人类基因组草图公布以来,我们对人类基因组的认识不断扩展。多年来,新的检测方法使我们能够在原始的 A、T、G 和 C 序列之上逐步叠加知识层。参考人类基因组序列现在是一个复杂的知识库,由多个专业社区共同管理。它的复杂性源于这样一个事实,即它既是转录的模板,也是进化的记录,是遗传学的载体,也是一个功能分子。简而言之,人类基因组是多个科学领域交叉的参考框架。近年来,测序成本的逐步降低使得高质量的人类参考基因组变得越来越重要,因为每年有数十万人进行测序,其中许多是出于临床应用的目的。此外,基于新型测序的检测方法揭示了基因组的新功能,尤其是在基因表达调控方面。因此,不断更新和准确注释人类基因组是参考注释项目与全球更广泛社区之间的持续合作。