Suppr超能文献

人类基因组计划——概述

The Human Genome Project--an overview.

作者信息

Bentley D R

机构信息

The Sanger Centre, The Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK.

出版信息

Med Res Rev. 2000 May;20(3):189-96. doi: 10.1002/(sici)1098-1128(200005)20:3<189::aid-med2>3.0.co;2-#.

Abstract

The human genome sequence will underpin human biology and medicine in the next century, providing a single, essential reference to all genetic information. The international program to determine the complete DNA sequence (3,000 million bases) is well underway. As of January 2000, 50% of the sequence is available in the public domain. A comprehensive working draft is expected this year, and the entire sequence is projected to be finished in 2003. DNA sequencing is carried out on mapped, overlapping bacterial clones of 150-200 kb. The working draft comprises assembled unfinished sequence and is released immediately in the public domain. The draft sequence of each clone is then completed, by closing any remaining gaps and resolving any ambiguities, before the entire sequence is checked, annotated, and submitted to the public databases. The sequence of each clone is finished to an accuracy of >99.99%. The availability of a reference sequence of the genome provides the basis for studying the nature of sequence variation, particularly single nucleotide polymorphisms (SNPs), in human populations. SNP typing is a powerful tool for genetic analysis, and will enable us to uncover the association of loci at specific sites in the genome with many disease traits. SNPs occur at a frequency of approximately 1 SNP/kb throughout the genome when the sequence of any two individuals is compared. Programs to detect and map SNPs in the human genome are underway with the aim of establishing a SNP map of the genome during the next two years. The human genome sequence will provide a complete description of all the genes. Annotation of the sequence with the gene structures is achieved by a combination of computational analysis (predictive and homology-based) and experimental confirmation by cDNA sequencing. Detecting homologies between newly defined gene products and proteins of known function helps to postulate biochemical functions for them, which can then be tested. Establishing the association of specific genes with disease phenotypes by mutation screening, particularly for monogenic disorders, provides further assistance in defining the functions of some gene products, as well as helping to establish the cause of the disease. As our knowledge of gene sequences and sequence variation in populations increases, we will pinpoint more and more of the genes and proteins that are important in common, complex diseases. A more detailed understanding of the function of the human genome will be achieved as we identify sequences that control gene expression. Given the availability of gene sequences, the expression status of genes in particular tissues can be monitored in parallel. By comparing corresponding genomic sequences in different species (for example: man, mouse, chicken, and zebrafish), regions that have been highly conserved during evolution can be identified, many of which reflect conserved functions such as gene regulation. These approaches promise to greatly accelerate our interpretation of the human genome sequence.

摘要

人类基因组序列将成为下个世纪人类生物学和医学的基础,为所有遗传信息提供唯一的重要参考。测定完整DNA序列(30亿个碱基)的国际项目正在顺利进行。截至2000年1月,50%的序列已公开。预计今年将完成全面的工作草图,整个序列预计在2003年完成。DNA测序是在150 - 200 kb的定位重叠细菌克隆上进行的。工作草图包含组装好的未完成序列,并立即在公共领域发布。然后,在对整个序列进行检查、注释并提交到公共数据库之前,通过填补任何剩余的缺口并解决任何模糊之处,完成每个克隆的序列。每个克隆的序列完成后的准确率大于99.99%。基因组参考序列的可用性为研究人类群体中序列变异的本质,特别是单核苷酸多态性(SNP),提供了基础。SNP分型是一种强大的遗传分析工具,将使我们能够揭示基因组中特定位点的基因座与许多疾病特征之间的关联。当比较任何两个个体的序列时,SNP在整个基因组中的出现频率约为每千碱基1个SNP。检测和绘制人类基因组中SNP的项目正在进行中,目标是在未来两年内建立基因组的SNP图谱。人类基因组序列将提供所有基因的完整描述。通过计算分析(基于预测和同源性)与cDNA测序的实验确认相结合,实现对基因结构的序列注释。检测新定义的基因产物与已知功能蛋白质之间的同源性有助于推测它们的生化功能,然后可以进行测试。通过突变筛查,特别是针对单基因疾病,确定特定基因与疾病表型之间的关联,有助于进一步确定一些基因产物的功能,以及帮助确定疾病的病因。随着我们对群体中基因序列和序列变异的了解不断增加,我们将找出越来越多在常见复杂疾病中起重要作用的基因和蛋白质。当我们识别出控制基因表达的序列时,将对人类基因组的功能有更详细的了解。鉴于基因序列的可用性,可以同时监测特定组织中基因的表达状态。通过比较不同物种(例如:人类、小鼠、鸡和斑马鱼)的相应基因组序列,可以识别出在进化过程中高度保守的区域,其中许多区域反映了保守的功能,如基因调控。这些方法有望极大地加速我们对人类基因组序列的解读。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验