Ewing B, Green P
Department of Molecular Biotechnology, University of Washington, Seattle, Washington, USA.
Nat Genet. 2000 Jun;25(2):232-4. doi: 10.1038/76115.
The number of protein-coding genes in an organism provides a useful first measure of its molecular complexity. Single-celled prokaryotes and eukaryotes typically have a few thousand genes; for example, Escherichia coli has 4,300 and Saccharomyces cerevisiae has 6,000. Evolution of multicellularity appears to have been accompanied by a several-fold increase in gene number, the invertebrates Caenorhabditis elegans and Drosophila melanogaster having 19,000 and 13,600 genes, respectively. Here we estimate the number of human genes by comparing a set of human expressed sequence tag (EST) contigs with human chromosome 22 and with a non-redundant set of mRNA sequences. The two comparisons give mutually consistent estimates of approximately 35,000 genes, substantially lower than most previous estimates. Evolution of the increased physiological complexity of vertebrates may therefore have depended more on the combinatorial diversification of regulatory networks or alternative splicing than on a substantial increase in gene number.
生物体中蛋白质编码基因的数量为衡量其分子复杂性提供了一个有用的初步指标。单细胞原核生物和真核生物通常有几千个基因;例如,大肠杆菌有4300个基因,酿酒酵母有6000个基因。多细胞生物的进化似乎伴随着基因数量几倍的增加,无脊椎动物秀丽隐杆线虫和黑腹果蝇分别有19000个和13600个基因。在这里,我们通过将一组人类表达序列标签(EST)重叠群与人类22号染色体以及一组非冗余mRNA序列进行比较,来估计人类基因的数量。这两种比较给出了相互一致的估计,约为35000个基因,大大低于大多数先前的估计。因此,脊椎动物生理复杂性增加的进化可能更多地依赖于调控网络的组合多样化或可变剪接,而不是基因数量的大幅增加。