Lai Jinsheng, Dey Nrisingha, Kim Cheol-Soo, Bharti Arvind K, Rudd Stephen, Mayer Klaus F X, Larkins Brian A, Becraft Philip, Messing Joachim
Waksman Institute, Rutgers, The State University of New Jersey, Piscataway, New Jersey 08854, USA.
Genome Res. 2004 Oct;14(10A):1932-7. doi: 10.1101/gr.2780504.
The cereal endosperm is a major organ of the seed and an important component of the world's food supply. To understand the development and physiology of the endosperm of cereal seeds, we focused on the identification of genes expressed at various times during maize endosperm development. We constructed several cDNA libraries to identify full-length clones and subjected them to a twofold enrichment. A total of 23,348 high-quality sequence-reads from 5'- and 3'-ends of cDNAs were generated and assembled into a unigene set representing 5326 genes with paired sequence-reads. Additional sequencing yielded a total of 3160 (59%) completely sequenced, full-length cDNAs. From 5326 unigenes, 4139 (78%) can be aligned with 5367 predicted rice genes and by taking only the "best hit" be mapped to 3108 positions on the rice genome. The 22% unigenes not present in rice indicate a rapid change of gene content between rice and maize in only 50 million years. Differences in rice and maize gene numbers also suggest that maize has lost a large number of duplicated genes following tetraploidization. The larger number of gene copies in rice suggests that as many as 30% of its genes arose from gene amplification, which would extrapolate to a significant proportion of the estimated 44,027 candidate genes of its entire genome. Functional classification of the maize endosperm unigene set indicated that more than a fourth of the novel functionally assignable genes found in this study are involved in carbohydrate metabolism, consistent with its role as a storage organ.
谷物胚乳是种子的主要器官,也是全球食物供应的重要组成部分。为了解谷物种子胚乳的发育和生理学,我们重点研究了玉米胚乳发育过程中不同时期表达的基因。我们构建了几个cDNA文库以鉴定全长克隆,并对其进行了两倍富集。从cDNA的5'端和3'端共产生了23348条高质量序列读数,并组装成一个代表5326个具有配对序列读数的基因的单基因集。额外的测序共产生了3160个(59%)完全测序的全长cDNA。在5326个单基因中,4139个(78%)可以与5367个预测的水稻基因比对,仅取“最佳匹配”就可以定位到水稻基因组的3108个位置上。水稻中不存在的22%的单基因表明,在仅5000万年的时间里,水稻和玉米之间的基因含量发生了快速变化。水稻和玉米基因数量的差异也表明,玉米在四倍体化后失去了大量重复基因。水稻中较多的基因拷贝数表明,其多达30%的基因来自基因扩增,这可以推断出其整个基因组估计的44027个候选基因中有很大一部分。玉米胚乳单基因集的功能分类表明,本研究中发现的超过四分之一的新的可功能分类基因参与碳水化合物代谢,这与其作为储存器官的作用一致。