Verza Natalia C, E Silva Thaís Rezende, Neto Germano Cord, Nogueira Fábio T S, Fisch Paulo H, de Rosa Vincente E, Rebello Marcelo M, Vettore André L, da Silva Felipe Rodrigues, Arruda Paulo
Centro de Biologia Molecular e Engenharia Genética, Universidade Estadual de Campinas (UNICAMP), 13083-970, Campinas, SP, Brazil.
Plant Mol Biol. 2005 Sep;59(2):363-74. doi: 10.1007/s11103-005-8924-7.
The transcriptome-wide endosperm-preferred expression of maize genes was addressed by analyzing a large database of expressed sequence tags (ESTs). We generated 30,531 high quality sequence-reads from the 5'-ends of cDNA libraries from maize endosperm harvested at 10, 15, and 20 days after pollination. A further 196,900 maize sequence-reads retrieved from public databases were added to this endosperm collection to generate MAIZEST, a database with tools for data storage and analysis. MAIZEST contains 227,431 ESTs, one third of which represents developing endosperm and the remaining two-thirds represent transcripts from 49 cDNA libraries constructed from different organs and tissues. Assembling the MAIZEST ESTs generated 29,206 putative transcripts, of which a set of 4032 assembled sequences was composed exclusively of sequences derived from endosperm cDNA libraries. After sequence analysis using overlapping parameters, a sub-set of 2403 assembled sequences was functionally annotated and revealed a wide variety of putative new genes involved in endosperm development and metabolism.
通过分析一个庞大的表达序列标签(EST)数据库,研究了玉米基因在全转录组范围内胚乳偏好性表达的情况。我们从授粉后10天、15天和20天收获的玉米胚乳cDNA文库的5'端生成了30,531个高质量的序列读数。从公共数据库中检索到的另外196,900个玉米序列读数被添加到这个胚乳数据集中,以生成MAIZEST,这是一个具有数据存储和分析工具的数据库。MAIZEST包含227,431个EST,其中三分之一代表发育中的胚乳,其余三分之二代表从不同器官和组织构建的49个cDNA文库中的转录本。对MAIZEST EST进行组装产生了29,206个推定转录本,其中一组4032个组装序列仅由胚乳cDNA文库衍生的序列组成。在使用重叠参数进行序列分析后,对2403个组装序列的一个子集进行了功能注释,揭示了参与胚乳发育和代谢的各种推定新基因。