Chan Agnes P, Pertea Geo, Cheung Foo, Lee Dan, Zheng Li, Whitelaw Cathy, Pontaroli Ana C, SanMiguel Phillip, Yuan Yinan, Bennetzen Jeffrey, Barbazuk William Brad, Quackenbush John, Rabinowicz Pablo D
The Institute for Genomic Research, 9712 Medical Center Drive, Rockville, MD 20850, USA.
Nucleic Acids Res. 2006 Jan 1;34(Database issue):D771-6. doi: 10.1093/nar/gkj072.
Maize is a staple crop of the grass family and also an excellent model for plant genetics. Owing to the large size and repetitiveness of its genome, we previously investigated two approaches to accelerate gene discovery and genome analysis in maize: methylation filtration and high C(0)t selection. These techniques allow the construction of gene-enriched genomic libraries by minimizing repeat sequences due to either their methylation status or their copy number, yielding a 7-fold enrichment in genic sequences relative to a random genomic library. Approximately 900,000 gene-enriched reads from maize were generated and clustered into Assembled Zea mays (AZM) sequences. Here we report the current AZM release, which consists of approximately 298 Mb representing 243,807 sequence assemblies and singletons. In order to provide a repository of publicly available maize genomic sequences, we have created the TIGR Maize Database (http://maize.tigr.org). In this resource, we have assembled and annotated the AZMs and used available sequenced markers to anchor AZMs to maize chromosomes. We have constructed a maize repeat database and generated draft sequence assemblies of 287 maize bacterial artificial chromosome (BAC) clone sequences, which we annotated along with 172 additional publicly available BAC clones. All sequences, assemblies and annotations are available at the project website via web interfaces and FTP downloads.
玉米是禾本科的主要作物,也是植物遗传学的优秀模式植物。由于其基因组规模大且具有重复性,我们之前研究了两种加速玉米基因发现和基因组分析的方法:甲基化过滤和高Cot值选择。这些技术通过根据重复序列的甲基化状态或拷贝数将其最小化,从而构建富含基因的基因组文库,相对于随机基因组文库,基因序列的富集倍数达到7倍。我们生成了约90万个来自玉米的富含基因的 reads,并将其聚类为玉米组装序列(AZM)。在此,我们报告当前的AZM版本,它由约298 Mb组成,代表243,807个序列组装和单序列。为了提供一个公开可用的玉米基因组序列库,我们创建了TIGR玉米数据库(http://maize.tigr.org)。在这个资源库中,我们对AZM进行了组装和注释,并使用可用的测序标记将AZM定位到玉米染色体上。我们构建了一个玉米重复数据库,并生成了287个玉米细菌人工染色体(BAC)克隆序列的草图组装,我们对这些序列以及另外172个公开可用的BAC克隆进行了注释。所有序列、组装和注释都可通过项目网站的网页界面和FTP下载获得。