Whitelaw C A, Barbazuk W B, Pertea G, Chan A P, Cheung F, Lee Y, Zheng L, van Heeringen S, Karamycheva S, Bennetzen J L, SanMiguel P, Lakey N, Bedell J, Yuan Y, Budiman M A, Resnick A, Van Aken S, Utterback T, Riedmuller S, Williams M, Feldblyum T, Schubert K, Beachy R, Fraser C M, Quackenbush J
The Institute for Genomic Research (TIGR), 9712 Medical Center Drive, Rockville, MD 20850, USA.
Science. 2003 Dec 19;302(5653):2118-20. doi: 10.1126/science.1090047.
Approximately 80% of the maize genome comprises highly repetitive sequences interspersed with single-copy, gene-rich sequences, and standard genome sequencing strategies are not readily adaptable to this type of genome. Methodologies that enrich for genic sequences might more rapidly generate useful results from complex genomes. Equivalent numbers of clones from maize selected by techniques called methylation filtering and High C0t selection were sequenced to generate approximately 200,000 reads (approximately 132 megabases), which were assembled into contigs. Combination of the two techniques resulted in a sixfold reduction in the effective genome size and a fourfold increase in the gene identification rate in comparison to a nonenriched library.
大约80%的玉米基因组由高度重复序列与单拷贝、富含基因的序列穿插组成,标准的基因组测序策略并不容易适用于这种类型的基因组。富集基因序列的方法可能会更快地从复杂基因组中产生有用的结果。通过称为甲基化过滤和高Cot值选择的技术从玉米中选择等量的克隆进行测序,以产生约200,000条读数(约132兆碱基),这些读数被组装成重叠群。与未富集文库相比,两种技术的结合使有效基因组大小减少了六倍,基因识别率提高了四倍。