Thomas James W, Prasad Arjun B, Summers Tyrone J, Lee-Lin Shih-Queen, Maduro Valerie V B, Idol Jacquelyn R, Ryan Joseph F, Thomas Pamela J, McDowell Jennifer C, Green Eric D
Genome Technology Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland 20892, USA.
Genome Res. 2002 Aug;12(8):1277-85. doi: 10.1101/gr.283202.
Comparison is a fundamental tool for analyzing DNA sequence. Interspecies sequence comparison is particularly powerful for inferring genome function and is based on the simple premise that conserved sequences are likely to be important. Thus, the comparison of a genomic sequence with its orthologous counterpart from another species is increasingly becoming an integral component of genome analysis. In ideal situations, such comparisons are performed with orthologous sequences from multiple species. To facilitate multispecies comparative sequence analysis, a robust and scalable strategy for simultaneously constructing sequence-ready bacterial artificial chromosome (BAC) contig maps from targeted genomic regions has been developed. Central to this approach is the generation and utilization of "universal" oligonucleotide-based hybridization probes ("overgo" probes), which are designed from sequences that are highly conserved between distantly related species. Large collections of these probes are used en masse to screen BAC libraries from multiple species in parallel, with the isolated clones assembled into physical contig maps. To validate the effectiveness of this strategy, efforts were focused on the construction of BAC-based physical maps from multiple mammalian species (chimpanzee, baboon, cat, dog, cow, and pig). Using available human and mouse genomic sequence and a newly developed computer program to design the requisite probes, sequence-ready maps were constructed in all species for a series of targeted regions totaling approximately 16 Mb in the human genome. The described approach can be used to facilitate the multispecies comparative sequencing of targeted genomic regions and can be adapted for constructing BAC contig maps in other vertebrates.
比较是分析DNA序列的基本工具。种间序列比较对于推断基因组功能特别有效,其基于一个简单的前提,即保守序列可能很重要。因此,将基因组序列与其来自另一个物种的直系同源对应序列进行比较正日益成为基因组分析的一个组成部分。在理想情况下,这种比较是与来自多个物种的直系同源序列进行的。为了促进多物种比较序列分析,已经开发出一种强大且可扩展的策略,用于从目标基因组区域同时构建可用于测序的细菌人工染色体(BAC)重叠群图谱。这种方法的核心是生成和利用基于“通用”寡核苷酸的杂交探针(“重叠延伸”探针),这些探针是根据远缘物种之间高度保守的序列设计的。大量收集这些探针,并行用于筛选来自多个物种的BAC文库,将分离出的克隆组装成物理重叠群图谱。为了验证该策略的有效性,工作重点是构建来自多个哺乳动物物种(黑猩猩、狒狒、猫、狗、牛和猪)的基于BAC的物理图谱。利用现有的人类和小鼠基因组序列以及一个新开发的计算机程序来设计所需的探针,在所有物种中针对人类基因组中总共约16 Mb的一系列目标区域构建了可用于测序的图谱。所描述的方法可用于促进目标基因组区域的多物种比较测序,并可适用于在其他脊椎动物中构建BAC重叠群图谱。