Thomas J W, Touchman J W, Blakesley R W, Bouffard G G, Beckstrom-Sternberg S M, Margulies E H, Blanchette M, Siepel A C, Thomas P J, McDowell J C, Maskeri B, Hansen N F, Schwartz M S, Weber R J, Kent W J, Karolchik D, Bruen T C, Bevan R, Cutler D J, Schwartz S, Elnitski L, Idol J R, Prasad A B, Lee-Lin S-Q, Maduro V V B, Summers T J, Portnoy M E, Dietrich N L, Akhter N, Ayele K, Benjamin B, Cariaga K, Brinkley C P, Brooks S Y, Granite S, Guan X, Gupta J, Haghighi P, Ho S-L, Huang M C, Karlins E, Laric P L, Legaspi R, Lim M J, Maduro Q L, Masiello C A, Mastrian S D, McCloskey J C, Pearson R, Stantripop S, Tiongson E E, Tran J T, Tsurgeon C, Vogt J L, Walker M A, Wetherby K D, Wiggins L S, Young A C, Zhang L-H, Osoegawa K, Zhu B, Zhao B, Shu C L, De Jong P J, Lawrence C E, Smit A F, Chakravarti A, Haussler D, Green P, Miller W, Green E D
Genome Technology Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland 20892,USA.
Nature. 2003 Aug 14;424(6950):788-93. doi: 10.1038/nature01858.
The systematic comparison of genomic sequences from different organisms represents a central focus of contemporary genome analysis. Comparative analyses of vertebrate sequences can identify coding and conserved non-coding regions, including regulatory elements, and provide insight into the forces that have rendered modern-day genomes. As a complement to whole-genome sequencing efforts, we are sequencing and comparing targeted genomic regions in multiple, evolutionarily diverse vertebrates. Here we report the generation and analysis of over 12 megabases (Mb) of sequence from 12 species, all derived from the genomic region orthologous to a segment of about 1.8 Mb on human chromosome 7 containing ten genes, including the gene mutated in cystic fibrosis. These sequences show conservation reflecting both functional constraints and the neutral mutational events that shaped this genomic region. In particular, we identify substantial numbers of conserved non-coding segments beyond those previously identified experimentally, most of which are not detectable by pair-wise sequence comparisons alone. Analysis of transposable element insertions highlights the variation in genome dynamics among these species and confirms the placement of rodents as a sister group to the primates.
对来自不同生物体的基因组序列进行系统比较是当代基因组分析的核心重点。脊椎动物序列的比较分析可以识别编码区和保守的非编码区,包括调控元件,并深入了解塑造现代基因组的各种力量。作为全基因组测序工作的补充,我们正在对多种进化上不同的脊椎动物的靶向基因组区域进行测序和比较。在此,我们报告了来自12个物种的超过12兆碱基(Mb)的序列的生成和分析,所有这些序列均来自与人类7号染色体上一段约1.8 Mb片段直系同源的基因组区域,该片段包含十个基因,其中包括在囊性纤维化中发生突变的基因。这些序列显示出保守性,反映了功能限制以及塑造该基因组区域的中性突变事件。特别是,我们发现了大量先前通过实验未鉴定出的保守非编码片段,其中大多数仅通过成对序列比较无法检测到。对转座元件插入的分析突出了这些物种之间基因组动态的差异,并证实了啮齿动物作为灵长类动物姐妹群的位置。