Evans G A, Lewis K A
Molecular Genetics Laboratory, Salk Institute for Biological Studies, La Jolla, CA 92037.
Proc Natl Acad Sci U S A. 1989 Jul;86(13):5030-4. doi: 10.1073/pnas.86.13.5030.
A rapid and powerful approach for linking individual clones of a cosmid library and the assembly of a large physical map is presented, which depends on the simultaneous analysis of many cosmid clones for overlapping regions. This method uses cosmid vectors that contain endogenous bacteriophage T3 and T7 promoters to allow for the identification of overlapping clones through the synthesis of end-specific RNA probes. A genomic library is constructed and organized as an ordered matrix such that each clone is assigned an identifying coordinate. DNA from mixtures of cosmid clones is pooled such that each pool contains only one common member with any other pool, RNA probes are prepared from mixtures of cosmid clones, and groups of clones overlapping with the constituents of the mixtures are determined by hybridization. Pooled probes are most simply prepared by grouping clones according to the rows and columns of the library matrix. The pairwise comparison of data generated by the hybridization of mixed probes can be decoded by using simple algorithms that predict the order and linkage of all clones in the collection and organize them into predicted contigs. To demonstrate the feasibility of multiplexed analysis of cosmids, a genomic library was prepared from a mouse-human somatic cell hybrid that contains a portion of the long arm of human chromosome 11. Preparation, arrangement on a matrix, and analysis of pooled cosmid clones from this collection resulted in the detection of 1099 linked pairs of cosmids, which could be assembled into 315 contigs. Thus, with a minimal amount of effort, a substantial portion of this genomic region has been linked in multiple overlapping contigs. This method may have practical applications in the large-scale mapping and sequencing of mammalian genomes.
本文介绍了一种快速且强大的方法,用于连接黏粒文库的各个克隆并构建大型物理图谱,该方法依赖于对多个黏粒克隆的重叠区域进行同步分析。此方法使用含有内源性噬菌体T3和T7启动子的黏粒载体,通过合成末端特异性RNA探针来鉴定重叠克隆。构建基因组文库并将其组织成有序矩阵,以便为每个克隆分配一个识别坐标。将来自黏粒克隆混合物的DNA汇集起来,使得每个池与其他任何池仅共享一个共同成员,从黏粒克隆混合物中制备RNA探针,并通过杂交确定与混合物成分重叠的克隆组。汇集探针最简单的制备方法是根据文库矩阵的行和列对克隆进行分组。通过使用简单算法可以对混合探针杂交产生的数据进行成对比较,该算法可预测集合中所有克隆的顺序和连锁关系,并将它们组织成预测的重叠群。为了证明黏粒多重分析的可行性,从含有人类11号染色体长臂一部分的小鼠 - 人类体细胞杂种中制备了基因组文库。对该集合中的黏粒克隆进行制备、排列在矩阵上并分析,结果检测到1099对连锁的黏粒,这些黏粒可组装成315个重叠群。因此,只需付出最少的努力,该基因组区域的很大一部分就已通过多个重叠的重叠群连接起来。这种方法可能在哺乳动物基因组的大规模图谱绘制和测序中具有实际应用。