Giraudeau F, Petit E, Avet-Loiseau H, Hauck Y, Vergnaud G, Amarger V
Institut de Génétique et Microbiologie, Université Paris-Sud, 91405 Orsay CEDEX, France.
Genome Res. 1999 Jul;9(7):647-53.
Microsatellites and minisatellites are two classes of tandem repeat sequences differing in their size, mutation processes, and chromosomal distribution. The boundary between the two classes is not defined. We have developed a convenient, hybridization-based human library screening procedure able to detect long CA-rich sequences. Analysis of cosmid clones derived from a chromosome 1 library show that cross-hybridizing sequences tested are imperfect CA-rich sequences, some of them showing a minisatellite organization. All but one of the 13 positive chromosome 1 clones studied are localized in chromosomal bands to which minisatellites have previously been assigned, such as the 1pter cluster. To test the applicability of the procedure to minisatellite detection on a larger scale, we then used a large-insert whole-genome PAC library. Altogether, 22 new minisatellites have been identified in positive PAC and cosmid clones and 20 of them are telomeric. Among the 42 positive PAC clones localized within the human genome by FISH and/or linkage analysis, 25 (60%) are assigned to a terminal band of the karyotype, 4 (9%) are juxtacentromeric, and 13 (31%) are interstitial. The localization of at least two of the interstitial PAC clones corresponds to previously characterized minisatellite-containing regions and/or ancestrally telomeric bands, in agreement with this minisatellite-like distribution. The data obtained are in close agreement with the parallel investigation of human genome sequence data and suggest that long human (CA)s are imperfect CA repeats belonging to the minisatellite class of sequences. This approach provides a new tool to efficiently target genomic clones originating from subtelomeric domains, from which minisatellite sequences can readily be obtained. [The sequence data described in this paper have been submitted to the EMBL data library under accession nos. AJ000377-AJ000383.]
微卫星和小卫星是两类串联重复序列,它们在大小、突变过程和染色体分布上有所不同。这两类之间的界限尚无明确界定。我们开发了一种便捷的、基于杂交的人类文库筛选程序,能够检测富含CA的长序列。对来自1号染色体文库的黏粒克隆进行分析表明,所测试的交叉杂交序列是不完美的富含CA的序列,其中一些呈现小卫星结构。在研究的13个1号染色体阳性克隆中,除一个外,其余均定位于先前已确定有小卫星的染色体带,如1pter簇。为了在更大规模上测试该程序对小卫星检测的适用性,我们随后使用了一个大插入片段的全基因组PAC文库。总共在阳性PAC和黏粒克隆中鉴定出22个新的小卫星,其中20个位于端粒。通过荧光原位杂交(FISH)和/或连锁分析定位在人类基因组内的42个阳性PAC克隆中,25个(60%)定位于核型的末端带,4个(9%)位于着丝粒旁,13个(31%)位于间质区。至少两个间质PAC克隆的定位与先前表征的含小卫星区域和/或祖先端粒带相对应,与这种类似小卫星的分布一致。所获得的数据与对人类基因组序列数据的平行研究密切一致,并表明人类长(CA)序列是属于小卫星序列类别的不完美CA重复序列。这种方法提供了一种新工具,可有效靶向源自亚端粒区域的基因组克隆,从中可轻松获得小卫星序列。[本文所述的序列数据已提交至EMBL数据库,登录号为AJ000377 - AJ000383。]