Fondon J W, Mele G M, Brezinschek R I, Cummings D, Pande A, Wren J, O'Brien K M, Kupfer K C, Wei M H, Lerman M, Minna J D, Garner H R
McDermott Center for Human Growth and Development and the Center for Biomedical Inventions, The University of Texas Southwestern Medical Center, Dallas, TX 75235, USA.
Proc Natl Acad Sci U S A. 1998 Jun 23;95(13):7514-9. doi: 10.1073/pnas.95.13.7514.
A computational system for the prediction of polymorphic loci directly and efficiently from human genomic sequence was developed and verified. A suite of programs, collectively called POMPOUS (polymorphic marker prediction of ubiquitous simple sequences) detects tandem repeats ranging from dinucleotides up to 250 mers, scores them according to predicted level of polymorphism, and designs appropriate flanking primers for PCR amplification. This approach was validated on an approximately 750-kilobase region of human chromosome 3p21.3, involved in lung and breast carcinoma homozygous deletions. Target DNA from 36 paired B lymphoblastoid and lung cancer lines was amplified and allelotyped for 33 loci predicted by POMPOUS to be variable in repeat size. We found that among those 36 predominately Caucasian individuals 22 of the 33 (67%) predicted loci were polymorphic with an average heterozygosity of 0.42. Allele loss in this region was found in 27/36 (75%) of the tumor lines using these markers. POMPOUS provides the genetic researcher with an additional tool for the rapid and efficient identification of polymorphic markers, and through a World Wide Web site, investigators can use POMPOUS to identify polymorphic markers for their research. A catalog of 13,261 potential polymorphic markers and associated primer sets has been created from the analysis of 141,779,504 base pairs of human genomic sequence in GenBank. This data is available on our Web site (pompous.swmed.edu) and will be updated periodically as GenBank is expanded and algorithm accuracy is improved.
开发并验证了一种用于直接且高效地从人类基因组序列预测多态性位点的计算系统。一套程序,统称为POMPOUS(普遍简单序列的多态性标记预测),可检测从二核苷酸到250聚体的串联重复序列,根据预测的多态性水平对其进行评分,并设计合适的侧翼引物用于PCR扩增。这种方法在人类染色体3p21.3的约750千碱基区域得到验证,该区域与肺癌和乳腺癌的纯合缺失有关。对来自36对B淋巴母细胞系和肺癌细胞系的目标DNA进行扩增,并对POMPOUS预测重复序列大小可变的33个位点进行基因分型。我们发现,在这36名主要为白种人的个体中,33个预测位点中有22个(67%)具有多态性,平均杂合度为0.42。使用这些标记物,在27/36(75%)的肿瘤细胞系中发现了该区域的等位基因缺失。POMPOUS为基因研究人员提供了一种快速高效鉴定多态性标记物的额外工具,研究人员可通过万维网站点使用POMPOUS为其研究鉴定多态性标记物。通过对GenBank中141,779,504碱基对的人类基因组序列进行分析,创建了一个包含13,261个潜在多态性标记物及相关引物组的目录。该数据可在我们的网站(pompous.swmed.edu)上获取,并将随着GenBank的扩展和算法准确性的提高而定期更新。