Mohlke Karen L, Erdos Michael R, Scott Laura J, Fingerlin Tasha E, Jackson Anne U, Silander Kaisa, Hollstein Pablo, Boehnke Michael, Collins Francis S
Genome Technology Branch, National Human Genome Research Institute, Bethesda, MD 20892, USA.
Proc Natl Acad Sci U S A. 2002 Dec 24;99(26):16928-33. doi: 10.1073/pnas.262661399. Epub 2002 Dec 13.
To facilitate positional cloning of complex trait susceptibility loci, we are investigating methods to reduce the effort required to identify trait-associated alleles. We examined primer extension analysis by matrix-assisted laser desorptionionization time-of-flight mass spectrometry to screen single-nucleotide polymorphisms (SNPs) for association by using DNA pools. We tested whether this method can accurately estimate allele frequency differences between pools while maintaining the high-throughput nature of assay design, sample handling, and scoring. We follow up interesting allele frequency differences in pools by genotyping individuals. We tested DNA pools of 182, 228, and 499 individuals using 16 SNPs with minor allele frequencies 0.026-0.486 and allele frequency differences 0.001-0.108 that we had genotyped previously on individuals and 381 SNPs that we had not. Precision, as measured by the average standard deviation among 16 semidependent replicates, was 0.021 +/- 0.011 for the 16 SNPs and 0.018 +/- 0.008 for the 291381 SNPs used in further analysis. For the 16 SNPs, the average absolute error in predicting allele frequency differences between pools was 0.009; the largest errors were 0.031, 0.028, and 0.027. We determined that compensating for unequal peak heights in heterozygotes improved precision of allele frequency estimates but had only a very minor effect on accuracy of allele frequency differences between pools. Based on these data and assuming pools of 500 individuals, we conclude that at significance level 0.05 we would have 95% (82%) power to detect population allele frequency differences of 0.07 for control allele frequencies of 0.10 (0.50).
为便于对复杂性状易感基因座进行定位克隆,我们正在研究减少鉴定性状相关等位基因所需工作量的方法。我们通过基质辅助激光解吸电离飞行时间质谱法检测引物延伸分析,以利用DNA池筛选单核苷酸多态性(SNP)进行关联分析。我们测试了该方法能否在保持检测设计、样本处理和评分的高通量特性的同时,准确估计池间等位基因频率差异。我们通过对个体进行基因分型来追踪池间有趣的等位基因频率差异。我们使用16个次要等位基因频率为0.026 - 0.486且等位基因频率差异为0.001 - 0.108的SNP(这些SNP我们之前已对个体进行过基因分型)以及381个我们未曾检测过的SNP,对182、228和499个个体的DNA池进行了测试。通过16个半独立重复样本间的平均标准差衡量的精密度,对于用于进一步分析的16个SNP为0.021±0.011,对于291381个SNP为0.018±0.008。对于这16个SNP,预测池间等位基因频率差异的平均绝对误差为0.009;最大误差分别为0.031、0.028和0.027。我们确定,补偿杂合子中不等的峰高可提高等位基因频率估计的精密度,但对池间等位基因频率差异的准确性影响极小。基于这些数据并假设个体池为500个,我们得出结论,在显著性水平0.05时,对于对照等位基因频率为0.10(0.50)的情况,我们有95%(82%)的把握检测到0.