Zhang Weihua, Lau Winston, Hu Cheng, Kuo Tai-Yue
Section of Cancer Genetics, The Institute of Cancer Research, 15 Cotswold Road, Belmont, Sutton, Surrey SM2 5NG, UK.
Department of Cardiology, Ealing Hospital NHS Trust, Uxbridge Road, Southall, Middlesex, UB1 3HW, UK.
BMC Proc. 2007;1 Suppl 1(Suppl 1):S166. doi: 10.1186/1753-6561-1-s1-s166. Epub 2007 Dec 18.
We studied the impact of marker density on the accuracy of association mapping using Genetic Analysis Workshop 15 simulated dense single-nucleotide polymorphism (SNP) data on chromosome 6. A total of 1500 cases and 2000 unaffected controls genotyped for 17,820 SNPs were analyzed. We applied the approach that combines information from multiple SNPs under the framework of the Malecot model and composite likelihood to non-overlapping regions of the chromosome. We successfully detected the associations with disease Loci C and D and predicted their locations as small as zero distance to Locus C when it was "typed" and 112 kb from the untyped rare Locus D. Reducing marker density decreased the accuracy of location estimates. However, the predicted locations were robust to variations in the number of SNPs. Generally, the linkage disequilibrium (LD) map reflecting distances between markers in relation to LD produced higher accuracy than the physical map. We also demonstrated that SNP selection based on equal LD distance outperforms that based on equal physical distance or SNP tagging. Furthermore, ignoring rare SNPs diminished the ability to detect rare causal variants.
我们使用遗传分析研讨会15提供的6号染色体上模拟的密集单核苷酸多态性(SNP)数据,研究了标记密度对关联作图准确性的影响。对总共1500例病例和2000名未受影响的对照进行了17,820个SNP的基因分型分析。我们在马勒科特模型和复合似然框架下,将来自多个SNP的信息应用于染色体的非重叠区域。我们成功检测到与疾病位点C和D的关联,并预测它们的位置,当“分型”位点C时距离为零,而对于未分型的罕见位点D,距离为112 kb。降低标记密度会降低位置估计的准确性。然而,预测位置对SNP数量的变化具有鲁棒性。一般来说,反映标记间与连锁不平衡(LD)相关距离的LD图谱比物理图谱产生的准确性更高。我们还证明,基于相等LD距离的SNP选择优于基于相等物理距离或SNP标签的选择。此外,忽略罕见SNP会降低检测罕见因果变异的能力。