Hoh J, Wille A, Ott J
Laboratory of Statistical Genetics, Rockefeller University, New York, New York 10021, USA.
Genome Res. 2001 Dec;11(12):2115-9. doi: 10.1101/gr.204001.
The search for genes underlying complex traits has been difficult and often disappointing. The main reason for these difficulties is that several genes, each with rather small effect, might be interacting to produce the trait. Therefore, we must search the whole genome for a good chance to find these genes. Doing this with tens of thousands of SNP markers, however, greatly increases the overall probability of false-positive results, and current methods limiting such error probabilities to acceptable levels tend to reduce the power of detecting weak genes. Investigating large numbers of SNPs inevitably introduces errors (e.g., in genotyping), which will distort analysis results. Here we propose a simple strategy that circumvents many of these problems. We develop a set-association method to blend relevant sources of information such as allelic association and Hardy-Weinberg disequilibrium. Information is combined over multiple markers and genes in the genome, quality control is improved by trimming, and an appropriate testing strategy limits the overall false-positive rate. In contrast to other available methods, our method to detect association to sets of SNP markers in different genes in a real data application has shown remarkable success.
寻找复杂性状背后的基因一直困难重重,且常常令人失望。造成这些困难的主要原因是,多个基因(每个基因的效应都相当小)可能相互作用以产生该性状。因此,我们必须在整个基因组中进行搜索,才有机会找到这些基因。然而,使用数以万计的单核苷酸多态性(SNP)标记来进行搜索,会大大增加假阳性结果的总体概率,而目前将此类错误概率限制在可接受水平的方法往往会降低检测弱效应基因的能力。研究大量的SNP不可避免地会引入误差(例如基因分型中的误差),这会扭曲分析结果。在此,我们提出一种简单的策略,可规避许多此类问题。我们开发了一种集合关联方法,以融合诸如等位基因关联和哈迪-温伯格不平衡等相关信息源。信息在基因组中的多个标记和基因上进行整合,通过筛选提高质量控制,并且采用适当的测试策略限制总体假阳性率。与其他现有方法相比,我们在实际数据应用中检测与不同基因中的SNP标记集关联的方法已取得显著成功。