Jiang Yuan, Brennan Jennifer S, Calixte Rose, He Yunxiao, Nyirabahizi Epiphanie, Zhang Heping
Department of Epidemiology and Public Health, Yale School of Public Health, School of Medicine, Yale University, 60 College Street, PO Box 208034, New Haven, CT 06520-8034, USA.
BMC Proc. 2011 Nov 29;5 Suppl 9(Suppl 9):S102. doi: 10.1186/1753-6561-5-S9-S102.
Existing methods for analyzing rare variant data focus on collapsing a group of rare variants into a single common variant; collapsing is based on an intuitive function of the rare variant genotype information, such as an indicator function or a weighted sum. It is more natural, however, to take into account the single-nucleotide polymorphism (SNP) interactions informed directly by the data. We propose a novel tree-based method that automatically detects SNP interactions and generates candidate markers from the original pool of rare variants. In addition, we utilize the advantage of having 200 phenotype replications in the Genetic Analysis Workshop 17 data to assess the candidate markers by means of repeated logistic regressions. This new approach shows potential in the rare variant analysis. We correctly identify the association between gene FLT1 and phenotype Affect, although there exist other false positives in our results. Our analyses are performed without knowledge of the underlying simulating model.
现有的分析罕见变异数据的方法侧重于将一组罕见变异合并为一个单一的常见变异;合并基于罕见变异基因型信息的直观函数,如指示函数或加权和。然而,更自然的做法是考虑由数据直接告知的单核苷酸多态性(SNP)相互作用。我们提出了一种新颖的基于树的方法,该方法能自动检测SNP相互作用,并从原始的罕见变异库中生成候选标记。此外,我们利用遗传分析研讨会17数据中200个表型复制品的优势,通过重复逻辑回归来评估候选标记。这种新方法在罕见变异分析中显示出潜力。尽管我们的结果中存在其他假阳性,但我们正确地识别出了基因FLT1与表型“Affect”之间的关联。我们的分析是在不了解潜在模拟模型的情况下进行的。