University of California, Berkeley, 94720, USA.
Genes Immun. 2010 Apr;11(3):199-208. doi: 10.1038/gene.2009.110. Epub 2010 Jan 21.
Investigating genetic interactions (epistasis) has proven difficult despite the recent advances of both laboratory methods and statistical developments. With no 'best' statistical approach available, combining several analytical methods may be optimal for detecting epistatic interactions. Using a multi-stage analysis that incorporated supervised machine learning and methods of association testing, we investigated epistatic interactions with a well-established genetic factor (PTPN22 1858T) in a complex autoimmune disease (rheumatoid arthritis (RA)). Our analysis consisted of four principal stages: Stage I (data reduction)-identifying candidate chromosomal regions in 292 affected sibling pairs, by predicting PTPN22 concordance using multipoint identity-by-descent probabilities and a supervised machine learning algorithm (Random Forests); Stage II (extension analysis)-testing detailed genetic data within candidate chromosomal regions for epistasis with PTPN22 1858T in 677 cases and 750 controls using logistic regression; Stage III (replication analysis)-confirmation of epistatic interactions in 947 cases and 1756 controls; Stage IV (combined analysis)-a pooled analysis including all 1624 RA cases and 2506 control subjects for final estimates of effect size. A total of seven replicating epistatic interactions were identified. SNP variants within CDH13, MYO3A, CEP72 and near WFDC1 showed significant evidence for interaction with PTPN22, affecting susceptibility to RA.
尽管实验室方法和统计方法都有了最近的进展,但对遗传相互作用(上位性)的研究仍然很困难。由于没有最佳的统计方法,因此结合几种分析方法可能是检测上位性相互作用的最佳方法。我们使用多阶段分析,结合了监督机器学习和关联测试方法,在一种已确立的遗传因素(PTPN22 1858T)和一种复杂的自身免疫性疾病(类风湿关节炎(RA))中研究了上位性相互作用。我们的分析包括四个主要阶段:第一阶段(数据缩减)-通过使用多点身份相关概率和监督机器学习算法(随机森林)预测 PTPN22 一致性,在 292 对受影响的同胞对中确定候选染色体区域;第二阶段(扩展分析)-在 677 例病例和 750 例对照中使用逻辑回归在候选染色体区域内测试与 PTPN22 1858T 的详细遗传数据的上位性;第三阶段(复制分析)-在 947 例病例和 1756 例对照中确认上位性相互作用;第四阶段(综合分析)-对所有 1624 例 RA 病例和 2506 例对照进行合并分析,以最终估计效应大小。共发现了七个复制的上位性相互作用。CDH13、MYO3A、CEP72 和 WFDC1 附近的 SNP 变体与 PTPN22 存在显著的相互作用证据,影响 RA 的易感性。