School of Statistics and Management, Shanghai University of Finance and Economics, 777 Guoding Road, Shanghai 200433, China.
Department of Biostatistics, School of Public Health, Yale University, 60 College Street, New Haven, CT 06520, USA.
Stat Med. 2018 Feb 20;37(4):598-610. doi: 10.1002/sim.7523. Epub 2017 Oct 16.
Gene-gene (G×G) interactions have been shown to be critical for the fundamental mechanisms and development of complex diseases beyond main genetic effects. The commonly adopted marginal analysis is limited by considering only a small number of G factors at a time. With the "main effects, interactions" hierarchical constraint, many of the existing joint analysis methods suffer from prohibitively high computational cost. In this study, we propose a new method for identifying important G×G interactions under joint modeling. The proposed method adopts tensor regression to accommodate high data dimensionality and the penalization technique for selection. It naturally accommodates the strong hierarchical structure without imposing additional constraints, making optimization much simpler and faster than in the existing studies. It outperforms multiple alternatives in simulation. The analysis of The Cancer Genome Atlas (TCGA) data on lung cancer and melanoma demonstrates that it can identify markers with important implications and better prediction performance.
基因-基因(G×G)相互作用已被证明对于复杂疾病的基本机制和发展至关重要,超出了主要遗传效应的范围。常用的边缘分析方法受到限制,因为它一次只能考虑少数 G 因素。在“主效应、相互作用”层次约束下,许多现有的联合分析方法由于计算成本过高而受到限制。在这项研究中,我们提出了一种新的联合建模方法来识别重要的 G×G 相互作用。所提出的方法采用张量回归来适应高数据维度和惩罚技术进行选择。它自然适应了强大的层次结构,而无需施加额外的约束,使得优化比现有研究更简单、更快。它在模拟中优于多种替代方法。对肺癌和黑色素瘤的癌症基因组图谱(TCGA)数据的分析表明,它可以识别具有重要意义和更好预测性能的标志物。