Department of Biostatistics, Yale School of Public Health, New Haven, Connecticut, USA.
School of Statistics and Management, Shanghai University of Finance and Economics, Shanghai, China.
Biometrics. 2022 Dec;78(4):1542-1554. doi: 10.1111/biom.13526. Epub 2021 Aug 1.
Multiple types of molecular (genetic, genomic, epigenetic, etc.) measurements, environmental risk factors, and their interactions have been found to contribute to the outcomes and phenotypes of complex diseases. In each of the previous studies, only the interactions between one type of molecular measurement and environmental risk factors have been analyzed. In recent biomedical studies, multidimensional profiling, in which data from multiple types of molecular measurements are collected from the same subjects, is becoming popular. A myriad of recent studies have shown that collectively analyzing multiple types of molecular measurements is not only biologically sensible but also leads to improved estimation and prediction. In this study, we conduct an M-E interaction analysis, with M standing for multidimensional molecular measurements and E standing for environmental risk factors. This can accommodate multiple types of molecular measurements and sufficiently account for their overlapping as well as independent information. Extensive simulation shows that it outperforms several closely related alternatives. In the analysis of TCGA (The Cancer Genome Atlas) data on lung adenocarcinoma and cutaneous melanoma, we make some stable biological findings and achieve stable prediction.
多种类型的分子(遗传、基因组、表观遗传等)测量、环境风险因素及其相互作用已被发现对复杂疾病的结果和表型有影响。在前每一项研究中,只分析了一种类型的分子测量和环境风险因素之间的相互作用。在最近的生物医学研究中,多维分析越来越流行,即从同一受试者中收集多种类型的分子测量数据。最近的大量研究表明,综合分析多种类型的分子测量不仅在生物学上是合理的,而且还可以提高估计和预测的准确性。在这项研究中,我们进行了 M-E 相互作用分析,其中 M 代表多维分子测量,E 代表环境风险因素。这可以容纳多种类型的分子测量,并充分考虑它们的重叠和独立信息。广泛的模拟表明,它优于几个密切相关的替代方案。在对肺腺癌和皮肤黑色素瘤的 TCGA(癌症基因组图谱)数据的分析中,我们得出了一些稳定的生物学发现,并实现了稳定的预测。