• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用相互作用进行精准医学中的个体特质预测。

Empowering individual trait prediction using interactions for precision medicine.

机构信息

Institut für Medizinische Biometrie und Statistik, Universität zu Lübeck, Universitätsklinikum Schleswig-Holstein, Campus Lübeck, Lübeck, Germany.

出版信息

BMC Bioinformatics. 2021 Feb 18;22(1):74. doi: 10.1186/s12859-021-04011-z.

DOI:10.1186/s12859-021-04011-z
PMID:33602124
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7890638/
Abstract

BACKGROUND

One component of precision medicine is to construct prediction models with their predicitve ability as high as possible, e.g. to enable individual risk prediction. In genetic epidemiology, complex diseases like coronary artery disease, rheumatoid arthritis, and type 2 diabetes, have a polygenic basis and a common assumption is that biological and genetic features affect the outcome under consideration via interactions. In the case of omics data, the use of standard approaches such as generalized linear models may be suboptimal and machine learning methods are appealing to make individual predictions. However, most of these algorithms focus mostly on main or marginal effects of the single features in a dataset. On the other hand, the detection of interacting features is an active area of research in the realm of genetic epidemiology. One big class of algorithms to detect interacting features is based on the multifactor dimensionality reduction (MDR). Here, we further develop the model-based MDR (MB-MDR), a powerful extension of the original MDR algorithm, to enable interaction empowered individual prediction.

RESULTS

Using a comprehensive simulation study we show that our new algorithm (median AUC: 0.66) can use information hidden in interactions and outperforms two other state-of-the-art algorithms, namely the Random Forest (median AUC: 0.54) and Elastic Net (median AUC: 0.50), if interactions are present in a scenario of two pairs of two features having small effects. The performance of these algorithms is comparable if no interactions are present. Further, we show that our new algorithm is applicable to real data by comparing the performance of the three algorithms on a dataset of rheumatoid arthritis cases and healthy controls. As our new algorithm is not only applicable to biological/genetic data but to all datasets with discrete features, it may have practical implications in other research fields where interactions between features have to be considered as well, and we made our method available as an R package ( https://github.com/imbs-hl/MBMDRClassifieR ).

CONCLUSIONS

The explicit use of interactions between features can improve the prediction performance and thus should be included in further attempts to move precision medicine forward.

摘要

背景

精准医学的一个组成部分是构建预测能力尽可能高的预测模型,例如,实现个体风险预测。在遗传流行病学中,复杂疾病,如冠状动脉疾病、类风湿关节炎和 2 型糖尿病,具有多基因基础,一个常见的假设是,生物和遗传特征通过相互作用影响所考虑的结果。在组学数据的情况下,使用广义线性模型等标准方法可能不是最优的,机器学习方法很有吸引力,可以进行个体预测。然而,这些算法中的大多数主要关注数据集单个特征的主要或边际效应。另一方面,检测相互作用的特征是遗传流行病学领域的一个活跃研究领域。一类用于检测相互作用特征的算法是基于多因子降维(MDR)的。在这里,我们进一步开发了基于模型的 MDR(MB-MDR),这是原始 MDR 算法的强大扩展,以实现具有交互功能的个体预测。

结果

使用全面的模拟研究,我们表明,我们的新算法(中位数 AUC:0.66)可以利用隐藏在相互作用中的信息,如果在存在两对具有小效应的两个特征的情况下存在相互作用,其表现优于另外两种最先进的算法,即随机森林(中位数 AUC:0.54)和弹性网络(中位数 AUC:0.50)。如果不存在相互作用,则这些算法的性能相当。此外,我们通过比较三种算法在类风湿关节炎病例和健康对照数据集上的性能,表明我们的新算法适用于真实数据。由于我们的新算法不仅适用于生物/遗传数据,而且适用于具有离散特征的所有数据集,因此它可能在其他需要考虑特征之间相互作用的研究领域具有实际意义,我们还将该方法作为 R 包(https://github.com/imbs-hl/MBMDRClassifieR)提供。

结论

明确使用特征之间的相互作用可以提高预测性能,因此应该包含在进一步推动精准医学的尝试中。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0408/7890638/a62123479e74/12859_2021_4011_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0408/7890638/bbb722eb5fee/12859_2021_4011_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0408/7890638/06fda6bde716/12859_2021_4011_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0408/7890638/0c37496b7861/12859_2021_4011_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0408/7890638/44688b58a892/12859_2021_4011_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0408/7890638/5b7730dbe65f/12859_2021_4011_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0408/7890638/eb59b1349f98/12859_2021_4011_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0408/7890638/a62123479e74/12859_2021_4011_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0408/7890638/bbb722eb5fee/12859_2021_4011_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0408/7890638/06fda6bde716/12859_2021_4011_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0408/7890638/0c37496b7861/12859_2021_4011_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0408/7890638/44688b58a892/12859_2021_4011_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0408/7890638/5b7730dbe65f/12859_2021_4011_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0408/7890638/eb59b1349f98/12859_2021_4011_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0408/7890638/a62123479e74/12859_2021_4011_Fig7_HTML.jpg

相似文献

1
Empowering individual trait prediction using interactions for precision medicine.利用相互作用进行精准医学中的个体特质预测。
BMC Bioinformatics. 2021 Feb 18;22(1):74. doi: 10.1186/s12859-021-04011-z.
2
A Novel Detection Method for High-Order SNP Epistatic Interactions Based on Explicit-Encoding-Based Multitasking Harmony Search.基于显式编码的多任务协同搜索的新型高阶 SNP 上位性互作检测方法。
Interdiscip Sci. 2024 Sep;16(3):688-711. doi: 10.1007/s12539-024-00621-2. Epub 2024 Jul 2.
3
Machine learning algorithm-based risk prediction model of coronary artery disease.基于机器学习算法的冠状动脉疾病风险预测模型
Mol Biol Rep. 2018 Oct;45(5):901-910. doi: 10.1007/s11033-018-4236-2. Epub 2018 Jul 11.
4
Spatial rank-based multifactor dimensionality reduction to detect gene-gene interactions for multivariate phenotypes.基于空间秩的多因素降维分析检测多变量表型的基因-基因交互作用。
BMC Bioinformatics. 2021 Oct 4;22(1):480. doi: 10.1186/s12859-021-04395-y.
5
A comparative study on the unified model based multifactor dimensionality reduction methods for identifying gene-gene interactions associated with the survival phenotype.基于统一模型的多因素降维方法识别与生存表型相关的基因-基因相互作用的比较研究。
BioData Min. 2021 Mar 1;14(1):17. doi: 10.1186/s13040-021-00248-9.
6
Exploring gene-gene interaction in family-based data with an unsupervised machine learning method: EPISFA.基于无监督机器学习方法的家系数据基因-基因交互作用研究:EPISFA。
Genet Epidemiol. 2020 Nov;44(8):811-824. doi: 10.1002/gepi.22342. Epub 2020 Sep 1.
7
Accurate Prediction of Coronary Heart Disease for Patients With Hypertension From Electronic Health Records With Big Data and Machine-Learning Methods: Model Development and Performance Evaluation.利用大数据和机器学习方法从电子健康记录中准确预测高血压患者的冠心病:模型开发与性能评估
JMIR Med Inform. 2020 Jul 6;8(7):e17257. doi: 10.2196/17257.
8
[Detecting interaction for quantitative trait by generalized multifactor dimensionality reduction].[利用广义多因素降维法检测数量性状的基因互作]
Zhonghua Liu Xing Bing Xue Za Zhi. 2010 Aug;31(8):938-41.
9
Machine learning techniques for personalized breast cancer risk prediction: comparison with the BCRAT and BOADICEA models.机器学习技术在个体化乳腺癌风险预测中的应用:与 BCRAT 和 BOADICEA 模型的比较。
Breast Cancer Res. 2019 Jun 20;21(1):75. doi: 10.1186/s13058-019-1158-4.
10
The R-package GenomicTools for multifactor dimensionality reduction and the analysis of (exploratory) Quantitative Trait Loci.用于多因素降维和(探索性)数量性状位点分析的R包GenomicTools。
Comput Methods Programs Biomed. 2017 Nov;151:171-177. doi: 10.1016/j.cmpb.2017.08.012. Epub 2017 Aug 30.

引用本文的文献

1
Machine learning in Alzheimer's disease genetics.阿尔茨海默病遗传学中的机器学习
Nat Commun. 2025 Jul 22;16(1):6726. doi: 10.1038/s41467-025-61650-z.
2
Detecting genetic interactions with visible neural networks.检测与可见神经网络的基因相互作用。
Commun Biol. 2025 Jun 5;8(1):874. doi: 10.1038/s42003-025-08157-x.
3
* and DOCK1* gene-gene interactions associated with rheumatoid arthritis in the focal adhesion pathway.以及与类风湿关节炎相关的粘着斑通路中DOCK1基因与基因的相互作用。

本文引用的文献

1
The search for gene-gene interactions in genome-wide association studies: challenges in abundance of methods, practical considerations, and biological interpretation.全基因组关联研究中基因-基因相互作用的探索:方法众多带来的挑战、实际考量及生物学解释
Ann Transl Med. 2018 Apr;6(8):157. doi: 10.21037/atm.2018.04.05.
2
Using Full Genomic Information to Predict Disease: Breaking Down the Barriers Between Complex and Mendelian Diseases.利用全基因组信息预测疾病:打破复杂疾病和孟德尔疾病之间的障碍。
Annu Rev Genomics Hum Genet. 2018 Aug 31;19:289-301. doi: 10.1146/annurev-genom-083117-021136. Epub 2018 Apr 11.
3
What is precision medicine?
Front Genet. 2024 May 13;15:1375036. doi: 10.3389/fgene.2024.1375036. eCollection 2024.
4
Computational approaches in rheumatic diseases - Deciphering complex spatio-temporal cell interactions.风湿性疾病中的计算方法——解读复杂的时空细胞相互作用
Comput Struct Biotechnol J. 2023 Aug 6;21:4009-4020. doi: 10.1016/j.csbj.2023.08.005. eCollection 2023.
5
Dynamic effects of prognostic factors and individual survival prediction for amyotrophic lateral sclerosis disease.肌萎缩侧索硬化症预后因素的动态影响及个体生存预测。
Ann Clin Transl Neurol. 2023 Jun;10(6):892-903. doi: 10.1002/acn3.51771. Epub 2023 Apr 4.
6
Artificial Intelligence in Rheumatoid Arthritis: Current Status and Future Perspectives: A State-of-the-Art Review.类风湿关节炎中的人工智能:现状与未来展望:一篇最新综述
Rheumatol Ther. 2022 Oct;9(5):1249-1304. doi: 10.1007/s40744-022-00475-4. Epub 2022 Jul 18.
精准医学是什么?
Eur Respir J. 2017 Oct 19;50(4). doi: 10.1183/13993003.00391-2017. Print 2017 Oct.
4
Precision medicine for all? Challenges and opportunities for a precision medicine approach to critical illness.全民精准医疗?危重症精准医疗方法面临的挑战和机遇。
Crit Care. 2017 Oct 18;21(1):257. doi: 10.1186/s13054-017-1836-5.
5
The Effect of Splitting on Random Forests.分裂对随机森林的影响。
Mach Learn. 2015 Apr;99(1):75-118. doi: 10.1007/s10994-014-5451-2. Epub 2014 Jul 2.
6
Precision medicine: In need of guidance and surveillance.精准医学:需要指导和监管。
World J Gastroenterol. 2017 Jul 28;23(28):5045-5050. doi: 10.3748/wjg.v23.i28.5045.
7
A perspective on interaction effects in genetic association studies.遗传关联研究中交互作用效应的观点。
Genet Epidemiol. 2016 Dec;40(8):678-688. doi: 10.1002/gepi.21989. Epub 2016 Jul 7.
8
gammaMAXT: a fast multiple-testing correction algorithm.伽马MAXT:一种快速多重检验校正算法。
BioData Min. 2015 Nov 20;8:36. doi: 10.1186/s13040-015-0069-x. eCollection 2015.
9
A roadmap to multifactor dimensionality reduction methods.多因素降维方法路线图。
Brief Bioinform. 2016 Mar;17(2):293-308. doi: 10.1093/bib/bbv038. Epub 2015 Jun 24.
10
An efficient algorithm to perform multiple testing in epistasis screening.一种用于连锁分析中多重检验的高效算法。
BMC Bioinformatics. 2013 Apr 24;14:138. doi: 10.1186/1471-2105-14-138.