• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于无监督机器学习方法的家系数据基因-基因交互作用研究:EPISFA。

Exploring gene-gene interaction in family-based data with an unsupervised machine learning method: EPISFA.

机构信息

Department of Epidemiology and Biostatistics, School of Public Health, Peking University, Beijing, China.

Department of Epidemiology and Biostatistics, School of Public Health, Capital Medical University, Beijing, China.

出版信息

Genet Epidemiol. 2020 Nov;44(8):811-824. doi: 10.1002/gepi.22342. Epub 2020 Sep 1.

DOI:10.1002/gepi.22342
PMID:32869348
Abstract

Gene-gene interaction (G × G) is thought to fill the gap between the estimated heritability of complex diseases and the limited genetic proportion explained by identified single-nucleotide polymorphisms. The current tools for exploring G × G were often developed for case-control designs with less considerations for their applications in families. Family-based studies are robust against bias led from population stratification in genetic studies and helpful in understanding G × G. We proposed a new algorithm epistasis sparse factor analysis (EPISFA) and epistasis sparse factor analysis for linkage disequilibrium (EPISFA-LD) based on unsupervised machine learning to screen G × G. Extensive simulations were performed to compare EPISFA/EPISFA-LD with a classical family-based algorithm FAM-MDR (family-based multifactor dimensionality reduction). The results showed that EPISFA/EPISFA-LD is a tool of both high power and computational efficiency that could be applied in family designs and is applicable within high-dimensionality datasets. Finally, we applied EPISFA/EPISFA-LD to a real dataset drawn from the Fangshan/family-based Ischemic Stroke Study in China. Five pairs of G × G were discovered by EPISFA/EPISFA-LD, including three pairs verified by other algorithms (FAM-MDR and logistic), and an additional two pairs uniquely identified by EPISFA/EPISFA-LD only. The results from EPISFA might offer new insights for understanding the genetic etiology of complex diseases. EPISFA/EPISFA-LD was implemented in R. All relevant source code as well as simulated data could be freely downloaded from https://github.com/doublexism/episfa.

摘要

基因-基因相互作用(G×G)被认为填补了复杂疾病遗传率估计值与已识别的单核苷酸多态性所解释的遗传比例之间的空白。目前用于探索 G×G 的工具通常是为病例对照设计开发的,对其在家庭中的应用考虑较少。基于家庭的研究对于遗传研究中由于人群分层引起的偏倚具有稳健性,有助于理解 G×G。我们提出了一种新的算法——基于无监督机器学习的连锁不平衡的上位稀疏因子分析(EPISFA)和上位稀疏因子分析(EPISFA-LD),用于筛选 G×G。进行了广泛的模拟比较 EPISFA/EPISFA-LD 与经典的基于家庭的算法 FAM-MDR(基于家庭的多因子维度降低)。结果表明,EPISFA/EPISFA-LD 是一种具有高功效和计算效率的工具,可应用于家庭设计,并适用于高维数据集。最后,我们将 EPISFA/EPISFA-LD 应用于来自中国房山/基于家庭的缺血性中风研究的真实数据集。通过 EPISFA/EPISFA-LD 发现了五对 G×G,其中三对通过其他算法(FAM-MDR 和逻辑)验证,另外两对仅通过 EPISFA/EPISFA-LD 唯一识别。EPISFA 的结果可能为理解复杂疾病的遗传病因提供新的见解。EPISFA/EPISFA-LD 是用 R 实现的。所有相关的源代码以及模拟数据都可以从 https://github.com/doublexism/episfa 上免费下载。

相似文献

1
Exploring gene-gene interaction in family-based data with an unsupervised machine learning method: EPISFA.基于无监督机器学习方法的家系数据基因-基因交互作用研究:EPISFA。
Genet Epidemiol. 2020 Nov;44(8):811-824. doi: 10.1002/gepi.22342. Epub 2020 Sep 1.
2
Spatial rank-based multifactor dimensionality reduction to detect gene-gene interactions for multivariate phenotypes.基于空间秩的多因素降维分析检测多变量表型的基因-基因交互作用。
BMC Bioinformatics. 2021 Oct 4;22(1):480. doi: 10.1186/s12859-021-04395-y.
3
Confounding of linkage disequilibrium patterns in large scale DNA based gene-gene interaction studies.大规模基于DNA的基因-基因相互作用研究中连锁不平衡模式的混杂
BioData Min. 2019 Jun 10;12:11. doi: 10.1186/s13040-019-0199-7. eCollection 2019.
4
A Novel Detection Method for High-Order SNP Epistatic Interactions Based on Explicit-Encoding-Based Multitasking Harmony Search.基于显式编码的多任务协同搜索的新型高阶 SNP 上位性互作检测方法。
Interdiscip Sci. 2024 Sep;16(3):688-711. doi: 10.1007/s12539-024-00621-2. Epub 2024 Jul 2.
5
FAM-MDR: a flexible family-based multifactor dimensionality reduction technique to detect epistasis using related individuals.FAM-MDR:一种灵活的基于家系的多因子维度缩减技术,用于检测相关个体中的上位性。
PLoS One. 2010 Apr 22;5(4):e10304. doi: 10.1371/journal.pone.0010304.
6
A Belief Degree-Associated Fuzzy Multifactor Dimensionality Reduction Framework for Epistasis Detection.基于置信度关联的模糊多因子降维框架用于检测基因互作。
Methods Mol Biol. 2021;2212:307-323. doi: 10.1007/978-1-0716-0947-7_19.
7
SVM-based generalized multifactor dimensionality reduction approaches for detecting gene-gene interactions in family studies.基于支持向量机的广义多因子降维方法在家族研究中检测基因-基因相互作用。
Genet Epidemiol. 2012 Feb;36(2):88-98. doi: 10.1002/gepi.21602.
8
An unsupervised machine learning method for discovering patient clusters based on genetic signatures.基于遗传特征的无监督机器学习方法发现患者聚类。
J Biomed Inform. 2018 Sep;85:30-39. doi: 10.1016/j.jbi.2018.07.004. Epub 2018 Jul 29.
9
A balanced accuracy function for epistasis modeling in imbalanced datasets using multifactor dimensionality reduction.一种使用多因素降维方法在不平衡数据集中进行上位性建模的平衡准确率函数。
Genet Epidemiol. 2007 May;31(4):306-15. doi: 10.1002/gepi.20211.
10
Empowering individual trait prediction using interactions for precision medicine.利用相互作用进行精准医学中的个体特质预测。
BMC Bioinformatics. 2021 Feb 18;22(1):74. doi: 10.1186/s12859-021-04011-z.

引用本文的文献

1
A method for finding epistatic effects of maternal and fetal variants.一种寻找母体和胎儿变异上位效应的方法。
Front Genet. 2025 Mar 31;16:1420641. doi: 10.3389/fgene.2025.1420641. eCollection 2025.
2
GADGETS: a genetic algorithm for detecting epistasis using nuclear families.GADGETS:一种使用核家庭检测上位性的遗传算法。
Bioinformatics. 2022 Jan 27;38(4):1052-1058. doi: 10.1093/bioinformatics/btab766.