• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过模拟蒸发冷却网络分析在基因关联研究中捕捉相互作用效应谱。

Capturing the spectrum of interaction effects in genetic association studies by simulated evaporative cooling network analysis.

作者信息

McKinney Brett A, Crowe James E, Guo Jingyu, Tian Dehua

机构信息

Department of Genetics, University of Alabama School of Medicine, Birmingham, AL, USA.

出版信息

PLoS Genet. 2009 Mar;5(3):e1000432. doi: 10.1371/journal.pgen.1000432. Epub 2009 Mar 20.

DOI:10.1371/journal.pgen.1000432
PMID:19300503
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2653647/
Abstract

Evidence from human genetic studies of several disorders suggests that interactions between alleles at multiple genes play an important role in influencing phenotypic expression. Analytical methods for identifying Mendelian disease genes are not appropriate when applied to common multigenic diseases, because such methods investigate association with the phenotype only one genetic locus at a time. New strategies are needed that can capture the spectrum of genetic effects, from Mendelian to multifactorial epistasis. Random Forests (RF) and Relief-F are two powerful machine-learning methods that have been studied as filters for genetic case-control data due to their ability to account for the context of alleles at multiple genes when scoring the relevance of individual genetic variants to the phenotype. However, when variants interact strongly, the independence assumption of RF in the tree node-splitting criterion leads to diminished importance scores for relevant variants. Relief-F, on the other hand, was designed to detect strong interactions but is sensitive to large backgrounds of variants that are irrelevant to classification of the phenotype, which is an acute problem in genome-wide association studies. To overcome the weaknesses of these data mining approaches, we develop Evaporative Cooling (EC) feature selection, a flexible machine learning method that can integrate multiple importance scores while removing irrelevant genetic variants. To characterize detailed interactions, we construct a genetic-association interaction network (GAIN), whose edges quantify the synergy between variants with respect to the phenotype. We use simulation analysis to show that EC is able to identify a wide range of interaction effects in genetic association data. We apply the EC filter to a smallpox vaccine cohort study of single nucleotide polymorphisms (SNPs) and infer a GAIN for a collection of SNPs associated with adverse events. Our results suggest an important role for hubs in SNP disease susceptibility networks. The software is available at (http://sites.google.com/site/McKinneyLab/software).

摘要

对多种疾病进行的人类遗传学研究证据表明,多个基因的等位基因之间的相互作用在影响表型表达方面起着重要作用。用于识别孟德尔疾病基因的分析方法应用于常见的多基因疾病时并不适用,因为此类方法每次仅研究一个基因座与表型的关联。需要新的策略来捕捉从孟德尔遗传到多因素上位性的遗传效应谱。随机森林(RF)和Relief-F是两种强大的机器学习方法,由于它们在对单个遗传变异与表型的相关性进行评分时能够考虑多个基因的等位基因背景,因此已被研究用作遗传病例对照数据的筛选方法。然而,当变异强烈相互作用时,RF在树节点分裂标准中的独立性假设会导致相关变异的重要性得分降低。另一方面,Relief-F旨在检测强相互作用,但对与表型分类无关的大量变异背景敏感,这在全基因组关联研究中是一个严重问题。为了克服这些数据挖掘方法的弱点,我们开发了蒸发冷却(EC)特征选择方法,这是一种灵活的机器学习方法,它可以整合多个重要性得分,同时去除不相关的遗传变异。为了表征详细的相互作用,我们构建了一个遗传关联相互作用网络(GAIN),其边量化了变异之间相对于表型的协同作用。我们通过模拟分析表明,EC能够识别遗传关联数据中的广泛相互作用效应。我们将EC筛选方法应用于一项关于单核苷酸多态性(SNP)的天花疫苗队列研究,并推断出与不良事件相关的一组SNP的GAIN。我们的结果表明枢纽在SNP疾病易感性网络中起着重要作用。该软件可在(http://sites.google.com/site/McKinneyLab/software)获取。

相似文献

1
Capturing the spectrum of interaction effects in genetic association studies by simulated evaporative cooling network analysis.通过模拟蒸发冷却网络分析在基因关联研究中捕捉相互作用效应谱。
PLoS Genet. 2009 Mar;5(3):e1000432. doi: 10.1371/journal.pgen.1000432. Epub 2009 Mar 20.
2
Genome-wide association data classification and SNPs selection using two-stage quality-based Random Forests.使用基于质量的两阶段随机森林进行全基因组关联数据分类和单核苷酸多态性选择。
BMC Genomics. 2015;16 Suppl 2(Suppl 2):S5. doi: 10.1186/1471-2164-16-S2-S5. Epub 2015 Jan 21.
3
ReliefSeq: a gene-wise adaptive-K nearest-neighbor feature selection tool for finding gene-gene interactions and main effects in mRNA-Seq gene expression data.ReliefSeq:一种基于基因的自适应K近邻特征选择工具,用于在mRNA序列基因表达数据中寻找基因-基因相互作用和主效应。
PLoS One. 2013 Dec 10;8(12):e81527. doi: 10.1371/journal.pone.0081527. eCollection 2013.
4
Surfing a genetic association interaction network to identify modulators of antibody response to smallpox vaccine.利用遗传关联交互网络来鉴定天花疫苗抗体反应的调节剂。
Genes Immun. 2010 Dec;11(8):630-6. doi: 10.1038/gene.2010.37. Epub 2010 Jul 8.
5
Differential privacy-based evaporative cooling feature selection and classification with relief-F and random forests.基于差分隐私的 Relief-F 和随机森林蒸发冷却特征选择与分类。
Bioinformatics. 2017 Sep 15;33(18):2906-2913. doi: 10.1093/bioinformatics/btx298.
6
Encore: Genetic Association Interaction Network centrality pipeline and application to SLE exome data.再分析:遗传关联相互作用网络中心性分析管道及其在系统性红斑狼疮外显子组数据中的应用。
Genet Epidemiol. 2013 Sep;37(6):614-21. doi: 10.1002/gepi.21739. Epub 2013 Jun 5.
7
Random forests on Hadoop for genome-wide association studies of multivariate neuroimaging phenotypes.基于 Hadoop 的随机森林在多变量神经影像学表型全基因组关联研究中的应用。
BMC Bioinformatics. 2013;14 Suppl 16(Suppl 16):S6. doi: 10.1186/1471-2105-14-S16-S6. Epub 2013 Oct 22.
8
Evaluating the ability of tree-based methods and logistic regression for the detection of SNP-SNP interaction.评估基于树的方法和逻辑回归检测单核苷酸多态性(SNP)-SNP相互作用的能力。
Ann Hum Genet. 2009 May;73(Pt 3):360-9. doi: 10.1111/j.1469-1809.2009.00511.x. Epub 2009 Mar 8.
9
A Bayesian model for detection of high-order interactions among genetic variants in genome-wide association studies.一种用于在全基因组关联研究中检测基因变异间高阶相互作用的贝叶斯模型。
BMC Genomics. 2015 Nov 25;16:1011. doi: 10.1186/s12864-015-2217-6.
10
The Integration of Epistasis Network and Functional Interactions in a GWAS Implicates RXR Pathway Genes in the Immune Response to Smallpox Vaccine.全基因组关联研究中上位性网络与功能相互作用的整合表明视黄酸X受体(RXR)信号通路基因在天花疫苗免疫反应中的作用
PLoS One. 2016 Aug 11;11(8):e0158016. doi: 10.1371/journal.pone.0158016. eCollection 2016.

引用本文的文献

1
Gene age gap estimate (GAGE) for major depressive disorder: A penalized biological age model using gene expression.重度抑郁症的基因年龄差距估计(GAGE):一种使用基因表达的惩罚性生物学年龄模型。
Neurobiol Aging. 2025 Jul;151:13-21. doi: 10.1016/j.neurobiolaging.2025.01.012. Epub 2025 Apr 1.
2
Gene Age Gap Estimate (GAGE) for major depressive disorder: a penalized biological age model using gene expression.重度抑郁症的基因年龄差距估计(GAGE):一种使用基因表达的惩罚性生物学年龄模型。
bioRxiv. 2024 Nov 17:2024.09.03.610913. doi: 10.1101/2024.09.03.610913.
3
Identifying microbial drivers in biological phenotypes with a Bayesian network regression model.

本文引用的文献

1
Genetic basis for adverse events after smallpox vaccination.天花疫苗接种后不良事件的遗传基础。
J Infect Dis. 2008 Jul 1;198(1):16-22. doi: 10.1086/588670.
2
Common sequence variants in the LOXL1 gene confer susceptibility to exfoliation glaucoma.赖氨酰氧化酶样蛋白1(LOXL1)基因中的常见序列变异会增加剥脱性青光眼的易感性。
Science. 2007 Sep 7;317(5843):1397-400. doi: 10.1126/science.1146554. Epub 2007 Aug 9.
3
Risk alleles for multiple sclerosis identified by a genomewide study.一项全基因组研究确定的多发性硬化症风险等位基因。
使用贝叶斯网络回归模型识别生物表型中的微生物驱动因素。
Ecol Evol. 2024 May 20;14(5):e11039. doi: 10.1002/ece3.11039. eCollection 2024 May.
4
NSPA: characterizing the disease association of multiple genetic interactions at single-subject resolution.NSPA:在单个体分辨率下表征多个基因相互作用的疾病关联。
Bioinform Adv. 2023 Feb 7;3(1):vbad010. doi: 10.1093/bioadv/vbad010. eCollection 2023.
5
Novel HLA associations with outcomes of Mycobacterium tuberculosis exposure and sarcoidosis in individuals of African ancestry using nearest-neighbor feature selection.利用最近邻特征选择研究非洲裔人群中结核分枝杆菌暴露和结节病结局的新型 HLA 相关性。
Genet Epidemiol. 2022 Oct;46(7):463-474. doi: 10.1002/gepi.22490. Epub 2022 Jun 14.
6
Nearest-Neighbor Projected Distance Regression for Epistasis Detection in GWAS With Population Structure Correction.用于在群体结构校正的全基因组关联研究中检测上位性的最近邻投影距离回归
Front Genet. 2020 Jul 22;11:784. doi: 10.3389/fgene.2020.00784. eCollection 2020.
7
Nearest-neighbor Projected-Distance Regression (NPDR) for detecting network interactions with adjustments for multiple tests and confounding.最近邻投影距离回归 (NPDR) 用于检测网络交互,同时调整多重检验和混杂因素。
Bioinformatics. 2020 May 1;36(9):2770-2777. doi: 10.1093/bioinformatics/btaa024.
8
Discovering genetic interactions bridging pathways in genome-wide association studies.发现全基因组关联研究中连接途径的遗传相互作用。
Nat Commun. 2019 Sep 19;10(1):4274. doi: 10.1038/s41467-019-12131-7.
9
Multi-Level Model to Predict Antibody Response to Influenza Vaccine Using Gene Expression Interaction Network Feature Selection.使用基因表达相互作用网络特征选择预测流感疫苗抗体反应的多层次模型
Microorganisms. 2019 Mar 14;7(3):79. doi: 10.3390/microorganisms7030079.
10
Integrated machine learning pipeline for aberrant biomarker enrichment (i-mAB): characterizing clusters of differentiation within a compendium of systemic lupus erythematosus patients.用于异常生物标志物富集的集成机器学习管道(i-mAB):表征系统性红斑狼疮患者样本集中的分化簇
AMIA Annu Symp Proc. 2018 Dec 5;2018:1358-1367. eCollection 2018.
N Engl J Med. 2007 Aug 30;357(9):851-62. doi: 10.1056/NEJMoa073493. Epub 2007 Jul 29.
4
Variants conferring risk of atrial fibrillation on chromosome 4q25.位于4号染色体q25区域的增加心房颤动风险的变异体。
Nature. 2007 Jul 19;448(7151):353-7. doi: 10.1038/nature06007. Epub 2007 Jul 1.
5
Evaporative cooling feature selection for genotypic data involving interactions.针对涉及相互作用的基因型数据的蒸发冷却特征选择
Bioinformatics. 2007 Aug 15;23(16):2113-20. doi: 10.1093/bioinformatics/btm317. Epub 2007 Jun 22.
6
GAB2 alleles modify Alzheimer's risk in APOE epsilon4 carriers.GAB2等位基因改变APOE ε4携带者患阿尔茨海默病的风险。
Neuron. 2007 Jun 7;54(5):713-20. doi: 10.1016/j.neuron.2007.05.022.
7
A genome-wide association study identifies alleles in FGFR2 associated with risk of sporadic postmenopausal breast cancer.一项全基因组关联研究确定了FGFR2基因中的等位基因与散发性绝经后乳腺癌风险相关。
Nat Genet. 2007 Jul;39(7):870-4. doi: 10.1038/ng2075. Epub 2007 May 27.
8
Penalized logistic regression for detecting gene interactions.用于检测基因相互作用的惩罚逻辑回归
Biostatistics. 2008 Jan;9(1):30-50. doi: 10.1093/biostatistics/kxm010. Epub 2007 Apr 11.
9
Detection of gene x gene interactions in genome-wide association studies of human population data.在人类群体数据的全基因组关联研究中检测基因与基因的相互作用
Hum Hered. 2007;63(2):67-84. doi: 10.1159/000099179. Epub 2007 Feb 2.
10
Data simulation software for whole-genome association and other studies in human genetics.用于全基因组关联研究及人类遗传学其他研究的数据模拟软件。
Pac Symp Biocomput. 2006:499-510.