• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

全基因组关联研究中的机器学习。

Machine learning in genome-wide association studies.

机构信息

Institut für Medizinische Biometrie und Statistik, Universität zu Lübeck, Lübeck, Germany.

出版信息

Genet Epidemiol. 2009;33 Suppl 1:S51-7. doi: 10.1002/gepi.20473.

DOI:10.1002/gepi.20473
PMID:19924717
Abstract

Recently, genome-wide association studies have substantially expanded our knowledge about genetic variants that influence the susceptibility to complex diseases. Although standard statistical tests for each single-nucleotide polymorphism (SNP) separately are able to capture main genetic effects, different approaches are necessary to identify SNPs that influence disease risk jointly or in complex interactions. Experimental and simulated genome-wide SNP data provided by the Genetic Analysis Workshop 16 afforded an opportunity to analyze the applicability and benefit of several machine learning methods. Penalized regression, ensemble methods, and network analyses resulted in several new findings while known and simulated genetic risk variants were also identified. In conclusion, machine learning approaches are promising complements to standard single-and multi-SNP analysis methods for understanding the overall genetic architecture of complex human diseases. However, because they are not optimized for genome-wide SNP data, improved implementations and new variable selection procedures are required.

摘要

最近,全基因组关联研究大大扩展了我们对影响复杂疾病易感性的遗传变异的认识。虽然单独对每个单核苷酸多态性(SNP)进行标准统计检验能够捕捉主要的遗传效应,但需要采用不同的方法来识别共同或复杂相互作用影响疾病风险的 SNPs。遗传分析研讨会 16 提供的实验和模拟全基因组 SNP 数据为分析几种机器学习方法的适用性和益处提供了机会。惩罚回归、集成方法和网络分析产生了一些新的发现,同时也确定了已知和模拟的遗传风险变异。总之,机器学习方法是理解复杂人类疾病整体遗传结构的标准单 SNP 和多 SNP 分析方法的有前途的补充。然而,由于它们不是针对全基因组 SNP 数据进行优化的,因此需要改进实现和新的变量选择过程。

相似文献

1
Machine learning in genome-wide association studies.全基因组关联研究中的机器学习。
Genet Epidemiol. 2009;33 Suppl 1:S51-7. doi: 10.1002/gepi.20473.
2
Data mining, neural nets, trees--problems 2 and 3 of Genetic Analysis Workshop 15.数据挖掘、神经网络、决策树——遗传分析研讨会15的问题2和问题3。
Genet Epidemiol. 2007;31 Suppl 1:S51-60. doi: 10.1002/gepi.20280.
3
Bayesian variable and model selection methods for genetic association studies.用于基因关联研究的贝叶斯变量与模型选择方法。
Genet Epidemiol. 2009 Jan;33(1):27-37. doi: 10.1002/gepi.20353.
4
Machine learning approaches for the discovery of gene-gene interactions in disease data.机器学习方法在疾病数据中发现基因-基因相互作用。
Brief Bioinform. 2013 Mar;14(2):251-60. doi: 10.1093/bib/bbs024. Epub 2012 May 18.
5
Genotype distribution-based inference of collective effects in genome-wide association studies: insights to age-related macular degeneration disease mechanism.基于基因型分布推断全基因组关联研究中的集体效应:对年龄相关性黄斑变性疾病机制的见解
BMC Genomics. 2016 Aug 30;17(1):695. doi: 10.1186/s12864-016-2871-3.
6
How powerful are summary-based methods for identifying expression-trait associations under different genetic architectures?基于汇总数据的方法在不同遗传结构下识别表达性状关联的能力有多强?
Pac Symp Biocomput. 2018;23:228-239.
7
A deep hybrid model to detect multi-locus interacting SNPs in the presence of noise.一种用于在噪声存在的情况下检测多基因座相互作用 SNP 的深度混合模型。
Int J Med Inform. 2018 Nov;119:134-151. doi: 10.1016/j.ijmedinf.2018.09.003. Epub 2018 Sep 15.
8
[Current status of SNPs interaction in genome-wide association study].[全基因组关联研究中SNP相互作用的现状]
Yi Chuan. 2011 Sep;33(9):901-10. doi: 10.3724/sp.j.1005.2011.00901.
9
Ultrahigh-dimensional variable selection method for whole-genome gene-gene interaction analysis.超高维全基因组基因-基因交互分析的变量选择方法。
BMC Bioinformatics. 2012 May 3;13:72. doi: 10.1186/1471-2105-13-72.
10
Comparison of approaches for machine-learning optimization of neural networks for detecting gene-gene interactions in genetic epidemiology.遗传流行病学中用于检测基因-基因相互作用的神经网络机器学习优化方法的比较。
Genet Epidemiol. 2008 May;32(4):325-40. doi: 10.1002/gepi.20307.

引用本文的文献

1
Advancing precision oncology with AI-powered genomic analysis.通过人工智能驱动的基因组分析推动精准肿瘤学发展。
Front Pharmacol. 2025 Apr 30;16:1591696. doi: 10.3389/fphar.2025.1591696. eCollection 2025.
2
Learning genotype-phenotype associations from gaps in multi-species sequence alignments.从多物种序列比对的缺口处学习基因型-表型关联。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbaf022.
3
Identification of Parkinson's disease using MRI and genetic data from the PPMI cohort: an improved machine learning fusion approach.
利用帕金森病标志物倡议(PPMI)队列的MRI和基因数据识别帕金森病:一种改进的机器学习融合方法。
Front Aging Neurosci. 2025 Feb 4;17:1510192. doi: 10.3389/fnagi.2025.1510192. eCollection 2025.
4
Genome-wide association study (GWAS) uncovers candidate genes linked to the germination performance of bread wheat (Triticum aestivum L.) under salt stress.全基因组关联研究(GWAS)揭示了与盐胁迫下面包小麦(Triticum aestivum L.)发芽性能相关的候选基因。
BMC Genomics. 2025 Jan 6;26(1):5. doi: 10.1186/s12864-024-11188-z.
5
GWAS for identification of genomic regions and candidate genes in vegetable crops.GWAS 用于鉴定蔬菜作物中的基因组区域和候选基因。
Funct Integr Genomics. 2024 Oct 29;24(6):203. doi: 10.1007/s10142-024-01477-x.
6
Integrated Assays of Genome-Wide Association Study, Multi-Omics Co-Localization, and Machine Learning Associated Calcium Signaling Genes with Oilseed Rape Resistance to .全基因组关联研究、多组学共定位和机器学习综合分析与油菜籽抗 . 相关的钙信号基因
Int J Mol Sci. 2024 Jun 25;25(13):6932. doi: 10.3390/ijms25136932.
7
Reviewing the essential roles of remote phenotyping, GWAS and explainable AI in practical marker-assisted selection for drought-tolerant winter wheat breeding.回顾远程表型分析、全基因组关联研究(GWAS)以及可解释人工智能在耐旱冬小麦育种实际标记辅助选择中的重要作用。
Front Plant Sci. 2024 Apr 18;15:1319938. doi: 10.3389/fpls.2024.1319938. eCollection 2024.
8
Unravelling the Genetic Landscape of Hemiplegic Migraine: Exploring Innovative Strategies and Emerging Approaches.解析偏瘫性偏头痛的遗传图谱:探索创新策略与新兴方法。
Genes (Basel). 2024 Mar 31;15(4):443. doi: 10.3390/genes15040443.
9
Logistic regression and other statistical tools in diagnostic biomarker studies.逻辑回归和其他统计工具在诊断生物标志物研究中的应用。
Clin Transl Oncol. 2024 Sep;26(9):2172-2180. doi: 10.1007/s12094-024-03413-8. Epub 2024 Mar 26.
10
Application of SVR-Mediated GWAS for Identification of Durable Genetic Regions Associated with Soybean Seed Quality Traits.应用支持向量回归介导的全基因组关联研究鉴定与大豆种子品质性状相关的持久遗传区域
Plants (Basel). 2023 Jul 16;12(14):2659. doi: 10.3390/plants12142659.