• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

全基因组关联研究中的生物信息学挑战。

Bioinformatics challenges for genome-wide association studies.

机构信息

Department of Genetics, Department of Community and Family Medicine, Dartmouth Medical School, Lebanon, NH 03756, USA.

出版信息

Bioinformatics. 2010 Feb 15;26(4):445-55. doi: 10.1093/bioinformatics/btp713. Epub 2010 Jan 6.

DOI:10.1093/bioinformatics/btp713
PMID:20053841
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2820680/
Abstract

The sequencing of the human genome has made it possible to identify an informative set of >1 million single nucleotide polymorphisms (SNPs) across the genome that can be used to carry out genome-wide association studies (GWASs). The availability of massive amounts of GWAS data has necessitated the development of new biostatistical methods for quality control, imputation and analysis issues including multiple testing. This work has been successful and has enabled the discovery of new associations that have been replicated in multiple studies. However, it is now recognized that most SNPs discovered via GWAS have small effects on disease susceptibility and thus may not be suitable for improving health care through genetic testing. One likely explanation for the mixed results of GWAS is that the current biostatistical analysis paradigm is by design agnostic or unbiased in that it ignores all prior knowledge about disease pathobiology. Further, the linear modeling framework that is employed in GWAS often considers only one SNP at a time thus ignoring their genomic and environmental context. There is now a shift away from the biostatistical approach toward a more holistic approach that recognizes the complexity of the genotype-phenotype relationship that is characterized by significant heterogeneity and gene-gene and gene-environment interaction. We argue here that bioinformatics has an important role to play in addressing the complexity of the underlying genetic basis of common human diseases. The goal of this review is to identify and discuss those GWAS challenges that will require computational methods.

摘要

人类基因组测序使得在整个基因组中鉴定出 >100 万个有信息量的单核苷酸多态性(SNP)成为可能,这些 SNP 可用于进行全基因组关联研究(GWAS)。大量 GWAS 数据的出现,需要开发新的生物统计学方法来解决质量控制、缺失值推断和分析问题,包括多重检验。这项工作取得了成功,并发现了新的关联,这些关联在多个研究中得到了复制。然而,现在人们认识到,通过 GWAS 发现的大多数 SNP 对疾病易感性的影响很小,因此可能不适合通过基因测试来改善医疗保健。GWAS 结果喜忧参半的一个可能解释是,当前的生物统计学分析范式在设计上是不可知或无偏的,因为它忽略了所有关于疾病病理生物学的先验知识。此外,GWAS 中使用的线性建模框架通常一次只考虑一个 SNP,从而忽略了它们的基因组和环境背景。现在,人们正在从生物统计学方法向更全面的方法转变,这种方法认识到基因型-表型关系的复杂性,其特征是显著的异质性以及基因-基因和基因-环境相互作用。我们在这里认为,生物信息学在解决常见人类疾病潜在遗传基础的复杂性方面发挥着重要作用。本综述的目的是确定并讨论那些需要计算方法的 GWAS 挑战。

相似文献

1
Bioinformatics challenges for genome-wide association studies.全基因组关联研究中的生物信息学挑战。
Bioinformatics. 2010 Feb 15;26(4):445-55. doi: 10.1093/bioinformatics/btp713. Epub 2010 Jan 6.
2
Gene, pathway and network frameworks to identify epistatic interactions of single nucleotide polymorphisms derived from GWAS data.用于识别源自全基因组关联研究(GWAS)数据的单核苷酸多态性上位性相互作用的基因、通路和网络框架。
BMC Syst Biol. 2012;6 Suppl 3(Suppl 3):S15. doi: 10.1186/1752-0509-6-S3-S15. Epub 2012 Dec 17.
3
GWAS Integrator: a bioinformatics tool to explore human genetic associations reported in published genome-wide association studies.GWAS 整合器:一种生物信息学工具,用于探索已发表的全基因组关联研究中报告的人类遗传关联。
Eur J Hum Genet. 2011 Oct;19(10):1095-9. doi: 10.1038/ejhg.2011.91. Epub 2011 May 25.
4
Genome-wide association data classification and SNPs selection using two-stage quality-based Random Forests.使用基于质量的两阶段随机森林进行全基因组关联数据分类和单核苷酸多态性选择。
BMC Genomics. 2015;16 Suppl 2(Suppl 2):S5. doi: 10.1186/1471-2164-16-S2-S5. Epub 2015 Jan 21.
5
Multi-locus test conditional on confirmed effects leads to increased power in genome-wide association studies.多基因座条件检验导致全基因组关联研究的功效增加。
PLoS One. 2010 Nov 16;5(11):e15006. doi: 10.1371/journal.pone.0015006.
6
SNPranker 2.0: a gene-centric data mining tool for diseases associated SNP prioritization in GWAS.SNPranker 2.0:一种针对 GWAS 中疾病相关 SNP 优先级排序的基于基因的数据分析工具。
BMC Bioinformatics. 2013;14 Suppl 1(Suppl 1):S9. doi: 10.1186/1471-2105-14-S1-S9. Epub 2013 Jan 14.
7
GWAS analyzer: integrating genotype, phenotype and public annotation data for genome-wide association study analysis.GWAS 分析器:用于全基因组关联研究分析的基因型、表型和公共注释数据的整合。
Bioinformatics. 2010 Feb 15;26(4):560-4. doi: 10.1093/bioinformatics/btp714. Epub 2010 Jan 6.
8
Genome-wide association studies--data generation, storage, interpretation, and bioinformatics.全基因组关联研究——数据生成、存储、解释和生物信息学。
J Cardiovasc Transl Res. 2010 Jun;3(3):183-8. doi: 10.1007/s12265-010-9181-y. Epub 2010 Apr 8.
9
Translating genome wide association study results to associations among common diseases: in silico study with an electronic medical record.将全基因组关联研究结果转化为常见疾病之间的关联:基于电子病历的计算机模拟研究。
Int J Med Inform. 2013 Sep;82(9):864-74. doi: 10.1016/j.ijmedinf.2013.05.003. Epub 2013 Jun 3.
10
Biostatistical aspects of genome-wide association studies.全基因组关联研究的生物统计学方面
Biom J. 2008 Feb;50(1):8-28. doi: 10.1002/bimj.200710398.

引用本文的文献

1
SAGA (Simplified Association Genomewide Analyses): a user-friendly Pipeline to Democratize Genome-Wide Association Studies.SAGA(简化全基因组关联分析):一种使全基因组关联研究普及化的用户友好型流程。
bioRxiv. 2025 Aug 29:2025.08.25.672146. doi: 10.1101/2025.08.25.672146.
2
OsLB2.2 negatively regulates rice disease resistance at seedling stage in rice.OsLB2.2在水稻幼苗期对水稻抗病性起负调控作用。
Front Plant Sci. 2025 Aug 11;16:1629283. doi: 10.3389/fpls.2025.1629283. eCollection 2025.
3
Personalized Nutrition Strategies for Patients in the Intensive Care Unit: A Narrative Review on the Future of Critical Care Nutrition.

本文引用的文献

1
Sensible Initialization Using Expert Knowledge for Genome-Wide Analysis of Epistasis Using Genetic Programming.利用专家知识进行明智初始化,通过遗传编程对上位性进行全基因组分析
Genet Evol Comput Conf. 2009 May 18;2009:1289-1296. doi: 10.1109/CEC.2009.4983093.
2
Role for protein-protein interaction databases in human genetics.蛋白质-蛋白质相互作用数据库在人类遗传学中的作用。
Expert Rev Proteomics. 2009 Dec;6(6):647-59. doi: 10.1586/epr.09.86.
3
Enabling personal genomics with an explicit test of epistasis.通过明确的上位性检验实现个人基因组学。
重症监护病房患者的个性化营养策略:关于重症监护营养未来的叙述性综述
Nutrients. 2025 May 13;17(10):1659. doi: 10.3390/nu17101659.
4
Genetically Transitional Disease and the Road to Personalized Medicine.基因过渡性疾病与个性化医疗之路
Genes (Basel). 2025 Mar 30;16(4):401. doi: 10.3390/genes16040401.
5
Rethinking GWAS: how lessons from genetic screens and artificial intelligence could reveal biological mechanisms.重新审视全基因组关联研究:基因筛选和人工智能的经验教训如何揭示生物学机制。
Bioinformatics. 2025 Mar 29;41(4). doi: 10.1093/bioinformatics/btaf153.
6
How "Omics" Studies Contribute to a Better Understanding of Fuchs' Endothelial Corneal Dystrophy.“组学”研究如何有助于更好地理解富克斯角膜内皮营养不良。
Curr Issues Mol Biol. 2025 Feb 20;47(3):135. doi: 10.3390/cimb47030135.
7
Improving genetic variant identification for quantitative traits using ensemble learning-based approaches.使用基于集成学习的方法改进数量性状的遗传变异识别。
BMC Genomics. 2025 Mar 12;26(1):237. doi: 10.1186/s12864-025-11443-x.
8
The goldmine of GWAS summary statistics: a systematic review of methods and tools.全基因组关联研究汇总统计数据的宝库:方法与工具的系统综述
BioData Min. 2024 Sep 5;17(1):31. doi: 10.1186/s13040-024-00385-x.
9
Mental issues, internet addiction and quality of life predict burnout among Hungarian teachers: a machine learning analysis.精神问题、网络成瘾和生活质量预测匈牙利教师的倦怠:机器学习分析。
BMC Public Health. 2024 Aug 27;24(1):2322. doi: 10.1186/s12889-024-19797-9.
10
SlowMoMan: a web app for discovery of important features along user-drawn trajectories in 2D embeddings.慢动作人:一款用于在二维嵌入中沿着用户绘制的轨迹发现重要特征的网络应用程序。
Bioinform Adv. 2024 Jun 21;4(1):vbae095. doi: 10.1093/bioadv/vbae095. eCollection 2024.
Pac Symp Biocomput. 2010:327-36. doi: 10.1142/9789814295291_0035.
4
INTERSNP: genome-wide interaction analysis guided by a priori information.基于先验信息的全基因组交互分析
Bioinformatics. 2009 Dec 15;25(24):3275-81. doi: 10.1093/bioinformatics/btp596. Epub 2009 Oct 16.
5
Finding the missing heritability of complex diseases.寻找复杂疾病中缺失的遗传力。
Nature. 2009 Oct 8;461(7265):747-53. doi: 10.1038/nature08494.
6
Spatially uniform relieff (SURF) for computationally-efficient filtering of gene-gene interactions.用于计算高效过滤基因-基因相互作用的空间均匀化滤波器(SURF)。
BioData Min. 2009 Sep 22;2(1):5. doi: 10.1186/1756-0381-2-5.
7
Detecting purely epistatic multi-locus interactions by an omnibus permutation test on ensembles of two-locus analyses.通过对两基因座分析集合进行整体置换检验来检测纯上位多基因座相互作用。
BMC Bioinformatics. 2009 Sep 17;10:294. doi: 10.1186/1471-2105-10-294.
8
Epistasis and its implications for personal genetics.上位效应及其对个人遗传学的影响。
Am J Hum Genet. 2009 Sep;85(3):309-20. doi: 10.1016/j.ajhg.2009.08.006.
9
Polymorphisms in DNA repair genes, smoking, and bladder cancer risk: findings from the international consortium of bladder cancer.DNA修复基因多态性、吸烟与膀胱癌风险:来自国际膀胱癌联盟的研究结果
Cancer Res. 2009 Sep 1;69(17):6857-64. doi: 10.1158/0008-5472.CAN-09-1091. Epub 2009 Aug 25.
10
Replication by the Epistasis Project of the interaction between the genes for IL-6 and IL-10 in the risk of Alzheimer's disease.上位性项目对白细胞介素-6基因与白细胞介素-10基因相互作用在阿尔茨海默病风险中的复制研究。
J Neuroinflammation. 2009 Aug 23;6:22. doi: 10.1186/1742-2094-6-22.