• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

贝叶斯线性回归模型在复杂疾病基因集优先级中的评价。

Evaluation of Bayesian Linear Regression models for gene set prioritization in complex diseases.

机构信息

Center for Quantitative Genetics and Genomics, Aarhus University, Aarhus, Denmark.

Department of Biomedicine, Aarhus University, Aarhus, Denmark.

出版信息

PLoS Genet. 2024 Nov 4;20(11):e1011463. doi: 10.1371/journal.pgen.1011463. eCollection 2024 Nov.

DOI:10.1371/journal.pgen.1011463
PMID:39495786
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11563439/
Abstract

Genome-wide association studies (GWAS) provide valuable insights into the genetic architecture of complex traits, yet interpreting their results remains challenging due to the polygenic nature of most traits. Gene set analysis offers a solution by aggregating genetic variants into biologically relevant pathways, enhancing the detection of coordinated effects across multiple genes. In this study, we present and evaluate a gene set prioritization approach utilizing Bayesian Linear Regression (BLR) models to uncover shared genetic components among different phenotypes and facilitate biological interpretation. Through extensive simulations and analyses of real traits, we demonstrate the efficacy of the BLR model in prioritizing pathways for complex traits. Simulation studies reveal insights into the model's performance under various scenarios, highlighting the impact of factors such as the number of causal genes, proportions of causal variants, heritability, and disease prevalence. Comparative analyses with MAGMA (Multi-marker Analysis of GenoMic Annotation) demonstrate BLR's superior performance, especially in highly overlapped gene sets. Application of both single-trait and multi-trait BLR models to real data, specifically GWAS summary data for type 2 diabetes (T2D) and related phenotypes, identifies significant associations with T2D-related pathways. Furthermore, comparison between single- and multi-trait BLR analyses highlights the superior performance of the multi-trait approach in identifying associated pathways, showcasing increased statistical power when analyzing multiple traits jointly. Additionally, enrichment analysis with integrated data from various public resources supports our results, confirming significant enrichment of diabetes-related genes within the top T2D pathways resulting from the multi-trait analysis. The BLR model's ability to handle diverse genomic features, perform regularization, conduct variable selection, and integrate information from multiple traits, genders, and ancestries demonstrates its utility in understanding the genetic architecture of complex traits. Our study provides insights into the potential of the BLR model to prioritize gene sets, offering a flexible framework applicable to various datasets. This model presents opportunities for advancing personalized medicine by exploring the genetic underpinnings of multifactorial traits.

摘要

全基因组关联研究(GWAS)为复杂性状的遗传结构提供了有价值的见解,但由于大多数性状的多基因性质,解释其结果仍然具有挑战性。基因集分析通过将遗传变异聚集到生物相关的途径中,增强了对多个基因之间协调效应的检测,提供了一种解决方案。在这项研究中,我们提出并评估了一种利用贝叶斯线性回归(BLR)模型对不同表型之间共享遗传成分进行优先排序的基因集方法,并促进了生物学解释。通过对真实性状的广泛模拟和分析,我们证明了 BLR 模型在优先排序复杂性状途径方面的有效性。模拟研究揭示了模型在各种情况下的性能洞察,强调了因果基因数量、因果变异比例、遗传率和疾病患病率等因素的影响。与 MAGMA(多标记分析基因注释)的比较分析表明,BLR 具有优越的性能,特别是在高度重叠的基因集中。将单性状和多性状 BLR 模型应用于真实数据,特别是 2 型糖尿病(T2D)和相关表型的 GWAS 汇总数据,确定了与 T2D 相关途径的显著关联。此外,单性状和多性状 BLR 分析之间的比较突出了多性状分析在识别相关途径方面的优越性能,表明在联合分析多个性状时具有更高的统计能力。此外,利用来自各种公共资源的综合数据进行富集分析支持了我们的结果,证实了多性状分析中糖尿病相关基因在顶级 T2D 途径中的显著富集。BLR 模型能够处理多种基因组特征、进行正则化、进行变量选择以及整合来自多个性状、性别和祖源的信息,展示了其在理解复杂性状遗传结构方面的实用性。我们的研究为 BLR 模型在优先排序基因集方面的潜力提供了见解,为各种数据集提供了一个灵活的框架。该模型为探索多因素性状的遗传基础,提供了推进个性化医学的机会。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/032c/11563439/9b4efe0066b2/pgen.1011463.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/032c/11563439/cda49d495215/pgen.1011463.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/032c/11563439/c18455b2a8fc/pgen.1011463.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/032c/11563439/44f6bb769003/pgen.1011463.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/032c/11563439/1f9c6c92b0a5/pgen.1011463.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/032c/11563439/d125a533954e/pgen.1011463.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/032c/11563439/e9a79585121b/pgen.1011463.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/032c/11563439/868619d8ba68/pgen.1011463.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/032c/11563439/846b5d805815/pgen.1011463.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/032c/11563439/9b4efe0066b2/pgen.1011463.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/032c/11563439/cda49d495215/pgen.1011463.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/032c/11563439/c18455b2a8fc/pgen.1011463.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/032c/11563439/44f6bb769003/pgen.1011463.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/032c/11563439/1f9c6c92b0a5/pgen.1011463.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/032c/11563439/d125a533954e/pgen.1011463.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/032c/11563439/e9a79585121b/pgen.1011463.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/032c/11563439/868619d8ba68/pgen.1011463.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/032c/11563439/846b5d805815/pgen.1011463.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/032c/11563439/9b4efe0066b2/pgen.1011463.g009.jpg

相似文献

1
Evaluation of Bayesian Linear Regression models for gene set prioritization in complex diseases.贝叶斯线性回归模型在复杂疾病基因集优先级中的评价。
PLoS Genet. 2024 Nov 4;20(11):e1011463. doi: 10.1371/journal.pgen.1011463. eCollection 2024 Nov.
2
How powerful are summary-based methods for identifying expression-trait associations under different genetic architectures?基于汇总数据的方法在不同遗传结构下识别表达性状关联的能力有多强?
Pac Symp Biocomput. 2018;23:228-239.
3
A Multiple-Trait Bayesian Lasso for Genome-Enabled Analysis and Prediction of Complex Traits.用于基于基因组的复杂性状分析与预测的多性状贝叶斯套索法
Genetics. 2020 Feb;214(2):305-331. doi: 10.1534/genetics.119.302934. Epub 2019 Dec 26.
4
Leveraging polygenic enrichments of gene features to predict genes underlying complex traits and diseases.利用基因特征的多基因富集来预测复杂性状和疾病的潜在基因。
Nat Genet. 2023 Aug;55(8):1267-1276. doi: 10.1038/s41588-023-01443-6. Epub 2023 Jul 13.
5
Probabilistic integration of transcriptome-wide association studies and colocalization analysis identifies key molecular pathways of complex traits.全转录组关联研究和共定位分析的概率集成确定复杂性状的关键分子途径。
Am J Hum Genet. 2023 Jan 5;110(1):44-57. doi: 10.1016/j.ajhg.2022.12.002.
6
GUESS-ing polygenic associations with multiple phenotypes using a GPU-based evolutionary stochastic search algorithm.基于 GPU 的进化随机搜索算法对多种表型进行基因多效性关联推断。
PLoS Genet. 2013;9(8):e1003657. doi: 10.1371/journal.pgen.1003657. Epub 2013 Aug 8.
7
Improved functional mapping of complex trait heritability with GSA-MiXeR implicates biologically specific gene sets.利用 GSA-MiXeR 提高复杂性状遗传力的功能映射,提示了生物学特异性基因集。
Nat Genet. 2024 Jun;56(6):1310-1318. doi: 10.1038/s41588-024-01771-1. Epub 2024 Jun 3.
8
Investigation of multi-trait associations using pathway-based analysis of GWAS summary statistics.基于 GWAS 汇总统计数据的通路分析探究多性状关联。
BMC Genomics. 2019 Feb 4;20(Suppl 1):79. doi: 10.1186/s12864-018-5373-7.
9
Contrasting the Genetic Architecture of 30 Complex Traits from Summary Association Data.基于汇总关联数据对比30种复杂性状的遗传结构
Am J Hum Genet. 2016 Jul 7;99(1):139-53. doi: 10.1016/j.ajhg.2016.05.013. Epub 2016 Jun 23.
10
Haplotype function score improves biological interpretation and cross-ancestry polygenic prediction of human complex traits.单体型功能评分可改善人类复杂性状的生物学解释和跨血统多基因预测。
Elife. 2024 Apr 19;12:RP92574. doi: 10.7554/eLife.92574.

引用本文的文献

1
Physics-Informed Neural Networks with Unknown Partial Differential Equations: An Application in Multivariate Time Series.具有未知偏微分方程的物理信息神经网络:在多元时间序列中的应用
Entropy (Basel). 2025 Jun 26;27(7):682. doi: 10.3390/e27070682.
2
Prediction of Rice Plant Height Using Linear Regression Model by Pyramiding Plant Height-Related Alleles.利用株高相关等位基因聚合的线性回归模型预测水稻株高
Int J Mol Sci. 2025 Jun 28;26(13):6249. doi: 10.3390/ijms26136249.
3
Evaluation of Bayesian Linear Regression derived gene set test methods.

本文引用的文献

1
Improved functional mapping of complex trait heritability with GSA-MiXeR implicates biologically specific gene sets.利用 GSA-MiXeR 提高复杂性状遗传力的功能映射,提示了生物学特异性基因集。
Nat Genet. 2024 Jun;56(6):1310-1318. doi: 10.1038/s41588-024-01771-1. Epub 2024 Jun 3.
2
Expanded utility of the R package, qgg, with applications within genomic medicine.扩展 R 包 qgg 的实用功能,在基因组医学中应用。
Bioinformatics. 2023 Nov 1;39(11). doi: 10.1093/bioinformatics/btad656.
3
Type 1 and Type 2 Diabetes Mellitus: Commonalities, Differences and the Importance of Exercise and Nutrition.
贝叶斯线性回归衍生基因集检验方法的评估
BMC Genomics. 2024 Dec 23;25(1):1236. doi: 10.1186/s12864-024-11026-2.
1 型和 2 型糖尿病:共同之处、差异以及运动和营养的重要性。
Nutrients. 2023 Oct 7;15(19):4279. doi: 10.3390/nu15194279.
4
Histones: The critical players in innate immunity.组蛋白:先天免疫中的关键角色。
Front Immunol. 2022 Nov 21;13:1030610. doi: 10.3389/fimmu.2022.1030610. eCollection 2022.
5
Insights into the Genetics and Signaling Pathways in Maturity-Onset Diabetes of the Young.青少年起病的成年型糖尿病的遗传学与信号通路研究进展。
Int J Mol Sci. 2022 Oct 26;23(21):12910. doi: 10.3390/ijms232112910.
6
The NHGRI-EBI GWAS Catalog: knowledgebase and deposition resource.NHGRI-EBI GWAS 目录:知识库和存储资源。
Nucleic Acids Res. 2023 Jan 6;51(D1):D977-D985. doi: 10.1093/nar/gkac1010.
7
A saturated map of common genetic variants associated with human height.与人类身高相关的常见遗传变异的饱和图谱。
Nature. 2022 Oct;610(7933):704-712. doi: 10.1038/s41586-022-05275-y. Epub 2022 Oct 12.
8
Gene-environment correlations across geographic regions affect genome-wide association studies.基因-环境相关性在地理区域上的差异会影响全基因组关联研究。
Nat Genet. 2022 Sep;54(9):1345-1354. doi: 10.1038/s41588-022-01158-0. Epub 2022 Aug 22.
9
Serine biosynthesis as a novel therapeutic target for dilated cardiomyopathy.丝氨酸生物合成作为扩张型心肌病的一个新的治疗靶点。
Eur Heart J. 2022 Sep 21;43(36):3477-3489. doi: 10.1093/eurheartj/ehac305.
10
Mapping genomic loci implicates genes and synaptic biology in schizophrenia.基因组定位研究提示精神分裂症的发病与基因及突触生物学有关。
Nature. 2022 Apr;604(7906):502-508. doi: 10.1038/s41586-022-04434-5. Epub 2022 Apr 8.