• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于高维数量性状基因座发现的计算高效的贝叶斯看似不相关回归模型。

A computationally efficient Bayesian seemingly unrelated regressions model for high-dimensional quantitative trait loci discovery.

作者信息

Bottolo Leonardo, Banterle Marco, Richardson Sylvia, Ala-Korpela Mika, Järvelin Marjo-Riitta, Lewin Alex

机构信息

Department of Medical Genetics, University of Cambridge, Cambridge, UK.

The Alan Turing Institute, London, UK.

出版信息

J R Stat Soc Ser C Appl Stat. 2021 Aug;70(4):886-908. doi: 10.1111/rssc.12490. Epub 2021 May 8.

DOI:10.1111/rssc.12490
PMID:35001978
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7612194/
Abstract

Our work is motivated by the search for metabolite quantitative trait loci (QTL) in a cohort of more than 5000 people. There are 158 metabolites measured by NMR spectroscopy in the 31-year follow-up of the Northern Finland Birth Cohort 1966 (NFBC66). These metabolites, as with many multivariate phenotypes produced by high-throughput biomarker technology, exhibit strong correlation structures. Existing approaches for combining such data with genetic variants for multivariate QTL analysis generally ignore phenotypic correlations or make restrictive assumptions about the associations between phenotypes and genetic loci. We present a computationally efficient Bayesian seemingly unrelated regressions model for high-dimensional data, with cell-sparse variable selection and sparse graphical structure for covariance selection. Cell sparsity allows different phenotype responses to be associated with different genetic predictors and the graphical structure is used to represent the conditional dependencies between phenotype variables. To achieve feasible computation of the large model space, we exploit a factorisation of the covariance matrix. Applying the model to the NFBC66 data with 9000 directly genotyped single nucleotide polymorphisms, we are able to simultaneously estimate genotype-phenotype associations and the residual dependence structure among the metabolites. The R package BayesSUR with full documentation is available at https://cran.r-project.org/web/packages/BayesSUR/.

摘要

我们的工作旨在对5000多人的队列进行代谢物数量性状基因座(QTL)研究。在对1966年芬兰北部出生队列(NFBC66)进行的31年随访中,通过核磁共振波谱法测定了158种代谢物。这些代谢物与高通量生物标志物技术产生的许多多变量表型一样,呈现出很强的相关结构。现有的将此类数据与遗传变异相结合进行多变量QTL分析的方法通常会忽略表型相关性,或者对表型与基因座之间的关联做出限制性假设。我们提出了一种用于高维数据的计算高效的贝叶斯看似不相关回归模型,该模型具有细胞稀疏变量选择和用于协方差选择的稀疏图形结构。细胞稀疏性允许不同的表型反应与不同的遗传预测因子相关联,并且图形结构用于表示表型变量之间的条件依赖性。为了实现对大型模型空间的可行计算,我们利用了协方差矩阵的分解。将该模型应用于具有9000个直接基因分型单核苷酸多态性的NFBC66数据,我们能够同时估计基因型与表型的关联以及代谢物之间的残余依赖性结构。带有完整文档的R包BayesSUR可在https://cran.r-project.org/web/packages/BayesSUR/获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c317/7612194/9707f4739a41/EMS140622-f005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c317/7612194/444bad70af2f/EMS140622-f001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c317/7612194/6d6a7fd661c3/EMS140622-f002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c317/7612194/c02e498a34ac/EMS140622-f003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c317/7612194/a5311862733d/EMS140622-f004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c317/7612194/9707f4739a41/EMS140622-f005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c317/7612194/444bad70af2f/EMS140622-f001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c317/7612194/6d6a7fd661c3/EMS140622-f002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c317/7612194/c02e498a34ac/EMS140622-f003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c317/7612194/a5311862733d/EMS140622-f004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c317/7612194/9707f4739a41/EMS140622-f005.jpg

相似文献

1
A computationally efficient Bayesian seemingly unrelated regressions model for high-dimensional quantitative trait loci discovery.一种用于高维数量性状基因座发现的计算高效的贝叶斯看似不相关回归模型。
J R Stat Soc Ser C Appl Stat. 2021 Aug;70(4):886-908. doi: 10.1111/rssc.12490. Epub 2021 May 8.
2
Joint high-dimensional Bayesian variable and covariance selection with an application to eQTL analysis.联合高维贝叶斯变量与协方差选择及其在表达数量性状位点分析中的应用
Biometrics. 2013 Jun;69(2):447-57. doi: 10.1111/biom.12021. Epub 2013 Apr 22.
3
Bayesian quantitative trait loci mapping for multiple traits.多性状的贝叶斯数量性状基因座定位
Genetics. 2008 Aug;179(4):2275-89. doi: 10.1534/genetics.108.088427. Epub 2008 Aug 9.
4
R/qtlbim: QTL with Bayesian Interval Mapping in experimental crosses.R/qtlbim:实验杂交中用于贝叶斯区间定位的数量性状基因座
Bioinformatics. 2007 Mar 1;23(5):641-3. doi: 10.1093/bioinformatics/btm011. Epub 2007 Jan 19.
5
Multibreed genomic prediction using multitrait genomic residual maximum likelihood and multitask Bayesian variable selection.多品种基因组预测使用多性状基因组残差极大似然法和多任务贝叶斯变量选择。
J Dairy Sci. 2018 May;101(5):4279-4294. doi: 10.3168/jds.2017-13366. Epub 2018 Mar 15.
6
A Multiple-Trait Bayesian Lasso for Genome-Enabled Analysis and Prediction of Complex Traits.用于基于基因组的复杂性状分析与预测的多性状贝叶斯套索法
Genetics. 2020 Feb;214(2):305-331. doi: 10.1534/genetics.119.302934. Epub 2019 Dec 26.
7
A genome scan for quantitative trait loci influencing carcass, post-natal growth and reproductive traits in commercial Angus cattle.对影响商业安格斯牛胴体、产后生长和繁殖性状的数量性状基因座进行全基因组扫描。
Anim Genet. 2010 Dec;41(6):597-607. doi: 10.1111/j.1365-2052.2010.02063.x.
8
A Gaussian process model and Bayesian variable selection for mapping function-valued quantitative traits with incomplete phenotypic data.一种用于映射具有不完全表型数据的函数型数量性状的高斯过程模型和贝叶斯变量选择方法。
Bioinformatics. 2019 Oct 1;35(19):3684-3692. doi: 10.1093/bioinformatics/btz164.
9
Bayesian estimation of genetic parameters for multivariate threshold and continuous phenotypes and molecular genetic data in simulated horse populations using Gibbs sampling.使用吉布斯抽样对模拟马种群中的多变量阈值和连续表型以及分子遗传数据进行遗传参数的贝叶斯估计。
BMC Genet. 2007 May 9;8:19. doi: 10.1186/1471-2156-8-19.
10
A Genomic Bayesian Multi-trait and Multi-environment Model.一种基因组贝叶斯多性状多环境模型。
G3 (Bethesda). 2016 Sep 8;6(9):2725-44. doi: 10.1534/g3.116.032359.

引用本文的文献

1
A Bayesian multivariate hierarchical model for developing a treatment benefit index using mixed types of outcomes.利用混合类型结局构建治疗获益指数的贝叶斯多元层次模型。
BMC Med Res Methodol. 2024 Sep 27;24(1):218. doi: 10.1186/s12874-024-02333-z.
2
Inferring personal intake recommendations of phosphorous and potassium for end-stage renal failure patients by simulating with Bayesian hierarchical multivariate model.利用贝叶斯分层多元模型模拟推断终末期肾衰竭患者的磷和钾个人摄入推荐量。
PLoS One. 2024 Feb 6;19(2):e0291153. doi: 10.1371/journal.pone.0291153. eCollection 2024.
3
Improving Individualized Treatment Decisions: A Bayesian Multivariate Hierarchical Model for Developing a Treatment Benefit Index using Mixed Types of Outcomes.

本文引用的文献

1
Bayesian Variable Selection for Gaussian copula regression models.高斯Copula回归模型的贝叶斯变量选择
J Comput Graph Stat. 2020 Dec 10;30(3):578-593. doi: 10.1080/10618600.2020.1840997.
2
A Global-Local Approach for Detecting Hotspots in Multiple-Response Regression.一种用于检测多响应回归中热点的全局-局部方法。
Ann Appl Stat. 2020 Jun;14(2):905-928. doi: 10.1214/20-AOAS1332. Epub 2020 Jun 29.
3
EPISPOT: An epigenome-driven approach for detecting and interpreting hotspots in molecular QTL studies.EPISPOT:一种基于表观基因组的方法,用于检测和解释分子 QTL 研究中的热点。
改善个性化治疗决策:一种使用混合类型结果开发治疗效益指数的贝叶斯多变量层次模型
medRxiv. 2024 Jan 7:2023.11.17.23298711. doi: 10.1101/2023.11.17.23298711.
4
Fast and flexible joint fine-mapping of multiple traits via the Sum of Single Effects model.通过单效应模型的总和实现多个性状的快速灵活联合精细定位。
bioRxiv. 2024 Jun 18:2023.04.14.536893. doi: 10.1101/2023.04.14.536893.
Am J Hum Genet. 2021 Jun 3;108(6):983-1000. doi: 10.1016/j.ajhg.2021.04.010. Epub 2021 May 1.
4
Spatial disease mapping using directed acyclic graph auto-regressive (DAGAR) models.使用有向无环图自回归(DAGAR)模型进行空间疾病映射。
Bayesian Anal. 2019 Dec;14(4):1221-1244. doi: 10.1214/19-ba1177. Epub 2019 Oct 3.
5
Quantitative Serum Nuclear Magnetic Resonance Metabolomics in Large-Scale Epidemiology: A Primer on -Omic Technologies.大规模流行病学中的定量血清核磁共振代谢组学:“组学”技术入门
Am J Epidemiol. 2017 Nov 1;186(9):1084-1096. doi: 10.1093/aje/kwx016.
6
MWASTools: an R/bioconductor package for metabolome-wide association studies.MWASTools:一个用于代谢组学全基因组关联研究的 R/bioconductor 包。
Bioinformatics. 2018 Mar 1;34(5):890-892. doi: 10.1093/bioinformatics/btx477.
7
Efficient inference for genetic association studies with multiple outcomes.多结局遗传关联研究的高效推断
Biostatistics. 2017 Oct 1;18(4):618-636. doi: 10.1093/biostatistics/kxx007.
8
metaCCA: summary statistics-based multivariate meta-analysis of genome-wide association studies using canonical correlation analysis.metaCCA:基于全基因组关联研究汇总统计量,运用典型相关分析的多变量荟萃分析。
Bioinformatics. 2016 Jul 1;32(13):1981-9. doi: 10.1093/bioinformatics/btw052. Epub 2016 Feb 19.
9
Genome-wide study for circulating metabolites identifies 62 loci and reveals novel systemic effects of LPA.全基因组循环代谢物研究鉴定出62个基因座并揭示溶血磷脂酸的新系统效应。
Nat Commun. 2016 Mar 23;7:11122. doi: 10.1038/ncomms11122.
10
MT-HESS: an efficient Bayesian approach for simultaneous association detection in OMICS datasets, with application to eQTL mapping in multiple tissues.MT - HESS:一种用于在组学数据集中同时进行关联检测的高效贝叶斯方法,并应用于多个组织中的表达定量性状位点(eQTL)定位
Bioinformatics. 2016 Feb 15;32(4):523-32. doi: 10.1093/bioinformatics/btv568. Epub 2015 Oct 26.