• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于映射具有不完全表型数据的函数型数量性状的高斯过程模型和贝叶斯变量选择方法。

A Gaussian process model and Bayesian variable selection for mapping function-valued quantitative traits with incomplete phenotypic data.

机构信息

Department of Mathematics and Statistics and Organismal and Evolutionary Biology Research Programme, University of Helsinki, Helsinki, Finland.

CSIRO Agriculture & Food, GPO Box 1600, Canberra, ACT 2601, Australia.

出版信息

Bioinformatics. 2019 Oct 1;35(19):3684-3692. doi: 10.1093/bioinformatics/btz164.

DOI:10.1093/bioinformatics/btz164
PMID:30850830
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6761969/
Abstract

MOTIVATION

Recent advances in high dimensional phenotyping bring time as an extra dimension into the phenotypes. This promotes the quantitative trait locus (QTL) studies of function-valued traits such as those related to growth and development. Existing approaches for analyzing functional traits utilize either parametric methods or semi-parametric approaches based on splines and wavelets. However, very limited choices of software tools are currently available for practical implementation of functional QTL mapping and variable selection.

RESULTS

We propose a Bayesian Gaussian process (GP) approach for functional QTL mapping. We use GPs to model the continuously varying coefficients which describe how the effects of molecular markers on the quantitative trait are changing over time. We use an efficient gradient based algorithm to estimate the tuning parameters of GPs. Notably, the GP approach is directly applicable to the incomplete datasets having even larger than 50% missing data rate (among phenotypes). We further develop a stepwise algorithm to search through the model space in terms of genetic variants, and use a minimal increase of Bayesian posterior probability as a stopping rule to focus on only a small set of putative QTL. We also discuss the connection between GP and penalized B-splines and wavelets. On two simulated and three real datasets, our GP approach demonstrates great flexibility for modeling different types of phenotypic trajectories with low computational cost. The proposed model selection approach finds the most likely QTL reliably in tested datasets.

AVAILABILITY AND IMPLEMENTATION

Software and simulated data are available as a MATLAB package 'GPQTLmapping', and they can be downloaded from GitHub (https://github.com/jpvanhat/GPQTLmapping). Real datasets used in case studies are publicly available at QTL Archive.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

高维表型分析的最新进展将时间作为一个额外的维度引入到表型中。这促进了功能值性状(如与生长和发育相关的性状)的数量性状基因座(QTL)研究。现有的分析功能性状的方法要么利用参数方法,要么利用基于样条和小波的半参数方法。然而,目前用于功能 QTL 映射和变量选择的实用实现的软件工具选择非常有限。

结果

我们提出了一种用于功能 QTL 映射的贝叶斯高斯过程(GP)方法。我们使用 GPs 来建模描述分子标记对定量性状的影响随时间变化的连续变化系数。我们使用有效的基于梯度的算法来估计 GPs 的调整参数。值得注意的是,GP 方法可直接应用于具有超过 50%缺失数据率(在表型中)的不完整数据集。我们进一步开发了一种逐步算法,根据遗传变异搜索模型空间,并使用贝叶斯后验概率的最小增加作为停止规则,只关注一小部分假定的 QTL。我们还讨论了 GP 与惩罚 B 样条和小波之间的关系。在两个模拟数据集和三个真实数据集上,我们的 GP 方法以低计算成本展示了对不同类型的表型轨迹进行建模的极大灵活性。所提出的模型选择方法在测试数据集上可靠地找到了最可能的 QTL。

可用性和实现

软件和模拟数据作为 MATLAB 包“GPQTLmapping”提供,可从 GitHub(https://github.com/jpvanhat/GPQTLmapping)下载。案例研究中使用的真实数据集可在 QTL 档案中公开获得。

补充信息

补充数据可在生物信息学在线获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/536e/6761969/2315f5a03d9c/btz164f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/536e/6761969/f2b3311c2c7b/btz164f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/536e/6761969/014c864e3be9/btz164f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/536e/6761969/2d1bb723149f/btz164f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/536e/6761969/2315f5a03d9c/btz164f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/536e/6761969/f2b3311c2c7b/btz164f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/536e/6761969/014c864e3be9/btz164f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/536e/6761969/2d1bb723149f/btz164f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/536e/6761969/2315f5a03d9c/btz164f4.jpg

相似文献

1
A Gaussian process model and Bayesian variable selection for mapping function-valued quantitative traits with incomplete phenotypic data.一种用于映射具有不完全表型数据的函数型数量性状的高斯过程模型和贝叶斯变量选择方法。
Bioinformatics. 2019 Oct 1;35(19):3684-3692. doi: 10.1093/bioinformatics/btz164.
2
A Bayesian nonparametric approach for mapping dynamic quantitative traits.贝叶斯非参数方法用于映射动态定量性状。
Genetics. 2013 Aug;194(4):997-1016. doi: 10.1534/genetics.113.152736. Epub 2013 Jun 14.
3
Bayesian mapping of genomewide interacting quantitative trait loci for ordinal traits.用于有序性状的全基因组相互作用数量性状位点的贝叶斯定位
Genetics. 2007 Jul;176(3):1855-64. doi: 10.1534/genetics.107.071142. Epub 2007 May 16.
4
Mapping multiple quantitative trait Loci for ordinal traits.定位有序性状的多个数量性状基因座。
Behav Genet. 2004 Jan;34(1):3-15. doi: 10.1023/B:BEGE.0000009473.43185.43.
5
Bayesian B-spline mapping for dynamic quantitative traits.用于动态数量性状的贝叶斯B样条映射
Genet Res (Camb). 2012 Apr;94(2):85-95. doi: 10.1017/S0016672312000249.
6
A coordinate descent approach for sparse Bayesian learning in high dimensional QTL mapping and genome-wide association studies.一种用于高维 QTL 映射和全基因组关联研究中稀疏贝叶斯学习的坐标下降方法。
Bioinformatics. 2019 Nov 1;35(21):4327-4335. doi: 10.1093/bioinformatics/btz244.
7
Across population genomic prediction scenarios in which Bayesian variable selection outperforms GBLUP.在贝叶斯变量选择优于基因组最佳线性无偏预测(GBLUP)的群体基因组预测场景中。
BMC Genet. 2015 Dec 23;16:146. doi: 10.1186/s12863-015-0305-x.
8
Bayesian composite model space approach for mapping quantitative trait Loci in variance component model.贝叶斯复合模型空间方法在方差成分模型中定位数量性状位点
Behav Genet. 2009 May;39(3):337-46. doi: 10.1007/s10519-009-9259-y. Epub 2009 Mar 5.
9
A simple regression-based method to map quantitative trait loci underlying function-valued phenotypes.一种基于简单回归的方法,用于定位功能值表型背后的数量性状基因座。
Genetics. 2014 Aug;197(4):1409-16. doi: 10.1534/genetics.114.166306. Epub 2014 Jun 14.
10
Estimation of dynamic SNP-heritability with Bayesian Gaussian process models.基于贝叶斯高斯过程模型估计动态 SNP 遗传率。
Bioinformatics. 2020 Jun 1;36(12):3795-3802. doi: 10.1093/bioinformatics/btaa199.

引用本文的文献

1
Assessing genotype adaptability and stability in perennial forage breeding trials using random regression models for longitudinal dry matter yield data.使用随机回归模型对多年生牧草育种试验中纵向干物质产量数据评估基因型适应性和稳定性。
G3 (Bethesda). 2025 Mar 18;15(3). doi: 10.1093/g3journal/jkae306.
2
A Multilayer Interactome Network Constructed in a Forest Poplar Population Mediates the Pleiotropic Control of Complex Traits.在杨树群体中构建的多层互作组网络介导复杂性状的多效性控制。
Front Genet. 2021 Nov 12;12:769688. doi: 10.3389/fgene.2021.769688. eCollection 2021.
3
Pleiotropy and epistasis within and between signaling pathways defines the genetic architecture of fungal virulence.

本文引用的文献

1
Two-stage identification of SNP effects on dynamic poplar growth.基于两阶段方法鉴定 SNP 对杨树生长动态的影响。
Plant J. 2018 Jan;93(2):286-296. doi: 10.1111/tpj.13777. Epub 2017 Dec 28.
2
Performance Gains in Genome-Wide Association Studies for Longitudinal Traits via Modeling Time-varied effects.通过建模时变效应提高纵向性状全基因组关联研究的性能。
Sci Rep. 2017 Apr 4;7(1):590. doi: 10.1038/s41598-017-00638-2.
3
RNA-Seq Count Data Modelling by Grey Relational Analysis and Nonparametric Gaussian Process.基于灰色关联分析和非参数高斯过程的 RNA-Seq 计数数据建模
信号通路内部和之间的多效性和上位性定义了真菌毒力的遗传结构。
PLoS Genet. 2021 Jan 25;17(1):e1009313. doi: 10.1371/journal.pgen.1009313. eCollection 2021 Jan.
4
Integrating High-Throughput Phenotyping and Statistical Genomic Methods to Genetically Improve Longitudinal Traits in Crops.整合高通量表型分析和统计基因组方法以遗传改良作物的纵向性状。
Front Plant Sci. 2020 May 26;11:681. doi: 10.3389/fpls.2020.00681. eCollection 2020.
5
Estimation of dynamic SNP-heritability with Bayesian Gaussian process models.基于贝叶斯高斯过程模型估计动态 SNP 遗传率。
Bioinformatics. 2020 Jun 1;36(12):3795-3802. doi: 10.1093/bioinformatics/btaa199.
PLoS One. 2016 Oct 26;11(10):e0164766. doi: 10.1371/journal.pone.0164766. eCollection 2016.
4
Genome-wide association study of behavioral, physiological and gene expression traits in outbred CFW mice.远交CFW小鼠行为、生理和基因表达特征的全基因组关联研究。
Nat Genet. 2016 Aug;48(8):919-26. doi: 10.1038/ng.3609. Epub 2016 Jul 4.
5
FINEMAP: efficient variable selection using summary data from genome-wide association studies.精细定位:利用全基因组关联研究的汇总数据进行高效变量选择。
Bioinformatics. 2016 May 15;32(10):1493-501. doi: 10.1093/bioinformatics/btw018. Epub 2016 Jan 14.
6
Mapping Quantitative Trait Loci Underlying Function-Valued Traits Using Functional Principal Component Analysis and Multi-Trait Mapping.使用功能主成分分析和多性状定位法定位功能值性状的数量性状基因座
G3 (Bethesda). 2015 Nov 3;6(1):79-86. doi: 10.1534/g3.115.024133.
7
Dynamic Quantitative Trait Locus Analysis of Plant Phenomic Data.动态定量性状基因座分析在植物表型数据中的应用。
Trends Plant Sci. 2015 Dec;20(12):822-833. doi: 10.1016/j.tplants.2015.08.012. Epub 2015 Oct 5.
8
BAYESIAN GROUP LASSO FOR NONPARAMETRIC VARYING-COEFFICIENT MODELS WITH APPLICATION TO FUNCTIONAL GENOME-WIDE ASSOCIATION STUDIES.用于非参数变系数模型的贝叶斯组套索及其在全基因组关联研究中的应用
Ann Appl Stat. 2015 Jun;9(2):640-664. doi: 10.1214/15-AOAS808.
9
Genome-wide modeling of transcription kinetics reveals patterns of RNA production delays.转录动力学的全基因组建模揭示了RNA产生延迟的模式。
Proc Natl Acad Sci U S A. 2015 Oct 20;112(42):13115-20. doi: 10.1073/pnas.1420404112. Epub 2015 Oct 5.
10
Genetics of Rapid and Extreme Size Evolution in Island Mice.岛屿小鼠快速和极端体型进化的遗传学
Genetics. 2015 Sep;201(1):213-28. doi: 10.1534/genetics.115.177790. Epub 2015 Jul 20.