• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

概率基因型推断方法和集成加权套索用于 QTL 鉴定。

Probability genotype imputation method and integrated weighted lasso for QTL identification.

机构信息

Johann Bernoulli Institute for Mathematics and Computer Science, University of Groningen, Groningen 9747 AG, The Netherlands.

出版信息

BMC Genet. 2013 Dec 30;14:125. doi: 10.1186/1471-2156-14-125.

DOI:10.1186/1471-2156-14-125
PMID:24378210
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4126192/
Abstract

BACKGROUND

Many QTL studies have two common features: (1) often there is missing marker information, (2) among many markers involved in the biological process only a few are causal. In statistics, the second issue falls under the headings "sparsity" and "causal inference". The goal of this work is to develop a two-step statistical methodology for QTL mapping for markers with binary genotypes. The first step introduces a novel imputation method for missing genotypes. Outcomes of the proposed imputation method are probabilities which serve as weights to the second step, namely in weighted lasso. The sparse phenotype inference is employed to select a set of predictive markers for the trait of interest.

RESULTS

Simulation studies validate the proposed methodology under a wide range of realistic settings. Furthermore, the methodology outperforms alternative imputation and variable selection methods in such studies. The methodology was applied to an Arabidopsis experiment, containing 69 markers for 165 recombinant inbred lines of a F8 generation. The results confirm previously identified regions, however several new markers are also found. On the basis of the inferred ROC behavior these markers show good potential for being real, especially for the germination trait Gmax.

CONCLUSIONS

Our imputation method shows higher accuracy in terms of sensitivity and specificity compared to alternative imputation method. Also, the proposed weighted lasso outperforms commonly practiced multiple regression as well as the traditional lasso and adaptive lasso with three weighting schemes. This means that under realistic missing data settings this methodology can be used for QTL identification.

摘要

背景

许多 QTL 研究具有两个共同特征:(1)经常存在缺失的标记信息,(2)在涉及生物过程的众多标记中,只有少数是因果关系的。在统计学中,第二个问题属于“稀疏性”和“因果推断”的范畴。这项工作的目标是开发一种用于具有二态基因型标记的 QTL 作图的两步统计方法。第一步引入了一种新的缺失基因型插补方法。所提出的插补方法的结果是概率,它们作为权重应用于第二步,即加权套索。稀疏表型推断用于选择一组与感兴趣性状相关的预测标记。

结果

模拟研究在广泛的现实设置下验证了所提出的方法。此外,在这些研究中,该方法优于替代的插补和变量选择方法。该方法应用于拟南芥实验,该实验包含 69 个标记,用于 F8 代的 165 个重组自交系。结果证实了先前确定的区域,但也发现了几个新的标记。基于推断的 ROC 行为,这些标记显示出很好的潜力,特别是对于发芽性状 Gmax。

结论

与替代插补方法相比,我们的插补方法在灵敏度和特异性方面表现出更高的准确性。此外,所提出的加权套索优于常用的多元回归以及传统的套索和自适应套索,具有三种加权方案。这意味着在现实的缺失数据环境下,该方法可用于 QTL 鉴定。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f5/4126192/65fad4264805/1471-2156-14-125-22.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f5/4126192/f47a21cd926f/1471-2156-14-125-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f5/4126192/f75d229281f9/1471-2156-14-125-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f5/4126192/288b8e073720/1471-2156-14-125-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f5/4126192/b5d4670d4eb3/1471-2156-14-125-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f5/4126192/d8d192bb1d2f/1471-2156-14-125-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f5/4126192/1061e2cdab80/1471-2156-14-125-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f5/4126192/d4965b75a136/1471-2156-14-125-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f5/4126192/190872a59403/1471-2156-14-125-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f5/4126192/e59eff8a1f70/1471-2156-14-125-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f5/4126192/19249dec2a24/1471-2156-14-125-10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f5/4126192/65c21ea1e86c/1471-2156-14-125-11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f5/4126192/f951591da2f5/1471-2156-14-125-12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f5/4126192/0b199d5370e8/1471-2156-14-125-13.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f5/4126192/c46c27402f24/1471-2156-14-125-14.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f5/4126192/6c4626b9b8ce/1471-2156-14-125-15.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f5/4126192/50c93cf7bb64/1471-2156-14-125-16.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f5/4126192/d04a0bae04d3/1471-2156-14-125-17.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f5/4126192/97d3af393a69/1471-2156-14-125-18.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f5/4126192/59d779f3e082/1471-2156-14-125-19.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f5/4126192/5e2eba831c71/1471-2156-14-125-20.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f5/4126192/988cbd22f4c8/1471-2156-14-125-21.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f5/4126192/65fad4264805/1471-2156-14-125-22.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f5/4126192/f47a21cd926f/1471-2156-14-125-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f5/4126192/f75d229281f9/1471-2156-14-125-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f5/4126192/288b8e073720/1471-2156-14-125-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f5/4126192/b5d4670d4eb3/1471-2156-14-125-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f5/4126192/d8d192bb1d2f/1471-2156-14-125-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f5/4126192/1061e2cdab80/1471-2156-14-125-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f5/4126192/d4965b75a136/1471-2156-14-125-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f5/4126192/190872a59403/1471-2156-14-125-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f5/4126192/e59eff8a1f70/1471-2156-14-125-9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f5/4126192/19249dec2a24/1471-2156-14-125-10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f5/4126192/65c21ea1e86c/1471-2156-14-125-11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f5/4126192/f951591da2f5/1471-2156-14-125-12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f5/4126192/0b199d5370e8/1471-2156-14-125-13.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f5/4126192/c46c27402f24/1471-2156-14-125-14.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f5/4126192/6c4626b9b8ce/1471-2156-14-125-15.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f5/4126192/50c93cf7bb64/1471-2156-14-125-16.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f5/4126192/d04a0bae04d3/1471-2156-14-125-17.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f5/4126192/97d3af393a69/1471-2156-14-125-18.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f5/4126192/59d779f3e082/1471-2156-14-125-19.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f5/4126192/5e2eba831c71/1471-2156-14-125-20.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f5/4126192/988cbd22f4c8/1471-2156-14-125-21.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32f5/4126192/65fad4264805/1471-2156-14-125-22.jpg

相似文献

1
Probability genotype imputation method and integrated weighted lasso for QTL identification.概率基因型推断方法和集成加权套索用于 QTL 鉴定。
BMC Genet. 2013 Dec 30;14:125. doi: 10.1186/1471-2156-14-125.
2
Mapping quantitative trait loci in noninbred mosquito crosses.在非近亲繁殖的蚊子杂交中定位数量性状基因座。
Genetics. 2006 Apr;172(4):2293-308. doi: 10.1534/genetics.105.050419. Epub 2006 Jan 16.
3
Dynamic quantitative trait loci analysis of seed reserve utilization during three germination stages in rice.水稻三个萌发阶段种子贮藏利用的动态数量性状位点分析。
PLoS One. 2013 Nov 11;8(11):e80002. doi: 10.1371/journal.pone.0080002. eCollection 2013.
4
Bayesian methods for quantitative trait loci mapping based on model selection: approximate analysis using the Bayesian information criterion.基于模型选择的数量性状基因座定位的贝叶斯方法:使用贝叶斯信息准则的近似分析
Genetics. 2001 Nov;159(3):1351-64. doi: 10.1093/genetics/159.3.1351.
5
Extreme QTL mapping of germination speed in Arabidopsis thaliana.拟南芥萌发速度的极端QTL定位
Mol Ecol. 2016 Sep;25(17):4177-96. doi: 10.1111/mec.13768.
6
Mapping quantitative trait loci using the marker regression and the interval mapping methods.使用标记回归和区间作图方法定位数量性状基因座。
Pak J Biol Sci. 2008 Feb 15;11(4):553-8. doi: 10.3923/pjbs.2008.553.558.
7
Mapping quantitative trait loci with dominant and missing markers in various crosses from two inbred lines.利用来自两个近交系的不同杂交组合中的显性和缺失标记定位数量性状基因座。
Genetica. 1997;101(1):47-58. doi: 10.1023/a:1018394410659.
8
Limitations of a two-step moment method for mapping linked multiple QTL.用于定位连锁多个数量性状基因座的两步矩法的局限性。
Genet Res. 2004 Dec;84(3):145-52. doi: 10.1017/s0016672304007207.
9
Mapping quantitative trait loci for traits defined as ratios.定位定义为比率的性状的数量性状基因座。
Genetica. 2008 Mar;132(3):323-9. doi: 10.1007/s10709-007-9175-0. Epub 2007 Aug 2.
10
Identification of quantitative trait loci associated with germination using chromosome segment substitution lines of rice (Oryza sativa L.).利用水稻染色体片段代换系鉴定与萌发相关的数量性状位点。
Theor Appl Genet. 2011 Aug;123(3):411-20. doi: 10.1007/s00122-011-1593-9. Epub 2011 Apr 22.

引用本文的文献

1
VARIABLE PRIORITIZATION IN NONLINEAR BLACK BOX METHODS: A GENETIC ASSOCIATION CASE STUDY.非线性黑箱方法中的变量优先级:一个基因关联案例研究
Ann Appl Stat. 2019 Jun;13(2):958-989. doi: 10.1214/18-aoas1222. Epub 2019 Jun 17.

本文引用的文献

1
Multiple imputation of missing phenotype data for QTL mapping.用于数量性状基因座定位的缺失表型数据的多重填补
Stat Appl Genet Mol Biol. 2011;10(1):Article 29. doi: 10.2202/1544-6115.1676.
2
Visualizing the genetic landscape of Arabidopsis seed performance.可视化拟南芥种子性能的遗传图谱。
Plant Physiol. 2012 Feb;158(2):570-89. doi: 10.1104/pp.111.186676. Epub 2011 Dec 12.
3
R/qtl: high-throughput multiple QTL mapping.R/qtl:高通量多 QTL 作图。
Bioinformatics. 2010 Dec 1;26(23):2990-2. doi: 10.1093/bioinformatics/btq565. Epub 2010 Oct 21.
4
Critical reasoning on causal inference in genome-wide linkage and association studies.全基因组连锁和关联研究中因果推断的批判性推理。
Trends Genet. 2010 Dec;26(12):493-8. doi: 10.1016/j.tig.2010.09.002. Epub 2010 Oct 15.
5
GERMINATOR: a software package for high-throughput scoring and curve fitting of Arabidopsis seed germination.GERMINATOR:一种用于拟南芥种子萌发高通量评分和曲线拟合的软件包。
Plant J. 2010 Apr 1;62(1):148-59. doi: 10.1111/j.1365-313X.2009.04116.x. Epub 2009 Dec 22.
6
Multiple-trait quantitative trait locus mapping with incomplete phenotypic data.利用不完整个表型数据进行多性状数量性状基因座定位
BMC Genet. 2008 Dec 5;9:82. doi: 10.1186/1471-2156-9-82.
7
A tutorial on statistical methods for population association studies.群体关联研究统计方法教程。
Nat Rev Genet. 2006 Oct;7(10):781-91. doi: 10.1038/nrg1916.
8
Multiple imputation of missing genotype data for unrelated individuals.无关个体缺失基因型数据的多重填补
Ann Hum Genet. 2006 May;70(Pt 3):372-81. doi: 10.1111/j.1529-8817.2005.00236.x.
9
Bay-0 x Shahdara recombinant inbred line population: a powerful tool for the genetic dissection of complex traits in Arabidopsis.Bay-0与Shahdara重组自交系群体:用于拟南芥复杂性状遗传剖析的强大工具。
Theor Appl Genet. 2002 May;104(6-7):1173-1184. doi: 10.1007/s00122-001-0825-9. Epub 2002 Feb 13.
10
The Ising model in physics and statistical genetics.物理学和统计遗传学中的伊辛模型。
Am J Hum Genet. 2001 Oct;69(4):853-62. doi: 10.1086/323419. Epub 2001 Aug 20.