Suppr超能文献

一种用于估计连续同源片段(IBD)函数的通用且高效的方法,该方法用于数量性状基因座(QTL)的基因组扫描。

A general and efficient method for estimating continuous IBD functions for use in genome scans for QTL.

作者信息

Besnier Francois, Carlborg Orjan

机构信息

Linnaeus Centre for Bioinformatics, Uppsala University, SE-75124 Uppsala, Sweden.

出版信息

BMC Bioinformatics. 2007 Nov 13;8:440. doi: 10.1186/1471-2105-8-440.

Abstract

BACKGROUND

Identity by descent (IBD) matrix estimation is a central component in mapping of Quantitative Trait Loci (QTL) using variance component models. A large number of algorithms have been developed for estimation of IBD between individuals in populations at discrete locations in the genome for use in genome scans to detect QTL affecting various traits of interest in experimental animal, human and agricultural pedigrees. Here, we propose a new approach to estimate IBD as continuous functions rather than as discrete values.

RESULTS

Estimation of IBD functions improved the computational efficiency and memory usage in genome scanning for QTL. We have explored two approaches to obtain continuous marker-bracket IBD-functions. By re-implementing an existing and fast deterministic IBD-estimation method, we show that this approach results in IBD functions that produces the exact same IBD as the original algorithm, but with a greater than 2-fold improvement of the computational efficiency and a considerably lower memory requirement for storing the resulting genome-wide IBD. By developing a general IBD function approximation algorithm, we show that it is possible to estimate marker-bracket IBD functions from IBD matrices estimated at marker locations by any existing IBD estimation algorithm. The general algorithm provides approximations that lead to QTL variance component estimates that even in worst-case scenarios are very similar to the true values. The approach of storing IBD as polynomial IBD-function was also shown to reduce the amount of memory required in genome scans for QTL.

CONCLUSION

In addition to direct improvements in computational and memory efficiency, estimation of IBD-functions is a fundamental step needed to develop and implement new efficient optimization algorithms for high precision localization of QTL. Here, we discuss and test two approaches for estimating IBD functions based on existing IBD estimation algorithms. Our approaches provide immediately useful techniques for use in single QTL analyses in the variance component QTL mapping framework. They will, however, be particularly useful in genome scans for multiple interacting QTL, where the improvements in both computational and memory efficiency are the key for successful development of efficient optimization algorithms to allow widespread use of this methodology.

摘要

背景

通过系谱同一性(IBD)矩阵估计是使用方差成分模型进行数量性状基因座(QTL)定位的核心组成部分。已经开发了大量算法,用于估计基因组中离散位置的群体中个体之间的IBD,以用于基因组扫描,以检测影响实验动物、人类和农业系谱中各种感兴趣性状的QTL。在此,我们提出一种新方法,将IBD估计为连续函数而非离散值。

结果

IBD函数估计提高了QTL基因组扫描中的计算效率和内存使用。我们探索了两种获得连续标记区间IBD函数的方法。通过重新实现一种现有的快速确定性IBD估计方法,我们表明该方法产生的IBD函数与原始算法产生的IBD完全相同,但计算效率提高了两倍以上,并且存储全基因组IBD所需的内存要求大大降低。通过开发一种通用的IBD函数近似算法,我们表明可以从任何现有IBD估计算法在标记位置估计的IBD矩阵中估计标记区间IBD函数。通用算法提供的近似值导致QTL方差成分估计值,即使在最坏情况下也与真实值非常相似。将IBD存储为多项式IBD函数的方法也被证明可以减少QTL基因组扫描所需的内存量。

结论

除了直接提高计算和内存效率外,IBD函数估计是开发和实施用于QTL高精度定位的新高效优化算法所需的基本步骤。在此,我们讨论并测试了基于现有IBD估计算法估计IBD函数的两种方法。我们的方法为方差成分QTL定位框架中的单QTL分析提供了立即可用的技术。然而,它们在多个相互作用QTL的基因组扫描中将特别有用,其中计算和内存效率的提高是成功开发高效优化算法以允许广泛使用该方法的关键。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a290/2194736/5738f4ef6f75/1471-2105-8-440-1.jpg

相似文献

1
2
Approximating identity-by-descent matrices using multiple haplotype configurations on pedigrees.
Genetics. 2005 Sep;171(1):365-76. doi: 10.1534/genetics.104.040337. Epub 2005 Jun 18.
3
Methodological aspects of the genetic dissection of gene expression.
Bioinformatics. 2005 May 15;21(10):2383-93. doi: 10.1093/bioinformatics/bti241.
4
A fast expectation-maximum algorithm for fine-scale QTL mapping.
Theor Appl Genet. 2012 Dec;125(8):1727-34. doi: 10.1007/s00122-012-1949-9. Epub 2012 Aug 4.
5
How to deal with genotype uncertainty in variance component quantitative trait loci analyses.
Genet Res (Camb). 2011 Oct;93(5):333-42. doi: 10.1017/S0016672311000152. Epub 2011 Jul 18.
6
Comparative analysis of haplotype association mapping algorithms.
BMC Bioinformatics. 2006 Feb 9;7:61. doi: 10.1186/1471-2105-7-61.
7
Simultaneous search for multiple QTL using the global optimization algorithm DIRECT.
Bioinformatics. 2004 Aug 12;20(12):1887-95. doi: 10.1093/bioinformatics/bth175. Epub 2004 Mar 25.
9
An IBD-based mixed model approach for QTL mapping in multiparental populations.
Theor Appl Genet. 2021 Nov;134(11):3643-3660. doi: 10.1007/s00122-021-03919-7. Epub 2021 Aug 3.
10
Efficient algorithms for quantitative trait loci mapping problems.
J Comput Biol. 2002;9(6):793-804. doi: 10.1089/10665270260518272.

引用本文的文献

1
A Mutation in DAOA Modifies the Age of Onset in PSEN1 E280A Alzheimer's Disease.
Neural Plast. 2016;2016:9760314. doi: 10.1155/2016/9760314. Epub 2016 Jan 5.
2
Genetic influences on brain gene expression in rats selected for tameness and aggression.
Genetics. 2014 Nov;198(3):1277-90. doi: 10.1534/genetics.114.168948. Epub 2014 Sep 3.
3
Identity-by-descent matrix decomposition using latent ancestral allele models.
Genetics. 2010 Jul;185(3):1045-57. doi: 10.1534/genetics.110.117390. Epub 2010 Apr 20.
4
Genetic architecture of tameness in a rat model of animal domestication.
Genetics. 2009 Jun;182(2):541-54. doi: 10.1534/genetics.109.102186. Epub 2009 Apr 10.

本文引用的文献

2
3
Epistasis: too often neglected in complex trait studies?
Nat Rev Genet. 2004 Aug;5(8):618-25. doi: 10.1038/nrg1407.
4
Simultaneous search for multiple QTL using the global optimization algorithm DIRECT.
Bioinformatics. 2004 Aug 12;20(12):1887-95. doi: 10.1093/bioinformatics/bth175. Epub 2004 Mar 25.
6
Merlin--rapid analysis of dense genetic maps using sparse gene flow trees.
Nat Genet. 2002 Jan;30(1):97-101. doi: 10.1038/ng786. Epub 2001 Dec 3.
7
A simple and rapid method for calculating identity-by-descent matrices using multiple markers.
Genet Sel Evol. 2001 Sep-Oct;33(5):453-71. doi: 10.1186/1297-9686-33-5-453.
9
Multipoint quantitative-trait linkage analysis in general pedigrees.
Am J Hum Genet. 1998 May;62(5):1198-211. doi: 10.1086/301844.
10
Power of variance component linkage analysis to detect epistasis.
Genet Epidemiol. 1997;14(6):1017-22. doi: 10.1002/(SICI)1098-2272(1997)14:6<1017::AID-GEPI76>3.0.CO;2-L.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验