• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

单倍型池:通过DNA池和系统发育建模改进单倍型频率估计

HAPLOPOOL: improving haplotype frequency estimation through DNA pools and phylogenetic modeling.

作者信息

Kirkpatrick Bonnie, Armendariz Carlos Santos, Karp Richard M, Halperin Eran

机构信息

Department of Electrical Engineering and Computer Sciences, UC Berkeley, CA, , USA.

出版信息

Bioinformatics. 2007 Nov 15;23(22):3048-55. doi: 10.1093/bioinformatics/btm435. Epub 2007 Sep 25.

DOI:10.1093/bioinformatics/btm435
PMID:17895275
Abstract

MOTIVATION

The search for genetic variants that are linked to complex diseases such as cancer, Parkinson's;, or Alzheimer's; disease, may lead to better treatments. Since haplotypes can serve as proxies for hidden variants, one method of finding the linked variants is to look for case-control associations between the haplotypes and disease. Finding these associations requires a high-quality estimation of the haplotype frequencies in the population. To this end, we present, HaploPool, a method of estimating haplotype frequencies from blocks of consecutive SNPs.

RESULTS

HaploPool leverages the efficiency of DNA pools and estimates the population haplotype frequencies from pools of disjoint sets, each containing two or three unrelated individuals. We study the trade-off between pooling efficiency and accuracy of haplotype frequency estimates. For a fixed genotyping budget, HaploPool performs favorably on pools of two individuals as compared with a state-of-the-art non-pooled phasing method, PHASE. Of independent interest, HaploPool can be used to phase non-pooled genotype data with an accuracy approaching that of PHASE. We compared our algorithm to three programs that estimate haplotype frequencies from pooled data. HaploPool is an order of magnitude more efficient (at least six times faster), and considerably more accurate than previous methods. In contrast to previous methods, HaploPool performs well with missing data, genotyping errors and long haplotype blocks (of between 5 and 25 SNPs).

摘要

动机

寻找与癌症、帕金森氏症或阿尔茨海默氏症等复杂疾病相关的基因变异,可能会带来更好的治疗方法。由于单倍型可以作为隐藏变异的替代物,一种寻找相关变异的方法是寻找单倍型与疾病之间的病例对照关联。找到这些关联需要对人群中的单倍型频率进行高质量的估计。为此,我们提出了HaploPool,一种从连续单核苷酸多态性(SNP)块估计单倍型频率的方法。

结果

HaploPool利用DNA池的效率,从不相交的集合池中估计人群单倍型频率,每个集合包含两个或三个不相关的个体。我们研究了池化效率与单倍型频率估计准确性之间的权衡。对于固定的基因分型预算,与一种先进的非池化定相方法PHASE相比,HaploPool在两个个体的池上表现良好。具有独立意义的是,可以使用HaploPool对非池化基因型数据进行定相,其准确性接近PHASE。我们将我们的算法与三个从池化数据估计单倍型频率的程序进行了比较。HaploPool比以前的方法效率高一个数量级(至少快六倍),并且准确性更高。与以前的方法不同,HaploPool在存在缺失数据、基因分型错误和长单倍型块(5至25个SNP)的情况下表现良好。

相似文献

1
HAPLOPOOL: improving haplotype frequency estimation through DNA pools and phylogenetic modeling.单倍型池:通过DNA池和系统发育建模改进单倍型频率估计
Bioinformatics. 2007 Nov 15;23(22):3048-55. doi: 10.1093/bioinformatics/btm435. Epub 2007 Sep 25.
2
PoooL: an efficient method for estimating haplotype frequencies from large DNA pools.PoooL:一种从大型DNA混合样本中估计单倍型频率的有效方法。
Bioinformatics. 2008 Sep 1;24(17):1942-8. doi: 10.1093/bioinformatics/btn324. Epub 2008 Jun 23.
3
[Estimation of haplotypes based on DNA pooling].基于DNA池的单倍型估计
Zhong Nan Da Xue Xue Bao Yi Xue Ban. 2011 May;36(5):457-60. doi: 10.3969/j.issn.1672-7347.2011.05.015.
4
On the use of DNA pooling to estimate haplotype frequencies.关于使用DNA池来估计单倍型频率。
Genet Epidemiol. 2003 Jan;24(1):74-82. doi: 10.1002/gepi.10195.
5
Estimate haplotype frequencies in pedigrees.估计系谱中的单倍型频率。
BMC Bioinformatics. 2006 Dec 12;7 Suppl 4(Suppl 4):S5. doi: 10.1186/1471-2105-7-S4-S5.
6
Haplotype reconstruction from genotype data using Imperfect Phylogeny.利用不完美系统发育从基因型数据中进行单倍型重建。
Bioinformatics. 2004 Aug 12;20(12):1842-9. doi: 10.1093/bioinformatics/bth149. Epub 2004 Feb 26.
7
HAPLORE: a program for haplotype reconstruction in general pedigrees without recombination.HAPLORE:一个用于在无重组的一般家系中进行单倍型重建的程序。
Bioinformatics. 2005 Jan 1;21(1):90-103. doi: 10.1093/bioinformatics/bth388. Epub 2004 Jul 1.
8
Maximum-parsimony haplotype frequencies inference based on a joint constrained sparse representation of pooled DNA.基于合并 DNA 的联合约束稀疏表示的最大简约单倍型频率推断。
BMC Bioinformatics. 2013 Sep 8;14:270. doi: 10.1186/1471-2105-14-270.
9
Preliminary analysis of a KIR haplotype estimation algorithm: a simulation study.杀伤细胞免疫球蛋白样受体(KIR)单倍型估计算法的初步分析:一项模拟研究
Tissue Antigens. 2007 Apr;69 Suppl 1:96-100. doi: 10.1111/j.1399-0039.2006.762_4.x.
10
Maximum-likelihood estimation of haplotype frequencies in nuclear families.核心家庭中单体型频率的最大似然估计。
Genet Epidemiol. 2004 Jul;27(1):21-32. doi: 10.1002/gepi.10323.

引用本文的文献

1
An efficient pipeline to generate data for studies in plastid population genomics and phylogeography.一种用于生成质体群体基因组学和系统地理学研究数据的高效流程。
Appl Plant Sci. 2017 Nov 14;5(11). doi: 10.3732/apps.1700053. eCollection 2017 Nov.
2
A sequential Monte Carlo framework for haplotype inference in CNV/SNP genotype data.用于CNV/SNP基因型数据单倍型推断的序贯蒙特卡罗框架。
EURASIP J Bioinform Syst Biol. 2014;2014(1):7. doi: 10.1186/1687-4153-2014-7. Epub 2014 Apr 24.
3
An EM algorithm based on an internal list for estimating haplotype distributions of rare variants from pooled genotype data.
基于内部列表的 EM 算法,用于从合并基因型数据估计罕见变异体的单体型分布。
BMC Genet. 2013 Sep 13;14:82. doi: 10.1186/1471-2156-14-82.
4
Maximum-parsimony haplotype frequencies inference based on a joint constrained sparse representation of pooled DNA.基于合并 DNA 的联合约束稀疏表示的最大简约单倍型频率推断。
BMC Bioinformatics. 2013 Sep 8;14:270. doi: 10.1186/1471-2105-14-270.
5
Maximum likelihood estimation of frequencies of known haplotypes from pooled sequence data.从合并的序列数据中估计已知单倍型频率的最大似然估计。
Mol Biol Evol. 2013 May;30(5):1145-58. doi: 10.1093/molbev/mst016. Epub 2013 Jan 30.
6
Cost-effective genome-wide estimation of allele frequencies from pooled DNA in Atlantic salmon (Salmo salar L.).从大西洋鲑鱼(Salmo salar L.)混合 DNA 中进行经济有效的全基因组等位基因频率估计。
BMC Genomics. 2013 Jan 16;14:12. doi: 10.1186/1471-2164-14-12.
7
Fast and accurate haplotype frequency estimation for large haplotype vectors from pooled DNA data.从混合 DNA 数据中快速准确估计大型单倍型向量的单倍型频率。
BMC Genet. 2012 Oct 30;13:94. doi: 10.1186/1471-2156-13-94.
8
Rapid inexpensive genome-wide association using pooled whole blood.利用混合全血进行快速、廉价的全基因组关联研究。
Genome Res. 2009 Nov;19(11):2075-80. doi: 10.1101/gr.094680.109. Epub 2009 Oct 3.
9
Multimarker analysis and imputation of multiple platform pooling-based genome-wide association studies.基于多平台混合样本的全基因组关联研究的多标记分析与推算
Bioinformatics. 2008 Sep 1;24(17):1896-902. doi: 10.1093/bioinformatics/btn333. Epub 2008 Jul 10.