• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于合并 DNA 的联合约束稀疏表示的最大简约单倍型频率推断。

Maximum-parsimony haplotype frequencies inference based on a joint constrained sparse representation of pooled DNA.

机构信息

Electrical Engineering Department, Columbia University, New York NY 10027, USA.

出版信息

BMC Bioinformatics. 2013 Sep 8;14:270. doi: 10.1186/1471-2105-14-270.

DOI:10.1186/1471-2105-14-270
PMID:24010487
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3847492/
Abstract

BACKGROUND

DNA pooling constitutes a cost effective alternative in genome wide association studies. In DNA pooling, equimolar amounts of DNA from different individuals are mixed into one sample and the frequency of each allele in each position is observed in a single genotype experiment. The identification of haplotype frequencies from pooled data in addition to single locus analysis is of separate interest within these studies as haplotypes could increase statistical power and provide additional insight.

RESULTS

We developed a method for maximum-parsimony haplotype frequency estimation from pooled DNA data based on the sparse representation of the DNA pools in a dictionary of haplotypes. Extensions to scenarios where data is noisy or even missing are also presented. The resulting method is first applied to simulated data based on the haplotypes and their associated frequencies of the AGT gene. We further evaluate our methodology on datasets consisting of SNPs from the first 7Mb of the HapMap CEU population. Noise and missing data were further introduced in the datasets in order to test the extensions of the proposed method. Both HIPPO and HAPLOPOOL were also applied to these datasets to compare performances.

CONCLUSIONS

We evaluate our methodology on scenarios where pooling is more efficient relative to individual genotyping; that is, in datasets that contain pools with a small number of individuals. We show that in such scenarios our methodology outperforms state-of-the-art methods such as HIPPO and HAPLOPOOL.

摘要

背景

在全基因组关联研究中,DNA 池化是一种具有成本效益的替代方法。在 DNA 池化中,将来自不同个体的等量 DNA 混合到一个样本中,并在单个基因型实验中观察每个位置的每个等位基因的频率。除了单一位点分析之外,从 pooled 数据中识别单倍型频率在这些研究中具有单独的意义,因为单倍型可以增加统计效力并提供额外的见解。

结果

我们开发了一种基于 haplotypes 字典中 DNA 池稀疏表示的从 pooled DNA 数据中进行最大简约单倍型频率估计的方法。还提出了针对数据存在噪声甚至缺失的情况的扩展。该方法首先应用于基于 AGT 基因的单倍型及其相关频率的模拟数据。我们进一步在包含来自 HapMap CEU 人群前 7Mb 的 SNPs 的数据集上评估我们的方法。为了测试所提出方法的扩展,在数据集上进一步引入了噪声和缺失数据。还将 HIPPO 和 HAPLOPOOL 应用于这些数据集以比较性能。

结论

我们在与个体基因分型相比更有效的 pooling 场景下评估我们的方法;也就是说,在包含少量个体的 pooled 的数据集上。我们表明,在这种情况下,我们的方法优于 HIPPO 和 HAPLOPOOL 等最先进的方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea2d/3847492/46ad6b1f5a20/1471-2105-14-270-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea2d/3847492/e9312c3bbf26/1471-2105-14-270-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea2d/3847492/5f21306e6f53/1471-2105-14-270-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea2d/3847492/4249cba06dc9/1471-2105-14-270-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea2d/3847492/46ad6b1f5a20/1471-2105-14-270-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea2d/3847492/e9312c3bbf26/1471-2105-14-270-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea2d/3847492/5f21306e6f53/1471-2105-14-270-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea2d/3847492/4249cba06dc9/1471-2105-14-270-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea2d/3847492/46ad6b1f5a20/1471-2105-14-270-4.jpg

相似文献

1
Maximum-parsimony haplotype frequencies inference based on a joint constrained sparse representation of pooled DNA.基于合并 DNA 的联合约束稀疏表示的最大简约单倍型频率推断。
BMC Bioinformatics. 2013 Sep 8;14:270. doi: 10.1186/1471-2105-14-270.
2
HAPLOPOOL: improving haplotype frequency estimation through DNA pools and phylogenetic modeling.单倍型池:通过DNA池和系统发育建模改进单倍型频率估计
Bioinformatics. 2007 Nov 15;23(22):3048-55. doi: 10.1093/bioinformatics/btm435. Epub 2007 Sep 25.
3
PoooL: an efficient method for estimating haplotype frequencies from large DNA pools.PoooL:一种从大型DNA混合样本中估计单倍型频率的有效方法。
Bioinformatics. 2008 Sep 1;24(17):1942-8. doi: 10.1093/bioinformatics/btn324. Epub 2008 Jun 23.
4
Fast and accurate haplotype frequency estimation for large haplotype vectors from pooled DNA data.从混合 DNA 数据中快速准确估计大型单倍型向量的单倍型频率。
BMC Genet. 2012 Oct 30;13:94. doi: 10.1186/1471-2156-13-94.
5
On the use of DNA pooling to estimate haplotype frequencies.关于使用DNA池来估计单倍型频率。
Genet Epidemiol. 2003 Jan;24(1):74-82. doi: 10.1002/gepi.10195.
6
Efficiency of single-nucleotide polymorphism haplotype estimation from pooled DNA.从混合DNA中估计单核苷酸多态性单倍型的效率。
Proc Natl Acad Sci U S A. 2003 Jun 10;100(12):7225-30. doi: 10.1073/pnas.1237858100. Epub 2003 May 30.
7
Estimating population haplotype frequencies from pooled SNP data using incomplete database information.基于不完全的数据库信息,从合并的 SNP 数据中估计群体单体型频率。
Bioinformatics. 2009 Dec 15;25(24):3296-302. doi: 10.1093/bioinformatics/btp584. Epub 2009 Oct 27.
8
Cost-effective genome-wide estimation of allele frequencies from pooled DNA in Atlantic salmon (Salmo salar L.).从大西洋鲑鱼(Salmo salar L.)混合 DNA 中进行经济有效的全基因组等位基因频率估计。
BMC Genomics. 2013 Jan 16;14:12. doi: 10.1186/1471-2164-14-12.
9
An improved preprocessing algorithm for haplotype inference by pure parsimony.一种通过纯简约法进行单倍型推断的改进预处理算法。
J Bioinform Comput Biol. 2014 Aug;12(4):1450020. doi: 10.1142/S0219720014500206. Epub 2014 Aug 1.
10
[Estimation of haplotypes based on DNA pooling].基于DNA池的单倍型估计
Zhong Nan Da Xue Xue Bao Yi Xue Ban. 2011 May;36(5):457-60. doi: 10.3969/j.issn.1672-7347.2011.05.015.

引用本文的文献

1
Accurate Allele Frequencies from Ultra-low Coverage Pool-Seq Samples in Evolve-and-Resequence Experiments.从进化重测序实验的超低覆盖度池测序样本中获得准确的等位基因频率。
G3 (Bethesda). 2019 Dec 3;9(12):4159-4168. doi: 10.1534/g3.119.400755.

本文引用的文献

1
eALPS: estimating abundance levels in pooled sequencing using available genotyping data.eALPS:利用可用的基因分型数据估算混合测序中的丰度水平。
J Comput Biol. 2013 Nov;20(11):861-77. doi: 10.1089/cmb.2013.0105. Epub 2013 Oct 21.
2
Maximum likelihood estimation of frequencies of known haplotypes from pooled sequence data.从合并的序列数据中估计已知单倍型频率的最大似然估计。
Mol Biol Evol. 2013 May;30(5):1145-58. doi: 10.1093/molbev/mst016. Epub 2013 Jan 30.
3
A fast collapsed data method for estimating haplotype frequencies from pooled genotype data with applications to the study of rare variants.
一种快速的崩溃数据方法,用于从合并基因型数据中估计单倍型频率,应用于稀有变异体的研究。
Stat Med. 2013 Apr 15;32(8):1343-60. doi: 10.1002/sim.5540. Epub 2012 Aug 1.
4
Bacterial community reconstruction using compressed sensing.利用压缩感知进行细菌群落重建。
J Comput Biol. 2011 Nov;18(11):1723-41. doi: 10.1089/cmb.2011.0189. Epub 2011 Oct 14.
5
Estimating haplotype frequencies by combining data from large DNA pools with database information.通过将大型 DNA 池数据与数据库信息相结合来估计单倍型频率。
IEEE/ACM Trans Comput Biol Bioinform. 2011 Jan-Mar;8(1):36-44. doi: 10.1109/TCBB.2009.71.
6
A study of the efficiency of pooling in haplotype estimation.一种用于单体型估计的合并效率研究。
Bioinformatics. 2010 Oct 15;26(20):2556-63. doi: 10.1093/bioinformatics/btq492. Epub 2010 Aug 27.
7
Estimating population haplotype frequencies from pooled SNP data using incomplete database information.基于不完全的数据库信息,从合并的 SNP 数据中估计群体单体型频率。
Bioinformatics. 2009 Dec 15;25(24):3296-302. doi: 10.1093/bioinformatics/btp584. Epub 2009 Oct 27.
8
Computationally feasible estimation of haplotype frequencies from pooled DNA with and without Hardy-Weinberg equilibrium.在有和没有哈迪-温伯格平衡的情况下,从混合DNA中对单倍型频率进行计算上可行的估计。
Bioinformatics. 2009 Feb 1;25(3):379-86. doi: 10.1093/bioinformatics/btn623. Epub 2008 Dec 2.
9
PoooL: an efficient method for estimating haplotype frequencies from large DNA pools.PoooL:一种从大型DNA混合样本中估计单倍型频率的有效方法。
Bioinformatics. 2008 Sep 1;24(17):1942-8. doi: 10.1093/bioinformatics/btn324. Epub 2008 Jun 23.
10
HAPLOPOOL: improving haplotype frequency estimation through DNA pools and phylogenetic modeling.单倍型池:通过DNA池和系统发育建模改进单倍型频率估计
Bioinformatics. 2007 Nov 15;23(22):3048-55. doi: 10.1093/bioinformatics/btm435. Epub 2007 Sep 25.