• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PoolHap:通过下一代测序从混合样本中推断单倍型频率。

PoolHap: inferring haplotype frequencies from pooled samples by next generation sequencing.

机构信息

Gregor Mendel Institute, Vienna, Austria.

出版信息

PLoS One. 2011 Jan 5;6(1):e15292. doi: 10.1371/journal.pone.0015292.

DOI:10.1371/journal.pone.0015292
PMID:21264334
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3016441/
Abstract

With the advance of next-generation sequencing (NGS) technologies, increasingly ambitious applications are becoming feasible. A particularly powerful one is the sequencing of polymorphic, pooled samples. The pool can be naturally occurring, as in the case of multiple pathogen strains in a blood sample, multiple types of cells in a cancerous tissue sample, or multiple isoforms of mRNA in a cell. In these cases, it's difficult or impossible to partition the subtypes experimentally before sequencing, and those subtype frequencies must hence be inferred. In addition, investigators may occasionally want to artificially pool the sample of a large number of individuals for reasons of cost-efficiency, e.g., when carrying out genetic mapping using bulked segregant analysis. Here we describe PoolHap, a computational tool for inferring haplotype frequencies from pooled samples when haplotypes are known. The key insight into why PoolHap works is that the large number of SNPs that come with genome-wide coverage can compensate for the uneven coverage across the genome. The performance of PoolHap is illustrated and discussed using simulated and real data. We show that PoolHap is able to accurately estimate the proportions of haplotypes with less than 2% error for 34-strain mixtures with 2X total coverage Arabidopsis thaliana whole genome polymorphism data. This method should facilitate greater biological insight into heterogeneous samples that are difficult or impossible to isolate experimentally. Software and users manual are freely available at http://arabidopsis.gmi.oeaw.ac.at/quan/poolhap/.

摘要

随着下一代测序(NGS)技术的进步,越来越多雄心勃勃的应用变得可行。其中一个特别强大的应用是对多态性、混合样本进行测序。该池可以是自然发生的,例如在血液样本中存在多种病原体菌株、在癌组织样本中存在多种类型的细胞、或在细胞中存在多种 mRNA 同工型的情况下。在这些情况下,在测序之前很难或不可能通过实验对亚类进行分区,因此必须推断这些亚类频率。此外,研究人员可能偶尔出于成本效益的原因希望人工混合大量个体的样本,例如,当使用 bulked segregant analysis 进行遗传图谱绘制时。在这里,我们描述了 PoolHap,这是一种当已知单倍型时从混合样本中推断单倍型频率的计算工具。PoolHap 之所以有效的关键见解是,基因组覆盖范围内大量的 SNP 可以弥补基因组覆盖不均的问题。使用模拟和真实数据说明了和讨论了 PoolHap 的性能。我们表明,PoolHap 能够以小于 2%的误差准确估计具有 2X 总覆盖度的拟南芥全基因组多态性数据的 34 株混合物的单倍型比例。该方法应该有助于更好地了解难以或不可能通过实验分离的异质样本的生物学特性。软件和用户手册可在 http://arabidopsis.gmi.oeaw.ac.at/quan/poolhap/ 免费获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/010c/3016441/d071db6319c8/pone.0015292.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/010c/3016441/337dc566c070/pone.0015292.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/010c/3016441/d071db6319c8/pone.0015292.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/010c/3016441/337dc566c070/pone.0015292.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/010c/3016441/d071db6319c8/pone.0015292.g002.jpg

相似文献

1
PoolHap: inferring haplotype frequencies from pooled samples by next generation sequencing.PoolHap:通过下一代测序从混合样本中推断单倍型频率。
PLoS One. 2011 Jan 5;6(1):e15292. doi: 10.1371/journal.pone.0015292.
2
Accurate estimation of haplotype frequency from pooled sequencing data and cost-effective identification of rare haplotype carriers by overlapping pool sequencing.基于合并测序数据的单体型频率精确估计和重叠池测序进行成本效益的罕见单体型携带者鉴定。
Bioinformatics. 2015 Feb 15;31(4):515-22. doi: 10.1093/bioinformatics/btu670. Epub 2014 Oct 9.
3
Ehapp2: Estimate haplotype frequencies from pooled sequencing data with prior database information.Ehapp2:利用先前的数据库信息从混合测序数据中估计单倍型频率。
J Bioinform Comput Biol. 2016 Aug;14(4):1650017. doi: 10.1142/S0219720016500177. Epub 2016 Apr 14.
4
Accurate Allele Frequencies from Ultra-low Coverage Pool-Seq Samples in Evolve-and-Resequence Experiments.从进化重测序实验的超低覆盖度池测序样本中获得准确的等位基因频率。
G3 (Bethesda). 2019 Dec 3;9(12):4159-4168. doi: 10.1534/g3.119.400755.
5
Computational Haplotype Inference from Pooled Samples.
Methods Mol Biol. 2017;1551:309-319. doi: 10.1007/978-1-4939-6750-6_15.
6
PDA: Pooled DNA analyzer.PDA:聚合DNA分析仪。
BMC Bioinformatics. 2006 Apr 28;7:233. doi: 10.1186/1471-2105-7-233.
7
How to optimize the precision of allele and haplotype frequency estimates using pooled-sequencing data.如何使用汇集测序数据优化等位基因和单倍型频率估计的精度。
Mol Ecol Resour. 2018 Mar;18(2):194-203. doi: 10.1111/1755-0998.12723. Epub 2017 Nov 4.
8
The next generation of molecular markers from massively parallel sequencing of pooled DNA samples.基于 DNA 样本池的高通量测序的下一代分子标记物。
Genetics. 2010 Sep;186(1):207-18. doi: 10.1534/genetics.110.114397. Epub 2010 May 10.
9
Pool-hmm: a Python program for estimating the allele frequency spectrum and detecting selective sweeps from next generation sequencing of pooled samples.Pool-hmm:一个用于从混合样本的下一代测序中估计等位基因频率谱和检测选择清除的 Python 程序。
Mol Ecol Resour. 2013 Mar;13(2):337-40. doi: 10.1111/1755-0998.12063. Epub 2013 Jan 11.
10
MixClone: a mixture model for inferring tumor subclonal populations.MixClone:一种用于推断肿瘤亚克隆群体的混合模型。
BMC Genomics. 2015;16 Suppl 2(Suppl 2):S1. doi: 10.1186/1471-2164-16-S2-S1. Epub 2015 Jan 21.

引用本文的文献

1
"Select and Resequence" Methods Enable a Genome-Wide Association Study of the Dimorphic Human Fungal Pathogen Coccidioides posadasii.“选择与重测序”方法助力对双态人类真菌病原体波萨达斯球孢子菌进行全基因组关联研究。
Genome Biol Evol. 2025 Jul 3;17(7). doi: 10.1093/gbe/evaf135.
2
Detecting complex infections in trypanosomatids using whole genome sequencing.利用全基因组测序检测原生动物中的复杂感染。
BMC Genomics. 2024 Oct 29;25(1):1011. doi: 10.1186/s12864-024-10862-6.
3
Haplotype based testing for a better understanding of the selective architecture.

本文引用的文献

1
Isolation, gene cloning and expression profile of a pathogen recognition protein: a serine proteinase homolog (Sp-SPH) involved in the antibacterial response in the crab Scylla paramamosain.病原菌识别蛋白(丝氨酸蛋白酶同源物 Sp-SPH)的分离、基因克隆及其在锯缘青蟹抗菌反应中的表达谱分析。
Dev Comp Immunol. 2010 Jul;34(7):741-8. doi: 10.1016/j.dci.2010.02.005. Epub 2010 Feb 20.
2
Population resequencing reveals local adaptation of Arabidopsis lyrata to serpentine soils.群体重测序揭示拟南芥对蛇纹石土壤的局部适应。
Nat Genet. 2010 Mar;42(3):260-3. doi: 10.1038/ng.515. Epub 2010 Jan 24.
3
Computational methods for discovering structural variation with next-generation sequencing.
基于单体型的检测以更好地理解选择架构。
BMC Bioinformatics. 2023 Aug 26;24(1):322. doi: 10.1186/s12859-023-05437-3.
4
Adaptation in Outbred Sexual Yeast is Repeatable, Polygenic and Favors Rare Haplotypes.异交性酵母的适应性是可重复的、多基因的,并有利于罕见的单倍型。
Mol Biol Evol. 2022 Dec 5;39(12). doi: 10.1093/molbev/msac248.
5
High-throughput estimation of allele frequencies using combined pooled-population sequencing and haplotype-based data processing.利用合并群体测序和基于单倍型的数据处理进行等位基因频率的高通量估计。
Plant Methods. 2022 Mar 21;18(1):34. doi: 10.1186/s13007-022-00852-8.
6
Powerful, efficient QTL mapping in Drosophila melanogaster using bulked phenotyping and pooled sequencing.利用表型混池和测序池进行黑腹果蝇中强大、高效的 QTL 作图。
Genetics. 2022 Mar 3;220(3). doi: 10.1093/genetics/iyab238.
7
Reconstruction of Microbial Haplotypes by Integration of Statistical and Physical Linkage in Scaffolding.通过在支架中整合统计和物理连锁信息重建微生物单倍型。
Mol Biol Evol. 2021 May 19;38(6):2660-2672. doi: 10.1093/molbev/msab037.
8
Detecting selected haplotype blocks in evolve and resequence experiments.检测进化和重测序实验中的选定单倍型块。
Mol Ecol Resour. 2021 Jan;21(1):93-109. doi: 10.1111/1755-0998.13244. Epub 2020 Sep 6.
9
Two Synthetic 18-Way Outcrossed Populations of Diploid Budding Yeast with Utility for Complex Trait Dissection.两个用于复杂性状剖析的合成十八交杂种二倍体芽殖酵母群体。
Genetics. 2020 Jun;215(2):323-342. doi: 10.1534/genetics.120.303202. Epub 2020 Apr 2.
10
Accurate Allele Frequencies from Ultra-low Coverage Pool-Seq Samples in Evolve-and-Resequence Experiments.从进化重测序实验的超低覆盖度池测序样本中获得准确的等位基因频率。
G3 (Bethesda). 2019 Dec 3;9(12):4159-4168. doi: 10.1534/g3.119.400755.
利用下一代测序技术发现结构变异的计算方法
Nat Methods. 2009 Nov;6(11 Suppl):S13-20. doi: 10.1038/nmeth.1374.
4
The Sequence Alignment/Map format and SAMtools.序列比对/映射格式和 SAMtools。
Bioinformatics. 2009 Aug 15;25(16):2078-9. doi: 10.1093/bioinformatics/btp352. Epub 2009 Jun 8.
5
Fast and accurate short read alignment with Burrows-Wheeler transform.使用Burrows-Wheeler变换进行快速准确的短读比对。
Bioinformatics. 2009 Jul 15;25(14):1754-60. doi: 10.1093/bioinformatics/btp324. Epub 2009 May 18.
6
Statistical inferences for isoform expression in RNA-Seq.RNA测序中异构体表达的统计推断。
Bioinformatics. 2009 Apr 15;25(8):1026-32. doi: 10.1093/bioinformatics/btp113. Epub 2009 Feb 25.
7
Mitochondrial DNA mutations and aging: devils in the details?线粒体DNA突变与衰老:细节中的魔鬼?
Trends Genet. 2009 Feb;25(2):91-8. doi: 10.1016/j.tig.2008.11.007. Epub 2008 Dec 26.
8
High-throughput isolation and mapping of C. elegans mutants susceptible to pathogen infection.秀丽隐杆线虫病原体感染易感突变体的高通量分离与定位
PLoS One. 2008 Aug 6;3(8):e2882. doi: 10.1371/journal.pone.0002882.
9
Isolation and identification of the human pathogen Pythium insidiosum from environmental samples collected in Thai agricultural areas.从泰国农业地区采集的环境样本中分离和鉴定人类病原体隐秘腐霉。
Med Mycol. 2008 Feb;46(1):41-52. doi: 10.1080/13693780701513840.
10
Prospective identification of tumorigenic breast cancer cells.致瘤性乳腺癌细胞的前瞻性鉴定。
Proc Natl Acad Sci U S A. 2003 Apr 1;100(7):3983-8. doi: 10.1073/pnas.0530291100. Epub 2003 Mar 10.