s-dePooler：从重叠 DNA 池确定多态性携带者。

s-dePooler: determination of polymorphism carriers from overlapping DNA pools.

机构信息

Research Department of Non-Coronary Heart Diseases, Almazov National Medical Research Center, Ministry of Health of Russia, 2 Akkuratova St., St. Petersburg, 197341, Russia.

All-Russia Research Institute for Agricultural Microbiology (ARRIAM), 3 Podbelsky Ch., St. Petersburg - Pushkin, 196608, Russia.

出版信息

BMC Bioinformatics. 2019 Jan 22;20(1):45. doi: 10.1186/s12859-019-2616-9.

DOI:10.1186/s12859-019-2616-9

PMID:30669964

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6343301/

Abstract

BACKGROUND

Samples pooling is a method widely used in studies to reduce costs and labour. DNA sample pooling combined with massive parallel sequencing is a powerful tool for discovering DNA variants (polymorphisms) in large analysing populations, which is the base of such research fields as Genome-Wide Association Studies, evolutionary and population studies, etc. Usage of overlapping pools where each sample is present in multiple pools can enhance the accuracy of polymorphism detection and allow identifying carriers of rare-variants. Surprisingly there is a lack of tools for result interpretation and carrier identification, i.e. for "depooling".

RESULTS

Here we present s-dePooler, the application for analysis of pooling experiments data. s-dePooler uses the variants information (VCF-file) and the pooling scheme to produce a list of candidate carriers for each polymorphism. We incorporated s-dePooler into a pipeline (dePoP) for automation of pooling analysis. The performance of the pipeline was tested on a synthetic dataset built using the 1000 Genomes Project data, resulting in the successful identification 97% of carriers of polymorphisms present in fewer than ~ 10% of carriers.

CONCLUSIONS

s-dePooler along with dePoP can be used to identify carriers of polymorphisms in overlapping pools, and is compatible with any pooling scheme with equivalent molar ratios of pooled samples. s-dePooler and dePoP with usage instructions and test data are freely available at https://github.com/lab9arriam/depop .

摘要

背景

样本池化是一种广泛应用于研究以降低成本和劳动力的方法。将 DNA 样本池化与大规模平行测序相结合，是在大型分析人群中发现 DNA 变体（多态性）的强大工具，这是全基因组关联研究、进化和群体研究等领域的基础。使用重叠池，其中每个样本存在于多个池中，可以提高多态性检测的准确性，并允许识别罕见变体的携带者。令人惊讶的是，缺乏用于结果解释和携带者识别的工具，即“解池化”。

结果

在这里，我们介绍了 s-dePooler，这是一种用于分析池化实验数据的应用程序。s-dePooler 使用变体信息（VCF 文件）和池化方案，为每个多态性生成候选携带者列表。我们将 s-dePooler 纳入了一个用于自动化池化分析的管道（dePoP）中。该管道的性能在使用 1000 基因组计划数据构建的合成数据集上进行了测试，成功识别了少于 ~10%携带者的多态性携带者的 97%。

结论

s-dePooler 与 dePoP 一起可用于识别重叠池中的多态性携带者，并且与任何具有等效摩尔比的池化方案兼容。s-dePooler 和 dePoP 带有使用说明和测试数据可在 https://github.com/lab9arriam/depop 上免费获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e930/6343301/a4d325fc7c06/12859_2019_2616_Fig1_HTML.jpg

相似文献

s-dePooler: determination of polymorphism carriers from overlapping DNA pools.

BMC Bioinformatics. 2019 Jan 22;20(1):45. doi: 10.1186/s12859-019-2616-9.

A statistical method for the detection of variants from next-generation resequencing of DNA pools.

Bioinformatics. 2010 Jun 15;26(12):i318-24. doi: 10.1093/bioinformatics/btq214.

Overlapping pools for high-throughput targeted resequencing.

Genome Res. 2009 Jul;19(7):1254-61. doi: 10.1101/gr.088559.108. Epub 2009 May 15.

Weighted pooling--practical and cost-effective techniques for pooled high-throughput sequencing.

Bioinformatics. 2012 Jun 15;28(12):i197-206. doi: 10.1093/bioinformatics/bts208.

Rare variant discovery and calling by sequencing pooled samples with overlaps.

Bioinformatics. 2013 Jan 1;29(1):29-38. doi: 10.1093/bioinformatics/bts645. Epub 2012 Oct 27.

Quantitative group testing-based overlapping pool sequencing to identify rare variant carriers.

BMC Bioinformatics. 2014 Jun 17;15:195. doi: 10.1186/1471-2105-15-195.

Identifying rare variants with optimal depth of coverage and cost-effective overlapping pool sequencing.

Genet Epidemiol. 2013 Dec;37(8):820-30. doi: 10.1002/gepi.21769. Epub 2013 Oct 28.

A two-dimensional pooling strategy for rare variant detection on next-generation sequencing platforms.

PLoS One. 2014 Apr 11;9(4):e93455. doi: 10.1371/journal.pone.0093455. eCollection 2014.

Variant identification in multi-sample pools by illumina genome analyzer sequencing.

J Biomol Tech. 2011 Jul;22(2):74-84.

SNPs/pools: a methodology for the identification of relevant SNPs in breast cancer epidemiology.

Oncol Rep. 2012 Feb;27(2):511-6. doi: 10.3892/or.2011.1523. Epub 2011 Oct 24.

本文引用的文献

Beta-Binomial Model for the Detection of Rare Mutations in Pooled Next-Generation Sequencing Experiments.

J Comput Biol. 2017 Apr;24(4):357-367. doi: 10.1089/cmb.2016.0106. Epub 2016 Sep 15.

A global reference for human genetic variation.

Nature. 2015 Oct 1;526(7571):68-74. doi: 10.1038/nature15393.

Computational framework for next-generation sequencing of heterogeneous viral populations using combinatorial pooling.

Bioinformatics. 2015 Mar 1;31(5):682-90. doi: 10.1093/bioinformatics/btu726. Epub 2014 Oct 29.

Identifying rare variants with optimal depth of coverage and cost-effective overlapping pool sequencing.

Genet Epidemiol. 2013 Dec;37(8):820-30. doi: 10.1002/gepi.21769. Epub 2013 Oct 28.

A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data.

Bioinformatics. 2011 Nov 1;27(21):2987-93. doi: 10.1093/bioinformatics/btr509. Epub 2011 Sep 8.

The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data.

Genome Res. 2010 Sep;20(9):1297-303. doi: 10.1101/gr.107524.110. Epub 2010 Jul 19.

DNA Sudoku--harnessing high-throughput sequencing for multiplexed specimen analysis.

Genome Res. 2009 Jul;19(7):1243-53. doi: 10.1101/gr.092957.109. Epub 2009 May 15.

Overlapping pools for high-throughput targeted resequencing.

Genome Res. 2009 Jul;19(7):1254-61. doi: 10.1101/gr.088559.108. Epub 2009 May 15.

A new pooling strategy for high-throughput screening: the Shifted Transversal Design.

BMC Bioinformatics. 2006 Jan 19;7:28. doi: 10.1186/1471-2105-7-28.

Theoretical analysis of library screening using a N-dimensional pooling strategy.

Nucleic Acids Res. 1991 Nov 25;19(22):6241-7. doi: 10.1093/nar/19.22.6241.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

s-dePooler：从重叠 DNA 池确定多态性携带者。

s-dePooler: determination of polymorphism carriers from overlapping DNA pools.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献