• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

s-dePooler:从重叠 DNA 池确定多态性携带者。

s-dePooler: determination of polymorphism carriers from overlapping DNA pools.

机构信息

Research Department of Non-Coronary Heart Diseases, Almazov National Medical Research Center, Ministry of Health of Russia, 2 Akkuratova St., St. Petersburg, 197341, Russia.

All-Russia Research Institute for Agricultural Microbiology (ARRIAM), 3 Podbelsky Ch., St. Petersburg - Pushkin, 196608, Russia.

出版信息

BMC Bioinformatics. 2019 Jan 22;20(1):45. doi: 10.1186/s12859-019-2616-9.

DOI:10.1186/s12859-019-2616-9
PMID:30669964
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6343301/
Abstract

BACKGROUND

Samples pooling is a method widely used in studies to reduce costs and labour. DNA sample pooling combined with massive parallel sequencing is a powerful tool for discovering DNA variants (polymorphisms) in large analysing populations, which is the base of such research fields as Genome-Wide Association Studies, evolutionary and population studies, etc. Usage of overlapping pools where each sample is present in multiple pools can enhance the accuracy of polymorphism detection and allow identifying carriers of rare-variants. Surprisingly there is a lack of tools for result interpretation and carrier identification, i.e. for "depooling".

RESULTS

Here we present s-dePooler, the application for analysis of pooling experiments data. s-dePooler uses the variants information (VCF-file) and the pooling scheme to produce a list of candidate carriers for each polymorphism. We incorporated s-dePooler into a pipeline (dePoP) for automation of pooling analysis. The performance of the pipeline was tested on a synthetic dataset built using the 1000 Genomes Project data, resulting in the successful identification 97% of carriers of polymorphisms present in fewer than ~ 10% of carriers.

CONCLUSIONS

s-dePooler along with dePoP can be used to identify carriers of polymorphisms in overlapping pools, and is compatible with any pooling scheme with equivalent molar ratios of pooled samples. s-dePooler and dePoP with usage instructions and test data are freely available at https://github.com/lab9arriam/depop .

摘要

背景

样本池化是一种广泛应用于研究以降低成本和劳动力的方法。将 DNA 样本池化与大规模平行测序相结合,是在大型分析人群中发现 DNA 变体(多态性)的强大工具,这是全基因组关联研究、进化和群体研究等领域的基础。使用重叠池,其中每个样本存在于多个池中,可以提高多态性检测的准确性,并允许识别罕见变体的携带者。令人惊讶的是,缺乏用于结果解释和携带者识别的工具,即“解池化”。

结果

在这里,我们介绍了 s-dePooler,这是一种用于分析池化实验数据的应用程序。s-dePooler 使用变体信息(VCF 文件)和池化方案,为每个多态性生成候选携带者列表。我们将 s-dePooler 纳入了一个用于自动化池化分析的管道(dePoP)中。该管道的性能在使用 1000 基因组计划数据构建的合成数据集上进行了测试,成功识别了少于 ~10%携带者的多态性携带者的 97%。

结论

s-dePooler 与 dePoP 一起可用于识别重叠池中的多态性携带者,并且与任何具有等效摩尔比的池化方案兼容。s-dePooler 和 dePoP 带有使用说明和测试数据可在 https://github.com/lab9arriam/depop 上免费获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e930/6343301/a4d325fc7c06/12859_2019_2616_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e930/6343301/a4d325fc7c06/12859_2019_2616_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e930/6343301/a4d325fc7c06/12859_2019_2616_Fig1_HTML.jpg

相似文献

1
s-dePooler: determination of polymorphism carriers from overlapping DNA pools.s-dePooler:从重叠 DNA 池确定多态性携带者。
BMC Bioinformatics. 2019 Jan 22;20(1):45. doi: 10.1186/s12859-019-2616-9.
2
A statistical method for the detection of variants from next-generation resequencing of DNA pools.一种用于从 DNA 池的下一代重测序中检测变异的统计方法。
Bioinformatics. 2010 Jun 15;26(12):i318-24. doi: 10.1093/bioinformatics/btq214.
3
Overlapping pools for high-throughput targeted resequencing.用于高通量靶向重测序的重叠文库。
Genome Res. 2009 Jul;19(7):1254-61. doi: 10.1101/gr.088559.108. Epub 2009 May 15.
4
Weighted pooling--practical and cost-effective techniques for pooled high-throughput sequencing.加权池化——高通量测序池化的实用且具有成本效益的技术。
Bioinformatics. 2012 Jun 15;28(12):i197-206. doi: 10.1093/bioinformatics/bts208.
5
Rare variant discovery and calling by sequencing pooled samples with overlaps.重叠测序池样本进行罕见变异发现和调用。
Bioinformatics. 2013 Jan 1;29(1):29-38. doi: 10.1093/bioinformatics/bts645. Epub 2012 Oct 27.
6
Quantitative group testing-based overlapping pool sequencing to identify rare variant carriers.基于定量分组检测的重叠池测序技术,用于鉴定罕见变异携带者。
BMC Bioinformatics. 2014 Jun 17;15:195. doi: 10.1186/1471-2105-15-195.
7
Identifying rare variants with optimal depth of coverage and cost-effective overlapping pool sequencing.采用最佳覆盖深度和具有成本效益的重叠池测序鉴定罕见变异。
Genet Epidemiol. 2013 Dec;37(8):820-30. doi: 10.1002/gepi.21769. Epub 2013 Oct 28.
8
A two-dimensional pooling strategy for rare variant detection on next-generation sequencing platforms.基于下一代测序平台的稀有变异检测的二维池化策略。
PLoS One. 2014 Apr 11;9(4):e93455. doi: 10.1371/journal.pone.0093455. eCollection 2014.
9
Variant identification in multi-sample pools by illumina genome analyzer sequencing.通过Illumina基因组分析仪测序对多样本池进行变异鉴定。
J Biomol Tech. 2011 Jul;22(2):74-84.
10
SNPs/pools: a methodology for the identification of relevant SNPs in breast cancer epidemiology.单核苷酸多态性/池:一种用于鉴定乳腺癌流行病学中相关单核苷酸多态性的方法。
Oncol Rep. 2012 Feb;27(2):511-6. doi: 10.3892/or.2011.1523. Epub 2011 Oct 24.

本文引用的文献

1
Beta-Binomial Model for the Detection of Rare Mutations in Pooled Next-Generation Sequencing Experiments.用于在混合下一代测序实验中检测罕见突变的贝塔-二项式模型。
J Comput Biol. 2017 Apr;24(4):357-367. doi: 10.1089/cmb.2016.0106. Epub 2016 Sep 15.
2
A global reference for human genetic variation.人类遗传变异的全球参考。
Nature. 2015 Oct 1;526(7571):68-74. doi: 10.1038/nature15393.
3
Computational framework for next-generation sequencing of heterogeneous viral populations using combinatorial pooling.基于组合池化的异质病毒群体下一代测序计算框架
Bioinformatics. 2015 Mar 1;31(5):682-90. doi: 10.1093/bioinformatics/btu726. Epub 2014 Oct 29.
4
Identifying rare variants with optimal depth of coverage and cost-effective overlapping pool sequencing.采用最佳覆盖深度和具有成本效益的重叠池测序鉴定罕见变异。
Genet Epidemiol. 2013 Dec;37(8):820-30. doi: 10.1002/gepi.21769. Epub 2013 Oct 28.
5
A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data.一种用于从测序数据中进行 SNP 调用、突变发现、关联映射和群体遗传参数估计的统计框架。
Bioinformatics. 2011 Nov 1;27(21):2987-93. doi: 10.1093/bioinformatics/btr509. Epub 2011 Sep 8.
6
The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data.基因组分析工具包:一种用于分析下一代 DNA 测序数据的 MapReduce 框架。
Genome Res. 2010 Sep;20(9):1297-303. doi: 10.1101/gr.107524.110. Epub 2010 Jul 19.
7
DNA Sudoku--harnessing high-throughput sequencing for multiplexed specimen analysis.DNA数独——利用高通量测序进行多重样本分析。
Genome Res. 2009 Jul;19(7):1243-53. doi: 10.1101/gr.092957.109. Epub 2009 May 15.
8
Overlapping pools for high-throughput targeted resequencing.用于高通量靶向重测序的重叠文库。
Genome Res. 2009 Jul;19(7):1254-61. doi: 10.1101/gr.088559.108. Epub 2009 May 15.
9
A new pooling strategy for high-throughput screening: the Shifted Transversal Design.一种用于高通量筛选的新型合并策略:移位横向设计。
BMC Bioinformatics. 2006 Jan 19;7:28. doi: 10.1186/1471-2105-7-28.
10
Theoretical analysis of library screening using a N-dimensional pooling strategy.使用N维合并策略进行文库筛选的理论分析。
Nucleic Acids Res. 1991 Nov 25;19(22):6241-7. doi: 10.1093/nar/19.22.6241.