• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

appreci8:一个集成了 8 种工具的精确变异调用管道。

appreci8: a pipeline for precise variant calling integrating 8 tools.

机构信息

Institute of Medical Informatics, University of Münster, Münster, Germany.

Department of Medicine Solna, Karolinska Institutet, Stockholm, Sweden.

出版信息

Bioinformatics. 2018 Dec 15;34(24):4205-4212. doi: 10.1093/bioinformatics/bty518.

DOI:10.1093/bioinformatics/bty518
PMID:29945233
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6289140/
Abstract

MOTIVATION

The application of next-generation sequencing in research and particularly in clinical routine requires valid variant calling results. However, evaluation of several commonly used tools has pointed out that not a single tool meets this requirement. False positive as well as false negative calls necessitate additional experiments and extensive manual work. Intelligent combination and output filtration of different tools could significantly improve the current situation.

RESULTS

We developed appreci8, an automatic variant calling pipeline for calling single nucleotide variants and short indels by combining and filtering the output of eight open-source variant calling tools, based on a novel artifact- and polymorphism score. Appreci8 was trained on two data sets from patients with myelodysplastic syndrome, covering 165 Illumina samples. Subsequently, appreci8's performance was tested on five independent data sets, covering 513 samples. Variation in sequencing platform, target region and disease entity was considered. All calls were validated by re-sequencing on the same platform, a different platform or expert-based review. Sensitivity of appreci8 ranged between 0.93 and 1.00, while positive predictive value ranged between 0.65 and 1.00. In all cases, appreci8 showed superior performance compared to any evaluated alternative approach.

AVAILABILITY AND IMPLEMENTATION

Appreci8 is freely available at https://hub.docker.com/r/wwuimi/appreci8/. Sequencing data (BAM files) of the 678 patients analyzed with appreci8 have been deposited into the NCBI Sequence Read Archive (BioProjectID: 388411; https://www.ncbi.nlm.nih.gov/bioproject/PRJNA388411).

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

下一代测序在研究中的应用,尤其是在临床常规中,需要有效的变异调用结果。然而,对几种常用工具的评估指出,没有一种工具能够满足这一要求。假阳性和假阴性调用需要额外的实验和广泛的人工工作。不同工具的智能组合和输出过滤可以显著改善当前的情况。

结果

我们开发了 appreci8,这是一种通过组合和过滤 8 种开源变异调用工具的输出,基于新的伪影和多态性评分,用于调用单核苷酸变异和短插入缺失的自动变异调用管道。Appreci8 是在两个来自骨髓增生异常综合征患者的数据集上进行训练的,涵盖了 165 个 Illumina 样本。随后,在五个独立的数据集上测试了 appreci8 的性能,涵盖了 513 个样本。考虑了测序平台、目标区域和疾病实体的变化。所有的调用都通过在同一平台、不同平台或基于专家的审查上重新测序进行了验证。Appreci8 的敏感性在 0.93 到 1.00 之间,而阳性预测值在 0.65 到 1.00 之间。在所有情况下,Appreci8 的表现都优于任何评估的替代方法。

可用性和实现

Appreci8 可在 https://hub.docker.com/r/wwuimi/appreci8/ 免费获得。使用 appreci8 分析的 678 名患者的测序数据(BAM 文件)已被存入 NCBI 序列读取档案(生物项目 ID:388411;https://www.ncbi.nlm.nih.gov/bioproject/PRJNA388411)。

补充信息

补充数据可在《生物信息学》在线获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c71c/6289140/2e391979a9d9/bty518f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c71c/6289140/c1e1c7643764/bty518f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c71c/6289140/4e05f364dd6d/bty518f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c71c/6289140/a9d75bc2d59f/bty518f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c71c/6289140/2e391979a9d9/bty518f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c71c/6289140/c1e1c7643764/bty518f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c71c/6289140/4e05f364dd6d/bty518f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c71c/6289140/a9d75bc2d59f/bty518f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c71c/6289140/2e391979a9d9/bty518f4.jpg

相似文献

1
appreci8: a pipeline for precise variant calling integrating 8 tools.appreci8:一个集成了 8 种工具的精确变异调用管道。
Bioinformatics. 2018 Dec 15;34(24):4205-4212. doi: 10.1093/bioinformatics/bty518.
2
Evaluating Variant Calling Tools for Non-Matched Next-Generation Sequencing Data.评估用于非配对下一代测序数据的变异调用工具。
Sci Rep. 2017 Feb 24;7:43169. doi: 10.1038/srep43169.
3
AMLVaran: a software approach to implement variant analysis of targeted NGS sequencing data in an oncological care setting.AMLVaran:一种在肿瘤护理环境中对靶向NGS测序数据进行变异分析的软件方法。
BMC Med Genomics. 2020 Feb 4;13(1):17. doi: 10.1186/s12920-020-0668-3.
4
tarSVM: Improving the accuracy of variant calls derived from microfluidic PCR-based targeted next generation sequencing using a support vector machine.tarSVM:使用支持向量机提高基于微流控PCR的靶向新一代测序得出的变异检测准确性。
BMC Bioinformatics. 2016 Jun 10;17(1):233. doi: 10.1186/s12859-016-1108-4.
5
VIPER: a web application for rapid expert review of variant calls.VIPER:一个用于快速专家变异调用审查的网络应用程序。
Bioinformatics. 2018 Jun 1;34(11):1928-1929. doi: 10.1093/bioinformatics/bty022.
6
Consensus Genotyper for Exome Sequencing (CGES): improving the quality of exome variant genotypes.外显子组测序一致性基因分型器(CGES):提高外显子组变异基因型的质量
Bioinformatics. 2015 Jan 15;31(2):187-93. doi: 10.1093/bioinformatics/btu591. Epub 2014 Sep 29.
7
Using genotype array data to compare multi- and single-sample variant calls and improve variant call sets from deep coverage whole-genome sequencing data.利用基因型阵列数据比较多样本和单样本变异检测结果,并改进来自深度覆盖全基因组测序数据的变异检测集。
Bioinformatics. 2017 Apr 15;33(8):1147-1153. doi: 10.1093/bioinformatics/btw786.
8
Performance evaluation of pipelines for mapping, variant calling and interval padding, for the analysis of NGS germline panels.用于分析NGS种系基因检测板的映射、变异位点检测和区间填充流程的性能评估。
BMC Bioinformatics. 2021 Apr 28;22(1):218. doi: 10.1186/s12859-021-04144-1.
9
GLM-based optimization of NGS data analysis: A case study of Roche 454, Ion Torrent PGM and Illumina NextSeq sequencing data.基于广义线性模型的二代测序数据分析优化:以罗氏454、Ion Torrent PGM和Illumina NextSeq测序数据为例
PLoS One. 2017 Feb 21;12(2):e0171983. doi: 10.1371/journal.pone.0171983. eCollection 2017.
10
VariantMetaCaller: automated fusion of variant calling pipelines for quantitative, precision-based filtering.变异元调用器:用于基于定量、精确性筛选的变异调用流程的自动融合。
BMC Genomics. 2015 Oct 28;16:875. doi: 10.1186/s12864-015-2050-y.

引用本文的文献

1
Somatic mutations and DNA methylation identify a subgroup of poor prognosis within lower-risk myelodysplastic syndromes.体细胞突变和DNA甲基化可识别低危骨髓增生异常综合征中预后不良的一个亚组。
Hemasphere. 2025 Jan 22;9(1):e70073. doi: 10.1002/hem3.70073. eCollection 2025 Jan.
2
Dynamic microfluidic single-cell screening identifies pheno-tuning compounds to potentiate tuberculosis therapy.动态微流控单细胞筛选鉴定出表型调节化合物,以增强结核病治疗效果。
Nat Commun. 2024 May 16;15(1):4175. doi: 10.1038/s41467-024-48269-2.
3
Structural, topological, and functional characterization of transmembrane proteins TMEM213, 207, 116, 72 and 30B provides a potential link to ccRCC etiology.

本文引用的文献

1
ISOWN: accurate somatic mutation identification in the absence of normal tissue controls.ISOWN:在无正常组织对照的情况下准确识别体细胞突变
Genome Med. 2017 Jun 29;9(1):59. doi: 10.1186/s13073-017-0446-9.
2
Mutation matters in precision medicine: A future to believe in.突变在精准医学中至关重要:一个值得期待的未来。
Cancer Treat Rev. 2017 Apr;55:136-149. doi: 10.1016/j.ctrv.2017.03.002. Epub 2017 Mar 16.
3
Evaluating Variant Calling Tools for Non-Matched Next-Generation Sequencing Data.评估用于非配对下一代测序数据的变异调用工具。
跨膜蛋白TMEM213、207、116、72和30B的结构、拓扑和功能特征为透明细胞肾细胞癌(ccRCC)的病因提供了潜在联系。
Am J Cancer Res. 2023 May 15;13(5):1863-1883. eCollection 2023.
4
Resources and tools for rare disease variant interpretation.罕见病变异解读的资源与工具。
Front Mol Biosci. 2023 May 10;10:1169109. doi: 10.3389/fmolb.2023.1169109. eCollection 2023.
5
Simple combination of multiple somatic variant callers to increase accuracy.多种体细胞变异 caller 的简单组合可提高准确性。
Sci Rep. 2023 May 25;13(1):8463. doi: 10.1038/s41598-023-34925-y.
6
Performance comparisons between clustering models for reconstructing NGS results from technical replicates.用于从技术重复样本中重建二代测序结果的聚类模型之间的性能比较。
Front Genet. 2023 Mar 16;14:1148147. doi: 10.3389/fgene.2023.1148147. eCollection 2023.
7
Clonal Evolution at First Sight: A Combined Visualization of Diverse Diagnostic Methods Improves Understanding of Leukemic Progression.初窥克隆进化:多种诊断方法的联合可视化有助于加深对白血病进展的理解。
Front Oncol. 2022 Jul 8;12:888114. doi: 10.3389/fonc.2022.888114. eCollection 2022.
8
Exploring Current Challenges and Perspectives for Automatic Reconstruction of Clonal Evolution.探索克隆进化自动重建的当前挑战和观点。
Cancer Genomics Proteomics. 2022 Mar-Apr;19(2):194-204. doi: 10.21873/cgp.20314.
9
Divergent Effects of EZH1 and EZH2 Protein Expression on the Prognosis of Patients with T-Cell Lymphomas.EZH1和EZH2蛋白表达对T细胞淋巴瘤患者预后的不同影响。
Biomedicines. 2021 Dec 5;9(12):1842. doi: 10.3390/biomedicines9121842.
10
Comparison of Open-access Databases for Clinical Variant Interpretation in Cancer: A Case Study of MDS/AML.癌症临床变异解读开放获取数据库的比较:以骨髓增生异常综合征/急性髓系白血病为例
Cancer Genomics Proteomics. 2021 Mar-Apr;18(2):157-166. doi: 10.21873/cgp.20250.
Sci Rep. 2017 Feb 24;7:43169. doi: 10.1038/srep43169.
4
GLM-based optimization of NGS data analysis: A case study of Roche 454, Ion Torrent PGM and Illumina NextSeq sequencing data.基于广义线性模型的二代测序数据分析优化:以罗氏454、Ion Torrent PGM和Illumina NextSeq测序数据为例
PLoS One. 2017 Feb 21;12(2):e0171983. doi: 10.1371/journal.pone.0171983. eCollection 2017.
5
Dermatologist-level classification of skin cancer with deep neural networks.基于深度神经网络的皮肤癌皮肤科医生级分类。
Nature. 2017 Feb 2;542(7639):115-118. doi: 10.1038/nature21056. Epub 2017 Jan 25.
6
Sanger Confirmation Is Required to Achieve Optimal Sensitivity and Specificity in Next-Generation Sequencing Panel Testing.在新一代测序 panel 检测中,需要进行桑格验证以实现最佳的灵敏度和特异性。
J Mol Diagn. 2016 Nov;18(6):923-932. doi: 10.1016/j.jmoldx.2016.07.006. Epub 2016 Oct 6.
7
Analysis of protein-coding genetic variation in 60,706 humans.对60706名人类的蛋白质编码基因变异进行分析。
Nature. 2016 Aug 18;536(7616):285-91. doi: 10.1038/nature19057.
8
SiNVICT: ultra-sensitive detection of single nucleotide variants and indels in circulating tumour DNA.SiNVICT:循环肿瘤 DNA 中单核苷酸变异和插入缺失的超灵敏检测。
Bioinformatics. 2017 Jan 1;33(1):26-34. doi: 10.1093/bioinformatics/btw536. Epub 2016 Aug 16.
9
Towards precision medicine.迈向精准医学。
Nat Rev Genet. 2016 Aug 16;17(9):507-22. doi: 10.1038/nrg.2016.86.
10
The Ensembl gene annotation system.Ensembl基因注释系统。
Database (Oxford). 2016 Jun 23;2016. doi: 10.1093/database/baw093. Print 2016.