• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

DNAscan2:一种功能多样、可扩展且易于使用的人类下一代测序数据分析管道。

DNAscan2: a versatile, scalable, and user-friendly analysis pipeline for human next-generation sequencing data.

机构信息

Department of Basic and Clinical Neuroscience, King's College London, London, United Kingdom.

Department of Biostatistics and Health Informatics, King's College London, London, United Kingdom.

出版信息

Bioinformatics. 2023 Apr 3;39(4). doi: 10.1093/bioinformatics/btad152.

DOI:10.1093/bioinformatics/btad152
PMID:37010501
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10112953/
Abstract

SUMMARY

The current widespread adoption of next-generation sequencing (NGS) in all branches of basic research and clinical genetics fields means that users with highly variable informatics skills, computing facilities and application purposes need to process, analyse, and interpret NGS data. In this landscape, versatility, scalability, and user-friendliness are key characteristics for an NGS analysis software. We developed DNAscan2, a highly flexible, end-to-end pipeline for the analysis of NGS data, which (i) can be used for the detection of multiple variant types, including SNVs, small indels, transposable elements, short tandem repeats, and other large structural variants; (ii) covers all standard steps of NGS analysis, from quality control of raw data and genome alignment to variant calling, annotation, and generation of reports for the interpretation and prioritization of results; (iii) is highly adaptable as it can be deployed and run via either a graphic user interface for non-bioinformaticians and a command line tool for personal computer usage; (iv) is scalable as it can be executed in parallel as a Snakemake workflow, and; (v) is computationally efficient by minimizing RAM and CPU time requirements.

AVAILABILITY AND IMPLEMENTATION

DNAscan2 is implemented in Python3 and is available at https://github.com/KHP-Informatics/DNAscanv2.

摘要

摘要

目前,下一代测序(NGS)在基础研究和临床遗传学领域的各个分支中得到了广泛应用,这意味着具有高度变化的信息学技能、计算设施和应用目的的用户需要处理、分析和解释 NGS 数据。在这种情况下,多功能性、可扩展性和用户友好性是 NGS 分析软件的关键特征。我们开发了 DNAscan2,这是一个用于分析 NGS 数据的高度灵活的端到端管道,它 (i) 可用于检测多种变体类型,包括 SNVs、小插入缺失、转座元件、短串联重复序列和其他大型结构变体;(ii) 涵盖 NGS 分析的所有标准步骤,从原始数据的质量控制和基因组比对到变体调用、注释以及为解释和优先化结果生成报告;(iii) 高度适应,因为它可以通过图形用户界面部署和运行,适用于非生物信息学人员,也可以通过命令行工具在个人计算机上使用;(iv) 可扩展性强,因为它可以作为 Snakemake 工作流程并行执行;(v) 通过最小化 RAM 和 CPU 时间需求,计算效率高。

可用性和实现

DNAscan2 是用 Python3 实现的,可在 https://github.com/KHP-Informatics/DNAscanv2 上获得。

相似文献

1
DNAscan2: a versatile, scalable, and user-friendly analysis pipeline for human next-generation sequencing data.DNAscan2:一种功能多样、可扩展且易于使用的人类下一代测序数据分析管道。
Bioinformatics. 2023 Apr 3;39(4). doi: 10.1093/bioinformatics/btad152.
2
DNAscan: personal computer compatible NGS analysis, annotation and visualisation.DNAscan:个人计算机兼容的 NGS 分析、注释和可视化。
BMC Bioinformatics. 2019 Apr 27;20(1):213. doi: 10.1186/s12859-019-2791-8.
3
WEP: a high-performance analysis pipeline for whole-exome data.WEP:一种用于全外显子组数据的高性能分析管道。
BMC Bioinformatics. 2013;14 Suppl 7(Suppl 7):S11. doi: 10.1186/1471-2105-14-S7-S11. Epub 2013 Apr 22.
4
NGS-QCbox and Raspberry for Parallel, Automated and Rapid Quality Control Analysis of Large-Scale Next Generation Sequencing (Illumina) Data.用于大规模新一代测序(Illumina)数据并行、自动化和快速质量控制分析的NGS-QCbox与树莓派
PLoS One. 2015 Oct 13;10(10):e0139868. doi: 10.1371/journal.pone.0139868. eCollection 2015.
5
mInDel: a high-throughput and efficient pipeline for genome-wide InDel marker development.mInDel:一种用于全基因组插入缺失标记开发的高通量高效流程
BMC Genomics. 2016 Apr 14;17:290. doi: 10.1186/s12864-016-2614-5.
6
AMLVaran: a software approach to implement variant analysis of targeted NGS sequencing data in an oncological care setting.AMLVaran:一种在肿瘤护理环境中对靶向NGS测序数据进行变异分析的软件方法。
BMC Med Genomics. 2020 Feb 4;13(1):17. doi: 10.1186/s12920-020-0668-3.
7
AUSPP: A universal short-read pre-processing package.AUSPP:一个通用的短读长预处理程序包。
J Bioinform Comput Biol. 2019 Dec;17(6):1950037. doi: 10.1142/S0219720019500379.
8
grenepipe: a flexible, scalable and reproducible pipeline to automate variant calling from sequence reads. grenepipe:一个灵活、可扩展且可重复的管道,用于从序列读取中自动进行变体调用。
Bioinformatics. 2022 Oct 14;38(20):4809-4811. doi: 10.1093/bioinformatics/btac600.
9
VarGenius executes cohort-level DNA-seq variant calling and annotation and allows to manage the resulting data through a PostgreSQL database.VarGenius 执行队列级别的 DNA 测序变异调用和注释,并允许通过 PostgreSQL 数据库管理产生的数据。
BMC Bioinformatics. 2018 Dec 12;19(1):477. doi: 10.1186/s12859-018-2532-4.
10
COSAP: Comparative Sequencing Analysis Platform.COSAP:比较测序分析平台。
BMC Bioinformatics. 2024 Mar 26;25(1):130. doi: 10.1186/s12859-024-05756-z.

引用本文的文献

1
hDNApipe: streamlining human genome analysis and interpretation with an intuitive and user-friendly interface.hDNApipe:通过直观且用户友好的界面简化人类基因组分析与解读。
NAR Genom Bioinform. 2025 Jun 26;7(2):lqaf088. doi: 10.1093/nargab/lqaf088. eCollection 2025 Jun.
2
Investigating the Performance of Oxford Nanopore Long-Read Sequencing with Respect to Illumina Microarrays and Short-Read Sequencing.研究牛津纳米孔长读长测序相对于Illumina微阵列和短读长测序的性能。
Int J Mol Sci. 2025 May 8;26(10):4492. doi: 10.3390/ijms26104492.
3
aPEAch: Automated Pipeline for End-to-End Analysis of Epigenomic and Transcriptomic Data.aPEAch:用于表观基因组和转录组数据端到端分析的自动化流程
Biology (Basel). 2024 Jul 2;13(7):492. doi: 10.3390/biology13070492.
4
MIRACUM-Pipe: An Adaptable Pipeline for Next-Generation Sequencing Analysis, Reporting, and Visualization for Clinical Decision Making.MIRACUM-Pipe:一种适用于下一代测序分析、报告及可视化以支持临床决策的流程。
Cancers (Basel). 2023 Jul 1;15(13):3456. doi: 10.3390/cancers15133456.

本文引用的文献

1
AnnotSV and knotAnnotSV: a web server for human structural variations annotations, ranking and analysis.AnnotSV 和 knotAnnotSV:一个用于人类结构变异注释、排名和分析的网络服务器。
Nucleic Acids Res. 2021 Jul 2;49(W1):W21-W28. doi: 10.1093/nar/gkab402.
2
Parliament2: Accurate structural variant calling at scale.Parliament2:大规模精确的结构变异调用。
Gigascience. 2020 Dec 21;9(12). doi: 10.1093/gigascience/giaa145.
3
A structural variation reference for medical and population genetics.医学和人群遗传学的结构变异参考
Nature. 2020 May;581(7809):444-451. doi: 10.1038/s41586-020-2287-8. Epub 2020 May 27.
4
ExpansionHunter Denovo: a computational method for locating known and novel repeat expansions in short-read sequencing data.ExpansionHunter Denovo:一种在短读测序数据中定位已知和新的重复扩展的计算方法。
Genome Biol. 2020 Apr 28;21(1):102. doi: 10.1186/s13059-020-02017-z.
5
Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype.基于图的基因组比对和基因分型与 HISAT2 和 HISAT-genotype。
Nat Biotechnol. 2019 Aug;37(8):907-915. doi: 10.1038/s41587-019-0201-4. Epub 2019 Aug 2.
6
DNAscan: personal computer compatible NGS analysis, annotation and visualisation.DNAscan:个人计算机兼容的 NGS 分析、注释和可视化。
BMC Bioinformatics. 2019 Apr 27;20(1):213. doi: 10.1186/s12859-019-2791-8.
7
ALSgeneScanner: a pipeline for the analysis and interpretation of DNA sequencing data of ALS patients.ALSgeneScanner:用于分析和解释 ALS 患者 DNA 测序数据的管道。
Amyotroph Lateral Scler Frontotemporal Degener. 2019 May;20(3-4):207-215. doi: 10.1080/21678421.2018.1562553. Epub 2019 Mar 5.
8
Strelka2: fast and accurate calling of germline and somatic variants.Strelka2:快速准确地调用种系和体细胞变异。
Nat Methods. 2018 Aug;15(8):591-594. doi: 10.1038/s41592-018-0051-x. Epub 2018 Jul 16.
9
DNAp: A Pipeline for DNA-seq Data Analysis.DNAp:用于 DNA 测序数据分析的管道。
Sci Rep. 2018 May 1;8(1):6793. doi: 10.1038/s41598-018-25022-6.
10
AnnotSV: an integrated tool for structural variations annotation.AnnotSV:一个用于结构变异注释的集成工具。
Bioinformatics. 2018 Oct 15;34(20):3572-3574. doi: 10.1093/bioinformatics/bty304.