• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Metapipeline-DNA:一个全面的种系和体细胞基因组学Nextflow流程。

Metapipeline-DNA: A Comprehensive Germline & Somatic Genomics Nextflow Pipeline.

作者信息

Patel Yash, Zhu Chenghao, Yamaguchi Takafumi N, Wang Nicholas K, Wiltsie Nicholas, Zeltser Nicole, Gonzalez Alfredo E, Winata Helena K, Pan Yu, Mootor Mohammed Faizal Eeman, Sanders Timothy, Fitz-Gibbon Sorel T, Kandoth Cyriac, Livingstone Julie, Liu Lydia Y, Carlin Benjamin, Holmes Aaron, Oh Jieun, Sahrmann John, Tao Shu, Eng Stefan, Hugh-White Rupert, Pashminehazar Kiarod, Park Andrew, Beshlikyan Arpi, Jordan Madison, Wu Selina, Tian Mao, Arbet Jaron, Neilsen Beth, Haas Roni, Bugh Yuan Zhe, Kim Gina, Salmingo Joseph, Zhang Wenshu, Anand Aakarsh, Hwang Edward, Neiman-Golden Anna, Steinberg Philippa, Zhao Wenyan, Anand Prateek, Agrawal Raag, Tsai Brandon L, Boutros Paul C

出版信息

bioRxiv. 2025 Apr 25:2024.09.04.611267. doi: 10.1101/2024.09.04.611267.

DOI:10.1101/2024.09.04.611267
PMID:39282325
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11398472/
Abstract

SUMMARY

The price, quality and throughout of DNA sequencing continue to improve. Algorithmic innovations have allowed inference of a growing range of features from DNA sequencing data, quantifying nuclear, mitochondrial and evolutionary aspects of both germline and somatic genomes. To automate analyses of the full range of genomic characteristics, we created an extensible Nextflow meta-pipeline called metapipeline-DNA. Metapipeline-DNA analyzes targeted and whole-genome sequencing data from raw reads through pre-processing, feature detection by multiple algorithms, quality-control and data- visualization. Each step can be run independently and is supported robust software engineering including automated failure-recovery, robust testing and consistent verifications of inputs, outputs and parameters. Metapipeline-DNA is cloud-compatible and highly configurable, with options to subset and optimize each analysis. Metapipeline-DNA facilitates high-scale, comprehensive analysis of DNA sequencing data.

AVAILABILITY

Metapipeline-DNA is an open-source Nextflow pipeline under the GPLv2 license and is available at https://github.com/uclahs-cds/metapipeline-DNA .

摘要

摘要

DNA测序的价格、质量和通量持续提高。算法创新使得能够从DNA测序数据中推断出越来越多的特征,对种系和体细胞基因组的核、线粒体及进化方面进行量化。为了实现对全基因组特征的自动化分析,我们创建了一个名为metapipeline-DNA的可扩展Nextflow元管道。metapipeline-DNA可分析来自原始读数的靶向和全基因组测序数据,包括预处理、通过多种算法进行特征检测、质量控制和数据可视化。每个步骤都可以独立运行,并得到强大的软件工程支持,包括自动故障恢复、严格测试以及对输入、输出和参数的一致性验证。metapipeline-DNA与云兼容且高度可配置,具有对每个分析进行子集化和优化的选项。metapipeline-DNA有助于对DNA测序数据进行大规模、全面的分析。

可用性

metapipeline-DNA是一个遵循GPLv2许可的开源Nextflow管道,可在https://github.com/uclahs-cds/metapipeline-DNA获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/705d/12051667/ae5f47bbb78c/nihpp-2024.09.04.611267v4-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/705d/12051667/0ba92660cfce/nihpp-2024.09.04.611267v4-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/705d/12051667/2f546f211171/nihpp-2024.09.04.611267v4-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/705d/12051667/ae5f47bbb78c/nihpp-2024.09.04.611267v4-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/705d/12051667/0ba92660cfce/nihpp-2024.09.04.611267v4-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/705d/12051667/2f546f211171/nihpp-2024.09.04.611267v4-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/705d/12051667/ae5f47bbb78c/nihpp-2024.09.04.611267v4-f0003.jpg

相似文献

1
Metapipeline-DNA: A Comprehensive Germline & Somatic Genomics Nextflow Pipeline.Metapipeline-DNA:一个全面的种系和体细胞基因组学Nextflow流程。
bioRxiv. 2025 Apr 25:2024.09.04.611267. doi: 10.1101/2024.09.04.611267.
2
NFTest: automated testing of Nextflow pipelines.NFTest:用于 Nextflow 管道的自动化测试。
Bioinformatics. 2024 Feb 1;40(2). doi: 10.1093/bioinformatics/btae081.
3
PipeVal: light-weight extensible tool for file validation.PipeVal:轻量级可扩展的文件验证工具。
Bioinformatics. 2024 Feb 1;40(2). doi: 10.1093/bioinformatics/btae079.
4
Bactopia: a Flexible Pipeline for Complete Analysis of Bacterial Genomes.Bactopia:用于细菌基因组全面分析的灵活流程
mSystems. 2020 Aug 4;5(4):e00190-20. doi: 10.1128/mSystems.00190-20.
5
AssemblyQC: a Nextflow pipeline for reproducible reporting of assembly quality.组装质量控制 (AssemblyQC):用于可重复报告组装质量的 Nextflow 管道。
Bioinformatics. 2024 Aug 2;40(8). doi: 10.1093/bioinformatics/btae477.
6
Sarek: A portable workflow for whole-genome sequencing analysis of germline and somatic variants.萨雷克:用于种系和体细胞变异的全基因组测序分析的便携式工作流程。
F1000Res. 2020 Jan 29;9:63. doi: 10.12688/f1000research.16665.2. eCollection 2020.
7
DolphinNext: a distributed data processing platform for high throughput genomics.海豚下一代:一个用于高通量基因组学的分布式数据处理平台。
BMC Genomics. 2020 Apr 19;21(1):310. doi: 10.1186/s12864-020-6714-x.
8
mettannotator: a comprehensive and scalable Nextflow annotation pipeline for prokaryotic assemblies.Mettannotator:一种用于原核生物组装的全面且可扩展的Nextflow注释管道。
Bioinformatics. 2025 Feb 4;41(2). doi: 10.1093/bioinformatics/btaf037.
9
snpQT: flexible, reproducible, and comprehensive quality control and imputation of genomic data.snpQT:用于基因组数据的灵活、可重复和全面的质量控制和 imputation。
F1000Res. 2021 Jul 14;10:567. doi: 10.12688/f1000research.53821.2. eCollection 2021.
10
RatesTools: a Nextflow pipeline for detecting de novo germline mutations in pedigree sequence data.RatesTools:一种用于检测家系序列数据中新生种系突变的Nextflow管道。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac784.

本文引用的文献

1
Small variant benchmark from a complete assembly of X and Y chromosomes.来自X和Y染色体完整组装的小变异基准。
Nat Commun. 2025 Jan 8;16(1):497. doi: 10.1038/s41467-024-55710-z.
2
Comprehensive genome analysis and variant detection at scale using DRAGEN.使用DRAGEN进行大规模的全基因组分析和变异检测。
Nat Biotechnol. 2024 Oct 25. doi: 10.1038/s41587-024-02382-1.
3
MHC Hammer reveals genetic and non-genetic HLA disruption in cancer evolution.MHC Hammer 揭示了癌症进化过程中遗传和非遗传 HLA 破坏。
Nat Genet. 2024 Oct;56(10):2121-2131. doi: 10.1038/s41588-024-01883-8. Epub 2024 Oct 2.
4
Crowd-sourced benchmarking of single-sample tumor subclonal reconstruction.单样本肿瘤亚克隆重建的众包基准测试
Nat Biotechnol. 2025 Apr;43(4):581-592. doi: 10.1038/s41587-024-02250-y. Epub 2024 Jun 11.
5
NFTest: automated testing of Nextflow pipelines.NFTest:用于 Nextflow 管道的自动化测试。
Bioinformatics. 2024 Feb 1;40(2). doi: 10.1093/bioinformatics/btae081.
6
PipeVal: light-weight extensible tool for file validation.PipeVal:轻量级可扩展的文件验证工具。
Bioinformatics. 2024 Feb 1;40(2). doi: 10.1093/bioinformatics/btae079.
7
Insights for precision oncology from the integration of genomic and clinical data of 13,880 tumors from the 100,000 Genomes Cancer Programme.从 10 万基因组癌症计划中 13880 个肿瘤的基因组和临床数据集成中获得的精准肿瘤学见解。
Nat Med. 2024 Jan;30(1):279-289. doi: 10.1038/s41591-023-02682-0. Epub 2024 Jan 11.
8
Association of Mitochondrial DNA Copy Number With Brain MRI Markers and Cognitive Function: A Meta-analysis of Community-Based Cohorts.线粒体 DNA 拷贝数与脑 MRI 标志物和认知功能的关联:基于社区队列的荟萃分析。
Neurology. 2023 May 2;100(18):e1930-e1943. doi: 10.1212/WNL.0000000000207157. Epub 2023 Mar 16.
9
MuSE: A Novel Approach to Mutation Calling with Sample-Specific Error Modeling.MuSE:一种具有样本特异性错误建模的新突变调用方法。
Methods Mol Biol. 2022;2493:21-27. doi: 10.1007/978-1-0716-2293-3_2.
10
The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2022 update.Galaxy 平台:用于可访问、可重复和协作的生物医学分析:2022 更新。
Nucleic Acids Res. 2022 Jul 5;50(W1):W345-W351. doi: 10.1093/nar/gkac247.