• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SeqMule:用于分析人类外显子组/基因组测序数据的自动化流程

SeqMule: automated pipeline for analysis of human exome/genome sequencing data.

作者信息

Guo Yunfei, Ding Xiaolei, Shen Yufeng, Lyon Gholson J, Wang Kai

机构信息

Zilkha Neurogenetic Institute, University of Southern California, Los Angeles, CA 90033, USA.

Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, CA 90032, USA.

出版信息

Sci Rep. 2015 Sep 18;5:14283. doi: 10.1038/srep14283.

DOI:10.1038/srep14283
PMID:26381817
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4585643/
Abstract

Next-generation sequencing (NGS) technology has greatly helped us identify disease-contributory variants for Mendelian diseases. However, users are often faced with issues such as software compatibility, complicated configuration, and no access to high-performance computing facility. Discrepancies exist among aligners and variant callers. We developed a computational pipeline, SeqMule, to perform automated variant calling from NGS data on human genomes and exomes. SeqMule integrates computational-cluster-free parallelization capability built on top of the variant callers, and facilitates normalization/intersection of variant calls to generate consensus set with high confidence. SeqMule integrates 5 alignment tools, 5 variant calling algorithms and accepts various combinations all by one-line command, therefore allowing highly flexible yet fully automated variant calling. In a modern machine (2 Intel Xeon X5650 CPUs, 48 GB memory), when fast turn-around is needed, SeqMule generates annotated VCF files in a day from a 30X whole-genome sequencing data set; when more accurate calling is needed, SeqMule generates consensus call set that improves over single callers, as measured by both Mendelian error rate and consistency. SeqMule supports Sun Grid Engine for parallel processing, offers turn-key solution for deployment on Amazon Web Services, allows quality check, Mendelian error check, consistency evaluation, HTML-based reports. SeqMule is available at http://seqmule.openbioinformatics.org.

摘要

下一代测序(NGS)技术极大地帮助我们识别孟德尔疾病的致病变异。然而,用户经常面临软件兼容性、配置复杂以及无法使用高性能计算设备等问题。比对器和变异检测工具之间存在差异。我们开发了一个计算流程SeqMule,用于对人类基因组和外显子组的NGS数据进行自动变异检测。SeqMule整合了基于变异检测工具的无计算集群并行化能力,并促进变异检测的标准化/交叉分析,以生成高可信度的一致性集合。SeqMule整合了5种比对工具、5种变异检测算法,并且通过一行命令接受各种组合,因此允许高度灵活但完全自动化的变异检测。在一台现代机器(2个英特尔至强X5650 CPU,48GB内存)上,当需要快速周转时,SeqMule能在一天内从30X全基因组测序数据集中生成注释的VCF文件;当需要更准确的检测时,SeqMule生成的一致性检测集合在孟德尔错误率和一致性方面均优于单个检测工具。SeqMule支持使用Sun Grid Engine进行并行处理,为在亚马逊网络服务上的部署提供一站式解决方案,允许进行质量检查、孟德尔错误检查、一致性评估以及基于HTML的报告。SeqMule可在http://seqmule.openbioinformatics.org获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea01/4585643/02e20636b136/srep14283-f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea01/4585643/9e28e8f28da3/srep14283-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea01/4585643/95b9b3205341/srep14283-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea01/4585643/a1693201ea94/srep14283-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea01/4585643/dd364aaa4bc4/srep14283-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea01/4585643/d8480e06e0c1/srep14283-f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea01/4585643/02e20636b136/srep14283-f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea01/4585643/9e28e8f28da3/srep14283-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea01/4585643/95b9b3205341/srep14283-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea01/4585643/a1693201ea94/srep14283-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea01/4585643/dd364aaa4bc4/srep14283-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea01/4585643/d8480e06e0c1/srep14283-f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea01/4585643/02e20636b136/srep14283-f6.jpg

相似文献

1
SeqMule: automated pipeline for analysis of human exome/genome sequencing data.SeqMule:用于分析人类外显子组/基因组测序数据的自动化流程
Sci Rep. 2015 Sep 18;5:14283. doi: 10.1038/srep14283.
2
SNooPer: a machine learning-based method for somatic variant identification from low-pass next-generation sequencing.SNooPer:一种基于机器学习从低深度下一代测序中识别体细胞变异的方法。
BMC Genomics. 2016 Nov 14;17(1):912. doi: 10.1186/s12864-016-3281-2.
3
Challenges in exome analysis by LifeScope and its alternative computational pipelines.LifeScope及其替代计算流程在全外显子组分析中的挑战。
BMC Res Notes. 2015 Sep 7;8:421. doi: 10.1186/s13104-015-1385-4.
4
Fastq2vcf: a concise and transparent pipeline for whole-exome sequencing data analyses.Fastq2vcf:用于全外显子组测序数据分析的简洁且透明的流程
BMC Res Notes. 2015 Mar 8;8:72. doi: 10.1186/s13104-015-1027-x.
5
Impact of post-alignment processing in variant discovery from whole exome data.全外显子数据变异发现中比对后处理的影响
BMC Bioinformatics. 2016 Oct 3;17(1):403. doi: 10.1186/s12859-016-1279-z.
6
Variant callers for next-generation sequencing data: a comparison study.下一代测序数据的变异调用者:一项比较研究。
PLoS One. 2013 Sep 27;8(9):e75619. doi: 10.1371/journal.pone.0075619. eCollection 2013.
7
From Wet-Lab to Variations: Concordance and Speed of Bioinformatics Pipelines for Whole Genome and Whole Exome Sequencing.从湿实验室到变异:全基因组和全外显子组测序的生物信息学流程的一致性和速度
Hum Mutat. 2016 Dec;37(12):1263-1271. doi: 10.1002/humu.23114. Epub 2016 Sep 26.
8
CoVaCS: a consensus variant calling system.CoVaCS:一个共识变异调用系统。
BMC Genomics. 2018 Feb 5;19(1):120. doi: 10.1186/s12864-018-4508-1.
9
An analytical workflow for accurate variant discovery in highly divergent regions.一种用于在高度分化区域进行准确变异发现的分析流程。
BMC Genomics. 2016 Sep 2;17(1):703. doi: 10.1186/s12864-016-3045-z.
10
VariantMetaCaller: automated fusion of variant calling pipelines for quantitative, precision-based filtering.变异元调用器:用于基于定量、精确性筛选的变异调用流程的自动融合。
BMC Genomics. 2015 Oct 28;16:875. doi: 10.1186/s12864-015-2050-y.

引用本文的文献

1
MOLGENIS VIP: an end-to-end DNA variant interpretation pipeline for research and diagnostics configurable to support rapid implementation of new methods.MOLGENIS VIP:一个用于研究和诊断的端到端DNA变异解读流程,可配置以支持新方法的快速实施。
NAR Genom Bioinform. 2025 Jun 23;7(2):lqaf087. doi: 10.1093/nargab/lqaf087. eCollection 2025 Jun.
2
Whole-exome sequencing to identify causative variants in juvenile sudden cardiac death.全外显子组测序鉴定青少年心源性猝死的致病突变。
Hum Genomics. 2024 Sep 16;18(1):102. doi: 10.1186/s40246-024-00657-x.
3
Focal adhesion kinase-YAP signaling axis drives drug-tolerant persister cells and residual disease in lung cancer.

本文引用的文献

1
Exome sequencing on malignant meningiomas identified mutations in neurofibromatosis type 2 (NF2) and meningioma 1 (MN1) genes.对恶性脑膜瘤进行外显子组测序,发现了2型神经纤维瘤病(NF2)和脑膜瘤1(MN1)基因的突变。
Discov Med. 2014 Dec;18(101):301-311.
2
"Genotype-first" approaches on a curious case of idiopathic progressive cognitive decline.针对一例特发性进行性认知衰退的奇特病例的“基因型优先”方法。
BMC Med Genomics. 2014 Dec 3;7:66. doi: 10.1186/s12920-014-0066-9.
3
Long-range PCR in next-generation sequencing: comparison of six enzymes and evaluation on the MiSeq sequencer.
粘着斑激酶-YAP信号轴驱动肺癌中的药物耐受持久性细胞和残留疾病。
Nat Commun. 2024 May 3;15(1):3741. doi: 10.1038/s41467-024-47423-0.
4
Exploring Regorafenib Responsiveness and Uncovering Molecular Mechanisms in Recurrent Glioblastoma Tumors through Longitudinal In Vitro Sampling.通过纵向体外采样探索复发性胶质母细胞瘤肿瘤对瑞戈非尼的反应性及揭示其分子机制。
Cells. 2024 Mar 11;13(6):487. doi: 10.3390/cells13060487.
5
Kuura-An automated workflow for analyzing WES and WGS data.Kuura—一种用于分析 WES 和 WGS 数据的自动化工作流程。
PLoS One. 2024 Jan 18;19(1):e0296785. doi: 10.1371/journal.pone.0296785. eCollection 2024.
6
Contribution of uniparental disomy to fetal growth restriction: a whole-exome sequencing series in a prenatal setting.单亲二体性对胎儿生长受限的影响:产前全外显子组测序系列研究。
Sci Rep. 2024 Jan 2;14(1):238. doi: 10.1038/s41598-023-50584-5.
7
Innovative Approach to Isolate and Characterize Glioblastoma Circulating Tumor Cells and Correlation with Tumor Mutational Status.创新方法分离和鉴定胶质母细胞瘤循环肿瘤细胞及其与肿瘤突变状态的相关性。
Int J Mol Sci. 2023 Jun 14;24(12):10147. doi: 10.3390/ijms241210147.
8
Identification of New Potential Prognostic and Predictive Markers in High-Grade Osteosarcoma Using Whole Exome Sequencing.利用全外显子测序鉴定高级别骨肉瘤中的新潜在预后和预测标志物。
Int J Mol Sci. 2023 Jun 13;24(12):10086. doi: 10.3390/ijms241210086.
9
Whole Animal Genome Sequencing: user-friendly, rapid, containerized pipelines for processing, variant discovery, and annotation of short-read whole genome sequencing data.全基因组测序:用户友好、快速、集装箱化的流程,用于处理、发现变体和注释短读长全基因组测序数据。
G3 (Bethesda). 2023 Aug 9;13(8). doi: 10.1093/g3journal/jkad117.
10
Resources and tools for rare disease variant interpretation.罕见病变异解读的资源与工具。
Front Mol Biosci. 2023 May 10;10:1169109. doi: 10.3389/fmolb.2023.1169109. eCollection 2023.
新一代测序中的长片段PCR:六种酶的比较及在MiSeq测序仪上的评估
Sci Rep. 2014 Jul 18;4:5737. doi: 10.1038/srep05737.
4
DeNovoGear: de novo indel and point mutation discovery and phasing.DeNovoGear:从头缺失和点突变发现及相位分析。
Nat Methods. 2013 Oct;10(10):985-7. doi: 10.1038/nmeth.2611. Epub 2013 Aug 25.
5
Harnessing virtual machines to simplify next-generation DNA sequencing analysis.利用虚拟机简化下一代 DNA 测序分析。
Bioinformatics. 2013 Sep 1;29(17):2075-83. doi: 10.1093/bioinformatics/btt352. Epub 2013 Jun 20.
6
Low concordance of multiple variant-calling pipelines: practical implications for exome and genome sequencing.多种变异calling 管道一致性低:外显子组和基因组测序的实际影响。
Genome Med. 2013 Mar 27;5(3):28. doi: 10.1186/gm432. eCollection 2013.
7
Rare variant detection using family-based sequencing analysis.基于家系测序分析的罕见变异检测。
Proc Natl Acad Sci U S A. 2013 Mar 5;110(10):3985-90. doi: 10.1073/pnas.1222158110. Epub 2013 Feb 20.
8
A survey of tools for variant analysis of next-generation genome sequencing data.下一代基因组测序数据变异分析工具综述。
Brief Bioinform. 2014 Mar;15(2):256-78. doi: 10.1093/bib/bbs086. Epub 2013 Jan 21.
9
An integrated map of genetic variation from 1,092 human genomes.1092 个人类基因组遗传变异的综合图谱。
Nature. 2012 Nov 1;491(7422):56-65. doi: 10.1038/nature11632.
10
A likelihood-based framework for variant calling and de novo mutation detection in families.基于可能性的框架,用于家族中的变异调用和从头突变检测。
PLoS Genet. 2012;8(10):e1002944. doi: 10.1371/journal.pgen.1002944. Epub 2012 Oct 4.