• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用 GROM 实现快速基因组变异检测。

Lightning-fast genome variant detection with GROM.

机构信息

Department of Biology, Center for Computational and Integrative Biology, Rutgers University, 315 Penn St, Camden 08102, NJ, USA.

出版信息

Gigascience. 2017 Oct 1;6(10):1-7. doi: 10.1093/gigascience/gix091.

DOI:10.1093/gigascience/gix091
PMID:29048532
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5737730/
Abstract

Current human whole genome sequencing projects produce massive amounts of data, often creating significant computational challenges. Different approaches have been developed for each type of genome variant and method of its detection, necessitating users to run multiple algorithms to find variants. We present Genome Rearrangement OmniMapper (GROM), a novel comprehensive variant detection algorithm accepting aligned read files as input and finding SNVs, indels, structural variants (SVs), and copy number variants (CNVs). We show that GROM outperforms state-of-the-art methods on 7 validated benchmarks using 2 whole genome sequencing (WGS) data sets. Additionally, GROM boasts lightning-fast run times, analyzing a 50× WGS human data set (NA12878) on commonly available computer hardware in 11 minutes, more than an order of magnitude (up to 72 times) faster than tools detecting a similar range of variants. Addressing the needs of big data analysis, GROM combines in 1 algorithm SNV, indel, SV, and CNV detection, providing superior speed, sensitivity, and precision. GROM is also able to detect CNVs, SNVs, and indels in non-paired-read WGS libraries, as well as SNVs and indels in whole exome or RNA sequencing data sets.

摘要

当前的人类全基因组测序项目产生了大量的数据,这常常带来巨大的计算挑战。针对每种类型的基因组变异和检测方法,都开发了不同的方法,这使得用户需要运行多个算法来寻找变异。我们提出了基因组重排全景图(Genome Rearrangement OmniMapper,GROM),这是一种新颖的综合变异检测算法,它接受已对齐的读取文件作为输入,并可发现单核苷酸变异(SNV)、插入缺失(indel)、结构变异(SV)和拷贝数变异(CNV)。我们展示了 GROM 在使用 2 个全基因组测序(WGS)数据集的 7 个经过验证的基准测试中优于最先进的方法。此外,GROM 具有闪电般的运行速度,可在常见的计算机硬件上分析 50×WGS 人类数据集(NA12878),运行时间为 11 分钟,比检测类似变异范围的工具快一个数量级(快 72 倍)以上。为满足大数据分析的需求,GROM 将 SNV、indel、SV 和 CNV 检测组合在 1 个算法中,提供卓越的速度、灵敏度和精度。GROM 还能够检测非配对读取 WGS 文库中的 CNV、SNV 和 indel,以及全外显子或 RNA 测序数据集中的 SNV 和 indel。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/50ef/5737730/b1cad335de7f/gix091fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/50ef/5737730/cf035b6dffe3/gix091fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/50ef/5737730/30ea53d003c0/gix091fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/50ef/5737730/fa38cecad655/gix091fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/50ef/5737730/1efec99b22ca/gix091fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/50ef/5737730/b1cad335de7f/gix091fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/50ef/5737730/cf035b6dffe3/gix091fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/50ef/5737730/30ea53d003c0/gix091fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/50ef/5737730/fa38cecad655/gix091fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/50ef/5737730/1efec99b22ca/gix091fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/50ef/5737730/b1cad335de7f/gix091fig5.jpg

相似文献

1
Lightning-fast genome variant detection with GROM.利用 GROM 实现快速基因组变异检测。
Gigascience. 2017 Oct 1;6(10):1-7. doi: 10.1093/gigascience/gix091.
2
From cytogenetics to cytogenomics: whole-genome sequencing as a first-line test comprehensively captures the diverse spectrum of disease-causing genetic variation underlying intellectual disability.从细胞遗传学到细胞基因组学:全基因组测序作为一线检测手段,全面捕捉到导致智力障碍的遗传变异的多样谱。
Genome Med. 2019 Nov 7;11(1):68. doi: 10.1186/s13073-019-0675-1.
3
GROM-RD: resolving genomic biases to improve read depth detection of copy number variants.GROM-RD:解决基因组偏倚以提高拷贝数变异的读深度检测。
PeerJ. 2015 Mar 17;3:e836. doi: 10.7717/peerj.836. eCollection 2015.
4
Performance of copy number variants detection based on whole-genome sequencing by DNBSEQ platforms.基于 DNBSEQ 平台的全基因组测序的拷贝数变异检测性能。
BMC Bioinformatics. 2020 Nov 11;21(1):518. doi: 10.1186/s12859-020-03859-x.
5
Whole-genome sequencing is more powerful than whole-exome sequencing for detecting exome variants.在检测外显子变异方面,全基因组测序比全外显子测序更强大。
Proc Natl Acad Sci U S A. 2015 Apr 28;112(17):5473-8. doi: 10.1073/pnas.1418631112. Epub 2015 Mar 31.
6
Evaluation of the performance of copy number variant prediction tools for the detection of deletions from whole genome sequencing data.评估拷贝数变异预测工具在检测全基因组测序数据中缺失方面的性能。
J Biomed Inform. 2019 Jun;94:103174. doi: 10.1016/j.jbi.2019.103174. Epub 2019 Apr 6.
7
Quality control and integration of genotypes from two calling pipelines for whole genome sequence data in the Alzheimer's disease sequencing project.全基因组序列数据阿尔茨海默病测序项目中两种调用管道基因型的质量控制和整合。
Genomics. 2019 Jul;111(4):808-818. doi: 10.1016/j.ygeno.2018.05.004. Epub 2018 May 29.
8
VISTA: an integrated framework for structural variant discovery.VISTA:一个用于结构变异发现的集成框架。
Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae462.
9
KoVariome: Korean National Standard Reference Variome database of whole genomes with comprehensive SNV, indel, CNV, and SV analyses.KoVariome:韩国全基因组标准参考变异组数据库,包含全面的单核苷酸变异、插入缺失、拷贝数变异和结构变异分析。
Sci Rep. 2018 Apr 4;8(1):5677. doi: 10.1038/s41598-018-23837-x.
10
Comparison of commercially available whole-genome sequencing kits for variant detection in circulating cell-free DNA.比较商用的循环游离 DNA 变异检测全基因组测序试剂盒。
Sci Rep. 2020 Apr 10;10(1):6190. doi: 10.1038/s41598-020-63102-8.

引用本文的文献

1
Structural Variants: Mechanisms, Mapping, and Interpretation in Human Genetics.结构变异:人类遗传学中的机制、定位与解读
Genes (Basel). 2025 Jul 29;16(8):905. doi: 10.3390/genes16080905.
2
Leveraging Whole-Genome Resequencing to Uncover Genetic Diversity and Promote Conservation Strategies for Ruminants in Asia.利用全基因组重测序揭示亚洲反刍动物的遗传多样性并促进保护策略
Animals (Basel). 2025 Mar 13;15(6):831. doi: 10.3390/ani15060831.
3
VISTA: an integrated framework for structural variant discovery.VISTA:一个用于结构变异发现的集成框架。

本文引用的文献

1
A reference data set of 5.4 million phased human variants validated by genetic inheritance from sequencing a three-generation 17-member pedigree.通过对一个包含17名成员的三代家系进行测序,经遗传继承验证的540万个定相人类变异的参考数据集。
Genome Res. 2017 Jan;27(1):157-164. doi: 10.1101/gr.210500.116. Epub 2016 Nov 30.
2
Deep sequencing of 10,000 human genomes.一万个人类基因组的深度测序。
Proc Natl Acad Sci U S A. 2016 Oct 18;113(42):11901-11906. doi: 10.1073/pnas.1613365113. Epub 2016 Oct 4.
3
Evaluating the necessity of PCR duplicate removal from next-generation sequencing data and a comparison of approaches.
Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae462.
4
A comprehensive benchmarking of WGS-based deletion structural variant callers.基于 WGS 的缺失结构变异调用器的综合基准测试。
Brief Bioinform. 2022 Jul 18;23(4). doi: 10.1093/bib/bbac221.
5
Chasing breeding footprints through structural variations in Cucumis melo and wild relatives.追寻甜瓜及其野生近缘属植物结构变异中的繁殖足迹。
G3 (Bethesda). 2021 Jan 18;11(1). doi: 10.1093/g3journal/jkaa038.
6
Structural variant identification and characterization.结构变异的识别与表征
Chromosome Res. 2020 Mar;28(1):31-47. doi: 10.1007/s10577-019-09623-z. Epub 2020 Jan 6.
7
Structural variants in 3000 rice genomes.3000 份水稻基因组中的结构变异。
Genome Res. 2019 May;29(5):870-880. doi: 10.1101/gr.241240.118. Epub 2019 Apr 16.
8
ARIADNA: machine learning method for ancient DNA variant discovery.ARIADNA:古 DNA 变异发现的机器学习方法。
DNA Res. 2018 Dec 1;25(6):619-627. doi: 10.1093/dnares/dsy029.
评估从下一代测序数据中去除PCR重复的必要性及方法比较。
BMC Bioinformatics. 2016 Jul 25;17 Suppl 7(Suppl 7):239. doi: 10.1186/s12859-016-1097-3.
4
Long-read sequencing and de novo assembly of a Chinese genome.长读测序和中国基因组的从头组装。
Nat Commun. 2016 Jun 30;7:12065. doi: 10.1038/ncomms12065.
5
China's bid to be a DNA superpower.中国力争成为DNA超级大国。
Nature. 2016 Jun 23;534(7608):462-3. doi: 10.1038/534462a.
6
Extensive sequencing of seven human genomes to characterize benchmark reference materials.对七个人类基因组进行广泛测序以表征基准参考材料。
Sci Data. 2016 Jun 7;3:160025. doi: 10.1038/sdata.2016.25.
7
svclassify: a method to establish benchmark structural variant calls.svclassify:一种建立基准结构变异调用的方法。
BMC Genomics. 2016 Jan 16;17:64. doi: 10.1186/s12864-016-2366-2.
8
Impact of GSTM1, GSTT1 and GSTP1 gene polymorphism and risk of ARV-associated hepatotoxicity in HIV-infected individuals and its modulation.谷胱甘肽S-转移酶M1、谷胱甘肽S-转移酶T1和谷胱甘肽S-转移酶P1基因多态性对HIV感染个体抗逆转录病毒药物相关肝毒性风险的影响及其调节作用
Pharmacogenomics J. 2017 Jan;17(1):53-60. doi: 10.1038/tpj.2015.88. Epub 2015 Dec 15.
9
Manta: rapid detection of structural variants and indels for germline and cancer sequencing applications.Manta:用于种系和癌症测序应用的结构变异和插入缺失的快速检测。
Bioinformatics. 2016 Apr 15;32(8):1220-2. doi: 10.1093/bioinformatics/btv710. Epub 2015 Dec 8.
10
SpeedSeq: ultra-fast personal genome analysis and interpretation.SpeedSeq:超快速个人基因组分析与解读
Nat Methods. 2015 Oct;12(10):966-8. doi: 10.1038/nmeth.3505. Epub 2015 Aug 10.