• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Bivartect:通过直接读取比较实现准确且节省内存的断点检测。

Bivartect: accurate and memory-saving breakpoint detection by direct read comparison.

机构信息

Department of RNA Biology and Neuroscience, Graduate School of Medicine, Osaka University, Suita, Osaka 565-0871, Japan.

出版信息

Bioinformatics. 2020 May 1;36(9):2725-2730. doi: 10.1093/bioinformatics/btaa059.

DOI:10.1093/bioinformatics/btaa059
PMID:31985791
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7203739/
Abstract

MOTIVATION

Genetic variant calling with high-throughput sequencing data has been recognized as a useful tool for better understanding of disease mechanism and detection of potential off-target sites in genome editing. Since most of the variant calling algorithms rely on initial mapping onto a reference genome and tend to predict many variant candidates, variant calling remains challenging in terms of predicting variants with low false positives.

RESULTS

Here we present Bivartect, a simple yet versatile variant caller based on direct comparison of short sequence reads between normal and mutated samples. Bivartect can detect not only single nucleotide variants but also insertions/deletions, inversions and their complexes. Bivartect achieves high predictive performance with an elaborate memory-saving mechanism, which allows Bivartect to run on a computer with a single node for analyzing small omics data. Tests with simulated benchmark and real genome-editing data indicate that Bivartect was comparable to state-of-the-art variant callers in positive predictive value for detection of single nucleotide variants, even though it yielded a substantially small number of candidates. These results suggest that Bivartect, a reference-free approach, will contribute to the identification of germline mutations as well as off-target sites introduced during genome editing with high accuracy.

AVAILABILITY AND IMPLEMENTATION

Bivartect is implemented in C++ and available along with in silico simulated data at https://github.com/ykat0/bivartect.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

高通量测序数据的遗传变异调用已被认为是更好地理解疾病机制和检测基因组编辑中潜在脱靶位点的有用工具。由于大多数变异调用算法依赖于初始映射到参考基因组,并倾向于预测许多变异候选者,因此在预测低假阳性变异方面,变异调用仍然具有挑战性。

结果

在这里,我们提出了 Bivartect,这是一种基于正常和突变样本之间短序列读取直接比较的简单而通用的变异调用器。Bivartect 不仅可以检测单核苷酸变异,还可以检测插入/缺失、倒位及其复合物。Bivartect 通过精心设计的节省内存的机制实现了高预测性能,允许 Bivartect 在单个节点的计算机上运行,用于分析小型组学数据。使用模拟基准和真实基因组编辑数据的测试表明,Bivartect 在检测单核苷酸变异的阳性预测值方面与最先进的变异调用器相当,尽管它产生的候选者数量要少得多。这些结果表明,Bivartect 是一种无参考的方法,将有助于以高精度识别生殖系突变以及基因组编辑过程中引入的脱靶位点。

可用性和实现

Bivartect 是用 C++编写的,并在 https://github.com/ykat0/bivartect 上提供了虚拟模拟数据。

补充信息

补充数据可在 Bioinformatics 在线获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5872/7203739/78405df8d7a8/btaa059f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5872/7203739/18cf5ab43aa5/btaa059f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5872/7203739/89c5f1bca129/btaa059f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5872/7203739/839e621e1f65/btaa059f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5872/7203739/13af9b7905bf/btaa059f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5872/7203739/78405df8d7a8/btaa059f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5872/7203739/18cf5ab43aa5/btaa059f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5872/7203739/89c5f1bca129/btaa059f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5872/7203739/839e621e1f65/btaa059f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5872/7203739/13af9b7905bf/btaa059f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5872/7203739/78405df8d7a8/btaa059f5.jpg

相似文献

1
Bivartect: accurate and memory-saving breakpoint detection by direct read comparison.Bivartect:通过直接读取比较实现准确且节省内存的断点检测。
Bioinformatics. 2020 May 1;36(9):2725-2730. doi: 10.1093/bioinformatics/btaa059.
2
Calling known variants and identifying new variants while rapidly aligning sequence data.在快速对齐序列数据的同时,调用已知变异体并识别新变异体。
J Dairy Sci. 2019 Apr;102(4):3216-3229. doi: 10.3168/jds.2018-15172. Epub 2019 Feb 14.
3
Discovery of tandem and interspersed segmental duplications using high-throughput sequencing.利用高通量测序发现串联和散在的片段重复。
Bioinformatics. 2019 Oct 15;35(20):3923-3930. doi: 10.1093/bioinformatics/btz237.
4
dv-trio: a family-based variant calling pipeline using DeepVariant.dv-trio:一种基于家系的使用 DeepVariant 的变异calling 流程。
Bioinformatics. 2020 Jun 1;36(11):3549-3551. doi: 10.1093/bioinformatics/btaa116.
5
Whisper: read sorting allows robust mapping of DNA sequencing data.低语:读排序允许对 DNA 测序数据进行稳健的映射。
Bioinformatics. 2019 Jun 1;35(12):2043-2050. doi: 10.1093/bioinformatics/bty927.
6
SNVSniffer: an integrated caller for germline and somatic single-nucleotide and indel mutations.SNVSniffer:一种用于种系和体细胞单核苷酸及插入缺失突变的综合检测工具。
BMC Syst Biol. 2016 Aug 1;10 Suppl 2(Suppl 2):47. doi: 10.1186/s12918-016-0300-5.
7
ReliableGenome: annotation of genomic regions with high/low variant calling concordance.可靠基因组:具有高/低变异检测一致性的基因组区域注释。
Bioinformatics. 2017 Jan 15;33(2):155-160. doi: 10.1093/bioinformatics/btw587. Epub 2016 Sep 7.
8
SiNVICT: ultra-sensitive detection of single nucleotide variants and indels in circulating tumour DNA.SiNVICT:循环肿瘤 DNA 中单核苷酸变异和插入缺失的超灵敏检测。
Bioinformatics. 2017 Jan 1;33(1):26-34. doi: 10.1093/bioinformatics/btw536. Epub 2016 Aug 16.
9
NGSEP3: accurate variant calling across species and sequencing protocols.NGSEP3:跨物种和测序协议的准确变异调用。
Bioinformatics. 2019 Nov 1;35(22):4716-4723. doi: 10.1093/bioinformatics/btz275.
10
Discovery and genotyping of novel sequence insertions in many sequenced individuals.在许多测序个体中发现和基因分型新的序列插入。
Bioinformatics. 2017 Jul 15;33(14):i161-i169. doi: 10.1093/bioinformatics/btx254.

引用本文的文献

1
Detecting gene breakpoints in noisy genome sequences using position-annotated colored de-Bruijn graphs.利用位置注释的有色 de-Bruijn 图检测嘈杂基因组序列中的基因断点。
BMC Bioinformatics. 2023 Jun 5;24(1):235. doi: 10.1186/s12859-023-05371-4.
2
Temporal convolutional network for a Fast DNA mutation detection in breast cancer data.用于乳腺癌数据中快速 DNA 突变检测的时间卷积网络。
PLoS One. 2023 May 25;18(5):e0285981. doi: 10.1371/journal.pone.0285981. eCollection 2023.

本文引用的文献

1
Kevlar: A Mapping-Free Framework for Accurate Discovery of De Novo Variants.凯夫拉尔:一种用于准确发现新生变异的无映射框架。
iScience. 2019 Aug 30;18:28-36. doi: 10.1016/j.isci.2019.07.032. Epub 2019 Jul 23.
2
Strelka2: fast and accurate calling of germline and somatic variants.Strelka2:快速准确地调用种系和体细胞变异。
Nat Methods. 2018 Aug;15(8):591-594. doi: 10.1038/s41592-018-0051-x. Epub 2018 Jul 16.
3
Association mapping from sequencing reads using -mers.基于 -mers 的测序reads 的关联作图。
Elife. 2018 Jun 13;7:e32920. doi: 10.7554/eLife.32920.
4
CRISPR off-target analysis in genetically engineered rats and mice.基因工程大鼠和小鼠中的 CRISPR 脱靶分析。
Nat Methods. 2018 Jul;15(7):512-514. doi: 10.1038/s41592-018-0011-5. Epub 2018 May 21.
5
Mapping-free variant calling using haplotype reconstruction from k-mer frequencies.基于 k- -mer 频率的单倍型重构进行无图谱变异调用。
Bioinformatics. 2018 May 15;34(10):1659-1665. doi: 10.1093/bioinformatics/btx753.
6
FastGT: an alignment-free method for calling common SNVs directly from raw sequencing reads.FastGT:一种从原始测序读段中直接调用常见单核苷酸变异(SNVs)的无需比对方法。
Sci Rep. 2017 May 31;7(1):2537. doi: 10.1038/s41598-017-02487-5.
7
Exome sequencing in the knockin mice generated using the CRISPR/Cas system.利用 CRISPR/Cas 系统生成的基因敲入小鼠的外显子组测序。
Sci Rep. 2016 Oct 4;6:34703. doi: 10.1038/srep34703.
8
VarDict: a novel and versatile variant caller for next-generation sequencing in cancer research.VarDict:一种用于癌症研究中下一代测序的新型多功能变异检测工具。
Nucleic Acids Res. 2016 Jun 20;44(11):e108. doi: 10.1093/nar/gkw227. Epub 2016 Apr 7.
9
Manta: rapid detection of structural variants and indels for germline and cancer sequencing applications.Manta:用于种系和癌症测序应用的结构变异和插入缺失的快速检测。
Bioinformatics. 2016 Apr 15;32(8):1220-2. doi: 10.1093/bioinformatics/btv710. Epub 2015 Dec 8.
10
An integrated map of structural variation in 2,504 human genomes.2504个人类基因组结构变异的整合图谱。
Nature. 2015 Oct 1;526(7571):75-81. doi: 10.1038/nature15394.