• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

BatAlign:一种用于测序读段精确比对的增量方法。

BatAlign: an incremental method for accurate alignment of sequencing reads.

作者信息

Lim Jing-Quan, Tennakoon Chandana, Guan Peiyong, Sung Wing-Kin

机构信息

Department of Computer Science, National University of Singapore, Singapore 117417 Laboratory of Cancer Epigenome, Division of Medical Sciences, National Cancer Centre Singapore, Singapore 169610.

Department of Computer Science, National University of Singapore, Singapore 117417 NUS Graduate School for Integrative Sciences and Engineering, (CeLS), #05-01, 28 Medical Drive, Singapore 117456 Department of Computational and Systems Biology, Genome Institute of Singapore, Singapore 138672 UAE University, PO Box 17551, Al Ain, UAE.

出版信息

Nucleic Acids Res. 2015 Sep 18;43(16):e107. doi: 10.1093/nar/gkv533. Epub 2015 Jul 13.

DOI:10.1093/nar/gkv533
PMID:26170239
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4652746/
Abstract

Structural variations (SVs) play a crucial role in genetic diversity. However, the alignments of reads near/across SVs are made inaccurate by the presence of polymorphisms. BatAlign is an algorithm that integrated two strategies called 'Reverse-Alignment' and 'Deep-Scan' to improve the accuracy of read-alignment. In our experiments, BatAlign was able to obtain the highest F-measures in read-alignments on mismatch-aberrant, indel-aberrant, concordantly/discordantly paired and SV-spanning data sets. On real data, the alignments of BatAlign were able to recover 4.3% more PCR-validated SVs with 73.3% less callings. These suggest BatAlign to be effective in detecting SVs and other polymorphic-variants accurately using high-throughput data. BatAlign is publicly available at https://goo.gl/a6phxB.

摘要

结构变异(SVs)在遗传多样性中起着至关重要的作用。然而,由于多态性的存在,SVs附近/跨越SVs的 reads 的比对变得不准确。BatAlign是一种整合了“反向比对”和“深度扫描”这两种策略的算法,以提高 reads 比对的准确性。在我们的实验中,BatAlign在错配异常、插入缺失异常、一致/不一致配对和跨越SVs的数据集的 reads 比对中能够获得最高的F值。在真实数据上,BatAlign的比对能够以少73.3%的调用次数多找回4.3%经PCR验证的SVs。这些表明BatAlign在使用高通量数据准确检测SVs和其他多态性变异方面是有效的。BatAlign可在https://goo.gl/a6phxB上公开获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2307/4652746/18bcaaae52a5/gkv533fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2307/4652746/a43add81e150/gkv533fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2307/4652746/6272347d9233/gkv533fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2307/4652746/aba79b4ada40/gkv533fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2307/4652746/f06e2f2c236c/gkv533fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2307/4652746/ced88e243d52/gkv533fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2307/4652746/18bcaaae52a5/gkv533fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2307/4652746/a43add81e150/gkv533fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2307/4652746/6272347d9233/gkv533fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2307/4652746/aba79b4ada40/gkv533fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2307/4652746/f06e2f2c236c/gkv533fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2307/4652746/ced88e243d52/gkv533fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2307/4652746/18bcaaae52a5/gkv533fig6.jpg

相似文献

1
BatAlign: an incremental method for accurate alignment of sequencing reads.BatAlign:一种用于测序读段精确比对的增量方法。
Nucleic Acids Res. 2015 Sep 18;43(16):e107. doi: 10.1093/nar/gkv533. Epub 2015 Jul 13.
2
PRISM: pair-read informed split-read mapping for base-pair level detection of insertion, deletion and structural variants.PRISM:基于双读信息的分读比对算法,用于检测插入、缺失和结构变异的碱基对水平。
Bioinformatics. 2012 Oct 15;28(20):2576-83. doi: 10.1093/bioinformatics/bts484. Epub 2012 Jul 31.
3
Robust and exact structural variation detection with paired-end and soft-clipped alignments: SoftSV compared with eight algorithms.利用双端和软剪切比对进行稳健且精确的结构变异检测:SoftSV与八种算法的比较
Brief Bioinform. 2016 Jan;17(1):51-62. doi: 10.1093/bib/bbv028. Epub 2015 May 20.
4
BIMA V3: an aligner customized for mate pair library sequencing.BIMA V3:一种专用于 mate pair 文库测序的接头。
Bioinformatics. 2014 Jun 1;30(11):1627-9. doi: 10.1093/bioinformatics/btu078. Epub 2014 Feb 12.
5
Detection of structural variants involving repetitive regions in the reference genome.检测参考基因组中涉及重复区域的结构变异。
J Comput Biol. 2014 Mar;21(3):219-33. doi: 10.1089/cmb.2013.0129. Epub 2014 Feb 19.
6
HQAlign: aligning nanopore reads for SV detection using current-level modeling.HQAlign:使用电流水平建模对齐纳米孔读取以进行 SV 检测。
Bioinformatics. 2023 Oct 3;39(10). doi: 10.1093/bioinformatics/btad580.
7
Comparison of multiple algorithms to reliably detect structural variants in pears.比较多种算法以可靠地检测梨中的结构变异。
BMC Genomics. 2020 Jan 20;21(1):61. doi: 10.1186/s12864-020-6455-x.
8
SRPRISM (Single Read Paired Read Indel Substitution Minimizer): an efficient aligner for assemblies with explicit guarantees.SRPRISM(单读配对读插入缺失替换最小化器):具有明确保证的组装的高效对齐器。
Gigascience. 2020 Apr 1;9(4). doi: 10.1093/gigascience/giaa023.
9
Whole-genome sequencing with long reads reveals complex structure and origin of structural variation in human genetic variations and somatic mutations in cancer.全基因组测序与长读长揭示了人类遗传变异和癌症体细胞突变中结构变异的复杂结构和起源。
Genome Med. 2021 Apr 29;13(1):65. doi: 10.1186/s13073-021-00883-1.
10
Detection and visualization of complex structural variants from long reads.从长读中检测和可视化复杂结构变体。
BMC Bioinformatics. 2018 Dec 21;19(Suppl 20):508. doi: 10.1186/s12859-018-2539-x.

引用本文的文献

1
Benchmarking DNA methylation analysis of 14 alignment algorithms for whole genome bisulfite sequencing in mammals.哺乳动物全基因组亚硫酸氢盐测序的14种比对算法的DNA甲基化分析基准测试
Comput Struct Biotechnol J. 2022 Aug 27;20:4704-4716. doi: 10.1016/j.csbj.2022.08.051. eCollection 2022.
2
An integrated package for bisulfite DNA methylation data analysis with Indel-sensitive mapping.用于具有插入缺失敏感映射的亚硫酸氢盐 DNA 甲基化数据分析的集成包。
BMC Bioinformatics. 2019 Jan 22;20(1):47. doi: 10.1186/s12859-018-2593-4.
3
Performance evaluation method for read mapping tool in clinical panel sequencing.

本文引用的文献

1
Decoding complex patterns of genomic rearrangement in hepatocellular carcinoma.解析肝细胞癌中基因组重排的复杂模式
Genomics. 2014 Feb-Mar;103(2-3):189-203. doi: 10.1016/j.ygeno.2014.01.003. Epub 2014 Jan 21.
2
SSW library: an SIMD Smith-Waterman C/C++ library for use in genomic applications.SSW 库:一个用于基因组应用的 SIMD Smith-Waterman C/C++ 库。
PLoS One. 2013 Dec 4;8(12):e82138. doi: 10.1371/journal.pone.0082138. eCollection 2013.
3
RSVSim: an R/Bioconductor package for the simulation of structural variations.
临床Panel测序中读段比对工具的性能评估方法
Genes Genomics. 2018;40(2):189-197. doi: 10.1007/s13258-017-0621-9. Epub 2017 Nov 9.
4
AlignerBoost: A Generalized Software Toolkit for Boosting Next-Gen Sequencing Mapping Accuracy Using a Bayesian-Based Mapping Quality Framework.AlignerBoost:一种基于贝叶斯映射质量框架提高下一代测序映射准确性的通用软件工具包。
PLoS Comput Biol. 2016 Oct 5;12(10):e1005096. doi: 10.1371/journal.pcbi.1005096. eCollection 2016 Oct.
5
Suitability of Different Mapping Algorithms for Genome-Wide Polymorphism Scans with Pool-Seq Data.不同映射算法对Pool-Seq数据全基因组多态性扫描的适用性
G3 (Bethesda). 2016 Nov 8;6(11):3507-3515. doi: 10.1534/g3.116.034488.
RSVSim:一个用于模拟结构变异的 R/Bioconductor 包。
Bioinformatics. 2013 Jul 1;29(13):1679-81. doi: 10.1093/bioinformatics/btt198. Epub 2013 Apr 25.
4
Fast and accurate read mapping with approximate seeds and multiple backtracking.使用近似种子和多次回溯实现快速准确的读映射。
Nucleic Acids Res. 2013 Apr;41(7):e78. doi: 10.1093/nar/gkt005. Epub 2013 Jan 28.
5
Long read alignment based on maximal exact match seeds.基于最大精确匹配种子的长读比对。
Bioinformatics. 2012 Sep 15;28(18):i318-i324. doi: 10.1093/bioinformatics/bts414.
6
YAHA: fast and flexible long-read alignment with optimal breakpoint detection.YAHA:快速灵活的长读比对,具有最佳断点检测功能。
Bioinformatics. 2012 Oct 1;28(19):2417-24. doi: 10.1093/bioinformatics/bts456. Epub 2012 Jul 24.
7
Fast and accurate read alignment for resequencing.快速准确的重测序读对齐。
Bioinformatics. 2012 Sep 15;28(18):2366-73. doi: 10.1093/bioinformatics/bts450. Epub 2012 Jul 18.
8
BatMis: a fast algorithm for k-mismatch mapping.BatMis:一种快速的 k-错配映射算法。
Bioinformatics. 2012 Aug 15;28(16):2122-8. doi: 10.1093/bioinformatics/bts339. Epub 2012 Jun 10.
9
ART: a next-generation sequencing read simulator.ART:一种新一代测序读模拟程序。
Bioinformatics. 2012 Feb 15;28(4):593-4. doi: 10.1093/bioinformatics/btr708. Epub 2011 Dec 23.
10
Identification of genomic indels and structural variations using split reads.利用拆分读识别基因组插入/缺失和结构变异。
BMC Genomics. 2011 Jul 25;12:375. doi: 10.1186/1471-2164-12-375.