• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

短读序列数据中的微缺失/插入检测。

Microindel detection in short-read sequence data.

机构信息

Institute for Medical Genetics, Charité-Universitätsmedizin Berlin, 13353 Berlin.

出版信息

Bioinformatics. 2010 Mar 15;26(6):722-9. doi: 10.1093/bioinformatics/btq027. Epub 2010 Feb 9.

DOI:10.1093/bioinformatics/btq027
PMID:20144947
Abstract

MOTIVATION

Several recent studies have demonstrated the effectiveness of resequencing and single nucleotide variant (SNV) detection by deep short-read sequencing platforms. While several reliable algorithms are available for automated SNV detection, the automated detection of microindels in deep short-read data presents a new bioinformatics challenge.

RESULTS

We systematically analyzed how the short-read mapping tools MAQ, Bowtie, Burrows-Wheeler alignment tool (BWA), Novoalign and RazerS perform on simulated datasets that contain indels and evaluated how indels affect error rates in SNV detection. We implemented a simple algorithm to compute the equivalent indel region eir, which can be used to process the alignments produced by the mapping tools in order to perform indel calling. Using simulated data that contains indels, we demonstrate that indel detection works well on short-read data: the detection rate for microindels (<4 bp) is >90%. Our study provides insights into systematic errors in SNV detection that is based on ungapped short sequence read alignments. Gapped alignments of short sequence reads can be used to reduce this error and to detect microindels in simulated short-read data. A comparison with microindels automatically identified on the ABI Sanger and Roche 454 platform indicates that microindel detection from short sequence reads identifies both overlapping and distinct indels.

CONTACT

peter.krawitz@googlemail.com; peter.robinson@charite.de

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

最近的几项研究表明,深度短读测序平台在重测序和单核苷酸变体 (SNV) 检测方面非常有效。虽然有几个可靠的算法可用于自动 SNV 检测,但在深度短读数据中自动检测微缺失和微插入则是一个新的生物信息学挑战。

结果

我们系统地分析了 MAQ、Bowtie、Burrows-Wheeler 比对工具 (BWA)、Novoalign 和 RazerS 等短读映射工具在包含缺失和插入的模拟数据集上的性能,并评估了缺失和插入对 SNV 检测错误率的影响。我们实现了一种简单的算法来计算等效插入缺失区域 eir,可用于处理映射工具生成的比对结果,以执行插入缺失调用。使用包含插入缺失的模拟数据,我们证明了插入缺失在短读数据上的检测效果良好:微缺失 (<4 bp) 的检测率>90%。我们的研究提供了基于未加缺口短序列读比对的 SNV 检测系统误差的见解。短序列读的加缺口比对可用于减少这种错误,并检测模拟短读数据中的微缺失。与 ABI Sanger 和 Roche 454 平台自动识别的微缺失的比较表明,短序列读取的微缺失检测可识别重叠和独特的缺失。

联系方式

peter.krawitz@googlemail.com;peter.robinson@charite.de

补充信息

补充数据可在“Bioinformatics”在线获取。

相似文献

1
Microindel detection in short-read sequence data.短读序列数据中的微缺失/插入检测。
Bioinformatics. 2010 Mar 15;26(6):722-9. doi: 10.1093/bioinformatics/btq027. Epub 2010 Feb 9.
2
A universal algorithm for de novo decrypting of heterozygous indel sequences: a tool for personalized medicine.一种用于从头解密杂合插入缺失序列的通用算法:个性化医疗的工具。
Clin Chim Acta. 2008 Mar;389(1-2):7-13. doi: 10.1016/j.cca.2007.11.011. Epub 2007 Nov 23.
3
Analysis of high-throughput sequencing data.高通量测序数据的分析
Methods Mol Biol. 2011;678:1-11. doi: 10.1007/978-1-60761-682-5_1.
4
Correction of sequencing errors in a mixed set of reads.纠正混合读取集中的测序错误。
Bioinformatics. 2010 May 15;26(10):1284-90. doi: 10.1093/bioinformatics/btq151. Epub 2010 Apr 8.
5
Optimal spliced alignments of short sequence reads.短序列 reads 的最优剪接比对。
Bioinformatics. 2008 Aug 15;24(16):i174-80. doi: 10.1093/bioinformatics/btn300.
6
Benchmarking next-generation transcriptome sequencing for functional and evolutionary genomics.下一代转录组测序在功能和进化基因组学中的基准测试。
Mol Biol Evol. 2009 Dec;26(12):2731-44. doi: 10.1093/molbev/msp188. Epub 2009 Aug 25.
7
Reptile: representative tiling for short read error correction.爬行动物:简称短读错误纠正的代表性平铺。
Bioinformatics. 2010 Oct 15;26(20):2526-33. doi: 10.1093/bioinformatics/btq468. Epub 2010 Aug 16.
8
Fast and accurate short read alignment with Burrows-Wheeler transform.使用Burrows-Wheeler变换进行快速准确的短读比对。
Bioinformatics. 2009 Jul 15;25(14):1754-60. doi: 10.1093/bioinformatics/btp324. Epub 2009 May 18.
9
EDAR: an efficient error detection and removal algorithm for next generation sequencing data.EDAR:一种用于下一代测序数据的高效错误检测与去除算法。
J Comput Biol. 2010 Nov;17(11):1549-60. doi: 10.1089/cmb.2010.0127. Epub 2010 Oct 25.
10
Comparative analysis of algorithms for next-generation sequencing read alignment.下一代测序读段比对算法的比较分析。
Bioinformatics. 2011 Oct 15;27(20):2790-6. doi: 10.1093/bioinformatics/btr477. Epub 2011 Aug 19.

引用本文的文献

1
Tracing the evolution of sequencing into the era of genomic medicine.追溯测序技术在基因组医学时代的发展历程。
Nat Rev Genet. 2025 Aug 15. doi: 10.1038/s41576-025-00884-5.
2
Comparative evaluation of SNVs, indels, and structural variations detected with short- and long-read sequencing data.利用短读长和长读长测序数据检测到的单核苷酸变异(SNV)、插入缺失(indel)和结构变异的比较评估。
Hum Genome Var. 2024 Apr 17;11(1):18. doi: 10.1038/s41439-024-00276-x.
3
VarSCAT: A computational tool for sequence context annotations of genomic variants.VarSCAT:一个用于基因组变异序列上下文注释的计算工具。
PLoS Comput Biol. 2023 Aug 11;19(8):e1010727. doi: 10.1371/journal.pcbi.1010727. eCollection 2023 Aug.
4
Identification of the Mutation in .鉴定. 中的突变。
Cells. 2022 Nov 3;11(21):3484. doi: 10.3390/cells11213484.
5
Performance evaluation of pipelines for mapping, variant calling and interval padding, for the analysis of NGS germline panels.用于分析NGS种系基因检测板的映射、变异位点检测和区间填充流程的性能评估。
BMC Bioinformatics. 2021 Apr 28;22(1):218. doi: 10.1186/s12859-021-04144-1.
6
Rare and de novo coding variants in chromodomain genes in Chiari I malformation.Chiari I 畸形中染色质结构域基因的罕见和新生编码变异。
Am J Hum Genet. 2021 Jan 7;108(1):100-114. doi: 10.1016/j.ajhg.2020.12.001. Epub 2020 Dec 21.
7
Comparative assessments of indel annotations in healthy and cancer genomes with next-generation sequencing data.基于下一代测序数据的健康和癌症基因组中插入缺失注释的比较评估。
BMC Med Genomics. 2020 Nov 10;13(1):170. doi: 10.1186/s12920-020-00818-6.
8
regulates the action of nitrogen-containing bisphosphonates on bone.调节含氮双膦酸盐对骨骼的作用。
Sci Transl Med. 2020 May 20;12(544). doi: 10.1126/scitranslmed.aav9166.
9
Hypermutator Pseudomonas aeruginosa Exploits Multiple Genetic Pathways To Develop Multidrug Resistance during Long-Term Infections in the Airways of Cystic Fibrosis Patients.高突变铜绿假单胞菌在囊性纤维化患者气道中长期感染期间利用多种遗传途径发展出多药耐药性。
Antimicrob Agents Chemother. 2020 Apr 21;64(5). doi: 10.1128/AAC.02142-19.
10
UPS-indel: a Universal Positioning System for Indels.UPS-indel:一种用于插入缺失变异的通用定位系统。
Sci Rep. 2017 Oct 26;7(1):14106. doi: 10.1038/s41598-017-14400-1.