文献检索文档翻译深度研究
Suppr Zotero 插件Zotero 插件
邀请有礼套餐&价格历史记录

新学期,新优惠

限时优惠:9月1日-9月22日

30天高级会员仅需29元

1天体验卡首发特惠仅需5.99元

了解详情
不再提醒
插件&应用
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
高级版
套餐订阅购买积分包
AI 工具
文献检索文档翻译深度研究
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2025

短读序列数据中的微缺失/插入检测。

Microindel detection in short-read sequence data.

机构信息

Institute for Medical Genetics, Charité-Universitätsmedizin Berlin, 13353 Berlin.

出版信息

Bioinformatics. 2010 Mar 15;26(6):722-9. doi: 10.1093/bioinformatics/btq027. Epub 2010 Feb 9.


DOI:10.1093/bioinformatics/btq027
PMID:20144947
Abstract

MOTIVATION: Several recent studies have demonstrated the effectiveness of resequencing and single nucleotide variant (SNV) detection by deep short-read sequencing platforms. While several reliable algorithms are available for automated SNV detection, the automated detection of microindels in deep short-read data presents a new bioinformatics challenge. RESULTS: We systematically analyzed how the short-read mapping tools MAQ, Bowtie, Burrows-Wheeler alignment tool (BWA), Novoalign and RazerS perform on simulated datasets that contain indels and evaluated how indels affect error rates in SNV detection. We implemented a simple algorithm to compute the equivalent indel region eir, which can be used to process the alignments produced by the mapping tools in order to perform indel calling. Using simulated data that contains indels, we demonstrate that indel detection works well on short-read data: the detection rate for microindels (<4 bp) is >90%. Our study provides insights into systematic errors in SNV detection that is based on ungapped short sequence read alignments. Gapped alignments of short sequence reads can be used to reduce this error and to detect microindels in simulated short-read data. A comparison with microindels automatically identified on the ABI Sanger and Roche 454 platform indicates that microindel detection from short sequence reads identifies both overlapping and distinct indels. CONTACT: peter.krawitz@googlemail.com; peter.robinson@charite.de SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

摘要

动机:最近的几项研究表明,深度短读测序平台在重测序和单核苷酸变体 (SNV) 检测方面非常有效。虽然有几个可靠的算法可用于自动 SNV 检测,但在深度短读数据中自动检测微缺失和微插入则是一个新的生物信息学挑战。

结果:我们系统地分析了 MAQ、Bowtie、Burrows-Wheeler 比对工具 (BWA)、Novoalign 和 RazerS 等短读映射工具在包含缺失和插入的模拟数据集上的性能,并评估了缺失和插入对 SNV 检测错误率的影响。我们实现了一种简单的算法来计算等效插入缺失区域 eir,可用于处理映射工具生成的比对结果,以执行插入缺失调用。使用包含插入缺失的模拟数据,我们证明了插入缺失在短读数据上的检测效果良好:微缺失 (<4 bp) 的检测率>90%。我们的研究提供了基于未加缺口短序列读比对的 SNV 检测系统误差的见解。短序列读的加缺口比对可用于减少这种错误,并检测模拟短读数据中的微缺失。与 ABI Sanger 和 Roche 454 平台自动识别的微缺失的比较表明,短序列读取的微缺失检测可识别重叠和独特的缺失。

联系方式:peter.krawitz@googlemail.com;peter.robinson@charite.de

补充信息:补充数据可在“Bioinformatics”在线获取。

相似文献

[1]
Microindel detection in short-read sequence data.

Bioinformatics. 2010-2-9

[2]
A universal algorithm for de novo decrypting of heterozygous indel sequences: a tool for personalized medicine.

Clin Chim Acta. 2008-3

[3]
Analysis of high-throughput sequencing data.

Methods Mol Biol. 2011

[4]
Correction of sequencing errors in a mixed set of reads.

Bioinformatics. 2010-4-8

[5]
Optimal spliced alignments of short sequence reads.

Bioinformatics. 2008-8-15

[6]
Benchmarking next-generation transcriptome sequencing for functional and evolutionary genomics.

Mol Biol Evol. 2009-8-25

[7]
Reptile: representative tiling for short read error correction.

Bioinformatics. 2010-8-16

[8]
Fast and accurate short read alignment with Burrows-Wheeler transform.

Bioinformatics. 2009-7-15

[9]
EDAR: an efficient error detection and removal algorithm for next generation sequencing data.

J Comput Biol. 2010-11

[10]
Comparative analysis of algorithms for next-generation sequencing read alignment.

Bioinformatics. 2011-8-19

引用本文的文献

[1]
Tracing the evolution of sequencing into the era of genomic medicine.

Nat Rev Genet. 2025-8-15

[2]
Comparative evaluation of SNVs, indels, and structural variations detected with short- and long-read sequencing data.

Hum Genome Var. 2024-4-17

[3]
VarSCAT: A computational tool for sequence context annotations of genomic variants.

PLoS Comput Biol. 2023-8

[4]
Identification of the Mutation in .

Cells. 2022-11-3

[5]
Performance evaluation of pipelines for mapping, variant calling and interval padding, for the analysis of NGS germline panels.

BMC Bioinformatics. 2021-4-28

[6]
Rare and de novo coding variants in chromodomain genes in Chiari I malformation.

Am J Hum Genet. 2021-1-7

[7]
Comparative assessments of indel annotations in healthy and cancer genomes with next-generation sequencing data.

BMC Med Genomics. 2020-11-10

[8]
regulates the action of nitrogen-containing bisphosphonates on bone.

Sci Transl Med. 2020-5-20

[9]
Hypermutator Pseudomonas aeruginosa Exploits Multiple Genetic Pathways To Develop Multidrug Resistance during Long-Term Infections in the Airways of Cystic Fibrosis Patients.

Antimicrob Agents Chemother. 2020-4-21

[10]
UPS-indel: a Universal Positioning System for Indels.

Sci Rep. 2017-10-26

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

推荐工具

医学文档翻译智能文献检索