• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过多次比对纠正短读中的错误。

Correcting errors in short reads by multiple alignments.

机构信息

Department of Computer Science, Helsinki Institute for Information Technology HIIT, University of Helsinki, Helsinki, Finland.

出版信息

Bioinformatics. 2011 Jun 1;27(11):1455-61. doi: 10.1093/bioinformatics/btr170. Epub 2011 Apr 5.

DOI:10.1093/bioinformatics/btr170
PMID:21471014
Abstract

MOTIVATION

Current sequencing technologies produce a large number of erroneous reads. The sequencing errors present a major challenge in utilizing the data in de novo sequencing projects as assemblers have difficulties in dealing with errors.

RESULTS

We present Coral which corrects sequencing errors by forming multiple alignments. Unlike previous tools for error correction, Coral can utilize also bases distant from the error in the correction process because the whole read is present in the alignment. Coral is easily adjustable to reads produced by different sequencing technologies like Illumina Genome Analyzer and Roche/454 Life Sciences sequencing platforms because the sequencing error model can be defined by the user. We show that our method is able to reduce the error rate of reads more than previous methods.

AVAILABILITY

The source code of Coral is freely available at http://www.cs.helsinki.fi/u/lmsalmel/coral/.

摘要

动机

当前的测序技术会产生大量错误的读取。测序错误在从头测序项目中利用数据时带来了重大挑战,因为组装器难以处理错误。

结果

我们提出了 Coral,它通过形成多个比对来纠正测序错误。与以前用于纠错的工具不同,Coral 可以在纠错过程中利用远离错误的碱基,因为整个读取都存在于比对中。Coral 可以轻松地调整到不同测序技术产生的读取,如 Illumina Genome Analyzer 和 Roche/454 Life Sciences 测序平台,因为用户可以定义测序错误模型。我们表明,我们的方法能够比以前的方法更有效地降低读取错误率。

可用性

Coral 的源代码可在 http://www.cs.helsinki.fi/u/lmsalmel/coral/ 上免费获得。

相似文献

1
Correcting errors in short reads by multiple alignments.通过多次比对纠正短读中的错误。
Bioinformatics. 2011 Jun 1;27(11):1455-61. doi: 10.1093/bioinformatics/btr170. Epub 2011 Apr 5.
2
Correction of sequencing errors in a mixed set of reads.纠正混合读取集中的测序错误。
Bioinformatics. 2010 May 15;26(10):1284-90. doi: 10.1093/bioinformatics/btq151. Epub 2010 Apr 8.
3
Accurate self-correction of errors in long reads using de Bruijn graphs.使用德布鲁因图对长读段中的错误进行准确的自我校正。
Bioinformatics. 2017 Mar 15;33(6):799-806. doi: 10.1093/bioinformatics/btw321.
4
Blue: correcting sequencing errors using consensus and context.蓝色:使用一致性和上下文来纠正测序错误。
Bioinformatics. 2014 Oct;30(19):2723-32. doi: 10.1093/bioinformatics/btu368. Epub 2014 Jun 11.
5
QuorUM: An Error Corrector for Illumina Reads.QuorUM:Illumina测序读数的纠错工具
PLoS One. 2015 Jun 17;10(6):e0130821. doi: 10.1371/journal.pone.0130821. eCollection 2015.
6
Illumina error correction near highly repetitive DNA regions improves de novo genome assembly.Illumina 纠错技术在高度重复 DNA 区域的应用提高了从头基因组组装的质量。
BMC Bioinformatics. 2019 Jun 3;20(1):298. doi: 10.1186/s12859-019-2906-2.
7
Improve homology search sensitivity of PacBio data by correcting frameshifts.通过校正移码来提高PacBio数据的同源性搜索灵敏度。
Bioinformatics. 2016 Sep 1;32(17):i529-i537. doi: 10.1093/bioinformatics/btw458.
8
In search of perfect reads.寻找完美的读数。
BMC Bioinformatics. 2015;16 Suppl 17(Suppl 17):S7. doi: 10.1186/1471-2105-16-S17-S7. Epub 2015 Dec 7.
9
Correcting errors in shotgun sequences.校正鸟枪法测序中的错误。
Nucleic Acids Res. 2003 Aug 1;31(15):4663-72. doi: 10.1093/nar/gkg653;.
10
Optimal spliced alignments of short sequence reads.短序列 reads 的最优剪接比对。
Bioinformatics. 2008 Aug 15;24(16):i174-80. doi: 10.1093/bioinformatics/btn300.

引用本文的文献

1
MAFcounter: an efficient tool for counting the occurrences of k-mers in MAF files.MAFcounter:一种用于统计MAF文件中k-mer出现次数的高效工具。
BMC Bioinformatics. 2025 May 30;26(1):142. doi: 10.1186/s12859-025-06172-7.
2
MAFcounter: An efficient tool for counting the occurrences of k-mers in MAF files.MAFcounter:一种用于统计MAF文件中k-mer出现次数的高效工具。
ArXiv. 2024 Nov 29:arXiv:2411.19427v1.
3
A survey of k-mer methods and applications in bioinformatics.生物信息学中k-mer方法及其应用综述。
Comput Struct Biotechnol J. 2024 May 21;23:2289-2303. doi: 10.1016/j.csbj.2024.05.025. eCollection 2024 Dec.
4
Enhanced bovine genome annotation through integration of transcriptomics and epi-transcriptomics datasets facilitates genomic biology.通过转录组学和表观转录组学数据集的整合,增强牛基因组注释,从而促进基因组生物学研究。
Gigascience. 2024 Jan 2;13. doi: 10.1093/gigascience/giae019.
5
MAC-ErrorReads: machine learning-assisted classifier for filtering erroneous NGS reads.MAC-ErrorReads:一种基于机器学习的分类器,用于过滤错误的 NGS 读取。
BMC Bioinformatics. 2024 Feb 7;25(1):61. doi: 10.1186/s12859-024-05681-1.
6
Methods to improve the accuracy of next-generation sequencing.提高下一代测序准确性的方法。
Front Bioeng Biotechnol. 2023 Jan 20;11:982111. doi: 10.3389/fbioe.2023.982111. eCollection 2023.
7
Genome sequence assembly algorithms and misassembly identification methods.基因组序列组装算法和错误组装识别方法。
Mol Biol Rep. 2022 Nov;49(11):11133-11148. doi: 10.1007/s11033-022-07919-8. Epub 2022 Sep 23.
8
Lightweight Pattern Matching Method for DNA Sequencing in Internet of Medical Things.物联网中 DNA 测序的轻量级模式匹配方法。
Comput Intell Neurosci. 2022 Sep 8;2022:6980335. doi: 10.1155/2022/6980335. eCollection 2022.
9
CARE 2.0: reducing false-positive sequencing error corrections using machine learning.CARE 2.0:利用机器学习减少假阳性测序错误纠正。
BMC Bioinformatics. 2022 Jun 13;23(1):227. doi: 10.1186/s12859-022-04754-3.
10
Aberration-corrected ultrafine analysis of miRNA reads at single-base resolution: a k-mer lattice approach.单碱基分辨率下 miRNA reads 的畸变校正超微分析:一种 k-mer 格点方法。
Nucleic Acids Res. 2021 Oct 11;49(18):e106. doi: 10.1093/nar/gkab610.