• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

耳环法:一种高效且准确的衔接子修剪方法不需要预先知道衔接子序列。

EARRINGS: an efficient and accurate adapter trimmer entails no a priori adapter sequences.

作者信息

Wang Ting-Hsuan, Huang Cheng-Ching, Hung Jui-Hung

机构信息

Department of Computer Science, College of Computer Science, National Chiao Tung University, National Yang Ming Chiao Tung University, Hsinchu 30010, Taiwan.

出版信息

Bioinformatics. 2021 Jul 27;37(13):1846-1852. doi: 10.1093/bioinformatics/btab025.

DOI:10.1093/bioinformatics/btab025
PMID:33459339
Abstract

MOTIVATION

Cross-sample comparisons or large-scale meta-analyses based on the next generation sequencing (NGS) involve replicable and universal data preprocessing, including removing adapter fragments in contaminated reads (i.e. adapter trimming). While modern adapter trimmers require users to provide candidate adapter sequences for each sample, which are sometimes unavailable or falsely documented in the repositories (such as GEO or SRA), large-scale meta-analyses are therefore jeopardized by suboptimal adapter trimming.

RESULTS

Here we introduce a set of fast and accurate adapter detection and trimming algorithms that entail no a priori adapter sequences. These algorithms were implemented in modern C++ with SIMD and multithreading to accelerate its speed. Our experiments and benchmarks show that the implementation (i.e. EARRINGS), without being given any hint of adapter sequences, can reach comparable accuracy and higher throughput than that of existing adapter trimmers. EARRINGS is particularly useful in meta-analyses of a large batch of datasets and can be incorporated in any sequence analysis pipelines in all scales.

AVAILABILITY AND IMPLEMENTATION

EARRINGS is open-source software and is available at https://github.com/jhhung/EARRINGS.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

基于下一代测序(NGS)的跨样本比较或大规模荟萃分析涉及可重复且通用的数据预处理,包括去除污染读段中的接头片段(即接头修剪)。虽然现代接头修剪工具要求用户为每个样本提供候选接头序列,但这些序列有时不可用或在数据库(如GEO或SRA)中记录错误,因此大规模荟萃分析会因接头修剪不理想而受到影响。

结果

在此,我们介绍了一组快速且准确的接头检测和修剪算法,这些算法无需先验接头序列。这些算法用现代C++实现,并结合了SIMD和多线程技术以加速其运行速度。我们的实验和基准测试表明,该实现(即EARRINGS)在未给出任何接头序列提示的情况下,能够达到与现有接头修剪工具相当的准确性且具有更高的通量。EARRINGS在大量数据集的荟萃分析中特别有用,并且可以纳入任何规模的序列分析流程中。

可用性与实现

EARRINGS是开源软件,可从https://github.com/jhhung/EARRINGS获取。

补充信息

补充数据可在《生物信息学》在线获取。

相似文献

1
EARRINGS: an efficient and accurate adapter trimmer entails no a priori adapter sequences.耳环法:一种高效且准确的衔接子修剪方法不需要预先知道衔接子序列。
Bioinformatics. 2021 Jul 27;37(13):1846-1852. doi: 10.1093/bioinformatics/btab025.
2
Ktrim: an extra-fast and accurate adapter- and quality-trimmer for sequencing data.Ktrim:一款超快、超准的测序数据接头和质量修剪工具。
Bioinformatics. 2020 Jun 1;36(11):3561-3562. doi: 10.1093/bioinformatics/btaa171.
3
SeqPurge: highly-sensitive adapter trimming for paired-end NGS data.SeqPurge:用于双端NGS数据的高灵敏度接头修剪
BMC Bioinformatics. 2016 May 10;17:208. doi: 10.1186/s12859-016-1069-7.
4
PEAT: an intelligent and efficient paired-end sequencing adapter trimming algorithm.PEAT:一种智能高效的双端测序接头修剪算法。
BMC Bioinformatics. 2015;16 Suppl 1(Suppl 1):S2. doi: 10.1186/1471-2105-16-S1-S2. Epub 2015 Jan 21.
5
Skewer: a fast and accurate adapter trimmer for next-generation sequencing paired-end reads.Skewer:一种用于新一代测序双端读段的快速且准确的接头修剪工具。
BMC Bioinformatics. 2014 Jun 12;15:182. doi: 10.1186/1471-2105-15-182.
6
An Efficient Trimming Algorithm based on Multi-Feature Fusion Scoring Model for NGS Data.基于多特征融合评分模型的 NGS 数据高效修剪算法。
IEEE/ACM Trans Comput Biol Bioinform. 2020 May-Jun;17(3):728-738. doi: 10.1109/TCBB.2019.2897558. Epub 2019 Feb 5.
7
Atropos: specific, sensitive, and speedy trimming of sequencing reads.Atropos:对测序读数进行特定、灵敏且快速的修剪。
PeerJ. 2017 Aug 30;5:e3720. doi: 10.7717/peerj.3720. eCollection 2017.
8
Software for pre-processing Illumina next-generation sequencing short read sequences.用于预处理Illumina下一代测序短读序列的软件。
Source Code Biol Med. 2014 May 3;9:8. doi: 10.1186/1751-0473-9-8. eCollection 2014.
9
Sequence-matching adapter trimmers generate consistent quality and assembly metrics for Illumina sequencing of RNA viruses.序列匹配接头修剪器为 Illumina 测序的 RNA 病毒生成一致的质量和组装指标。
BMC Res Notes. 2024 Oct 14;17(1):308. doi: 10.1186/s13104-024-06951-0.
10
Porechop_ABI: discovering unknown adapters in Oxford Nanopore Technology sequencing reads for downstream trimming.Porechop_ABI:在牛津纳米孔技术测序读数中发现未知接头以便进行下游修剪。
Bioinform Adv. 2022 Nov 21;3(1):vbac085. doi: 10.1093/bioadv/vbac085. eCollection 2023.

引用本文的文献

1
Effects of overexpression on lignin and cell wall characteristics in transgenic hybrid aspen.过表达对转基因杂交杨树木质素和细胞壁特性的影响。
Front Plant Sci. 2025 Mar 28;16:1543168. doi: 10.3389/fpls.2025.1543168. eCollection 2025.
2
FindAdapt: A python package for fast and accurate adapter detection in small RNA sequencing.FindAdapt:一个用于快速准确检测小 RNA 测序中接头的 Python 包。
PLoS Comput Biol. 2024 Jan 22;20(1):e1011786. doi: 10.1371/journal.pcbi.1011786. eCollection 2024 Jan.
3
High genome heterozygosity revealed vegetative propagation over the sea in Moso bamboo.
高基因组杂合度揭示了毛竹通过海上营养繁殖。
BMC Genomics. 2023 Jun 24;24(1):348. doi: 10.1186/s12864-023-09428-9.
4
MiR34 contributes to spinal muscular atrophy and AAV9-mediated delivery of MiR34a ameliorates the motor deficits in SMA mice.微小RNA34参与脊髓性肌萎缩症的发生,腺相关病毒9介导的微小RNA34a递送可改善脊髓性肌萎缩症小鼠的运动功能障碍。
Mol Ther Nucleic Acids. 2023 Mar 15;32:144-160. doi: 10.1016/j.omtn.2023.03.005. eCollection 2023 Jun 13.