• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

对读段修剪对Illumina二代测序数据分析的影响进行的广泛评估。

An extensive evaluation of read trimming effects on Illumina NGS data analysis.

作者信息

Del Fabbro Cristian, Scalabrin Simone, Morgante Michele, Giorgi Federico M

机构信息

Institute of Applied Genomics, Udine, Italy.

IGA Technology Services, Udine, Italy.

出版信息

PLoS One. 2013 Dec 23;8(12):e85024. doi: 10.1371/journal.pone.0085024. eCollection 2013.

DOI:10.1371/journal.pone.0085024
PMID:24376861
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3871669/
Abstract

Next Generation Sequencing is having an extremely strong impact in biological and medical research and diagnostics, with applications ranging from gene expression quantification to genotyping and genome reconstruction. Sequencing data is often provided as raw reads which are processed prior to analysis 1 of the most used preprocessing procedures is read trimming, which aims at removing low quality portions while preserving the longest high quality part of a NGS read. In the current work, we evaluate nine different trimming algorithms in four datasets and three common NGS-based applications (RNA-Seq, SNP calling and genome assembly). Trimming is shown to increase the quality and reliability of the analysis, with concurrent gains in terms of execution time and computational resources needed.

摘要

新一代测序技术正在对生物学和医学研究及诊断产生极其强大的影响,其应用范围从基因表达定量到基因分型和基因组重建。测序数据通常以原始读数的形式提供,在分析之前需要进行处理。最常用的预处理程序之一是读段修剪,其目的是去除低质量部分,同时保留NGS读段中最长的高质量部分。在当前的工作中,我们在四个数据集和三个基于NGS的常见应用(RNA测序、单核苷酸多态性(SNP)检测和基因组组装)中评估了九种不同的修剪算法。结果表明,修剪可提高分析的质量和可靠性,同时在执行时间和所需计算资源方面也有相应的提升。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/18dd/3871669/04bba81d750a/pone.0085024.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/18dd/3871669/89b6f9854566/pone.0085024.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/18dd/3871669/26c3de0d1619/pone.0085024.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/18dd/3871669/3b6348364268/pone.0085024.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/18dd/3871669/23c85cd241a3/pone.0085024.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/18dd/3871669/db8230dcbd8a/pone.0085024.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/18dd/3871669/04bba81d750a/pone.0085024.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/18dd/3871669/89b6f9854566/pone.0085024.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/18dd/3871669/26c3de0d1619/pone.0085024.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/18dd/3871669/3b6348364268/pone.0085024.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/18dd/3871669/23c85cd241a3/pone.0085024.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/18dd/3871669/db8230dcbd8a/pone.0085024.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/18dd/3871669/04bba81d750a/pone.0085024.g006.jpg

相似文献

1
An extensive evaluation of read trimming effects on Illumina NGS data analysis.对读段修剪对Illumina二代测序数据分析的影响进行的广泛评估。
PLoS One. 2013 Dec 23;8(12):e85024. doi: 10.1371/journal.pone.0085024. eCollection 2013.
2
Read trimming has minimal effect on bacterial SNP-calling accuracy.reads 修剪对细菌 SNP 调用准确性的影响最小。
Microb Genom. 2020 Dec;6(12). doi: 10.1099/mgen.0.000434. Epub 2020 Dec 11.
3
Software for pre-processing Illumina next-generation sequencing short read sequences.用于预处理Illumina下一代测序短读序列的软件。
Source Code Biol Med. 2014 May 3;9:8. doi: 10.1186/1751-0473-9-8. eCollection 2014.
4
Optimal Block-Based Trimming for Next Generation Sequencing.基于最优分块的下一代测序裁剪。
IEEE/ACM Trans Comput Biol Bioinform. 2018 Mar-Apr;15(2):364-376. doi: 10.1109/TCBB.2017.2696525. Epub 2017 Apr 24.
5
QC-Chain: fast and holistic quality control method for next-generation sequencing data.QC-Chain:一种用于下一代测序数据的快速且全面的质量控制方法。
PLoS One. 2013;8(4):e60234. doi: 10.1371/journal.pone.0060234. Epub 2013 Apr 2.
6
A comparative study of k-spectrum-based error correction methods for next-generation sequencing data analysis.基于k谱的下一代测序数据分析纠错方法的比较研究。
Hum Genomics. 2016 Jul 25;10 Suppl 2(Suppl 2):20. doi: 10.1186/s40246-016-0068-0.
7
Colib'read on galaxy: a tools suite dedicated to biological information extraction from raw NGS reads.Colib'read on galaxy:一个致力于从原始NGS读段中提取生物信息的工具套件。
Gigascience. 2016 Feb 11;5:9. doi: 10.1186/s13742-015-0105-2. eCollection 2016.
8
Trimming and Validation of Illumina Short Reads Using Trimmomatic, Trinity Assembly, and Assessment of RNA-Seq Data.使用 Trimmomatic、Trinity 组装进行 Illumina 短读段的修剪和验证,以及 RNA-Seq 数据的评估。
Methods Mol Biol. 2022;2443:211-232. doi: 10.1007/978-1-0716-2067-0_11.
9
Sequence-matching adapter trimmers generate consistent quality and assembly metrics for Illumina sequencing of RNA viruses.序列匹配接头修剪器为 Illumina 测序的 RNA 病毒生成一致的质量和组装指标。
BMC Res Notes. 2024 Oct 14;17(1):308. doi: 10.1186/s13104-024-06951-0.
10
An Efficient Trimming Algorithm based on Multi-Feature Fusion Scoring Model for NGS Data.基于多特征融合评分模型的 NGS 数据高效修剪算法。
IEEE/ACM Trans Comput Biol Bioinform. 2020 May-Jun;17(3):728-738. doi: 10.1109/TCBB.2019.2897558. Epub 2019 Feb 5.

引用本文的文献

1
Phylogenetic comparative analysis of functional morphology sheds light on the evolution of seasonal migration in nightingale-thrushes (Turdidae: Catharus).功能形态学的系统发育比较分析揭示了夜鸫(鸫科:卡氏鸫属)季节性迁徙的演化过程。
Sci Rep. 2025 Jul 17;15(1):25951. doi: 10.1038/s41598-025-11396-x.
2
Integration of Bulk RNA-seq Pipeline Metrics for Assessing Low-Quality Samples.整合批量RNA测序流程指标以评估低质量样本
Res Sq. 2025 Jul 3:rs.3.rs-6976695. doi: 10.21203/rs.3.rs-6976695/v1.
3
Genetic diversity and fine-scale spatial genetic structure of European beech populations along an elevational gradient.

本文引用的文献

1
GAM-NGS: genomic assemblies merger for next generation sequencing.GAM-NGS:用于下一代测序的基因组组装合并。
BMC Bioinformatics. 2013;14 Suppl 7(Suppl 7):S6. doi: 10.1186/1471-2105-14-S7-S6. Epub 2013 Apr 22.
2
The high-quality draft genome of peach (Prunus persica) identifies unique patterns of genetic diversity, domestication and genome evolution.桃(Prunus persica)高质量基因组草图揭示了独特的遗传多样性、驯化和基因组进化模式。
Nat Genet. 2013 May;45(5):487-94. doi: 10.1038/ng.2586. Epub 2013 Mar 24.
3
Comparative study of RNA-seq- and microarray-derived coexpression networks in Arabidopsis thaliana.
沿海拔梯度的欧洲山毛榉种群的遗传多样性和精细尺度空间遗传结构
Heredity (Edinb). 2025 Jun 26. doi: 10.1038/s41437-025-00776-8.
4
A loss-of-function of ZmWRKY125 induced by CRISPR/Cas9 improves resistance against Fusarium verticillioides in maize kernels.CRISPR/Cas9诱导的ZmWRKY125功能丧失提高了玉米籽粒对轮枝镰孢菌的抗性。
Plant Cell Rep. 2025 Jun 17;44(7):144. doi: 10.1007/s00299-025-03544-4.
5
Transcriptome-Wide Analysis of Brain Cancer Initiated by Polarity Disruption in Type II Neuroblasts.对II型神经母细胞极性破坏引发的脑癌进行全转录组分析。
Int J Mol Sci. 2025 May 26;26(11):5115. doi: 10.3390/ijms26115115.
6
Preservation of milk in liquid nitrogen during sample collection does not affect the RNA quality for RNA-seq analysis.在样本采集过程中将牛奶保存在液氮中不会影响用于RNA测序分析的RNA质量。
BMC Genomics. 2025 May 24;26(1):525. doi: 10.1186/s12864-025-11707-6.
7
Metagenomic insights into microbial diversity and potential pathogenic transmission in poultry farm environments of Bangladesh.孟加拉国家禽养殖场环境中微生物多样性和潜在病原体传播的宏基因组学见解
BMC Microbiol. 2025 May 22;25(1):318. doi: 10.1186/s12866-025-03970-0.
8
Circular PVT1 promotes cardiac fibroblast activation interacting with miR-30a-5p and miR-125b-5p.环状PVT1通过与miR-30a-5p和miR-125b-5p相互作用促进心脏成纤维细胞活化。
Cell Death Dis. 2025 Apr 21;16(1):325. doi: 10.1038/s41419-025-07652-7.
9
Virulence plasmids in edema disease: Insights from whole-genome analysis of porcine O139:H1 Shiga toxin-producing (STEC) strains.水肿病中的毒力质粒:来自猪源O139:H1产志贺毒素大肠杆菌(STEC)菌株全基因组分析的见解
Front Cell Infect Microbiol. 2025 Mar 20;15:1528408. doi: 10.3389/fcimb.2025.1528408. eCollection 2025.
10
Phytic acid is an available phosphorus source for maize plants in juvenile phase belonging to two populations with different breeding backgrounds.植酸是两个具有不同育种背景群体的玉米幼苗期植株可利用的磷源。
BMC Plant Biol. 2025 Apr 3;25(1):425. doi: 10.1186/s12870-025-06431-y.
拟南芥 RNA-seq 和微阵列衍生共表达网络的比较研究。
Bioinformatics. 2013 Mar 15;29(6):717-24. doi: 10.1093/bioinformatics/btt053. Epub 2013 Feb 1.
4
De novo transcriptome assembly in chili pepper (Capsicum frutescens) to identify genes involved in the biosynthesis of capsaicinoids.利用从头转录组组装方法鉴定辣椒(Capsicum frutescens)中参与辣椒素生物合成的基因。
PLoS One. 2013;8(1):e48156. doi: 10.1371/journal.pone.0048156. Epub 2013 Jan 22.
5
Symptomatic atherosclerosis is associated with an altered gut metagenome.有症状的动脉粥样硬化与肠道微生物组的改变有关。
Nat Commun. 2012;3:1245. doi: 10.1038/ncomms2266.
6
Microevolution of a zoonotic Helicobacter population colonizing the stomach of a human host before and after failed treatment.在人类宿主的胃部中定植的一种人畜共患的幽门螺旋杆菌种群的微进化,在治疗失败前后。
Genome Biol Evol. 2012;4(12):1310-5. doi: 10.1093/gbe/evs107.
7
Next-generation sequencing in the clinic: are we ready?临床中的下一代测序:我们准备好了吗?
Nat Rev Genet. 2012 Nov;13(11):818-24. doi: 10.1038/nrg3357.
8
A comprehensive comparison of RNA-Seq-based transcriptome analysis from reads to differential gene expression and cross-comparison with microarrays: a case study in Saccharomyces cerevisiae.基于 RNA-Seq 的转录组分析从读段到差异基因表达的全面比较,以及与微阵列的交叉比较:以酿酒酵母为例的研究。
Nucleic Acids Res. 2012 Nov 1;40(20):10084-97. doi: 10.1093/nar/gks804. Epub 2012 Sep 10.
9
An integrated encyclopedia of DNA elements in the human genome.人类基因组中 DNA 元件的综合百科全书。
Nature. 2012 Sep 6;489(7414):57-74. doi: 10.1038/nature11247.
10
RobiNA: a user-friendly, integrated software solution for RNA-Seq-based transcriptomics.RobiNA:一个基于 RNA-Seq 的转录组学的用户友好、集成的软件解决方案。
Nucleic Acids Res. 2012 Jul;40(Web Server issue):W622-7. doi: 10.1093/nar/gks540. Epub 2012 Jun 8.