• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

AUSPP:一个通用的短读长预处理程序包。

AUSPP: A universal short-read pre-processing package.

作者信息

Gao Lei, Wu Cong, Liu Lin

机构信息

The Key Laboratory of Plant Epigenetics of Guangdong Province, College of Life Sciences and Oceanography, Shenzhen University, Shenzhen, P. R. China.

出版信息

J Bioinform Comput Biol. 2019 Dec;17(6):1950037. doi: 10.1142/S0219720019500379.

DOI:10.1142/S0219720019500379
PMID:32019418
Abstract

There are many short-read aligners that can map short reads to a reference genome/sequence, and most of them can directly accept a FASTQ file as the input query file. However, the raw data usually need to be pre-processed. Few software programs specialize in pre-processing raw data generated by a variety of next-generation sequencing (NGS) technologies. Here, we present AUSPP, a Perl script-based pipeline for pre-processing and automatic mapping of NGS short reads. This pipeline encompasses quality control, adaptor trimming, collapsing of reads, structural RNA removal, length selection, read mapping, and normalized wiggle file creation. It facilitates the processing from raw data to genome mapping and is therefore a powerful tool for the steps before meta-analysis. Most importantly, since AUSPP has default processing pipeline settings for many types of NGS data, most of the time, users will simply need to provide the raw data and genome. AUSPP is portable and easy to install, and the source codes are freely available at https://github.com/highlei/AUSPP.

摘要

有许多短读长比对工具可以将短读长映射到参考基因组/序列,并且它们中的大多数都可以直接接受FASTQ文件作为输入查询文件。然而,原始数据通常需要进行预处理。很少有软件程序专门用于预处理由各种下一代测序(NGS)技术生成的原始数据。在这里,我们展示了AUSPP,这是一个基于Perl脚本的流程,用于对NGS短读长进行预处理和自动映射。该流程包括质量控制、接头修剪、读长折叠、结构RNA去除、长度选择、读长映射以及标准化wiggle文件创建。它便于从原始数据处理到基因组映射,因此是荟萃分析之前步骤的强大工具。最重要的是,由于AUSPP对许多类型的NGS数据具有默认的处理流程设置,大多数时候,用户只需提供原始数据和基因组即可。AUSPP可移植且易于安装,其源代码可在https://github.com/highlei/AUSPP上免费获取。

相似文献

1
AUSPP: A universal short-read pre-processing package.AUSPP:一个通用的短读长预处理程序包。
J Bioinform Comput Biol. 2019 Dec;17(6):1950037. doi: 10.1142/S0219720019500379.
2
NGS-QCbox and Raspberry for Parallel, Automated and Rapid Quality Control Analysis of Large-Scale Next Generation Sequencing (Illumina) Data.用于大规模新一代测序(Illumina)数据并行、自动化和快速质量控制分析的NGS-QCbox与树莓派
PLoS One. 2015 Oct 13;10(10):e0139868. doi: 10.1371/journal.pone.0139868. eCollection 2015.
3
Software for pre-processing Illumina next-generation sequencing short read sequences.用于预处理Illumina下一代测序短读序列的软件。
Source Code Biol Med. 2014 May 3;9:8. doi: 10.1186/1751-0473-9-8. eCollection 2014.
4
AlignerBoost: A Generalized Software Toolkit for Boosting Next-Gen Sequencing Mapping Accuracy Using a Bayesian-Based Mapping Quality Framework.AlignerBoost:一种基于贝叶斯映射质量框架提高下一代测序映射准确性的通用软件工具包。
PLoS Comput Biol. 2016 Oct 5;12(10):e1005096. doi: 10.1371/journal.pcbi.1005096. eCollection 2016 Oct.
5
Fully automated pipeline for detection of sex linked genes using RNA-Seq data.使用RNA测序数据检测性连锁基因的全自动流程
BMC Bioinformatics. 2015 Mar 11;16(1):78. doi: 10.1186/s12859-015-0509-0.
6
SeqAssist: a novel toolkit for preliminary analysis of next-generation sequencing data.SeqAssist:一种用于下一代测序数据初步分析的新型工具包。
BMC Bioinformatics. 2014;15 Suppl 11(Suppl 11):S10. doi: 10.1186/1471-2105-15-S11-S10. Epub 2014 Oct 21.
7
SPARTA: Simple Program for Automated reference-based bacterial RNA-seq Transcriptome Analysis.SPARTA:用于基于参考的细菌RNA测序转录组自动分析的简单程序。
BMC Bioinformatics. 2016 Feb 4;17:66. doi: 10.1186/s12859-016-0923-y.
8
Evaluation and assessment of read-mapping by multiple next-generation sequencing aligners based on genome-wide characteristics.基于全基因组特征,对多种新一代测序比对器的读段比对进行评估。
Genomics. 2017 Jul;109(3-4):186-191. doi: 10.1016/j.ygeno.2017.03.001. Epub 2017 Mar 9.
9
HSA: a heuristic splice alignment tool.HSA:一种启发式剪接比对工具。
BMC Syst Biol. 2013;7 Suppl 2(Suppl 2):S10. doi: 10.1186/1752-0509-7-S2-S10. Epub 2013 Dec 17.
10
Accurate estimation of short read mapping quality for next-generation genome sequencing.准确估计下一代基因组测序中短读测序数据的映射质量。
Bioinformatics. 2012 Sep 15;28(18):i349-i355. doi: 10.1093/bioinformatics/bts408.

引用本文的文献

1
Integrated Analysis of Transcriptome and Small RNAome Reveals the Regulatory Network for Rapid Growth in .转录组和小 RNA 组综合分析揭示. 快速生长的调控网络。
Int J Mol Sci. 2022 Sep 13;23(18):10596. doi: 10.3390/ijms231810596.
2
TRANS-ACTING SIRNA3-derived short interfering RNAs confer cleavage of mRNAs in rice.转染激活的 siRNA3 衍生的短干扰 RNA 可在水稻中切割 mRNAs。
Plant Physiol. 2022 Jan 20;188(1):347-362. doi: 10.1093/plphys/kiab452.