文献检索文档翻译深度研究
Suppr Zotero 插件Zotero 插件
邀请有礼套餐&价格历史记录

新学期,新优惠

限时优惠:9月1日-9月22日

30天高级会员仅需29元

1天体验卡首发特惠仅需5.99元

了解详情
不再提醒
插件&应用
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
高级版
套餐订阅购买积分包
AI 工具
文献检索文档翻译深度研究
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2025

AUSPP:一个通用的短读长预处理程序包。

AUSPP: A universal short-read pre-processing package.

作者信息

Gao Lei, Wu Cong, Liu Lin

机构信息

The Key Laboratory of Plant Epigenetics of Guangdong Province, College of Life Sciences and Oceanography, Shenzhen University, Shenzhen, P. R. China.

出版信息

J Bioinform Comput Biol. 2019 Dec;17(6):1950037. doi: 10.1142/S0219720019500379.


DOI:10.1142/S0219720019500379
PMID:32019418
Abstract

There are many short-read aligners that can map short reads to a reference genome/sequence, and most of them can directly accept a FASTQ file as the input query file. However, the raw data usually need to be pre-processed. Few software programs specialize in pre-processing raw data generated by a variety of next-generation sequencing (NGS) technologies. Here, we present AUSPP, a Perl script-based pipeline for pre-processing and automatic mapping of NGS short reads. This pipeline encompasses quality control, adaptor trimming, collapsing of reads, structural RNA removal, length selection, read mapping, and normalized wiggle file creation. It facilitates the processing from raw data to genome mapping and is therefore a powerful tool for the steps before meta-analysis. Most importantly, since AUSPP has default processing pipeline settings for many types of NGS data, most of the time, users will simply need to provide the raw data and genome. AUSPP is portable and easy to install, and the source codes are freely available at https://github.com/highlei/AUSPP.

摘要

有许多短读长比对工具可以将短读长映射到参考基因组/序列,并且它们中的大多数都可以直接接受FASTQ文件作为输入查询文件。然而,原始数据通常需要进行预处理。很少有软件程序专门用于预处理由各种下一代测序(NGS)技术生成的原始数据。在这里,我们展示了AUSPP,这是一个基于Perl脚本的流程,用于对NGS短读长进行预处理和自动映射。该流程包括质量控制、接头修剪、读长折叠、结构RNA去除、长度选择、读长映射以及标准化wiggle文件创建。它便于从原始数据处理到基因组映射,因此是荟萃分析之前步骤的强大工具。最重要的是,由于AUSPP对许多类型的NGS数据具有默认的处理流程设置,大多数时候,用户只需提供原始数据和基因组即可。AUSPP可移植且易于安装,其源代码可在https://github.com/highlei/AUSPP上免费获取。

相似文献

[1]
AUSPP: A universal short-read pre-processing package.

J Bioinform Comput Biol. 2019-12

[2]
NGS-QCbox and Raspberry for Parallel, Automated and Rapid Quality Control Analysis of Large-Scale Next Generation Sequencing (Illumina) Data.

PLoS One. 2015-10-13

[3]
Software for pre-processing Illumina next-generation sequencing short read sequences.

Source Code Biol Med. 2014-5-3

[4]
AlignerBoost: A Generalized Software Toolkit for Boosting Next-Gen Sequencing Mapping Accuracy Using a Bayesian-Based Mapping Quality Framework.

PLoS Comput Biol. 2016-10-5

[5]
Fully automated pipeline for detection of sex linked genes using RNA-Seq data.

BMC Bioinformatics. 2015-3-11

[6]
SeqAssist: a novel toolkit for preliminary analysis of next-generation sequencing data.

BMC Bioinformatics. 2014

[7]
SPARTA: Simple Program for Automated reference-based bacterial RNA-seq Transcriptome Analysis.

BMC Bioinformatics. 2016-2-4

[8]
Evaluation and assessment of read-mapping by multiple next-generation sequencing aligners based on genome-wide characteristics.

Genomics. 2017-7

[9]
HSA: a heuristic splice alignment tool.

BMC Syst Biol. 2013

[10]
Accurate estimation of short read mapping quality for next-generation genome sequencing.

Bioinformatics. 2012-9-15

引用本文的文献

[1]
Integrated Analysis of Transcriptome and Small RNAome Reveals the Regulatory Network for Rapid Growth in .

Int J Mol Sci. 2022-9-13

[2]
TRANS-ACTING SIRNA3-derived short interfering RNAs confer cleavage of mRNAs in rice.

Plant Physiol. 2022-1-20

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

推荐工具

医学文档翻译智能文献检索