• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

QuickRNASeq将大规模RNA测序数据分析提升到了一个新的自动化和交互式可视化水平。

QuickRNASeq lifts large-scale RNA-seq data analyses to the next level of automation and interactive visualization.

作者信息

Zhao Shanrong, Xi Li, Quan Jie, Xi Hualin, Zhang Ying, von Schack David, Vincent Michael, Zhang Baohong

机构信息

PharmaTherapeutics Clinical R&D, Pfizer Worldwide Research and Development, Cambridge, MA, 02139, USA.

Computational Sciences Center of Emphasis, Pfizer Worldwide Research and Development, Cambridge, MA, 02139, USA.

出版信息

BMC Genomics. 2016 Jan 8;17:39. doi: 10.1186/s12864-015-2356-9.

DOI:10.1186/s12864-015-2356-9
PMID:26747388
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4706714/
Abstract

BACKGROUND

RNA sequencing (RNA-seq), a next-generation sequencing technique for transcriptome profiling, is being increasingly used, in part driven by the decreasing cost of sequencing. Nevertheless, the analysis of the massive amounts of data generated by large-scale RNA-seq remains a challenge. Multiple algorithms pertinent to basic analyses have been developed, and there is an increasing need to automate the use of these tools so as to obtain results in an efficient and user friendly manner. Increased automation and improved visualization of the results will help make the results and findings of the analyses readily available to experimental scientists.

RESULTS

By combing the best open source tools developed for RNA-seq data analyses and the most advanced web 2.0 technologies, we have implemented QuickRNASeq, a pipeline for large-scale RNA-seq data analyses and visualization. The QuickRNASeq workflow consists of three main steps. In Step #1, each individual sample is processed, including mapping RNA-seq reads to a reference genome, counting the numbers of mapped reads, quality control of the aligned reads, and SNP (single nucleotide polymorphism) calling. Step #1 is computationally intensive, and can be processed in parallel. In Step #2, the results from individual samples are merged, and an integrated and interactive project report is generated. All analyses results in the report are accessible via a single HTML entry webpage. Step #3 is the data interpretation and presentation step. The rich visualization features implemented here allow end users to interactively explore the results of RNA-seq data analyses, and to gain more insights into RNA-seq datasets. In addition, we used a real world dataset to demonstrate the simplicity and efficiency of QuickRNASeq in RNA-seq data analyses and interactive visualizations. The seamless integration of automated capabilites with interactive visualizations in QuickRNASeq is not available in other published RNA-seq pipelines.

CONCLUSION

The high degree of automation and interactivity in QuickRNASeq leads to a substantial reduction in the time and effort required prior to further downstream analyses and interpretation of the analyses findings. QuickRNASeq advances primary RNA-seq data analyses to the next level of automation, and is mature for public release and adoption.

摘要

背景

RNA测序(RNA-seq)是一种用于转录组分析的新一代测序技术,随着测序成本的降低,其应用越来越广泛。然而,对大规模RNA-seq产生的海量数据进行分析仍然是一项挑战。已经开发了多种与基础分析相关的算法,并且越来越需要自动化使用这些工具,以便以高效且用户友好的方式获得结果。提高自动化程度并改善结果的可视化将有助于使分析结果和发现易于被实验科学家获取。

结果

通过结合为RNA-seq数据分析开发的最佳开源工具和最先进的Web 2.0技术,我们实现了QuickRNASeq,这是一个用于大规模RNA-seq数据分析和可视化的流程。QuickRNASeq工作流程包括三个主要步骤。在步骤1中,对每个单独的样本进行处理,包括将RNA-seq读数映射到参考基因组、计算映射读数的数量、对齐读数的质量控制以及单核苷酸多态性(SNP)检测。步骤1计算量很大,可以并行处理。在步骤2中,合并各个样本的结果,并生成一份综合且交互式的项目报告。报告中的所有分析结果都可以通过一个HTML入口网页访问。步骤3是数据解释和呈现步骤。这里实现的丰富可视化功能允许最终用户交互式地探索RNA-seq数据分析结果,并更深入地了解RNA-seq数据集。此外,我们使用一个实际数据集来证明QuickRNASeq在RNA-seq数据分析和交互式可视化方面的简单性和效率。QuickRNASeq中自动化功能与交互式可视化的无缝集成在其他已发表的RNA-seq流程中是不存在的。

结论

QuickRNASeq的高度自动化和交互性使得在对分析结果进行进一步下游分析和解释之前所需的时间和精力大幅减少。QuickRNASeq将初级RNA-seq数据分析提升到了更高的自动化水平,并已成熟可供公众发布和采用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a6bc/4706714/c894416d7bae/12864_2015_2356_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a6bc/4706714/ebf733b13549/12864_2015_2356_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a6bc/4706714/3e11b2c63fff/12864_2015_2356_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a6bc/4706714/10526354a392/12864_2015_2356_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a6bc/4706714/e567771bbf62/12864_2015_2356_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a6bc/4706714/8b98551da202/12864_2015_2356_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a6bc/4706714/c894416d7bae/12864_2015_2356_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a6bc/4706714/ebf733b13549/12864_2015_2356_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a6bc/4706714/3e11b2c63fff/12864_2015_2356_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a6bc/4706714/10526354a392/12864_2015_2356_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a6bc/4706714/e567771bbf62/12864_2015_2356_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a6bc/4706714/8b98551da202/12864_2015_2356_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a6bc/4706714/c894416d7bae/12864_2015_2356_Fig6_HTML.jpg

相似文献

1
QuickRNASeq lifts large-scale RNA-seq data analyses to the next level of automation and interactive visualization.QuickRNASeq将大规模RNA测序数据分析提升到了一个新的自动化和交互式可视化水平。
BMC Genomics. 2016 Jan 8;17:39. doi: 10.1186/s12864-015-2356-9.
2
QuickRNASeq: Guide for Pipeline Implementation and for Interactive Results Visualization.快速RNA测序:流程实施及交互式结果可视化指南
Methods Mol Biol. 2018;1751:57-70. doi: 10.1007/978-1-4939-7710-9_4.
3
SPARTA: Simple Program for Automated reference-based bacterial RNA-seq Transcriptome Analysis.SPARTA:用于基于参考的细菌RNA测序转录组自动分析的简单程序。
BMC Bioinformatics. 2016 Feb 4;17:66. doi: 10.1186/s12859-016-0923-y.
4
QuickMIRSeq: a pipeline for quick and accurate quantification of both known miRNAs and isomiRs by jointly processing multiple samples from microRNA sequencing.QuickMIRSeq:一种通过联合处理来自微小RNA测序的多个样本,对已知微小RNA和异源微小RNA进行快速准确定量的流程。
BMC Bioinformatics. 2017 Mar 20;18(1):180. doi: 10.1186/s12859-017-1601-4.
5
Fully automated pipeline for detection of sex linked genes using RNA-Seq data.使用RNA测序数据检测性连锁基因的全自动流程
BMC Bioinformatics. 2015 Mar 11;16(1):78. doi: 10.1186/s12859-015-0509-0.
6
iSeq: Web-Based RNA-seq Data Analysis and Visualization.iSeq:基于网络的RNA测序数据分析与可视化
Methods Mol Biol. 2018;1754:167-181. doi: 10.1007/978-1-4939-7717-8_10.
7
VIPER: Visualization Pipeline for RNA-seq, a Snakemake workflow for efficient and complete RNA-seq analysis.VIPER:RNA-seq 可视化管道,一个 Snakemake 工作流程,用于高效完整的 RNA-seq 分析。
BMC Bioinformatics. 2018 Apr 12;19(1):135. doi: 10.1186/s12859-018-2139-9.
8
QuickNGS elevates Next-Generation Sequencing data analysis to a new level of automation.QuickNGS将下一代测序数据分析提升到了一个新的自动化水平。
BMC Genomics. 2015 Jul 1;16(1):487. doi: 10.1186/s12864-015-1695-x.
9
SimBA: A methodology and tools for evaluating the performance of RNA-Seq bioinformatic pipelines.SimBA:一种用于评估RNA测序生物信息学流程性能的方法和工具。
BMC Bioinformatics. 2017 Sep 29;18(1):428. doi: 10.1186/s12859-017-1831-5.
10
CANEapp: a user-friendly application for automated next generation transcriptomic data analysis.CANEapp:一款用于自动化下一代转录组数据分析的用户友好型应用程序。
BMC Genomics. 2016 Jan 13;17:49. doi: 10.1186/s12864-015-2346-y.

引用本文的文献

1
A comprehensive workflow for optimizing RNA-seq data analysis.一种优化 RNA-seq 数据分析的综合工作流程。
BMC Genomics. 2024 Jun 24;25(1):631. doi: 10.1186/s12864-024-10414-y.
2
DNA methylation of insulin signaling pathways is associated with HOMA2-IR in primary myoblasts from older adults.胰岛素信号通路的 DNA 甲基化与老年人原代成肌细胞中 HOMA2-IR 相关。
Skelet Muscle. 2023 Oct 28;13(1):17. doi: 10.1186/s13395-023-00326-y.
3
transXpress: a Snakemake pipeline for streamlined de novo transcriptome assembly and annotation.

本文引用的文献

1
Union Exon Based Approach for RNA-Seq Gene Quantification: To Be or Not to Be?基于外显子联合的RNA测序基因定量方法:何去何从?
PLoS One. 2015 Nov 11;10(11):e0141910. doi: 10.1371/journal.pone.0141910. eCollection 2015.
2
Mapping RNA-seq Reads with STAR.使用STAR对RNA测序读数进行比对
Curr Protoc Bioinformatics. 2015 Sep 3;51:11.14.1-11.14.19. doi: 10.1002/0471250953.bi1114s51.
3
Comparison of stranded and non-stranded RNA-seq transcriptome profiling and investigation of gene overlap.链特异性和非链特异性RNA测序转录组分析比较及基因重叠研究
transXpress:用于简化从头转录组组装和注释的 SnakeMake 管道。
BMC Bioinformatics. 2023 Apr 4;24(1):133. doi: 10.1186/s12859-023-05254-8.
4
Molecular and Functional Characterization of Human Intestinal Organoids and Monolayers for Modeling Epithelial Barrier.人类肠类器官和单层细胞的分子和功能特征用于上皮屏障建模。
Inflamm Bowel Dis. 2023 Feb 1;29(2):195-206. doi: 10.1093/ibd/izac212.
5
Searchlight: automated bulk RNA-seq exploration and visualisation using dynamically generated R scripts.Searchlight:使用动态生成的 R 脚本进行自动化批量 RNA-seq 探索和可视化。
BMC Bioinformatics. 2021 Aug 19;22(1):411. doi: 10.1186/s12859-021-04321-2.
6
A versatile workflow to integrate RNA-seq genomic and transcriptomic data into mechanistic models of signaling pathways.一种灵活的工作流程,可将 RNA-seq 基因组和转录组数据集成到信号通路的机制模型中。
PLoS Comput Biol. 2021 Feb 11;17(2):e1008748. doi: 10.1371/journal.pcbi.1008748. eCollection 2021 Feb.
7
The Adipocyte Acquires a Fibroblast-Like Transcriptional Signature in Response to a High Fat Diet.脂肪细胞在高脂肪饮食的刺激下呈现出成纤维细胞样的转录特征。
Sci Rep. 2020 Feb 11;10(1):2380. doi: 10.1038/s41598-020-59284-w.
8
Thrombospondin-I is a critical modulator in non-alcoholic steatohepatitis (NASH).血栓反应蛋白-1 是非酒精性脂肪性肝炎(NASH)的关键调节因子。
PLoS One. 2019 Dec 31;14(12):e0226854. doi: 10.1371/journal.pone.0226854. eCollection 2019.
9
BISR-RNAseq: an efficient and scalable RNAseq analysis workflow with interactive report generation.BISR-RNAseq:一个高效、可扩展的 RNAseq 分析工作流程,具有交互式报告生成功能。
BMC Bioinformatics. 2019 Dec 20;20(Suppl 24):670. doi: 10.1186/s12859-019-3251-1.
10
deficient microglia exhibit a premature aging transcriptome.功能失调的小胶质细胞表现出过早衰老的转录组。
Life Sci Alliance. 2019 Dec 2;2(6). doi: 10.26508/lsa.201900453. Print 2019 Dec.
BMC Genomics. 2015 Sep 3;16(1):675. doi: 10.1186/s12864-015-1876-7.
4
Comparative assessment of methods for the computational inference of transcript isoform abundance from RNA-seq data.从RNA测序数据计算推断转录本异构体丰度方法的比较评估
Genome Biol. 2015 Jul 23;16(1):150. doi: 10.1186/s13059-015-0702-5.
5
QuickNGS elevates Next-Generation Sequencing data analysis to a new level of automation.QuickNGS将下一代测序数据分析提升到了一个新的自动化水平。
BMC Genomics. 2015 Jul 1;16(1):487. doi: 10.1186/s12864-015-1695-x.
6
Human genomics. The human transcriptome across tissues and individuals.人类基因组学。跨组织和个体的人类转录组。
Science. 2015 May 8;348(6235):660-5. doi: 10.1126/science.aaa0355.
7
Human genomics. The Genotype-Tissue Expression (GTEx) pilot analysis: multitissue gene regulation in humans.人类基因组学。基因型-组织表达(GTEx)试点分析:人类多组织基因调控
Science. 2015 May 8;348(6235):648-60. doi: 10.1126/science.1262110. Epub 2015 May 7.
8
A comprehensive evaluation of ensembl, RefSeq, and UCSC annotations in the context of RNA-seq read mapping and gene quantification.在RNA测序读段映射和基因定量的背景下,对Ensembl、RefSeq和UCSC注释进行全面评估。
BMC Genomics. 2015 Feb 18;16(1):97. doi: 10.1186/s12864-015-1308-8.
9
Gene Ontology Consortium: going forward.基因本体论联盟:展望未来。
Nucleic Acids Res. 2015 Jan;43(Database issue):D1049-56. doi: 10.1093/nar/gku1179. Epub 2014 Nov 26.
10
RNA-Seq Data: A Complexity Journey.RNA测序数据:一段复杂性之旅。
Comput Struct Biotechnol J. 2014 Sep 17;11(19):123-30. doi: 10.1016/j.csbj.2014.09.004. eCollection 2014 Sep.