• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

RTCGAToolbox:一种用于导出TCGA Firehose数据的新工具。

RTCGAToolbox: a new tool for exporting TCGA Firehose data.

作者信息

Samur Mehmet Kemal

机构信息

Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute and Harvard School of Public Health, Boston, Massachusetts, United States of America; Lebow Institute of Myeloma Therapeutics and Jerome Lipper Multiple Myeloma Center, Dana-Farber Cancer Institute and Harvard Medical School, Boston, Massachusetts, United States of America.

出版信息

PLoS One. 2014 Sep 2;9(9):e106397. doi: 10.1371/journal.pone.0106397. eCollection 2014.

DOI:10.1371/journal.pone.0106397
PMID:25181531
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4152273/
Abstract

BACKGROUND & OBJECTIVE: Managing data from large-scale projects (such as The Cancer Genome Atlas (TCGA)) for further analysis is an important and time consuming step for research projects. Several efforts, such as the Firehose project, make TCGA pre-processed data publicly available via web services and data portals, but this information must be managed, downloaded and prepared for subsequent steps. We have developed an open source and extensible R based data client for pre-processed data from the Firehouse, and demonstrate its use with sample case studies. Results show that our RTCGAToolbox can facilitate data management for researchers interested in working with TCGA data. The RTCGAToolbox can also be integrated with other analysis pipelines for further data processing.

AVAILABILITY AND IMPLEMENTATION

The RTCGAToolbox is open-source and licensed under the GNU General Public License Version 2.0. All documentation and source code for RTCGAToolbox is freely available at http://mksamur.github.io/RTCGAToolbox/ for Linux and Mac OS X operating systems.

摘要

背景与目的

管理来自大型项目(如癌症基因组图谱(TCGA))的数据以便进行进一步分析,对于研究项目而言是重要且耗时的一步。诸如Firehose项目等多项工作,通过网络服务和数据门户使TCGA预处理数据公开可用,但这些信息必须进行管理、下载并为后续步骤做好准备。我们已为来自Firehouse的预处理数据开发了一个基于R的开源且可扩展的数据客户端,并通过示例案例研究展示其用法。结果表明,我们的RTCGAToolbox能够为有兴趣处理TCGA数据的研究人员提供便利的数据管理。RTCGAToolbox还可与其他分析流程集成以进行进一步的数据处理。

可用性与实现

RTCGAToolbox是开源的,遵循GNU通用公共许可证第2.0版。RTCGAToolbox的所有文档和源代码可在http://mksamur.github.io/RTCGAToolbox/上免费获取,适用于Linux和Mac OS X操作系统。

相似文献

1
RTCGAToolbox: a new tool for exporting TCGA Firehose data.RTCGAToolbox:一种用于导出TCGA Firehose数据的新工具。
PLoS One. 2014 Sep 2;9(9):e106397. doi: 10.1371/journal.pone.0106397. eCollection 2014.
2
TCGA Expedition: A Data Acquisition and Management System for TCGA Data.TCGA探索计划:一个用于TCGA数据的数据采集与管理系统。
PLoS One. 2016 Oct 27;11(10):e0165395. doi: 10.1371/journal.pone.0165395. eCollection 2016.
3
Dasty3, a WEB framework for DAS.Dasty3,一个用于 DAS 的 WEB 框架。
Bioinformatics. 2011 Sep 15;27(18):2616-7. doi: 10.1093/bioinformatics/btr433. Epub 2011 Jul 28.
4
Web-TCGA: an online platform for integrated analysis of molecular cancer data sets.网络-癌症基因组图谱(Web-TCGA):一个用于分子癌症数据集综合分析的在线平台。
BMC Bioinformatics. 2016 Feb 6;17:72. doi: 10.1186/s12859-016-0917-9.
5
JUICE: a data management system that facilitates the analysis of large volumes of information in an EST project workflow.JUICE:一个数据管理系统,可在EST项目工作流程中促进对大量信息的分析。
BMC Bioinformatics. 2006 Nov 23;7:513. doi: 10.1186/1471-2105-7-513.
6
A graph-based approach for designing extensible pipelines.基于图的可扩展流水线设计方法。
BMC Bioinformatics. 2012 Jul 12;13:163. doi: 10.1186/1471-2105-13-163.
7
Exploring drivers of gene expression in the Cancer Genome Atlas.探索癌症基因组图谱中基因表达的驱动因素。
Bioinformatics. 2019 Jan 1;35(1):62-68. doi: 10.1093/bioinformatics/bty551.
8
Omics Pipe: a community-based framework for reproducible multi-omics data analysis.组学管道:一个基于社区的可重复多组学数据分析框架。
Bioinformatics. 2015 Jun 1;31(11):1724-8. doi: 10.1093/bioinformatics/btv061. Epub 2015 Jan 30.
9
Oncotator: cancer variant annotation tool.Oncotator:癌症变异注释工具。
Hum Mutat. 2015 Apr;36(4):E2423-9. doi: 10.1002/humu.22771. Epub 2015 Mar 16.
10
AssociationViewer: a scalable and integrated software tool for visualization of large-scale variation data in genomic context.关联查看器:一种用于在基因组背景下可视化大规模变异数据的可扩展集成软件工具。
Bioinformatics. 2009 Mar 1;25(5):662-3. doi: 10.1093/bioinformatics/btp017. Epub 2009 Jan 25.

引用本文的文献

1
Updating TCGA glioma classification through integration of molecular data following the latest WHO guidelines.依据世界卫生组织最新指南,通过整合分子数据更新癌症基因组图谱(TCGA)胶质瘤分类。
Sci Data. 2025 Jun 4;12(1):935. doi: 10.1038/s41597-025-05117-2.
2
Priority-Elastic net for binary disease outcome prediction based on multi-omics data.基于多组学数据的二元疾病结局预测的优先级弹性网络
BioData Min. 2024 Oct 29;17(1):45. doi: 10.1186/s13040-024-00401-0.
3
Inferring Diagnostic and Prognostic Gene Expression Signatures Across WHO Glioma Classifications: A Network-Based Approach.

本文引用的文献

1
voom: Precision weights unlock linear model analysis tools for RNA-seq read counts.voom:精确权重为RNA测序读数计数解锁线性模型分析工具。
Genome Biol. 2014 Feb 3;15(2):R29. doi: 10.1186/gb-2014-15-2-r29.
2
The shaping and functional consequences of the dosage effect landscape in multiple myeloma.多发性骨髓瘤中剂量效应景观的形成及其功能后果。
BMC Genomics. 2013 Oct 2;14:672. doi: 10.1186/1471-2164-14-672.
3
Exploring TCGA Pan-Cancer data at the UCSC Cancer Genomics Browser.探索 UCSC 癌症基因组浏览器中的 TCGA 泛癌症数据。
推断世界卫生组织(WHO)胶质瘤分类中的诊断和预后基因表达特征:一种基于网络的方法。
Bioinform Biol Insights. 2024 Sep 15;18:11779322241271535. doi: 10.1177/11779322241271535. eCollection 2024.
4
ToxDAR: A Workflow Software for Analyzing Toxicologically Relevant Proteomic and Transcriptomic Data, from Data Preparation to Toxicological Mechanism Elucidation.ToxDAR:一个用于分析毒理学相关蛋白质组学和转录组学数据的工作流程软件,从数据准备到毒理学机制阐明。
Int J Mol Sci. 2024 Sep 2;25(17):9544. doi: 10.3390/ijms25179544.
5
Integration of Multi-Omics Data for the Classification of Glioma Types and Identification of Novel Biomarkers.整合多组学数据用于胶质瘤类型分类和新型生物标志物鉴定
Bioinform Biol Insights. 2024 May 27;18:11779322241249563. doi: 10.1177/11779322241249563. eCollection 2024.
6
StellarPath: Hierarchical-vertical multi-omics classifier synergizes stable markers and interpretable similarity networks for patient profiling.StellarPath:分层垂直多组学分类器结合稳定标志物和可解释的相似性网络进行患者特征分析。
PLoS Comput Biol. 2024 Apr 12;20(4):e1012022. doi: 10.1371/journal.pcbi.1012022. eCollection 2024 Apr.
7
Study of prognostic splicing factors in cancer using machine learning approaches.基于机器学习方法的癌症预后剪接因子研究。
Hum Mol Genet. 2024 Jun 21;33(13):1131-1141. doi: 10.1093/hmg/ddae047.
8
Homeostatic iron regulatory protein drives glioblastoma growth via tumor cell-intrinsic and sex-specific responses.稳态铁调节蛋白通过肿瘤细胞内在和性别特异性反应驱动胶质母细胞瘤生长。
Neurooncol Adv. 2023 Nov 28;6(1):vdad154. doi: 10.1093/noajnl/vdad154. eCollection 2024 Jan-Dec.
9
Low EGFL7 expression is associated with high lymph node spread and invasion of lymphatic vessels in colorectal cancer.EGFL7 低表达与结直肠癌中淋巴结转移和淋巴管浸润有关。
Sci Rep. 2023 Nov 13;13(1):19783. doi: 10.1038/s41598-023-47132-6.
10
Identification of lncRNAs associated with uterine corpus endometrial cancer prognosis based on the competing endogenous RNA network.基于竞争性内源 RNA 网络鉴定与子宫体子宫内膜癌预后相关的长链非编码 RNA。
Int J Med Sci. 2023 Sep 25;20(12):1600-1615. doi: 10.7150/ijms.87430. eCollection 2023.
Sci Rep. 2013 Oct 2;3:2652. doi: 10.1038/srep02652.
4
Emerging landscape of oncogenic signatures across human cancers.人类癌症中致癌特征的新态势。
Nat Genet. 2013 Oct;45(10):1127-33. doi: 10.1038/ng.2762.
5
Enabling transparent and collaborative computational analysis of 12 tumor types within The Cancer Genome Atlas.实现癌症基因组图谱中 12 种肿瘤类型的透明和协作计算分析。
Nat Genet. 2013 Oct;45(10):1121-6. doi: 10.1038/ng.2761.
6
The Cancer Genome Atlas Pan-Cancer analysis project.癌症基因组图谱泛癌分析项目。
Nat Genet. 2013 Oct;45(10):1113-20. doi: 10.1038/ng.2764.
7
Large scale comparison of gene expression levels by microarrays and RNAseq using TCGA data.基于 TCGA 数据的基因表达水平的大规模比较:微阵列和 RNAseq 方法的比较。
PLoS One. 2013 Aug 20;8(8):e71462. doi: 10.1371/journal.pone.0071462. eCollection 2013.
8
Signatures of mutational processes in human cancer.人类癌症中的突变过程特征。
Nature. 2013 Aug 22;500(7463):415-21. doi: 10.1038/nature12477. Epub 2013 Aug 14.
9
RCircos: an R package for Circos 2D track plots.RCircos:一个用于 Circos 2D 轨道图的 R 包。
BMC Bioinformatics. 2013 Aug 10;14:244. doi: 10.1186/1471-2105-14-244.
10
A self-updating road map of The Cancer Genome Atlas.癌症基因组图谱的自更新路线图。
Bioinformatics. 2013 May 15;29(10):1333-40. doi: 10.1093/bioinformatics/btt141. Epub 2013 Apr 17.