TCGAbiolinks：一个用于对TCGA数据进行综合分析的R/Bioconductor软件包。

TCGAbiolinks: an R/Bioconductor package for integrative analysis of TCGA data.

作者信息

Colaprico Antonio, Silva Tiago C, Olsen Catharina, Garofano Luciano, Cava Claudia, Garolini Davide, Sabedot Thais S, Malta Tathiane M, Pagnotta Stefano M, Castiglioni Isabella, Ceccarelli Michele, Bontempi Gianluca, Noushmehr Houtan

机构信息

Interuniversity Institute of Bioinformatics in Brussels (IB), Brussels, Belgium Machine Learning Group (MLG), Department d'Informatique, Université libre de Bruxelles (ULB), Brussels, Belgium.

Department of Genetics Ribeirão Preto Medical School, University of São Paulo, Ribeirão Preto, São Paulo, Brazil Center for Integrative Systems Biology - CISBi, NAP/USP, Ribeirão Preto, São Paulo, Brazil.

出版信息

Nucleic Acids Res. 2016 May 5;44(8):e71. doi: 10.1093/nar/gkv1507. Epub 2015 Dec 23.

DOI:10.1093/nar/gkv1507

PMID:26704973

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4856967/

Abstract

The Cancer Genome Atlas (TCGA) research network has made public a large collection of clinical and molecular phenotypes of more than 10 000 tumor patients across 33 different tumor types. Using this cohort, TCGA has published over 20 marker papers detailing the genomic and epigenomic alterations associated with these tumor types. Although many important discoveries have been made by TCGA's research network, opportunities still exist to implement novel methods, thereby elucidating new biological pathways and diagnostic markers. However, mining the TCGA data presents several bioinformatics challenges, such as data retrieval and integration with clinical data and other molecular data types (e.g. RNA and DNA methylation). We developed an R/Bioconductor package called TCGAbiolinks to address these challenges and offer bioinformatics solutions by using a guided workflow to allow users to query, download and perform integrative analyses of TCGA data. We combined methods from computer science and statistics into the pipeline and incorporated methodologies developed in previous TCGA marker studies and in our own group. Using four different TCGA tumor types (Kidney, Brain, Breast and Colon) as examples, we provide case studies to illustrate examples of reproducibility, integrative analysis and utilization of different Bioconductor packages to advance and accelerate novel discoveries.

摘要

癌症基因组图谱（TCGA）研究网络公开了大量来自33种不同肿瘤类型的10000多名肿瘤患者的临床和分子表型数据。利用这一队列，TCGA发表了20多篇标志性论文，详细阐述了与这些肿瘤类型相关的基因组和表观基因组改变。尽管TCGA研究网络已经取得了许多重要发现，但仍有机会采用新方法，从而阐明新的生物学途径和诊断标志物。然而，挖掘TCGA数据面临着一些生物信息学挑战，如数据检索以及与临床数据和其他分子数据类型（如RNA和DNA甲基化）的整合。我们开发了一个名为TCGAbiolinks的R/Bioconductor软件包来应对这些挑战，并通过使用一个有指导的工作流程提供生物信息学解决方案，以允许用户查询、下载和对TCGA数据进行综合分析。我们将计算机科学和统计学方法整合到流程中，并纳入了先前TCGA标志物研究以及我们自己团队所开发的方法。以四种不同的TCGA肿瘤类型（肾、脑、乳腺和结肠）为例，我们提供案例研究，以说明可重复性、综合分析以及利用不同的Bioconductor软件包推进和加速新发现的实例。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ff3/4856967/ebeb1f9c41bb/gkv1507fig1.jpg

相似文献

TCGAbiolinks: an R/Bioconductor package for integrative analysis of TCGA data.

Nucleic Acids Res. 2016 May 5;44(8):e71. doi: 10.1093/nar/gkv1507. Epub 2015 Dec 23.

: Analyze cancer genomics and epigenomics data using Bioconductor packages.

F1000Res. 2016 Jun 29;5:1542. doi: 10.12688/f1000research.8923.2. eCollection 2016.

New functionalities in the TCGAbiolinks package for the study and integration of cancer data from GDC and GTEx.

PLoS Comput Biol. 2019 Mar 5;15(3):e1006701. doi: 10.1371/journal.pcbi.1006701. eCollection 2019 Mar.

InterSIM: Simulation tool for multiple integrative 'omic datasets'.

Comput Methods Programs Biomed. 2016 May;128:69-74. doi: 10.1016/j.cmpb.2016.02.011. Epub 2016 Feb 27.

Identification of Gene Expression Pattern Related to Breast Cancer Survival Using Integrated TCGA Datasets and Genomic Tools.

Biomed Res Int. 2015;2015:878546. doi: 10.1155/2015/878546. Epub 2015 Oct 20.

PanCancer insights from The Cancer Genome Atlas: the pathologist's perspective.

J Pathol. 2018 Apr;244(5):512-524. doi: 10.1002/path.5028. Epub 2018 Feb 22.

A Practical Guide to The Cancer Genome Atlas (TCGA).

Methods Mol Biol. 2016;1418:111-41. doi: 10.1007/978-1-4939-3578-9_6.

MethCNA: a database for integrating genomic and epigenomic data in human cancer.

BMC Genomics. 2018 Feb 13;19(1):138. doi: 10.1186/s12864-018-4525-0.

BEclear: Batch Effect Detection and Adjustment in DNA Methylation Data.

PLoS One. 2016 Aug 25;11(8):e0159921. doi: 10.1371/journal.pone.0159921. eCollection 2016.

Exploring TCGA Pan-Cancer data at the UCSC Cancer Genomics Browser.

Sci Rep. 2013 Oct 2;3:2652. doi: 10.1038/srep02652.

引用本文的文献

Weighted overlapping group lasso for integrating prior network knowledge into gene set analysis.

BMC Bioinformatics. 2025 Sep 1;26(1):226. doi: 10.1186/s12859-025-06170-9.

Predictive value of MHC-related genes in cervical cancer: implications for immunotherapy and prognostic nomogram development.

Discov Oncol. 2025 Sep 1;16(1):1662. doi: 10.1007/s12672-025-03460-9.

Overexpression of mG writers METTL1 and BUD23 confers oncogenicity in kidney renal clear cell carcinoma.

J Pathol. 2025 Sep;267(1):1-9. doi: 10.1002/path.6453. Epub 2025 Jul 18.

Machine learning-based identification of diagnostic and prognostic mitotic cell cycle genes in hepatocellular carcinoma.

PLoS One. 2025 Aug 28;20(8):e0331118. doi: 10.1371/journal.pone.0331118. eCollection 2025.

Identification of Epigenetic Regulatory Networks of Gene Methylation-miRNA-Transcription Factor Feed-Forward Loops in Basal-like Breast Cancer.

Cells. 2025 Aug 10;14(16):1235. doi: 10.3390/cells14161235.

Exploring the multifaceted roles of glutamate oxaloacetate transaminase 1 as a biomarker and therapeutic target in colorectal cancer and pan-cancer analyses.

Discov Oncol. 2025 Aug 23;16(1):1600. doi: 10.1007/s12672-025-03461-8.

One-carbon metabolic pathway is a novel molecular signature for CD44-positive intestinal-type gastric cancer.

Cell Death Discov. 2025 Aug 23;11(1):399. doi: 10.1038/s41420-025-02704-5.

Bioinformatics mining and experimental validation of prognostic biomarkers in colorectal cancer.

Discov Oncol. 2025 Aug 22;16(1):1596. doi: 10.1007/s12672-025-03301-9.

Disulfidptosis-associated gene signature predicts prognosis and radioresistance in NSCLC.

Transl Oncol. 2025 Aug 20;61:102496. doi: 10.1016/j.tranon.2025.102496.

SARDH in the 1-C metabolism sculpts the T-cell fate and serves as a potential cancer therapeutic target.

Cell Mol Immunol. 2025 Aug 20. doi: 10.1038/s41423-025-01331-5.

本文引用的文献

Molecular Profiling Reveals Biologically Discrete Subsets and Pathways of Progression in Diffuse Glioma.

Cell. 2016 Jan 28;164(3):550-63. doi: 10.1016/j.cell.2015.12.028.

The expanding role of primary care in cancer control.

Lancet Oncol. 2015 Sep;16(12):1231-72. doi: 10.1016/S1470-2045(15)00205-3.

Comprehensive, Integrative Genomic Analysis of Diffuse Lower-Grade Gliomas.

N Engl J Med. 2015 Jun 25;372(26):2481-98. doi: 10.1056/NEJMoa1402121. Epub 2015 Jun 10.

Inferring regulatory element landscapes and transcription factor networks from cancer methylomes.

Genome Biol. 2015 May 21;16(1):105. doi: 10.1186/s13059-015-0668-3.

Orchestrating high-throughput genomic analysis with Bioconductor.

Nat Methods. 2015 Feb;12(2):115-21. doi: 10.1038/nmeth.3252.

limma powers differential expression analyses for RNA-sequencing and microarray studies.

Nucleic Acids Res. 2015 Apr 20;43(7):e47. doi: 10.1093/nar/gkv007. Epub 2015 Jan 20.

Integrated genomic characterization of papillary thyroid carcinoma.

Cell. 2014 Oct 23;159(3):676-90. doi: 10.1016/j.cell.2014.09.050.

The UCSC Cancer Genomics Browser: update 2015.

Nucleic Acids Res. 2015 Jan;43(Database issue):D812-7. doi: 10.1093/nar/gku1073. Epub 2014 Nov 11.

The 'dnet' approach promotes emerging research on cancer patient survival.

Genome Med. 2014 Aug 26;6(8):64. doi: 10.1186/s13073-014-0064-8. eCollection 2014.

RTCGAToolbox: a new tool for exporting TCGA Firehose data.

PLoS One. 2014 Sep 2;9(9):e106397. doi: 10.1371/journal.pone.0106397. eCollection 2014.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

TCGAbiolinks：一个用于对TCGA数据进行综合分析的R/Bioconductor软件包。

TCGAbiolinks: an R/Bioconductor package for integrative analysis of TCGA data.

作者信息

机构信息

Interuniversity Institute of Bioinformatics in Brussels (IB), Brussels, Belgium Machine Learning Group (MLG), Department d'Informatique, Université libre de Bruxelles (ULB), Brussels, Belgium.

出版信息

Nucleic Acids Res. 2016 May 5;44(8):e71. doi: 10.1093/nar/gkv1507. Epub 2015 Dec 23.

DOI:10.1093/nar/gkv1507

PMID:26704973

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4856967/

Abstract

摘要

TCGAbiolinks：一个用于对TCGA数据进行综合分析的R/Bioconductor软件包。

TCGAbiolinks: an R/Bioconductor package for integrative analysis of TCGA data.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

TCGAbiolinks：一个用于对TCGA数据进行综合分析的R/Bioconductor软件包。

TCGAbiolinks: an R/Bioconductor package for integrative analysis of TCGA data.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献