Chan Soon-Shiong Institute of Molecular Medicine at Windber, Windber, PA 15963, USA.
Nationwide Children's Hospital, Columbus, OH 43205, USA.
Cell. 2018 Apr 5;173(2):400-416.e11. doi: 10.1016/j.cell.2018.02.052.
For a decade, The Cancer Genome Atlas (TCGA) program collected clinicopathologic annotation data along with multi-platform molecular profiles of more than 11,000 human tumors across 33 different cancer types. TCGA clinical data contain key features representing the democratized nature of the data collection process. To ensure proper use of this large clinical dataset associated with genomic features, we developed a standardized dataset named the TCGA Pan-Cancer Clinical Data Resource (TCGA-CDR), which includes four major clinical outcome endpoints. In addition to detailing major challenges and statistical limitations encountered during the effort of integrating the acquired clinical data, we present a summary that includes endpoint usage recommendations for each cancer type. These TCGA-CDR findings appear to be consistent with cancer genomics studies independent of the TCGA effort and provide opportunities for investigating cancer biology using clinical correlates at an unprecedented scale.
在过去的十年中,癌症基因组图谱(TCGA)计划收集了超过 11000 个人类肿瘤的临床病理注释数据以及 33 种不同癌症类型的多平台分子谱。TCGA 临床数据包含代表数据收集过程民主化性质的关键特征。为了确保正确使用与基因组特征相关的这个大型临床数据集,我们开发了一个名为 TCGA 泛癌临床数据资源(TCGA-CDR)的标准化数据集,其中包括四个主要的临床结局终点。除了详细说明在整合所获得的临床数据过程中遇到的主要挑战和统计限制之外,我们还提供了一份总结,其中包括每种癌症类型的终点使用建议。这些 TCGA-CDR 的发现似乎与独立于 TCGA 工作的癌症基因组学研究一致,并为使用前所未有的规模的临床相关性研究癌症生物学提供了机会。
Sci Rep. 2013-10-2
BMC Med Genomics. 2018-8-8
BMC Bioinformatics. 2025-9-2
Genes (Basel). 2025-8-21
Acta Neuropathol. 2016-5-9
Cell. 2015-11-5
Cell. 2015-6-18
N Engl J Med. 2015-6-25