Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA.
Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, USA.
Genome Biol. 2018 Sep 25;19(1):142. doi: 10.1186/s13059-018-1511-4.
Previous approaches to defining subtypes of colorectal carcinoma (CRC) and other cancers based on transcriptomes have assumed the existence of discrete subtypes. We analyze gene expression patterns of colorectal tumors from a large number of patients to test this assumption and propose an approach to identify potentially a continuum of subtypes that are present across independent studies and cohorts.
We examine the assumption of discrete CRC subtypes by integrating 18 published gene expression datasets and > 3700 patients, and contrary to previous reports, find no evidence to support the existence of discrete transcriptional subtypes. Using a meta-analysis approach to identify co-expression patterns present in multiple datasets, we identify and define robust, continuously varying subtype scores to represent CRC transcriptomes. The subtype scores are consistent with established subtypes (including microsatellite instability and previously proposed discrete transcriptome subtypes), but better represent overall transcriptional activity than do discrete subtypes. The scores are also better predictors of tumor location, stage, grade, and times of disease-free survival than discrete subtypes. Gene set enrichment analysis reveals that the subtype scores characterize T-cell function, inflammation response, and cyclin-dependent kinase regulation of DNA replication.
We find no evidence to support discrete subtypes of the CRC transcriptome and instead propose two validated scores to better characterize a continuity of CRC transcriptomes.
以前基于转录组定义结直肠癌(CRC)和其他癌症亚型的方法假设存在离散的亚型。我们分析了大量患者的结直肠肿瘤的基因表达模式,以检验这一假设,并提出了一种方法来识别可能存在于独立研究和队列中的连续亚型。
我们通过整合 18 个已发表的基因表达数据集和 >3700 名患者,检验了 CRC 亚型的离散假设,与之前的报告相反,没有证据支持存在离散转录亚型。我们使用荟萃分析方法来识别多个数据集中存在的共表达模式,从而确定并定义稳健的、连续变化的亚型分数来代表 CRC 的转录组。与已建立的亚型(包括微卫星不稳定性和以前提出的离散转录组亚型)一致,但比离散亚型更好地代表整体转录活性。与离散亚型相比,评分也更好地预测了肿瘤位置、分期、分级和无病生存时间。基因集富集分析显示,亚型评分描述了 T 细胞功能、炎症反应和细胞周期蛋白依赖性激酶对 DNA 复制的调控。
我们没有发现支持 CRC 转录组离散亚型的证据,而是提出了两个经过验证的评分,以更好地描述 CRC 转录组的连续性。