Charité-Universitätsmedizin Berlin, corporate member of Freie Universität Berlin, Humboldt-Universität zu Berlin and Berlin Institute of Health, Institute of Pathology, Berlin, Germany.
Charité Comprehensive Cancer Center (CCCC), Berlin, Germany.
Mod Pathol. 2019 Jun;32(6):855-865. doi: 10.1038/s41379-019-0207-y. Epub 2019 Feb 5.
Pulmonary enteric adenocarcinoma is a rare non-small cell lung cancer subtype. It is poorly characterized and cannot be distinguished from metastatic colorectal or upper gastrointestinal adenocarcinomas by means of routine pathological methods. As DNA methylation patterns are known to be highly tissue specific, we aimed to develop a methylation-based algorithm to differentiate these entities. To this end, genome-wide methylation profiles of 600 primary pulmonary, colorectal, and upper gastrointestinal adenocarcinomas obtained from The Cancer Genome Atlas and the Gene Expression Omnibus database were used as a reference cohort to train a machine learning algorithm. The resulting classifier correctly classified all samples from a validation cohort consisting of 680 primary pulmonary, colorectal and upper gastrointestinal adenocarcinomas, demonstrating the ability of the algorithm to reliably distinguish these three entities. We then analyzed methylation data of 15 pulmonary enteric adenocarcinomas as well as four pulmonary metastases and four primary colorectal adenocarcinomas with the algorithm. All 15 pulmonary enteric adenocarcinomas were reliably classified as primary pulmonary tumors and all four metastases as well as all four primary colorectal cancer samples were identified as colorectal adenocarcinomas. In a t-distributed stochastic neighbor embedding analysis, the pulmonary enteric adenocarcinoma samples did not form a separate methylation subclass but rather diffusely intermixed with other pulmonary cancers. Additional characterization of the pulmonary enteric adenocarcinoma series using fluorescence in situ hybridization, next-generation sequencing and copy number analysis revealed KRAS mutations in nine of 15 samples (60%) and a high number of structural chromosomal changes. Except for an unusually high rate of chromosome 20 gain (67%), the molecular data was mostly reminiscent of standard pulmonary adenocarcinomas. In conclusion, we provide sound evidence of the pulmonary origin of pulmonary enteric adenocarcinomas and in addition provide a publicly available machine learning-based algorithm to reliably distinguish these tumors from metastatic colorectal cancer.
肺肠型腺癌是一种罕见的非小细胞肺癌亚型。它的特征不明显,通过常规病理方法无法与转移性结直肠或上消化道腺癌区分。由于 DNA 甲基化模式具有高度的组织特异性,我们旨在开发一种基于甲基化的算法来区分这些实体。为此,我们使用来自癌症基因组图谱和基因表达综合数据库的 600 例原发性肺、结直肠和上消化道腺癌的全基因组甲基化谱作为参考队列来训练机器学习算法。该分类器正确分类了由 680 例原发性肺、结直肠和上消化道腺癌组成的验证队列中的所有样本,证明了该算法可靠地区分这三种实体的能力。然后,我们使用该算法分析了 15 例肺肠型腺癌以及 4 例肺转移和 4 例原发性结直肠腺癌的甲基化数据。所有 15 例肺肠型腺癌均可靠地被分类为原发性肺肿瘤,而所有 4 例转移和所有 4 例原发性结直肠腺癌样本均被鉴定为结直肠腺癌。在 t 分布随机邻域嵌入分析中,肺肠型腺癌样本没有形成单独的甲基化亚类,而是与其他肺癌样本混合在一起。使用荧光原位杂交、下一代测序和拷贝数分析对肺肠型腺癌系列进行进一步的特征描述,发现 15 例样本中有 9 例(60%)存在 KRAS 突变和大量结构性染色体改变。除了染色体 20 增益异常高(67%)外,分子数据大多与标准肺腺癌相似。总之,我们提供了肺肠型腺癌来源于肺的有力证据,并提供了一种基于机器学习的算法,可用于可靠地区分这些肿瘤与转移性结直肠癌。