Lu Liangqun, Daigle Bernie J
Departments of Biological Sciences and Computer Science, The University of Memphis, Memphis, TN, USA.
PeerJ. 2020 Mar 12;8:e8668. doi: 10.7717/peerj.8668. eCollection 2020.
Histopathological images contain rich phenotypic descriptions of the molecular processes underlying disease progression. Convolutional neural networks, state-of-the-art image analysis techniques in computer vision, automatically learn representative features from such images which can be useful for disease diagnosis, prognosis, and subtyping. Hepatocellular carcinoma (HCC) is the sixth most common type of primary liver malignancy. Despite the high mortality rate of HCC, little previous work has made use of CNN models to explore the use of histopathological images for prognosis and clinical survival prediction of HCC. We applied three pre-trained CNN models-VGG 16, Inception V3 and ResNet 50-to extract features from HCC histopathological images. Sample visualization and classification analyses based on these features showed a very clear separation between cancer and normal samples. In a univariate Cox regression analysis, 21.4% and 16% of image features on average were significantly associated with overall survival (OS) and disease-free survival (DFS), respectively. We also observed significant correlations between these features and integrated biological pathways derived from gene expression and copy number variation. Using an elastic net regularized Cox Proportional Hazards model of OS constructed from Inception image features, we obtained a concordance index (C-index) of 0.789 and a significant log-rank test ( = 7.6E-18). We also performed unsupervised classification to identify HCC subgroups from image features. The optimal two subgroups discovered using Inception model image features showed significant differences in both overall (C-index = 0.628 and = 7.39E-07) and DFS (C-index = 0.558 and = 0.012). Our work demonstrates the utility of extracting image features using pre-trained models by using them to build accurate prognostic models of HCC as well as highlight significant correlations between these features, clinical survival, and relevant biological pathways. Image features extracted from HCC histopathological images using the pre-trained CNN models VGG 16, Inception V3 and ResNet 50 can accurately distinguish normal and cancer samples. Furthermore, these image features are significantly correlated with survival and relevant biological pathways.
组织病理学图像包含了有关疾病进展潜在分子过程的丰富表型描述。卷积神经网络作为计算机视觉中最先进的图像分析技术,能自动从此类图像中学习具有代表性的特征,这些特征对疾病诊断、预后评估和亚型分类很有用。肝细胞癌(HCC)是原发性肝脏恶性肿瘤中第六常见的类型。尽管HCC死亡率很高,但此前很少有研究利用卷积神经网络模型来探索组织病理学图像在HCC预后和临床生存预测中的应用。我们应用了三种预训练的卷积神经网络模型——VGG 16、Inception V3和ResNet 50——从HCC组织病理学图像中提取特征。基于这些特征的样本可视化和分类分析显示,癌症样本和正常样本之间有非常明显的区分。在单变量Cox回归分析中,平均分别有21.4%和16%的图像特征与总生存期(OS)和无病生存期(DFS)显著相关。我们还观察到这些特征与源自基因表达和拷贝数变异的综合生物学途径之间存在显著相关性。使用由Inception图像特征构建的OS弹性网络正则化Cox比例风险模型,我们获得了0.789的一致性指数(C指数)和显著的对数秩检验( = 7.6E - 18)。我们还进行了无监督分类,以从图像特征中识别HCC亚组。使用Inception模型图像特征发现的最佳两个亚组在总生存期(C指数 = 0.628和 = 7.39E - 07)和DFS(C指数 = 0.558和 = 0.012)方面均显示出显著差异。我们的工作证明了使用预训练模型提取图像特征的实用性,通过这些特征构建准确的HCC预后模型,并突出了这些特征、临床生存和相关生物学途径之间的显著相关性。使用预训练的卷积神经网络模型VGG 16、Inception V3和ResNet 50从HCC组织病理学图像中提取的图像特征可以准确区分正常样本和癌症样本。此外,这些图像特征与生存和相关生物学途径显著相关。