Yu Wei-Hsiang, Li Chih-Hao, Wang Ren-Ching, Yeh Chao-Yuan, Chuang Shih-Sung
aetherAI, Co., Ltd., Taipei 115, Taiwan.
Department of Pathology, Chang Gung Memorial Hospital, Kaohsiung 83301, Taiwan.
Cancers (Basel). 2021 Oct 30;13(21):5463. doi: 10.3390/cancers13215463.
The aim of this study was to investigate the feasibility of using machine learning techniques based on morphological features in classifying two subtypes of primary intestinal T-cell lymphomas (PITLs) defined according to the WHO criteria: monomorphic epitheliotropic intestinal T-cell lymphoma (MEITL) versus intestinal T-cell lymphoma, not otherwise specified (ITCL-NOS), which is considered a major challenge for pathological diagnosis. A total of 40 histopathological whole-slide images (WSIs) from 40 surgically resected PITL cases were used as the dataset for model training and testing. A deep neural network was trained to detect and segment the nuclei of lymphocytes. Quantitative nuclear morphometrics were further computed from these predicted contours. A decision-tree-based machine learning algorithm, XGBoost, was then trained to classify PITL cases into two disease subtypes using these nuclear morphometric features. The deep neural network achieved an average precision of 0.881 in the cell segmentation work. In terms of classifying MEITL versus ITCL-NOS, the XGBoost model achieved an area under receiver operating characteristic curve (AUC) of 0.966. Our research demonstrated an accurate, human-interpretable approach to using machine learning algorithms for reducing the high dimensionality of image features and classifying T cell lymphomas that present challenges in morphologic diagnosis. The quantitative nuclear morphometric features may lead to further discoveries concerning the relationship between cellular phenotype and disease status.
本研究的目的是探讨基于形态学特征的机器学习技术在对根据世界卫生组织标准定义的原发性肠道T细胞淋巴瘤(PITL)的两种亚型进行分类中的可行性:单形性亲上皮性肠道T细胞淋巴瘤(MEITL)与未另行指定的肠道T细胞淋巴瘤(ITCL-NOS),这被认为是病理诊断的一项重大挑战。来自40例手术切除的PITL病例的40张组织病理学全切片图像(WSI)被用作模型训练和测试的数据集。训练了一个深度神经网络来检测和分割淋巴细胞的细胞核。从这些预测轮廓中进一步计算定量核形态计量学。然后使用这些核形态计量学特征,训练基于决策树的机器学习算法XGBoost,将PITL病例分类为两种疾病亚型。深度神经网络在细胞分割工作中实现了0.881的平均精度。在对MEITL与ITCL-NOS进行分类方面,XGBoost模型在受试者工作特征曲线(AUC)下的面积为0.966。我们的研究展示了一种准确的、可人工解释的方法,即使用机器学习算法来降低图像特征的高维度,并对在形态学诊断中存在挑战的T细胞淋巴瘤进行分类。定量核形态计量学特征可能会带来关于细胞表型与疾病状态之间关系的进一步发现。