Ko Soo Min, Shin Jae-Ik, Hong Yiyu, Kim Hyunji, Sohn Insuk, Lee Ji-Young, Han Hyo-Jeong, Jeong Da Som, Lee Yerin, Son Woo-Chan
Department of Medical Science, AMIST, Asan Medical Center, University of Ulsan College of Medicine, Seoul, Republic of Korea.
Department of R&D Center, Arontier Co., Ltd., Seoul, Republic of Korea.
Front Med (Lausanne). 2025 Jul 4;12:1629036. doi: 10.3389/fmed.2025.1629036. eCollection 2025.
Metabolic dysfunction-associated steatohepatitis (MASH) is a significant liver disease that can lead to cirrhosis and liver cancer. Accurate assessment of liver fibrosis is crucial for diagnosis, prognosis, and informed treatment decision-making. Staging of liver fibrosis in MASH is based on Kleiner's score, which categorizes fibrosis based on its location within the liver as observed microscopically. This scoring system is part of a standard clinical research network and relies heavily on the expertise of pathologists.
This study utilized Sirius Red-stained whole slide images of liver tissue obtained from various MASH animal models to develop deep learning (DL) models for scoring liver fibrosis, with a focus on the criteria outlined in Kleiner's score. We created a trainable and testable dataset of whole-slide images of the liver, consisting of 999,711 patch images derived from 914 whole-slide images. The performance of the multi-class classification model was evaluated using the kappa statistic, area under the precision-recall curve (AUPRC), area under the receiver operating characteristic curve (AUROC), and Matthews correlation coefficient (MCC).
To address challenges in clinical subclassification, a 5-class classification model was initially applied; the model achieved moderate agreement. A more refined 7-class model was subsequently developed, which outperformed the 5-class classification model. The enhanced subclassification significantly improved classification performance, as evidenced by the superior AUROC and AUPRC values of the 7-class model.
This study highlights that DL models for scoring liver fibrosis can support expert pathologists in staging liver fibrosis in preclinical animal studies.
代谢功能障碍相关脂肪性肝炎(MASH)是一种严重的肝脏疾病,可导致肝硬化和肝癌。准确评估肝纤维化对于诊断、预后以及明智的治疗决策至关重要。MASH中肝纤维化的分期基于克莱纳评分,该评分根据显微镜下观察到的纤维化在肝脏内的位置对其进行分类。该评分系统是标准临床研究网络的一部分,严重依赖病理学家的专业知识。
本研究利用从各种MASH动物模型获得的天狼星红染色肝脏组织全切片图像,开发用于肝纤维化评分的深度学习(DL)模型,重点关注克莱纳评分中概述的标准。我们创建了一个可训练和可测试的肝脏全切片图像数据集,由来自914张全切片图像的999,711个贴片图像组成。使用kappa统计量、精确召回曲线下面积(AUPRC)、受试者操作特征曲线下面积(AUROC)和马修斯相关系数(MCC)评估多类分类模型的性能。
为应对临床亚分类中的挑战,最初应用了一个5类分类模型;该模型达成了中等一致性。随后开发了一个更精细的7类模型,其表现优于5类分类模型。如7类模型更高的AUROC和AUPRC值所示,增强的亚分类显著提高了分类性能。
本研究强调,用于肝纤维化评分的DL模型可在临床前动物研究中支持专家病理学家对肝纤维化进行分期。