Horbas Hannah, Bauer Marcus, Eckert Alexander, Bethmann Daniel, Wilfer Andreas, Seliger Barbara, Wickenhauser Claudia
Institute of Pathology, University Hospital Halle, Martin-Luther-University Halle-Wittenberg, Halle (Saale), Germany.
Clinic for Oral and Maxillofacial Surgery, University Hospital of the Paracelsus Medical Private University, Nürnberg, Germany.
J Histochem Cytochem. 2025 May 15:221554251335698. doi: 10.1369/00221554251335698.
Gold standard for immunohistochemical analyses is the manual assessment by two specialist pathologists. This process is time-consuming, highly dependent on the respective evaluator and often difficult to reproduce. The use of image analysis software, such as ImageJ, QuPath, or CellProfiler, which employ machine learning and/or deep learning mechanisms to perform biomarker analyses, offers a potential solution to these problems. The objective of our study is to evaluate whether digital assessment using the open-source software QuPath is comparable to manual evaluation and to examine the inter-evaluator variability between the two manual evaluators and two software-based evaluations. Six tissue microarrays (TMAs) were constructed for a cohort of 309 patients with primary oral squamous cell carcinoma (OSCC). The tumor tissue and corresponding non-lesional squamous epithelial mucosa specimen were immunohistochemically stained for the biomarkers Ki67, as a nuclear marker; the epidermal growth factor receptor (EGF-R), as a membranous marker; and the major histocompatibility complex class I (MHC-I) heavy chain (HC) expressed on the membrane and in the cytoplasm. The staining pattern was analyzed by two experienced, independent manual evaluators and by QuPath. The percentage of positive cells, for Ki67, and the histoscore (H-score) based on the percentage of positive cells and their staining intensity, for EGF-R and MHC-I, were determined as final values. The results yielded high to excellent spearman correlation coefficients for all three biomarkers (<0.001) in lesional and non-lesional tissues. The Bland-Altman plots demonstrated a high degree of agreement between manual and software-based analysis, as well as inter-evaluator variability demonstrating a high comparability of the evaluation methods. However, a prerequisite for a proper software-based analysis is an accurate, time-consuming annotation of the single specimen, which requires users with a comprehensive understanding of histology and extensive training in QuPath. Once these requirements are met, the software-based analysis offers advantages for large-scale biomarker studies due to objective and reproducible comparability of the stainings leading to a greater accuracy as well as the reuse of established conditions across similar analyses without requiring further operator input.
免疫组织化学分析的金标准是由两名专业病理学家进行手动评估。这个过程耗时、高度依赖各自的评估者且往往难以重复。使用图像分析软件,如ImageJ、QuPath或CellProfiler,它们采用机器学习和/或深度学习机制来进行生物标志物分析,为这些问题提供了一个潜在的解决方案。我们研究的目的是评估使用开源软件QuPath进行数字评估是否与手动评估相当,并检验两名手动评估者之间以及两种基于软件的评估之间的评估者间变异性。为一组309例原发性口腔鳞状细胞癌(OSCC)患者构建了六个组织微阵列(TMA)。对肿瘤组织和相应的非病变鳞状上皮黏膜标本进行免疫组织化学染色,检测生物标志物:作为核标志物的Ki67;作为膜标志物的表皮生长因子受体(EGF-R);以及在膜和细胞质中表达的主要组织相容性复合体I类(MHC-I)重链(HC)。由两名经验丰富、独立的手动评估者和QuPath对染色模式进行分析。将Ki67的阳性细胞百分比以及基于EGF-R和MHC-I的阳性细胞百分比及其染色强度的组织学评分(H-score)确定为最终值。结果显示,在病变和非病变组织中,所有三种生物标志物的斯皮尔曼相关系数都很高至极好(<0.001)。布兰德-奥特曼图显示手动分析和基于软件的分析之间高度一致,以及评估者间变异性表明评估方法具有高度可比性。然而,基于软件的正确分析的一个先决条件是对单个标本进行准确、耗时的注释,这需要对组织学有全面理解并在QuPath方面接受广泛培训的用户。一旦满足这些要求,基于软件的分析由于染色的客观和可重复可比性而具有优势,从而提高准确性,并且在类似分析中可以重复使用既定条件而无需进一步的操作员输入,这对于大规模生物标志物研究很有好处。