Moyes Andrew, Gault Richard, Zhang Kun, Ming Ji, Crookes Danny, Wang Jing
School of Electronics, Electrical Engineering and Computer Science, Queen's University, Belfast, 18 Malone Road, BT9 6RT, UK.
School of Electrical Engineering, Nantong University, Nantong, China.
Med Image Anal. 2023 Jan;83:102640. doi: 10.1016/j.media.2022.102640. Epub 2022 Sep 27.
Domain shift is a problem commonly encountered when developing automated histopathology pipelines. The performance of machine learning models such as convolutional neural networks within automated histopathology pipelines is often diminished when applying them to novel data domains due to factors arising from differing staining and scanning protocols. The Dual-Channel Auto-Encoder (DCAE) model was previously shown to produce feature representations that are less sensitive to appearance variation introduced by different digital slide scanners. In this work, the Multi-Channel Auto-Encoder (MCAE) model is presented as an extension to DCAE which learns from more than two domains of data. Experimental results show that the MCAE model produces feature representations that are less sensitive to inter-domain variations than the comparative StaNoSA method when tested on a novel synthetic dataset. This was apparent when applying the MCAE, DCAE, and StaNoSA models to three different classification tasks from unseen domains. The results of this experiment show the MCAE model out performs the other models. These results show that the MCAE model is able to generalise better to novel data, including data from unseen domains, than existing approaches by actively learning normalised feature representations.
域转移是开发自动化组织病理学流水线时常见的问题。由于不同的染色和扫描协议所产生的因素,当将诸如卷积神经网络等机器学习模型应用于新的数据域时,自动化组织病理学流水线中的模型性能往往会下降。双通道自动编码器(DCAE)模型先前已被证明能够生成对不同数字载玻片扫描仪引入的外观变化不太敏感的特征表示。在这项工作中,多通道自动编码器(MCAE)模型作为DCAE的扩展被提出,它从两个以上的数据域进行学习。实验结果表明,在一个新的合成数据集上进行测试时,MCAE模型生成的特征表示比对比的StaNoSA方法对域间变化更不敏感。当将MCAE、DCAE和StaNoSA模型应用于来自未知域的三个不同分类任务时,这一点很明显。该实验结果表明MCAE模型的性能优于其他模型。这些结果表明,与现有方法相比,MCAE模型能够通过主动学习归一化特征表示,更好地推广到新数据,包括来自未知域的数据。