Artificial Intelligence & Computer Vision Lab, Department of Computer Science, Cochin University of Science and Technology, Kochi, 682022, Kerala, India.
Artificial Intelligence & Computer Vision Lab, Department of Computer Science, Cochin University of Science and Technology, Kochi, 682022, Kerala, India.
Artif Intell Med. 2020 Mar;103:101805. doi: 10.1016/j.artmed.2020.101805. Epub 2020 Jan 25.
Breast cancer is the most prevalent invasive type of cancer among women. The mortality rate of the disease can be reduced considerably through timely prognosis and felicitous treatment planning, by utilizing the computer aided detection and diagnosis techniques. With the advent of whole slide image (WSI) scanners for digitizing the histopathological tissue samples, there is a drastic increase in the availability of digital histopathological images. However, these samples are often unlabeled and hence they need labeling to be done through manual annotations by domain experts and experienced pathologists. But this annotation process required for acquiring high quality large labeled training set for nuclear atypia scoring is a tedious, expensive and time consuming job. Active learning techniques have achieved widespread acceptance in reducing this human effort in annotating the data samples. In this paper, we explore the possibilities of active learning on nuclear pleomorphism scoring over a non-Euclidean framework, the Riemannian manifold. Active learning technique adopted for the cancer grading is in the batch-mode framework, that adaptively identifies the apt batch size along with the batch of instances to be queried, following a submodular optimization framework. Samples for annotation are selected considering the diversity and redundancy between the pair of samples, based on the kernelized Riemannian distance measures such as log-Euclidean metrics and the two Bregman divergences - Stein and Jeffrey divergences. Results of the adaptive Batch Mode Active Learning on the Riemannian metric show a superior performance when compared with the state-of-the-art techniques for breast cancer nuclear pleomorphism scoring, as it makes use of the information from the unlabeled samples.
乳腺癌是女性中最常见的侵袭性癌症类型。通过利用计算机辅助检测和诊断技术,可以及时进行预后和制定恰当的治疗计划,大大降低该疾病的死亡率。随着用于对组织病理样本进行数字化的全玻片图像 (WSI) 扫描仪的出现,数字病理图像的可用性大大增加。然而,这些样本通常是未标记的,因此需要通过领域专家和经验丰富的病理学家进行手动注释来进行标记。但是,对于核异型性评分来说,获取高质量的大型标记训练集所需的这种注释过程既繁琐、昂贵又耗时。主动学习技术已被广泛接受,可用于减少数据样本注释的人力投入。在本文中,我们探讨了在非欧几里得框架——黎曼流形上进行核异型性评分的主动学习的可能性。所采用的用于癌症分级的主动学习技术是在批处理模式框架中,根据子模优化框架,自适应地确定合适的批处理大小以及要查询的实例批处理。基于核黎曼距离度量(如对数欧几里得度量和两种 Bregman 散度——Stein 散度和 Jeffrey 散度),根据样本对之间的多样性和冗余性选择用于注释的样本。与乳腺癌核异型性评分的最新技术相比,在黎曼度量上的自适应批量主动学习的结果表现出更好的性能,因为它利用了未标记样本的信息。