Shen Zhuoyan, Simard Mikaël, Brand Douglas, Andrei Vanghelita, Al-Khader Ali, Oumlil Fatine, Trevers Katherine, Butters Thomas, Haefliger Simon, Kara Eleanna, Amary Fernanda, Tirabosco Roberto, Cool Paul, Royle Gary, Hawkins Maria A, Flanagan Adrienne M, Collins-Fekete Charles-Antoine
Department of Medical Physics and Biomedical Engineering, University College London, London, UK.
Department of Radiotherapy, University College London Hospitals NHS Foundation Trust, London, UK.
Commun Biol. 2024 Dec 19;7(1):1674. doi: 10.1038/s42003-024-07398-6.
Mitotic activity is an important feature for grading several cancer types. However, counting mitotic figures (cells in division) is a time-consuming and laborious task prone to inter-observer variation. Inaccurate recognition of MFs can lead to incorrect grading and hence potential suboptimal treatment. This study presents an artificial intelligence-based approach to detect mitotic figures in digitised whole-slide images stained with haematoxylin and eosin. Advances in this area are hampered by the small size and variety of datasets available. To address this, we create the largest dataset of mitotic figures (N = 74,620), combining an in-house dataset of soft tissue tumours with five open-source datasets. We then employ a two-stage framework, named the Optimised Mitoses Generator Network (OMG-Net), to identify mitotic figures. This framework first deploys the Segment Anything Model to automatically outline cells, followed by an adapted ResNet18 that distinguishes mitotic figures. OMG-Net achieves an F1 score of 0.84 in detecting pan-cancer mitotic figures, including human breast carcinoma, neuroendocrine tumours, and melanoma. It outperforms previous state-of-the-art models in hold-out test sets. To summarise, our study introduces a generalisable data creation and curation pipeline and a high-performance detection model, which can largely contribute to the field of computer-aided mitotic figure detection.
有丝分裂活性是几种癌症类型分级的重要特征。然而,计数有丝分裂象(处于分裂期的细胞)是一项耗时费力的任务,容易出现观察者间的差异。对有丝分裂象的错误识别会导致分级错误,从而可能导致治疗欠佳。本研究提出了一种基于人工智能的方法,用于在苏木精和伊红染色的数字化全切片图像中检测有丝分裂象。该领域的进展受到可用数据集规模小和种类少的阻碍。为了解决这个问题,我们创建了最大的有丝分裂象数据集(N = 74,620),将软组织肿瘤的内部数据集与五个开源数据集相结合。然后,我们采用了一个名为优化有丝分裂生成器网络(OMG-Net)的两阶段框架来识别有丝分裂象。该框架首先部署“分割一切模型”自动勾勒细胞轮廓,随后使用一个经过改编的ResNet18来区分有丝分裂象。OMG-Net在检测泛癌有丝分裂象(包括人类乳腺癌、神经内分泌肿瘤和黑色素瘤)时,F1分数达到0.84。在留出测试集中,它优于先前的最先进模型。总之,我们的研究引入了一个可推广的数据创建和管理流程以及一个高性能检测模型,这可以极大地推动计算机辅助有丝分裂象检测领域的发展。