Wu Shaoxu, Hong Guibin, Xu Abai, Zeng Hong, Chen Xulin, Wang Yun, Luo Yun, Wu Peng, Liu Cundong, Jiang Ning, Dang Qiang, Yang Cheng, Liu Bohao, Shen Runnan, Chen Zeshi, Liao Chengxiao, Lin Zhen, Wang Jin, Lin Tianxin
Department of Urology, Sun Yat-sen University, Guangzhou, Guangdong, China; Guangdong Provincial Key Laboratory of Malignant Tumour Epigenetics and Gene Regulation, Sun Yat-sen University, Guangzhou, Guangdong, China; Guangdong Provincial Clinical Research Centre for Urological Diseases, Guangzhou, Guangdong, China.
Department of Urology, Sun Yat-sen University, Guangzhou, Guangdong, China.
Lancet Oncol. 2023 Apr;24(4):360-370. doi: 10.1016/S1470-2045(23)00061-X. Epub 2023 Mar 6.
Accurate lymph node staging is important for the diagnosis and treatment of patients with bladder cancer. We aimed to develop a lymph node metastases diagnostic model (LNMDM) on whole slide images and to assess the clinical effect of an artificial intelligence-assisted (AI) workflow.
In this retrospective, multicentre, diagnostic study in China, we included consecutive patients with bladder cancer who had radical cystectomy and pelvic lymph node dissection, and from whom whole slide images of lymph node sections were available, for model development. We excluded patients with non-bladder cancer and concurrent surgery, or low-quality images. Patients from two hospitals (Sun Yat-sen Memorial Hospital of Sun Yat-sen University and Zhujiang Hospital of Southern Medical University, Guangzhou, Guangdong, China) were assigned before a cutoff date to a training set and after the date to internal validation sets for each hospital. Patients from three other hospitals (the Third Affiliated Hospital of Sun Yat-sen University, Nanfang Hospital of Southern Medical University, and the Third Affiliated Hospital of Southern Medical University, Guangzhou, Guangdong, China) were included as external validation sets. A validation subset of challenging cases from the five validation sets was used to compare performance between the LNMDM and pathologists, and two other datasets (breast cancer from the CAMELYON16 dataset and prostate cancer from the Sun Yat-sen Memorial Hospital of Sun Yat-sen University) were collected for a multi-cancer test. The primary endpoint was diagnostic sensitivity in the four prespecified groups (ie, the five validation sets, a single-lymph-node test set, the multi-cancer test set, and the subset for a performance comparison between the LNMDM and pathologists).
Between Jan 1, 2013 and Dec 31, 2021, 1012 patients with bladder cancer had radical cystectomy and pelvic lymph node dissection and were included (8177 images and 20 954 lymph nodes). We excluded 14 patients (165 images) with concurrent non-bladder cancer and also excluded 21 low-quality images. We included 998 patients and 7991 images (881 [88%] men; 117 [12%] women; median age 64 years [IQR 56-72]; ethnicity data not available; 268 [27%] with lymph node metastases) to develop the LNMDM. The area under the curve (AUC) for accurate diagnosis of the LNMDM ranged from 0·978 (95% CI 0·960-0·996) to 0·998 (0·996-1·000) in the five validation sets. Performance comparisons between the LNMDM and pathologists showed that the diagnostic sensitivity of the model (0·983 [95% CI 0·941-0·998]) substantially exceeded that of both junior pathologists (0·906 [0·871-0·934]) and senior pathologists (0·947 [0·919-0·968]), and that AI assistance improved sensitivity for both junior (from 0·906 without AI to 0·953 with AI) and senior (from 0·947 to 0·986) pathologists. In the multi-cancer test, the LNMDM maintained an AUC of 0·943 (95% CI 0·918-0·969) in breast cancer images and 0·922 (0·884-0·960) in prostate cancer images. In 13 patients, the LNMDM detected tumour micrometastases that had been missed by pathologists who had previously classified these patients' results as negative. Receiver operating characteristic curves showed that the LNMDM would enable pathologists to exclude 80-92% of negative slides while maintaining 100% sensitivity in clinical application.
We developed an AI-based diagnostic model that did well in detecting lymph node metastases, particularly micrometastases. The LNMDM showed substantial potential for clinical applications in improving the accuracy and efficiency of pathologists' work.
National Natural Science Foundation of China, the Science and Technology Planning Project of Guangdong Province, the National Key Research and Development Programme of China, and the Guangdong Provincial Clinical Research Centre for Urological Diseases.
准确的淋巴结分期对于膀胱癌患者的诊断和治疗至关重要。我们旨在基于全切片图像开发一种淋巴结转移诊断模型(LNMDM),并评估人工智能辅助(AI)工作流程的临床效果。
在这项中国的回顾性、多中心诊断研究中,我们纳入了连续接受根治性膀胱切除术和盆腔淋巴结清扫术且有淋巴结切片全切片图像的膀胱癌患者,用于模型开发。我们排除了患有非膀胱癌和同期进行其他手术的患者,以及图像质量差的患者。在截止日期之前,将来自两家医院(中山大学孙逸仙纪念医院和南方医科大学珠江医院,中国广东广州)的患者分配到训练集,截止日期之后的患者分配到各医院的内部验证集。来自其他三家医院(中山大学附属第三医院、南方医科大学南方医院和南方医科大学附属第三医院,中国广东广州)的患者作为外部验证集。从五个验证集中选取具有挑战性的病例组成一个验证子集,用于比较LNMDM和病理学家的表现,并收集另外两个数据集(来自CAMELYON16数据集的乳腺癌和中山大学孙逸仙纪念医院的前列腺癌)进行多癌测试。主要终点是四个预先设定组(即五个验证集、单个淋巴结测试集、多癌测试集以及用于比较LNMDM和病理学家表现的子集)中的诊断敏感性。
2013年1月1日至2021年12月31日期间,1012例膀胱癌患者接受了根治性膀胱切除术和盆腔淋巴结清扫术并被纳入研究(8177张图像和20954个淋巴结)。我们排除了14例(165张图像)患有同期非膀胱癌的患者,还排除了21张质量差的图像。我们纳入了998例患者和7991张图像(881例[88%]男性;117例[12%]女性;中位年龄64岁[IQR 56 - 72];种族数据未提供;268例[27%]有淋巴结转移)来开发LNMDM。在五个验证集中,LNMDM准确诊断的曲线下面积(AUC)范围为0.978(95%CI 0.960 - 0.996)至0.998(0.996 - 1.000)。LNMDM与病理学家的表现比较表明,该模型的诊断敏感性(0.983[95%CI 0.941 - 0.998])显著超过初级病理学家(0.906[0.871 - 0.934])和高级病理学家(0.947[0.919 - 0.968]),并且AI辅助提高了初级(从无AI时的0.906提高到有AI时的0.953)和高级(从0.947提高到0.986)病理学家的敏感性。在多癌测试中,LNMDM在乳腺癌图像中的AUC为0.943(95%CI 0.918 - 0.969),在前列腺癌图像中的AUC为0.922(0.884 - 0.960)。在13例患者中,LNMDM检测到了病理学家之前将这些患者结果分类为阴性而遗漏的肿瘤微转移。受试者操作特征曲线表明,LNMDM能够使病理学家在临床应用中排除80 - 92%的阴性切片,同时保持100%的敏感性。
我们开发了一种基于AI的诊断模型,该模型在检测淋巴结转移,尤其是微转移方面表现良好。LNMDM在提高病理学家工作的准确性和效率方面具有巨大的临床应用潜力。
中国国家自然科学基金、广东省科技计划项目、中国国家重点研发计划以及广东省泌尿外科疾病临床研究中心。