• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估一种基于人工智能的宫颈癌筛查视觉检测方法的可推广性。

Assessing generalizability of an AI-based visual test for cervical cancer screening.

作者信息

Ahmed Syed Rakin, Egemen Didem, Befano Brian, Rodriguez Ana Cecilia, Jeronimo Jose, Desai Kanan, Teran Carolina, Alfaro Karla, Fokom-Domgue Joel, Charoenkwan Kittipat, Mungo Chemtai, Luckett Rebecca, Saidu Rakiya, Raiol Taina, Ribeiro Ana, Gage Julia C, de Sanjose Silvia, Kalpathy-Cramer Jayashree, Schiffman Mark

机构信息

Athinoula A. Martinos Center for Biomedical Imaging, Department of Radiology, Massachusetts General Hospital, Boston, Massachusetts, United States of America.

Harvard Graduate Program in Biophysics, Harvard Medical School, Harvard University, Cambridge, Massachusetts, United States of America.

出版信息

PLOS Digit Health. 2024 Oct 2;3(10):e0000364. doi: 10.1371/journal.pdig.0000364. eCollection 2024 Oct.

DOI:10.1371/journal.pdig.0000364
PMID:39356713
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11446437/
Abstract

A number of challenges hinder artificial intelligence (AI) models from effective clinical translation. Foremost among these challenges is the lack of generalizability, which is defined as the ability of a model to perform well on datasets that have different characteristics from the training data. We recently investigated the development of an AI pipeline on digital images of the cervix, utilizing a multi-heterogeneous dataset of 9,462 women (17,013 images) and a multi-stage model selection and optimization approach, to generate a diagnostic classifier able to classify images of the cervix into "normal", "indeterminate" and "precancer/cancer" (denoted as "precancer+") categories. In this work, we investigate the performance of this multiclass classifier on external data not utilized in training and internal validation, to assess the generalizability of the classifier when moving to new settings. We assessed both the classification performance and repeatability of our classifier model across the two axes of heterogeneity present in our dataset: image capture device and geography, utilizing both out-of-the-box inference and retraining with external data. Our results demonstrate that device-level heterogeneity affects our model performance more than geography-level heterogeneity. Classification performance of our model is strong on images from a new geography without retraining, while incremental retraining with inclusion of images from a new device progressively improves classification performance on that device up to a point of saturation. Repeatability of our model is relatively unaffected by data heterogeneity and remains strong throughout. Our work supports the need for optimized retraining approaches that address data heterogeneity (e.g., when moving to a new device) to facilitate effective use of AI models in new settings.

摘要

一些挑战阻碍了人工智能(AI)模型在临床中的有效应用。这些挑战中最主要的是缺乏通用性,通用性被定义为模型在与训练数据具有不同特征的数据集上良好运行的能力。我们最近研究了一种针对子宫颈数字图像的AI流程开发,利用了一个包含9462名女性(17013张图像)的多异构数据集以及一种多阶段模型选择和优化方法,以生成一个能够将子宫颈图像分类为“正常”、“不确定”和“癌前病变/癌症”(表示为“癌前病变+”)类别的诊断分类器。在这项工作中,我们研究了这个多类分类器在训练和内部验证中未使用的外部数据上的性能,以评估该分类器在应用于新环境时的通用性。我们通过开箱即用的推理以及使用外部数据进行再训练,评估了分类器模型在数据集存在的两个异构轴(图像采集设备和地理位置)上的分类性能和可重复性。我们的结果表明,设备级异构性比地理级异构性对我们模型性能的影响更大。在不进行再训练的情况下,我们的模型在来自新地理位置的图像上分类性能很强,而通过纳入来自新设备的图像进行增量再训练,在该设备上的分类性能会逐步提高,直至达到饱和点。我们模型的可重复性相对不受数据异构性的影响,并且始终保持很强。我们的工作支持需要优化再训练方法来解决数据异构性(例如,当迁移到新设备时),以促进在新环境中有效使用AI模型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35fb/11446437/6d98488065db/pdig.0000364.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35fb/11446437/734039b5e7e3/pdig.0000364.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35fb/11446437/27e591f4d4b7/pdig.0000364.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35fb/11446437/f3d091c22097/pdig.0000364.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35fb/11446437/6d98488065db/pdig.0000364.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35fb/11446437/734039b5e7e3/pdig.0000364.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35fb/11446437/27e591f4d4b7/pdig.0000364.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35fb/11446437/f3d091c22097/pdig.0000364.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35fb/11446437/6d98488065db/pdig.0000364.g004.jpg

相似文献

1
Assessing generalizability of an AI-based visual test for cervical cancer screening.评估一种基于人工智能的宫颈癌筛查视觉检测方法的可推广性。
PLOS Digit Health. 2024 Oct 2;3(10):e0000364. doi: 10.1371/journal.pdig.0000364. eCollection 2024 Oct.
2
Generalizable deep neural networks for image quality classification of cervical images.用于宫颈图像质量分类的通用深度神经网络。
Sci Rep. 2025 Feb 21;15(1):6312. doi: 10.1038/s41598-025-90024-0.
3
Generative artificial intelligence to produce high-fidelity blastocyst-stage embryo images.生成式人工智能生成高保真囊胚期胚胎图像。
Hum Reprod. 2024 Jun 3;39(6):1197-1207. doi: 10.1093/humrep/deae064.
4
Role of sureness in evaluating AI/CADx: Lesion-based repeatability of machine learning classification performance on breast MRI.Surety 在评估 AI/CADx 中的作用:基于病灶的机器学习分类性能在乳腺 MRI 上的重复性。
Med Phys. 2024 Mar;51(3):1812-1821. doi: 10.1002/mp.16673. Epub 2023 Aug 21.
5
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
6
Validation of artificial intelligence prediction models for skin cancer diagnosis using dermoscopy images: the 2019 International Skin Imaging Collaboration Grand Challenge.基于皮肤镜图像的皮肤癌诊断人工智能预测模型验证:2019 年国际皮肤成像协作挑战赛。
Lancet Digit Health. 2022 May;4(5):e330-e339. doi: 10.1016/S2589-7500(22)00021-8.
7
Development and validation of a deep learning pipeline to diagnose ovarian masses using ultrasound screening: a retrospective multicenter study.用于超声筛查诊断卵巢肿块的深度学习流程的开发与验证:一项回顾性多中心研究
EClinicalMedicine. 2024 Nov 19;78:102923. doi: 10.1016/j.eclinm.2024.102923. eCollection 2024 Dec.
8
Validation in Zambia of a cervical screening strategy including HPV genotyping and artificial intelligence (AI)-based automated visual evaluation.赞比亚对一项包括人乳头瘤病毒(HPV)基因分型和基于人工智能(AI)的自动视觉评估的宫颈癌筛查策略的验证。
Infect Agent Cancer. 2023 Oct 16;18(1):61. doi: 10.1186/s13027-023-00536-5.
9
Reproducible and clinically translatable deep neural networks for cervical screening.可重现且可临床转化的用于宫颈癌筛查的深度神经网络。
Sci Rep. 2023 Dec 8;13(1):21772. doi: 10.1038/s41598-023-48721-1.
10
AI-Based Identification Method for Cervical Transformation Zone Within Digital Colposcopy: Development and Multicenter Validation Study.基于人工智能的数字化阴道镜检查中宫颈转化区识别方法:开发与多中心验证研究
JMIR Cancer. 2025 Mar 31;11:e69672. doi: 10.2196/69672.

引用本文的文献

1
Generalizable deep neural networks for image quality classification of cervical images.用于宫颈图像质量分类的通用深度神经网络。
Sci Rep. 2025 Feb 21;15(1):6312. doi: 10.1038/s41598-025-90024-0.
2
Utility of colposcopy for the screening and management of cervical cancer in Africa: a cross-sectional analysis of providers' training and practices.阴道镜检查在非洲宫颈癌筛查与管理中的应用:提供者培训与实践的横断面分析
BMC Health Serv Res. 2024 Dec 18;24(1):1619. doi: 10.1186/s12913-024-11982-1.
3
The Future of Cervical Cancer Screening.

本文引用的文献

1
Reproducible and clinically translatable deep neural networks for cervical screening.可重现且可临床转化的用于宫颈癌筛查的深度神经网络。
Sci Rep. 2023 Dec 8;13(1):21772. doi: 10.1038/s41598-023-48721-1.
2
Artificial intelligence-based image analysis in clinical testing: lessons from cervical cancer screening.基于人工智能的图像分析在临床检测中的应用:宫颈癌筛查的经验教训。
J Natl Cancer Inst. 2024 Jan 10;116(1):26-33. doi: 10.1093/jnci/djad202.
3
Inconsistent Partitioning and Unproductive Feature Associations Yield Idealized Radiomic Models.
宫颈癌筛查的未来。
Int J Womens Health. 2024 Oct 23;16:1715-1731. doi: 10.2147/IJWH.S474571. eCollection 2024.
4
Design of the HPV-automated visual evaluation (PAVE) study: Validating a novel cervical screening strategy.HPV 自动化视觉评估(PAVE)研究设计:验证一种新的宫颈癌筛查策略。
Elife. 2024 Jan 15;12:RP91469. doi: 10.7554/eLife.91469.
5
Validation in Zambia of a cervical screening strategy including HPV genotyping and artificial intelligence (AI)-based automated visual evaluation.赞比亚对一项包括人乳头瘤病毒(HPV)基因分型和基于人工智能(AI)的自动视觉评估的宫颈癌筛查策略的验证。
Infect Agent Cancer. 2023 Oct 16;18(1):61. doi: 10.1186/s13027-023-00536-5.
不一致的分区和非生产性特征关联产生理想化的放射组学模型。
Radiology. 2023 Apr;307(1):e220715. doi: 10.1148/radiol.220715. Epub 2022 Dec 20.
4
Improving the repeatability of deep learning models with Monte Carlo dropout.利用蒙特卡洛随机失活提高深度学习模型的可重复性。
NPJ Digit Med. 2022 Nov 18;5(1):174. doi: 10.1038/s41746-022-00709-3.
5
Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries.《全球癌症统计数据 2020:全球 185 个国家和地区 36 种癌症的发病率和死亡率估计》。
CA Cancer J Clin. 2021 May;71(3):209-249. doi: 10.3322/caac.21660. Epub 2021 Feb 4.
6
Detection of high-risk human papillomavirus (HPV) by the novel AmpFire isothermal HPV assay among pregnant women in Pemba Island, Tanzania.新型 AmpFire 等温 HPV 检测法在坦桑尼亚奔巴岛孕妇中检测高危型人乳头瘤病毒(HPV)。
Pan Afr Med J. 2020 Oct 27;37:183. doi: 10.11604/pamj.2020.37.183.23367. eCollection 2020.
7
Accuracy and Efficiency of Deep-Learning-Based Automation of Dual Stain Cytology in Cervical Cancer Screening.深度学习自动化双重染色细胞学在宫颈癌筛查中的准确性和效率。
J Natl Cancer Inst. 2021 Jan 4;113(1):72-79. doi: 10.1093/jnci/djaa066.
8
High-performance medicine: the convergence of human and artificial intelligence.高性能医学:人机智能融合。
Nat Med. 2019 Jan;25(1):44-56. doi: 10.1038/s41591-018-0300-7. Epub 2019 Jan 7.
9
Cardiologist-level arrhythmia detection and classification in ambulatory electrocardiograms using a deep neural network.使用深度神经网络在动态心电图中进行心脏病学家级别的心律失常检测和分类。
Nat Med. 2019 Jan;25(1):65-69. doi: 10.1038/s41591-018-0268-3. Epub 2019 Jan 7.
10
Is It Time to Move Beyond Visual Inspection With Acetic Acid for Cervical Cancer Screening?醋酸目视检查用于宫颈癌筛查是否已过时?
Glob Health Sci Pract. 2018 Jun 29;6(2):242-246. doi: 10.9745/GHSP-D-18-00206. Print 2018 Jun 27.