• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于宫颈图像质量分类的通用深度神经网络。

Generalizable deep neural networks for image quality classification of cervical images.

作者信息

Ahmed Syed Rakin, Befano Brian, Egemen Didem, Rodriguez Ana Cecilia, Desai Kanan T, Jeronimo Jose, Ajenifuja Kayode O, Clark Christopher, Perkins Rebecca, Campos Nicole G, Inturrisi Federica, Wentzensen Nicolas, Han Paul, Guillen Diego, Norman Judy, Goldstein Andrew T, Madeleine Margaret M, Donastorg Yeycy, Schiffman Mark, de Sanjose Silvia, Kalpathy-Cramer Jayashree

机构信息

Athinoula A. Martinos Center for Biomedical Imaging, Department of Radiology, Massachusetts General Hospital, Boston, MA, 02129, USA.

Harvard Graduate Program in Biophysics, Harvard Medical School, Harvard University, Cambridge, MA, 02115, USA.

出版信息

Sci Rep. 2025 Feb 21;15(1):6312. doi: 10.1038/s41598-025-90024-0.

DOI:10.1038/s41598-025-90024-0
PMID:
39984572
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11845747/
Abstract

Successful translation of artificial intelligence (AI) models into clinical practice, across clinical domains, is frequently hindered by the lack of image quality control. Diagnostic models are often trained on images with no denotation of image quality in the training data; this, in turn, can lead to misclassifications by these models when implemented in the clinical setting. In the case of cervical images, quality classification is a crucial task to ensure accurate detection of precancerous lesions or cancer; this is true for both gynecologic-oncologists' (manual) and diagnostic AI models' (automated) predictions. Factors that impact the quality of a cervical image include but are not limited to blur, poor focus, poor light, noise, obscured view of the cervix due to mucus and/or blood, improper position, and over- and/or under-exposure. Utilizing a multi-level image quality ground truth denoted by providers, we generated an image quality classifier following a multi-stage model selection process that investigated several key design choices on a multi-heterogenous "SEED" dataset of 40,534 images. We subsequently validated the best model on an external dataset ("EXT"), comprising 1,340 images captured using a different device and acquired in different geographies from "SEED". We assessed the relative impact of various axes of data heterogeneity, including device, geography, and ground-truth rater on model performance. Our best performing model achieved an area under the receiver operating characteristics curve (AUROC) of 0.92 (low quality, LQ vs. rest) and 0.93 (high quality, HQ vs. rest), and a minimal total %extreme misclassification (%EM) of 2.8% on the internal validation set. Our model also generalized well externally, achieving corresponding AUROCs of 0.83 and 0.82, and %EM of 3.9% when tested out-of-the-box on the external validation ("EXT") set. Additionally, our model was geography agnostic with no meaningful difference in performance across geographies, did not exhibit catastrophic forgetting upon retraining with new data, and mimicked the overall/average ground truth rater behavior well. Our work represents one of the first efforts at generating and externally validating an image quality classifier across multiple axes of data heterogeneity to aid in visual diagnosis of cervical precancer and cancer. We hope that this will motivate the accompaniment of adequate guardrails for AI-based pipelines to account for image quality and generalizability concerns.

摘要

人工智能(AI)模型在临床实践中的成功应用,在各个临床领域,常常因缺乏图像质量控制而受阻。诊断模型通常是在训练数据中没有图像质量标注的图像上进行训练的;反过来,当这些模型应用于临床环境时,可能会导致错误分类。对于宫颈图像而言,质量分类是确保准确检测癌前病变或癌症的关键任务;这对于妇科肿瘤学家(手动)和诊断AI模型(自动)的预测都是如此。影响宫颈图像质量的因素包括但不限于模糊、对焦不佳、光线不足、噪声、因黏液和/或血液导致的宫颈视野模糊、位置不当以及曝光过度和/或曝光不足。利用提供者标注的多层次图像质量真值,我们在一个包含40,534张图像的多异质“SEED”数据集上,经过一个研究了几个关键设计选择的多阶段模型选择过程,生成了一个图像质量分类器。随后,我们在一个外部数据集(“EXT”)上验证了最佳模型,该数据集包含1,340张使用不同设备拍摄且在与“SEED”不同的地理位置获取的图像。我们评估了数据异质性的各个维度,包括设备、地理位置和真值评估者对模型性能的相对影响。我们表现最佳的模型在内部验证集上,受试者操作特征曲线下面积(AUROC)在低质量(LQ与其他)情况下为0.92,在高质量(HQ与其他)情况下为0.93,最小总极端错误分类率(%EM)为2.8%。我们的模型在外部也具有良好的泛化能力,在外部验证(“EXT”)集上开箱即用测试时,相应的AUROC为0.83和0.82,%EM为3.9%。此外,我们的模型与地理位置无关,在不同地理位置的性能没有显著差异,在用新数据重新训练时没有表现出灾难性遗忘,并且很好地模仿了总体/平均真值评估者的行为。我们的工作是在跨数据异质性的多个维度生成并外部验证图像质量分类器以辅助宫颈癌症前病变和癌症的视觉诊断方面的首批努力之一。我们希望这将促使为基于AI的流程配备适当的保障措施,以解决图像质量和泛化性问题。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e239/11845747/07892c0d5a7d/41598_2025_90024_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e239/11845747/6904133efd60/41598_2025_90024_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e239/11845747/4eb6d0e417ee/41598_2025_90024_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e239/11845747/06c1e5c7a265/41598_2025_90024_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e239/11845747/05e5c757048f/41598_2025_90024_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e239/11845747/a4923f8ebc8e/41598_2025_90024_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e239/11845747/4ef2ab990756/41598_2025_90024_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e239/11845747/85a4966ecf0e/41598_2025_90024_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e239/11845747/3a8851af264a/41598_2025_90024_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e239/11845747/07892c0d5a7d/41598_2025_90024_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e239/11845747/6904133efd60/41598_2025_90024_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e239/11845747/4eb6d0e417ee/41598_2025_90024_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e239/11845747/06c1e5c7a265/41598_2025_90024_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e239/11845747/05e5c757048f/41598_2025_90024_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e239/11845747/a4923f8ebc8e/41598_2025_90024_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e239/11845747/4ef2ab990756/41598_2025_90024_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e239/11845747/85a4966ecf0e/41598_2025_90024_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e239/11845747/3a8851af264a/41598_2025_90024_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e239/11845747/07892c0d5a7d/41598_2025_90024_Fig9_HTML.jpg

相似文献

1
Generalizable deep neural networks for image quality classification of cervical images.用于宫颈图像质量分类的通用深度神经网络。
Sci Rep. 2025 Feb 21;15(1):6312. doi: 10.1038/s41598-025-90024-0.
2
Brain tumor segmentation and detection in MRI using convolutional neural networks and VGG16.使用卷积神经网络和VGG16在磁共振成像(MRI)中进行脑肿瘤分割与检测
Cancer Biomark. 2025 Mar;42(3):18758592241311184. doi: 10.1177/18758592241311184. Epub 2025 Apr 4.
3
Reproducible and clinically translatable deep neural networks for cervical screening.可重现且可临床转化的用于宫颈癌筛查的深度神经网络。
Sci Rep. 2023 Dec 8;13(1):21772. doi: 10.1038/s41598-023-48721-1.
4
Deep Convolution Neural Network for Malignancy Detection and Classification in Microscopic Uterine Cervix Cell Images.用于子宫颈细胞显微图像中恶性肿瘤检测与分类的深度卷积神经网络
Asian Pac J Cancer Prev. 2019 Nov 1;20(11):3447-3456. doi: 10.31557/APJCP.2019.20.11.3447.
5
Assessing generalizability of an AI-based visual test for cervical cancer screening.评估一种基于人工智能的宫颈癌筛查视觉检测方法的可推广性。
PLOS Digit Health. 2024 Oct 2;3(10):e0000364. doi: 10.1371/journal.pdig.0000364. eCollection 2024 Oct.
6
A demonstration of automated visual evaluation of cervical images taken with a smartphone camera.使用智能手机摄像头拍摄的宫颈图像的自动化视觉评估演示。
Int J Cancer. 2020 Nov 1;147(9):2416-2423. doi: 10.1002/ijc.33029. Epub 2020 May 19.
7
An Observational Study of Deep Learning and Automated Evaluation of Cervical Images for Cancer Screening.深度学习在宫颈癌筛查中对宫颈图像进行自动评估的观察性研究。
J Natl Cancer Inst. 2019 Sep 1;111(9):923-932. doi: 10.1093/jnci/djy225.
8
The application of deep learning based diagnostic system to cervical squamous intraepithelial lesions recognition in colposcopy images.基于深度学习的诊断系统在阴道镜图像宫颈鳞状上皮内病变识别中的应用。
Sci Rep. 2020 Jul 15;10(1):11639. doi: 10.1038/s41598-020-68252-3.
9
Enhancing pap smear image classification: integrating transfer learning and attention mechanisms for improved detection of cervical abnormalities.增强巴氏涂片图像分类:集成迁移学习和注意力机制以提高宫颈异常检测。
Biomed Phys Eng Express. 2024 Sep 30;10(6). doi: 10.1088/2057-1976/ad7bc0.
10
Automated curation of large-scale cancer histopathology image datasets using deep learning.利用深度学习对大规模癌症组织病理学图像数据集进行自动化注释。
Histopathology. 2024 Jun;84(7):1139-1153. doi: 10.1111/his.15159. Epub 2024 Feb 26.

本文引用的文献

1
Assessing generalizability of an AI-based visual test for cervical cancer screening.评估一种基于人工智能的宫颈癌筛查视觉检测方法的可推广性。
PLOS Digit Health. 2024 Oct 2;3(10):e0000364. doi: 10.1371/journal.pdig.0000364. eCollection 2024 Oct.
2
Design of the HPV-automated visual evaluation (PAVE) study: Validating a novel cervical screening strategy.HPV 自动化视觉评估(PAVE)研究设计:验证一种新的宫颈癌筛查策略。
Elife. 2024 Jan 15;12:RP91469. doi: 10.7554/eLife.91469.
3
Reproducible and clinically translatable deep neural networks for cervical screening.
可重现且可临床转化的用于宫颈癌筛查的深度神经网络。
Sci Rep. 2023 Dec 8;13(1):21772. doi: 10.1038/s41598-023-48721-1.
4
Artificial intelligence-based image analysis in clinical testing: lessons from cervical cancer screening.基于人工智能的图像分析在临床检测中的应用:宫颈癌筛查的经验教训。
J Natl Cancer Inst. 2024 Jan 10;116(1):26-33. doi: 10.1093/jnci/djad202.
5
Use of risk-based cervical screening programs in resource-limited settings.资源有限环境下基于风险的宫颈癌筛查计划的应用。
Cancer Epidemiol. 2023 Jun;84:102369. doi: 10.1016/j.canep.2023.102369. Epub 2023 Apr 25.
6
Estimating human papillomavirus vaccine efficacy from a single-arm trial: proof-of-principle in the Costa Rica Vaccine Trial.从单臂试验估计人乳头瘤病毒疫苗的功效:哥斯达黎加疫苗试验的原理证明。
J Natl Cancer Inst. 2023 Jul 6;115(7):788-795. doi: 10.1093/jnci/djad064.
7
The Effect of Image Resolution on Deep Learning in Radiography.图像分辨率对放射成像深度学习的影响
Radiol Artif Intell. 2020 Jan 22;2(1):e190015. doi: 10.1148/ryai.2019190015. eCollection 2020 Jan.
8
Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries.《全球癌症统计数据 2020:全球 185 个国家和地区 36 种癌症的发病率和死亡率估计》。
CA Cancer J Clin. 2021 May;71(3):209-249. doi: 10.3322/caac.21660. Epub 2021 Feb 4.
9
The Effect of Quality Control on Accuracy of Digital Pathology Image Analysis.质量控制对数字病理学图像分析准确性的影响。
IEEE J Biomed Health Inform. 2021 Feb;25(2):307-314. doi: 10.1109/JBHI.2020.3046094. Epub 2021 Feb 8.
10
Design and feasibility of a novel program of cervical screening in Nigeria: self-sampled HPV testing paired with visual triage.尼日利亚一项新型宫颈癌筛查项目的设计与可行性:自我采样的人乳头瘤病毒检测与视觉分诊相结合
Infect Agent Cancer. 2020 Oct 14;15:60. doi: 10.1186/s13027-020-00324-5. eCollection 2020.