• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于机器学习的宫颈癌早期检测的统计分析。

Machine learning-based statistical analysis for early stage detection of cervical cancer.

机构信息

Department of Software Engineering (SWE), Daffodil International University (DIU), Sukrabad, Dhaka, 1207, Bangladesh.

Department of Electrical and Computer Engineering, University of Saskatchewan, 57 Campus Drive, Saskatoon, SK, S7N 5A9, Canada; Group of Biophotomatiχ, Department of Information and Communication Technology, Mawlana Bhashani Science and Technology University, Santosh, Tangail, 1902, Bangladesh.

出版信息

Comput Biol Med. 2021 Dec;139:104985. doi: 10.1016/j.compbiomed.2021.104985. Epub 2021 Oct 28.

DOI:10.1016/j.compbiomed.2021.104985
PMID:34735942
Abstract

Cervical cancer (CC) is the most common type of cancer in women and remains a significant cause of mortality, particularly in less developed countries, although it can be effectively treated if detected at an early stage. This study aimed to find efficient machine-learning-based classifying models to detect early stage CC using clinical data. We obtained a Kaggle data repository CC dataset which contained four classes of attributes including biopsy, cytology, Hinselmann, and Schiller. This dataset was split into four categories based on these class attributes. Three feature transformation methods, including log, sine function, and Z-score were applied to these datasets. Several supervised machine learning algorithms were assessed for their performance in classification. A Random Tree (RT) algorithm provided the best classification accuracy for the biopsy (98.33%) and cytology (98.65%) data, whereas Random Forest (RF) and Instance-Based K-nearest neighbor (IBk) provided the best performance for Hinselmann (99.16%), and Schiller (98.58%) respectively. Among the feature transformation methods, logarithmic gave the best performance for biopsy datasets whereas sine function was superior for cytology. Both logarithmic and sine functions performed the best for the Hinselmann dataset, while Z-score was best for the Schiller dataset. Various Feature Selection Techniques (FST) methods were applied to the transformed datasets to identify and prioritize important risk factors. The outcomes of this study indicate that appropriate system design and tuning, machine learning methods and classification are able to detect CC accurately and efficiently in its early stages using clinical data.

摘要

宫颈癌(CC)是女性最常见的癌症类型,仍然是导致死亡的主要原因,尤其是在欠发达国家,尽管如果在早期发现,它可以得到有效治疗。本研究旨在寻找有效的基于机器学习的分类模型,使用临床数据来检测早期宫颈癌。我们从 Kaggle 数据仓库 CC 数据集获得了包含活检、细胞学、Hinselmann 和 Schiller 四个类别的属性的数据集。该数据集根据这些类属性分为四类。我们应用了三种特征变换方法,包括对数、正弦函数和 Z 分数,对这些数据集进行处理。评估了几种监督机器学习算法在分类方面的性能。随机树(RT)算法在活检(98.33%)和细胞学(98.65%)数据方面提供了最佳的分类准确性,而随机森林(RF)和基于实例的 K-最近邻(IBk)算法在 Hinselmann(99.16%)和 Schiller(98.58%)方面提供了最佳性能。在特征变换方法中,对数在活检数据集中表现最好,而正弦函数在细胞学数据集中表现更好。对数和正弦函数在 Hinselmann 数据集上表现最好,而 Z 分数在 Schiller 数据集上表现最好。应用了各种特征选择技术(FST)方法对变换后的数据集进行分析,以确定和优先考虑重要的风险因素。本研究的结果表明,通过使用临床数据,适当的系统设计和调整、机器学习方法和分类能够在早期阶段准确有效地检测宫颈癌。

相似文献

1
Machine learning-based statistical analysis for early stage detection of cervical cancer.基于机器学习的宫颈癌早期检测的统计分析。
Comput Biol Med. 2021 Dec;139:104985. doi: 10.1016/j.compbiomed.2021.104985. Epub 2021 Oct 28.
2
Prediction of precancerous cervical cancer lesions among women living with HIV on antiretroviral therapy in Uganda: a comparison of supervised machine learning algorithms.乌干达接受抗逆转录病毒疗法的 HIV 感染者中宫颈癌前病变的预测:监督机器学习算法的比较。
BMC Womens Health. 2024 Jul 8;24(1):393. doi: 10.1186/s12905-024-03232-7.
3
Heart disease prediction using supervised machine learning algorithms: Performance analysis and comparison.基于监督机器学习算法的心脏病预测:性能分析与比较。
Comput Biol Med. 2021 Sep;136:104672. doi: 10.1016/j.compbiomed.2021.104672. Epub 2021 Jul 21.
4
Prediction and Detection of Cervical Malignancy Using Machine Learning Models.基于机器学习模型预测和检测宫颈癌。
Asian Pac J Cancer Prev. 2023 Apr 1;24(4):1419-1433. doi: 10.31557/APJCP.2023.24.4.1419.
5
A review of image analysis and machine learning techniques for automated cervical cancer screening from pap-smear images.基于巴氏涂片图像的宫颈癌自动筛查的图像分析和机器学习技术综述。
Comput Methods Programs Biomed. 2018 Oct;164:15-22. doi: 10.1016/j.cmpb.2018.05.034. Epub 2018 Jun 26.
6
Cervical Cancer Identification with Synthetic Minority Oversampling Technique and PCA Analysis using Random Forest Classifier.基于随机森林分类器的合成少数过采样技术和 PCA 分析对宫颈癌的识别。
J Med Syst. 2019 Jul 17;43(9):286. doi: 10.1007/s10916-019-1402-6.
7
Comparison of Supervised Machine Learning Algorithms for Classifying of Home Discharge Possibility in Convalescent Stroke Patients: A Secondary Analysis.基于机器学习的监督算法在恢复期脑卒中患者居家康复可能性分类中的比较:二次分析。
J Stroke Cerebrovasc Dis. 2021 Oct;30(10):106011. doi: 10.1016/j.jstrokecerebrovasdis.2021.106011. Epub 2021 Jul 26.
8
Evaluation of Machine Learning Techniques for Traffic Flow-Based Intrusion Detection.基于流量的入侵检测的机器学习技术评估。
Sensors (Basel). 2022 Nov 30;22(23):9326. doi: 10.3390/s22239326.
9
A Model for Predicting Cervical Cancer Using Machine Learning Algorithms.基于机器学习算法的宫颈癌预测模型。
Sensors (Basel). 2022 May 29;22(11):4132. doi: 10.3390/s22114132.
10
Soft Clustering for Enhancing the Diagnosis of Chronic Diseases over Machine Learning Algorithms.基于机器学习算法的软聚类在慢性病诊断中的应用。
J Healthc Eng. 2020 Mar 9;2020:4984967. doi: 10.1155/2020/4984967. eCollection 2020.

引用本文的文献

1
Deep dive into deep learning methods for cervical cancer detection and classification.深入探究用于宫颈癌检测和分类的深度学习方法。
Rep Pract Oncol Radiother. 2025 Aug 7;30(3):396-416. doi: 10.5603/rpor.106148. eCollection 2025.
2
Machine and Deep Learning for the Diagnosis, Prognosis, and Treatment of Cervical Cancer: A Scoping Review.用于宫颈癌诊断、预后和治疗的机器学习与深度学习:一项范围综述
Diagnostics (Basel). 2025 Jun 17;15(12):1543. doi: 10.3390/diagnostics15121543.
3
Attention-guided deep framework for polyp localization and subsequent classification via polyp local and Siamese feature fusion.
通过息肉局部特征与暹罗特征融合的注意力引导深度框架用于息肉定位及后续分类。
Med Biol Eng Comput. 2025 May 2. doi: 10.1007/s11517-025-03369-z.
4
Comparison of different machine learning methods in the prediction of early recurrence in HCC patients with Gd-EOB-DTPA-MRI.不同机器学习方法在基于钆塞酸二钠增强磁共振成像预测肝癌患者早期复发中的比较
Abdom Radiol (NY). 2025 Apr 11. doi: 10.1007/s00261-025-04932-w.
5
Interpretable artificial intelligence (AI) for cervical cancer risk analysis leveraging stacking ensemble and expert knowledge.利用堆叠集成和专家知识进行宫颈癌风险分析的可解释人工智能(AI)
Digit Health. 2025 Mar 25;11:20552076251327945. doi: 10.1177/20552076251327945. eCollection 2025 Jan-Dec.
6
Performance of artificial intelligence for diagnosing cervical intraepithelial neoplasia and cervical cancer: a systematic review and meta-analysis.人工智能诊断宫颈上皮内瘤变和宫颈癌的性能:系统评价与荟萃分析
EClinicalMedicine. 2024 Dec 28;80:102992. doi: 10.1016/j.eclinm.2024.102992. eCollection 2025 Feb.
7
A bibliometric review of predictive modelling for cervical cancer risk.宫颈癌风险预测模型的文献计量学综述
Front Res Metr Anal. 2024 Nov 19;9:1493944. doi: 10.3389/frma.2024.1493944. eCollection 2024.
8
A precise machine learning model: Detecting cervical cancer using feature selection and explainable AI.一种精确的机器学习模型:利用特征选择和可解释人工智能检测宫颈癌。
J Pathol Inform. 2024 Sep 26;15:100398. doi: 10.1016/j.jpi.2024.100398. eCollection 2024 Dec.
9
StackDPP: Stacking-Based Explainable Classifier for Depression Prediction and Finding the Risk Factors among Clinicians.StackDPP:基于堆叠的可解释分类器用于抑郁症预测及在临床医生中寻找风险因素
Bioengineering (Basel). 2023 Jul 20;10(7):858. doi: 10.3390/bioengineering10070858.
10
Improvement method for cervical cancer detection: A comparative analysis.提高宫颈癌检出率的方法:对比分析。
Oncol Res. 2022 Oct 10;29(5):365-376. doi: 10.32604/or.2022.025897. eCollection 2021.