• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过创新的堆叠集成学习模型改善甲状腺疾病诊断

Improving thyroid disorder diagnosis via innovative stacking ensemble learning model.

作者信息

Hassan Ayesha, Ramzan Shabana, Raza Ali, Munwar Iqbal Muhammad, Smerat Aseel, Latif Fitriyani Norma, Syafrudin Muhammad, Won Lee Seung

机构信息

Department of Computer Science & IT, Government Sadiq College Women University Bahawalpur, Punjab, Pakistan.

Department of Software Engineering, The University of Lahore, Lahore, Punjab, Pakistan.

出版信息

Digit Health. 2025 Jun 6;11:20552076251341430. doi: 10.1177/20552076251341430. eCollection 2025 Jan-Dec.

DOI:10.1177/20552076251341430
PMID:40487882
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12144368/
Abstract

OBJECTIVE

Hypothyroidism, hyperthyroidism, thyroid nodules, and other thyroid disorders are common around the world, affect millions of people worldwide, and untreated health conditions may lead to serious health issues. An accurate and timely diagnosis serves as crucial for proper management and medication. This study utilizes a dataset from the UCI machine-learning repository to put forward the comprehensive machine-learning technique for diagnosing thyroid disorders.

METHODS

The proposed methodology involved exploratory data analysis and preparation, which included handling missing values, encoding categorical values, and selecting features. The synthetic minority over-sampling technique technique is utilized to overcome the problem of class imbalance. Five advanced machine learning (ML) algorithms, logistic regression, support vector machine, decision tree, random forest, and gradient boosting are employed to develop predictive models. Further, an innovative stacking ensemble method is proposed with the help of four applied models. The results from these models are aggregated, and logistic regression serves as a meta-learner.

RESULTS

A 10-fold cross-validation technique is utilized to ensure robust model evaluation and reduce the risk of overfitting by using one test set for each subset and training on the rest of the subsets. The ensemble model attained an accuracy of 99.86%, outperforming individual models.

CONCLUSION

These results reveal the capability of ML, especially ensemble approaches, to enhance accurate and timely diagnosis of thyroid disorders.

摘要

目的

甲状腺功能减退、甲状腺功能亢进、甲状腺结节及其他甲状腺疾病在全球范围内普遍存在,影响着全球数百万人,未经治疗的健康状况可能导致严重的健康问题。准确及时的诊断对于正确管理和用药至关重要。本研究利用来自加州大学欧文分校机器学习存储库的数据集,提出用于诊断甲状腺疾病的综合机器学习技术。

方法

所提出的方法包括探索性数据分析和准备,其中包括处理缺失值、对分类值进行编码以及选择特征。利用合成少数过采样技术来克服类不平衡问题。采用逻辑回归、支持向量机、决策树、随机森林和梯度提升这五种先进的机器学习(ML)算法来开发预测模型。此外,借助四个应用模型提出了一种创新的堆叠集成方法。对这些模型的结果进行汇总,逻辑回归作为元学习器。

结果

采用10折交叉验证技术,通过对每个子集使用一个测试集并在其余子集上进行训练,确保稳健的模型评估并降低过拟合风险。集成模型的准确率达到99.86%,优于单个模型。

结论

这些结果揭示了机器学习,尤其是集成方法,在增强甲状腺疾病准确及时诊断方面的能力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65d6/12144368/7cbb83c682cf/10.1177_20552076251341430-fig14.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65d6/12144368/7f671b844865/10.1177_20552076251341430-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65d6/12144368/7dc43ed91a62/10.1177_20552076251341430-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65d6/12144368/b594632ef8f0/10.1177_20552076251341430-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65d6/12144368/7706dd57e039/10.1177_20552076251341430-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65d6/12144368/5475d420ef28/10.1177_20552076251341430-fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65d6/12144368/46964d2f9bcd/10.1177_20552076251341430-fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65d6/12144368/13b716ff57e2/10.1177_20552076251341430-fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65d6/12144368/a6d7713ddb95/10.1177_20552076251341430-fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65d6/12144368/c034c6028677/10.1177_20552076251341430-fig9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65d6/12144368/e45704a4f19e/10.1177_20552076251341430-fig10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65d6/12144368/05116bddb811/10.1177_20552076251341430-fig11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65d6/12144368/d815d81e9392/10.1177_20552076251341430-fig12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65d6/12144368/941c4b95c91a/10.1177_20552076251341430-fig13.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65d6/12144368/7cbb83c682cf/10.1177_20552076251341430-fig14.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65d6/12144368/7f671b844865/10.1177_20552076251341430-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65d6/12144368/7dc43ed91a62/10.1177_20552076251341430-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65d6/12144368/b594632ef8f0/10.1177_20552076251341430-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65d6/12144368/7706dd57e039/10.1177_20552076251341430-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65d6/12144368/5475d420ef28/10.1177_20552076251341430-fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65d6/12144368/46964d2f9bcd/10.1177_20552076251341430-fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65d6/12144368/13b716ff57e2/10.1177_20552076251341430-fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65d6/12144368/a6d7713ddb95/10.1177_20552076251341430-fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65d6/12144368/c034c6028677/10.1177_20552076251341430-fig9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65d6/12144368/e45704a4f19e/10.1177_20552076251341430-fig10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65d6/12144368/05116bddb811/10.1177_20552076251341430-fig11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65d6/12144368/d815d81e9392/10.1177_20552076251341430-fig12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65d6/12144368/941c4b95c91a/10.1177_20552076251341430-fig13.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65d6/12144368/7cbb83c682cf/10.1177_20552076251341430-fig14.jpg

相似文献

1
Improving thyroid disorder diagnosis via innovative stacking ensemble learning model.通过创新的堆叠集成学习模型改善甲状腺疾病诊断
Digit Health. 2025 Jun 6;11:20552076251341430. doi: 10.1177/20552076251341430. eCollection 2025 Jan-Dec.
2
An efficient ensemble based machine learning approach for predicting Chronic Kidney Disease.一种基于集成的高效机器学习方法用于预测慢性肾脏病。
Curr Med Imaging. 2023 May 8. doi: 10.2174/1573405620666230508104538.
3
Can Predictive Modeling Tools Identify Patients at High Risk of Prolonged Opioid Use After ACL Reconstruction?预测模型工具能否识别 ACL 重建术后阿片类药物使用时间延长的高风险患者?
Clin Orthop Relat Res. 2020 Jul;478(7):0-1618. doi: 10.1097/CORR.0000000000001251.
4
Machine learning-based predictive models for perioperative major adverse cardiovascular events in patients with stable coronary artery disease undergoing noncardiac surgery.基于机器学习的预测模型用于接受非心脏手术的稳定冠状动脉疾病患者围手术期主要不良心血管事件的预测
Comput Methods Programs Biomed. 2025 Mar;260:108561. doi: 10.1016/j.cmpb.2024.108561. Epub 2024 Dec 13.
5
Supervised Machine Learning Models for Predicting Sepsis-Associated Liver Injury in Patients With Sepsis: Development and Validation Study Based on a Multicenter Cohort Study.用于预测脓毒症患者脓毒症相关肝损伤的监督式机器学习模型:基于多中心队列研究的开发与验证研究
J Med Internet Res. 2025 May 26;27:e66733. doi: 10.2196/66733.
6
Exploring Ensemble Learning Techniques for Infant Mortality Prediction: A Technical Analysis of XGBoost Stacking AdaBoost and Bagging Models.探索用于婴儿死亡率预测的集成学习技术:XGBoost、堆叠、AdaBoost和装袋模型的技术分析
Birth Defects Res. 2025 Feb;117(2):e2443. doi: 10.1002/bdr2.2443.
7
RDET stacking classifier: a novel machine learning based approach for stroke prediction using imbalance data.RDET堆叠分类器:一种基于机器学习的新型方法,用于利用不平衡数据进行中风预测。
PeerJ Comput Sci. 2023 Nov 21;9:e1684. doi: 10.7717/peerj-cs.1684. eCollection 2023.
8
Joint modeling strategy for using electronic medical records data to build machine learning models: an example of intracerebral hemorrhage.利用电子病历数据构建机器学习模型的联合建模策略:以脑出血为例。
BMC Med Inform Decis Mak. 2022 Oct 25;22(1):278. doi: 10.1186/s12911-022-02018-x.
9
Ensemble machine learning model trained on a new synthesized dataset generalizes well for stress prediction using wearable devices.在新合成数据集上训练的集成机器学习模型,对于使用可穿戴设备进行压力预测具有良好的泛化能力。
J Biomed Inform. 2023 Dec;148:104556. doi: 10.1016/j.jbi.2023.104556. Epub 2023 Dec 2.
10
[Construction of a machine learning ensemble prediction model for gas chromatographic retention index on stationary phases with different polarities].[基于不同极性固定相的气相色谱保留指数构建机器学习集成预测模型]
Se Pu. 2025 Apr 8;43(4):355-362. doi: 10.3724/SP.J.1123.2024.07014.

本文引用的文献

1
An improved deep convolutional neural network-based YouTube video classification using textual features.一种基于改进的深度卷积神经网络并利用文本特征的YouTube视频分类方法。
Heliyon. 2024 Aug 10;10(16):e35812. doi: 10.1016/j.heliyon.2024.e35812. eCollection 2024 Aug 30.
2
Novel glassbox based explainable boosting machine for fault detection in electrical power transmission system.基于新型玻璃盒的可解释增强机在输配电系统故障检测中的应用。
PLoS One. 2024 Aug 28;19(8):e0309459. doi: 10.1371/journal.pone.0309459. eCollection 2024.
3
Optimized virtual reality design through user immersion level detection with novel feature fusion and explainable artificial intelligence.
通过具有新颖特征融合和可解释人工智能的用户沉浸水平检测实现优化的虚拟现实设计。
PeerJ Comput Sci. 2024 Jul 19;10:e2150. doi: 10.7717/peerj-cs.2150. eCollection 2024.
4
Integrated Diagnostics of Thyroid Nodules.甲状腺结节的综合诊断
Cancers (Basel). 2024 Jan 11;16(2):311. doi: 10.3390/cancers16020311.
5
Thyroid Disease Prediction Using Selective Features and Machine Learning Techniques.利用选择性特征和机器学习技术进行甲状腺疾病预测
Cancers (Basel). 2022 Aug 13;14(16):3914. doi: 10.3390/cancers14163914.
6
Thyroid nodules.甲状腺结节。
Best Pract Res Clin Endocrinol Metab. 2014 Mar;28(2):245-77. doi: 10.1016/j.beem.2013.08.007. Epub 2013 Aug 30.
7
SMOTE for high-dimensional class-imbalanced data.过采样处理高维类别不平衡数据。
BMC Bioinformatics. 2013 Mar 22;14:106. doi: 10.1186/1471-2105-14-106.
8
A three-stage expert system based on support vector machines for thyroid disease diagnosis.基于支持向量机的甲状腺疾病诊断三阶段专家系统。
J Med Syst. 2012 Jun;36(3):1953-63. doi: 10.1007/s10916-011-9655-8. Epub 2011 Feb 1.
9
Thyroid incidentalomas. Prevalence, diagnosis, significance, and management.甲状腺偶发瘤。患病率、诊断、意义及管理。
Endocrinol Metab Clin North Am. 2000 Mar;29(1):187-203. doi: 10.1016/s0889-8529(05)70123-7.