• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用特征选择技术预测地中海贫血:一项比较分析。

Predicting Thalassemia Using Feature Selection Techniques: A Comparative Analysis.

作者信息

Saleem Muniba, Aslam Waqar, Lali Muhammad Ikram Ullah, Rauf Hafiz Tayyab, Nasr Emad Abouel

机构信息

Department of Computer Science & Information Technology, The Government Sadiq College Women University Bahawalpur, Bahawalpur 63100, Pakistan.

Department of Information Security, The Islamia University of Bahawalpur, Bahawalpur 63100, Pakistan.

出版信息

Diagnostics (Basel). 2023 Nov 14;13(22):3441. doi: 10.3390/diagnostics13223441.

DOI:10.3390/diagnostics13223441
PMID:37998577
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10670018/
Abstract

Thalassemia represents one of the most common genetic disorders worldwide, characterized by defects in hemoglobin synthesis. The affected individuals suffer from malfunctioning of one or more of the four globin genes, leading to chronic hemolytic anemia, an imbalance in the hemoglobin chain ratio, iron overload, and ineffective erythropoiesis. Despite the challenges posed by this condition, recent years have witnessed significant advancements in diagnosis, therapy, and transfusion support, significantly improving the prognosis for thalassemia patients. This research empirically evaluates the efficacy of models constructed using classification methods and explores the effectiveness of relevant features that are derived using various machine-learning techniques. Five feature selection approaches, namely Chi-Square (χ2), Exploratory Factor Score (EFS), tree-based Recursive Feature Elimination (RFE), gradient-based RFE, and Linear Regression Coefficient, were employed to determine the optimal feature set. Nine classifiers, namely K-Nearest Neighbors (KNN), Decision Trees (DT), Gradient Boosting Classifier (GBC), Linear Regression (LR), AdaBoost, Extreme Gradient Boosting (XGB), Random Forest (RF), Light Gradient Boosting Machine (LGBM), and Support Vector Machine (SVM), were utilized to evaluate the performance. The χ2 method achieved accuracy, registering 91.56% precision, 91.04% recall, and 92.65% f-score when aligned with the LR classifier. Moreover, the results underscore that amalgamating over-sampling with Synthetic Minority Over-sampling Technique (SMOTE), RFE, and 10-fold cross-validation markedly elevates the detection accuracy for αT patients. Notably, the Gradient Boosting Classifier (GBC) achieves 93.46% accuracy, 93.89% recall, and 92.72% F1 score.

摘要

地中海贫血是全球最常见的遗传性疾病之一,其特征是血红蛋白合成缺陷。受影响的个体一个或多个四个珠蛋白基因功能异常,导致慢性溶血性贫血、血红蛋白链比例失衡、铁过载和无效红细胞生成。尽管这种疾病带来了诸多挑战,但近年来在诊断、治疗和输血支持方面取得了显著进展,显著改善了地中海贫血患者的预后。本研究通过实证评估使用分类方法构建的模型的疗效,并探索使用各种机器学习技术得出的相关特征的有效性。采用了五种特征选择方法,即卡方检验(χ2)、探索性因子得分(EFS)、基于树的递归特征消除(RFE)、基于梯度的RFE和线性回归系数,以确定最佳特征集。使用了九个分类器,即K近邻(KNN)、决策树(DT)、梯度提升分类器(GBC)、线性回归(LR)、AdaBoost、极端梯度提升(XGB)、随机森林(RF)、轻量级梯度提升机(LGBM)和支持向量机(SVM)来评估性能。当与LR分类器结合时,χ2方法的准确率达到91.56%,召回率为91.04%,F值为92.65%。此外,结果强调将过采样与合成少数过采样技术(SMOTE)、RFE和10折交叉验证相结合,可显著提高αT患者的检测准确率。值得注意的是,梯度提升分类器(GBC)的准确率达到93.46%,召回率为93.89%,F1值为92.72%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb68/10670018/714ec5187f5f/diagnostics-13-03441-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb68/10670018/714ec5187f5f/diagnostics-13-03441-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb68/10670018/714ec5187f5f/diagnostics-13-03441-g001.jpg

相似文献

1
Predicting Thalassemia Using Feature Selection Techniques: A Comparative Analysis.使用特征选择技术预测地中海贫血:一项比较分析。
Diagnostics (Basel). 2023 Nov 14;13(22):3441. doi: 10.3390/diagnostics13223441.
2
Artificial intelligence in clinical care amidst COVID-19 pandemic: A systematic review.COVID-19大流行期间临床护理中的人工智能:一项系统综述。
Comput Struct Biotechnol J. 2021;19:2833-2850. doi: 10.1016/j.csbj.2021.05.010. Epub 2021 May 7.
3
Classification and prediction of spinal disease based on the SMOTE-RFE-XGBoost model.基于SMOTE-RFE-XGBoost模型的脊柱疾病分类与预测
PeerJ Comput Sci. 2023 Mar 10;9:e1280. doi: 10.7717/peerj-cs.1280. eCollection 2023.
4
Screening of COVID-19 based on the extracted radiomics features from chest CT images.基于胸部 CT 图像提取的放射组学特征对 COVID-19 进行筛查。
J Xray Sci Technol. 2021;29(2):229-243. doi: 10.3233/XST-200831.
5
Union With Recursive Feature Elimination: A Feature Selection Framework to Improve the Classification Performance of Multicategory Causes of Death in Colorectal Cancer.基于递归特征消除的特征选择框架,提高结直肠癌多死因分类性能
Lab Invest. 2024 Mar;104(3):100320. doi: 10.1016/j.labinv.2023.100320. Epub 2023 Dec 28.
6
Machine learning-based models to predict the conversion of normal blood pressure to hypertension within 5-year follow-up.基于机器学习的模型预测正常血压在 5 年内转为高血压。
PLoS One. 2024 Mar 14;19(3):e0300201. doi: 10.1371/journal.pone.0300201. eCollection 2024.
7
Performance Analysis of Conventional Machine Learning Algorithms for Identification of Chronic Kidney Disease in Type 1 Diabetes Mellitus Patients.用于识别1型糖尿病患者慢性肾病的传统机器学习算法的性能分析
Diagnostics (Basel). 2021 Dec 3;11(12):2267. doi: 10.3390/diagnostics11122267.
8
Non-Contrasted CT Radiomics for SAH Prognosis Prediction.用于蛛网膜下腔出血预后预测的非增强CT影像组学
Bioengineering (Basel). 2023 Aug 16;10(8):967. doi: 10.3390/bioengineering10080967.
9
Diagnosing Coronary Artery Disease on the Basis of Hard Ensemble Voting Optimization.基于硬集合投票优化的冠状动脉疾病诊断。
Medicina (Kaunas). 2022 Nov 28;58(12):1745. doi: 10.3390/medicina58121745.
10
Joint modeling strategy for using electronic medical records data to build machine learning models: an example of intracerebral hemorrhage.利用电子病历数据构建机器学习模型的联合建模策略:以脑出血为例。
BMC Med Inform Decis Mak. 2022 Oct 25;22(1):278. doi: 10.1186/s12911-022-02018-x.

引用本文的文献

1
Advanced molecular approaches to thalassemia disorder and the selection of molecular-level diagnostic testing in resource-limited settings.地中海贫血症的先进分子方法以及资源有限环境下分子水平诊断测试的选择
Hematol Transfus Cell Ther. 2025 Jun 14;47(3):103860. doi: 10.1016/j.htct.2025.103860.
2
Assessing knowledge on thalassemia for prevention and management practices among the tribal population of Sitteri Panchayat, Dharmapuri District, Tamil Nadu, India.评估印度泰米尔纳德邦达摩布里区西泰里村潘查亚特部落人群关于地中海贫血预防和管理措施的知识。
J Family Med Prim Care. 2025 Mar;14(3):1098-1103. doi: 10.4103/jfmpc.jfmpc_1526_24. Epub 2025 Mar 25.

本文引用的文献

1
TT@MHA: A machine learning-based webpage tool for discriminating thalassemia trait from microcytic hypochromic anemia patients.基于机器学习的网页工具,用于区分地中海贫血和小细胞低色素性贫血患者。
Clin Chim Acta. 2023 May 1;545:117368. doi: 10.1016/j.cca.2023.117368. Epub 2023 Apr 29.
2
Predicting thalassemia using deep neural network based on red blood cell indices.基于红细胞指数的深度学习神经网络预测地中海贫血
Clin Chim Acta. 2023 Mar 15;543:117329. doi: 10.1016/j.cca.2023.117329. Epub 2023 Apr 3.
3
Prediction of [Formula: see text]-Thalassemia carriers using complete blood count features.
应用全血细胞计数特征预测 [公式:见正文]-地中海贫血携带者。
Sci Rep. 2022 Nov 21;12(1):19999. doi: 10.1038/s41598-022-22011-8.
4
DeepThal: A Deep Learning-Based Framework for the Large-Scale Prediction of the α-Thalassemia Trait Using Red Blood Cell Parameters.DeepThal:一种基于深度学习的框架,用于利用红细胞参数大规模预测α地中海贫血特征
J Clin Med. 2022 Oct 26;11(21):6305. doi: 10.3390/jcm11216305.
5
From Unit to Dose: A Machine Learning Approach for Precise Prediction of Hemoglobin and Iron Content in Individual Packed Red Blood Cell Units.从单位到剂量:一种用于精确预测单个袋装红细胞单位中血红蛋白和铁含量的机器学习方法。
Adv Sci (Weinh). 2022 Dec;9(36):e2204077. doi: 10.1002/advs.202204077. Epub 2022 Nov 4.
6
Non-Transfusion-Dependent Thalassemia: A Panoramic Review.非输血依赖型地中海贫血症:全景综述。
Medicina (Kaunas). 2022 Oct 21;58(10):1496. doi: 10.3390/medicina58101496.
7
Deep Learning Assisted Automated Assessment of Thalassaemia from Haemoglobin Electrophoresis Images.基于血红蛋白电泳图像的深度学习辅助地中海贫血自动评估
Diagnostics (Basel). 2022 Oct 3;12(10):2405. doi: 10.3390/diagnostics12102405.
8
Luspatercept: A New Tool for the Treatment of Anemia Related to β-Thalassemia, Myelodysplastic Syndromes and Primary Myelofibrosis.鲁索替尼:治疗β地中海贫血、骨髓增生异常综合征和原发性骨髓纤维化相关贫血的新工具。
Diseases. 2022 Oct 9;10(4):85. doi: 10.3390/diseases10040085.
9
Increased myocardial extracellular volume is associated with myocardial iron overload and heart failure in thalassemia major.在重型地中海贫血中,心肌细胞外容量增加与心肌铁过载及心力衰竭相关。
Eur Radiol. 2023 Feb;33(2):1266-1276. doi: 10.1007/s00330-022-09120-8. Epub 2022 Sep 6.
10
Advances in screening of thalassaemia.地中海贫血症的筛查进展。
Clin Chim Acta. 2022 Sep 1;534:176-184. doi: 10.1016/j.cca.2022.08.001. Epub 2022 Aug 3.