• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

稀疏正则化通过逻辑回归增强基因选择和白血病亚型分类。

Sparsity regularization enhances gene selection and leukemia subtype classification via logistic regression.

作者信息

Mahmood Nozad Hussein, Kadir Dler Hussein

机构信息

Department of Statistics and information, College of Administration and Economics, Salahaddin University-Erbil, Erbil, Iraq; Cihan University Sulaimaniya Research Center (CUSRC), Cihan University Sulaimaniya, Sulaymaniyah City, Kurdistan Region, Iraq.

Department of Statistics and information, College of Administration and Economics, Salahaddin University-Erbil, Erbil, Iraq.

出版信息

Leuk Res. 2025 Mar;150:107663. doi: 10.1016/j.leukres.2025.107663. Epub 2025 Feb 11.

DOI:10.1016/j.leukres.2025.107663
PMID:39954557
Abstract

This study investigated the application of sparsity regularization methods to improve the classification of leukemia subtypes using high-dimensional gene expression data. Multinomial logistic regression models with the sparsity methods of Ridge, Lasso, and Elastic Net regularizations were employed to address overfitting and dimensionality issues while enhancing model interpretability. The study used a leukemia cancer dataset from the Curated Microarray Database (CuMiDa), which included gene expression data for 16,383 genes across 281 samples representing seven different types of leukemia. The statistical metrics of Accuracy, Kappa statistics, AUC, and F1-score were used to measure the models' implementation. Besides, the effectiveness and ability of each method in gene selection and dimensional reduction of the models were discussed. Elastic Net regularization was a better technique than the Ridge and Lasso based on overall classification performance; it also reached the highest accuracy along with Kappa values. On the other hand, both Lasso and Elastic Net were making more effective feature selections, creating sparse models that could efficiently discriminate leukemia subtypes. In this way, the results highlighted that the inclusion of sparsity regularization could enhance knowledge and accuracy in such a challenging task of subclass leukemia classification, enabling much more tailored treatments.

摘要

本研究探讨了稀疏正则化方法在利用高维基因表达数据改善白血病亚型分类中的应用。采用具有岭回归、套索回归和弹性网络正则化等稀疏方法的多项逻辑回归模型来解决过拟合和维度问题,同时增强模型的可解释性。该研究使用了来自精心策划的微阵列数据库(CuMiDa)的白血病癌症数据集,其中包括代表七种不同类型白血病的281个样本中16383个基因的基因表达数据。使用准确率、卡帕统计量、AUC和F1分数等统计指标来衡量模型的实施情况。此外,还讨论了每种方法在模型的基因选择和降维方面的有效性和能力。基于整体分类性能,弹性网络正则化是比岭回归和套索回归更好的技术;它还达到了最高的准确率以及卡帕值。另一方面,套索回归和弹性网络都能更有效地进行特征选择,创建能够有效区分白血病亚型的稀疏模型。通过这种方式,结果突出表明,在白血病亚类分类这一具有挑战性的任务中,纳入稀疏正则化可以提高认知度和准确率,从而实现更具针对性的治疗。

相似文献

1
Sparsity regularization enhances gene selection and leukemia subtype classification via logistic regression.稀疏正则化通过逻辑回归增强基因选择和白血病亚型分类。
Leuk Res. 2025 Mar;150:107663. doi: 10.1016/j.leukres.2025.107663. Epub 2025 Feb 11.
2
Sparse logistic regression with a L1/2 penalty for gene selection in cancer classification.基于 L1/2 罚项的稀疏逻辑回归在癌症分类中的基因选择。
BMC Bioinformatics. 2013 Jun 19;14:198. doi: 10.1186/1471-2105-14-198.
3
Evaluating key predictors of breast cancer through survival: a comparison of AFT frailty models with LASSO, ridge, and elastic net regularization.通过生存分析评估乳腺癌的关键预测因素:AFT脆弱模型与LASSO、岭回归和弹性网络正则化的比较
BMC Cancer. 2025 Apr 11;25(1):665. doi: 10.1186/s12885-025-14040-z.
4
Regularized logistic regression with adjusted adaptive elastic net for gene selection in high dimensional cancer classification.用于高维癌症分类中基因选择的带调整自适应弹性网络的正则化逻辑回归
Comput Biol Med. 2015 Dec 1;67:136-45. doi: 10.1016/j.compbiomed.2015.10.008. Epub 2015 Oct 24.
5
Gene selection in cancer classification using sparse logistic regression with Bayesian regularization.使用带贝叶斯正则化的稀疏逻辑回归进行癌症分类中的基因选择。
Bioinformatics. 2006 Oct 1;22(19):2348-55. doi: 10.1093/bioinformatics/btl386. Epub 2006 Jul 14.
6
Sparse Logistic Regression With Penalty for Emotion Recognition in Electroencephalography Classification.用于脑电图分类中情感识别的带惩罚项的稀疏逻辑回归
Front Neuroinform. 2020 Aug 7;14:29. doi: 10.3389/fninf.2020.00029. eCollection 2020.
7
Sparse logistic regression with Lp penalty for biomarker identification.用于生物标志物识别的具有Lp惩罚的稀疏逻辑回归。
Stat Appl Genet Mol Biol. 2007;6:Article6. doi: 10.2202/1544-6115.1248. Epub 2007 Feb 10.
8
Overcoming the inadaptability of sparse group lasso for data with various group structures by stacking.通过堆叠克服具有各种群组结构的数据的稀疏群组套索的不适应性。
Bioinformatics. 2022 Mar 4;38(6):1542-1549. doi: 10.1093/bioinformatics/btab848.
9
Identification of biomarkers predictive of metastasis development in early-stage colorectal cancer using network-based regularization.基于网络正则化的方法识别早期结直肠癌转移发展的生物标志物。
BMC Bioinformatics. 2023 Jan 16;24(1):17. doi: 10.1186/s12859-022-05104-z.
10
Elastic SCAD as a novel penalization method for SVM classification tasks in high-dimensional data.弹性 SCAD 作为一种新的惩罚方法,用于高维数据中的 SVM 分类任务。
BMC Bioinformatics. 2011 May 9;12:138. doi: 10.1186/1471-2105-12-138.

引用本文的文献

1
Constructing an early warning model for elderly sepsis patients based on machine learning.基于机器学习构建老年脓毒症患者预警模型。
Sci Rep. 2025 Mar 27;15(1):10580. doi: 10.1038/s41598-025-95604-8.