• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于微阵列数据的局部降维的 L1 正则化特征选择方法。

A L1-regularized feature selection method for local dimension reduction on microarray data.

机构信息

Department of Electronic Engineering, Xiamen University, Fujian 361005, China; Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518000, China.

Department of Electronic Engineering, Xiamen University, Fujian 361005, China.

出版信息

Comput Biol Chem. 2017 Apr;67:92-101. doi: 10.1016/j.compbiolchem.2016.12.010. Epub 2016 Dec 31.

DOI:10.1016/j.compbiolchem.2016.12.010
PMID:28064045
Abstract

Dimension reduction is a crucial technique in machine learning and data mining, which is widely used in areas of medicine, bioinformatics and genetics. In this paper, we propose a two-stage local dimension reduction approach for classification on microarray data. In first stage, a new L1-regularized feature selection method is defined to remove irrelevant and redundant features and to select the important features (biomarkers). In the next stage, PLS-based feature extraction is implemented on the selected features to extract synthesis features that best reflect discriminating characteristics for classification. The suitability of the proposal is demonstrated in an empirical study done with ten widely used microarray datasets, and the results show its effectiveness and competitiveness compared with four state-of-the-art methods. The experimental results on St Jude dataset shows that our method can be effectively applied to microarray data analysis for subtype prediction and the discovery of gene coexpression.

摘要

降维是机器学习和数据挖掘中的一项关键技术,广泛应用于医学、生物信息学和遗传学等领域。在本文中,我们提出了一种两阶段的局部降维方法,用于微阵列数据的分类。在第一阶段,定义了一种新的 L1 正则化特征选择方法,以去除不相关和冗余的特征,并选择重要的特征(生物标志物)。在下一阶段,在选择的特征上实现基于 PLS 的特征提取,以提取最佳反映分类区分特征的综合特征。该提案在对十个广泛使用的微阵列数据集进行的实证研究中得到了验证,结果表明与四种最先进的方法相比,它具有有效性和竞争力。在 St Jude 数据集上的实验结果表明,我们的方法可以有效地应用于微阵列数据分析,用于亚类预测和基因共表达的发现。

相似文献

1
A L1-regularized feature selection method for local dimension reduction on microarray data.基于微阵列数据的局部降维的 L1 正则化特征选择方法。
Comput Biol Chem. 2017 Apr;67:92-101. doi: 10.1016/j.compbiolchem.2016.12.010. Epub 2016 Dec 31.
2
A centroid-based gene selection method for microarray data classification.一种基于质心的微阵列数据分类基因选择方法。
J Theor Biol. 2016 Jul 7;400:32-41. doi: 10.1016/j.jtbi.2016.03.034. Epub 2016 Apr 4.
3
The feature selection bias problem in relation to high-dimensional gene data.与高维基因数据相关的特征选择偏差问题。
Artif Intell Med. 2016 Jan;66:63-71. doi: 10.1016/j.artmed.2015.11.001. Epub 2015 Nov 14.
4
PLS dimension reduction for classification with microarray data.用于微阵列数据分类的偏最小二乘降维法
Stat Appl Genet Mol Biol. 2004;3:Article33. doi: 10.2202/1544-6115.1075. Epub 2004 Nov 23.
5
Designing a hybrid dimension reduction for improving the performance of Amharic news document classification.设计一种混合降维方法以提高阿姆哈拉语新闻文档分类的性能。
PLoS One. 2021 May 21;16(5):e0251902. doi: 10.1371/journal.pone.0251902. eCollection 2021.
6
Improving PLS-RFE based gene selection for microarray data classification.改进基于偏最小二乘回归特征消除法的基因选择用于微阵列数据分类
Comput Biol Med. 2015 Jul;62:14-24. doi: 10.1016/j.compbiomed.2015.04.011. Epub 2015 Apr 17.
7
Partial least squares dimension reduction for microarray gene expression data with a censored response.具有删失响应的微阵列基因表达数据的偏最小二乘降维法
Math Biosci. 2005 Jan;193(1):119-37. doi: 10.1016/j.mbs.2004.10.007. Epub 2005 Jan 22.
8
Selecting subsets of newly extracted features from PCA and PLS in microarray data analysis.在微阵列数据分析中从主成分分析(PCA)和偏最小二乘法(PLS)中选择新提取特征的子集。
BMC Genomics. 2008 Sep 16;9 Suppl 2(Suppl 2):S24. doi: 10.1186/1471-2164-9-S2-S24.
9
Kernelized partial least squares for feature reduction and classification of gene microarray data.用于基因微阵列数据特征约简与分类的核偏最小二乘法
BMC Syst Biol. 2011;5 Suppl 3(Suppl 3):S13. doi: 10.1186/1752-0509-5-S3-S13. Epub 2011 Dec 23.
10
An efficient data preprocessing approach for large scale medical data mining.一种用于大规模医学数据挖掘的高效数据预处理方法。
Technol Health Care. 2015;23(2):153-60. doi: 10.3233/THC-140887.

引用本文的文献

1
Multi-task machine learning for transfusion decision support in acute upper gastrointestinal bleeding: a novel ensemble approach with clinical validation.用于急性上消化道出血输血决策支持的多任务机器学习:一种经过临床验证的新型集成方法
J Transl Med. 2025 Sep 2;23(1):979. doi: 10.1186/s12967-025-06995-1.
2
PPIGCF: A Protein-Protein Interaction-Based Gene Correlation Filter for Optimal Gene Selection.PPIGCF:一种基于蛋白质相互作用的基因关联滤波器,用于最优基因选择。
Genes (Basel). 2023 May 10;14(5):1063. doi: 10.3390/genes14051063.
3
Mental Health Identification of Children and Young Adults in a Pandemic Using Machine Learning Classifiers.
使用机器学习分类器对大流行期间儿童和青年的心理健康进行识别
Front Psychol. 2022 Jul 29;13:947856. doi: 10.3389/fpsyg.2022.947856. eCollection 2022.
4
A Simple and Effective Approach Based on a Multi-Level Feature Selection for Automated Parkinson's Disease Detection.一种基于多级特征选择的简单有效自动帕金森病检测方法。
J Pers Med. 2022 Jan 6;12(1):55. doi: 10.3390/jpm12010055.
5
ILRC: a hybrid biomarker discovery algorithm based on improved L1 regularization and clustering in microarray data.ILRC:一种基于改进的L1正则化和微阵列数据聚类的混合生物标志物发现算法。
BMC Bioinformatics. 2021 Oct 22;22(1):514. doi: 10.1186/s12859-021-04443-7.
6
Machine Learning Based Computational Gene Selection Models: A Survey, Performance Evaluation, Open Issues, and Future Research Directions.基于机器学习的计算基因选择模型:综述、性能评估、开放问题及未来研究方向
Front Genet. 2020 Dec 10;11:603808. doi: 10.3389/fgene.2020.603808. eCollection 2020.
7
Gene selection for microarray data classification via subspace learning and manifold regularization.基于子空间学习和流形正则化的基因选择在微阵列数据分类中的应用。
Med Biol Eng Comput. 2018 Jul;56(7):1271-1284. doi: 10.1007/s11517-017-1751-6. Epub 2017 Dec 19.