• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Subspace learning using low-rank latent representation learning and perturbation theorem: Unsupervised gene selection.

作者信息

Moslemi Amir, Naeini Fariborz Baghaei

机构信息

Department of Physics, Toronto Metropolitan University, Ontario, Canada; School of Software Design & Data Science, Seneca Polytechnic, Toronto, ON, M4N 3M5, Canada; Physical Sciences, Sunnybrook Health Sciences Centre, Toronto, ON, M4N 3M5, Canada.

Faculty of Engineering, Computing and the Environment, Kingston University, Penrhyn Road Campus, Kingston Upon Thames, London, KT1 2EE, UK.

出版信息

Comput Biol Med. 2025 Feb;185:109567. doi: 10.1016/j.compbiomed.2024.109567. Epub 2024 Dec 14.

DOI:10.1016/j.compbiomed.2024.109567
PMID:39675215
Abstract

In recent years, gene expression data analysis has gained growing significance in the fields of machine learning and computational biology. Typically, microarray gene datasets exhibit a scenario where the number of features exceeds the number of samples, resulting in an ill-posed and underdetermined equation system. The presence of redundant features in high-dimensional data leads to suboptimal performance and increased computational time for learning algorithms. Although feature extraction and feature selection are two approaches that can be employed to deal with this challenge, feature selection has greater interpretability ability which causes it to receive more attention. In this study, we propose an unsupervised feature selection which is based on pseudo label latent representation learning and perturbation theory. In the first step, pseudo labels are extracted and constructed using latent representation learning. In the second step, the least square problem is solved for original data matrix and perturbed data matrix. Features are clustered based on the similarity between the original data matrix and the perturbed data matrix using k-means. In the last step, features in each subcluster are ranked based on information gain criterion. To showcase the efficacy of the proposed approach, numerical experiments were carried out on six benchmark microarray datasets and two RNA-Sequencing benchmark datasets. The outcomes indicate that the proposed technique surpasses eight state-of-the-art unsupervised feature selection methods in both clustering accuracy and normalized mutual information.

摘要

相似文献

1
Subspace learning using low-rank latent representation learning and perturbation theorem: Unsupervised gene selection.
Comput Biol Med. 2025 Feb;185:109567. doi: 10.1016/j.compbiomed.2024.109567. Epub 2024 Dec 14.
2
Dual regularized subspace learning using adaptive graph learning and rank constraint: Unsupervised feature selection on gene expression microarray datasets.基于自适应图学习和秩约束的双重正则化子空间学习:基因表达微阵列数据集上的无监督特征选择。
Comput Biol Med. 2023 Dec;167:107659. doi: 10.1016/j.compbiomed.2023.107659. Epub 2023 Nov 4.
3
Unsupervised feature selection via latent representation learning and manifold regularization.基于潜在表示学习和流形正则化的无监督特征选择。
Neural Netw. 2019 Sep;117:163-178. doi: 10.1016/j.neunet.2019.04.015. Epub 2019 May 22.
4
Hierarchical gene selection and genetic fuzzy system for cancer microarray data classification.用于癌症微阵列数据分类的分层基因选择与遗传模糊系统
PLoS One. 2015 Mar 30;10(3):e0120364. doi: 10.1371/journal.pone.0120364. eCollection 2015.
5
An Unsupervised Feature Extraction Approach Based on Self-Expression.基于自表达的无监督特征提取方法。
Big Data. 2023 Feb;11(1):18-34. doi: 10.1089/big.2021.0420. Epub 2022 May 10.
6
Feature weight estimation for gene selection: a local hyperlinear learning approach.特征权重估计在基因选择中的应用:一种局部超线性学习方法。
BMC Bioinformatics. 2014 Mar 14;15:70. doi: 10.1186/1471-2105-15-70.
7
Subspace learning using structure learning and non-convex regularization: Hybrid technique with mushroom reproduction optimization in gene selection.使用结构学习和非凸正则化的子空间学习:在基因选择中使用蘑菇繁殖优化的混合技术。
Comput Biol Med. 2023 Sep;164:107309. doi: 10.1016/j.compbiomed.2023.107309. Epub 2023 Jul 31.
8
CGRclust: Chaos Game Representation for twin contrastive clustering of unlabelled DNA sequences.CGRclust:用于未标记DNA序列双对比聚类的混沌游戏表示法
BMC Genomics. 2024 Dec 18;25(1):1214. doi: 10.1186/s12864-024-11135-y.
9
A general adaptive unsupervised feature selection with auto-weighting.一种具有自动加权的通用自适应无监督特征选择方法。
Neural Netw. 2025 Jan;181:106840. doi: 10.1016/j.neunet.2024.106840. Epub 2024 Oct 31.
10
Unsupervised feature selection based on variance-covariance subspace distance.基于方差协方差子空间距离的无监督特征选择
Neural Netw. 2023 Sep;166:188-203. doi: 10.1016/j.neunet.2023.06.018. Epub 2023 Jun 24.

引用本文的文献

1
Hybrid Feature Selection for Predicting Chemotherapy Response in Locally Advanced Breast Cancer Using Clinical and CT Radiomics Features: Integration of Matrix Rank and Genetic Algorithm.利用临床和CT影像组学特征进行局部晚期乳腺癌化疗反应预测的混合特征选择:矩阵秩与遗传算法的整合
Cancers (Basel). 2025 Aug 23;17(17):2738. doi: 10.3390/cancers17172738.