• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于类别不平衡感知的 Relief 算法,用于使用微阵列基因表达数据进行肿瘤分类。

A class imbalance-aware Relief algorithm for the classification of tumors using microarray gene expression data.

机构信息

College of Information Science and Engineering Hunan University Changsha, China.

College of Information Science and Engineering Hunan University Changsha, China.

出版信息

Comput Biol Chem. 2019 Jun;80:121-127. doi: 10.1016/j.compbiolchem.2019.03.017. Epub 2019 Mar 24.

DOI:10.1016/j.compbiolchem.2019.03.017
PMID:30947070
Abstract

DNA microarray data has been widely used in cancer research due to the significant advantage helped to successfully distinguish between tumor classes. However, typical gene expression data usually presents a high-dimensional imbalanced characteristic, which poses severe challenge for traditional machine learning methods to construct a robust classifier performing well on both the minority and majority classes. As one of the most successful feature weighting techniques, Relief is considered to particularly suit to handle high-dimensional problems. Unfortunately, almost all relief-based methods have not taken the class imbalance distribution into account. This study identifies that existing Relief-based algorithms may underestimate the features with the discernibility ability of minority classes, and ignore the distribution characteristic of minority class samples. As a result, an additional bias towards being classified into the majority classes can be introduced. To this end, a new method, named imRelief, is proposed for efficiently handling high-dimensional imbalanced gene expression data. imRelief can correct the bias towards to the majority classes, and consider the scattered distributional characteristic of minority class samples in the process of estimating feature weights. This way, imRelief has the ability to reward the features which perform well at separating the minority classes from other classes. Experiments on four microarray gene expression data sets demonstrate the effectiveness of imRelief in both feature weighting and feature subset selection applications.

摘要

DNA 微阵列数据由于在成功区分肿瘤类别方面具有显著优势,因此在癌症研究中得到了广泛应用。然而,典型的基因表达数据通常呈现出高维不平衡的特征,这对传统的机器学习方法构建在少数类和多数类上都能很好地执行的稳健分类器构成了严峻挑战。作为最成功的特征加权技术之一,Relief 被认为特别适合处理高维问题。不幸的是,几乎所有基于 Relief 的方法都没有考虑到类不平衡分布。本研究发现,现有的基于 Relief 的算法可能低估了具有少数类可辨别能力的特征,并且忽略了少数类样本的分布特征。结果,可能会引入对分类为多数类的额外偏差。为此,提出了一种名为 imRelief 的新方法,用于有效地处理高维不平衡基因表达数据。imRelief 可以纠正偏向多数类的偏差,并在估计特征权重的过程中考虑少数类样本的分散分布特征。这样,imRelief 就有能力奖励那些在区分少数类和其他类方面表现良好的特征。在四个微阵列基因表达数据集上的实验表明,imRelief 在特征加权和特征子集选择应用中都具有有效性。

相似文献

1
A class imbalance-aware Relief algorithm for the classification of tumors using microarray gene expression data.一种基于类别不平衡感知的 Relief 算法,用于使用微阵列基因表达数据进行肿瘤分类。
Comput Biol Chem. 2019 Jun;80:121-127. doi: 10.1016/j.compbiolchem.2019.03.017. Epub 2019 Mar 24.
2
C-HMOSHSSA: Gene selection for cancer classification using multi-objective meta-heuristic and machine learning methods.C-HMOSHSSA:使用多目标元启发式和机器学习方法进行癌症分类的基因选择。
Comput Methods Programs Biomed. 2019 Sep;178:219-235. doi: 10.1016/j.cmpb.2019.06.029. Epub 2019 Jun 29.
3
The feature selection bias problem in relation to high-dimensional gene data.与高维基因数据相关的特征选择偏差问题。
Artif Intell Med. 2016 Jan;66:63-71. doi: 10.1016/j.artmed.2015.11.001. Epub 2015 Nov 14.
4
Feature weight estimation for gene selection: a local hyperlinear learning approach.特征权重估计在基因选择中的应用:一种局部超线性学习方法。
BMC Bioinformatics. 2014 Mar 14;15:70. doi: 10.1186/1471-2105-15-70.
5
Class prediction for high-dimensional class-imbalanced data.高维类别不平衡数据的类别预测。
BMC Bioinformatics. 2010 Oct 20;11:523. doi: 10.1186/1471-2105-11-523.
6
Chaotic genetic algorithm for gene selection and classification problems.用于基因选择与分类问题的混沌遗传算法。
OMICS. 2009 Oct;13(5):407-20. doi: 10.1089/omi.2009.0007.
7
Stable gene selection from microarray data via sample weighting.基于样本加权的基因芯片数据中稳定基因的选择。
IEEE/ACM Trans Comput Biol Bioinform. 2012 Jan-Feb;9(1):262-72. doi: 10.1109/TCBB.2011.47. Epub 2011 Mar 3.
8
A centroid-based gene selection method for microarray data classification.一种基于质心的微阵列数据分类基因选择方法。
J Theor Biol. 2016 Jul 7;400:32-41. doi: 10.1016/j.jtbi.2016.03.034. Epub 2016 Apr 4.
9
Tumor classification ranking from microarray data.基于微阵列数据的肿瘤分类排名
BMC Genomics. 2008 Sep 16;9 Suppl 2(Suppl 2):S21. doi: 10.1186/1471-2164-9-S2-S21.
10
Class-imbalanced classifiers for high-dimensional data.高维数据的不平衡分类器。
Brief Bioinform. 2013 Jan;14(1):13-26. doi: 10.1093/bib/bbs006. Epub 2012 Mar 9.

引用本文的文献

1
Navigating the microarray landscape: a comprehensive review of feature selection techniques and their applications.探索微阵列领域:特征选择技术及其应用的全面综述
Front Big Data. 2025 Jul 10;8:1624507. doi: 10.3389/fdata.2025.1624507. eCollection 2025.