• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

半监督最大鉴别局部边缘的基因选择。

Semi-Supervised Maximum Discriminative Local Margin for Gene Selection.

机构信息

College of Information Science and Engineering, Hunan University, Changsha, Hunan, 410082, China.

School of Computer and Information Science, Hunan Institute of Technology, Hengyang, 412002, China.

出版信息

Sci Rep. 2018 Jun 5;8(1):8619. doi: 10.1038/s41598-018-26806-6.

DOI:10.1038/s41598-018-26806-6
PMID:29872069
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5988834/
Abstract

In the present study, we introduce a novel semi-supervised method called the semi-supervised maximum discriminative local margin (semiMM) for gene selection in expression data. The semiMM is a "filter" approach that exploits local structure, variance, and mutual information. We first constructed a local nearest neighbour graph and divided this information into within-class and between-class local nearest neighbour graphs by weighing the edge between the two data points. The semiMM aims to discover the most discriminative features for classification via maximizing the local margin between the within-class and between-class data, the variance of all data, and the mutual information of features with class labels. Experiments on five publicly available gene expression datasets revealed the effectiveness of the proposed method compared to three state-of-the-art feature selection algorithms.

摘要

在本研究中,我们引入了一种新的半监督方法,称为半监督最大判别局部边界(semiMM),用于表达数据中的基因选择。semiMM 是一种“过滤”方法,利用局部结构、方差和互信息。我们首先构建了一个局部最近邻图,并通过给两个数据点之间的边加权来将此信息分为类内和类间局部最近邻图。semiMM 的目的是通过最大化类内和类间数据之间的局部边界、所有数据的方差以及特征与类别标签的互信息,来发现最具判别力的分类特征。在五个公开可用的基因表达数据集上的实验表明,与三种最先进的特征选择算法相比,该方法是有效的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e21/5988834/e6a9a7c5b51d/41598_2018_26806_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e21/5988834/b0e90d934935/41598_2018_26806_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e21/5988834/e6a9a7c5b51d/41598_2018_26806_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e21/5988834/b0e90d934935/41598_2018_26806_Figa_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8e21/5988834/e6a9a7c5b51d/41598_2018_26806_Fig3_HTML.jpg

相似文献

1
Semi-Supervised Maximum Discriminative Local Margin for Gene Selection.半监督最大鉴别局部边缘的基因选择。
Sci Rep. 2018 Jun 5;8(1):8619. doi: 10.1038/s41598-018-26806-6.
2
Gene Selection Using Locality Sensitive Laplacian Score.使用局部敏感拉普拉斯分数的基因选择
IEEE/ACM Trans Comput Biol Bioinform. 2014 Nov-Dec;11(6):1146-56. doi: 10.1109/TCBB.2014.2328334.
3
Discriminative semi-supervised feature selection via manifold regularization.基于流形正则化的判别式半监督特征选择
IEEE Trans Neural Netw. 2010 Jul;21(7):1033-47. doi: 10.1109/TNN.2010.2047114. Epub 2010 Jun 21.
4
Feature Subset Selection for Cancer Classification Using Weight Local Modularity.基于权重局部模块度的癌症分类特征子集选择
Sci Rep. 2016 Oct 5;6:34759. doi: 10.1038/srep34759.
5
Unified Simultaneous Clustering and Feature Selection for Unlabeled and Labeled Data.针对未标记和已标记数据的统一同步聚类与特征选择
IEEE Trans Neural Netw Learn Syst. 2018 Dec;29(12):6083-6098. doi: 10.1109/TNNLS.2018.2818444. Epub 2018 Apr 20.
6
Enhanced manifold regularization for semi-supervised classification.用于半监督分类的增强流形正则化
J Opt Soc Am A Opt Image Sci Vis. 2016 Jun 1;33(6):1207-13. doi: 10.1364/JOSAA.33.001207.
7
Pairwise Constraint-Guided Sparse Learning for Feature Selection.基于成对约束的稀疏学习特征选择。
IEEE Trans Cybern. 2016 Jan;46(1):298-310. doi: 10.1109/TCYB.2015.2401733. Epub 2015 Jul 6.
8
Sparse maximum margin discriminant analysis for feature extraction and gene selection on gene expression data.基于基因表达数据的稀疏最大间隔判别分析的特征提取和基因选择。
Comput Biol Med. 2013 Aug 1;43(7):933-41. doi: 10.1016/j.compbiomed.2013.04.018. Epub 2013 May 4.
9
Robust and efficient identification of biomarkers by classifying features on graphs.通过对图上的特征进行分类实现稳健且高效的生物标志物识别。
Bioinformatics. 2008 Sep 15;24(18):2023-9. doi: 10.1093/bioinformatics/btn383. Epub 2008 Jul 24.
10
Clinically driven semi-supervised class discovery in gene expression data.基因表达数据中基于临床驱动的半监督类别发现
Bioinformatics. 2008 Aug 15;24(16):i90-7. doi: 10.1093/bioinformatics/btn279.

引用本文的文献

1
A review on advancements in feature selection and feature extraction for high-dimensional NGS data analysis.一篇关于高通量测序数据分析中特征选择和特征提取进展的综述。
Funct Integr Genomics. 2024 Aug 19;24(5):139. doi: 10.1007/s10142-024-01415-x.
2
Machine Learning Based Computational Gene Selection Models: A Survey, Performance Evaluation, Open Issues, and Future Research Directions.基于机器学习的计算基因选择模型:综述、性能评估、开放问题及未来研究方向
Front Genet. 2020 Dec 10;11:603808. doi: 10.3389/fgene.2020.603808. eCollection 2020.
3
A Hybrid Gene Selection Method Based on ReliefF and Ant Colony Optimization Algorithm for Tumor Classification.

本文引用的文献

1
MicroRNAs and complex diseases: from experimental results to computational models.微小 RNA 与复杂疾病:从实验结果到计算模型。
Brief Bioinform. 2019 Mar 22;20(2):515-539. doi: 10.1093/bib/bbx130.
2
Multiple similarly effective solutions exist for biomedical feature selection and classification problems.对于生物医学特征选择和分类问题,存在多种效果相似的解决方案。
Sci Rep. 2017 Oct 9;7(1):12830. doi: 10.1038/s41598-017-13184-8.
3
Global network random walk for predicting potential human lncRNA-disease associations.基于全局网络随机游走的人类 lncRNA-疾病关联预测方法
基于 ReliefF 和蚁群优化算法的混合基因选择方法在肿瘤分类中的应用。
Sci Rep. 2019 Jun 20;9(1):8978. doi: 10.1038/s41598-019-45223-x.
Sci Rep. 2017 Sep 29;7(1):12442. doi: 10.1038/s41598-017-12763-z.
4
Elitist Binary Wolf Search Algorithm for Heuristic Feature Selection in High-Dimensional Bioinformatics Datasets.精英二元狼搜索算法在高维生物信息学数据集启发式特征选择中的应用。
Sci Rep. 2017 Jun 28;7(1):4354. doi: 10.1038/s41598-017-04037-5.
5
New multilocus linkage disequilibrium measure for tag SNP selection.用于标签单核苷酸多态性选择的新的多位点连锁不平衡度量方法。
J Bioinform Comput Biol. 2017 Feb;15(1):1750001. doi: 10.1142/S0219720017500019.
6
Drug Repositioning through Systematic Mining of Gene Coexpression Networks in Cancer.通过系统挖掘癌症基因共表达网络进行药物重新定位。
PLoS One. 2016 Nov 8;11(11):e0165059. doi: 10.1371/journal.pone.0165059. eCollection 2016.
7
Identification of apolipoprotein using feature selection technique.利用特征选择技术鉴定载脂蛋白。
Sci Rep. 2016 Jul 22;6:30441. doi: 10.1038/srep30441.
8
NLLSS: Predicting Synergistic Drug Combinations Based on Semi-supervised Learning.NLLSS:基于半监督学习预测协同药物组合
PLoS Comput Biol. 2016 Jul 14;12(7):e1004975. doi: 10.1371/journal.pcbi.1004975. eCollection 2016 Jul.
9
Long non-coding RNAs and complex diseases: from experimental results to computational models.长链非编码RNA与复杂疾病:从实验结果到计算模型
Brief Bioinform. 2017 Jul 1;18(4):558-576. doi: 10.1093/bib/bbw060.
10
Proteomic analysis of Plasmodium falciparum parasites from patients with cerebral and uncomplicated malaria.对患有脑型疟疾和非复杂性疟疾患者的恶性疟原虫进行蛋白质组学分析。
Sci Rep. 2016 Jun 1;6:26773. doi: 10.1038/srep26773.