• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于预测蛋白质结构类别的新型特征选择方法。

A novel feature selection method to predict protein structural class.

机构信息

Department of Automation, Xiamen University, Xiamen 361005, Fujian, China; School of Information Technology, York University, Toronto M3J 1P3, Canada.

School of Information Technology, York University, Toronto M3J 1P3, Canada.

出版信息

Comput Biol Chem. 2018 Oct;76:118-129. doi: 10.1016/j.compbiolchem.2018.06.007. Epub 2018 Jul 2.

DOI:10.1016/j.compbiolchem.2018.06.007
PMID:29990791
Abstract

Integrating various features from different protein properties helps to improve the prediction accuracy of protein structural class but need to deal with the corresponding integrated high-dimensional data. Thus, the feature selection process used to select the informative features from the integrated features also becomes an indispensable key step. This paper proposes a novel feature selection method, Partial-Maximum-Correlation-Information based Recursive Feature Elimination (PMCI-RFE), to quickly select the best feature subset from the integrated high-dimensional protein features set to improve the prediction performance of protein structural class. PMCI-RFE can also be used to find different types of informative features to further analyze some biological relationships. The proposed PMCI-RFE method uses the correlation information between the feature space and class encoding space to select informative features based on the idea of orthogonal component projection in the feature space. The experimental results on six widely used benchmark datasets show that PMCI-RFE is a fast and effective method compare to other four state-of-the-art feature selection methods, which indeed can make full use of different protein property information and improve the predictability of protein structural class.

摘要

整合来自不同蛋白质特性的各种特征有助于提高蛋白质结构类别预测的准确性,但需要处理相应的集成高维数据。因此,用于从集成特征中选择信息特征的特征选择过程也成为一个不可或缺的关键步骤。本文提出了一种新的特征选择方法,基于偏最大相关信息的递归特征消除(PMCI-RFE),用于从集成的高维蛋白质特征集中快速选择最佳特征子集,以提高蛋白质结构类别预测的性能。PMCI-RFE 还可以用于寻找不同类型的信息特征,以进一步分析一些生物学关系。所提出的 PMCI-RFE 方法基于特征空间中正交分量投影的思想,利用特征空间和类别编码空间之间的相关信息来选择信息特征。在六个广泛使用的基准数据集上的实验结果表明,与其他四种最先进的特征选择方法相比,PMCI-RFE 是一种快速有效的方法,它确实可以充分利用不同的蛋白质特性信息,提高蛋白质结构类别的可预测性。

相似文献

1
A novel feature selection method to predict protein structural class.一种用于预测蛋白质结构类别的新型特征选择方法。
Comput Biol Chem. 2018 Oct;76:118-129. doi: 10.1016/j.compbiolchem.2018.06.007. Epub 2018 Jul 2.
2
PSSP-RFE: accurate prediction of protein structural class by recursive feature extraction from PSI-BLAST profile, physical-chemical property and functional annotations.PSSP-RFE:通过从PSI-BLAST序列谱、物理化学性质和功能注释中进行递归特征提取来准确预测蛋白质结构类别。
PLoS One. 2014 Mar 27;9(3):e92863. doi: 10.1371/journal.pone.0092863. eCollection 2014.
3
A highly accurate protein structural class prediction approach using auto cross covariance transformation and recursive feature elimination.一种使用自动交叉协方差变换和递归特征消除的高精度蛋白质结构类别预测方法。
Comput Biol Chem. 2015 Dec;59 Pt A:95-100. doi: 10.1016/j.compbiolchem.2015.08.012. Epub 2015 Sep 2.
4
Decision Variants for the Automatic Determination of Optimal Feature Subset in RF-RFE.用于在随机森林-递归特征消除中自动确定最优特征子集的决策变体
Genes (Basel). 2018 Jun 15;9(6):301. doi: 10.3390/genes9060301.
5
Effective hybrid feature selection using different bootstrap enhances cancers classification performance.使用不同的自助法进行有效的混合特征选择可提高癌症分类性能。
BioData Min. 2022 Sep 30;15(1):24. doi: 10.1186/s13040-022-00304-y.
6
A Feature and Algorithm Selection Method for Improving the Prediction of Protein Structural Class.一种用于改进蛋白质结构类预测的特征与算法选择方法
Comb Chem High Throughput Screen. 2017;20(7):612-621. doi: 10.2174/1386207320666170314103147.
7
MinE-RFE: determine the optimal subset from RFE by minimizing the subset-accuracy-defined energy.MinE-RFE:通过最小化子集精度定义的能量来确定 RFE 中的最优子集。
Brief Bioinform. 2020 Mar 23;21(2):687-698. doi: 10.1093/bib/bbz021.
8
Prediction of protein structural class using tri-gram probabilities of position-specific scoring matrix and recursive feature elimination.利用位置特异性评分矩阵的三元概率和递归特征消除预测蛋白质结构类别。
Amino Acids. 2015 Mar;47(3):461-8. doi: 10.1007/s00726-014-1878-9. Epub 2015 Jan 13.
9
Recursive gene selection based on maximum margin criterion: a comparison with SVM-RFE.基于最大间隔准则的递归基因选择:与支持向量机递归特征消除法的比较
BMC Bioinformatics. 2006 Dec 25;7:543. doi: 10.1186/1471-2105-7-543.
10
Prediction of Protein Structural Class Based on Gapped-Dipeptides and a Recursive Feature Selection Approach.基于带间隙二肽和递归特征选择方法的蛋白质结构类预测
Int J Mol Sci. 2015 Dec 24;17(1):15. doi: 10.3390/ijms17010015.

引用本文的文献

1
Feature selection with the Fisher score followed by the Maximal Clique Centrality algorithm can accurately identify the hub genes of hepatocellular carcinoma.基于 Fisher 得分的特征选择,再结合最大团中心度算法,可以准确识别肝细胞癌的枢纽基因。
Sci Rep. 2019 Nov 21;9(1):17283. doi: 10.1038/s41598-019-53471-0.