结合多种聚类方法进行蛋白质结构预测。

Combining multiple clusterings for protein structure prediction.

作者信息

Sakar C Okan, Kursun Olcay, Seker Huseyin, Gurgen Fikret

出版信息

Int J Data Min Bioinform. 2014;10(2):162-74. doi: 10.1504/ijdmb.2014.064012.

DOI:10.1504/ijdmb.2014.064012

Abstract

Computational annotation and prediction of protein structure is very important in the post-genome era due to existence of many different proteins, most of which are yet to be verified. Mutual information based feature selection methods can be used in selecting such minimal yet predictive subsets of features. However, as protein features are organised into natural partitions, individual feature selection that ignores the presence of these views, dismantles them, and treats their variables intermixed along with those of others at best results in a complex un-interpretable predictive system for such multi-view datasets. In this paper, instead of selecting a subset of individual features, each feature subset is passed through a clustering step so that it is represented in discrete form using the cluster indices; this makes mutual information based methods applicable to view-selection. We present our experimental results on a multi-view protein dataset that are used to predict protein structure.

摘要

在后基因组时代，由于存在众多不同的蛋白质，且其中大多数尚未得到验证，蛋白质结构的计算注释和预测非常重要。基于互信息的特征选择方法可用于选择此类最小但具有预测性的特征子集。然而，由于蛋白质特征被组织成自然分区，忽略这些视图存在的单个特征选择会拆解它们，并将其变量与其他变量混合处理，这充其量会为这类多视图数据集产生一个复杂且难以解释的预测系统。在本文中，不是选择单个特征的子集，而是将每个特征子集经过聚类步骤，以便使用聚类索引以离散形式表示；这使得基于互信息的方法适用于视图选择。我们展示了在用于预测蛋白质结构的多视图蛋白质数据集上的实验结果。

相似文献

Combining multiple clusterings for protein structure prediction.结合多种聚类方法进行蛋白质结构预测。

Int J Data Min Bioinform. 2014;10(2):162-74. doi: 10.1504/ijdmb.2014.064012.

Prediction of protein structure classes with flexible neural tree.使用灵活神经树预测蛋白质结构类别。

Biomed Mater Eng. 2014;24(6):3797-806. doi: 10.3233/BME-141209.

Support vector machines for prediction of dihedral angle regions.用于预测二面角区域的支持向量机

Bioinformatics. 2006 Dec 15;22(24):3009-15. doi: 10.1093/bioinformatics/btl489. Epub 2006 Sep 27.

Adding some SPICE to DAS.给数据采集系统增添一些特色。

Bioinformatics. 2005 Sep 1;21 Suppl 2(Suppl 2):ii40-1. doi: 10.1093/bioinformatics/bti1106.

Probabilistic multi-class multi-kernel learning: on protein fold recognition and remote homology detection.概率多类多核学习：用于蛋白质折叠识别和远程同源性检测

Bioinformatics. 2008 May 15;24(10):1264-70. doi: 10.1093/bioinformatics/btn112. Epub 2008 Mar 31.

CHORAL: a differential geometry approach to the prediction of the cores of protein structures.CHORAL：一种用于预测蛋白质结构核心的微分几何方法。

Bioinformatics. 2005 Oct 1;21(19):3719-25. doi: 10.1093/bioinformatics/bti595. Epub 2005 Jul 26.

Modeling protein loops with knowledge-based prediction of sequence-structure alignment.基于知识的序列-结构比对预测对蛋白质环进行建模。

Bioinformatics. 2007 Nov 1;23(21):2836-42. doi: 10.1093/bioinformatics/btm456. Epub 2007 Sep 7.

Protein homology detection with biologically inspired features and interpretable statistical models.

Int J Data Min Bioinform. 2008;2(2):157-75. doi: 10.1504/ijdmb.2008.019096.

Prediction of protein structural class with Rough Sets.基于粗糙集的蛋白质结构类预测

BMC Bioinformatics. 2006 Jan 14;7:20. doi: 10.1186/1471-2105-7-20.

Improved method for predicting beta-turn using support vector machine.使用支持向量机预测β-转角的改进方法。

Bioinformatics. 2005 May 15;21(10):2370-4. doi: 10.1093/bioinformatics/bti358. Epub 2005 Mar 29.

引用本文的文献

Machine Learning-Based Radiomics for Prediction of Epidermal Growth Factor Receptor Mutations in Lung Adenocarcinoma.基于机器学习的放射组学预测肺腺癌表皮生长因子受体突变。

Dis Markers. 2022 May 7;2022:2056837. doi: 10.1155/2022/2056837. eCollection 2022.

Covid19-Mexican-Patients' Dataset (Covid19MPD) Classification and Prediction Using Feature Importance.使用特征重要性对新冠疫情墨西哥患者数据集（Covid19MPD）进行分类和预测

Concurr Comput. 2022 Feb 15;34(4):e6675. doi: 10.1002/cpe.6675. Epub 2021 Oct 16.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

结合多种聚类方法进行蛋白质结构预测。

Combining multiple clusterings for protein structure prediction.

作者信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献