结合网络和功能特征预测人类蛋白质亚细胞定位

Predicting Human Protein Subcellular Locations by Using a Combination of Network and Function Features.

作者信息

Chen Lei, Li ZhanDong, Zeng Tao, Zhang Yu-Hang, Zhang ShiQi, Huang Tao, Cai Yu-Dong

机构信息

School of Life Sciences, Shanghai University, Shanghai, China.

College of Information Engineering, Shanghai Maritime University, Shanghai, China.

出版信息

Front Genet. 2021 Nov 5;12:783128. doi: 10.3389/fgene.2021.783128. eCollection 2021.

DOI:10.3389/fgene.2021.783128

PMID:34804131

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8603309/

Abstract

Given the limitation of technologies, the subcellular localizations of proteins are difficult to identify. Predicting the subcellular localization and the intercellular distribution patterns of proteins in accordance with their specific biological roles, including validated functions, relationships with other proteins, and even their specific sequence characteristics, is necessary. The computational prediction of protein subcellular localizations can be performed on the basis of the sequence and the functional characteristics. In this study, the protein-protein interaction network, functional annotation of proteins and a group of direct proteins with known subcellular localization were used to construct models. To build efficient models, several powerful machine learning algorithms, including two feature selection methods, four classification algorithms, were employed. Some key proteins and functional terms were discovered, which may provide important contributions for determining protein subcellular locations. Furthermore, some quantitative rules were established to identify the potential subcellular localizations of proteins. As the first prediction model that uses direct protein annotation information (i.e., functional features) and STRING-based protein-protein interaction network (i.e., network features), our computational model can help promote the development of predictive technologies on subcellular localizations and provide a new approach for exploring the protein subcellular localization patterns and their potential biological importance.

摘要

鉴于技术的局限性，蛋白质的亚细胞定位很难确定。根据蛋白质的特定生物学作用，包括已验证的功能、与其他蛋白质的关系，甚至其特定的序列特征，预测蛋白质的亚细胞定位和细胞间分布模式是必要的。蛋白质亚细胞定位的计算预测可以基于序列和功能特征来进行。在本研究中，利用蛋白质-蛋白质相互作用网络、蛋白质功能注释以及一组已知亚细胞定位的直接蛋白质来构建模型。为了构建高效的模型，采用了几种强大的机器学习算法，包括两种特征选择方法和四种分类算法。发现了一些关键蛋白质和功能术语，这可能为确定蛋白质亚细胞定位提供重要贡献。此外，还建立了一些定量规则来识别蛋白质潜在的亚细胞定位。作为第一个使用直接蛋白质注释信息（即功能特征）和基于STRING的蛋白质-蛋白质相互作用网络（即网络特征）的预测模型，我们的计算模型有助于推动亚细胞定位预测技术的发展，并为探索蛋白质亚细胞定位模式及其潜在的生物学重要性提供一种新方法。

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

结合网络和功能特征预测人类蛋白质亚细胞定位

Predicting Human Protein Subcellular Locations by Using a Combination of Network and Function Features.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

结合网络和功能特征预测人类蛋白质亚细胞定位

Predicting Human Protein Subcellular Locations by Using a Combination of Network and Function Features.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献