• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

结合网络和功能特征预测人类蛋白质亚细胞定位

Predicting Human Protein Subcellular Locations by Using a Combination of Network and Function Features.

作者信息

Chen Lei, Li ZhanDong, Zeng Tao, Zhang Yu-Hang, Zhang ShiQi, Huang Tao, Cai Yu-Dong

机构信息

School of Life Sciences, Shanghai University, Shanghai, China.

College of Information Engineering, Shanghai Maritime University, Shanghai, China.

出版信息

Front Genet. 2021 Nov 5;12:783128. doi: 10.3389/fgene.2021.783128. eCollection 2021.

DOI:10.3389/fgene.2021.783128
PMID:34804131
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8603309/
Abstract

Given the limitation of technologies, the subcellular localizations of proteins are difficult to identify. Predicting the subcellular localization and the intercellular distribution patterns of proteins in accordance with their specific biological roles, including validated functions, relationships with other proteins, and even their specific sequence characteristics, is necessary. The computational prediction of protein subcellular localizations can be performed on the basis of the sequence and the functional characteristics. In this study, the protein-protein interaction network, functional annotation of proteins and a group of direct proteins with known subcellular localization were used to construct models. To build efficient models, several powerful machine learning algorithms, including two feature selection methods, four classification algorithms, were employed. Some key proteins and functional terms were discovered, which may provide important contributions for determining protein subcellular locations. Furthermore, some quantitative rules were established to identify the potential subcellular localizations of proteins. As the first prediction model that uses direct protein annotation information (i.e., functional features) and STRING-based protein-protein interaction network (i.e., network features), our computational model can help promote the development of predictive technologies on subcellular localizations and provide a new approach for exploring the protein subcellular localization patterns and their potential biological importance.

摘要

鉴于技术的局限性,蛋白质的亚细胞定位很难确定。根据蛋白质的特定生物学作用,包括已验证的功能、与其他蛋白质的关系,甚至其特定的序列特征,预测蛋白质的亚细胞定位和细胞间分布模式是必要的。蛋白质亚细胞定位的计算预测可以基于序列和功能特征来进行。在本研究中,利用蛋白质-蛋白质相互作用网络、蛋白质功能注释以及一组已知亚细胞定位的直接蛋白质来构建模型。为了构建高效的模型,采用了几种强大的机器学习算法,包括两种特征选择方法和四种分类算法。发现了一些关键蛋白质和功能术语,这可能为确定蛋白质亚细胞定位提供重要贡献。此外,还建立了一些定量规则来识别蛋白质潜在的亚细胞定位。作为第一个使用直接蛋白质注释信息(即功能特征)和基于STRING的蛋白质-蛋白质相互作用网络(即网络特征)的预测模型,我们的计算模型有助于推动亚细胞定位预测技术的发展,并为探索蛋白质亚细胞定位模式及其潜在的生物学重要性提供一种新方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d57/8603309/828cb47ad1d5/fgene-12-783128-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d57/8603309/54ba1a3686b1/fgene-12-783128-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d57/8603309/48934066be8d/fgene-12-783128-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d57/8603309/9ca3567e99c7/fgene-12-783128-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d57/8603309/a2f012dc48a7/fgene-12-783128-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d57/8603309/853622c81a37/fgene-12-783128-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d57/8603309/828cb47ad1d5/fgene-12-783128-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d57/8603309/54ba1a3686b1/fgene-12-783128-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d57/8603309/48934066be8d/fgene-12-783128-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d57/8603309/9ca3567e99c7/fgene-12-783128-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d57/8603309/a2f012dc48a7/fgene-12-783128-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d57/8603309/853622c81a37/fgene-12-783128-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5d57/8603309/828cb47ad1d5/fgene-12-783128-g006.jpg

相似文献

1
Predicting Human Protein Subcellular Locations by Using a Combination of Network and Function Features.结合网络和功能特征预测人类蛋白质亚细胞定位
Front Genet. 2021 Nov 5;12:783128. doi: 10.3389/fgene.2021.783128. eCollection 2021.
2
Subcellular Localization Prediction of Human Proteins Using Multifeature Selection Methods.基于多特征选择方法的人类蛋白质亚细胞定位预测。
Biomed Res Int. 2022 Sep 12;2022:3288527. doi: 10.1155/2022/3288527. eCollection 2022.
3
FGsub: Fusarium graminearum protein subcellular localizations predicted from primary structures.FGsub:根据一级结构预测的禾谷镰刀菌蛋白质亚细胞定位
BMC Syst Biol. 2010 Sep 13;4 Suppl 2(Suppl 2):S12. doi: 10.1186/1752-0509-4-S2-S12.
4
Predicting protein subcellular location with network embedding and enrichment features.利用网络嵌入和富集特征预测蛋白质亚细胞定位。
Biochim Biophys Acta Proteins Proteom. 2020 Oct;1868(10):140477. doi: 10.1016/j.bbapap.2020.140477. Epub 2020 Jun 25.
5
Hum-mPLoc 3.0: prediction enhancement of human protein subcellular localization through modeling the hidden correlations of gene ontology and functional domain features.Hum-mPLoc 3.0:通过对基因本体和功能域特征的隐藏相关性进行建模来增强人类蛋白质亚细胞定位预测
Bioinformatics. 2017 Mar 15;33(6):843-853. doi: 10.1093/bioinformatics/btw723.
6
ProLoc-GO: utilizing informative Gene Ontology terms for sequence-based prediction of protein subcellular localization.ProLoc-GO:利用信息丰富的基因本体术语进行基于序列的蛋白质亚细胞定位预测。
BMC Bioinformatics. 2008 Feb 1;9:80. doi: 10.1186/1471-2105-9-80.
7
Identification of Protein Subcellular Localization With Network and Functional Embeddings.利用网络和功能嵌入识别蛋白质亚细胞定位
Front Genet. 2021 Jan 20;11:626500. doi: 10.3389/fgene.2020.626500. eCollection 2020.
8
Prediction of Protein Subcellular Localization Based on Fusion of Multi-view Features.基于多视图特征融合的蛋白质亚细胞定位预测。
Molecules. 2019 Mar 6;24(5):919. doi: 10.3390/molecules24050919.
9
Subcellular localization prediction for human internal and organelle membrane proteins with projected gene ontology scores.使用预测的基因本体评分对人类细胞内和细胞器膜蛋白进行亚细胞定位预测。
J Theor Biol. 2012 Nov 21;313:61-7. doi: 10.1016/j.jtbi.2012.08.016. Epub 2012 Aug 23.
10
Identify ncRNA Subcellular Localization via Graph Regularized k-Local Hyperplane Distance Nearest Neighbor Model on Multi-Kernel Learning.基于多核学习的图正则化k-局部超平面距离最近邻模型识别非编码RNA亚细胞定位
IEEE/ACM Trans Comput Biol Bioinform. 2022 Nov-Dec;19(6):3517-3529. doi: 10.1109/TCBB.2021.3107621. Epub 2022 Dec 8.

引用本文的文献

1
Artificial intelligence-driven reverse vaccinology for vaccine: Prioritizing epitope-based candidates.用于疫苗的人工智能驱动的反向疫苗学:基于表位的候选疫苗的优先级确定。
Front Mol Biosci. 2024 Aug 13;11:1442158. doi: 10.3389/fmolb.2024.1442158. eCollection 2024.
2
A Review for Artificial Intelligence Based Protein Subcellular Localization.基于人工智能的蛋白质亚细胞定位研究综述
Biomolecules. 2024 Mar 27;14(4):409. doi: 10.3390/biom14040409.
3
Identification of Protein-Protein Interaction Associated Functions Based on Gene Ontology.

本文引用的文献

1
iMPT-FDNPL: Identification of Membrane Protein Types with Functional Domains and a Natural Language Processing Approach.iMPT-FDNPL:基于功能域和自然语言处理方法识别膜蛋白类型。
Comput Math Methods Med. 2021 Oct 11;2021:7681497. doi: 10.1155/2021/7681497. eCollection 2021.
2
Identification of Protein Subcellular Localization With Network and Functional Embeddings.利用网络和功能嵌入识别蛋白质亚细胞定位
Front Genet. 2021 Jan 20;11:626500. doi: 10.3389/fgene.2020.626500. eCollection 2020.
3
Determining protein-protein functional associations by functional rules based on gene ontology and KEGG pathway.
基于基因本体论鉴定蛋白质-蛋白质相互作用相关功能。
Protein J. 2024 Jun;43(3):477-486. doi: 10.1007/s10930-024-10180-6. Epub 2024 Mar 4.
4
HMGB1 is involved in viral replication and the inflammatory response in coxsackievirus A16-infected 16HBE cells via proteomic analysis and identification.通过蛋白质组学分析和鉴定发现,HMGB1 参与柯萨奇病毒 A16 感染的 16HBE 细胞中的病毒复制和炎症反应。
Virol J. 2023 Aug 9;20(1):178. doi: 10.1186/s12985-023-02150-8.
5
Global quantitative proteomic analysis profiles of host protein expression in response to enterovirus A71 infection in bronchial epithelial cells based on tandem mass tag (TMT) peptide labeling coupled with LC-MS/MS uncovers the key role of proteasome in virus replication.基于串联质量标签 (TMT) 肽标记与 LC-MS/MS 的联合应用,对支气管上皮细胞中肠道病毒 A71 感染后宿主蛋白表达的全球定量蛋白质组学分析图谱进行了研究,结果揭示了蛋白酶体在病毒复制中的关键作用。
Virus Res. 2023 Jun;330:199118. doi: 10.1016/j.virusres.2023.199118. Epub 2023 Apr 18.
6
Recent Advances in the Prediction of Subcellular Localization of Proteins and Related Topics.蛋白质亚细胞定位预测及相关主题的最新进展
Front Bioinform. 2022 May 19;2:910531. doi: 10.3389/fbinf.2022.910531. eCollection 2022.
基于基因本体论和 KEGG 通路的功能规则确定蛋白质-蛋白质功能关联。
Biochim Biophys Acta Proteins Proteom. 2021 Jun;1869(6):140621. doi: 10.1016/j.bbapap.2021.140621. Epub 2021 Feb 6.
4
Detecting the Multiomics Signatures of Factor-Specific Inflammatory Effects on Airway Smooth Muscles.检测因子特异性炎症对气道平滑肌影响的多组学特征
Front Genet. 2021 Jan 13;11:599970. doi: 10.3389/fgene.2020.599970. eCollection 2020.
5
Identifying Transcriptomic Signatures and Rules for SARS-CoV-2 Infection.确定新型冠状病毒感染的转录组特征和规律
Front Cell Dev Biol. 2021 Jan 11;8:627302. doi: 10.3389/fcell.2020.627302. eCollection 2020.
6
iATC-FRAKEL: a simple multi-label web server for recognizing anatomical therapeutic chemical classes of drugs with their fingerprints only.iATC-FRAKEL:一个简单的多标签网络服务器,仅使用药物的指纹识别其解剖治疗化学类别。
Bioinformatics. 2020 Jun 1;36(11):3568-3569. doi: 10.1093/bioinformatics/btaa166.
7
Human protein paucimannosylation: cues from the eukaryotic kingdoms.人源寡甘露糖基化蛋白:真核生物王国的线索。
Biol Rev Camb Philos Soc. 2019 Dec;94(6):2068-2100. doi: 10.1111/brv.12548. Epub 2019 Aug 14.
8
Mitochondrial Disorders of the Nervous System: A Review.神经系统的线粒体疾病:综述
Clin Pediatr (Phila). 2019 Apr;58(4):381-394. doi: 10.1177/0009922818821890. Epub 2019 Jan 4.
9
miR-378 suppresses the proliferation, migration and invasion of colon cancer cells by inhibiting SDAD1.微小RNA-378通过抑制SDAD1抑制结肠癌细胞的增殖、迁移和侵袭。
Cell Mol Biol Lett. 2017 Jul 17;22:12. doi: 10.1186/s11658-017-0041-5. eCollection 2017.
10
A subcellular map of the human proteome.人类蛋白质组的亚细胞图谱。
Science. 2017 May 26;356(6340). doi: 10.1126/science.aal3321. Epub 2017 May 11.