• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于免疫组化图像和蛋白质序列的双信号特征空间映射蛋白质亚细胞定位。

Dual-Signal Feature Spaces Map Protein Subcellular Locations Based on Immunohistochemistry Image and Protein Sequence.

机构信息

School of Communications and Electronics, Jiangxi Science and Technology Normal University, Nanchang 330038, China.

School of Computer Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China.

出版信息

Sensors (Basel). 2023 Nov 7;23(22):9014. doi: 10.3390/s23229014.

DOI:10.3390/s23229014
PMID:38005402
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10675401/
Abstract

Protein is one of the primary biochemical macromolecular regulators in the compartmental cellular structure, and the subcellular locations of proteins can therefore provide information on the function of subcellular structures and physiological environments. Recently, data-driven systems have been developed to predict the subcellular location of proteins based on protein sequence, immunohistochemistry (IHC) images, or immunofluorescence (IF) images. However, the research on the fusion of multiple protein signals has received little attention. In this study, we developed a dual-signal computational protocol by incorporating IHC images into protein sequences to learn protein subcellular localization. Three major steps can be summarized as follows in this protocol: first, a benchmark database that includes 281 proteins sorted out from 4722 proteins of the Human Protein Atlas (HPA) and Swiss-Prot database, which is involved in the endoplasmic reticulum (ER), Golgi apparatus, cytosol, and nucleoplasm; second, discriminative feature operators were first employed to quantitate protein image-sequence samples that include IHC images and protein sequence; finally, the feature subspace of different protein signals is absorbed to construct multiple sub-classifiers via dimensionality reduction and binary relevance (BR), and multiple confidence derived from multiple sub-classifiers is adopted to decide subcellular location by the centralized voting mechanism at the decision layer. The experimental results indicated that the dual-signal model embedded IHC images and protein sequences outperformed the single-signal models with accuracy, precision, and recall of 75.41%, 80.38%, and 74.38%, respectively. It is enlightening for further research on protein subcellular location prediction under multi-signal fusion of protein.

摘要

蛋白质是细胞结构区室化的主要生物化学大分子调控因子之一,因此蛋白质的亚细胞定位可以提供有关亚细胞结构和生理环境功能的信息。最近,已经开发了基于蛋白质序列、免疫组织化学(IHC)图像或免疫荧光(IF)图像来预测蛋白质亚细胞定位的数据驱动系统。然而,关于融合多种蛋白质信号的研究却很少受到关注。在这项研究中,我们通过将 IHC 图像纳入蛋白质序列中,开发了一种双信号计算方案,用于学习蛋白质亚细胞定位。该方案可以概括为以下三个主要步骤:首先,从人类蛋白质图谱(HPA)和 Swiss-Prot 数据库的 4722 种蛋白质中筛选出 281 种蛋白质,建立一个基准数据库,这些蛋白质涉及内质网(ER)、高尔基体、细胞质和核质;其次,首先使用判别特征算子对包括 IHC 图像和蛋白质序列的蛋白质图像序列样本进行量化;最后,通过降维和二进制相关性(BR)吸收不同蛋白质信号的特征子空间,通过决策层的集中投票机制,利用多个子分类器的多个置信度来决定亚细胞位置。实验结果表明,嵌入 IHC 图像和蛋白质序列的双信号模型在准确性、精度和召回率方面均优于单信号模型,分别为 75.41%、80.38%和 74.38%。这对于进一步研究蛋白质在多信号融合下的亚细胞定位预测具有启发意义。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb47/10675401/5bab26db449c/sensors-23-09014-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb47/10675401/1aa36b2f8ad4/sensors-23-09014-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb47/10675401/ae63ecc94601/sensors-23-09014-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb47/10675401/cd8f17b70a09/sensors-23-09014-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb47/10675401/15419693d33c/sensors-23-09014-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb47/10675401/f5122515990f/sensors-23-09014-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb47/10675401/7281c9a97e85/sensors-23-09014-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb47/10675401/5bab26db449c/sensors-23-09014-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb47/10675401/1aa36b2f8ad4/sensors-23-09014-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb47/10675401/ae63ecc94601/sensors-23-09014-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb47/10675401/cd8f17b70a09/sensors-23-09014-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb47/10675401/15419693d33c/sensors-23-09014-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb47/10675401/f5122515990f/sensors-23-09014-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb47/10675401/7281c9a97e85/sensors-23-09014-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cb47/10675401/5bab26db449c/sensors-23-09014-g007.jpg

相似文献

1
Dual-Signal Feature Spaces Map Protein Subcellular Locations Based on Immunohistochemistry Image and Protein Sequence.基于免疫组化图像和蛋白质序列的双信号特征空间映射蛋白质亚细胞定位。
Sensors (Basel). 2023 Nov 7;23(22):9014. doi: 10.3390/s23229014.
2
Hum-PLoc: a novel ensemble classifier for predicting human protein subcellular localization.Hum-PLoc:一种用于预测人类蛋白质亚细胞定位的新型集成分类器。
Biochem Biophys Res Commun. 2006 Aug 18;347(1):150-7. doi: 10.1016/j.bbrc.2006.06.059. Epub 2006 Jun 21.
3
Prediction of protein subcellular locations by support vector machines using compositions of amino acids and amino acid pairs.利用氨基酸组成和氨基酸对,通过支持向量机预测蛋白质亚细胞定位。
Bioinformatics. 2003 Sep 1;19(13):1656-63. doi: 10.1093/bioinformatics/btg222.
4
HAR_Locator: a novel protein subcellular location prediction model of immunohistochemistry images based on hybrid attention modules and residual units.HAR定位器:一种基于混合注意力模块和残差单元的免疫组织化学图像新型蛋白质亚细胞定位预测模型
Front Mol Biosci. 2023 Aug 17;10:1171429. doi: 10.3389/fmolb.2023.1171429. eCollection 2023.
5
Supervised learning method for the prediction of subcellular localization of proteins using amino acid and amino acid pair composition.使用氨基酸和氨基酸对组成预测蛋白质亚细胞定位的监督学习方法。
BMC Genomics. 2008;9 Suppl 1(Suppl 1):S16. doi: 10.1186/1471-2164-9-S1-S16.
6
Euk-PLoc: an ensemble classifier for large-scale eukaryotic protein subcellular location prediction.Euk-PLoc:一种用于大规模真核生物蛋白质亚细胞定位预测的集成分类器。
Amino Acids. 2007 Jul;33(1):57-67. doi: 10.1007/s00726-006-0478-8. Epub 2007 Jan 19.
7
Predicting eukaryotic protein subcellular location by fusing optimized evidence-theoretic K-Nearest Neighbor classifiers.通过融合优化的证据理论K近邻分类器预测真核生物蛋白质亚细胞定位
J Proteome Res. 2006 Aug;5(8):1888-97. doi: 10.1021/pr060167c.
8
CE-PLoc: an ensemble classifier for predicting protein subcellular locations by fusing different modes of pseudo amino acid composition.CE-PLoc:一种通过融合不同模式的伪氨基酸组成来预测蛋白质亚细胞位置的集成分类器。
Comput Biol Chem. 2011 Aug 10;35(4):218-29. doi: 10.1016/j.compbiolchem.2011.05.003. Epub 2011 May 27.
9
Multi-label learning with fuzzy hypergraph regularization for protein subcellular location prediction.基于模糊超图正则化的多标签学习用于蛋白质亚细胞定位预测
IEEE Trans Nanobioscience. 2014 Dec;13(4):438-47. doi: 10.1109/TNB.2014.2341111. Epub 2014 Jul 31.
10
iLoc-Hum: using the accumulation-label scale to predict subcellular locations of human proteins with both single and multiple sites.iLoc-Hum:利用累积标签量表预测具有单一位点和多个位点的人类蛋白质的亚细胞定位。
Mol Biosyst. 2012 Feb;8(2):629-41. doi: 10.1039/c1mb05420a. Epub 2011 Dec 1.

引用本文的文献

1
A Review for Artificial Intelligence Based Protein Subcellular Localization.基于人工智能的蛋白质亚细胞定位研究综述
Biomolecules. 2024 Mar 27;14(4):409. doi: 10.3390/biom14040409.

本文引用的文献

1
Recent Advances in the Prediction of Subcellular Localization of Proteins and Related Topics.蛋白质亚细胞定位预测及相关主题的最新进展
Front Bioinform. 2022 May 19;2:910531. doi: 10.3389/fbinf.2022.910531. eCollection 2022.
2
Multi-scale deep learning for the imbalanced multi-label protein subcellular localization prediction based on immunohistochemistry images.基于免疫组化图像的不平衡多标签蛋白质亚细胞定位预测的多尺度深度学习。
Bioinformatics. 2022 Apr 28;38(9):2602-2611. doi: 10.1093/bioinformatics/btac123.
3
Multiple Protein Subcellular Locations Prediction Based on Deep Convolutional Neural Networks with Self-Attention Mechanism.
基于具有自注意力机制的深度卷积神经网络的多种蛋白质亚细胞定位预测。
Interdiscip Sci. 2022 Jun;14(2):421-438. doi: 10.1007/s12539-021-00496-7. Epub 2022 Jan 23.
4
Learning protein subcellular localization multi-view patterns from heterogeneous data of imaging, sequence and networks.从成像、序列和网络的异构数据中学习蛋白质亚细胞定位多视图模式。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbab539.
5
Subcellular Transcriptomics and Proteomics: A Comparative Methods Review.亚细胞转录组学和蛋白质组学:比较方法综述。
Mol Cell Proteomics. 2022 Feb;21(2):100186. doi: 10.1016/j.mcpro.2021.100186. Epub 2021 Dec 16.
6
DULoc: quantitatively unmixing protein subcellular location patterns in immunofluorescence images based on deep learning features.DULoc:基于深度学习特征对免疫荧光图像中的蛋白质亚细胞定位模式进行定量解混。
Bioinformatics. 2022 Jan 12;38(3):827-833. doi: 10.1093/bioinformatics/btab730.
7
Predicting subcellular location of protein with evolution information and sequence-based deep learning.利用进化信息和基于序列的深度学习预测蛋白质的亚细胞定位。
BMC Bioinformatics. 2021 Oct 22;22(Suppl 10):515. doi: 10.1186/s12859-021-04404-0.
8
Incorporating label correlations into deep neural networks to classify protein subcellular location patterns in immunohistochemistry images.将标签相关性纳入深度神经网络以分类免疫组化图像中的蛋白质亚细胞位置模式。
Proteins. 2022 Feb;90(2):493-503. doi: 10.1002/prot.26244. Epub 2021 Oct 1.
9
PScL-HDeep: image-based prediction of protein subcellular location in human tissue using ensemble learning of handcrafted and deep learned features with two-layer feature selection.PScL-HDeep:基于图像的人类组织蛋白亚细胞定位预测,使用基于手工和深度学习特征的两层特征选择的集成学习方法。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab278.
10
Self-evoluting framework of deep convolutional neural network for multilocus protein subcellular localization.用于多定位点蛋白质亚细胞定位的深度卷积神经网络的自进化框架。
Med Biol Eng Comput. 2020 Dec;58(12):3017-3038. doi: 10.1007/s11517-020-02275-w. Epub 2020 Oct 20.