• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从成像、序列和网络的异构数据中学习蛋白质亚细胞定位多视图模式。

Learning protein subcellular localization multi-view patterns from heterogeneous data of imaging, sequence and networks.

作者信息

Wang Ge, Xue Min-Qi, Shen Hong-Bin, Xu Ying-Ying

机构信息

School of Biomedical Engineering and Guangdong Provincial Key Laboratory of Medical Image Processing, Southern Medical University, Guangzhou 510515, China.

Guangdong Province Engineering Laboratory for Medical Imaging and Diagnostic Technology, Southern Medical University, Guangzhou 510515, China.

出版信息

Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbab539.

DOI:10.1093/bib/bbab539
PMID:35018423
Abstract

Location proteomics seeks to provide automated high-resolution descriptions of protein location patterns within cells. Many efforts have been undertaken in location proteomics over the past decades, thereby producing plenty of automated predictors for protein subcellular localization. However, most of these predictors are trained solely from high-throughput microscopic images or protein amino acid sequences alone. Unifying heterogeneous protein data sources has yet to be exploited. In this paper, we present a pipeline called sequence, image, network-based protein subcellular locator (SIN-Locator) that constructs a multi-view description of proteins by integrating multiple data types including images of protein expression in cells or tissues, amino acid sequences and protein-protein interaction networks, to classify the patterns of protein subcellular locations. Proteins were encoded by both handcrafted features and deep learning features, and multiple combining methods were implemented. Our experimental results indicated that optimal integrations can considerately enhance the classification accuracy, and the utility of SIN-Locator has been demonstrated through applying to new released proteins in the human protein atlas. Furthermore, we also investigate the contribution of different data sources and influence of partial absence of data. This work is anticipated to provide clues for reconciliation and combination of multi-source data for protein location analysis.

摘要

定位蛋白质组学旨在提供细胞内蛋白质定位模式的自动化高分辨率描述。在过去几十年里,定位蛋白质组学领域已经开展了许多工作,从而产生了大量用于蛋白质亚细胞定位的自动化预测工具。然而,这些预测工具大多仅基于高通量显微镜图像或蛋白质氨基酸序列进行训练。尚未对统一的异构蛋白质数据源进行开发利用。在本文中,我们提出了一种名为基于序列、图像、网络的蛋白质亚细胞定位器(SIN-Locator)的流程,该流程通过整合多种数据类型(包括细胞或组织中蛋白质表达的图像、氨基酸序列和蛋白质-蛋白质相互作用网络)来构建蛋白质的多视图描述,以对蛋白质亚细胞定位模式进行分类。蛋白质由手工特征和深度学习特征进行编码,并实施了多种组合方法。我们的实验结果表明,最优整合能够显著提高分类准确率,并且通过应用于人类蛋白质图谱中新发布的蛋白质,已证明了SIN-Locator的实用性。此外,我们还研究了不同数据源的贡献以及数据部分缺失的影响。这项工作有望为蛋白质定位分析多源数据的协调与组合提供线索。

相似文献

1
Learning protein subcellular localization multi-view patterns from heterogeneous data of imaging, sequence and networks.从成像、序列和网络的异构数据中学习蛋白质亚细胞定位多视图模式。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbab539.
2
Improving Protein Subcellular Location Classification by Incorporating Three-Dimensional Structure Information.通过整合三维结构信息来改进蛋白质亚细胞定位分类。
Biomolecules. 2021 Oct 29;11(11):1607. doi: 10.3390/biom11111607.
3
Incorporating label correlations into deep neural networks to classify protein subcellular location patterns in immunohistochemistry images.将标签相关性纳入深度神经网络以分类免疫组化图像中的蛋白质亚细胞位置模式。
Proteins. 2022 Feb;90(2):493-503. doi: 10.1002/prot.26244. Epub 2021 Oct 1.
4
HAR_Locator: a novel protein subcellular location prediction model of immunohistochemistry images based on hybrid attention modules and residual units.HAR定位器:一种基于混合注意力模块和残差单元的免疫组织化学图像新型蛋白质亚细胞定位预测模型
Front Mol Biosci. 2023 Aug 17;10:1171429. doi: 10.3389/fmolb.2023.1171429. eCollection 2023.
5
Boosting accuracy of automated classification of fluorescence microscope images for location proteomics.提高用于定位蛋白质组学的荧光显微镜图像自动分类的准确性。
BMC Bioinformatics. 2004 Jun 18;5:78. doi: 10.1186/1471-2105-5-78.
6
Location proteomics: systematic determination of protein subcellular location.定位蛋白质组学:蛋白质亚细胞定位的系统测定
Methods Mol Biol. 2009;500:313-32. doi: 10.1007/978-1-59745-525-1_11.
7
Bioimage-Based Prediction of Protein Subcellular Location in Human Tissue with Ensemble Features and Deep Networks.基于生物图像的人类组织中蛋白质亚细胞位置的集成特征和深度网络预测。
IEEE/ACM Trans Comput Biol Bioinform. 2020 Nov-Dec;17(6):1966-1980. doi: 10.1109/TCBB.2019.2917429. Epub 2020 Dec 8.
8
A framework for the automated analysis of subcellular patterns in human protein atlas images.人类蛋白质图谱图像中亚细胞模式自动分析框架。
J Proteome Res. 2008 Jun;7(6):2300-8. doi: 10.1021/pr7007626. Epub 2008 Apr 25.
9
Automated classification of protein subcellular localization in immunohistochemistry images to reveal biomarkers in colon cancer.利用免疫组织化学图像中的蛋白质亚细胞定位的自动分类来揭示结肠癌中的生物标志物。
BMC Bioinformatics. 2020 Sep 9;21(1):398. doi: 10.1186/s12859-020-03731-y.
10
Prediction of Protein Subcellular Localization Based on Fusion of Multi-view Features.基于多视图特征融合的蛋白质亚细胞定位预测。
Molecules. 2019 Mar 6;24(5):919. doi: 10.3390/molecules24050919.

引用本文的文献

1
Multi-modal contrastive learning of subcellular organization using DICE.基于 DICE 的亚细胞组织多模态对比学习。
Bioinformatics. 2024 Sep 1;40(Suppl 2):ii105-ii110. doi: 10.1093/bioinformatics/btae387.
2
TransGCN: a semi-supervised graph convolution network-based framework to infer protein translocations in spatio-temporal proteomics.TransGCN:一种基于半监督图卷积网络的框架,用于推断时空蛋白质组学中的蛋白质易位。
Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbae055.
3
Automated identification of protein expression intensity and classification of protein cellular locations in mouse brain regions from immunofluorescence images.
从免疫荧光图像中自动识别小鼠脑区的蛋白质表达强度和蛋白质细胞定位分类。
Med Biol Eng Comput. 2024 Apr;62(4):1105-1119. doi: 10.1007/s11517-023-02985-x. Epub 2023 Dec 27.
4
Dual-Signal Feature Spaces Map Protein Subcellular Locations Based on Immunohistochemistry Image and Protein Sequence.基于免疫组化图像和蛋白质序列的双信号特征空间映射蛋白质亚细胞定位。
Sensors (Basel). 2023 Nov 7;23(22):9014. doi: 10.3390/s23229014.
5
PScL-2LSAESM: bioimage-based prediction of protein subcellular localization by integrating heterogeneous features with the two-level SAE-SM and mean ensemble method.PScL-2LSAESM:基于生物图像的蛋白质亚细胞定位预测,通过整合异质特征与两级 SAE-SM 和均值集成方法。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac727.