• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
PScL-HDeep: image-based prediction of protein subcellular location in human tissue using ensemble learning of handcrafted and deep learned features with two-layer feature selection.PScL-HDeep:基于图像的人类组织蛋白亚细胞定位预测,使用基于手工和深度学习特征的两层特征选择的集成学习方法。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab278.
2
PScL-DDCFPred: an ensemble deep learning-based approach for characterizing multiclass subcellular localization of human proteins from bioimage data.PScL-DDCFPred:一种基于集成深度学习的方法,用于从生物图像数据中描述人类蛋白质的多类亚细胞定位。
Bioinformatics. 2022 Aug 10;38(16):4019-4026. doi: 10.1093/bioinformatics/btac432.
3
PScL-2LSAESM: bioimage-based prediction of protein subcellular localization by integrating heterogeneous features with the two-level SAE-SM and mean ensemble method.PScL-2LSAESM:基于生物图像的蛋白质亚细胞定位预测,通过整合异质特征与两级 SAE-SM 和均值集成方法。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac727.
4
Computer-assisted lip diagnosis on Traditional Chinese Medicine using multi-class support vector machines.基于多类支持向量机的中医唇诊计算机辅助诊断。
BMC Complement Altern Med. 2012 Aug 16;12:127. doi: 10.1186/1472-6882-12-127.
5
Combining handcrafted features with latent variables in machine learning for prediction of radiation-induced lung damage.将机器学习中的手工特征与潜在变量相结合,以预测放射性肺损伤。
Med Phys. 2019 May;46(5):2497-2511. doi: 10.1002/mp.13497. Epub 2019 Apr 8.
6
Protein subcellular localization prediction using multiple kernel learning based support vector machine.基于多核学习支持向量机的蛋白质亚细胞定位预测
Mol Biosyst. 2017 Mar 28;13(4):785-795. doi: 10.1039/c6mb00860g.
7
TargetDBP: Accurate DNA-Binding Protein Prediction Via Sequence-Based Multi-View Feature Learning.目标 DBP:基于序列的多视图特征学习的准确 DNA 结合蛋白预测。
IEEE/ACM Trans Comput Biol Bioinform. 2020 Jul-Aug;17(4):1419-1429. doi: 10.1109/TCBB.2019.2893634. Epub 2019 Jan 18.
8
iAPSL-IF: Identification of Apoptosis Protein Subcellular Location Using Integrative Features Captured from Amino Acid Sequences.iAPSL-IF:利用从氨基酸序列中提取的综合特征识别细胞凋亡蛋白亚细胞定位。
Int J Mol Sci. 2018 Apr 13;19(4):1190. doi: 10.3390/ijms19041190.
9
Prediction of nuclear proteins using nuclear translocation signals proposed by probabilistic latent semantic indexing.基于概率潜在语义索引的核转位信号预测核蛋白。
BMC Bioinformatics. 2012;13 Suppl 17(Suppl 17):S13. doi: 10.1186/1471-2105-13-S17-S13. Epub 2012 Dec 13.
10
An efficient model selection for linear discriminant function-based recursive feature elimination.基于线性判别函数的递归特征消除的有效模型选择。
J Biomed Inform. 2022 May;129:104070. doi: 10.1016/j.jbi.2022.104070. Epub 2022 Apr 15.

引用本文的文献

1
Screening biomarkers related to cholesterol metabolism in osteoarthritis based on transcriptomics.基于转录组学筛选骨关节炎中与胆固醇代谢相关的生物标志物
Sci Rep. 2025 Jul 1;15(1):21218. doi: 10.1038/s41598-025-04550-y.
2
Combining lipidomics and machine learning to identify lipid biomarkers for nonsyndromic cleft lip with palate.结合脂质组学和机器学习以鉴定非综合征性唇腭裂的脂质生物标志物。
JCI Insight. 2025 May 8;10(9). doi: 10.1172/jci.insight.186629.
3
TargetCLP: clathrin proteins prediction combining transformed and evolutionary scale modeling-based multi-view features via weighted feature integration approach.TargetCLP:通过加权特征整合方法结合基于变换和进化尺度建模的多视图特征进行网格蛋白蛋白质预测。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbaf026.
4
A Review for Artificial Intelligence Based Protein Subcellular Localization.基于人工智能的蛋白质亚细胞定位研究综述
Biomolecules. 2024 Mar 27;14(4):409. doi: 10.3390/biom14040409.
5
Leveraging a meta-learning approach to advance the accuracy of Na blocking peptides prediction.利用元学习方法提高 Na 阻断肽预测的准确性。
Sci Rep. 2024 Feb 23;14(1):4463. doi: 10.1038/s41598-024-55160-z.
6
Dual-Signal Feature Spaces Map Protein Subcellular Locations Based on Immunohistochemistry Image and Protein Sequence.基于免疫组化图像和蛋白质序列的双信号特征空间映射蛋白质亚细胞定位。
Sensors (Basel). 2023 Nov 7;23(22):9014. doi: 10.3390/s23229014.
7
Empirical comparison and analysis of machine learning-based approaches for druggable protein identification.基于机器学习的可成药蛋白识别方法的实证比较与分析
EXCLI J. 2023 Aug 29;22:915-927. doi: 10.17179/excli2023-6410. eCollection 2023.
8
HAR_Locator: a novel protein subcellular location prediction model of immunohistochemistry images based on hybrid attention modules and residual units.HAR定位器:一种基于混合注意力模块和残差单元的免疫组织化学图像新型蛋白质亚细胞定位预测模型
Front Mol Biosci. 2023 Aug 17;10:1171429. doi: 10.3389/fmolb.2023.1171429. eCollection 2023.
9
TROLLOPE: A novel sequence-based stacked approach for the accelerated discovery of linear T-cell epitopes of hepatitis C virus.特罗洛普:一种基于新型序列的堆叠方法,用于加速发现丙型肝炎病毒的线性 T 细胞表位。
PLoS One. 2023 Aug 25;18(8):e0290538. doi: 10.1371/journal.pone.0290538. eCollection 2023.
10
StackTTCA: a stacking ensemble learning-based framework for accurate and high-throughput identification of tumor T cell antigens.StackTTCA:一种基于堆叠集成学习的框架,用于准确、高通量地鉴定肿瘤 T 细胞抗原。
BMC Bioinformatics. 2023 Jul 28;24(1):301. doi: 10.1186/s12859-023-05421-x.

本文引用的文献

1
Accurate prediction of multi-label protein subcellular localization through multi-view feature learning with RBRL classifier.通过使用 RBRL 分类器的多视图特征学习实现多标签蛋白质亚细胞定位的准确预测。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab012.
2
DeepPPSite: A deep learning-based model for analysis and prediction of phosphorylation sites using efficient sequence information.DeepPPSite:一种基于深度学习的模型,用于利用有效的序列信息分析和预测磷酸化位点。
Anal Biochem. 2021 Jan 1;612:113955. doi: 10.1016/j.ab.2020.113955. Epub 2020 Sep 16.
3
Bioimage-Based Prediction of Protein Subcellular Location in Human Tissue with Ensemble Features and Deep Networks.基于生物图像的人类组织中蛋白质亚细胞位置的集成特征和深度网络预测。
IEEE/ACM Trans Comput Biol Bioinform. 2020 Nov-Dec;17(6):1966-1980. doi: 10.1109/TCBB.2019.2917429. Epub 2020 Dec 8.
4
TargetDBP: Accurate DNA-Binding Protein Prediction Via Sequence-Based Multi-View Feature Learning.目标 DBP:基于序列的多视图特征学习的准确 DNA 结合蛋白预测。
IEEE/ACM Trans Comput Biol Bioinform. 2020 Jul-Aug;17(4):1419-1429. doi: 10.1109/TCBB.2019.2893634. Epub 2019 Jan 18.
5
Bioimage Classification with Handcrafted and Learned Features.基于手工特征和学习特征的生物图像分类
IEEE/ACM Trans Comput Biol Bioinform. 2018 Mar 30. doi: 10.1109/TCBB.2018.2821127.
6
Using Chou's general PseAAC to analyze the evolutionary relationship of receptor associated proteins (RAP) with various folding patterns of protein domains.使用周的通用伪氨基酸组成方法来分析受体相关蛋白(RAP)与蛋白质结构域各种折叠模式之间的进化关系。
J Theor Biol. 2018 May 14;445:62-74. doi: 10.1016/j.jtbi.2018.02.008. Epub 2018 Feb 22.
7
ATPbind: Accurate Protein-ATP Binding Site Prediction by Combining Sequence-Profiling and Structure-Based Comparisons.ATPbind:通过序列特征分析与结构比较相结合的方法进行准确的蛋白质-ATP 结合位点预测。
J Chem Inf Model. 2018 Feb 26;58(2):501-510. doi: 10.1021/acs.jcim.7b00397. Epub 2018 Feb 8.
8
iKcr-PseEns: Identify lysine crotonylation sites in histone proteins with pseudo components and ensemble classifier.iKcr-PseEns:使用伪组件和集成分类器鉴定组蛋白中的赖氨酸巴豆酰化位点。
Genomics. 2018 Sep;110(5):239-246. doi: 10.1016/j.ygeno.2017.10.008. Epub 2017 Nov 17.
9
pLoc-mHum: predict subcellular localization of multi-location human proteins via general PseAAC to winnow out the crucial GO information.pLoc-mHum:通过通用 PseAAC 预测多定位人类蛋白质的亚细胞定位,以筛选出关键的 GO 信息。
Bioinformatics. 2018 May 1;34(9):1448-1456. doi: 10.1093/bioinformatics/btx711.
10
pLoc-mGneg: Predict subcellular localization of Gram-negative bacterial proteins by deep gene ontology learning via general PseAAC.pLoc-mGneg:通过基于通用伪氨基酸组成的深度基因本体学习预测革兰氏阴性菌蛋白质的亚细胞定位。
Genomics. 2017 Oct 6. doi: 10.1016/j.ygeno.2017.10.002.

PScL-HDeep:基于图像的人类组织蛋白亚细胞定位预测,使用基于手工和深度学习特征的两层特征选择的集成学习方法。

PScL-HDeep: image-based prediction of protein subcellular location in human tissue using ensemble learning of handcrafted and deep learned features with two-layer feature selection.

机构信息

Nanjing University of Science and Technology, China.

School of Computer Science and Engineering, Nanjing University of Science and Technology, China.

出版信息

Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab278.

DOI:10.1093/bib/bbab278
PMID:34337652
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8574991/
Abstract

Protein subcellular localization plays a crucial role in characterizing the function of proteins and understanding various cellular processes. Therefore, accurate identification of protein subcellular location is an important yet challenging task. Numerous computational methods have been proposed to predict the subcellular location of proteins. However, most existing methods have limited capability in terms of the overall accuracy, time consumption and generalization power. To address these problems, in this study, we developed a novel computational approach based on human protein atlas (HPA) data, referred to as PScL-HDeep, for accurate and efficient image-based prediction of protein subcellular location in human tissues. We extracted different handcrafted and deep learned (by employing pretrained deep learning model) features from different viewpoints of the image. The step-wise discriminant analysis (SDA) algorithm was applied to generate the optimal feature set from each original raw feature set. To further obtain a more informative feature subset, support vector machine-based recursive feature elimination with correlation bias reduction (SVM-RFE + CBR) feature selection algorithm was applied to the integrated feature set. Finally, the classification models, namely support vector machine with radial basis function (SVM-RBF) and support vector machine with linear kernel (SVM-LNR), were learned on the final selected feature set. To evaluate the performance of the proposed method, a new gold standard benchmark training dataset was constructed from the HPA databank. PScL-HDeep achieved the maximum performance on 10-fold cross validation test on this dataset and showed a better efficacy over existing predictors. Furthermore, we also illustrated the generalization ability of the proposed method by conducting a stringent independent validation test.

摘要

蛋白质亚细胞定位在描述蛋白质功能和理解各种细胞过程中起着至关重要的作用。因此,准确识别蛋白质亚细胞位置是一项重要但具有挑战性的任务。已经提出了许多计算方法来预测蛋白质的亚细胞位置。然而,大多数现有的方法在整体准确性、时间消耗和泛化能力方面都有一定的局限性。为了解决这些问题,在本研究中,我们开发了一种基于人类蛋白质图谱(HPA)数据的新型计算方法,称为 PScL-HDeep,用于准确、高效地预测人类组织中蛋白质的亚细胞位置。我们从图像的不同视角提取了不同的手工和深度学习(通过使用预先训练的深度学习模型)特征。逐步判别分析(SDA)算法被应用于从每个原始原始特征集中生成最优特征集。为了进一步获得更具信息量的特征子集,基于支持向量机的递归特征消除与相关偏置减少(SVM-RFE+CBR)特征选择算法被应用于集成特征集。最后,支持向量机的径向基函数(SVM-RBF)和支持向量机的线性核(SVM-LNR)的分类模型被应用于最终选择的特征集上进行学习。为了评估所提出方法的性能,我们从 HPA 数据库构建了一个新的黄金标准基准训练数据集。PScL-HDeep 在该数据集上的 10 折交叉验证测试中达到了最大性能,并在现有预测器中表现出更好的效果。此外,我们还通过进行严格的独立验证测试说明了该方法的泛化能力。