• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过将序列特征纳入 Chou 的 PseAAC 来预测多标签蛋白质的亚细胞定位。

Predicting subcellular localization of multi-label proteins by incorporating the sequence features into Chou's PseAAC.

机构信息

Department of Computer Science, Abdul Wali Khan University Mardan, Pakistan.

Department of Computer Science, Abdul Wali Khan University Mardan, Pakistan.

出版信息

Genomics. 2019 Dec;111(6):1325-1332. doi: 10.1016/j.ygeno.2018.09.004. Epub 2018 Sep 7.

DOI:10.1016/j.ygeno.2018.09.004
PMID:30196077
Abstract

The emergence of numerous genome projects has made the experimental classification of the protein localization almost impossible due to the exponential increase in the number of protein samples. However, most of the applications are merely developed for single-plex and completely ignored the presence of one protein at two or more locations in a cell. In this regard, few attempts were carried out to target Multi-label protein localizations; consequently, undesirable accuracies are achieved. This paper presents a novel approach, in which a discrete feature extraction method is fused with physicochemical properties of amino acids by using Chou's general form of Pseudo Amino Acid Composition. The technique is tested on two benchmark datasets namely: Gpos-mploc and Virus-mPLoc. The empirical results demonstrated that the proposed method yields better results via two examined classifiers i.e. ML-KNN and Rank-SVM. It is established that the proposed model has improved values in all performance measures considered for the comparison.

摘要

由于蛋白质样本数量呈指数级增长,众多基因组项目的出现使得蛋白质定位的实验分类几乎变得不可能。然而,大多数应用程序仅仅是为单plex 开发的,完全忽略了一个蛋白质在细胞中存在于两个或更多位置的情况。在这方面,很少有尝试针对多标签蛋白质定位;因此,得到的准确性不理想。本文提出了一种新方法,其中通过使用 Chou 的通用形式的伪氨基酸组成,将离散特征提取方法与氨基酸的理化性质融合在一起。该技术在两个基准数据集 Gpos-mploc 和 Virus-mPLoc 上进行了测试。实验结果表明,该方法通过两种经过检验的分类器(即 ML-KNN 和 Rank-SVM)产生了更好的结果。已经确定,所提出的模型在所有考虑的比较性能指标中都具有改进的值。

相似文献

1
Predicting subcellular localization of multi-label proteins by incorporating the sequence features into Chou's PseAAC.通过将序列特征纳入 Chou 的 PseAAC 来预测多标签蛋白质的亚细胞定位。
Genomics. 2019 Dec;111(6):1325-1332. doi: 10.1016/j.ygeno.2018.09.004. Epub 2018 Sep 7.
2
pLoc_bal-mVirus: Predict Subcellular Localization of Multi-Label Virus Proteins by Chou's General PseAAC and IHTS Treatment to Balance Training Dataset.pLoc_bal-mVirus:基于周式广义伪氨基酸组成和用于平衡训练数据集的迭代启发式阈值选择处理预测多标签病毒蛋白的亚细胞定位
Med Chem. 2019;15(5):496-509. doi: 10.2174/1573406415666181217114710.
3
Prediction of protein subcellular multi-localization based on the general form of Chou's pseudo amino acid composition.基于周氏伪氨基酸组成通用形式的蛋白质亚细胞多定位预测
Protein Pept Lett. 2012 Apr;19(4):375-87. doi: 10.2174/092986612799789369.
4
Virus-ECC-mPLoc: a multi-label predictor for predicting the subcellular localization of virus proteins with both single and multiple sites based on a general form of Chou's pseudo amino acid composition.病毒-ECC-mPLoc:一种基于周氏伪氨基酸组成的通用形式,用于预测具有单一位点和多个位点的病毒蛋白亚细胞定位的多标签预测器。
Protein Pept Lett. 2013 Mar;20(3):309-17. doi: 10.2174/0929866511320030009.
5
Using a novel AdaBoost algorithm and Chou's Pseudo amino acid composition for predicting protein subcellular localization.使用一种新颖的AdaBoost算法和周氏伪氨基酸组成来预测蛋白质亚细胞定位。
Protein Pept Lett. 2011 Dec;18(12):1219-25. doi: 10.2174/092986611797642797.
6
Prediction of bacterial protein subcellular localization by incorporating various features into Chou's PseAAC and a backward feature selection approach.通过将各种特征纳入周氏伪氨基酸组成并采用反向特征选择方法预测细菌蛋白质亚细胞定位
Biochimie. 2014 Sep;104:100-7. doi: 10.1016/j.biochi.2014.06.001. Epub 2014 Jun 11.
7
iMem-2LSAAC: A two-level model for discrimination of membrane proteins and their types by extending the notion of SAAC into chou's pseudo amino acid composition.iMem-2LSAAC:一种通过将SAAC概念扩展到周氏伪氨基酸组成来区分膜蛋白及其类型的两级模型。
J Theor Biol. 2018 Apr 7;442:11-21. doi: 10.1016/j.jtbi.2018.01.008. Epub 2018 Jan 11.
8
Prediction of protein subcellular localization with oversampling approach and Chou's general PseAAC.基于过采样方法和周式广义伪氨基酸组成预测蛋白质亚细胞定位
J Theor Biol. 2018 Jan 21;437:239-250. doi: 10.1016/j.jtbi.2017.10.030. Epub 2017 Oct 31.
9
A two-stage SVM method to predict membrane protein types by incorporating amino acid classifications and physicochemical properties into a general form of Chou's PseAAC.一种两阶段支持向量机方法,通过将氨基酸分类和物理化学性质纳入到 Chou 的 PseAAC 的一般形式中,来预测膜蛋白类型。
J Theor Biol. 2014 Mar 7;344:31-9. doi: 10.1016/j.jtbi.2013.11.017. Epub 2013 Dec 4.
10
Predicting the Subcellular Localization of Proteins with Multiple Sites Based on Multiple Features Fusion.基于多特征融合预测具有多个位点的蛋白质的亚细胞定位
IEEE/ACM Trans Comput Biol Bioinform. 2016 Jan-Feb;13(1):36-42. doi: 10.1109/TCBB.2015.2485207. Epub 2015 Oct 6.

引用本文的文献

1
A Review for Artificial Intelligence Based Protein Subcellular Localization.基于人工智能的蛋白质亚细胞定位研究综述
Biomolecules. 2024 Mar 27;14(4):409. doi: 10.3390/biom14040409.
2
Drug-target interaction prediction based on protein features, using wrapper feature selection.基于蛋白质特征的药物-靶标相互作用预测,使用包装器特征选择。
Sci Rep. 2023 Mar 3;13(1):3594. doi: 10.1038/s41598-023-30026-y.
3
Multiple Protein Subcellular Locations Prediction Based on Deep Convolutional Neural Networks with Self-Attention Mechanism.基于具有自注意力机制的深度卷积神经网络的多种蛋白质亚细胞定位预测。
Interdiscip Sci. 2022 Jun;14(2):421-438. doi: 10.1007/s12539-021-00496-7. Epub 2022 Jan 23.
4
iTAGPred: A Two-Level Prediction Model for Identification of Angiogenesis and Tumor Angiogenesis Biomarkers.iTAGPred:一种用于识别血管生成和肿瘤血管生成生物标志物的两级预测模型。
Appl Bionics Biomech. 2021 Sep 27;2021:2803147. doi: 10.1155/2021/2803147. eCollection 2021.
5
MULocDeep: A deep-learning framework for protein subcellular and suborganellar localization prediction with residue-level interpretation.MULocDeep:一种用于蛋白质亚细胞和亚细胞器定位预测并具有残基水平解释的深度学习框架。
Comput Struct Biotechnol J. 2021 Aug 18;19:4825-4839. doi: 10.1016/j.csbj.2021.08.027. eCollection 2021.
6
Machine and Deep Learning for Prediction of Subcellular Localization.机器和深度学习在预测亚细胞定位中的应用。
Methods Mol Biol. 2021;2361:249-261. doi: 10.1007/978-1-0716-1641-3_15.
7
MirLocPredictor: A ConvNet-Based Multi-Label MicroRNA Subcellular Localization Predictor by Incorporating k-Mer Positional Information.MirLocPredictor:一种基于卷积神经网络的多标签 miRNA 亚细胞定位预测方法,通过整合 k-mer 位置信息。
Genes (Basel). 2020 Dec 9;11(12):1475. doi: 10.3390/genes11121475.
8
Identification of 4-carboxyglutamate residue sites based on position based statistical feature and multiple classification.基于位置的统计特征和多分类识别 4-羧基谷氨酸残基位点
Sci Rep. 2020 Oct 9;10(1):16913. doi: 10.1038/s41598-020-73107-y.
9
Subcellular location prediction of apoptosis proteins using two novel feature extraction methods based on evolutionary information and LDA.基于进化信息和 LDA 的两种新特征提取方法对凋亡蛋白的亚细胞定位预测
BMC Bioinformatics. 2020 May 24;21(1):212. doi: 10.1186/s12859-020-3539-1.
10
Identifying Antifreeze Proteins Based on Key Evolutionary Information.基于关键进化信息鉴定抗冻蛋白
Front Bioeng Biotechnol. 2020 Mar 26;8:244. doi: 10.3389/fbioe.2020.00244. eCollection 2020.