• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ATGPred-FL:基于特征表示学习的自噬蛋白序列预测。

ATGPred-FL: sequence-based prediction of autophagy proteins with feature representation learning.

机构信息

Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou, China.

School of Applied Chemistry and Biological Technology, Shenzhen Polytechnic, 7098 Liuxian Street, Shenzhen, 518055, China.

出版信息

Amino Acids. 2022 May;54(5):799-809. doi: 10.1007/s00726-022-03145-5. Epub 2022 Mar 14.

DOI:10.1007/s00726-022-03145-5
PMID:35286461
Abstract

Autophagy plays an important role in biological evolution and is regulated by many autophagy proteins. Accurate identification of autophagy proteins is crucially important to reveal their biological functions. Due to the expense and labor cost of experimental methods, it is urgent to develop automated, accurate and reliable sequence-based computational tools to enable the identification of novel autophagy proteins among numerous proteins and peptides. For this purpose, a new predictor named ATGPred-FL was proposed for the efficient identification of autophagy proteins. We investigated various sequence-based feature descriptors and adopted the feature learning method to generate corresponding, more informative probability features. Then, a two-step feature selection strategy based on accuracy was utilized to remove irrelevant and redundant features, leading to the most discriminative 14-dimensional feature set. The final predictor was built using a support vector machine classifier, which performed favorably on both the training and testing sets with accuracy values of 94.40% and 90.50%, respectively. ATGPred-FL is the first ATG machine learning predictor based on protein primary sequences. We envision that ATGPred-FL will be an effective and useful tool for autophagy protein identification, and it is available for free at http://lab.malab.cn/~acy/ATGPred-FL , the source code and datasets are accessible at https://github.com/jiaoshihu/ATGPred .

摘要

自噬在生物进化中起着重要作用,受许多自噬蛋白的调控。准确识别自噬蛋白对于揭示其生物学功能至关重要。由于实验方法的费用和劳动力成本,迫切需要开发自动化、准确和可靠的基于序列的计算工具,以便在众多蛋白质和肽中识别新的自噬蛋白。为此,我们提出了一种名为 ATGPred-FL 的新型预测器,用于有效地识别自噬蛋白。我们研究了各种基于序列的特征描述符,并采用特征学习方法生成相应的、更具信息量的概率特征。然后,我们采用基于准确性的两步特征选择策略来去除不相关和冗余的特征,从而得到最具判别力的 14 维特征集。最终的预测器是使用支持向量机分类器构建的,在训练集和测试集上的准确率分别为 94.40%和 90.50%,表现良好。ATGPred-FL 是第一个基于蛋白质一级序列的 ATG 机器学习预测器。我们预计 ATGPred-FL 将成为自噬蛋白鉴定的有效且有用的工具,它可以在 http://lab.malab.cn/~acy/ATGPred-FL 上免费获得,源代码和数据集可在 https://github.com/jiaoshihu/ATGPred 上获得。

相似文献

1
ATGPred-FL: sequence-based prediction of autophagy proteins with feature representation learning.ATGPred-FL:基于特征表示学习的自噬蛋白序列预测。
Amino Acids. 2022 May;54(5):799-809. doi: 10.1007/s00726-022-03145-5. Epub 2022 Mar 14.
2
ACPred-FL: a sequence-based predictor using effective feature representation to improve the prediction of anti-cancer peptides.ACPred-FL:一种基于序列的预测器,使用有效的特征表示来提高抗癌肽的预测能力。
Bioinformatics. 2018 Dec 1;34(23):4007-4016. doi: 10.1093/bioinformatics/bty451.
3
EnsembleDL-ATG: Identifying autophagy proteins by integrating their sequence and evolutionary information using an ensemble deep learning framework.集成深度学习自噬蛋白识别方法(EnsembleDL-ATG):使用集成深度学习框架整合自噬蛋白的序列和进化信息来识别自噬蛋白。
Comput Struct Biotechnol J. 2023 Sep 29;21:4836-4848. doi: 10.1016/j.csbj.2023.09.036. eCollection 2023.
4
Sequence-Based Prediction with Feature Representation Learning and Biological Function Analysis of Channel Proteins.基于序列的通道蛋白特征表示学习和生物学功能预测。
Front Biosci (Landmark Ed). 2022 Jun 2;27(6):177. doi: 10.31083/j.fbl2706177.
5
Identification of plant vacuole proteins by exploiting deep representation learning features.利用深度表征学习特征鉴定植物液泡蛋白
Comput Struct Biotechnol J. 2022 Jun 8;20:2921-2927. doi: 10.1016/j.csbj.2022.06.002. eCollection 2022.
6
Meta-iPVP: a sequence-based meta-predictor for improving the prediction of phage virion proteins using effective feature representation.Meta-iPVP:一种基于序列的元预测器,用于使用有效的特征表示来改进噬菌体衣壳蛋白的预测。
J Comput Aided Mol Des. 2020 Oct;34(10):1105-1116. doi: 10.1007/s10822-020-00323-z. Epub 2020 Jun 16.
7
Enhancer-FRL: Improved and Robust Identification of Enhancers and Their Activities Using Feature Representation Learning.增强子-FRL:利用特征表示学习改进并稳健识别增强子及其活性
IEEE/ACM Trans Comput Biol Bioinform. 2023 Mar-Apr;20(2):967-975. doi: 10.1109/TCBB.2022.3204365. Epub 2023 Apr 3.
8
Comparative analysis and prediction of quorum-sensing peptides using feature representation learning and machine learning algorithms.使用特征表示学习和机器学习算法对群体感应肽进行比较分析和预测。
Brief Bioinform. 2020 Jan 17;21(1):106-119. doi: 10.1093/bib/bby107.
9
iTTCA-RF: a random forest predictor for tumor T cell antigens.iTTCA-RF:一种用于肿瘤 T 细胞抗原的随机森林预测器。
J Transl Med. 2021 Oct 27;19(1):449. doi: 10.1186/s12967-021-03084-x.
10
UMPred-FRL: A New Approach for Accurate Prediction of Umami Peptides Using Feature Representation Learning.UMPred-FRL:一种使用特征表示学习准确预测鲜味肽的新方法。
Int J Mol Sci. 2021 Dec 4;22(23):13124. doi: 10.3390/ijms222313124.

引用本文的文献

1
Explainable AI to unveil cellular autophagy dynamics.可解释人工智能将揭示细胞自噬动力学。
PLoS One. 2025 Sep 11;20(9):e0331045. doi: 10.1371/journal.pone.0331045. eCollection 2025.
2
PLM-ATG: Identification of Autophagy Proteins by Integrating Protein Language Model Embeddings with PSSM-Based Features.PLM-ATG:通过将蛋白质语言模型嵌入与基于位置特异性得分矩阵的特征相结合来鉴定自噬蛋白
Molecules. 2025 Apr 10;30(8):1704. doi: 10.3390/molecules30081704.
3
Therapeutic peptides identification via kernel risk sensitive loss-based k-nearest neighbor model and multi-Laplacian regularization.

本文引用的文献

1
CoV-Spectrum: analysis of globally shared SARS-CoV-2 data to identify and characterize new variants.CoV-Spectrum:对全球共享的 SARS-CoV-2 数据进行分析,以识别和描述新的变体。
Bioinformatics. 2022 Mar 4;38(6):1735-1737. doi: 10.1093/bioinformatics/btab856.
2
JLAN: medical code prediction via joint learning attention networks and denoising mechanism.JLAN:基于联合学习注意力网络和去噪机制的医疗编码预测。
BMC Bioinformatics. 2021 Dec 13;22(1):590. doi: 10.1186/s12859-021-04520-x.
3
PreTP-EL: prediction of therapeutic peptides based on ensemble learning.
基于核风险敏感损失的 k-最近邻模型和多重拉普拉斯正则化的治疗性肽鉴定。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae534.
4
Autophagy and machine learning: Unanswered questions.自噬作用与机器学习:未解之谜。
Biochim Biophys Acta Mol Basis Dis. 2024 Aug;1870(6):167263. doi: 10.1016/j.bbadis.2024.167263. Epub 2024 May 25.
5
Accurately identifying hemagglutinin using sequence information and machine learning methods.使用序列信息和机器学习方法准确识别血凝素。
Front Med (Lausanne). 2023 Oct 31;10:1281880. doi: 10.3389/fmed.2023.1281880. eCollection 2023.
6
EnsembleDL-ATG: Identifying autophagy proteins by integrating their sequence and evolutionary information using an ensemble deep learning framework.集成深度学习自噬蛋白识别方法(EnsembleDL-ATG):使用集成深度学习框架整合自噬蛋白的序列和进化信息来识别自噬蛋白。
Comput Struct Biotechnol J. 2023 Sep 29;21:4836-4848. doi: 10.1016/j.csbj.2023.09.036. eCollection 2023.
PreTP-EL:基于集成学习的治疗性肽预测。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab358.
4
Leveraging the attention mechanism to improve the identification of DNA N6-methyladenine sites.利用注意力机制提高 DNA N6-甲基腺嘌呤位点的识别。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab351.
5
Review of unsupervised pretraining strategies for molecules representation.分子表示的无监督预训练策略综述。
Brief Funct Genomics. 2021 Sep 11;20(5):323-332. doi: 10.1093/bfgp/elab036.
6
iAMP-CA2L: a new CNN-BiLSTM-SVM classifier based on cellular automata image for identifying antimicrobial peptides and their functional types.iAMP-CA2L:一种基于细胞自动机图像的新型 CNN-BiLSTM-SVM 分类器,用于识别抗菌肽及其功能类型。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab209.
7
iBLP: An XGBoost-Based Predictor for Identifying Bioluminescent Proteins.iBLP:一种基于 XGBoost 的生物发光蛋白鉴定预测器。
Comput Math Methods Med. 2021 Jan 7;2021:6664362. doi: 10.1155/2021/6664362. eCollection 2021.
8
PROBselect: accurate prediction of protein-binding residues from proteins sequences via dynamic predictor selection.PROBselect:通过动态预测器选择从蛋白质序列中准确预测蛋白质结合残基。
Bioinformatics. 2020 Dec 30;36(Suppl_2):i735-i744. doi: 10.1093/bioinformatics/btaa806.
9
A Novel Adaptive Linear Neuron Based on DNA Strand Displacement Reaction Network.一种基于 DNA 链置换反应网络的新型自适应线性神经元。
IEEE/ACM Trans Comput Biol Bioinform. 2022 May-Jun;19(3):1424-1434. doi: 10.1109/TCBB.2020.3045567. Epub 2022 Jun 3.
10
Prediction of bio-sequence modifications and the associations with diseases.生物序列修饰的预测及其与疾病的关联。
Brief Funct Genomics. 2021 Mar 2;20(1):1-18. doi: 10.1093/bfgp/elaa023.