HybridKla:一种用于乳酰化位点预测的混合深度学习框架。

HybridKla: a hybrid deep learning framework for lactylation site prediction.

作者信息

Ning Wanshan, Qin Feibo, Zhou Ziwei, Yang Hang, Li Chentan, Guo Yaping

机构信息

Institute for Clinical Medical Research, The First Affiliated Hospital of Xiamen University, School of Medicine, Xiamen University, Xiamen, Fujian 361003, China.

Department of Pathophysiology, School of Basic Medical Sciences, Zhengzhou University, Zhengzhou, Henan 450001, China.

出版信息

Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf375.

Abstract

Lysine lactylation (Kla), a novel lactate-derived post-translational modification, is involved in a myriad of biological processes and complex diseases. While several computational methods have been developed to identify Kla sites, these approaches still suffer from small datasets. In this work, we collected 23 984 Kla sites in 7297 proteins from the literature to construct the benchmark dataset. Leveraging recent advances in feature encoding, we tailored a multi-feature hybrid system, which integrated eight complementary feature-encoding strategies derived from two automated encoders and a composition-based module. Combining the hybrid system with deep learning, we presented our newly designed predictor named HybridKla, achieving an area under the curve (AUC) value of 0.8460. Compared to existing tools, HybridKla achieved >28.90% improvement of the AUC value (0.8460 versus 0.6563). we also conducted a proteome-wide search and provided a systematic prediction of Kla sites. The friendly online service of HybridKla is freely accessible for academic research at http://transkla.zzu.edu.cn/.

摘要

赖氨酸乳酰化(Kla)是一种新型的源自乳酸的翻译后修饰,参与众多生物过程和复杂疾病。虽然已经开发了几种计算方法来识别Kla位点,但这些方法仍然受限于小数据集。在这项工作中,我们从文献中收集了7297个蛋白质中的23984个Kla位点,以构建基准数据集。利用特征编码方面的最新进展,我们定制了一个多特征混合系统,该系统整合了源自两个自动编码器和一个基于组成的模块的八种互补特征编码策略。将混合系统与深度学习相结合,我们展示了新设计的预测器HybridKla,其曲线下面积(AUC)值达到0.8460。与现有工具相比,HybridKla的AUC值提高了>28.90%(0.8460对0.6563)。我们还进行了全蛋白质组搜索,并对Kla位点进行了系统预测。HybridKla的友好在线服务可通过http://transkla.zzu.edu.cn/免费用于学术研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3c68/12309240/651993b11de3/bbaf375f1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索