• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

WGAN-GP_Glu:一种基于双生成器-瓦瑟斯坦生成对抗网络和梯度惩罚算法的半监督模型,用于戊二酰化位点识别。

WGAN-GP_Glu: A semi-supervised model based on double generator-Wasserstein GAN with gradient penalty algorithm for glutarylation site identification.

作者信息

Ning Qiao, Qi Zedong

机构信息

Information Science and Technology, Dalian Maritime University, Dalian, Liaoning, China; The School of Artificial Intelligence and Computer Science, Jiangnan University, Wuxi, China; Neusoft Education Technology Group, Dalian, China; Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun, 130012, China.

Information Science and Technology, Dalian Maritime University, Dalian, Liaoning, China.

出版信息

Comput Biol Med. 2025 Jan;184:109328. doi: 10.1016/j.compbiomed.2024.109328. Epub 2024 Nov 14.

DOI:10.1016/j.compbiomed.2024.109328
PMID:39541896
Abstract

As an important post-translational modification, glutarylation plays a crucial role in a variety of cellular functions. Recently, diverse computational methods for glutarylation site identification have been proposed. However, the class imbalance problem due to data noise and uncertainty of non-glutarylation sites remains a great challenge. In this article, we propose a novel semi-supervised learning algorithm, called WGAN-GP_Glu, for identifying reliable non-glutarylation lysine sites from those without glutarylation annotation. WGAN-GP_Glu method is a multi-module framework algorithm, which mainly includes a reliable negative sample selection module, a deep feature extraction module, and a glutarylation site prediction module. In reliable negative sample selection module, we design an improved method of Wasserstein GAN with Gradient Penalty (WGAN-GP), named ReliableWGAN-GP, including three parts, two generators G1, G2 and a discriminator D, which can select reliable non-glutarylation samples from a great number of unlabeled samples. Generator G1 is utilized to generate noise data from unlabeled samples. For generator G2, both the positive sample and the noise data are used as inputs to improve the discriminant capability of discriminator D. Then, convolutional neural network and bidirectional long short-term memory network combined with attention mechanism are utilized to extract deep features for glutarylation samples and reliable non-glutarylation samples. Finally, a glutarylation site prediction module based on the three-layer fully connected layer is designed to make class predictions for samples. The sensitivity, specificity, accuracy and Matthew correlation coefficient of WGAN-GP_Glu on the independent test data set reach 90.58 %, 95.82 %, 94.44 % and 0.8645, respectively, which surpassed the existing methods for glutarylation sites prediction. Therefore, WGAN-GP_Glu can serve as a powerful tool in identifying glutarylation sites and the ReliableWGAN-GP algorithm is effective in selecting reliable negative samples. The data and code are available at https://github.com/xbbxhbc/WGAN-GP_Glu.git.

摘要

作为一种重要的翻译后修饰,戊二酰化在多种细胞功能中发挥着关键作用。近年来,已提出了多种用于戊二酰化位点识别的计算方法。然而,由于数据噪声和非戊二酰化位点的不确定性导致的类不平衡问题仍然是一个巨大的挑战。在本文中,我们提出了一种新颖的半监督学习算法,称为WGAN-GP_Glu,用于从没有戊二酰化注释的位点中识别可靠的非戊二酰化赖氨酸位点。WGAN-GP_Glu方法是一种多模块框架算法,主要包括可靠负样本选择模块、深度特征提取模块和戊二酰化位点预测模块。在可靠负样本选择模块中,我们设计了一种改进的带梯度惩罚的Wasserstein生成对抗网络(WGAN-GP)方法,称为可靠WGAN-GP,包括三个部分,两个生成器G1、G2和一个判别器D,它可以从大量未标记样本中选择可靠的非戊二酰化样本。生成器G1用于从未标记样本中生成噪声数据。对于生成器G2,正样本和噪声数据都用作输入,以提高判别器D的判别能力。然后,利用卷积神经网络和结合注意力机制的双向长短期记忆网络为戊二酰化样本和可靠的非戊二酰化样本提取深度特征。最后,设计了一个基于三层全连接层的戊二酰化位点预测模块对样本进行类别预测。WGAN-GP_Glu在独立测试数据集上的灵敏度、特异性、准确率和马修相关系数分别达到90.58%、95.82%、94.44%和0.8645,超过了现有的戊二酰化位点预测方法。因此,WGAN-GP_Glu可以作为识别戊二酰化位点的有力工具,并且可靠WGAN-GP算法在选择可靠负样本方面是有效的。数据和代码可在https://github.com/xbbxhbc/WGAN-GP_Glu.git获取。

相似文献

1
WGAN-GP_Glu: A semi-supervised model based on double generator-Wasserstein GAN with gradient penalty algorithm for glutarylation site identification.WGAN-GP_Glu:一种基于双生成器-瓦瑟斯坦生成对抗网络和梯度惩罚算法的半监督模型,用于戊二酰化位点识别。
Comput Biol Med. 2025 Jan;184:109328. doi: 10.1016/j.compbiomed.2024.109328. Epub 2024 Nov 14.
2
FCCCSR_Glu: a semi-supervised learning model based on FCCCSR algorithm for prediction of glutarylation sites.FCCCSR_Glu:一种基于 FCCCSR 算法的半监督学习模型,用于预测谷氨酰化位点。
Brief Bioinform. 2022 Nov 19;23(6). doi: 10.1093/bib/bbac421.
3
A Novel Method for Identification of Glutarylation Sites Combining Borderline-SMOTE With Tomek Links Technique in Imbalanced Data.一种结合边缘-SMOTE 与 Tomek 链接技术的不平衡数据谷氨酰化位点鉴定新方法
IEEE/ACM Trans Comput Biol Bioinform. 2022 Sep-Oct;19(5):2632-2641. doi: 10.1109/TCBB.2021.3095482. Epub 2022 Oct 10.
4
Computational Identification of Lysine Glutarylation Sites Using Positive-Unlabeled Learning.利用正无标记学习法对赖氨酸戊二酰化位点进行计算识别
Curr Genomics. 2020 Apr;21(3):204-211. doi: 10.2174/1389202921666200511072327.
5
DeepDN_iGlu: prediction of lysine glutarylation sites based on attention residual learning method and DenseNet.DeepDN_iGlu:基于注意力残差学习方法和 DenseNet 的赖氨酸瓜氨酸化位点预测。
Math Biosci Eng. 2023 Jan;20(2):2815-2830. doi: 10.3934/mbe.2023132. Epub 2022 Dec 1.
6
RF-GlutarySite: a random forest based predictor for glutarylation sites.RF-GlutarySite:基于随机森林的谷氨酰化位点预测器。
Mol Omics. 2019 Jun 1;15(3):189-204. doi: 10.1039/c9mo00028c. Epub 2019 Apr 26.
7
Deepro-Glu: combination of convolutional neural network and Bi-LSTM models using ProtBert and handcrafted features to identify lysine glutarylation sites.Deepro-Glu:使用 ProtBert 和手工特征的卷积神经网络和 Bi-LSTM 模型组合,以识别赖氨酸谷氨酰化位点。
Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbac631.
8
Accurately Predicting Glutarylation Sites Using Sequential Bi-Peptide-Based Evolutionary Features.基于序贯双肽进化特征准确预测谷氨酰化位点。
Genes (Basel). 2020 Aug 31;11(9):1023. doi: 10.3390/genes11091023.
9
Protein secondary structure prediction based on Wasserstein generative adversarial networks and temporal convolutional networks with convolutional block attention modules.基于瓦瑟斯坦生成对抗网络、带有卷积块注意力模块的时间卷积网络的蛋白质二级结构预测
Math Biosci Eng. 2023 Jan;20(2):2203-2218. doi: 10.3934/mbe.2023102. Epub 2022 Nov 17.
10
Generative AI with WGAN-GP for boosting seizure detection accuracy.用于提高癫痫发作检测准确性的带有 Wasserstein 生成对抗网络梯度惩罚的生成式人工智能。
Front Artif Intell. 2024 Oct 2;7:1437315. doi: 10.3389/frai.2024.1437315. eCollection 2024.