• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过结合性质、序列特征与支持向量机对O-糖基化位点进行预测。 (原句by前少了个词,表述不太完整准确,按现有内容尽量通顺翻译)

O-glycosylation site prediction for by combining properties and sequence features with support vector machine.

作者信息

Zhu Yan, Yin Shuwan, Zheng Jia, Shi Yixia, Jia Cangzhi

机构信息

School of Science, Dalian Maritime University, Dalian 116026, P. R. China.

School of Mathematics and Statistics, Lingnan Normal University, Zhanjiang 524048, P. R. China.

出版信息

J Bioinform Comput Biol. 2022 Feb;20(1):2150029. doi: 10.1142/S0219720021500293. Epub 2021 Nov 19.

DOI:10.1142/S0219720021500293
PMID:34806952
Abstract

O-glycosylation is a protein posttranslational modification important in regulating almost all cells. It is related to a large number of physiological and pathological phenomena. Recognizing O-glycosylation sites is the key to further investigating the molecular mechanism of protein posttranslational modification. This study aimed to collect a reliable dataset on and develop an O-glycosylation predictor for , named , through multiple features. A random undersampling method and a synthetic minority oversampling technique were employed to deal with imbalanced data. In addition, the Kruskal-Wallis (K-W) test was adopted to optimize feature vectors and improve the performance of the model. A support vector machine, due to its optimal performance, was used to train and optimize the final prediction model after a comprehensive comparison of various classifiers in traditional machine learning methods and deep learning. On the independent test set, outperformed the existing O-glycosylation tool, suggesting that could provide more instructive guidance for further experimental research on O-glycosylation. The source code and datasets are available at https://github.com/YanZhu06/Captor/.

摘要

O-糖基化是一种蛋白质翻译后修饰,对几乎所有细胞的调节都很重要。它与大量的生理和病理现象相关。识别O-糖基化位点是进一步研究蛋白质翻译后修饰分子机制的关键。本研究旨在收集一个可靠的数据集,并通过多种特征开发一种用于O-糖基化预测的工具,名为Captor。采用随机欠采样方法和合成少数过采样技术来处理数据不平衡问题。此外,采用Kruskal-Wallis(K-W)检验来优化特征向量并提高模型性能。在对传统机器学习方法和深度学习中的各种分类器进行全面比较后,由于支持向量机性能最优,因此使用它来训练和优化最终的预测模型。在独立测试集上,Captor的表现优于现有的O-糖基化工具,这表明Captor可为O-糖基化的进一步实验研究提供更具指导性的指导。源代码和数据集可在https://github.com/YanZhu06/Captor/获取。

相似文献

1
O-glycosylation site prediction for by combining properties and sequence features with support vector machine.通过结合性质、序列特征与支持向量机对O-糖基化位点进行预测。 (原句by前少了个词,表述不太完整准确,按现有内容尽量通顺翻译)
J Bioinform Comput Biol. 2022 Feb;20(1):2150029. doi: 10.1142/S0219720021500293. Epub 2021 Nov 19.
2
PredGly: predicting lysine glycation sites for Homo sapiens based on XGboost feature optimization.PredGly:基于 XGBoost 特征优化预测人类赖氨酸糖基化位点。
Bioinformatics. 2019 Aug 15;35(16):2749-2756. doi: 10.1093/bioinformatics/bty1043.
3
Computational Prediction of N- and O-Linked Glycosylation Sites for Human and Mouse Proteins.计算预测人类和小鼠蛋白质的 N-和 O-连接糖基化位点。
Methods Mol Biol. 2022;2499:177-186. doi: 10.1007/978-1-0716-2317-6_9.
4
Glypred: Lysine Glycation Site Prediction via CCU-LightGBM-BiLSTM Framework with Multi-Head Attention Mechanism.Glypred:基于 CCU-LightGBM-BiLSTM 框架与多头注意力机制的赖氨酸糖基化位点预测
J Chem Inf Model. 2024 Aug 26;64(16):6699-6711. doi: 10.1021/acs.jcim.4c01034. Epub 2024 Aug 9.
5
iRNA-2OM: A Sequence-Based Predictor for Identifying 2'-O-Methylation Sites in Homo sapiens.iRNA-2OM:一种用于识别人类2'-O-甲基化位点的基于序列的预测工具。
J Comput Biol. 2018 Nov;25(11):1266-1277. doi: 10.1089/cmb.2018.0004. Epub 2018 Aug 16.
6
UbNiRF: A Hybrid Framework Based on Null Importances and Random Forest that Combines Multiple Features to Predict Ubiquitination Sites in and .UbNiRF:一种基于空重要性和随机森林的混合框架,它结合多种特征来预测[具体内容缺失]中的泛素化位点。
Front Biosci (Landmark Ed). 2024 May 21;29(5):197. doi: 10.31083/j.fbl2905197.
7
HOTGpred: Enhancing human O-linked threonine glycosylation prediction using integrated pretrained protein language model-based features and multi-stage feature selection approach.HOTGpred:利用集成的预训练蛋白质语言模型为基础的特征和多阶段特征选择方法增强人类 O-连接 threonine 糖基化预测。
Comput Biol Med. 2024 Sep;179:108859. doi: 10.1016/j.compbiomed.2024.108859. Epub 2024 Jul 18.
8
Glycosylation site prediction using ensembles of Support Vector Machine classifiers.使用支持向量机分类器集成进行糖基化位点预测。
BMC Bioinformatics. 2007 Nov 9;8:438. doi: 10.1186/1471-2105-8-438.
9
Combining handcrafted features with latent variables in machine learning for prediction of radiation-induced lung damage.将机器学习中的手工特征与潜在变量相结合,以预测放射性肺损伤。
Med Phys. 2019 May;46(5):2497-2511. doi: 10.1002/mp.13497. Epub 2019 Apr 8.
10
Positive-unlabelled learning of glycosylation sites in the human proteome.人类蛋白质组中天冬酰胺糖基化位点的阳性无标记学习。
BMC Bioinformatics. 2019 Mar 6;20(1):112. doi: 10.1186/s12859-019-2700-1.

引用本文的文献

1
DeepO-GlcNAc: a web server for prediction of protein O-GlcNAcylation sites using deep learning combined with attention mechanism.DeepO-GlcNAc:一种利用深度学习结合注意力机制预测蛋白质O-连接N-乙酰葡糖胺化位点的网络服务器。
Front Cell Dev Biol. 2024 Oct 10;12:1456728. doi: 10.3389/fcell.2024.1456728. eCollection 2024.
2
Prediction of human O-linked glycosylation sites using stacked generalization and embeddings from pre-trained protein language model.基于堆叠泛化和预训练蛋白质语言模型嵌入的人源 O 糖基化位点预测。
Bioinformatics. 2024 Nov 1;40(11). doi: 10.1093/bioinformatics/btae643.
3
Polypeptide N-acetylgalactosaminyltransferase (GalNAc-T) isozyme surface charge governs charge substrate preferences to modulate mucin type O-glycosylation.
多肽 N-乙酰半乳糖胺转移酶(GalNAc-T)同工酶表面电荷控制电荷底物偏好,从而调节粘蛋白型 O-糖基化。
Glycobiology. 2023 Oct 30;33(10):817-836. doi: 10.1093/glycob/cwad066.