• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用从一级序列计算出的特征识别抗氧化蛋白的机器学习模型

Machine Learning Model for Identifying Antioxidant Proteins Using Features Calculated from Primary Sequences.

作者信息

Ho Thanh Lam Luu, Le Ngoc Hoang, Van Tuan Le, Tran Ban Ho, Nguyen Khanh Hung Truong, Nguyen Ngan Thi Kim, Huu Dang Luong, Le Nguyen Quoc Khanh

机构信息

International Master/PhD Program in Medicine, College of Medicine, Taipei Medical University, Taipei City 110, Taiwan.

Children's Hospital 2, Ho Chi Minh City 700000, Vietnam.

出版信息

Biology (Basel). 2020 Oct 6;9(10):325. doi: 10.3390/biology9100325.

DOI:10.3390/biology9100325
PMID:33036150
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7599600/
Abstract

Antioxidant proteins are involved importantly in many aspects of cellular life activities. They protect the cell and DNA from oxidative substances (such as peroxide, nitric oxide, oxygen-free radicals, etc.) which are known as reactive oxygen species (ROS). Free radical generation and antioxidant defenses are opposing factors in the human body and the balance between them is necessary to maintain a healthy body. An unhealthy routine or the degeneration of age can break the balance, leading to more ROS than antioxidants, causing damage to health. In general, the antioxidant mechanism is the combination of antioxidant molecules and ROS in a one-electron reaction. Creating computational models to promptly identify antioxidant candidates is essential in supporting antioxidant detection experiments in the laboratory. In this study, we proposed a machine learning-based model for this prediction purpose from a benchmark set of sequencing data. The experiments were conducted by using 10-fold cross-validation on the training process and validated by three different independent datasets. Different machine learning and deep learning algorithms have been evaluated on an optimal set of sequence features. Among them, Random Forest has been identified as the best model to identify antioxidant proteins with the highest performance. Our optimal model achieved high accuracy of 84.6%, as well as a balance in sensitivity (81.5%) and specificity (85.1%) for antioxidant protein identification on the training dataset. The performance results from different independent datasets also showed the significance in our model compared to previously published works on antioxidant protein identification.

摘要

抗氧化蛋白在细胞生命活动的许多方面都起着重要作用。它们保护细胞和DNA免受氧化物质(如过氧化物、一氧化氮、氧自由基等)的侵害,这些氧化物质被称为活性氧(ROS)。自由基的产生和抗氧化防御是人体中的相反因素,它们之间的平衡对于维持身体健康是必要的。不健康的生活习惯或年龄的增长会打破这种平衡,导致ROS多于抗氧化剂,从而损害健康。一般来说,抗氧化机制是抗氧化分子与ROS在单电子反应中的结合。创建计算模型以快速识别抗氧化候选物对于支持实验室中的抗氧化检测实验至关重要。在本研究中,我们从一组测序数据基准集中提出了一种基于机器学习的模型用于此预测目的。实验在训练过程中采用10折交叉验证进行,并通过三个不同的独立数据集进行验证。在一组最优的序列特征上评估了不同的机器学习和深度学习算法。其中,随机森林被确定为识别抗氧化蛋白性能最高的最佳模型。我们的最优模型在训练数据集上识别抗氧化蛋白时达到了84.6%的高精度,以及灵敏度(81.5%)和特异性(85.1%)的平衡。与之前发表的关于抗氧化蛋白识别的工作相比,不同独立数据集的性能结果也显示了我们模型的重要性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d27/7599600/d30748a2468a/biology-09-00325-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d27/7599600/678679f35196/biology-09-00325-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d27/7599600/106290f7a473/biology-09-00325-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d27/7599600/d13b72e59765/biology-09-00325-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d27/7599600/d30748a2468a/biology-09-00325-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d27/7599600/678679f35196/biology-09-00325-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d27/7599600/106290f7a473/biology-09-00325-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d27/7599600/d13b72e59765/biology-09-00325-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d27/7599600/d30748a2468a/biology-09-00325-g004.jpg

相似文献

1
Machine Learning Model for Identifying Antioxidant Proteins Using Features Calculated from Primary Sequences.利用从一级序列计算出的特征识别抗氧化蛋白的机器学习模型
Biology (Basel). 2020 Oct 6;9(10):325. doi: 10.3390/biology9100325.
2
DP-BINDER: machine learning model for prediction of DNA-binding proteins by fusing evolutionary and physicochemical information.DP-BINDER:一种通过融合进化和物理化学信息来预测 DNA 结合蛋白的机器学习模型。
J Comput Aided Mol Des. 2019 Jul;33(7):645-658. doi: 10.1007/s10822-019-00207-x. Epub 2019 May 23.
3
Mechanisms of Oxidative Damage and Their Impact on Contracting Muscle氧化损伤机制及其对收缩肌肉的影响
4
Prediction of antioxidant proteins using hybrid feature representation method and random forest.基于混合特征表示方法和随机森林的抗氧化蛋白预测。
Genomics. 2020 Nov;112(6):4666-4674. doi: 10.1016/j.ygeno.2020.08.016. Epub 2020 Aug 17.
5
DP-AOP: A novel SVM-based antioxidant proteins identifier.DP-AOP:一种新颖的基于 SVM 的抗氧化蛋白鉴定方法。
Int J Biol Macromol. 2023 Aug 30;247:125499. doi: 10.1016/j.ijbiomac.2023.125499. Epub 2023 Jul 4.
6
CRlncRC: a machine learning-based method for cancer-related long noncoding RNA identification using integrated features.CRlncRC:一种基于机器学习的方法,利用整合特征识别癌症相关长链非编码RNA
BMC Med Genomics. 2018 Dec 31;11(Suppl 6):120. doi: 10.1186/s12920-018-0436-9.
7
PeNGaRoo, a combined gradient boosting and ensemble learning framework for predicting non-classical secreted proteins.PeNGaRoo,一种组合梯度提升和集成学习框架,用于预测非经典分泌蛋白。
Bioinformatics. 2020 Feb 1;36(3):704-712. doi: 10.1093/bioinformatics/btz629.
8
Machine learning models to predict the progression from early to late stages of papillary renal cell carcinoma.机器学习模型预测肾乳头状细胞癌早期到晚期的进展。
Comput Biol Med. 2018 Sep 1;100:92-99. doi: 10.1016/j.compbiomed.2018.06.030. Epub 2018 Jun 28.
9
Machine learning-based antioxidant protein identification model: Progress and evaluation.基于机器学习的抗氧化蛋白鉴定模型:进展与评估。
J Cell Biochem. 2023 Nov;124(11):1825-1834. doi: 10.1002/jcb.30491. Epub 2023 Oct 25.
10
Identifying Antioxidant Proteins by Using Amino Acid Composition and Protein-Protein Interactions.利用氨基酸组成和蛋白质-蛋白质相互作用鉴定抗氧化蛋白
Front Cell Dev Biol. 2020 Oct 29;8:591487. doi: 10.3389/fcell.2020.591487. eCollection 2020.

引用本文的文献

1
Oxidative stress in cancer: from tumor and microenvironment remodeling to therapeutic frontiers.癌症中的氧化应激:从肿瘤与微环境重塑到治疗前沿
Mol Cancer. 2025 Aug 22;24(1):219. doi: 10.1186/s12943-025-02375-x.
2
Potential Mechanisms of Lactate Dehydrogenase and Bovine Serum Albumin Proteins as Antioxidants: A Mixed Experimental-Computational Study.乳酸脱氢酶和牛血清白蛋白作为抗氧化剂的潜在机制:一项实验与计算相结合的研究
Biochem Res Int. 2025 Feb 10;2025:9638644. doi: 10.1155/bri/9638644. eCollection 2025.
3
Smart estimation of protective antioxidant enzymes' activity in savory (Satureja rechingeri L.) under drought stress and soil amendments.

本文引用的文献

1
XGBoost Improves Classification of MGMT Promoter Methylation Status in IDH1 Wildtype Glioblastoma.XGBoost算法改善了异柠檬酸脱氢酶1(IDH1)野生型胶质母细胞瘤中O6-甲基鸟嘌呤-DNA甲基转移酶(MGMT)启动子甲基化状态的分类。
J Pers Med. 2020 Sep 15;10(3):128. doi: 10.3390/jpm10030128.
2
Identifying Antioxidant Proteins by Combining Multiple Methods.通过多种方法结合鉴定抗氧化蛋白
Front Bioeng Biotechnol. 2020 Jul 23;8:858. doi: 10.3389/fbioe.2020.00858. eCollection 2020.
3
Using deep neural networks and biological subwords to detect protein S-sulfenylation sites.
干旱胁迫和土壤改良条件下香薄荷(Satureja rechingeri L.)中保护性抗氧化酶活性的智能估算
BMC Plant Biol. 2025 Jan 6;25(1):19. doi: 10.1186/s12870-024-06044-x.
4
Machine Learning Framework for Conotoxin Class and Molecular Target Prediction.用于 Conotoxin 类和分子靶标预测的机器学习框架。
Toxins (Basel). 2024 Nov 3;16(11):475. doi: 10.3390/toxins16110475.
5
StackedEnC-AOP: prediction of antioxidant proteins using transform evolutionary and sequential features based multi-scale vector with stacked ensemble learning.StackedEnC-AOP:基于多尺度向量的转换进化和序列特征与堆叠集成学习预测抗氧化蛋白。
BMC Bioinformatics. 2024 Aug 4;25(1):256. doi: 10.1186/s12859-024-05884-6.
6
Conotoxin Prediction: New Features to Increase Prediction Accuracy.毒素预测:提高预测准确性的新特征。
Toxins (Basel). 2023 Nov 3;15(11):641. doi: 10.3390/toxins15110641.
7
ROSes-FINDER: a multi-task deep learning framework for accurate prediction of microorganism reactive oxygen species scavenging enzymes.ROSes-FINDER:一种用于准确预测微生物活性氧清除酶的多任务深度学习框架。
Front Microbiol. 2023 Sep 7;14:1245805. doi: 10.3389/fmicb.2023.1245805. eCollection 2023.
8
Identification of S1PR4 as an immune modulator for favorable prognosis in HNSCC through machine learning.通过机器学习鉴定S1PR4作为头颈部鳞状细胞癌预后良好的免疫调节剂。
iScience. 2023 Aug 19;26(9):107693. doi: 10.1016/j.isci.2023.107693. eCollection 2023 Sep 15.
9
Bridging artificial intelligence and fucoxanthin for the recovery and quantification from microalgae.将人工智能和岩藻黄质结合起来,从微藻中回收和定量分析。
Bioengineered. 2023 Dec;14(1):2244232. doi: 10.1080/21655979.2023.2244232.
10
Predicting delayed methotrexate elimination in pediatric acute lymphoblastic leukemia patients: an innovative web-based machine learning tool developed through a multicenter, retrospective analysis.预测儿科急性淋巴细胞白血病患者甲氨蝶呤清除延迟:一项通过多中心回顾性分析开发的创新型基于网络的机器学习工具。
BMC Med Inform Decis Mak. 2023 Aug 3;23(1):148. doi: 10.1186/s12911-023-02248-7.
利用深度神经网络和生物子词检测蛋白质 S-亚磺化位点。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa128.
4
AOPs-SVM: A Sequence-Based Classifier of Antioxidant Proteins Using a Support Vector Machine.AOPs-SVM:一种基于序列的使用支持向量机的抗氧化蛋白分类器。
Front Bioeng Biotechnol. 2019 Sep 18;7:224. doi: 10.3389/fbioe.2019.00224. eCollection 2019.
5
iLearn: an integrated platform and meta-learner for feature engineering, machine-learning analysis and modeling of DNA, RNA and protein sequence data.iLearn:一个集成平台和元学习者,用于 DNA、RNA 和蛋白质序列数据的特征工程、机器学习分析和建模。
Brief Bioinform. 2020 May 21;21(3):1047-1057. doi: 10.1093/bib/bbz041.
6
Prediction of antioxidant proteins by incorporating statistical moments based features into Chou's PseAAC.基于统计矩特征的 Chou's PseAAC 算法预测抗氧化蛋白
J Theor Biol. 2019 Jul 21;473:1-8. doi: 10.1016/j.jtbi.2019.04.019. Epub 2019 Apr 18.
7
SeqSVM: A Sequence-Based Support Vector Machine Method for Identifying Antioxidant Proteins.SeqSVM:一种基于序列的支持向量机方法,用于识别抗氧化蛋白。
Int J Mol Sci. 2018 Jun 15;19(6):1773. doi: 10.3390/ijms19061773.
8
iFeature: a Python package and web server for features extraction and selection from protein and peptide sequences.iFeature:一个用于从蛋白质和肽序列中提取和选择特征的 Python 包和网络服务器。
Bioinformatics. 2018 Jul 15;34(14):2499-2502. doi: 10.1093/bioinformatics/bty140.
9
Endogenous non-enzymatic antioxidants in the human body.人体内的内源性非酶抗氧化剂。
Adv Med Sci. 2018 Mar;63(1):68-78. doi: 10.1016/j.advms.2017.05.005. Epub 2017 Aug 17.
10
Oxidative Stress, Inflammation, and Vascular Aging in Hypertension.高血压中的氧化应激、炎症与血管衰老
Hypertension. 2017 Oct;70(4):660-667. doi: 10.1161/HYPERTENSIONAHA.117.07802. Epub 2017 Aug 7.