• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SIMLIN:一种基于多阶段集成学习模型的人类蛋白质组中 S-亚磺酰化预测的生物信息学工具。

SIMLIN: a bioinformatics tool for prediction of S-sulphenylation in the human proteome based on multi-stage ensemble-learning models.

机构信息

Monash Centre for Data Science, Faculty of Information Technology, Monash University, Melbourne, VIC, 3800, Australia.

Division of Cancer Epidemiology, Cancer Council Victoria, Melbourne, VIC, 3004, Australia.

出版信息

BMC Bioinformatics. 2019 Nov 21;20(1):602. doi: 10.1186/s12859-019-3178-6.

DOI:10.1186/s12859-019-3178-6
PMID:31752668
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6868744/
Abstract

BACKGROUND

S-sulphenylation is a ubiquitous protein post-translational modification (PTM) where an S-hydroxyl (-SOH) bond is formed via the reversible oxidation on the Sulfhydryl group of cysteine (C). Recent experimental studies have revealed that S-sulphenylation plays critical roles in many biological functions, such as protein regulation and cell signaling. State-of-the-art bioinformatic advances have facilitated high-throughput in silico screening of protein S-sulphenylation sites, thereby significantly reducing the time and labour costs traditionally required for the experimental investigation of S-sulphenylation.

RESULTS

In this study, we have proposed a novel hybrid computational framework, termed SIMLIN, for accurate prediction of protein S-sulphenylation sites using a multi-stage neural-network based ensemble-learning model integrating both protein sequence derived and protein structural features. Benchmarking experiments against the current state-of-the-art predictors for S-sulphenylation demonstrated that SIMLIN delivered competitive prediction performance. The empirical studies on the independent testing dataset demonstrated that SIMLIN achieved 88.0% prediction accuracy and an AUC score of 0.82, which outperforms currently existing methods.

CONCLUSIONS

In summary, SIMLIN predicts human S-sulphenylation sites with high accuracy thereby facilitating biological hypothesis generation and experimental validation. The web server, datasets, and online instructions are freely available at http://simlin.erc.monash.edu/ for academic purposes.

摘要

背景

S-亚磺酰化是一种普遍存在的蛋白质翻译后修饰(PTM),其中通过半胱氨酸(C)的巯基的可逆氧化形成 S-羟基(-SOH)键。最近的实验研究表明,S-亚磺酰化在许多生物功能中起着关键作用,如蛋白质调节和细胞信号转导。最先进的生物信息学进展促进了蛋白质 S-亚磺酰化位点的高通量计算机筛选,从而大大减少了传统上对 S-亚磺酰化进行实验研究所需的时间和劳动力成本。

结果

在这项研究中,我们提出了一种新的混合计算框架,称为 SIMLIN,用于使用基于多阶段神经网络的集成学习模型准确预测蛋白质 S-亚磺酰化位点,该模型集成了蛋白质序列衍生和蛋白质结构特征。针对 S-亚磺酰化的最新预测器进行的基准实验表明,SIMLIN 提供了有竞争力的预测性能。在独立测试数据集上的实证研究表明,SIMLIN 实现了 88.0%的预测准确率和 0.82 的 AUC 得分,优于现有方法。

结论

总之,SIMLIN 可以高精度地预测人类 S-亚磺酰化位点,从而促进生物学假设的生成和实验验证。该网络服务器、数据集和在线说明可在学术目的免费使用:http://simlin.erc.monash.edu/。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/806c/6868744/95b7f9d52db7/12859_2019_3178_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/806c/6868744/a28d4cd4742c/12859_2019_3178_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/806c/6868744/f22a4ff0ee5c/12859_2019_3178_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/806c/6868744/64a934177530/12859_2019_3178_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/806c/6868744/aeacd581d542/12859_2019_3178_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/806c/6868744/95b7f9d52db7/12859_2019_3178_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/806c/6868744/a28d4cd4742c/12859_2019_3178_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/806c/6868744/f22a4ff0ee5c/12859_2019_3178_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/806c/6868744/64a934177530/12859_2019_3178_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/806c/6868744/aeacd581d542/12859_2019_3178_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/806c/6868744/95b7f9d52db7/12859_2019_3178_Fig5_HTML.jpg

相似文献

1
SIMLIN: a bioinformatics tool for prediction of S-sulphenylation in the human proteome based on multi-stage ensemble-learning models.SIMLIN:一种基于多阶段集成学习模型的人类蛋白质组中 S-亚磺酰化预测的生物信息学工具。
BMC Bioinformatics. 2019 Nov 21;20(1):602. doi: 10.1186/s12859-019-3178-6.
2
Positive-unlabelled learning of glycosylation sites in the human proteome.人类蛋白质组中天冬酰胺糖基化位点的阳性无标记学习。
BMC Bioinformatics. 2019 Mar 6;20(1):112. doi: 10.1186/s12859-019-2700-1.
3
Large-scale comparative assessment of computational predictors for lysine post-translational modification sites.大规模比较评估赖氨酸翻译后修饰位点的计算预测因子。
Brief Bioinform. 2019 Nov 27;20(6):2267-2290. doi: 10.1093/bib/bby089.
4
SOHPRED: a new bioinformatics tool for the characterization and prediction of human S-sulfenylation sites.SOHPRED:一种用于表征和预测人类S-亚磺酰化位点的新型生物信息学工具。
Mol Biosyst. 2016 Aug 16;12(9):2849-58. doi: 10.1039/c6mb00314a.
5
MDD-SOH: exploiting maximal dependence decomposition to identify S-sulfenylation sites with substrate motifs.MDD-SOH:利用最大依赖分解来识别具有底物基序的S-亚磺酰化位点。
Bioinformatics. 2016 Jan 15;32(2):165-72. doi: 10.1093/bioinformatics/btv558. Epub 2015 Sep 26.
6
DeepSSPred: A Deep Learning Based Sulfenylation Site Predictor Via a Novel nSegmented Optimize Federated Feature Encoder.DeepSSPred:一种基于深度学习的新型 nSegmented Optimize 联邦特征编码器的硫化位点预测器。
Protein Pept Lett. 2021;28(6):708-721. doi: 10.2174/0929866527666201202103411.
7
GlycoMine: a machine learning-based approach for predicting N-, C- and O-linked glycosylation in the human proteome.糖基分析软件(GlycoMine):一种基于机器学习的方法,用于预测人类蛋白质组中的 N-、C-和 O-糖基化。
Bioinformatics. 2015 May 1;31(9):1411-9. doi: 10.1093/bioinformatics/btu852. Epub 2015 Jan 6.
8
Comprehensive review and assessment of computational methods for predicting RNA post-transcriptional modification sites from RNA sequences.全面综述和评估基于 RNA 序列预测 RNA 转录后修饰位点的计算方法。
Brief Bioinform. 2020 Sep 25;21(5):1676-1696. doi: 10.1093/bib/bbz112.
9
nhKcr: a new bioinformatics tool for predicting crotonylation sites on human nonhistone proteins based on deep learning.nhKcr:一种基于深度学习的预测人源非组蛋白上琥珀酰化修饰位点的新生物信息学工具。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab146.
10
DeepCSO: A Deep-Learning Network Approach to Predicting Cysteine S-Sulphenylation Sites.DeepCSO:一种用于预测半胱氨酸S-亚磺酰化位点的深度学习网络方法。
Front Cell Dev Biol. 2020 Dec 1;8:594587. doi: 10.3389/fcell.2020.594587. eCollection 2020.

引用本文的文献

1
CysModDB: a comprehensive platform with the integration of manually curated resources and analysis tools for cysteine posttranslational modifications.CysModDB:一个综合平台,整合了手动注释资源和半胱氨酸翻译后修饰分析工具。
Brief Bioinform. 2022 Nov 19;23(6). doi: 10.1093/bib/bbac460.
2
Computational prediction of species-specific yeast DNA replication origin via iterative feature representation.通过迭代特征表示计算预测物种特异性酵母 DNA 复制原点。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa304.
3
Computational prediction and interpretation of both general and specific types of promoters in Escherichia coli by exploiting a stacked ensemble-learning framework.

本文引用的文献

1
DeepCleave: a deep learning predictor for caspase and matrix metalloprotease substrates and cleavage sites.DeepCleave:用于半胱天冬酶和基质金属蛋白酶底物及切割位点的深度学习预测器。
Bioinformatics. 2020 Feb 15;36(4):1057-1065. doi: 10.1093/bioinformatics/btz721.
2
A comprehensive review and performance evaluation of bioinformatics tools for HLA class I peptide-binding prediction.HLA 类 I 肽结合预测的生物信息学工具的综合评价与性能评估。
Brief Bioinform. 2020 Jul 15;21(4):1119-1135. doi: 10.1093/bib/bbz051.
3
iLearn: an integrated platform and meta-learner for feature engineering, machine-learning analysis and modeling of DNA, RNA and protein sequence data.
利用堆叠集成学习框架对大肠杆菌中的一般和特定类型启动子进行计算预测和解释。
Brief Bioinform. 2021 Mar 22;22(2):2126-2140. doi: 10.1093/bib/bbaa049.
iLearn:一个集成平台和元学习者,用于 DNA、RNA 和蛋白质序列数据的特征工程、机器学习分析和建模。
Brief Bioinform. 2020 May 21;21(3):1047-1057. doi: 10.1093/bib/bbz041.
4
Encodings and models for antimicrobial peptide classification for multi-resistant pathogens.用于多重耐药病原体抗菌肽分类的编码与模型
BioData Min. 2019 Mar 4;12:7. doi: 10.1186/s13040-019-0196-x. eCollection 2019.
5
Positive-unlabelled learning of glycosylation sites in the human proteome.人类蛋白质组中天冬酰胺糖基化位点的阳性无标记学习。
BMC Bioinformatics. 2019 Mar 6;20(1):112. doi: 10.1186/s12859-019-2700-1.
6
Twenty years of bioinformatics research for protease-specific substrate and cleavage site prediction: a comprehensive revisit and benchmarking of existing methods.二十年来蛋白酶特异性底物和切割位点预测的生物信息学研究:对现有方法的全面回顾和基准测试。
Brief Bioinform. 2019 Nov 27;20(6):2150-2166. doi: 10.1093/bib/bby077.
7
Acetaminophen-induced -nitrosylation and -sulfenylation signalling in 3D cultured hepatocarcinoma cell spheroids.对乙酰氨基酚诱导的三维培养肝癌细胞球体中的亚硝基化和亚磺酰化信号传导
Toxicol Res (Camb). 2016 Mar 2;5(3):905-920. doi: 10.1039/c5tx00469a. eCollection 2016 May 1.
8
SVM-SulfoSite: A support vector machine based predictor for sulfenylation sites.SVM-SulfoSite:一种基于支持向量机的巯基化位点预测器。
Sci Rep. 2018 Jul 26;8(1):11288. doi: 10.1038/s41598-018-29126-x.
9
Quokka: a comprehensive tool for rapid and accurate prediction of kinase family-specific phosphorylation sites in the human proteome.Quokka:一种全面的工具,可快速准确地预测人类蛋白质组中激酶家族特异性磷酸化位点。
Bioinformatics. 2018 Dec 15;34(24):4223-4231. doi: 10.1093/bioinformatics/bty522.
10
PredCSO: an ensemble method for the prediction of S-sulfenylation sites in proteins.PredCSO:一种用于预测蛋白质中 S-亚磺酰化位点的集成方法。
Mol Omics. 2018 Aug 6;14(4):257-265. doi: 10.1039/c8mo00089a.