• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

MEDUSA:从序列预测蛋白质柔性。

MEDUSA: Prediction of Protein Flexibility from Sequence.

机构信息

Université de Paris, Inserm UMR_S 1134 - BIGR, INTS, 6 rue Alexandre Cabanel, 75015 Paris, France; Laboratoire d'Excellence GR-Ex, 75015 Paris, France.

Université de Paris, Inserm UMR_S 1134 - BIGR, INTS, 6 rue Alexandre Cabanel, 75015 Paris, France; Laboratoire d'Excellence GR-Ex, 75015 Paris, France.

出版信息

J Mol Biol. 2021 May 28;433(11):166882. doi: 10.1016/j.jmb.2021.166882. Epub 2021 Feb 20.

DOI:10.1016/j.jmb.2021.166882
PMID:33972018
Abstract

Information on the protein flexibility is essential to understand crucial molecular mechanisms such as protein stability, interactions with other molecules and protein functions in general. B-factor obtained in the X-ray crystallography experiments is the most common flexibility descriptor available for the majority of the resolved protein structures. Since the gap between the number of the resolved protein structures and available protein sequences is continuously growing, it is important to provide computational tools for protein flexibility prediction from amino acid sequence. In the current study, we report a Deep Learning based protein flexibility prediction tool MEDUSA (https://www.dsimb.inserm.fr/MEDUSA). MEDUSA uses evolutionary information extracted from protein homologous sequences and amino acid physico-chemical properties as input for a convolutional neural network to assign a flexibility class to each protein sequence position. Trained on a non-redundant dataset of X-ray structures, MEDUSA provides flexibility prediction in two, three and five classes. MEDUSA is freely available as a web-server providing a clear visualization of the prediction results as well as a standalone utility (https://github.com/DSIMB/medusa). Analysis of the MEDUSA output allows a user to identify the potentially highly deformable protein regions and general dynamic properties of the protein.

摘要

蛋白质柔性的信息对于理解关键的分子机制至关重要,如蛋白质稳定性、与其他分子的相互作用以及蛋白质的一般功能。在 X 射线晶体学实验中获得的 B 因子是大多数已解析蛋白质结构中最常用的柔性描述符。由于已解析的蛋白质结构数量与可用的蛋白质序列之间的差距不断扩大,因此提供从氨基酸序列预测蛋白质柔性的计算工具非常重要。在本研究中,我们报告了一种基于深度学习的蛋白质柔性预测工具 MEDUSA(https://www.dsimb.inserm.fr/MEDUSA)。MEDUSA 使用从蛋白质同源序列中提取的进化信息和氨基酸物理化学性质作为输入,用于卷积神经网络,为每个蛋白质序列位置分配一个柔性类别。经过 X 射线结构的非冗余数据集训练,MEDUSA 提供了两类、三类和五类的柔性预测。MEDUSA 作为一个网络服务器免费提供,它提供了预测结果的清晰可视化,以及一个独立的实用程序(https://github.com/DSIMB/medusa)。对 MEDUSA 输出的分析允许用户识别潜在的高度可变形蛋白质区域和蛋白质的一般动态特性。

相似文献

1
MEDUSA: Prediction of Protein Flexibility from Sequence.MEDUSA:从序列预测蛋白质柔性。
J Mol Biol. 2021 May 28;433(11):166882. doi: 10.1016/j.jmb.2021.166882. Epub 2021 Feb 20.
2
Medusa: Software to build and analyze ensembles of genome-scale metabolic network reconstructions.美杜莎:用于构建和分析基因组规模代谢网络重建集合的软件。
PLoS Comput Biol. 2020 Apr 29;16(4):e1007847. doi: 10.1371/journal.pcbi.1007847. eCollection 2020 Apr.
3
FLEXc: protein flexibility prediction using context-based statistics, predicted structural features, and sequence information.FLEXc:利用基于上下文的统计信息、预测的结构特征和序列信息进行蛋白质柔性预测。
BMC Bioinformatics. 2016 Aug 31;17 Suppl 8(Suppl 8):281. doi: 10.1186/s12859-016-1117-3.
4
PredyFlexy: flexibility and local structure prediction from sequence.PredyFlexy:基于序列的柔性和局部结构预测。
Nucleic Acids Res. 2012 Jul;40(Web Server issue):W317-22. doi: 10.1093/nar/gks482. Epub 2012 Jun 11.
5
ProtDCal-Suite: A web server for the numerical codification and functional analysis of proteins.ProtDCal-Suite:一个用于蛋白质数值编码和功能分析的网络服务器。
Protein Sci. 2019 Sep;28(9):1734-1743. doi: 10.1002/pro.3673.
6
PATSIM: Prediction and analysis of protein sequences using hybrid Knuth-Morris Pratt (KMP) and Boyer-Moore (BM) algorithm.PATSIM:使用混合的克努特-莫里斯-普拉特(KMP)算法和博耶-摩尔(BM)算法对蛋白质序列进行预测和分析。
Gene. 2018 May 30;657:50-59. doi: 10.1016/j.gene.2018.02.069. Epub 2018 Mar 6.
7
PiPred - a deep-learning method for prediction of π-helices in protein sequences.PiPred-一种用于预测蛋白质序列中π-螺旋的深度学习方法。
Sci Rep. 2019 May 3;9(1):6888. doi: 10.1038/s41598-019-43189-4.
8
DeepMito: accurate prediction of protein sub-mitochondrial localization using convolutional neural networks.DeepMito:使用卷积神经网络准确预测蛋白质亚线粒体定位
Bioinformatics. 2020 Jan 1;36(1):56-64. doi: 10.1093/bioinformatics/btz512.
9
MEDUSA: large scale automatic selection and visual assessment of PCR primer pairs.美杜莎:PCR引物对的大规模自动筛选与可视化评估
Bioinformatics. 2001 Jul;17(7):656-7. doi: 10.1093/bioinformatics/17.7.656.
10
Protein secondary structure prediction improved by recurrent neural networks integrated with two-dimensional convolutional neural networks.通过与二维卷积神经网络集成的循环神经网络改进蛋白质二级结构预测。
J Bioinform Comput Biol. 2018 Oct;16(5):1850021. doi: 10.1142/S021972001850021X.

引用本文的文献

1
Cytoplasmic Isocitrate Dehydrogenase Neomorphic G98N and R133H Mutants Produce the Oncometabolite 2-Hydroxyglutarate.细胞质异柠檬酸脱氢酶的新形态G98N和R133H突变体产生致癌代谢物2-羟基戊二酸。
Int J Mol Sci. 2025 Aug 25;26(17):8238. doi: 10.3390/ijms26178238.
2
PEGASUS: Prediction of MD-derived protein flexibility from sequence.PEGASUS:从序列预测基于分子动力学的蛋白质柔韧性
Protein Sci. 2025 Aug;34(8):e70221. doi: 10.1002/pro.70221.
3
Persistent Sheaf Laplacian Analysis of Protein Flexibility.蛋白质柔韧性的持久层拉普拉斯分析
J Phys Chem B. 2025 May 1;129(17):4169-4178. doi: 10.1021/acs.jpcb.5c01287. Epub 2025 Apr 22.
4
Virtual Screening and Molecular Dynamics of Cytokine-Drug Complexes for Atherosclerosis Therapy.用于动脉粥样硬化治疗的细胞因子-药物复合物的虚拟筛选与分子动力学
Int J Mol Sci. 2025 Mar 24;26(7):2931. doi: 10.3390/ijms26072931.
5
Multiscale Differential Geometry Learning for Protein Flexibility Analysis.用于蛋白质灵活性分析的多尺度微分几何学习
J Comput Chem. 2025 Mar 15;46(7):e70073. doi: 10.1002/jcc.70073.
6
The 'very moment' when UDG recognizes a flipped-out uracil base in dsDNA.尿嘧啶DNA糖苷酶(UDG)识别双链DNA中翻转出来的尿嘧啶碱基的“那一刻”。
Sci Rep. 2025 Mar 7;15(1):7993. doi: 10.1038/s41598-025-91705-6.
7
mdCATH: A Large-Scale MD Dataset for Data-Driven Computational Biophysics.mdCATH:一个用于数据驱动计算生物物理学的大规模分子动力学数据集。
ArXiv. 2024 Dec 3:arXiv:2407.14794v2.
8
mdCATH: A Large-Scale MD Dataset for Data-Driven Computational Biophysics.mdCATH:用于数据驱动计算生物物理学的大规模 MD 数据集。
Sci Data. 2024 Nov 28;11(1):1299. doi: 10.1038/s41597-024-04140-z.
9
Assessment of Four Theoretical Approaches to Predict Protein Flexibility in the Crystal Phase and Solution.评估四种理论方法预测晶体相和溶液中蛋白质柔性。
J Chem Theory Comput. 2024 Sep 10;20(17):7667-7681. doi: 10.1021/acs.jctc.4c00754. Epub 2024 Aug 22.
10
In silico functional, structural and pathogenicity analysis of missense single nucleotide polymorphisms in human MCM6 gene.在人 MCM6 基因中单核苷酸错义多态性的计算机功能、结构和致病性分析。
Sci Rep. 2024 May 21;14(1):11607. doi: 10.1038/s41598-024-62299-2.