• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PySFD:从多个模拟集合中检测到的显著特征差异中获得全面的分子见解。

PySFD: comprehensive molecular insights from significant feature differences detected among many simulated ensembles.

机构信息

Department of Mathematics and Computer Science, Computational Molecular Biology Group, Arnimallee 6, 14195 Berlin, Germany.

出版信息

Bioinformatics. 2019 May 1;35(9):1588-1590. doi: 10.1093/bioinformatics/bty818.

DOI:10.1093/bioinformatics/bty818
PMID:30247628
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6499238/
Abstract

MOTIVATION

Many modeling analyses of molecular dynamics (MD) simulations are based on a definition of states that can be (groups of) clusters of simulation frames in a feature space composed of molecular coordinates. With increasing dimension of this feature space (due to the increasing size or complexity of a simulated molecule), it becomes very difficult to cluster the underlying MD data and estimate a statistically robust model. To mitigate this "curse of dimensionality", one can reduce the feature space, e.g., with principal component or time-lagged independent component analysis transformations, focusing the analysis on the most important modes of transitions. In practice, however, all these reduction strategies may neglect important molecular details that are susceptible to experimental verification.

RESULTS

To recover such molecular details, I have developed PySFD (Significant Feature Differences analyzer for Python), a multi-processing software package that efficiently selects significantly different features of any user-defined feature type among potentially many different simulated state ensembles, such as meta-stable states of a Markov State Model (MSM). Applying PySFD on MSMs of an aggregate of 300 microseconds MD simulations recently performed on the major histocompatibility complex class II (MHCII) protein, I demonstrate how this toolkit can extract and visualize valuable mechanistic information from big MD simulation data, e.g., in form of networks of dynamic interaction changes connecting functionally relevant sites of a protein complex.

AVAILABILITY AND IMPLEMENTATION

PySFD is freely available under the L-GPL license at https://github.com/markovmodel/PySFD.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

许多分子动力学 (MD) 模拟的建模分析都是基于状态的定义,这些状态可以是分子坐标构成的特征空间中的模拟帧 (帧群)。随着特征空间维度的增加(由于模拟分子的尺寸或复杂性增加),对基础 MD 数据进行聚类和估计具有稳健统计模型变得非常困难。为了缓解这种“维度诅咒”,可以减少特征空间,例如通过主成分或时滞独立成分分析变换,将分析重点放在最重要的转变模式上。然而,在实践中,所有这些降维策略都可能忽略了易于实验验证的重要分子细节。

结果

为了恢复这些分子细节,我开发了 PySFD(用于 Python 的显著特征差异分析器),这是一个多处理软件包,可以有效地选择任何用户定义的特征类型之间的潜在许多不同模拟状态集合中的显著不同的特征,例如马尔可夫状态模型 (MSM) 的亚稳态。在最近对主要组织相容性复合物 II (MHCII) 蛋白进行的 300 微秒 MD 模拟的 MSM 上应用 PySFD,我展示了这个工具包如何从大型 MD 模拟数据中提取和可视化有价值的机制信息,例如以连接蛋白质复合物功能相关位点的动态相互作用变化网络的形式。

可用性和实现

PySFD 可在 L-GPL 许可证下免费获得,网址为 https://github.com/markovmodel/PySFD。

补充信息

补充数据可在 Bioinformatics 在线获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/55da/6499238/65d7db273920/bty818f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/55da/6499238/65d7db273920/bty818f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/55da/6499238/65d7db273920/bty818f1.jpg

相似文献

1
PySFD: comprehensive molecular insights from significant feature differences detected among many simulated ensembles.PySFD:从多个模拟集合中检测到的显著特征差异中获得全面的分子见解。
Bioinformatics. 2019 May 1;35(9):1588-1590. doi: 10.1093/bioinformatics/bty818.
2
FATSLiM: a fast and robust software to analyze MD simulations of membranes.FATSLiM:一款用于分析膜的分子动力学模拟的快速且强大的软件。
Bioinformatics. 2017 Jan 1;33(1):133-134. doi: 10.1093/bioinformatics/btw563. Epub 2016 Aug 29.
3
MasterMSM: A Package for Constructing Master Equation Models of Molecular Dynamics.MasterMSM:用于构建分子动力学主方程模型的软件包。
J Chem Inf Model. 2019 Sep 23;59(9):3625-3629. doi: 10.1021/acs.jcim.9b00468. Epub 2019 Aug 19.
4
MD DaVis: interactive data visualization of protein molecular dynamics.MD DaVis:蛋白质分子动力学的交互式数据可视化。
Bioinformatics. 2022 Jun 13;38(12):3299-3301. doi: 10.1093/bioinformatics/btac314.
5
PyFeat: a Python-based effective feature generation tool for DNA, RNA and protein sequences.PyFeat:一个基于 Python 的用于 DNA、RNA 和蛋白质序列的有效特征生成工具。
Bioinformatics. 2019 Oct 1;35(19):3831-3833. doi: 10.1093/bioinformatics/btz165.
6
MODE-TASK: large-scale protein motion tools.模式任务:大规模蛋白质运动工具。
Bioinformatics. 2018 Nov 1;34(21):3759-3763. doi: 10.1093/bioinformatics/bty427.
7
gmxapi: a high-level interface for advanced control and extension of molecular dynamics simulations.gmxapi:高级接口,用于对分子动力学模拟进行高级控制和扩展。
Bioinformatics. 2018 Nov 15;34(22):3945-3947. doi: 10.1093/bioinformatics/bty484.
8
MDContactCom: a tool to identify differences of protein molecular dynamics from two MD simulation trajectories in terms of interresidue contacts.MDContactCom:一种工具,可用于根据残基间的接触来识别两个 MD 模拟轨迹中蛋白质分子动力学的差异。
Bioinformatics. 2021 Dec 22;38(1):273-274. doi: 10.1093/bioinformatics/btab538.
9
MDSCAN: RMSD-based HDBSCAN clustering of long molecular dynamics.MDSCAN:基于均方根偏差的长分子动力学HDBSCAN聚类
Bioinformatics. 2022 Nov 30;38(23):5191-5198. doi: 10.1093/bioinformatics/btac666.
10
RapidRMSD: rapid determination of RMSDs corresponding to motions of flexible molecules.RapidRMSD:对应柔性分子运动的 RMSD 的快速确定。
Bioinformatics. 2018 Aug 15;34(16):2757-2765. doi: 10.1093/bioinformatics/bty160.

引用本文的文献

1
MHC-II dynamics are maintained in HLA-DR allotypes to ensure catalyzed peptide exchange.MHC-II 动力学在 HLA-DR 同种异型中得以维持,以确保催化肽交换。
Nat Chem Biol. 2023 Oct;19(10):1196-1204. doi: 10.1038/s41589-023-01316-3. Epub 2023 May 4.

本文引用的文献

1
pyHVis3D: visualising molecular simulation deduced H-bond networks in 3D: application to T-cell receptor interactions.pyHVis3D:在 3D 中可视化分子模拟推断的氢键网络:在 T 细胞受体相互作用中的应用。
Bioinformatics. 2018 Jun 1;34(11):1941-1943. doi: 10.1093/bioinformatics/btx842.
2
Global Langevin model of multidimensional biomolecular dynamics.多维生物分子动力学的全局朗之万模型
J Chem Phys. 2016 Nov 14;145(18):184114. doi: 10.1063/1.4967341.
3
MHC class II complexes sample intermediate states along the peptide exchange pathway.
MHC Ⅱ类分子复合物沿着肽交换途径取样中间状态。
Nat Commun. 2016 Nov 9;7:13224. doi: 10.1038/ncomms13224.
4
Hierarchical Time-Lagged Independent Component Analysis: Computing Slow Modes and Reaction Coordinates for Large Molecular Systems.分层时间滞后独立成分分析:计算大分子系统的慢模式和反应坐标
J Chem Theory Comput. 2016 Dec 13;12(12):6118-6129. doi: 10.1021/acs.jctc.6b00738. Epub 2016 Nov 22.
5
Computational approaches to detect allosteric pathways in transmembrane molecular machines.检测跨膜分子机器中变构途径的计算方法。
Biochim Biophys Acta. 2016 Jul;1858(7 Pt B):1652-62. doi: 10.1016/j.bbamem.2016.01.010. Epub 2016 Jan 22.
6
Automated Event Detection and Activity Monitoring in Long Molecular Dynamics Simulations.长分子动力学模拟中的自动事件检测与活动监测
J Chem Theory Comput. 2009 Oct 13;5(10):2595-605. doi: 10.1021/ct900229u.
7
On-the-Fly Learning and Sampling of Ligand Binding by High-Throughput Molecular Simulations.通过高通量分子模拟实现配体结合的即时学习与采样
J Chem Theory Comput. 2014 May 13;10(5):2064-9. doi: 10.1021/ct400919u. Epub 2014 Apr 4.
8
Protein conformational plasticity and complex ligand-binding kinetics explored by atomistic simulations and Markov models.通过原子模拟和马尔可夫模型探索蛋白质构象的可塑性和复杂配体结合的动力学。
Nat Commun. 2015 Jul 2;6:7653. doi: 10.1038/ncomms8653.
9
Mechanism of the Association between Na+ Binding and Conformations at the Intracellular Gate in Neurotransmitter:Sodium Symporters.神经递质-钠同向转运体胞内门控处钠离子结合与构象之间的关联机制
J Biol Chem. 2015 May 29;290(22):13992-4003. doi: 10.1074/jbc.M114.625343. Epub 2015 Apr 13.
10
Allosteric signalling in the outer membrane translocation domain of PapC usher.PapC 外膜转运结构域中的变构信号传导。
Elife. 2014 Oct 28;3:e03532. doi: 10.7554/eLife.03532.