使用PENSA对生物分子构象集合进行系统分析。

Systematic analysis of biomolecular conformational ensembles with PENSA.

作者信息

Vögele Martin, Thomson Neil J, Truong Sang T, McAvity Jasper, Zachariae Ulrich, Dror Ron O

机构信息

Department of Computer Science, Stanford University, Stanford, California 94305, USA.

Department of Molecular and Cellular Physiology, Stanford University School of Medicine, Stanford, California 94305, USA.

出版信息

J Chem Phys. 2025 Jan 7;162(1). doi: 10.1063/5.0235544.

DOI:10.1063/5.0235544

PMID:39745157

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11698571/

Abstract

Atomic-level simulations are widely used to study biomolecules and their dynamics. A common goal in such studies is to compare simulations of a molecular system under several conditions-for example, with various mutations or bound ligands-in order to identify differences between the molecular conformations adopted under these conditions. However, the large amount of data produced by simulations of ever larger and more complex systems often renders it difficult to identify the structural features that are relevant to a particular biochemical phenomenon. We present a flexible software package named Python ENSemble Analysis (PENSA) that enables a comprehensive and thorough investigation into biomolecular conformational ensembles. It provides featurization and feature transformations that allow for a complete representation of biomolecules such as proteins and nucleic acids, including water and ion binding sites, thus avoiding the bias that would come with manual feature selection. PENSA implements methods to systematically compare the distributions of molecular features across ensembles to find the significant differences between them and identify regions of interest. It also includes a novel approach to quantify the state-specific information between two regions of a biomolecule, which allows, for example, tracing information flow to identify allosteric pathways. PENSA also comes with convenient tools for loading data and visualizing results, making them quick to process and easy to interpret. PENSA is an open-source Python library maintained at https://github.com/drorlab/pensa along with an example workflow and a tutorial. We demonstrate its usefulness in real-world examples by showing how it helps us determine molecular mechanisms efficiently.

摘要

原子水平的模拟被广泛用于研究生物分子及其动力学。此类研究的一个常见目标是比较分子系统在几种条件下的模拟结果——例如，具有各种突变或结合配体的情况——以便识别在这些条件下所采用的分子构象之间的差异。然而，由越来越大且越来越复杂的系统的模拟产生的大量数据常常使得难以识别与特定生化现象相关的结构特征。我们提出了一个名为Python集成分析（PENSA）的灵活软件包，它能够对生物分子构象集合进行全面而深入的研究。它提供了特征化和特征转换，能够完整地表示蛋白质和核酸等生物分子，包括水和离子结合位点，从而避免了手动特征选择可能带来的偏差。PENSA实现了系统比较集合中分子特征分布的方法，以找到它们之间的显著差异并识别感兴趣的区域。它还包括一种新颖的方法来量化生物分子两个区域之间的状态特异性信息，例如，这允许追踪信息流以识别变构途径。PENSA还附带了用于加载数据和可视化结果的便捷工具，使其处理快速且易于解释。PENSA是一个开源的Python库，可在https://github.com/drorlab/pensa上获取，同时还有一个示例工作流程和教程。我们通过展示它如何帮助我们高效地确定分子机制，在实际例子中证明了它的实用性。

相似文献

Systematic analysis of biomolecular conformational ensembles with PENSA.使用PENSA对生物分子构象集合进行系统分析。

J Chem Phys. 2025 Jan 7;162(1). doi: 10.1063/5.0235544.

Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病：网络荟萃分析。

Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.

Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗：一项网状Meta分析。

Cochrane Database Syst Rev. 2020 Jan 9;1(1):CD011535. doi: 10.1002/14651858.CD011535.pub3.

Adapting Safety Plans for Autistic Adults with Involvement from the Autism Community.在自闭症群体的参与下为成年自闭症患者调整安全计划。

Autism Adulthood. 2025 May 28;7(3):293-302. doi: 10.1089/aut.2023.0124. eCollection 2025 Jun.

Survivor, family and professional experiences of psychosocial interventions for sexual abuse and violence: a qualitative evidence synthesis.性虐待和暴力的心理社会干预的幸存者、家庭和专业人员的经验：定性证据综合。

Cochrane Database Syst Rev. 2022 Oct 4;10(10):CD013648. doi: 10.1002/14651858.CD013648.pub2.

Immunogenicity and seroefficacy of pneumococcal conjugate vaccines: a systematic review and network meta-analysis.肺炎球菌结合疫苗的免疫原性和血清效力：系统评价和网络荟萃分析。

Health Technol Assess. 2024 Jul;28(34):1-109. doi: 10.3310/YWHA3079.

Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗：一项网状荟萃分析。

Cochrane Database Syst Rev. 2017 Dec 22;12(12):CD011535. doi: 10.1002/14651858.CD011535.pub2.

Nivolumab for adults with Hodgkin's lymphoma (a rapid review using the software RobotReviewer).纳武单抗用于成人霍奇金淋巴瘤（使用RobotReviewer软件进行的快速综述）

Cochrane Database Syst Rev. 2018 Jul 12;7(7):CD012556. doi: 10.1002/14651858.CD012556.pub2.

Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。

Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.

The quantity, quality and findings of network meta-analyses evaluating the effectiveness of GLP-1 RAs for weight loss: a scoping review.评估胰高血糖素样肽-1受体激动剂（GLP-1 RAs）减肥效果的网状Meta分析的数量、质量及结果：一项范围综述

Health Technol Assess. 2025 Jun 25:1-73. doi: 10.3310/SKHT8119.

引用本文的文献

A cautious user's guide in applying HMMs to physical systems.一份关于将隐马尔可夫模型应用于物理系统的谨慎用户指南。

ArXiv. 2025 Jun 6:arXiv:2506.05707v1.

Machine Learning of Molecular Dynamics Simulations Provides Insights into the Modulation of Viral Capsid Assembly.分子动力学模拟的机器学习为病毒衣壳组装的调控提供了见解。

J Chem Inf Model. 2025 May 26;65(10):4844-4853. doi: 10.1021/acs.jcim.5c00274. Epub 2025 May 8.

Can Deep Learning Blind Docking Methods be Used to Predict Allosteric Compounds?深度学习盲对接方法可用于预测变构化合物吗？

J Chem Inf Model. 2025 Apr 14;65(7):3737-3748. doi: 10.1021/acs.jcim.5c00331. Epub 2025 Apr 1.

In silico characterization of the gating and selectivity mechanism of the human TPC2 cation channel.人TPC2阳离子通道门控和选择性机制的计算机模拟表征

J Gen Physiol. 2025 May 5;157(3). doi: 10.1085/jgp.202313506. Epub 2025 Feb 21.

PEG-mCherry interactions beyond classical macromolecular crowding.聚乙二醇（PEG）与单体红色荧光蛋白（mCherry）的相互作用超越了经典的大分子拥挤效应。

Protein Sci. 2025 Mar;34(3):e5235. doi: 10.1002/pro.5235.

本文引用的文献

Mechanism of negative μ-opioid receptor modulation by sodium ions.钠离子对μ-阿片受体负性调节的机制。

Structure. 2025 Jan 2;33(1):196-205.e2. doi: 10.1016/j.str.2024.10.023. Epub 2024 Nov 12.

EnGens: a computational framework for generation and analysis of representative protein conformational ensembles.EnGens：用于生成和分析代表性蛋白质构象集合的计算框架。

Brief Bioinform. 2023 Jul 20;24(4). doi: 10.1093/bib/bbad242.

A cooperative knock-on mechanism underpins Ca2+-selective cation permeation in TRPV channels.协同敲入机制为 TRPV 通道中钙离子选择性阳离子渗透提供了基础。

J Gen Physiol. 2023 May 1;155(5). doi: 10.1085/jgp.202213226. Epub 2023 Mar 21.

GPCR systems pharmacology: a different perspective on the development of biased therapeutics.GPCR 系统药理学：偏向性治疗药物开发的新视角。

Am J Physiol Cell Physiol. 2022 May 1;322(5):C887-C895. doi: 10.1152/ajpcell.00449.2021. Epub 2022 Feb 23.

GPCR activation mechanisms across classes and macro/microscales.跨类和宏/微观尺度的 G 蛋白偶联受体激活机制。

Nat Struct Mol Biol. 2021 Nov;28(11):879-888. doi: 10.1038/s41594-021-00674-7. Epub 2021 Nov 10.

Naturally Occurring Genetic Variants in the Oxytocin Receptor Alter Receptor Signaling Profiles.催产素受体中的自然发生的基因变异改变受体信号转导谱。

ACS Pharmacol Transl Sci. 2021 Sep 8;4(5):1543-1555. doi: 10.1021/acsptsci.1c00095. eCollection 2021 Oct 8.

Time-Lagged Independent Component Analysis of Random Walks and Protein Dynamics.随机漫步和蛋白质动力学的时滞独立成分分析。

J Chem Theory Comput. 2021 Sep 14;17(9):5766-5776. doi: 10.1021/acs.jctc.1c00273. Epub 2021 Aug 27.

Extended magnesium and calcium force field parameters for accurate ion-nucleic acid interactions in biomolecular simulations.扩展镁和钙的力场参数以实现生物分子模拟中离子-核酸相互作用的精确模拟。

J Chem Phys. 2021 May 7;154(17):171102. doi: 10.1063/5.0048113.

Deep learning the structural determinants of protein biochemical properties by comparing structural ensembles with DiffNets.通过将结构集合与 DiffNets 进行比较，深度学习蛋白质生化性质的结构决定因素。

Nat Commun. 2021 May 21;12(1):3023. doi: 10.1038/s41467-021-23246-1.

ProDy 2.0: increased scale and scope after 10 years of protein dynamics modelling with Python.ProDy 2.0：使用Python进行蛋白质动力学建模10年后规模和范围的扩大

Bioinformatics. 2021 Oct 25;37(20):3657-3659. doi: 10.1093/bioinformatics/btab187.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验