PSICalc：一种识别和排列整个蛋白质结构中关键非近端相互依赖性的新方法。

PSICalc: a novel approach to identifying and ranking critical non-proximal interdependencies within the overall protein structure.

作者信息

Townsley Thomas D, Wilson James T, Akers Harrison, Bryant Timothy, Cordova Salvador, Wallace T L, Durston Kirk K, Deweese Joseph E

机构信息

Department of Computational Sciences, College of Computing & Technology, Lipscomb University, Nashville, TN 37204, USA.

Department of Pharmaceutical Sciences, College of Pharmacy and Health Sciences, Lipscomb University, Nashville, TN 37204, USA.

出版信息

Bioinform Adv. 2022 Aug 18;2(1):vbac058. doi: 10.1093/bioadv/vbac058. eCollection 2022.

DOI:10.1093/bioadv/vbac058

PMID:36699404

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9710643/

Abstract

MOTIVATION

AlphaFold has been a major advance in predicting protein structure, but still leaves the problem of determining which sub-molecular components of a protein are essential for it to carry out its function within the cell. Direct coupling analysis predicts two- and three-amino acid contacts, but there may be essential interdependencies that are not proximal within the 3D structure. The problem to be addressed is to design a computational method that locates and ranks essential non-proximal interdependencies within a protein involving five or more amino acids, using large, multiple sequence alignments (MSAs) for both globular and intrinsically unstructured proteins.

RESULTS

We developed PSICalc (Protein Subdomain Interdependency Calculator), a laptop-friendly, pattern-discovery, bioinformatics software tool that analyzes large MSAs for both structured and unstructured proteins, locates both proximal and non-proximal inter-dependent sites, and clusters them into pairwise (second order), third-order and higher-order clusters using a k-modes approach, and provides ranked results within minutes. To aid in visualizing these interdependencies, we developed a graphical user interface that displays these subdomain relationships as a polytree graph. To demonstrate, we provide examples of both proximal and non-proximal interdependencies documented for eukaryotic topoisomerase II including between the unstructured C-terminal domain and the N-terminal domain.

AVAILABILITY AND IMPLEMENTATION

https://github.com/jdeweeselab/psicalc-package.

SUPPLEMENTARY INFORMATION

Supplementary data are available at online.

摘要

动机

AlphaFold在预测蛋白质结构方面取得了重大进展，但仍存在确定蛋白质的哪些亚分子成分对其在细胞内发挥功能至关重要的问题。直接耦合分析可预测两个和三个氨基酸之间的接触，但可能存在在三维结构中不相邻的关键相互依赖性。要解决的问题是设计一种计算方法，该方法使用针对球状蛋白和内在无序蛋白的大型多序列比对（MSA），来定位和排列蛋白质中涉及五个或更多氨基酸的关键非相邻相互依赖性。

结果

我们开发了PSICalc（蛋白质亚结构域相互依赖性计算器），这是一种便于在笔记本电脑上使用的、基于模式发现的生物信息学软件工具，可分析结构化和非结构化蛋白质的大型MSA，定位相邻和非相邻的相互依赖位点，并使用k-模式方法将它们聚类为成对（二阶）、三阶和更高阶聚类，并在几分钟内提供排名结果。为了帮助直观显示这些相互依赖性，我们开发了一个图形用户界面，将这些亚结构域关系显示为多树图。为了进行演示，我们提供了真核拓扑异构酶II中记录的相邻和非相邻相互依赖性的示例，包括在非结构化的C末端结构域和N末端结构域之间的相互依赖性。

可用性和实现方式

https://github.com/jdeweeselab/psicalc-package。

补充信息

补充数据可在网上获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8836/9710643/9dc8b0fc3652/vbac058f1.jpg

相似文献

PSICalc: a novel approach to identifying and ranking critical non-proximal interdependencies within the overall protein structure.PSICalc：一种识别和排列整个蛋白质结构中关键非近端相互依赖性的新方法。

Bioinform Adv. 2022 Aug 18;2(1):vbac058. doi: 10.1093/bioadv/vbac058. eCollection 2022.

Bioinformatic Analysis of Topoisomerase IIα Reveals Interdomain Interdependencies and Critical C-Terminal Domain Residues.拓扑异构酶 IIα 的生物信息学分析揭示了结构域间的相互依赖性和关键 C 末端结构域残基。

Int J Mol Sci. 2024 May 23;25(11):5674. doi: 10.3390/ijms25115674.

Statistical discovery of site inter-dependencies in sub-molecular hierarchical protein structuring.亚分子层次蛋白质结构中位点相互依赖性的统计发现

EURASIP J Bioinform Syst Biol. 2012 Jul 13;2012(1):8. doi: 10.1186/1687-4153-2012-8.

Highly significant improvement of protein sequence alignments with AlphaFold2.使用 AlphaFold2 大幅提高蛋白质序列比对的精确度。

Bioinformatics. 2022 Nov 15;38(22):5007-5011. doi: 10.1093/bioinformatics/btac625.

ResPRE: high-accuracy protein contact prediction by coupling precision matrix with deep residual neural networks.ResPRE：通过结合精度矩阵和深度残差神经网络进行高精度蛋白质接触预测。

Bioinformatics. 2019 Nov 1;35(22):4647-4655. doi: 10.1093/bioinformatics/btz291.

NX4: a web-based visualization of large multiple sequence alignments.NX4：一个基于网络的大型多重序列比对可视化工具。

Bioinformatics. 2019 Nov 1;35(22):4800-4802. doi: 10.1093/bioinformatics/btz457.

Folding the unfoldable: using AlphaFold to explore spurious proteins.折叠不可折叠之物：利用AlphaFold探索假蛋白

Bioinform Adv. 2022 Jan 9;2(1):vbab043. doi: 10.1093/bioadv/vbab043. eCollection 2022.

iS-CellR: a user-friendly tool for analyzing and visualizing single-cell RNA sequencing data.iS-CellR：一个用于分析和可视化单细胞 RNA 测序数据的用户友好工具。

Bioinformatics. 2018 Dec 15;34(24):4305-4306. doi: 10.1093/bioinformatics/bty517.

pydca v1.0: a comprehensive software for direct coupling analysis of RNA and protein sequences.pydca v1.0：用于 RNA 和蛋白质序列直接耦联分析的综合软件。

Bioinformatics. 2020 Apr 1;36(7):2264-2265. doi: 10.1093/bioinformatics/btz892.

MAGUS: Multiple sequence Alignment using Graph clUStering.MAGUS：基于图聚类的多重序列比对。

Bioinformatics. 2021 Jul 19;37(12):1666-1672. doi: 10.1093/bioinformatics/btaa992.

引用本文的文献

Mutagenesis of Intrinsically Disordered Domain Impacts Topoisomerase IIα Catalytic Activity.内在无序结构域的诱变影响拓扑异构酶IIα的催化活性。

Int J Mol Sci. 2025 Apr 11;26(8):3604. doi: 10.3390/ijms26083604.

Int J Mol Sci. 2024 May 23;25(11):5674. doi: 10.3390/ijms25115674.

In vivo functional phenotypes from a computational epistatic model of evolution.从进化的计算上位性模型中得出的体内功能表型。

Proc Natl Acad Sci U S A. 2024 Feb 6;121(6):e2308895121. doi: 10.1073/pnas.2308895121. Epub 2024 Jan 29.

No Time to Relax and Unwind: Exploration of Topoisomerases and a Growing Field of Study.无暇放松：拓扑异构酶的探索与不断发展的研究领域。

Int J Mol Sci. 2023 Aug 23;24(17):13080. doi: 10.3390/ijms241713080.

本文引用的文献

Exploration of the Role of the C-Terminal Domain of Human DNA Topoisomerase IIα in Catalytic Activity.人DNA拓扑异构酶IIα C末端结构域在催化活性中的作用探索

ACS Omega. 2021 Sep 30;6(40):25892-25903. doi: 10.1021/acsomega.1c02083. eCollection 2021 Oct 12.

Highly accurate protein structure prediction with AlphaFold.利用 AlphaFold 进行高精度蛋白质结构预测。

Nature. 2021 Aug;596(7873):583-589. doi: 10.1038/s41586-021-03819-2. Epub 2021 Jul 15.

Intrinsically disordered proteins and biomolecular condensates as drug targets.内在无序蛋白质和生物分子凝聚物作为药物靶点。

Curr Opin Chem Biol. 2021 Jun;62:90-100. doi: 10.1016/j.cbpa.2021.02.009. Epub 2021 Mar 31.

hoDCA: higher order direct-coupling analysis.高阶直接耦合分析（hoDCA）。

BMC Bioinformatics. 2018 Dec 29;19(1):546. doi: 10.1186/s12859-018-2583-6.

Evolution-Based Functional Decomposition of Proteins.基于进化的蛋白质功能分解

PLoS Comput Biol. 2016 Jun 2;12(6):e1004817. doi: 10.1371/journal.pcbi.1004817. eCollection 2016 Jun.

Classification of intrinsically disordered regions and proteins.内在无序区域和蛋白质的分类

Chem Rev. 2014 Jul 9;114(13):6589-631. doi: 10.1021/cr400525m. Epub 2014 Apr 29.

Statistical discovery of site inter-dependencies in sub-molecular hierarchical protein structuring.亚分子层次蛋白质结构中位点相互依赖性的统计发现

EURASIP J Bioinform Syst Biol. 2012 Jul 13;2012(1):8. doi: 10.1186/1687-4153-2012-8.

Genomics-aided structure prediction.基于基因组学的结构预测。

Proc Natl Acad Sci U S A. 2012 Jun 26;109(26):10340-5. doi: 10.1073/pnas.1207864109. Epub 2012 Jun 12.

Intrinsically unstructured proteins and their functions.内在无序蛋白质及其功能。

Nat Rev Mol Cell Biol. 2005 Mar;6(3):197-208. doi: 10.1038/nrm1589.

Statistical analysis of residue variability in cytochrome c.细胞色素c中残基变异性的统计分析

J Mol Biol. 1976 Apr 5;102(2):287-95. doi: 10.1016/s0022-2836(76)80054-x.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

PSICalc：一种识别和排列整个蛋白质结构中关键非近端相互依赖性的新方法。

PSICalc: a novel approach to identifying and ranking critical non-proximal interdependencies within the overall protein structure.

作者信息

机构信息

出版信息

MOTIVATION

RESULTS

AVAILABILITY AND IMPLEMENTATION

SUPPLEMENTARY INFORMATION

动机

结果

可用性和实现方式

补充信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献