COC DA——一种使用C距离矩阵在蛋白质中进行原子间接触检测的快速且可扩展的算法。

COC DA - a fast and scalable algorithm for interatomic contact detection in proteins using C distance matrices.

作者信息

Lemos Rafael Pereira, Mariano Diego, Silveira Sabrina De Azevedo, de Melo-Minardi Raquel C

机构信息

Laboratory of Bioinformatics and Systems, Department of Computer Science, Federal University of Minas Gerais, Belo Horizonte, Brazil.

Laboratory of Bioinformatics, Visualization and Systems, Department of Informatics, Federal University of Viçosa, Viçosa, Brazil.

出版信息

Front Bioinform. 2025 Sep 1;5:1630078. doi: 10.3389/fbinf.2025.1630078. eCollection 2025.

DOI:10.3389/fbinf.2025.1630078

PMID:40959146

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12433948/

Abstract

Protein interatomic contacts, defined by spatial proximity and physicochemical complementarity at atomic resolution, are fundamental to characterizing molecular interactions and bonding. Methods for calculating contacts are generally categorized as cutoff-dependent, which rely on Euclidean distances, or cutoff-independent, which utilize Delaunay and Voronoi tessellations. While cutoff-dependent methods are recognized for their simplicity, completeness, and reliability, traditional implementations remain computationally expensive, posing significant scalability challenges in the current Big Data era of bioinformatics. Here, we introduce COC DA (COntact search pruning by C Distance Analysis), a Python-based command-line tool for improving search pruning in large-scale interatomic protein contact analysis using alpha-carbon (C ) distance matrices. COC DA detects intra- and inter-chain contacts, and classifies them into seven different types: hydrogen and disulfide bonds; hydrophobic effects; attractive, repulsive, and salt-bridge interactions; and aromatic stackings. To evaluate our tool, we compared it with three traditional approaches in the literature: all-against-all atom distance calculation ("brute-force"), static C distance cutoff (SC), and Biopython's NeighborSearch class (NS). COC DA demonstrated superior performance compared to the other methods, achieving on average 6x faster computation times than advanced data structures like -d trees from NS, in addition to being simpler to implement and fully customizable. The presented tool facilitates exploratory and large-scale analyses of interatomic contacts in proteins in a simple and efficient manner, also enabling the integration of results with other tools and pipelines. The COC DA tool is freely available at https://github.com/LBS-UFMG/COCaDA.

摘要

蛋白质原子间接触由原子分辨率下的空间接近度和物理化学互补性定义，是表征分子相互作用和键合的基础。计算接触的方法通常分为依赖截止值的方法（依赖欧几里得距离）和不依赖截止值的方法（利用德劳内三角剖分和沃罗诺伊镶嵌）。虽然依赖截止值的方法因其简单性、完整性和可靠性而得到认可，但传统实现方式在计算上仍然很昂贵，在当前生物信息学的大数据时代带来了重大的可扩展性挑战。在这里，我们介绍了COC DA（通过Cα距离分析进行接触搜索剪枝），这是一个基于Python的命令行工具，用于使用α-碳（Cα）距离矩阵改进大规模蛋白质原子间接触分析中的搜索剪枝。COC DA检测链内和链间接触，并将它们分为七种不同类型：氢键和二硫键；疏水作用；吸引、排斥和盐桥相互作用；以及芳香堆积。为了评估我们的工具，我们将其与文献中的三种传统方法进行了比较：全对全原子距离计算（“暴力法”）、静态Cα距离截止（SC）和Biopython的NeighborSearch类（NS）。与其他方法相比，COC DA表现出卓越的性能，与NS中的kd树等高级数据结构相比，平均计算速度快6倍，此外还更易于实现且完全可定制。所展示的工具以简单高效的方式促进了对蛋白质原子间接触的探索性和大规模分析，还能够将结果与其他工具和管道集成。COC DA工具可在https://github.com/LBS-UFMG/COCaDA上免费获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e91b/12433948/ff95854a94b4/fbinf-05-1630078-g001.jpg

相似文献

COC DA - a fast and scalable algorithm for interatomic contact detection in proteins using C distance matrices.COC DA——一种使用C距离矩阵在蛋白质中进行原子间接触检测的快速且可扩展的算法。

Front Bioinform. 2025 Sep 1;5:1630078. doi: 10.3389/fbinf.2025.1630078. eCollection 2025.

Prescription of Controlled Substances: Benefits and Risks管制药品的处方：益处与风险

Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病：网络荟萃分析。

Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.

The quantity, quality and findings of network meta-analyses evaluating the effectiveness of GLP-1 RAs for weight loss: a scoping review.评估胰高血糖素样肽-1受体激动剂（GLP-1 RAs）减肥效果的网状Meta分析的数量、质量及结果：一项范围综述

Health Technol Assess. 2025 Jun 25:1-73. doi: 10.3310/SKHT8119.

Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗：一项网状荟萃分析。

Cochrane Database Syst Rev. 2017 Dec 22;12(12):CD011535. doi: 10.1002/14651858.CD011535.pub2.

Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗：一项网状Meta分析。

Cochrane Database Syst Rev. 2020 Jan 9;1(1):CD011535. doi: 10.1002/14651858.CD011535.pub3.

Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。

Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.

Healthcare workers' informal uses of mobile phones and other mobile devices to support their work: a qualitative evidence synthesis.医护人员非正规使用手机和其他移动设备来支持工作：定性证据综合评价。

Cochrane Database Syst Rev. 2024 Aug 27;8(8):CD015705. doi: 10.1002/14651858.CD015705.pub2.

Exploring the impact of housing insecurity on the health and well-being of children and young people: a systematic review.探索住房不安全对儿童和年轻人健康与福祉的影响：一项系统综述。

Public Health Res (Southampt). 2023 Dec;11(13):1-71. doi: 10.3310/TWWL4501.

Aspects of Genetic Diversity, Host Specificity and Public Health Significance of Single-Celled Intestinal Parasites Commonly Observed in Humans and Mostly Referred to as 'Non-Pathogenic'.人类常见且大多被称为“非致病性”的单细胞肠道寄生虫的遗传多样性、宿主特异性及公共卫生意义

APMIS. 2025 Sep;133(9):e70036. doi: 10.1111/apm.70036.

本文引用的文献

AlphaFold two years on: Validation and impact.两年后的AlphaFold：验证与影响。

Proc Natl Acad Sci U S A. 2024 Aug 20;121(34):e2315002121. doi: 10.1073/pnas.2315002121. Epub 2024 Aug 12.

Accurate structure prediction of biomolecular interactions with AlphaFold 3.利用 AlphaFold 3 进行生物分子相互作用的精确结构预测。

Nature. 2024 Jun;630(8016):493-500. doi: 10.1038/s41586-024-07487-w. Epub 2024 May 8.

AlphaFold Protein Structure Database in 2024: providing structure coverage for over 214 million protein sequences.2024 年的 AlphaFold 蛋白质结构数据库：为超过 2.14 亿个蛋白质序列提供结构覆盖。

Nucleic Acids Res. 2024 Jan 5;52(D1):D368-D375. doi: 10.1093/nar/gkad1011.

Editorial: Bioinformatics in the age of data science: algorithms, methods, and tools applied from Omics to structural data.社论：数据科学时代的生物信息学：从组学到结构数据应用的算法、方法和工具

Front Bioinform. 2023 Jul 4;3:1246859. doi: 10.3389/fbinf.2023.1246859. eCollection 2023.

VTR: A Web Tool for Identifying Analogous Contacts on Protein Structures and Their Complexes.VTR：一种用于识别蛋白质结构及其复合物上类似接触点的网络工具。

Front Bioinform. 2021 Nov 8;1:730350. doi: 10.3389/fbinf.2021.730350. eCollection 2021.

Prioritizing Virtual Screening with Interpretable Interaction Fingerprints.基于可解释相互作用指纹的虚拟筛选优先级排序。

J Chem Inf Model. 2022 Sep 26;62(18):4300-4318. doi: 10.1021/acs.jcim.2c00695. Epub 2022 Sep 14.

E-Volve: understanding the impact of mutations in SARS-CoV-2 variants spike protein on antibodies and ACE2 affinity through patterns of chemical interactions at protein interfaces.E-Volve：通过蛋白质界面化学相互作用模式了解 SARS-CoV-2 变体刺突蛋白突变对抗体和 ACE2 亲和力的影响。

PeerJ. 2022 Mar 22;10:e13099. doi: 10.7717/peerj.13099. eCollection 2022.

Highly accurate protein structure prediction with AlphaFold.利用 AlphaFold 进行高精度蛋白质结构预测。

Nature. 2021 Aug;596(7873):583-589. doi: 10.1038/s41586-021-03819-2. Epub 2021 Jul 15.

Proteus: An algorithm for proposing stabilizing mutation pairs based on interactions observed in known protein 3D structures.Proteus：一种基于已知蛋白质 3D 结构中观察到的相互作用来提出稳定突变对的算法。

BMC Bioinformatics. 2020 Jul 1;21(1):275. doi: 10.1186/s12859-020-03575-6.

nAPOLI: A Graph-Based Strategy to Detect and Visualize Conserved Protein-Ligand Interactions in Large-Scale.NAPOLI：一种基于图的策略，用于检测和可视化大规模的保守蛋白-配体相互作用。

IEEE/ACM Trans Comput Biol Bioinform. 2020 Jul-Aug;17(4):1317-1328. doi: 10.1109/TCBB.2019.2892099. Epub 2019 Jan 10.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

COC DA——一种使用C距离矩阵在蛋白质中进行原子间接触检测的快速且可扩展的算法。

COC DA - a fast and scalable algorithm for interatomic contact detection in proteins using C distance matrices.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献