Krishnan Sowmya Ramaswamy, Pandey Nishtha, Srinivasan Rajgopal, Roy Arijit
TCS Research (Life Sciences division), Tata Consultancy Services, Hyderabad, 500081, India.
Sci Data. 2025 Aug 26;12(1):1489. doi: 10.1038/s41597-025-05829-5.
Enzymes are essential biological catalysts that drive nearly all biochemical reactions. Understanding their efficiency and specificity involves studying enzyme kinetics, particularly the parameters k and K. However, there is limited data linking these kinetic parameters with the three-dimensional (3D) structures of enzyme-substrate complexes. Since enzyme function is determined by its structure, such mapping enhances insight into structural basis of enzymatic function and supports applications in enzyme design, synthetic biology and metabolic engineering. To address this critical gap, this work presents SKiD (Structure-oriented Kinetics Dataset), a comprehensive, structured dataset integrating k and K values with the corresponding 3D structural data. This is accomplished by integrating data from existing bioinformatics resources using automated programs to process the data and enhancing it with computational predictions. The erroneous data encountered during data integration is manually resolved. Metadata such as literature and assay conditions (e.g., pH and temperature) are preserved. The 3D coordinates of the modelled enzyme-substrate complexes are provided along with their UniProtKB identifier.
酶是驱动几乎所有生物化学反应的重要生物催化剂。了解它们的效率和特异性涉及研究酶动力学,特别是参数k和K。然而,将这些动力学参数与酶-底物复合物的三维(3D)结构联系起来的数据有限。由于酶的功能由其结构决定,这种映射增强了对酶功能结构基础的理解,并支持在酶设计、合成生物学和代谢工程中的应用。为了填补这一关键空白,这项工作提出了SKiD(面向结构的动力学数据集),这是一个将k和K值与相应的3D结构数据整合在一起的全面、结构化的数据集。这是通过使用自动化程序处理数据并通过计算预测对其进行增强,从而整合现有生物信息学资源的数据来实现的。数据整合过程中遇到的错误数据会手动解决。诸如文献和测定条件(如pH和温度)等元数据会被保留。提供了建模的酶-底物复合物的3D坐标及其UniProtKB标识符。