Department of Computer Science, University of Northern Iowa, Cedar Falls, IA, 50614, USA.
Sci Rep. 2023 Jan 18;13(1):959. doi: 10.1038/s41598-023-27995-5.
Past research in computational systems biology has focused more on the development and applications of advanced statistical and numerical optimization techniques and much less on understanding the geometry of the biological space. By representing biological entities as points in a low dimensional Euclidean space, state-of-the-art methods for drug-target interaction (DTI) prediction implicitly assume the flat geometry of the biological space. In contrast, recent theoretical studies suggest that biological systems exhibit tree-like topology with a high degree of clustering. As a consequence, embedding a biological system in a flat space leads to distortion of distances between biological objects. Here, we present a novel matrix factorization methodology for drug-target interaction prediction that uses hyperbolic space as the latent biological space. When benchmarked against classical, Euclidean methods, hyperbolic matrix factorization exhibits superior accuracy while lowering embedding dimension by an order of magnitude. We see this as additional evidence that the hyperbolic geometry underpins large biological networks.
过去的计算系统生物学研究更多地集中在开发和应用先进的统计和数值优化技术上,而对理解生物空间的几何结构关注较少。通过将生物实体表示为低维欧几里得空间中的点,药物 - 靶点相互作用(DTI)预测的最新方法隐含地假设生物空间的平坦几何形状。相比之下,最近的理论研究表明,生物系统具有高度聚类的树状拓扑结构。因此,将生物系统嵌入平坦空间会导致生物对象之间的距离失真。在这里,我们提出了一种新的药物 - 靶点相互作用预测矩阵分解方法,该方法将双曲空间用作潜在的生物空间。与经典的欧几里得方法相比,双曲矩阵分解在降低一个数量级的嵌入维度的同时表现出更高的准确性。我们认为这是双曲几何支撑大型生物网络的又一证据。