Department of Biostatistics and Bioinformatics, Duke University, Durham, North Carolina, USA.
Department of Computer Science, Tufts University, Medford, Massachusetts, USA.
J Comput Biol. 2024 Oct;31(10):990-1007. doi: 10.1089/cmb.2024.0673. Epub 2024 Sep 24.
The IsoRank algorithm of Singh, Xu, and Berger was a pioneering algorithmic advance that applied spectral methods to the problem of cross-species global alignment of biological networks. We develop a new IsoRank approximation that exploits the mathematical properties of IsoRank's linear system to solve the problem in quadratic time with respect to the maximum size of the two protein-protein interaction (PPI) networks. We further propose a refinement to this initial approximation so that the updated result is even closer to the original IsoRank formulation while remaining computationally inexpensive. In experiments on synthetic and real PPI networks with various proposed metrics to measure alignment quality, we find the results of our approximate IsoRank are nearly as accurate as the original IsoRank. In fact, for functional enrichment-based measures of global network alignment quality, our approximation performs better than the exact IsoRank, which is doubtless because it is more robust to the noise of missing or incorrect edges. It also performs competitively against two more recent global network alignment algorithms. We also present an analogous approximation to IsoRankN, which extends the network alignment to more than two species.
辛格、徐和伯杰的 IsoRank 算法是一项开创性的算法进步,它将谱方法应用于跨物种生物网络全局比对的问题。我们开发了一种新的 IsoRank 逼近方法,利用 IsoRank 线性系统的数学性质,以二次时间复杂度解决了这个问题,该复杂度与两个蛋白质-蛋白质相互作用(PPI)网络的最大规模有关。我们进一步对这个初始逼近进行了改进,以便更新的结果更接近原始的 IsoRank 公式,同时保持计算上的低成本。在使用各种建议的度量标准对合成和真实 PPI 网络进行实验时,我们发现我们的近似 IsoRank 的结果几乎与原始 IsoRank 一样准确。事实上,对于基于功能富集的全局网络比对质量度量标准,我们的逼近方法比精确的 IsoRank 表现更好,这无疑是因为它对缺失或错误边的噪声更具鲁棒性。它在与另外两个最近的全局网络比对算法的竞争中也表现出色。我们还提出了 IsoRankN 的类似逼近方法,它将网络比对扩展到两个以上的物种。