Franke R, Ivanova G
Institute of Computer Science, Humboldt-Universität zu Berlin, Unter den Linden 6, 10099 Berlin, Germany.
J Biomed Inform. 2014 Feb;47:62-70. doi: 10.1016/j.jbi.2013.09.005. Epub 2013 Sep 21.
A large number of topics in biology, medicine, neuroscience, psychology and sociology can be generally described via complex networks in order to investigate fundamental questions of structure, connectivity, information exchange and causality. Especially, research on biological networks like functional spatiotemporal brain activations and changes, caused by neuropsychiatric pathologies, is promising. Analyzing those so-called complex networks, the calculation of meaningful measures can be very long-winded depending on their size and structure. Even worse, in many labs only standard desktop computers are accessible to perform those calculations. Numerous investigations on complex networks regard huge but sparsely connected network structures, where most network nodes are connected to only a few others. Currently, there are several libraries available to tackle this kind of networks. A problem arises when not only a few big and sparse networks have to be analyzed, but hundreds or thousands of smaller and conceivably dense networks (e.g. in measuring brain activation over time). Then every minute per network is crucial. For these cases there several possibilities to use standard hardware more efficiently. It is not sufficient to apply just standard algorithms for dense graph characteristics. This article introduces the new library FALCON developed especially for the exploration of dense complex networks. Currently, it offers 12 different measures (like clustering coefficients), each for undirected-unweighted, undirected-weighted and directed-unweighted networks. It uses a multi-core approach in combination with comprehensive code and hardware optimizations. There is an alternative massively parallel GPU implementation for the most time-consuming measures, too. Finally, a comparing benchmark is integrated to support the choice of the most suitable library for a particular network issue.
生物学、医学、神经科学、心理学和社会学中的大量主题通常可以通过复杂网络来描述,以便研究结构、连通性、信息交换和因果关系等基本问题。特别是,对生物网络的研究很有前景,比如由神经精神病理学引起的功能性时空大脑激活和变化。分析这些所谓的复杂网络时,根据其规模和结构,计算有意义的度量可能会非常繁琐。更糟糕的是,在许多实验室中,只有标准的台式计算机可用于执行这些计算。众多关于复杂网络的研究都涉及巨大但连接稀疏的网络结构,其中大多数网络节点仅与少数其他节点相连。目前,有几个库可用于处理这类网络。当不仅要分析少数几个大的稀疏网络,而且要分析数百或数千个较小且可能密集的网络时(例如在测量大脑随时间的激活情况时),问题就出现了。在这些情况下,有几种方法可以更有效地使用标准硬件。仅仅应用针对密集图特征的标准算法是不够的。本文介绍了专门为探索密集复杂网络而开发的新库FALCON。目前,它提供12种不同的度量(如聚类系数),每种度量适用于无向无加权、无向加权和有向无加权网络。它采用多核方法,并结合了全面的代码和硬件优化。对于最耗时的度量,还有一种替代的大规模并行GPU实现。最后,集成了一个比较基准,以支持为特定网络问题选择最合适的库。