Kamuhanda Dany, Cui Mengtian, Tessone Claudio J
UZH Blockchain Center, University of Zurich, 8050 Zurich, Switzerland.
Blockchain & Distributed Ledger Technologies Group, Department of Informatics, University of Zurich, 8050 Zurich, Switzerland.
Entropy (Basel). 2023 Jul 16;25(7):1069. doi: 10.3390/e25071069.
Community detection is widely used in social networks to uncover groups of related vertices (nodes). In cryptocurrency transaction networks, community detection can help identify users that are most related to known illegal users. However, there are challenges in applying community detection in cryptocurrency transaction networks: (1) the use of pseudonymous addresses that are not directly linked to personal information make it difficult to interpret the detected communities; (2) on Bitcoin, a user usually owns multiple Bitcoin addresses, and nodes in transaction networks do not always represent users. Existing works on cluster analysis on Bitcoin transaction networks focus on addressing the later using different heuristics to cluster addresses that are controlled by the same user. This research focuses on illegal community detection containing one or more illegal Bitcoin addresses. We first investigate the structure of Bitcoin transaction networks and suitable community detection methods, then collect a set of illegal addresses and use them to label the detected communities. The results show that 0.06% of communities from daily transaction networks contain one or more illegal addresses when 2,313,344 illegal addresses are used to label the communities. The results also show that distance-based clustering methods and other methods depending on them, such as network representation learning, are not suitable for Bitcoin transaction networks while community quality optimization and label-propagation-based methods are the most suitable.
社区检测在社交网络中被广泛用于发现相关顶点(节点)组。在加密货币交易网络中,社区检测有助于识别与已知非法用户关系最为密切的用户。然而,在加密货币交易网络中应用社区检测存在一些挑战:(1)使用与个人信息无直接关联的匿名地址使得难以解释检测到的社区;(2)在比特币网络中,一个用户通常拥有多个比特币地址,并且交易网络中的节点并不总是代表用户。现有关于比特币交易网络聚类分析的工作主要集中于通过不同启发式方法对同一用户控制的地址进行聚类来解决后一个问题。本研究聚焦于包含一个或多个非法比特币地址的非法社区检测。我们首先研究比特币交易网络的结构和合适的社区检测方法,然后收集一组非法地址并用它们来标记检测到的社区。结果表明,当使用2313344个非法地址标记社区时,日常交易网络中0.06%的社区包含一个或多个非法地址。结果还表明,基于距离的聚类方法以及依赖于它们的其他方法,如网络表示学习,不适用于比特币交易网络,而基于社区质量优化和标签传播的方法是最合适的。