Lee Wesley, McCormick Tyler H, Neil Joshua, Sodja Cole, Cui Yanran
Department of Statistics, University of Washington, Seattle, DC.
Department of Statistics and Department of Sociology, University of Washington, Seattle, DC.
Technometrics. 2022;64(2):241-252. doi: 10.1080/00401706.2021.1952900. Epub 2021 Dec 16.
We develop a real-time anomaly detection method for directed activity on large, sparse networks. We model the propensity for future activity using a dynamic logistic model with interaction terms for sender- and receiver-specific latent factors in addition to sender- and receiver-specific popularity scores; deviations from this underlying model constitute potential anomalies. Latent nodal attributes are estimated via a variational Bayesian approach and may change over time, representing natural shifts in network activity. Estimation is augmented with a case-control approximation to take advantage of the sparsity of the network and reduces computational complexity from ( ) to (), where is the number of nodes and is the number of observed edges. We run our algorithm on network event records collected from an enterprise network of over 25,000 computers and are able to identify a red team attack with half the detection rate required of the model without latent interaction terms.
我们开发了一种用于大型稀疏网络上定向活动的实时异常检测方法。我们使用动态逻辑模型对未来活动的倾向进行建模,该模型除了发送者和接收者特定的流行度得分外,还包含发送者和接收者特定潜在因素的交互项;与这个基础模型的偏差构成潜在异常。潜在节点属性通过变分贝叶斯方法进行估计,并且可能随时间变化,代表网络活动的自然变化。估计通过病例对照近似进行增强,以利用网络的稀疏性,并将计算复杂度从()降低到(),其中是节点数量,是观察到的边的数量。我们在从一个拥有超过25000台计算机的企业网络收集的网络事件记录上运行我们的算法,并且能够以比没有潜在交互项的模型所需检测率低一半的速度识别出一次红队攻击。