0


SpotLight:检测流图中的异常

SpotLight: Detecting Anomalies in Streaming Graphs
课程网址: http://videolectures.net/kdd2018_eswaran_detecting_anomalies/  
主讲教师: Dhivya Eswaran
开课单位: 卡内基梅隆大学
开课时间: 2018-11-23
课程语种: 英语
中文简介:
我们如何从电子邮件或交通日志中发现有趣的事件?我们如何从IP-IP通信数据中检测端口扫描或拒绝服务攻击?一般来说,给定一系列加权、有向或二分图,每个图都总结了一个时间窗口中的活动快照,我们如何使用亚线性存储器近实时地发现包含大型密集子图(例如,近二分图)突然出现或消失的异常图?为此,我们提出了一种称为SpotLight的基于随机草图的方法,该方法确保异常图形在草图空间中以高概率映射到远离“正常”实例的位置,以便选择适当的参数。在真实世界数据集上进行的大量实验表明,SpotLight(a)与现有方法相比,精度提高了至少8.4%,(b)速度快,可以在几分钟内处理数百万条边,(c)与边的数量和草图尺寸成线性比例,(d)在实践中产生了有趣的发现。
课程简介: How do we spot interesting events from e-mail or transportation logs? How can we detect port scan or denial of service attacks from IP-IP communication data? In general, given a sequence of weighted, directed or bipartite graphs, each summarizing a snapshot of activity in a time window, how can we spot anomalous graphs containing the sudden appearance or disappearance of large dense subgraphs (e.g., near bicliques) in near real-time using sublinear memory? To this end, we propose a randomized sketching-based approach called SpotLight, which guarantees that an anomalous graph is mapped ‘far’ away from ‘normal’ instances in the sketch space with high probability for appropriate choice of parameters. Extensive experiments on real-world datasets show that SpotLight (a) improves accuracy by at least 8.4% compared to prior approaches, (b) is fast and can process millions of edges within a few minutes, (c) scales linearly with the number of edges and sketching dimensions and (d) leads to interesting discoveries in practice.
关 键 词: SpotLight; 检测流图中的异常; 从电子邮件或交通日志中挖掘; 真实世界数据集
课程来源: 视频讲座网
数据采集: 2023-03-09:cyh
最后编审: 2023-05-15:cyh
阅读次数: 19