0


ANAPSID: 一种适用于SPARQL端点的自适应查询处理引擎

ANAPSID: An Adaptive Query Processing Engine for SPARQL Endpoints
课程网址: http://videolectures.net/iswc2011_acosta_vidal_anapsid/  
主讲教师: Maribel Acosta; Maria Esther Vidal
开课单位: 卡尔斯鲁厄理工学院
开课时间: 信息不详。欢迎您在右侧留言补充。
课程语种: 英语
中文简介:
按照链接数据的设计规则,支持远程查询处理的可用SPARQL端点的数量正在快速增长;但是,由于缺乏适应性,查询执行可能经常失败。首先,按照传统的“先优化后执行”模式确定的固定计划可能由于端点可用性而超时。第二,由于通常实现阻塞运算符,端点查询引擎无法以增量方式生成结果,并且如果数据源停止发送数据,可能会被阻塞。我们提供了一个用于SPARQL端点的自适应查询引擎anapsid,它使查询执行调度程序适应数据可用性和运行时条件。Anapsid提供物理SPARQL操作符,用于检测源何时被阻塞或数据流量是否激增,并且,操作员可能会在数据从源到达时快速生成结果。此外,anapsid操作符实现主内存替换策略,将以前计算的匹配项移动到辅助内存,避免重复。我们比较了与RDF存储和端点相关的Anapsid性能,并观察到,在某些情况下,Anapsid以超过一个数量级的速度加快了执行时间。
课程简介: Following the design rules of Linked Data, the number of available SPARQL endpoints that support remote query processing is quickly growing; however, because of the lack of adaptivity, query executions may frequently be unsuccessful. First, fixed plans identified following the traditional optimize-then execute paradigm, may timeout as a consequence of endpoint availability. Second, because blocking operators are usually implemented, endpoint query engines are not able to incrementally produce results, and may become blocked if data sources stop sending data. We present ANAPSID, an adaptive query engine for SPARQL endpoints that adapts query execution schedulers to data availability and run-time conditions. ANAPSID provides physical SPARQL operators that detect when a source becomes blocked or data traffic is bursty, and opportunistically, the operators produce results as quickly as data arrives from the sources. Additionally, ANAPSID operators implement main memory replacement policies to move previously computed matches to secondary memory avoiding duplicates. We compared ANAPSID performance with respect to RDF stores and endpoints, and observed that ANAPSID speeds up execution time, in some cases, in more than one order of magnitude.
关 键 词: 关联数据; 端点查询引擎; SPARQL端点
课程来源: 视频讲座网
最后编审: 2019-12-05:cwx
阅读次数: 30