Spade: A Modular Framework for Analytical Exploration of RDF Graphs

Abstract : RDF data is complex; exploring it is hard, and can be done through many different metaphors. We have developed and propose to demonstrate Spade, a tool helping users discover meaningful content of an RDF graph by showing them the results of aggregation (OLAP-style) queries automatically identified from the data. Spade chooses aggregates that are visually interesting, a property formally based on statistic properties of the aggregation query results. While well understood for relational data, such exploration raises multiple challenges for RDF: facts, dimensions and measures have to be identified (as opposed to known beforehand); as there are more candidate aggregates, assessing their interestingness can be very costly; finally, ontologies bring novel specific challenges but also novel opportunities, enabling ontology-driven exploration from an aggregate initially proposed by the system. Spade is a generic, extensible framework, which we instantiated with: (i) novel methods for enumerating candidate measures and dimensions in the vast space of possibilities provided by an RDF graph; (ii) a set of aggregate inter-estingness functions; (iii) ontology-based interactive exploration , and (iv) efficient early-stop techniques for estimating the interestingness of an aggregate query. The demonstration will comprise interactive scenarios on a variety of large, interesting RDF graphs. PVLDB Reference Format: Y. Diao, P. Guzewicz, I. Manolescu, M. Mazuran. Spade: A Modular Framework for Multi-Dimensional RDF Exploration. PVLDB, 12(12): xxxx-yyyy, 2019.
Document type :
Conference papers
Complete list of metadatas
Contributor : Paweł Guzewicz <>
Submitted on : Tuesday, June 11, 2019 - 5:07:46 PM
Last modification on : Friday, June 14, 2019 - 1:48:47 AM


Files produced by the author(s)




Yanlei Diao, Paweł Guzewicz, Ioana Manolescu, Mirjana Mazuran. Spade: A Modular Framework for Analytical Exploration of RDF Graphs. 45th International Conference on Very Large Data Bases, Aug 2019, Los Angeles, United States. ⟨10.14778/xxxxxxx.xxxxxxx⟩. ⟨hal-02152844⟩



Record views


Files downloads