Authors
Ziawasch Abedjan, Toni Grütze, Anja Jentzsch, Felix Naumann
Publication date
2014/3/31
Conference
2014 IEEE 30th International Conference on Data Engineering
Pages
1198-1201
Publisher
IEEE
Description
Before reaping the benefits of open data to add value to an organizations internal data, such new, external datasets must be analyzed and understood already at the basic level of data types, constraints, value patterns etc. Such data profiling, already difficult for large relational data sources, is even more challenging for RDF datasets, the preferred data model for linked open data. We present ProLod++, a novel tool for various profiling and mining tasks to understand and ultimately improve open RDF data. ProLod++ comprises various traditional data profiling tasks, adapted to the RDF data model. In addition, it features many specific profiling results for open data, such as schema discovery for user-generated attributes, association rule discovery to uncover synonymous predicates, and uniqueness discovery along ontology hierarchies. ProLod++ is highly efficient, allowing interactive profiling for users interested in …
Total citations
20132014201520162017201820192020202120222023202414816911688654
Scholar articles
Z Abedjan, T Grütze, A Jentzsch, F Naumann - 2014 IEEE 30th International Conference on Data …, 2014