Motivation
Exploratory search includes methods to efficiently extract knowledge from data repositories, even if we do not know what exactly we are looking for, nor how to precisely describe our needs. The need for new and effective exploratory search methods is particularly relevant given the current abundance and richness of today’s large datasets (e.g., Linked Open Datasets). In common exploratory settings, the user progressively acquires the knowledge by issuing a sequence of generic queries to gather intelligence about the data.
The existing body of work in data analysis, assumes the user is willing to pose several well defined or structured queries to the underlying database in order to progressively gather the required information. This assumption stems from the intuition that the user is accustomed to data analysis techniques. Yet, this assumption is not always true.
Objectives
We survey the main approaches for exploratory queries, highlighting the main differences among data models, and presenting indepth insights of the current status of research in this area. The final goal is to provide a comprehensive overview of novel data-management techniques that can empower advanced exploratory search systems.
In particular, we will highlight the existing example-based methods that have been already studied to improve knowledge graph search, SPARQL query formulation, and data exploration of RDF data. Moreover, we aim to present techniques that have been studied in other research areas and that could be suc- cessfully applied in the Semantic Web domain.
Topic Summary
The first and second part of the tutorial introduce the broad topic of data exploration, highlighting the hardness of query languages for simple users and advocating the need of different query methods.
The third part of the tutorial discusses the current main techniques for textual, and graph data, with an excursus on relational data as well in order to provide a complete picture on the power of the approach.
The fourth part of the tutorial focuses on the latest developments of machine learning to progressively discover user intention.
Outline
- Introduction, motivation, and formulation
- The origin: Example-based approaches for structured data (relational)
- Example-based approaches for semi-structured and unstructured data (graphs and text)
- Learning methods based on examples
- Challenges and Discussion