Optimization for Active Learning-based Interactive Database Exploration

Abstract : There is an increasing gap between the fast growth of data and the limited human ability to comprehend data. Consequently, there has been a growing demand of data management tools that can bridge this gap and help the user retrieve high-value content from data more effectively. In this work, we aim to build interactive data exploration as a new database service, using an approach called "explore-by-example". In particular, we cast the explore-by-example problem in a principled "active learning" framework, and bring the properties of important classes of database queries to bear on the design of new algorithms and optimizations for active learning-based database exploration. These new techniques allow the database system to overcome fundamental limitations of traditional active learning, in particular, the slow convergence problem. Evaluation results using real-world datasets and user interest patterns show that our new system significantly outperforms state-of-the-art active learning techniques and data exploration systems in accuracy while achieving desired efficiency for interactive performance.
Type de document :
Rapport
[Technical Report] Ecole Polytechnique; University of Massachusetts Amherst. 2018
Liste complète des métadonnées

Littérature citée [39 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01870560
Contributeur : Enhui Huang <>
Soumis le : vendredi 7 septembre 2018 - 19:20:57
Dernière modification le : mercredi 12 septembre 2018 - 17:16:05

Fichier

TR_ide.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01870560, version 1

Citation

Enhui Huang, Liping Peng, Luciano Di Palma, Ahmed Abdelkafi, Anna Liu, et al.. Optimization for Active Learning-based Interactive Database Exploration. [Technical Report] Ecole Polytechnique; University of Massachusetts Amherst. 2018. 〈hal-01870560〉

Partager

Métriques

Consultations de la notice

46

Téléchargements de fichiers

13