The Researcher's Guide to the Data Deluge: Querying a Scientific Database in Just a Few Seconds

Publication Date: 
30/08/2011
Authors: 
Martin Kersten, Stratos Idreos, Stefan Manegold, Erietta Liarou

This paper was published in Proceedings of the 37th International Conference on Very Large Data Bases (VLDB2011), Seattle, WA, USA.

Award: VLDB 2011 Challenges & Visions track best paper award

Abstract: 

There is a clear need for interactive exploration of extremely large databases, especially in the area of scientific data management where ingestion of multiple Terabytes on a daily basis is foreseen. Unfor- tunately, current data management technology is not well-suited for such overwhelming demands.There is a clear need for interactive exploration of extremely large databases, especially in the area of scientific data management where ingestion of multiple Terabytes on a daily basis is foreseen. Unfor- tunately, current data management technology is not well-suited for such overwhelming demands.

In light of these challenges, we should rethink some of the strict requirements database systems adopted in the past. We envision that next generation database systems should interpret queries by their intent, rather than as a contract carved in stone for complete and correct answers. The result set should aid the user in un- derstanding the database's content and provide guidance to con- tinue the data exploration journey. A scientist can stepwise explore deeper and deeper into the database, and stop when the result con- tent and quality reaches his satisfaction point. At the same time, response times should be close to instant such that they allow a scientist to interact with the system and explore the data in a con- textualized way.

Several research directions are carved out to realize this vision. They range from engineering a novel database kernel where speed rather than completeness is the first class citizen, up to refusing to process a costly query in the first place, but providing advice on how to reformulate it instead, or even providing alternatives the system believes might be relevant for the exploration patterns ob- served.

AttachmentSize
CWI_VLDB2011.pdf129.38 KB

Partners

People

Alexander Marchuk
A.P. Ershov Institute of Informatics Systems
Alice Carpentier
Semantic Technology Institute, University of Innsbruck
Alina Dia Miron
Recognos Romania
Andreas Harth
AIFB Institute, Karlsruhe Institute of Technology
Anna Fensel
Semantic Technology Institute, University of Innsbruck
Barry Norton
AIFB Institute, Karlsruhe Institute of Technology
Benedikt Kämpgen
AIFB Institute, Karlsruhe Institute of Technology
Carlos Juiz
Universitat de les Illes Balears
Carolina Fortuna
Jozef Stefan Institute
Chris Bizer
Freie Universität Berlin
Daniel Fuleki
StrateGO Hungary - Creative Media Innovation Cluster
Daniele DellAglio
CEFRIEL
David Norheim
Computas
Dieter Fensel
Semantic Technology Institute, University of Innsbruck
Dumitru Roman
Stiftelsen SINTEF
Elena Simperl
AIFB Institute, Karlsruhe Institute of Technology
Francois Scharffe
University of Montpellier
Frank van Harmelen
Vrije Universiteit Amsterdam
Freddy Priyatna
Universidad Politécnica de Madrid
Giorgos Flouris
Foundation for Research and Technology Hellas
Graham Hench
Semantic Technology Institute International
Grigoris Antoniou
Foundation for Research and Technology Hellas
Ioana Ciuciu
Semantics Technology and Applications Research Laboratory
Irini Fundulaki
Foundation for Research and Technology Hellas
John Domingue
The Open University
Karl Aberer
Ecole Polytechnique Fédérale de Lausanne
Leonel Ruiz Miyares
Centre for Applied Linguistics
Lyndon Nixon
Semantic Technology Institute International
Marko Grobelnik
Jozef Stefan Institute
Marta Corubolo
CEFRIEL
Martin Kersten
Centrum Wiskunde & Informatica
Neil Chue Hong, EPPC
University of Edinburgh
Oscar Corcho
Universidad Politécnica de Madrid
Pablo Mendes
Freie Universität Berlin
Paolo Bouquet
Università degli Studi di Trento
Peter Mika
Yahoo Research Barcelona
Rajendra Akerkar
Western Norway Research Institute
Roberto García
Universitat de Lleida
Simeona Pellkvist
Semantic Technology Institute International
Simone Contessa
CEFRIEL
Snorri Gudmundsson
IceStat
Stefano Fumeo
CEFRIEL
Steffen Stadtmuller
AIFB Institute, Karlsruhe Institute of Technology
Thomas Bauereiss
Semantic Technology Institute, University of Innsbruck
Ying Zhang
Centrum Wiskunde & Informatica
York Sure
Leibniz Institute for the Social Sciences
Zoltan Miklos
Ecole Polytechnique Fédérale de Lausanne