SODA: Generating SQL for Business Users release_n6rvykhg4nfxpoqcs73wjcqrne

by Lukas Blunschi, Claudio Jossen, Donald Kossman, Magdalini Mori, Kurt Stockinger

Released as a article .

2012  

Abstract

The purpose of data warehouses is to enable business analysts to make better decisions. Over the years the technology has matured and data warehouses have become extremely successful. As a consequence, more and more data has been added to the data warehouses and their schemas have become increasingly complex. These systems still work great in order to generate pre-canned reports. However, with their current complexity, they tend to be a poor match for non tech-savvy business analysts who need answers to ad-hoc queries that were not anticipated. This paper describes the design, implementation, and experience of the SODA system (Search over DAta Warehouse). SODA bridges the gap between the business needs of analysts and the technical complexity of current data warehouses. SODA enables a Google-like search experience for data warehouses by taking keyword queries of business users and automatically generating executable SQL. The key idea is to use a graph pattern matching algorithm that uses the metadata model of the data warehouse. Our results with real data from a global player in the financial services industry show that SODA produces queries with high precision and recall, and makes it much easier for business users to interactively explore highly-complex data warehouses.
In text/plain format

Archived Files and Locations

application/pdf  443.4 kB
file_r3qkcrxsfzhrhntx2gkfs4jygm
arxiv.org (repository)
web.archive.org (webarchive)
application/pdf  443.4 kB
file_6exwqanz4rdfhgvcdj3ogvkoqe
archive.org (archive)
Read Archived PDF
Preserved and Accessible
Type  article
Stage   submitted
Date   2012-06-30
Version   v1
Language   en ?
arXiv  1207.0134v1
Work Entity
access all versions, variants, and formats of this works (eg, pre-prints)
Catalog Record
Revision: 0d51ca26-c038-42c1-8177-d3ebf5bea427
API URL: JSON