Article in Proceedings INPROC-2008-04

BibliographyJing, Lu; Bernhard, Mitschang: A Constraint-Aware Query Optimizer for Web-based Data Integration.
In: Proceedings of the Fourth International Conference on Web Information Systems and Technologies, May 4-7, 2008..
University of Stuttgart, Faculty of Computer Science, Electrical Engineering, and Information Technology.
pp. 1-6, english.
Funchal, Madeira, Portugal: Conference Proceedings, May 4, 2008.
Article in Proceedings (Conference Paper).
CR-SchemaH.3.5 (Online Information Services)
Abstract

Web has brought forth opportunities to connect information sources across all types of boundaries. The information sources include databases, XML documents, and other unstructured sources. Data integration is to combine data residing at different sources and providing the user with a unified view of these data. Currently users are expecting more efficient services from such data integration systems. Indeed, querying multiple data sources scattered on the web encounters many barriers for achieving efficiency due to the heterogeneity and autonomy of the information sources. This paper describes a query optimizer, which uses constraints to semantically optimize the queries. The optimizer first translates constraints from data sources into constraints expressed at the global level, e.g., in the common schema, and stores them in the constraint repository, again, at the global level. Then the optimizer can use semantic query optimization technologies including detection of empty results, join elimination, and predicate elimination to generate a more efficient but semantically equivalent query for the user. The optmizer is published as a web service and can be invoked by many data integration systems. We carry out experiments using our semantic query optimizer and first results show that performance can be greatly improved.

Department(s)University of Stuttgart, Institute of Parallel and Distributed Systems, Applications of Parallel and Distributed Systems
Project(s)DIS-CS
Entry dateDecember 19, 2007
   Publ. Department   Publ. Institute   Publ. Computer Science