Article in Proceedings INPROC-2004-42

BibliographySchwarz, Thomas; Iofcea, Markus; Grossmann, Matthias; Hönle, Nicola; Nicklas, Daniela; Mitschang, Bernhard: On Efficiently Processing Nearest Neighbor Queries in a Loosely Coupled Set of Data Sources.
In: ACM (ed.): Proceedings of the 12th ACM International Symposium on Advances in Geographic Information System (ACM GIS 004), Washington D.C., November 12-13, 2004.
University of Stuttgart : Collaborative Research Center SFB 627 (Nexus: World Models for Mobile Context-Based Systems).
english.
?, November 12, 2004.
Article in Proceedings (Conference Paper).
CR-SchemaH.2.4 (Database Management Systems)
H.2.8 (Database Applications)
H.3.3 (Information Search and Retrieval)
KeywordsData integration, distributed query processing, federated database system, kNN, nearest neighbors, parallel query processing
Abstract

We propose a family of algorithms for processing nearest neighbor (NN) queries in an integration middleware that provides federated access to numerous loosely coupled, autonomous data sources connected through the internet. Previous approaches for parallel and distributed NN queries considered all data sources as relevant, or determined the relevant ones in a single step by exploiting additional knowledge on object counts per data source. We propose a different approach that does not require such detailed statistics about the distribution of the data. It iteratively enlarges and shrinks the set of relevant data sources. Our experiments show that this yields considerable performance benefits with regard to both response time and effort. Additionally, we propose to use only moderate parallelism instead of querying all relevant data sources at the same time. This allows us to trade a slightly increased response time for a lot less effort, hence maximizing the cost profit ratio, as we show in our experiments. Thus, the proposed algorithms clearly extend the set of NN algorithms known so far.

Full text and
other links
PDF (473180 Bytes)
Contactthomas.schwarz@informatik.uni-stuttgart.de
Department(s)University of Stuttgart, Institute of Parallel and Distributed Systems, Applications of Parallel and Distributed Systems
Project(s)SFB-627, B1 (University of Stuttgart, Institute of Parallel and Distributed Systems, Applications of Parallel and Distributed Systems)
Entry dateNovember 9, 2004
   Publ. Department   Publ. Institute   Publ. Computer Science