Article in Proceedings INPROC-2005-06

BibliographyJakob, Mihály; Grossmann, Matthias; Nicklas, Daniela; Mitschang, Bernhard: DCbot: Finding Spatial Information on the Web.
In: Proceedings of the 10th International Conference on Database Systems for Advanced Applications (DASFAA 2005).
University of Stuttgart : Collaborative Research Center SFB 627 (Nexus: World Models for Mobile Context-Based Systems).
german.
Beijing: ??, April 2005.
Article in Proceedings (Conference Paper).
CR-SchemaH.2.8 (Database Applications)
H.3.3 (Information Search and Retrieval)
H.5.4 (Hypertext/Hypermedia)
Abstract

The WWW provides an overwhelming amount of information, which spatially indexed can be a valuable additional data source for location- based applications. By manually building a spatial index, only a fraction of the available resources can be covered. This paper introduces a system for the automatic mapping of web pages to geographical locations. Our web robot uses several sets of domain specific keywords, lexical context rules, that are automatically learned, and a hierarchical catalogue of geographical locations that provides exact geographical coordinates for locations. Spatially indexed web pages are used to construct Geographical Web Portals, which can be accessed by different location-based applications. In addition, we present experimental results demonstrating the quantity and the quality of automatically indexed web pages.

Full text and
other links
Nexus-Homepage
Department(s)University of Stuttgart, Institute of Parallel and Distributed Systems, Applications of Parallel and Distributed Systems
Project(s)SFB-627, B1 (University of Stuttgart, Institute of Parallel and Distributed Systems, Applications of Parallel and Distributed Systems)
Entry dateDecember 2, 2004
   Publ. Department   Publ. Institute   Publ. Computer Science