Diploma Thesis DIP-3038

BibliographyBeck, Daniel: Design and Prototypical implementation of a Linguistic Search Engine.
University of Stuttgart, Faculty of Computer Science, Electrical Engineering, and Information Technology, Diploma Thesis No. 3038 (2010).
83 pages, english.
CR-SchemaE.4 (Data Coding and Information Theory)
H.3.1 (Content Analysis and Indexing)
H.3.3 (Information Search and Retrieval)
Abstract

Design and Prototypical Implementation of a Linguistic Search Engine

Goal of this thesis was to develop PigLing, a linguistic search engine. It both serves as a tool for learners of English as a second language, and provides linguists with functionality for analyzing linguistic phenomena. For these purposes, PigLing provides a powerful query language to formulate phrase queries. PigLing supports two different collections of text (corpora) and can be extended to support even more corpora.

Its prominent feature is support for half-context classes as part of search queries: Terms are assigned to these half-context classes based on their left and right half-contexts, i.e. the terms directly preceding and succeeding occurrences of a specific term. Users can specify half-context wildcards, requiring that terms assigned to specific half-context classes must appear at a certain position.

After submitting a query, users can begin to analyze the first results; further results are computed in the background and presented afterwards. The web interface also allows browsing half-context classes and terms' assignment to these classes to facilitate iterative query construction.

Full text and
other links
PDF (1303330 Bytes)
Access to students' publications restricted to the faculty due to current privacy regulations
Department(s)University of Stuttgart, Institute of Visualisation and Interactive Systems, Visualisation and Interactive Systems
Superviser(s)Schütze, Hinrich
Entry dateFebruary 3, 2011
   Publ. Computer Science