Bachelor Thesis BCLR-2021-67

BibliographyKotchourko, Serge: Optimizing human annotation of word usage graphs in a realistic simulation environment.
University of Stuttgart, Faculty of Computer Science, Electrical Engineering, and Information Technology, Bachelor Thesis No. 67 (2021).
72 pages, english.
Abstract

Word Usage Graphs (WUGs) are an approach of representing relations between word usage pairs, where each word usage is considered as a node and the weighted undirected edge between such a pair represents its semantic proximity. This shifts problems of Computational Linguistics into the graph problem space. There is only little research into how such WUGs can be annotated efficiently and effectively. Therefore, we build a simulation to test a broad range of sampling, clustering and stopping procedures with respect to their impact on finding good solutions. We show that it is possible to simulate graphs which share characteristics close to the observed WUGs. Based on this we are able to scrutinize various annotation procedures and are able to extract their advantages and disadvantages for the annotation process.

Full text and
other links
Volltext
Department(s)University of Stuttgart, Institute for Natural Language Processing
Superviser(s)Schulte im Walde, Prof. Sabine; Schlechtweg, Dominik
Entry dateJanuary 18, 2022
   Publ. Computer Science