Masterarbeit MSTR-2015-07

Estel, Marcel: Taxonomy Extension through Synonym Discovery in Integrated Data.
Universität Stuttgart, Fakultät Informatik, Elektrotechnik und Informationstechnik, Masterarbeit (2015).
101 Seiten, englisch.

Semantic resources are of great value for a lot advanced tasks in Natural Language Processing such as Information Extraction or text classification. However the maintenance of such resources presents great challenges. This thesis examines approaches to automatically extend an automotive domain taxonomy by learning synonyms and thus improving coverage. These domain-specific synonyms are learned directly from text in car repair reports. We evaluate two distributional semantic models as approach to model semantic similarity. Due to the difficulty of the task traditional methods have not yet examined how proper synonym candidates can be selected from text. Our proposed method produces satisfying results, but synonym discovery is still a difficult task and cannot be fully automated. The quality control of a human editor is still necessary.

Volltext und
andere Links
PDF (1312775 Bytes)
Zugriff auf studentische Arbeiten aufgrund vorherrschender Datenschutzbestimmungen nur innerhalb der Fakultät möglich
Abteilung(en)Universität Stuttgart, Institut für Parallele und Verteilte Systeme, Anwendersoftware
BetreuerMitschang, Prof. Bernhard; Ammann, Prof. Eckhard; Kassner, Laura
Eingabedatum30. Juli 2018
   Publ. Abteilung   Publ. Institut   Publ. Informatik