Artikel in Tagungsband INPROC-2022-03

Bibliograph.
Daten
Spieß, Marco; Reimann, Peter; Weber, Christian; Mitschang, Bernhard: Analysis of Incremental Learning andWindowing to handle Combined Dataset Shifts on Binary Classification for Product Failure Prediction.
In: Proceedings of the 24th International Conference on Enterprise Information Systems (ICEIS 2022).
Universität Stuttgart, Fakultät Informatik, Elektrotechnik und Informationstechnik.
englisch.
SciTePress, April 2022.
Artikel in Tagungsband (Konferenz-Beitrag).
CR-Klassif.H.2.8 (Database Applications)
KeywordsBinary Classification; Dataset Shift; Incremental Learning; Product Failure Prediction; Windowing.
Kurzfassung

Dataset Shifts (DSS) are known to cause poor predictive performance in supervised machine learning tasks. We present a challenging binary classification task for a real-world use case of product failure prediction. The target is to predict whether a product, e. g., a truck may fail during the warranty period. However, building a satisfactory classifier is difficult, because the characteristics of underlying training data entail two kinds of DSS. First, the distribution of product configurations may change over time, leading to a covariate shift. Second, products gradually fail at different points in time, so that the labels in training data may change, which may a concept shift. Further, both DSS show a trade-off relationship, i. e., addressing one of them may imply negative impacts on the other one. We discuss the results of an experimental study to investigate how different approaches to addressing DSS perform when they are faced with both a covariate and a concept shift. Thereby, we prove that existing approaches, e. g., incremental learning and windowing, especially suffer from the trade-off between both DSS. Nevertheless, we come up with a solution for a data-driven classifier that yields better results than a baseline solution that does not address DSS.

Abteilung(en)Universität Stuttgart, Institut für Parallele und Verteilte Systeme, Anwendersoftware
Projekt(e)GSaME-NFG
Eingabedatum23. März 2022
   Publ. Abteilung   Publ. Institut   Publ. Informatik