Artikel in Zeitschrift ART-2023-04

Bibliograph.
Daten
Villanueva Zacarias, Alejandro Gabriel; Reimann, Peter; Weber, Christian; Mitschang, Bernhard: AssistML: An Approach to Manage, Recommend and Reuse ML Solutions.
In: International Journal of Data Science and Analytics (JDSA).
Universität Stuttgart, Fakultät Informatik, Elektrotechnik und Informationstechnik.
englisch.
Springer Nature, 17. Juli 2023.
Artikel in Zeitschrift.
CR-Klassif.H.2.8 (Database Applications)
KeywordsMeta-learning; Machine learning; AutoML; Metadata; Recommender systems
Kurzfassung

The adoption of machine learning (ML) in organizations is characterized by the use of multiple ML software components. When building ML systems out of these software components, citizen data scientists face practical requirements which go beyond the known challenges of ML, e.g., data engineering or parameter optimization. They are expected to quickly identify ML system options that strike a suitable trade-off across multiple performance criteria. These options also need to be understandable for non-technical users. Addressing these practical requirements represents a problem for citizen data scientists with limited ML experience. This calls for a concept to help them identify suitable ML software combinations. Related work, e.g., AutoML systems, are not responsive enough or cannot balance different performance criteria. This paper explains how AssistML, a novel concept to recommend ML solutions, i.e., software systems with ML models, can be used as an alternative for predictive use cases. Our concept collects and preprocesses metadata of existing ML solutions to quickly identify the ML solutions that can be reused in a new use case. We implement AssistML and evaluate it with two exemplary use cases. Results show that AssistML can recommend ML solutions in line with users’ performance preferences in seconds. Compared to AutoML, AssistML offers citizen data scientists simpler, intuitively explained ML solutions in considerably less time. Moreover, these solutions perform similarly or even better than AutoML models.

Abteilung(en)Universität Stuttgart, Institut für Parallele und Verteilte Systeme, Anwendersoftware
Projekt(e)GSaME-NFG
Eingabedatum26. Juli 2023
   Publ. Abteilung   Publ. Institut   Publ. Informatik