Article in Proceedings INPROC-2024-04

BibliographySchneider, Jan; Lutsch, Arnold; Gröger, Christoph; Schwarz, Holger; Mitschang, Bernhard: First Experiences on the Application of Lakehouses in Industrial Practice.
In: Störl, Uta (ed.): Proceedings of the 35th GI-Workshop on Foundations of Databases (Grundlagen von Datenbanken), Herdecke, Germany.
University of Stuttgart, Faculty of Computer Science, Electrical Engineering, and Information Technology.
CEUR Workshop Proceedings; 3710, pp. 3-8, english.
CEUR Workshop Proceedings, June 25, 2024.
ISBN: 1613-0073.
Article in Proceedings (Workshop Paper).
CorporationGesellschaft für Informatik
CR-SchemaH.3.4 (Information Storage and Retrieval Systems and Software)
H.4.2 (Information Systems Applications Types of Systems)
KeywordsData Lakehouse; Data Platform; Platform Architecture; Data Analytics; Case Study; Industry Experience
Abstract

In recent years, so-called lakehouses have emerged as a new type of data platform that intends to combine characteristics of data warehouses and data lakes. Although companies started to employ the associated concepts and technologies as part of their analytics architectures, little is known about their practical medium- and long-term experiences as well as proven architectural decisions. Additionally, there is only limited knowledge about how lakehouses can be utilized effectively in an industrial context. Hence, it remains unclear under which circumstances lakehouses represent a viable alternative to conventional data platforms. To address this gap, we conducted a case study on a real-world industrial case, in which manufacturing data needs to be managed and analytically exploited. Within the scope of this case, a dedicated analytics department has been testing and leveraging a lakehouse approach for several months in a productive environment with high data volumes and various types of analytical workloads. The paper at hand presents the results of our within-case analyses and focuses on the industrial setting of the case as well as the architecture of the utilized lakehouse. This way, it provides preliminary insights on the application of lakehouses in industrial practice and refers to useful architectural decisions.

Full text and
other links
PDF
Department(s)University of Stuttgart, Institute of Parallel and Distributed Systems, Applications of Parallel and Distributed Systems
Project(s)Architekturen &
Technologien für Datenplattformen
Entry dateAugust 30, 2024
   Publ. Department   Publ. Institute   Publ. Computer Science