Article in Proceedings INPROC-2024-06

BibliographyStach, Christoph; Li, Yunxuan; Schuiki, Laura; Mitschang, Bernhard: LALO—A Virtual Data Lake Zone for Composing Tailor-Made Data Products on Demand.
In: Strauss, Christine (ed.); Amagasa, Toshiyuki (ed.); Manco, Giuseppe (ed.); Kotsis, Gabriele (ed.); Tjoa, A Min (ed.); Khalil, Ismail (ed.): Proceedings of the 35th International Conference on Database and Expert Systems Applications (DEXA 2024).
University of Stuttgart, Faculty of Computer Science, Electrical Engineering, and Information Technology.
Lecture Notes in Computer Science; 14911, pp. 288-305, english.
Cham: Springer, August 2024.
ISBN: 978-3-031-68311-4; ISSN: 0302-9743; DOI: 10.1007/978-3-031-68312-1_22.
Article in Proceedings (Conference Paper).
CR-SchemaH.2.7 (Database Administration)
E.2 (Data Storage Representations)
H.3.3 (Information Search and Retrieval)
H.2.8 (Database Applications)
KeywordsData Product; Virtual Data Lake Zone; Data Stream Adaptation
Abstract

The emerging paradigm of data products, which has become increasingly popular recently due to the rise of data meshes and data marketplaces, also poses unprecedented challenges for data management. Current data architectures, namely data warehouses and data lakes, are not able to meet these challenges adequately. In particular, these architectures are not designed for a just-in-time provision of highly customized data products tailored perfectly to the needs of customers. In this paper, we therefore present a virtual data lake zone for composing tailor-made data products on demand, called LALO. LALO uses data streaming technologies to enable just-in-time composing of data products without allocating storage space in the data architecture permanently. In order to enable customers to tailor data products to their needs, LALO uses a novel mechanism that enables live adaptation of data streams. Evaluation results show that the overhead for such an adaptation is negligible. Therefore, LALO represents an efficient solution for the appropriate handling of data products, both in terms of storage space and runtime.

ContactSenden Sie eine E-Mail an <christoph.stach@ipvs.uni-stuttgart.de>.
Department(s)University of Stuttgart, Institute of Parallel and Distributed Systems, Applications of Parallel and Distributed Systems
Project(s)SofDCar
Entry dateAugust 31, 2024
   Publ. Department   Publ. Institute   Publ. Computer Science