Artikel in Tagungsband INPROC-2021-11

Bibliograph.
Daten
Stach, Christoph; Bräcker, Julia; Eichler, Rebecca; Giebler, Corinna; Mitschang, Bernhard: Demand-Driven Data Provisioning in Data Lakes: BARENTS - A Tailorable Data Preparation Zone.
In: Indrawan-Santiago, Maria (Hrsg); Pardede, Eric (Hrsg); Salvadori, Ivan Luiz (Hrsg); Steinbauer, Matthias (Hrsg); Khalil, Ismail (Hrsg); Kotsis, Gabriele (Hrsg): Proceedings of the 23rd International Conference on Information Integration and Web-based Applications & Services (iiWAS2021); Linz, Austria, November 29-December 1, 2021.
Universität Stuttgart, Fakultät Informatik, Elektrotechnik und Informationstechnik.
S. 1-12, englisch.
New York, NY, United States: Association for Computing Machinery (ACM), November 2021.
ISBN: 978-1-4503-9556-4/21/11; DOI: 10.1145/3487664.3487784.
Artikel in Tagungsband (Konferenz-Beitrag).
CR-Klassif.H.2.7 (Database Administration)
E.2 (Data Storage Representations)
H.3.3 (Information Search and Retrieval)
H.2.8 (Database Applications)
Keywordsdata pre-processing; data transformation; knowledge modeling; ontology; data management; Data Lakes; zone model; food analysis
Kurzfassung

Data has never been as significant as it is today. It can be acquired virtually at will on any subject. Yet, this poses new challenges towards data management, especially in terms of storage (data is not consumed during processing, i.e., the data volume keeps growing), flexibility (new applications emerge), and operability (analysts are no IT experts). The goal has to be a demand-driven data provisioning, i.e., the right data must be available in the right form at the right time. Therefore, we introduce a tailorable data preparation zone for Data Lakes called BARENTS. It enables users to model in an ontology how to derive information from data and assign the information to use cases. The data is automatically processed based on this model and the refined data is made available to the appropriate use cases. Here, we focus on a resource-efficient data management strategy. BARENTS can be embedded seamlessly into established Big Data infrastructures, e.g., Data Lakes.

KontaktSenden Sie eine E-Mail an christoph.stach@ipvs.uni-stuttgart.de.
Abteilung(en)Universität Stuttgart, Institut für Parallele und Verteilte Systeme, Anwendersoftware
Eingabedatum17. Oktober 2021
   Publ. Abteilung   Publ. Institut   Publ. Informatik