Article in Proceedings INPROC-2018-14

BibliographyGiebler, Corinna; Stach, Christoph; Schwarz, Holger; Mitschang, Bernhard: BRAID - A Hybrid Processing Architecture for Big Data.
In: Proceedings of the 7th International Conference on Data Science, Technology and Applications (DATA 2018).
University of Stuttgart, Faculty of Computer Science, Electrical Engineering, and Information Technology.
pp. 1-8, english.
INSTICC Press, July 2018.
Article in Proceedings (Conference Paper).
CR-SchemaD.2.11 (Software Engineering Software Architectures)
H.2.4 (Database Management Systems)
H.2.8 (Database Applications)
KeywordsBig Data; IoT; Batch Processing; Stream Processing; Lambda Architecture; Kappa Architecture
Abstract

The Internet of Things is applied in many domains and collects vast amounts of data. This data provides access to a lot of knowledge when analyzed comprehensively. However, advanced analysis techniques such as predictive or prescriptive analytics require access to both, history data, i.e., long-term persisted data, and real-time data as well as a joint view on both types of data. State-of-the-art hybrid processing architectures for big data - namely, the Lambda and the Kappa Architecture - support the processing of history data and real-time data. However, they lack of a tight coupling of the two processing modes. That is, the user has to do a lot of work manually in order to enable a comprehensive analysis of the data. For instance, the user has to combine the results of both processing modes or apply knowledge from one processing mode to the other. Therefore, we introduce a novel hybrid processing architecture for big data, called BRAID. BRAID intertwines the processing of history data and real-time data by adding communication channels between the batch engine and the stream engine. This enables to carry out comprehensive analyses automatically at a reasonable overhead.

ContactSenden Sie eine e-Mail an Corinna.Giebler@ipvs.uni-stuttgart.de
Department(s)University of Stuttgart, Institute of Parallel and Distributed Systems, Applications of Parallel and Distributed Systems
Project(s)PATRON
Entry dateMay 24, 2018
   Publ. Department   Publ. Institute   Publ. Computer Science