Artikel in Zeitschrift ART-2011-14

Reimann, Peter; Schwarz, Holger; Mitschang, Bernhard: Design, Implementation, and Evaluation of a Tight Integration of Database and Workflow Engines.
In: Laender, Alberto H. F. (Hrsg); Moro, Mirella M. (Hrsg): Journal of Information and Data Management. Vol. 2(3).
Universität Stuttgart, Fakultät Informatik, Elektrotechnik und Informationstechnik.
S. 353-368, englisch.
SBC - Brazilian Computer Society, 3. Oktober 2011.
ISSN: 2178-7107.
Artikel in Zeitschrift.
CR-Klassif.D.2.11 (Software Engineering Software Architectures)
H.2.8 (Database Applications)
H.4.1 (Office Automation)
KeywordsData-Intensive Workflow; Improved Local Data Processing; Scientific Workflow; Simulation Workflow

Accessing and processing huge amounts of heterogeneous and distributed data are some of the major challenges of data-intensive workflows. Traditionally, the descriptions of such workflows focus on their data flow. Nevertheless, control-flow-oriented workflow languages are increasingly adapted to the needs of data-intensive workflows. This provides a common level of abstraction for both data-intensive workflows and classical orchestration workflows, e.g., business workflows, which then enables a comprehensive optimization across all workflows. However, the problem still remains that workflows described in control-flow-oriented languages tend to be less efficient for data-intensive processes compared to specialized data-flow-oriented approaches. In this paper, we propose a new kind of optimization targeted at data-intensive workflows that are described in control-flow-oriented languages. We show how to improve efficiency of such workflows by introducing various techniques that partition the local data processing tasks to be performed during workflow execution in an improved way. These data processing tasks are either assigned to the workflow engine or to the tightly integrated local database engine. We evaluate the effectiveness of these techniques by means of various test scenarios.

KontaktPeter Reimann
Abteilung(en)Universität Stuttgart, Institut für Parallele und Verteilte Systeme, Anwendersoftware
Projekt(e)SimTech - DP4SW
Eingabedatum30. August 2011
   Publ. Abteilung   Publ. Institut   Publ. Informatik