Article in Journal ART-2011-14

BibliographyReimann, Peter; Schwarz, Holger; Mitschang, Bernhard: Design, Implementation, and Evaluation of a Tight Integration of Database and Workflow Engines.
In: Laender, Alberto H. F. (ed.); Moro, Mirella M. (ed.): Journal of Information and Data Management. Vol. 2(3).
University of Stuttgart, Faculty of Computer Science, Electrical Engineering, and Information Technology.
pp. 353-368, english.
SBC - Brazilian Computer Society, October 3, 2011.
ISSN: 2178-7107.
Article in Journal.
CR-SchemaD.2.11 (Software Engineering Software Architectures)
H.2.8 (Database Applications)
H.4.1 (Office Automation)
KeywordsData-Intensive Workflow; Improved Local Data Processing; Scientific Workflow; Simulation Workflow
Abstract

Accessing and processing huge amounts of heterogeneous and distributed data are some of the major challenges of data-intensive workflows. Traditionally, the descriptions of such workflows focus on their data flow. Nevertheless, control-flow-oriented workflow languages are increasingly adapted to the needs of data-intensive workflows. This provides a common level of abstraction for both data-intensive workflows and classical orchestration workflows, e.g., business workflows, which then enables a comprehensive optimization across all workflows. However, the problem still remains that workflows described in control-flow-oriented languages tend to be less efficient for data-intensive processes compared to specialized data-flow-oriented approaches. In this paper, we propose a new kind of optimization targeted at data-intensive workflows that are described in control-flow-oriented languages. We show how to improve efficiency of such workflows by introducing various techniques that partition the local data processing tasks to be performed during workflow execution in an improved way. These data processing tasks are either assigned to the workflow engine or to the tightly integrated local database engine. We evaluate the effectiveness of these techniques by means of various test scenarios.

ContactPeter Reimann Peter.Reimann@ipvs.uni-stuttgart.de
Department(s)University of Stuttgart, Institute of Parallel and Distributed Systems, Applications of Parallel and Distributed Systems
Project(s)SimTech - DP4SW
Entry dateAugust 30, 2011
   Publ. Institute   Publ. Computer Science