To main content

A Survey of Big Data Pipeline Orchestration Tools from the Perspective of the DataCloud Project

Abstract

This paper presents a survey of existing tools for Big Data pipeline orchestration based on a comparative framework developed in the DataCloud project. We propose criteria for evaluating the tools to support reusability, flexible pipeline communication modes, and separa- tion of concerns in Big Data pipeline descriptions. This survey aims to identify research and technological gaps and to recommend approaches for filling them. Further work in the DataCloud project is oriented to- wards the design, implementation, and practical evaluation of the rec- ommended approaches.

Category

Academic literature review

Language

English

Author(s)

Affiliation

  • SINTEF Digital / Sustainable Communication Technologies
  • Royal Institute of Technology

Date

15.12.2021

Year

2021

Published in

CEUR Workshop Proceedings

Page(s)

63 - 78

View this publication at Norwegian Research Information Repository