Towards a cloud native Big Data platform using MiCADO

Abdelkhalik Mosa, Tamas Kiss, Gabriele Pierantoni, James DesLauriers, Dimitrios Kagialis, Gabor Terstyanszky

Research output: Chapter in Book/Report/Conference proceedingConference contribution book

2 Citations (Scopus)
17 Downloads (Pure)

Abstract

In the big data era, creating self-managing scalable platforms for running big data applications is a fundamental task. Such self-managing and self-healing platforms involve a proper reaction to hardware (e.g., cluster nodes) and software (e.g., big data tools) failures, besides a dynamic resizing of the allocated resources based on overload and underload situations and scaling policies. The distributed and stateful nature of big data platforms (e.g., Hadoop-based cluster) makes the management of these platforms a challenging task. This paper aims to design and implement a scalable cloud native Hadoopbased big data platform using MiCADO, an open-source, and a highly customisable multi-cloud orchestration and auto-scaling framework for Docker containers, orchestrated by Kubernetes. The proposed MiCADO-based big data platform automates the deployment and enables an automatic horizontal scaling (in and out) of the underlying cloud infrastructure. The empirical evaluation of the MiCADO-based big data platform demonstrates how easy, efficient, and fast it is to deploy and undeploy Hadoop clusters of different sizes. Additionally, it shows how the platform can automatically be scaled based on user-defined policies (such as CPU-based scaling).
Original languageEnglish
Title of host publication2020 19th International Symposium on Parallel and Distributed Computing (ISPDC)
Place of PublicationPiscataway, NJ
PublisherIEEE
Pages118-125
Number of pages8
ISBN (Electronic)9781728189468
ISBN (Print)9781728189475
DOIs
Publication statusPublished - 22 Sept 2020
Externally publishedYes
Event2020 19th International Symposium on Parallel and Distributed Computing (ISPDC) - Warsaw, Poland
Duration: 5 Jul 20208 Jul 2020

Conference

Conference2020 19th International Symposium on Parallel and Distributed Computing (ISPDC)
Period5/07/208/07/20

Keywords

  • Big Data
  • Cloud computing
  • Containers
  • Software
  • Tools
  • Monitoring
  • Task analysis

Fingerprint

Dive into the research topics of 'Towards a cloud native Big Data platform using MiCADO'. Together they form a unique fingerprint.

Cite this