Have a personal or library account? Click to login
Spark-Based Digital Factory Design Cover
Open Access
|Aug 2022

Abstract

Big data processing often uses the paradigm of parallelism by computing directly on top of the distributed data storage. The existing big data workflows unify the data processing practices to utilize the cloud’s native computational potentials to offer advanced machine learning and BI capabilities. Spark is an open-source massively parallel in-memory data processing framework, the current state-of-the-art. The primary approach is to break down the job into granular-level executed tasks, enabling parallelization. In the discussed case study, through IoT – cloud solutions, the plant data can be converted into an analyzable form to let the farther machine learning modules produce added value. To maximize the efficiency of the processing and accumulation, cloud-based components are introduced. Based on the data insights, the appropriate operative actions can be taken. The cost and performance optimization methods were also discussed in the study. Through achieving higher degree of digitalization, the control over the production increased.

DOI: https://doi.org/10.2478/aei-2022-0008 | Journal eISSN: 1338-3957 | Journal ISSN: 1335-8243
Language: English
Page range: 19 - 26
Submitted on: Apr 4, 2022
Accepted on: Jun 20, 2022
Published on: Aug 12, 2022
Published by: Technical University of Košice
In partnership with: Paradigm Publishing Services
Publication frequency: 4 issues per year

© 2022 István Pölöskei, published by Technical University of Košice
This work is licensed under the Creative Commons Attribution 4.0 License.