For this project, my primary responsibility was to set up the CI/CD pipeline and automate the database. The aim of my part was to automate the deployment of MySQL and MongoDB, run the ETL process ...
Today, at its annual Data + AI Summit, Databricks announced that it is open-sourcing its core declarative ETL framework as Apache Spark Declarative Pipelines, making it available to the entire Apache ...
When it comes to handling data, it’s not just about collecting it – it’s about getting it into the right place, in the right format, and ready to use. That’s where ETL tools come in. These tools help ...
Abstract: The research focuses on the process of creating a data warehouse to meet the decision-making needs of a Greek beverage company. The data cover the period from 2018 to 2022. The developed ...
Abstract: The extract transformation load (ETL) process is an important element of the database (DB) and data warehouse designing. The article proposes the use of the multi-agent approach to build ...
This project implements a complete data architecture pipeline for a retail business, covering ETL, NoSQL, and data warehousing. It demonstrates robust data cleaning, schema design, and analytics using ...