This project implements a Region-Based Partitioning Hadoop MapReduce pipeline. It reads store performance data from a CSV file stored in HDFS and calculates per-region statistics: ...
This project implements a Region-Based Partitioning Hadoop MapReduce pipeline. It reads store performance data from a CSV file stored in HDFS and calculates per-region statistics: ...