CASE STUDY
Fixing Historical Data and Realtime Streaming Aggregation
About The Client
Our client is one of the leading beverage manufacturers operating and fulfilling the demands of millions of customers across the USA. As their business grows, the client is targeted to automate most of their supply chain workload for their 40 manufacturing plants.
Industry | Manufacturing
Solutions | Azure, Databricks
Location | USA
Business Challenges
- Aggregating data manually to control data volume.
- Generating CSV files manually.
- Fixing historical data for missing dates.
- Generating aggregated reporting in real-time.
Business Solutions
- Spark API to get better performance with parallel processing capabilities.
- Pipelines to create or update new tables with pivoted values for ML use cases.
- Python script to consume data from Confluent running on all devices and aggregating data.
- Key-vault to assign read-only privileges to specific ML team users who want to access data.