Design
Scaling a Sydney D2C Cosmetics Startup's Data Pipeline
Overview
What this challenge is about.
You are tasked with designing a cloud-based data pipeline for GlowUp. The pipeline must ingest real-time user events (page views, purchases, returns) from web and mobile apps, process them with Apache Spark or similar, and store results in a NoSQL database for fast querying. Constraints: use AWS or GCP free tier or low-cost services, ensure data consistency for inventory, and provide a cost estimate. Success means delivering a system architecture diagram, a cost analysis, and a proof-of-concept using sample data.
The Brief
What you'll do, and what you'll demonstrate.
Design a scalable, real-time data pipeline using cloud and big data technologies to replace the failing batch system.
Earning criteria — what you'll demonstrate
- Apply cloud computing concepts to design a scalable data pipeline
- Use Apache Spark or similar for distributed data processing
- Select appropriate NoSQL database for real-time analytics
- Evaluate cost-performance trade-offs in cloud architectures
Program Fit
Where this fits in your program.
Sharpens the same skills your degree expects you to demonstrate.
Skills
Skills you'll demonstrate.
Each one shows up on your verified credential.
Careers
Roles this prepares you for.
Real titles. Real skill bridges. Pick the one closest to your trajectory.
Career mappings coming soon.