Spark
If you like applying Spark, every challenge here gives you a chance to practice it on a real industry brief.
- AnalysisAdvancedNew
Cost-Profile a Spark Job at Scale and Cut the Bill in Half
Receive the PySpark job (around 1,800 lines), 5 nights of Spark UI + EMR metrics, and the EMR cluster config. Profile to find the top 3 cost drivers (likely candidates: skewed j…
- Spark
- Cost Optimization
- Etl Pipelines
Big Data and Data-Intensive Systems - DesignAdvancedNew
Migrate a 200TB Data Lake from Parquet to Iceberg
Receive an inventory of the 200TB hot tier (around 1,200 tables, around 38 PB of historical data referenced), the current Spark + Trino read patterns, and 6 months of schema-cha…
- Iceberg
- Parquet
- Data Lake
Big Data and Data-Intensive Systems - AnalysisAdvancedNew
Frequent-Itemset Mining on a Grocery Retailer's Basket History
Load 18 months of basket-level transaction data (Parquet, around 92 GB) into a Spark cluster. Run FP-growth at support thresholds tuned per category (food vs household vs fresh)…
- Frequent Itemset Mining
- Fp Growth
- Spark
Data Mining and Information Retrieval
How it works
From brief to credential, in six steps.
Step 01
Browse challenges aligned to your studies.
Step 02
Accept the one that fits your goals.
Step 03
Work through it with AI Copilot guidance.
Step 04
Submit for structured evaluation.
Step 05
Earn a verified credential.
Step 06
Add it to LinkedIn with one click.
Industry teams behind a decade of practitioner briefs
Hiring from this pool?
Sponsor a challenge and meet candidates through actual work.
Industry teams can shape briefs around the skills they hire for, then evaluate students on rubric-scored deliverables — not resumes.



















































































