Overview
What this challenge is about.
Receive the slow SQL query (around 600 lines), the Trino EXPLAIN ANALYZE output, the underlying Iceberg table schemas (partition specs, sort orders), and 30 days of execution metrics. Profile the query: identify the top cost stages (likely candidates: broadcast joins on large tables, missing partition pruning, redundant CTEs). Propose 4-6 optimizations: query rewrites, table-level changes (Z-ordering, compaction), and infrastructure tweaks (worker concurrency, dynamic filtering). Implement the optimizations on a copy of the warehouse, measure before/after, and write a 6-page optimization-pattern memo other analysts can apply to similar queries.
The Brief
What you'll do, and what you'll demonstrate.
Cut a 42-minute, USD 18-per-run Trino query to under 5 minutes and under USD 4 per run, then document the optimization pattern for analyst reuse.
Earning criteria — what you'll demonstrate
- Read and act on Trino EXPLAIN ANALYZE output
- Apply table-format optimizations (Z-ordering, compaction, partitioning) on Iceberg
- Quantify cost-per-query honestly across the stack
- Document optimization patterns analysts can apply themselves
Program Fit
Where this fits in your program.
Sharpens the same skills your degree expects you to demonstrate.
Skills
Skills you'll demonstrate.
Each one shows up on your verified credential.
Careers
Roles this prepares you for.
Real titles. Real skill bridges. Pick the one closest to your trajectory.
Career mappings coming soon.