Reliability Engineering
If you like applying Reliability Engineering, every challenge here gives you a chance to practice it on a real industry brief.
- StrategyIntermediateNew
Toil Audit + Automation Sprint for a Platform Team
Week 1-2: every team member logs every toil instance for 10 working days (timestamp, category, duration). Categorize using the Google SRE toil taxonomy (manual, repetitive, auto…
- Toil Reduction
- Automation
- Reliability Engineering
Site Reliability Engineering - AnalysisIntermediateNew
Postmortem and Action-Item Tracking for a 3-Service Outage
Re-write the existing postmortem using the Google SRE blameless format: timeline (anchored to UTC), what went well, what went poorly, where we got lucky, action items (S.M.A.R.T…
- Postmortems
- Blameless Culture
- Action Item Tracking
Site Reliability Engineering - AnalysisIntermediateNew
Define SLOs and Error Budgets for a Real-Time Trading API
Pull 90 days of API latency + error data per endpoint from Prometheus (anonymized exports provided). Propose Service Level Indicators (SLIs) for 3 services × 2 SLI types (availa…
- Slo Definition
- Error Budgets
- Sli Design
Site Reliability Engineering
How it works
From brief to credential, in six steps.
Step 01
Browse challenges aligned to your studies.
Step 02
Accept the one that fits your goals.
Step 03
Work through it with AI Copilot guidance.
Step 04
Submit for structured evaluation.
Step 05
Earn a verified credential.
Step 06
Add it to LinkedIn with one click.
Industry teams behind a decade of practitioner briefs
Hiring from this pool?
Sponsor a challenge and meet candidates through actual work.
Industry teams can shape briefs around the skills they hire for, then evaluate students on rubric-scored deliverables — not resumes.



















































































