- ResearchExpertNew
Approximation Algorithm for an SRE On-Call Roster
Formulate the roster as a constrained multi-week assignment problem. Show it's NP-hard via reduction. Design a deterministic constant-factor approximation (likely candidate: an …
- Approximation Algorithms
- Linear Programming
- Np Completeness
Advanced Algorithms - CodeIntermediateNew
Implement Progressive Delivery with Flagger for an E-Commerce Backend
Install Flagger (or Argo Rollouts) into the existing Kubernetes + Istio stack. Configure canary analysis using Prometheus metrics: request-success-rate, request-duration p99, an…
- Flagger
- Argo Rollouts
- Canary Deployment
GitOps and Continuous Delivery - AnalysisAdvancedNew
Randomized Load Balancer with the Power of Two Choices
Simulate four placement policies on a 24-hour anonymized connection-establishment trace (around 4.1 billion events): (1) random-1, (2) round-robin, (3) P2C with instantaneous lo…
- Power Of Two Choices
- Load Balancing
- Randomized Algorithms
Randomized Algorithms - CodeIntermediateNew
Build an I/O Benchmarking Harness for an Edge Storage Appliance
Receive the appliance specs (4x 7.68TB Gen4 NVMe, ZFS, Linux kernel 5.15), the 3 target workload profiles (4KB random read at QD32, 1MB sequential write at QD8, mixed 70/30 read…
- Io Benchmarking
- Fio
- System Calls
Computer Systems and Organization Practice your coursework on real scenarios.
Every challenge is shaped from real industry context — not generic exercises. The work mirrors what your degree prepares you for.
Why Ewance
- CodeIntermediateNew
Observability Injection: Distributed Tracing via Sidecars
Enable Envoy tracing with the OpenTelemetry tracer in Istio MeshConfig. Configure a Tempo backend with a Grafana frontend. Verify W3C tracecontext propagation across all 26 serv…
- Distributed Tracing
- Opentelemetry
- Envoy
Service Mesh and Microservices Networking - CodeAdvancedNew
Instrument Network Telemetry for an ISP's Backbone
Receive the backbone topology (12 routers across 4 PoPs, mix of Cisco IOS XR + Juniper Junos), the current SNMP-based monitoring stack, and 4 weeks of customer-complaint tickets…
- Network Telemetry
- Gnmi
- Kafka
Advanced Computer Networks - StrategyAdvancedNew
Design a Multi-Region Blue/Green Database Deployment for an EdTech Platform
Design the migration: source PostgreSQL 14 (primary in São Paulo) to target PostgreSQL 16 (primary in Mexico City, replica in Bogotá). Use logical replication for zero-downtime …
- Blue Green Deployment
- Logical Replication
- Postgresql
DevOps and Secure Deployment - AnalysisIntermediateNew
Define SLOs and Error Budgets for a Real-Time Trading API
Pull 90 days of API latency + error data per endpoint from Prometheus (anonymized exports provided). Propose Service Level Indicators (SLIs) for 3 services × 2 SLI types (availa…
- Slo Definition
- Error Budgets
- Sli Design
Site Reliability Engineering - Browse challenges
Explore role
Product Manager
Ship product that solves real user problems. Combine user research, prototyping, and stakeholder alignment to turn ambiguous briefs into measurable wins — the role at the centre of modern software teams.
- AnalysisAdvancedNew
Diagnose Modern Transport-Protocol Performance for an OTT Streamer
Receive the current delivery architecture (HTTP/2 origin + CDN), 4 weeks of Conviva-style QoE (quality of experience) metrics, and access to a synthetic-client harness (Linux + …
- Quic Http3
- Network Measurement
- Transport Protocols
Advanced Computer Networks - PresentationIntermediateNew
Run an Incident-Response Tabletop for a Healthtech On-Call Team
Design 3 tabletop scenarios with realistic timeline injects (every 5-10 minutes, new info arrives). Run the tabletop hybrid (in-person + remote) with the 8 on-call engineers + 2…
- Incident Response
- Tabletop Exercises
- Incident Command System
Site Reliability Engineering - DesignAdvancedNew
Observability for a Microservices Payments Platform
Design the observability architecture: OpenTelemetry traces from 38 services into Tempo, structured logs via Loki, RED (rate, errors, duration) metrics via Prometheus, SLOs defi…
- Observability
- Opentelemetry
- Slo Design
DevOps and Secure Deployment - CodeExpertNew
Profile and Tame a P99-Latency Tail for an Ad-Auction Service
Receive the bidder source (Go, around 22,000 lines), production traces (eBPF + flame graphs from 30 minutes of peak traffic), and the host config (NUMA-2 socket, 96 cores, 384GB…
- Performance Optimization
- Ebpf
- Go
Performance Engineering of Software Systems Build a verifiable portfolio.
Submissions become evidence. Reviewers with shipping experience score against a rubric; the result becomes a credential anyone can verify.
Why Ewance
- CodeAdvancedNew
Canary Deployments via Mesh Traffic Splitting for a Streaming Platform
Set up Flagger with Istio as the traffic provider. Define a canary policy for the playback-API service: 5 percent → 25 percent → 50 percent → 100 percent with 5-minute bake at e…
- Istio
- Flagger
- Canary Deployment
Service Mesh and Microservices Networking - AnalysisAdvancedNew
Debug Latency Tail With Distributed Tracing on a Logistics SaaS
Receive 7 days of anonymized trace data in Tempo, the service map (12 services), and the customer complaint log. Investigate: filter the slowest 1 percent of traces, identify th…
- Distributed Tracing
- Performance Analysis
- Tempo
Software Observability - DesignAdvancedNew
Design SLO-Driven Alerts for a Telco's Subscriber API
Receive a 90-day RED (Rate, Errors, Duration) metrics export for the subscriber API across 6 endpoints and 38 weeks of paging history. Define an SLO per endpoint (e.g., 99.9 per…
- Slo Design
- Alerting
- Prometheus
Software Observability - AnalysisIntermediateNew
Postmortem and Action-Item Tracking for a 3-Service Outage
Re-write the existing postmortem using the Google SRE blameless format: timeline (anchored to UTC), what went well, what went poorly, where we got lucky, action items (S.M.A.R.T…
- Postmortems
- Blameless Culture
- Action Item Tracking
Site Reliability Engineering - AnalysisAdvancedNew
Capacity Planning Model for a Black-Friday Traffic Surge
Pull 18 months of per-service request rate + utilization from Prometheus. Forecast BFCM traffic per service using a baseline + multiplicative seasonal model (Prophet or statsmod…
- Capacity Planning
- Forecasting
- Autoscaling
Site Reliability Engineering - AnalysisAdvancedNew
TCP Congestion Control Comparison on a Long-Fat Network
Set up two Linux test hosts in Sydney + Frankfurt cloud regions (or one host pair with tc-netem emulating 280ms RTT). Run iperf3 transfers using CUBIC and BBR at 4 loss rates (0…
- Tcp Ip
- Congestion Control
- Performance Testing
Computer Networks - StrategyIntermediateNew
Toil Audit + Automation Sprint for a Platform Team
Week 1-2: every team member logs every toil instance for 10 working days (timestamp, category, duration). Categorize using the Google SRE toil taxonomy (manual, repetitive, auto…
- Toil Reduction
- Automation
- Reliability Engineering
Site Reliability Engineering - CodeAdvancedNew
Roll Out OpenTelemetry Tracing Across a Microservices Fintech
Receive an anonymized service map (90 services, payment-critical path of 12), a runtime mix (Node.js, Go, Java), and existing logging/metrics setup. Define: an OTel SDK adoption…
- Distributed Tracing
- Opentelemetry
- Sampling Strategies
Software Observability
How it works
From brief to credential, in six steps.
Step 01
Browse challenges aligned to your studies.
Step 02
Accept the one that fits your goals.
Step 03
Work through it with AI Copilot guidance.
Step 04
Submit for structured evaluation.
Step 05
Earn a verified credential.
Step 06
Add it to LinkedIn with one click.
Industry teams behind a decade of practitioner briefs
Hiring from this pool?
Sponsor a challenge and meet candidates through actual work.
Industry teams can shape briefs around the skills they hire for, then evaluate students on rubric-scored deliverables — not resumes.
Skills and disciplines shown on this page are derived from the Ewance challenge catalogue. When the median annual salary is available for this role via Adzuna, it will be shown above with the sample size and country.



















































































