Reinforcement Learning

If you like applying Reinforcement Learning, every challenge here gives you a chance to practice it on a real industry brief.

Recommended Challenges

· Intermediate only Clear

CodeIntermediateNew
Train a Deep Q-Network for Warehouse Robot Routing
You receive a Gymnasium-compatible warehouse simulator (50x50 grid, 8 dynamic obstacle pedestrians, 20 randomized pick locations) and a baseline A* planner script. Train a DQN a…
- Deep Q Learning
- Reinforcement Learning
- Pytorch
Deep Reinforcement Learning
CodeIntermediateNew
Use Actor-Critic to Auto-Tune a HVAC Control Policy
You receive a Sinergym wrapper around the EnergyPlus model of one floor with 8 thermal zones, weather data for one year, and occupancy schedules. Train a Soft Actor-Critic (SAC,…
- Actor Critic
- Soft Actor Critic
- Continuous Control
Deep Reinforcement Learning
ResearchIntermediateNew
Tune a PPO Policy for an Energy-Storage Trading Bot
You receive 18 months of 15-minute Nordic spot-price data, a battery dynamics model (capacity, round-trip efficiency, degradation curve), and a rule-based baseline that earns ab…
- Policy Gradients
- Ppo
- Reinforcement Learning
Deep Reinforcement Learning
ResearchIntermediateNew
Train a Reinforcement-Learning Locomotion Policy for a Quadruped
You receive a configured Isaac Lab environment for the quadruped, a baseline PPO trainer, and a set of 8 trip-hazard / slip stress scenarios. Train the policy for a budget of ab…
- Reinforcement Learning
- Locomotion
- Domain Randomization
Robotics
Practice your coursework on real scenarios.
Every challenge is shaped from real-world context — not generic exercises. The work mirrors what your degree prepares you for.
Why Ewance
DesignIntermediateNew
Train a Self-Play Agent for a Card-Game Edtech Demo
Implement a small two-player imperfect-information card game (Kuhn poker or a 3-card simplified Hold'em variant). Implement CFR or CFR+ for the game and run self-play for at lea…
- Counterfactual Regret Minimization
- Self Play
- Game Theory
Artificial Intelligence: Principles and Techniques
AnalysisIntermediateNew
Imitation Learning from Human Demos for a Drone Inspection
You receive 6 hours of expert pilot demonstrations (state-action pairs at 20 Hz) recorded in an AirSim wind-farm environment with 3 turbine designs, plus a held-out 4th turbine …
- Imitation Learning
- Behavioral Cloning
- Dagger
Deep Reinforcement Learning

How it works

From brief to credential, in six steps.

Step 01
Browse challenges aligned to your studies.
Step 02
Accept the one that fits your goals.
Step 03
Work through it with AI Copilot guidance.
Step 04
Submit for structured evaluation.
Step 05
Earn a verified credential.
Step 06
Add it to LinkedIn with one click.

Industry teams behind a decade of practitioner briefs

Hiring from this pool?

Sponsor a challenge and meet candidates through actual work.

Industry teams can shape briefs around the skills they hire for, then evaluate students on rubric-scored deliverables — not resumes.

Explore sponsorship