AI & Data
Deep Learning Challenges
Deep Learning challenges put you inside the work of building models that learn from raw data. You'll develop skills in Neural Networks and Feedforward Networks, apply Data Augmentation, and train models in PyTorch or TensorFlow alongside Reinforcement Learning fundamentals.
From there you'll handle the harder edges — Transformer architecture, Attention mechanisms, Custom architecture design, and Distributed training — working with PyTorch Lightning / Hugging Face Trainer, JAX research patterns, and Ablation study design the way research teams actually do. Each challenge you solve earns a verified credential you can share with recruiters.
Recommended Challenges
· Reinforcement Learning Clear- ResearchIntermediateNew
Tune a PPO Policy for an Energy-Storage Trading Bot
You receive 18 months of 15-minute Nordic spot-price data, a battery dynamics model (capacity, round-trip efficiency, degradation curve), and a rule-based baseline that earns ab…
- Policy Gradients
- Ppo
- Reinforcement Learning
Deep Reinforcement Learning - ResearchSeniorNew
Benchmark Reward-from-Feedback Methods on a Tabletop Pick-Place
You will use a Franka Panda arm in PyBullet on a 4-object pick-and-place task. For each of the three feedback methods, train a reward model and a downstream policy until converg…
- Reinforcement Learning
- Reward Learning
- Preference Comparison
Human-Robot Interaction - DesignIntermediateNew
Train a Self-Play Agent for a Card-Game Edtech Demo
Implement a small two-player imperfect-information card game (Kuhn poker or a 3-card simplified Hold'em variant). Implement CFR or CFR+ for the game and run self-play for at lea…
- Counterfactual Regret Minimization
- Self Play
- Game Theory
Artificial Intelligence: Principles and Techniques - CodeIntermediateNew
Use Actor-Critic to Auto-Tune a HVAC Control Policy
You receive a Sinergym wrapper around the EnergyPlus model of one floor with 8 thermal zones, weather data for one year, and occupancy schedules. Train a Soft Actor-Critic (SAC,…
- Actor Critic
- Soft Actor Critic
- Continuous Control
Deep Reinforcement Learning Practice your coursework on real scenarios.
Every challenge is shaped from real industry context — not generic exercises. The work mirrors what your degree prepares you for.
Why Ewance
- ResearchSeniorNew
Model-Based RL for a Robotic Arm Pick-Place Task
You receive a PyBullet pick-and-place environment (Franka Panda arm, 12 object types, randomized starting poses) and a SAC baseline that hits 85% success after about 1.5 million…
- Model Based Rl
- World Models
- Reinforcement Learning
Deep Reinforcement Learning - DesignSeniorNew
Dynamic Pricing Optimization for a Ride-Hailing Platform
You are a data scientist at CityRide. Using 6 months of historical trip data (pickup/dropoff, time, fare, surge multiplier), weather data, and local events calendar, you must bu…
- Reinforcement Learning
- Optimization
- Simulation
Data Science for Business - CodeSeniorNew
Train a Reinforcement-Learning Policy for Drone Obstacle Avoidance
You receive a custom Gymnasium drone-flight environment (provided), a baseline hand-engineered controller, and a target evaluation suite covering 4 obstacle densities. Train a P…
- Reinforcement Learning
- Ppo
- Robotics Simulation
Advanced Robotics - AnalysisIntermediateNew
Imitation Learning from Human Demos for a Drone Inspection
You receive 6 hours of expert pilot demonstrations (state-action pairs at 20 Hz) recorded in an AirSim wind-farm environment with 3 turbine designs, plus a held-out 4th turbine …
- Imitation Learning
- Behavioral Cloning
- Dagger
Deep Reinforcement Learning - Browse challenges
Explore role
Product Manager
Ship product that solves real user problems. Combine user research, prototyping, and stakeholder alignment to turn ambiguous briefs into measurable wins — the role at the centre of modern software teams.
- CodeBeginnerNew
Compare MDP Solvers for a Smart-Grid Battery Dispatch Pilot
Model home-battery dispatch as a finite MDP: state is (state-of-charge, hour-of-day, current price tier), actions are charge/hold/discharge with realistic efficiency losses, tra…
- Markov Decision Processes
- Value Iteration
- Policy Iteration
Artificial Intelligence: Principles and Techniques - ResearchIntermediateNew
Train a Reinforcement-Learning Locomotion Policy for a Quadruped
You receive a configured Isaac Lab environment for the quadruped, a baseline PPO trainer, and a set of 8 trip-hazard / slip stress scenarios. Train the policy for a budget of ab…
- Reinforcement Learning
- Locomotion
- Domain Randomization
Robotics - CodeIntermediateNew
Train a Deep Q-Network for Warehouse Robot Routing
You receive a Gymnasium-compatible warehouse simulator (50x50 grid, 8 dynamic obstacle pedestrians, 20 randomized pick locations) and a baseline A* planner script. Train a DQN a…
- Deep Q Learning
- Reinforcement Learning
- Pytorch Or Tensorflow
Deep Reinforcement Learning
How it works
From brief to credential, in six steps.
Step 01
Browse challenges aligned to your studies.
Step 02
Accept the one that fits your goals.
Step 03
Work through it with AI Copilot guidance.
Step 04
Submit for structured evaluation.
Step 05
Earn a verified credential.
Step 06
Add it to LinkedIn with one click.
Industry teams behind a decade of practitioner briefs
Hiring from this pool?
Sponsor a challenge and meet candidates through actual work.
Industry teams can shape briefs around the skills they hire for, then evaluate students on rubric-scored deliverables — not resumes.



















































































