Data Sciences Challenges

Explore data science challenges on Ewance to build skills employers expect from analysts and ML engineers. Work through challenges on data cleaning, exploratory analysis, modeling, and storytelling with data.

Explore Categories

All Business Computer Sciences Data Sciences Health Information Technology Social Sciences

Most Popular

All Foundational Beginner Intermediate Senior

AnalysisIntermediateNew
Optimize Stop-Loss Policies with Dynamic Programming at a Quant Fund
You receive five years of daily PnL series for 12 momentum strategies plus a small set of state features (rolling vol, drawdown, regime indicator). Calibrate a discrete Markov m…
- Dynamic Programming
- Backward Induction
- State Modeling
Decision Making Under Uncertainty
AnalysisIntermediateNew
Frame an Energy-Storage Dispatch Decision as a Bayesian Decision Problem
You receive 2 years of hourly spot-price data, 2 years of wind generation data, and a manufacturer's battery degradation model. Frame dispatch as a Bayesian decision problem: mo…
- Bayesian Decision Theory
- Price Modeling
- Back Testing
Decision Making Under Uncertainty
AnalysisIntermediateNew
Simulate Hospital Bed Allocation for a Healthtech Decision Support Pilot
You receive 12 months of anonymized admissions and discharges data plus ward layouts (medicine, surgery, ICU, geriatrics) and a small set of clinical transfer rules. Build a dis…
- Discrete Event Simulation
- Simpy
- Policy Comparison
Decision Support Systems and Decision Analysis
DesignIntermediateNew
Score Compliance Risk for an Enterprise AI Rollout Pipeline
You will design a compliance-risk scoring methodology covering 8 attributes (data residency, model provider, retention policy, PII handling, audit trail, encryption, third-party…
- Risk Scoring
- Compliance Modeling
- Decision Support Systems
Decision Support Systems and Decision Analysis
Develop in-demand professional skills.
Each challenge names the skills it strengthens. Over time, your profile fills with the competences a hiring manager would actually look for.
Why Ewance
CodeIntermediateNew
Train a VAE for Synthetic Tabular Data at a Healthtech Startup
You receive a synthetic-but-realistic clinical-trial table (around 50,000 patients, 35 columns, mixed continuous and categorical). Train a tabular VAE (or TVAE/CTGAN as alternat…
- Vae
- Tabular Generation
- Synthetic Data
Deep Generative Models
CodeIntermediateNew
Build a GAN-Based Defect Generator for a Hardware Manufacturing Line
You receive around 60,000 good-unit images and around 380 defective-unit images across 4 defect classes. Train a class-conditional GAN (StyleGAN2-ADA or a smaller alternative fo…
- Gans
- Class Conditional Generation
- Data Augmentation
Deep Generative Models
ResearchIntermediateNew
Prototype a Normalizing Flow for Anomaly Scoring in Climate Sensor Data
You receive 12 months of multivariate sensor traces (8 channels per sensor, hourly). Train a Normalizing Flow (Real NVP or a small Neural Spline Flow) on a clean training window…
- Normalizing Flows
- Density Estimation
- Anomaly Detection
Deep Generative Models
CodeIntermediateNew
Design a Visual Search Backend for a Boutique Luxury Marketplace
You receive a catalog of 80,000 luxury items (image + sparse metadata) and a labeled query set of 300 user photos with hand-picked target items. Choose an embedding strategy (CL…
- Visual Search
- Embeddings
- Clip
Deep Learning for Computer Vision
Explore role
Pricing Strategist
Set the price that captures value without leaving sales on the table. Demand modelling, willingness-to-pay research, and the disciplined experimentation that turns pricing into a competitive advantage.
Browse challenges
ResearchIntermediateNew
Tune a PPO Policy for an Energy-Storage Trading Bot
You receive 18 months of 15-minute Nordic spot-price data, a battery dynamics model (capacity, round-trip efficiency, degradation curve), and a rule-based baseline that earns ab…
- Policy Gradients
- Ppo
- Reinforcement Learning
Deep Reinforcement Learning
CodeIntermediateNew
Use Actor-Critic to Auto-Tune a HVAC Control Policy
You receive a Sinergym wrapper around the EnergyPlus model of one floor with 8 thermal zones, weather data for one year, and occupancy schedules. Train a Soft Actor-Critic (SAC,…
- Actor Critic
- Soft Actor Critic
- Continuous Control
Deep Reinforcement Learning
AnalysisIntermediateNew
Imitation Learning from Human Demos for a Drone Inspection
You receive 6 hours of expert pilot demonstrations (state-action pairs at 20 Hz) recorded in an AirSim wind-farm environment with 3 turbine designs, plus a held-out 4th turbine …
- Imitation Learning
- Behavioral Cloning
- Dagger
Deep Reinforcement Learning
ResearchIntermediateNew
Hardware-Aware NAS for a Wearable ECG Classifier
You receive a labeled subset of an arrhythmia ECG dataset (about 80,000 10-second windows, 4 classes), a microcontroller latency lookup table (op-level milliseconds) for a Corte…
- Neural Architecture Search
- Hardware Aware Design
- Edge Inference
Edge ML and On-Device Machine Learning
Get recognized by recruiters and employers.
Credentials are blockchain-anchored via LearnCoin — tamper-evident, portable, link-shareable on LinkedIn and beyond.
Why Ewance
CodeIntermediateNew
Solve a Vehicle-Routing Problem with Tabu Search
You receive a week of anonymized daily VRPTW instances (around 800 orders per day, 120 vehicles, hard delivery windows). Implement tabu search with: a route-insertion constructi…
- Tabu Search
- Metaheuristics
- Vehicle Routing
Evolutionary Computation and Metaheuristic Search
DesignIntermediateNew
Counterfactual Explanations for an Insurance Pricing Model
You receive a trained LightGBM regression model (premium in GBP), the feature schema (28 features, 14 mutable from the customer's side), and 500 sample quotes. Use DiCE (Diverse…
- Counterfactual Explanations
- Dice Ml
- Interpretability
Explainable and Interpretable AI
CodeIntermediateNew
LoRA Fine-Tune a 7B LLM for Legal-Clause Extraction
You receive a curated extraction dataset (2,000 train, 500 val, 500 test contracts with span-level labels across 12 clause types) and a fine-tunable 7B base model (e.g., Llama-3…
- Lora
- Fine Tuning
- Parameter Efficient Tuning
Fine-Tuning Large Language Models
CodeIntermediateNew
Instruction-Tune a Small Model for an Edtech Tutor
You receive a 1.5B base model (e.g., SmolLM-1.7B or Qwen-1.8B), permission to use 2 hours of a rented A100, and a curated seed of around 5,000 math-tutoring dialogues. Augment w…
- Instruction Tuning
- Supervised Fine Tuning
- Dataset Curation
Fine-Tuning Large Language Models
AnalysisIntermediateNew
Catastrophic-Forgetting Audit on a Domain Fine-Tune
You receive the fine-tuned 7B chemistry model and its base, plus a benchmark basket (MMLU subset, GSM8K, IFEval, a small instruction-following set). Run all 4 benchmarks on both…
- Catastrophic Forgetting
- LLM Evaluation
- Fine Tuning
Fine-Tuning Large Language Models
DesignIntermediateNew
Build an OWL Ontology for a Pharma R&D Knowledge Base
You receive a CSV-form starter knowledge base (around 4,000 compounds, 600 targets, 1,200 assays) and a list of 12 competency questions the scientists currently can't answer wit…
- Ontology Design
- Owl
- Knowledge Representation
Fuzzy Logic, Knowledge Representation, and Symbolic Reasoning
ResearchIntermediateNew
Evaluate VAEs vs. Diffusion for Synthetic Tabular-Data Generation
You receive a real labeled dataset (around 18,000 anonymized patient records, 32 features, binary outcome) and the team's existing VAE baseline. Train a tabular diffusion model …
- Tabular Diffusion
- Vae
- Synthetic Data
Generative AI
CodeIntermediateNew
Train a GNN for Fraud-Ring Detection at a Payments Fintech
You receive an anonymized transaction dataset (around 120,000 merchants, around 4 million transactions over 12 months, around 2% labeled fraud) and the team's LightGBM baseline.…
- Graph Neural Networks
- Graphsage
- Fraud Detection
Machine Learning on Graphs
CodeIntermediateNew
Link Prediction for a B2B SaaS Account-Expansion Engine
You receive a CSV of around 80,000 accounts (existing customers + prospects) with attributes (industry, size, tech stack, geography) plus 18 months of marketing-touch and conver…
- Link Prediction
- Node Embeddings
- Node2vec
Open coursework
CodeIntermediateNew
Design Safe Navigation Behavior for a Hospital Delivery Robot
You receive a dataset of 200 anonymized hospital corridor traces (people positions over time from the robot's LIDAR) plus the current planner's parameters. Design a policy that …
- Human Aware Navigation
- Ros2
- Motion Planning
Human-Robot Interaction
CodeIntermediateNew
Teach a Warehouse Cobot from Operator Demonstrations
You receive a simulated UR5e cobot in PyBullet, plus 12 example demonstrations of two kitting sequences. Implement Dynamic Movement Primitives (DMPs — a classic LfD technique th…
- Learning From Demonstration
- Dynamic Movement Primitives
- Human Robot Interaction
Human-Robot Interaction
CodeIntermediateNew
Map a Climate-Policy Corpus to Linked Open Data
You receive 12,000 policy PDFs and a benchmark of 200 documents with manually linked entities (places, organizations, policies). Build a pipeline that runs NER, candidate-genera…
- Entity Linking
- Linked Open Data
- Wikidata
Knowledge Graphs and Semantic Web

How it works

From brief to credential, in six steps.

Step 01
Browse challenges aligned to your studies.
Step 02
Accept the one that fits your goals.
Step 03
Work through it with AI Copilot guidance.
Step 04
Submit for structured evaluation.
Step 05
Earn a verified credential.
Step 06
Add it to LinkedIn with one click.

Industry teams behind a decade of practitioner briefs

Hiring from this pool?

Sponsor a challenge and meet candidates through actual work.

Industry teams can shape briefs around the skills they hire for, then evaluate students on rubric-scored deliverables — not resumes.

Explore sponsorship