Data Sciences Challenges

Explore data science challenges on Ewance to build skills employers expect from analysts and ML engineers. Work through challenges on data cleaning, exploratory analysis, modeling, and storytelling with data.

Explore Categories

All Business Computer Sciences Data Sciences Health Information Technology Social Sciences

Most Popular

All Foundational Beginner Intermediate Senior

CodeIntermediateNew
Plan Warehouse Pick Routes with a Classical Planner
You receive a stylized warehouse map (aisle graph), 30 sample shifts of pick tasks, and the current heuristic's outputs. Write a PDDL domain + problem generator, solve with at l…
- Pddl Modeling
- State Space Search
- Classical Planning
Open coursework
CodeIntermediateNew
Hierarchical Plans for an Aerospace Maintenance Crew Scheduler
You receive a synthetic week of 80 work orders with hierarchical decompositions, technician certifications, and shared-tool constraints. Implement an HTN planner (PyHOP or HDDL …
- Htn Planning
- Domain Modeling
- Constraint Handling
Automated Planning
ResearchIntermediateNew
Planning Under Uncertainty for a Last-Mile Delivery Fleet
Build a simulator of the 50-block area with stochastic travel times conditioned on weather and time-of-day. Implement value iteration (for a small state space), MCTS (Monte Carl…
- Planning Under Uncertainty
- Markov Decision Processes
- Monte Carlo Tree Search
Open coursework
CodeIntermediateNew
Temporal Planner for a Robotics Mission Operator
You receive 30 days of mission logs with task lists, time windows, and actual durations. Encode the planning problem with temporal PDDL (PDDL 2.1 durative actions) and solve wit…
- Temporal Planning
- Pddl Modeling
- Simulation
Automated Planning
Practice your coursework on real scenarios.
Every challenge is shaped from real-world context — not generic exercises. The work mirrors what your degree prepares you for.
Why Ewance
CodeIntermediateNew
Safety-Critical Test Harness for an AV Planner
Use CARLA (open-source AV simulator) and encode 10 representative safety scenarios across 3 categories (cut-in, pedestrian emergence, signalized-intersection right-of-way). Writ…
- Simulation
- Scenario Testing
- Safety Evaluation
Open coursework
CodeIntermediateNew
Multi-Sensor Late-Fusion Prototype for an Indoor AGV
Use the public KITTI dataset (or a similar paired LiDAR+RGB dataset) restricted to static-obstacle classes. Implement a late-fusion baseline: a LiDAR-only detector (PointPillars…
- Sensor Fusion
- 3d Object Detection
- Perception
Open coursework
CodeIntermediateNew
Build a Serverless ETL Pipeline for a Climate-Tech Sensor Fleet
Build the pipeline using managed services only (e.g., S3 + Lambda + EventBridge + Glue, or GCS + Cloud Functions + Cloud Scheduler + BigQuery external tables). Source the data f…
- Serverless Architecture
- Etl Pipelines
- Infrastructure As Code
Cloud Computing for Data and ML
CodeIntermediateNew
Semantic Parser for an Enterprise Analytics Assistant
Define a small typed query language (filter, aggregate, group_by, time_range, metric). Curate or write 200 training examples covering the controlled subset and 50 held-out test …
- Semantic Parsing
- Grammar Design
- Transformer Models
Computational Semantics
Explore role
Marketing Analyst
Plan and measure campaigns that grow the business. Funnel analytics, attribution, segmentation, and the rigorous measurement that lets marketing defend its budget at the leadership table.
Browse challenges
CodeIntermediateNew
Distributional Embeddings for a Multilingual Legal Search
Use a public multilingual corpus (e.g., MultiEURLEX or a subset of EUR-Lex) plus a small hand-built test set of around 100 cross-lingual query-passage pairs. Fine-tune (or evalu…
- Distributional Semantics
- Multilingual NLP
- Sentence Embeddings
Computational Semantics
CodeIntermediateNew
Natural Language Inference for an HR-AI Compliance Tool
Use SNLI/MNLI/ANLI as starting data and curate 200 domain-specific HR examples (synthetic or anonymized) for fine-tuning. Fine-tune a small encoder (DeBERTa-v3-base or similar),…
- Natural Language Inference
- Transformer Models
- Fine Tuning
Open coursework
CodeIntermediateNew
Lambda-Calculus Semantic Parser for a Math-Tutor EdTech
Define a small typed lambda-calculus representation for linear equations and a small set of word-problem templates (rate, age, mixture). Build a parser that maps surface express…
- Semantic Parsing
- Lambda Calculus
- Symbolic Reasoning
Computational Semantics
CodeIntermediateNew
Multi-View Pose Estimation for a Sports-Analytics Startup
Use the publicly-released SoccerNet or a synthetic 4-view dataset (you can render with Unity or use a provided one). Implement a 2D pose estimator per view (HRNet or YOLOv8-pose…
- Pose Estimation
- Multi View Geometry
- 3d Reconstruction
Open coursework
Build a verifiable portfolio.
Submissions become evidence. Reviewers with shipping experience score against a rubric; the result becomes a credential anyone can verify.
Why Ewance
CodeIntermediateNew
Defect Detection on PCBs for a Hardware-AI Manufacturer
Use the publicly-available PCB defect dataset (e.g., DeepPCB or HRIPCB). Fine-tune a small object detector (YOLOv8n or RT-DETR-small) on the 6 defect classes. Evaluate mean Aver…
- Object Detection
- Transfer Learning
- Model Evaluation
Computer Vision
CodeIntermediateNew
3D Reconstruction of Cultural Artifacts from Photo Sets
Use COLMAP (open-source SfM) + OpenMVS (open-source MVS) on a curated dataset of 5 small artifacts plus a calibration cube. Build a single Python CLI that ingests a folder of im…
- Structure From Motion
- Multi View Stereo
- 3d Reconstruction
Computer Vision
DesignIntermediateNew
Design a Real-Time Order Pipeline for a Fintech Payments Platform
You receive a synthetic Kafka stream of around 500 transactions per second, a static merchant dimension table (about 80,000 rows), and a daily FX rate snapshot. Design an end-to…
- Streaming Data
- Kafka
- Stream Processing
Data Engineering and Big Data Systems
CodeIntermediateNew
Build a Feature Store Backbone for a Healthtech ML Team
You receive synthetic wearable telemetry (heart rate, accelerometer, sleep stages) for around 5,000 patients across 90 days, plus the existing scattered feature scripts from the…
- Feature Engineering
- Data Modeling
- Python
Data Engineering and Big Data Systems
CodeIntermediateNew
Migrate a Legacy Warehouse to a Lakehouse for an Edtech AI Platform
You receive a Postgres dump of around 50 GB and the current dbt models that produce the student-attempts mart. Land the raw data in object storage (S3 or GCS) as Parquet partiti…
- Lakehouse Architecture
- Delta Lake
- Spark
Data Engineering and Big Data Systems
CodeIntermediateNew
Build a Vector-Search Backend for an Enterprise AI Knowledge Assistant
You receive a corpus of around 20,000 PDFs (mixed scanned and digital) totalling around 30 GB and a labeled retrieval set of 200 queries with human-judged ground-truth passages.…
- RAG
- Vector Search
- Embeddings
Data Engineering and Big Data Systems
CodeIntermediateNew
Build an Anomaly-Detection Pipeline for Pharma Cold-Chain Logistics
You receive 18 months of shipment telemetry (around 60,000 shipments, around 12 million sensor readings) plus a hand-labeled set of 1,200 incidents (mix of true excursions, sens…
- Anomaly Detection
- Feature Engineering
- Time Series
Data Mining and Knowledge Discovery
CodeIntermediateNew
Detect Coordinated Fraud Rings via Link Analysis at a Neobank
You receive 90 days of account, login, and transaction data (around 1.2 million accounts, around 30 million events) plus a labeled set of 80 known fraud rings. Build a multi-rel…
- Graph Analysis
- Community Detection
- Link Analysis
Data Mining and Knowledge Discovery
CodeIntermediateNew
Build a Hybrid Recommender for a Niche Consumer-AI Music App
You receive listening events (around 240 million plays) plus a content embedding per track (audio + curator tags). Build a collaborative filtering model (ALS or implicit-feedbac…
- Recommender Systems
- Collaborative Filtering
- Content Based Filtering
Data Mining and Knowledge Discovery
DesignIntermediateNew
Visualize Embedding Drift for a RAG Knowledge Assistant
You receive weekly snapshots over 12 weeks of around 50,000 document embeddings each (1024-dim). Design and build a visualization tool that: (a) projects each snapshot to 2D wit…
- Embeddings
- Dimensionality Reduction
- Umap
Open coursework
CodeIntermediateNew
Plan Inventory Replenishment as an MDP for an E-Commerce AI Startup
You receive 18 months of daily demand for 50 representative SKUs at one warehouse plus lead-time and unit-cost data. For one SKU at a time, formulate an MDP with state = (on-han…
- Mdp Modeling
- Value Iteration
- Dynamic Programming
Open coursework
CodeIntermediateNew
Run a Monte Carlo Tree Search Strategy for a Robotics Pick-and-Place Task
You receive a simulator of the pick-and-place task: a bin with 10 randomly-placed parts, an action space of which part to pick next, and a reward = parts picked per minute with …
- Monte Carlo Tree Search
- Planning
- Simulation
Decision Making Under Uncertainty

How it works

From brief to credential, in six steps.

Step 01
Browse challenges aligned to your studies.
Step 02
Accept the one that fits your goals.
Step 03
Work through it with AI Copilot guidance.
Step 04
Submit for structured evaluation.
Step 05
Earn a verified credential.
Step 06
Add it to LinkedIn with one click.

Industry teams behind a decade of practitioner briefs

Hiring from this pool?

Sponsor a challenge and meet candidates through actual work.

Industry teams can shape briefs around the skills they hire for, then evaluate students on rubric-scored deliverables — not resumes.

Explore sponsorship