Data Sciences Challenges
Explore data science challenges on Ewance to build skills employers expect from analysts and ML engineers. Work through challenges on data cleaning, exploratory analysis, modeling, and storytelling with data.
Most Popular
- DesignIntermediateNew
Design a Continuous Eval Pipeline for an Enterprise RAG Product
Design (and partially build) a continuous-eval pipeline for a RAG system: (1) a structured eval set with at least 50 queries grouped by query class; (2) automated scoring (LLM-a…
- Continuous Evaluation
- LLM Evaluation
- RAG Architectures
AI Measurement and Evaluation - ResearchIntermediateNew
Design a Capability Evaluation for an Open-Weights Coding Model
Pick a recent open-weights coding model (e.g., a Qwen, DeepSeek, or Llama variant). Design an evaluation set of around 40 coding tasks across 4 buckets: standard benign coding, …
- Capability Evaluation
- Safety Evaluation
- LLM Evaluation
AI Safety and Alignment - CodeIntermediateNew
Prototype Constitutional-AI Style Guardrails for an Internal Chatbot
Author a 'constitution' of 15 to 20 principles tailored to internal research use (no IP leakage, no off-label medical claims, no personnel-data fishing, etc.). Implement a criti…
- Constitutional Ai
- Alignment Techniques
- LLM Evaluation
AI Safety and Alignment - CodeIntermediateNew
Ship an MVP RAG Knowledge Assistant for a Climate-Tech Startup
As a 4-person team across a 6-week sprint, ship: (1) an ingestion pipeline for around 4,000 mixed PDFs and markdown files; (2) a vector store with documented chunking strategy; …
- RAG Architectures
- Software Engineering For Ai
- Vector Databases
AI Software Engineering Group Project Develop in-demand professional skills.
Each challenge names the skills it strengthens. Over time, your profile fills with the competences a hiring manager would actually look for.
Why Ewance
- CodeIntermediateNew
Prototype a Computer-Vision QA Tool for a Robotics Manufacturer
As a 4-person team, build: (1) a labeling pipeline on around 2,000 component images (Label Studio is fine); (2) a transfer-learned classifier or a small segmentation model that …
- Computer Vision
- Transfer Learning
- Model Deployment
AI Software Engineering Group Project - CodeIntermediateNew
Reproducible Patient-Cohort Analysis for a Pharma AI Vendor
You receive a written cohort definition (type-2 diabetes patients on metformin for at least 90 days, aged 40-70) and a target output: 12-month HbA1c change distribution plus a K…
- Reproducible Analysis
- Cohort Analysis
- Survival Analysis
Applied Data Analysis and Practical Data Science - CodeIntermediateNew
Predict Loan Default Risk for a Cross-Border Fintech
You receive 18 months of transactions (around 12M rows) and seller-firmographic data. Define a defensible proxy label for default (e.g., a 60-day chargeback-or-dispute spike com…
- Feature Engineering
- Model Selection
- Model Evaluation
Applied Machine Learning - CodeIntermediateNew
Forecast Energy Demand for a Nordic Renewable Utility
You receive 5 years of hourly residential-segment demand, hourly weather data (temperature, wind, irradiance), and a calendar of public holidays. Build a probabilistic forecaste…
- Time Series Forecasting
- Probabilistic Modeling
- Feature Engineering
Applied Machine Learning - Browse challenges
Explore role
Strategy Analyst
Frame the business question, model the options, build the recommendation. From market sizing to competitive analysis, this role is where strategy consulting meets in-house decision-making.
- CodeIntermediateNew
Triage Medical-Imaging Annotations with a Small Vision Model
Train a binary normal/abnormal classifier on the public CheXpert or NIH ChestX-ray14 dataset. Use temperature scaling to calibrate the output, then define abstention thresholds …
- Cnn Classification
- Transfer Learning
- Calibration
Applied Machine Learning - CodeIntermediateNew
Plan Warehouse Pick Routes with a Classical Planner
You receive a stylized warehouse map (aisle graph), 30 sample shifts of pick tasks, and the current heuristic's outputs. Write a PDDL domain + problem generator, solve with at l…
- Pddl Modeling
- State Space Search
- Classical Planning
Automated Planning - CodeIntermediateNew
Hierarchical Plans for an Aerospace Maintenance Crew Scheduler
You receive a synthetic week of 80 work orders with hierarchical decompositions, technician certifications, and shared-tool constraints. Implement an HTN planner (PyHOP or HDDL …
- Htn Planning
- Domain Modeling
- Constraint Handling
Automated Planning - ResearchIntermediateNew
Planning Under Uncertainty for a Last-Mile Delivery Fleet
Build a simulator of the 50-block area with stochastic travel times conditioned on weather and time-of-day. Implement value iteration (for a small state space), MCTS (Monte Carl…
- Planning Under Uncertainty
- Markov Decision Processes
- Monte Carlo Tree Search
Automated Planning Get recognized by recruiters and employers.
Credentials are blockchain-anchored via LearnCoin — tamper-evident, portable, link-shareable on LinkedIn and beyond.
Why Ewance
- CodeIntermediateNew
Temporal Planner for a Robotics Mission Operator
You receive 30 days of mission logs with task lists, time windows, and actual durations. Encode the planning problem with temporal PDDL (PDDL 2.1 durative actions) and solve wit…
- Temporal Planning
- Pddl Modeling
- Simulation
Automated Planning - CodeIntermediateNew
Safety-Critical Test Harness for an AV Planner
Use CARLA (open-source AV simulator) and encode 10 representative safety scenarios across 3 categories (cut-in, pedestrian emergence, signalized-intersection right-of-way). Writ…
- Simulation
- Scenario Testing
- Safety Evaluation
AI for Autonomous Vehicles - CodeIntermediateNew
Multi-Sensor Late-Fusion Prototype for an Indoor AGV
Use the public KITTI dataset (or a similar paired LiDAR+RGB dataset) restricted to static-obstacle classes. Implement a late-fusion baseline: a LiDAR-only detector (PointPillars…
- Sensor Fusion
- Object Detection
- Perception
AI for Autonomous Vehicles - CodeIntermediateNew
Build a Serverless ETL Pipeline for a Climate-Tech Sensor Fleet
Build the pipeline using managed services only (e.g., S3 + Lambda + EventBridge + Glue, or GCS + Cloud Functions + Cloud Scheduler + BigQuery external tables). Source the data f…
- Serverless Architecture
- Etl Pipelines
- Terraform
Cloud Computing for Data and ML - CodeIntermediateNew
Semantic Parser for an Enterprise Analytics Assistant
Define a small typed query language (filter, aggregate, group_by, time_range, metric). Curate or write 200 training examples covering the controlled subset and 50 held-out test …
- Semantic Parsing
- Grammar Design
- Transformer Models
Computational Semantics - CodeIntermediateNew
Distributional Embeddings for a Multilingual Legal Search
Use a public multilingual corpus (e.g., MultiEURLEX or a subset of EUR-Lex) plus a small hand-built test set of around 100 cross-lingual query-passage pairs. Fine-tune (or evalu…
- Distributional Semantics
- Multilingual NLP
- Sentence Embeddings
Computational Semantics - CodeIntermediateNew
Natural Language Inference for an HR-AI Compliance Tool
Use SNLI/MNLI/ANLI as starting data and curate 200 domain-specific HR examples (synthetic or anonymized) for fine-tuning. Fine-tune a small encoder (DeBERTa-v3-base or similar),…
- Natural Language Inference
- Transformer Models
- Fine Tuning
Computational Semantics - CodeIntermediateNew
Lambda-Calculus Semantic Parser for a Math-Tutor EdTech
Define a small typed lambda-calculus representation for linear equations and a small set of word-problem templates (rate, age, mixture). Build a parser that maps surface express…
- Semantic Parsing
- Lambda Calculus
- Symbolic Reasoning
Computational Semantics - CodeIntermediateNew
Multi-View Pose Estimation for a Sports-Analytics Startup
Use the publicly-released SoccerNet or a synthetic 4-view dataset (you can render with Unity or use a provided one). Implement a 2D pose estimator per view (HRNet or YOLOv8-pose…
- Pose Estimation
- Multi View Geometry
- 3d Reconstruction
Computer Vision - CodeIntermediateNew
Defect Detection on PCBs for a Hardware-AI Manufacturer
Use the publicly-available PCB defect dataset (e.g., DeepPCB or HRIPCB). Fine-tune a small object detector (YOLOv8n or RT-DETR-small) on the 6 defect classes. Evaluate mean Aver…
- Object Detection
- Transfer Learning
- Model Evaluation
Computer Vision - CodeIntermediateNew
3D Reconstruction of Cultural Artifacts from Photo Sets
Use COLMAP (open-source SfM) + OpenMVS (open-source MVS) on a curated dataset of 5 small artifacts plus a calibration cube. Build a single Python CLI that ingests a folder of im…
- Structure From Motion
- Multi View Stereo
- 3d Reconstruction
Computer Vision - DesignIntermediateNew
Design a Real-Time Order Pipeline for a Fintech Payments Platform
You receive a synthetic Kafka stream of around 500 transactions per second, a static merchant dimension table (about 80,000 rows), and a daily FX rate snapshot. Design an end-to…
- Streaming Data
- Kafka Event Streaming
- Stream Processing
Data Engineering and Big Data Systems
How it works
From brief to credential, in six steps.
Step 01
Browse challenges aligned to your studies.
Step 02
Accept the one that fits your goals.
Step 03
Work through it with AI Copilot guidance.
Step 04
Submit for structured evaluation.
Step 05
Earn a verified credential.
Step 06
Add it to LinkedIn with one click.
Industry teams behind a decade of practitioner briefs
Hiring from this pool?
Sponsor a challenge and meet candidates through actual work.
Industry teams can shape briefs around the skills they hire for, then evaluate students on rubric-scored deliverables — not resumes.



















































































