Computer & Information Sciences

Data Science Challenges

Real data-science projects and challenges on Ewance — clean messy datasets, build and evaluate models, and turn raw data into decisions the way a working data scientist does. Solve them to build a portfolio of verified, recruiter-checkable proof you can do the work — not just describe it.

Recommended challenges

CodeBeginnerNew
Optimize Wind-Turbine Layout with a Genetic Algorithm
You receive a wind-speed-and-direction time series for the lease area, the polygon boundary, a minimum inter-turbine spacing constraint, and a Jensen wake model. Implement a rea…
- Genetic Algorithms
- Metaheuristics
- Constraint Handling
Evolutionary Computation and Metaheuristic Search
ResearchIntermediateNew
Tune a PPO Policy for an Energy-Storage Trading Bot
You receive 18 months of 15-minute Nordic spot-price data, a battery dynamics model (capacity, round-trip efficiency, degradation curve), and a rule-based baseline that earns ab…
- Policy Gradients
- Ppo
- Reinforcement Learning
Deep Reinforcement Learning
ResearchIntermediateNew
Design a Capability Evaluation for an Open-Weights Coding Model
Pick a recent open-weights coding model (e.g., a Qwen, DeepSeek, or Llama variant). Design an evaluation set of around 40 coding tasks across 4 buckets: standard benign coding, …
- Capability Evaluation
- Safety Evaluation
- LLM Evaluation
AI Safety and Alignment
AnalysisBeginnerNew
Interpretable-by-Design GAM for an Insurer's Claims Triage
You receive an anonymized claims dataset (around 60,000 claims, target: log reserve), a feature schema (22 features), and an existing LightGBM baseline (held-out R^2 of 0.78). T…
- Generalized Additive Models
- Ebm
- Interpretability
Explainable and Interpretable AI
Practice your coursework on real scenarios.
Every challenge is shaped from real-world context — not generic exercises. The work mirrors what your degree prepares you for.
Why Ewance
CodeIntermediateNew
Build a Forgetting-Curve-Aware Spaced-Repetition Engine
You receive 6 months of practice logs from 8,000 learners (item, timestamp, response correctness, response latency). Fit a learner-personalized forgetting model (logistic per-it…
- Spaced Repetition
- Personalization
- Behavioral Data
AI in Education and Learning Analytics
CodeIntermediateNew
Build a Small Transformer from Scratch and Train It on Code
Implement multi-head self-attention, RMSNorm, rotary positional embeddings, and a causal LM head from scratch — no Hugging Face shortcuts for the model code (you may use Hugging…
- Transformers
- Self Attention
- Pytorch
Neural Networks for NLP
DesignSeniorNew
Multi-Region Failover for an Enterprise RAG Service
Design and prototype: (1) a primary-region deployment of the RAG service (vector DB + LLM inference + retrieval API), (2) a passive secondary region with replicated vector store…
- Multi Region Architecture
- Disaster Recovery
- Infrastructure As Code
Cloud Computing for Data and ML
CodeIntermediateNew
Build an Ensemble Strategy for Marketing-Mix Modelling
You receive 36 months of weekly marketing-spend and outcome data for 8 sample brands. Build a per-brand baseline gradient-boosting MMM model, then build two more base learners (…
- Ensemble Methods
- Stacking
- Time Series Cv
Machine Learning
Explore role
Product Manager
Ship product that solves real user problems. Combine user research, prototyping, and stakeholder alignment to turn ambiguous briefs into measurable wins — the role at the centre of modern software teams.
Browse challenges
AnalysisIntermediateNew
Audit a Sepsis Early-Warning Model for Subgroup Performance
You receive a pre-trained vendor model, the training-data summary, and a held-out hospital-network evaluation set (about 18,000 ICU stays with sepsis labels). Compute AUROC + AU…
- Model Evaluation
- Fairness Metrics
- Model Calibration
Machine Learning for Healthcare and Biomedicine
AnalysisFoundationalNew
Cluster Climate-Tech SMB Customers for a Growth Team
You receive a CSV with company size, industry sub-vertical, country, product features adopted, monthly active users, and lifetime value. Standardize features, decide on a cluste…
- Unsupervised Learning
- Clustering
- Dimensionality Reduction
Machine Learning (Undergraduate)
CodeIntermediateNew
Teach a Warehouse Cobot from Operator Demonstrations
You receive a simulated UR5e cobot in PyBullet, plus 12 example demonstrations of two kitting sequences. Implement Dynamic Movement Primitives (DMPs — a classic LfD technique th…
- Learning From Demonstration
- Dynamic Movement Primitives
- Human Robot Interaction
Human-Robot Interaction
DesignBeginnerNew
Chain-of-Thought for High-School Math Tutoring
You receive 80 practice problems across 4 topics (linear equations, factoring, systems of equations, quadratics), each with the correct answer and an expected age-appropriate ex…
- Chain Of Thought Prompting
- Zero Shot Prompting
- Few Shot Prompting
Prompt Engineering
Build a verifiable portfolio.
Submissions become evidence. Reviewers with shipping experience score against a rubric; the result becomes a credential anyone can verify.
Why Ewance
CodeIntermediateNew
Scale Feature Pipelines for a Hyperscaler Search-Ranking Team
You receive a synthetic-but-realistic 80 GB sample of the ranking events plus the existing Spark pipeline (PySpark) and a Spark UI snapshot from a recent production run. Profile…
- Spark
- Distributed Systems
- Performance Profiling
Machine Learning at Scale
CodeIntermediateNew
Build a Feature Store Backbone for a Healthtech ML Team
You receive synthetic wearable telemetry (heart rate, accelerometer, sleep stages) for around 5,000 patients across 90 days, plus the existing scattered feature scripts from the…
- Feature Engineering
- Data Modeling
- Python
Data Engineering and Big Data Systems
CodeIntermediateNew
Plan Warehouse Pick Routes with a Classical Planner
You receive a stylized warehouse map (aisle graph), 30 sample shifts of pick tasks, and the current heuristic's outputs. Write a PDDL domain + problem generator, solve with at l…
- Pddl Modeling
- State Space Search
- Classical Planning
Automated Planning
ResearchSeniorNew
Pre-Register and Run a Small Neural-Network Ablation Study
You will study how three architectural and regularization choices (depth: 2/4/8 hidden layers; activation: ReLU vs. GELU; weight decay: 0 / 1e-4 / 1e-3) affect a small MLP's tes…
- Neural Networks
- Regularization
- Experiment Design
Machine Learning
DesignIntermediateNew
Build a Feature Store for a Fintech Fraud Team
You will design a feature-store layer covering 12 representative fraud features (account-level, merchant-level, transaction-level), with both batch (Spark) and online (low-laten…
- Feature Stores
- Data Pipelines
- Spark
Machine Learning at Scale
AnalysisIntermediateNew
Imitation Learning from Human Demos for a Drone Inspection
You receive 6 hours of expert pilot demonstrations (state-action pairs at 20 Hz) recorded in an AirSim wind-farm environment with 3 turbine designs, plus a held-out 4th turbine …
- Imitation Learning
- Behavioral Cloning
- Dagger
Deep Reinforcement Learning
ResearchIntermediateNew
Detect Coordinated Inauthentic Behavior on a News-Sharing Network
You receive a 60-day sample of about 6 million posts mentioning a recent election, with account metadata (creation date, posting times, follower graph). Design and prototype a C…
- Network Analysis
- Anomaly Detection
- Near Duplicate Detection
Social Network Analysis and Web Science
CodeIntermediateNew
Fuse LiDAR and Camera for an Autonomous Yard Truck
You receive 6 hours of synced LiDAR + 4-camera ring data from yard operations, with 3D bounding-box labels for pedestrians, forklifts, and containers. Build a late-fusion module…
- Sensor Fusion
- Lidar Perception
- Object Detection
Robot Perception and Autonomy
DesignSeniorNew
Design Eval Suite for a Multimodal Brainstorming Assistant
You receive (1) the assistant's current API, (2) a list of 6 launch user-personas, and (3) the product team's quality target ('beat the previous model on 4 of 6 personas'). Desi…
- LLM Evaluation
- Multimodal Evaluation
- Safety Evaluation
Generative AI
CodeIntermediateNew
Diagnose Equipment Failures with a Bayesian Network
You receive 90 days of sensor logs (vibration, spindle temperature, coolant flow, ambient humidity), the maintenance log of 180 failure events labeled by root cause, and a short…
- Bayesian Networks
- Probabilistic Inference
- Parameter Learning
Probabilistic Graphical Models
AnalysisIntermediateNew
Benchmark Visual SLAM Stacks for an Indoor Delivery Robot
You receive 8 indoor rosbag recordings (about 90 minutes total) captured by the robot's stereo camera + Inertial Measurement Unit (IMU) plus ground-truth trajectories from an ex…
- Visual Slam
- Sensor Fusion
- Trajectory Evaluation
Robot Perception and Autonomy
CodeSeniorNew
Plan Under Uncertainty for a Warehouse Restocking Robot
You receive a discrete-event simulator of a 1,200-shelf warehouse with calibrated optical-scanning error rates and stock-out cost per shelf. Formulate the restocking decision as…
- Planning Under Uncertainty
- Pomdp
- Monte Carlo Planning
Advanced Robotics

How it works

From brief to credential, in six steps.

Step 01
Browse challenges aligned to your studies.
Step 02
Accept the one that fits your goals.
Step 03
Work through it with AI Copilot guidance.
Step 04
Submit for structured evaluation.
Step 05
Earn a verified credential.
Step 06
Add it to LinkedIn with one click.

Related fields

Industry teams behind a decade of practitioner briefs

Hiring from this pool?

Sponsor a challenge and meet candidates through actual work.

Industry teams can shape briefs around the skills they hire for, then evaluate students on rubric-scored deliverables — not resumes.

Explore sponsorship

Data Science Projects & Challenges | Ewance