Computer & Information Sciences

Data Science Challenges

Real data-science projects and challenges on Ewance — clean messy datasets, build and evaluate models, and turn raw data into decisions the way a working data scientist does. Solve them to build a portfolio of verified, recruiter-checkable proof you can do the work — not just describe it.

Recommended challenges

AnalysisBeginnerNew
Audit a Climate-Tech Sensor Dataset for Production Readiness
You receive 18 months of raw sensor readings from 1,200 sensors (about 800M rows), plus a sensor-metadata table (location, firmware version, deployment date). Profile the data f…
- Data Quality Audit
- Data Profiling
- Time Series Analysis
Applied Data Analysis and Practical Data Science
CodeIntermediateNew
Multi-Agent Research Assistant for Biotech Patent Review
You receive 20 historical patent applications with the firm's own prior-art memos as ground truth. Design and build a 3-agent system: (a) Searcher — issues queries to a patent-s…
- LLM Agents
- Multi Agent Collaboration
- Agent Evaluation
AI Agents and LLM-Based Agents
AnalysisBeginnerNew
Interpretable-by-Design GAM for an Insurer's Claims Triage
You receive an anonymized claims dataset (around 60,000 claims, target: log reserve), a feature schema (22 features), and an existing LightGBM baseline (held-out R^2 of 0.78). T…
- Generalized Additive Models
- Ebm
- Interpretability
Explainable and Interpretable AI
CodeBeginnerNew
Implement a Constraint Solver for a Lisbon Tourism Scheduler
Model the next-week schedule as a CSP: variables are (guide, day, slot) assignments; domains are available routes; constraints encode language requirements, max consecutive tour…
- Constraint Satisfaction
- Backtracking Search
- Python
Introduction to Artificial Intelligence
Practice your coursework on real scenarios.
Every challenge is shaped from real-world context — not generic exercises. The work mirrors what your degree prepares you for.
Why Ewance
CodeIntermediateNew
Constitutional AI Critique Loop for Hallucination Reduction
You receive the meal-planning prompts (60 test cases with dietary constraints), an unrevised baseline (single-pass instruction-tuned model), and an empty nutrition-constraint co…
- Constitutional Ai
- Self Critique
- Alignment Prompting
Machine Learning from Human Preferences (RLHF and Alignment)
AnalysisIntermediateNew
Audit BLEU vs. COMET on a Multilingual Customer-Support Corpus
You receive 600 source-translation-reference triples covering 6 languages (EN as source; ES/FR/DE/JA/PT-BR/HI as targets), each scored on adequacy and fluency (1-6) by 3 profess…
- Mt Evaluation
- Neural Mt
- Statistical Analysis
Machine Translation
CodeFoundationalNew
Tune a Pick-and-Place Controller for a Cosmetics Co-Packer
You receive 4 hours of logged trajectories from the existing controller (joint positions, target poses, miss/success labels) and read/write access to the controller config (YAML…
- Motion Control
- Trajectory Tuning
- Robot Kinematics
Robotics
ResearchIntermediateNew
Planning Under Uncertainty for a Last-Mile Delivery Fleet
Build a simulator of the 50-block area with stochastic travel times conditioned on weather and time-of-day. Implement value iteration (for a small state space), MCTS (Monte Carl…
- Planning Under Uncertainty
- Markov Decision Processes
- Monte Carlo Tree Search
Automated Planning
Explore role
Product Manager
Ship product that solves real user problems. Combine user research, prototyping, and stakeholder alignment to turn ambiguous briefs into measurable wins — the role at the centre of modern software teams.
Browse challenges
CodeIntermediateNew
Description-Logic Reasoner for Insurance-Policy Coverage Checks
You receive 50 representative coverage rules in plain English (from the current rule engine) and a sample of 1,000 anonymized claim cases with the current engine's outcomes (cov…
- Description Logics
- Owl
- Reasoning
Fuzzy Logic, Knowledge Representation, and Symbolic Reasoning
CodeIntermediateNew
Reproducible Patient-Cohort Analysis for a Pharma AI Vendor
You receive a written cohort definition (type-2 diabetes patients on metformin for at least 90 days, aged 40-70) and a target output: 12-month HbA1c change distribution plus a K…
- Reproducible Analysis
- Cohort Analysis
- Survival Analysis
Applied Data Analysis and Practical Data Science
CodeIntermediateNew
Train a Sequence Model for Wearable-Telemetry Sleep Staging at a Healthtech
You receive 220 nights of wearable telemetry from 60 subjects with PSG ground-truth labels. Train three sequence models: an LSTM baseline, a 1D-CNN+GRU hybrid, and a small trans…
- Sequence Models
- Lstm
- Transformers
Deep Learning
CodeIntermediateNew
Build an Evaluation Harness for an Internal LLM Assistant
You will design and implement an evaluation harness in Python that runs four test suites: (1) helpfulness (LLM-as-judge with rubric), (2) factual grounding (compare cited source…
- LLM Evaluation
- LLM As Judge
- Prompt Injection Testing
Large Language Models
Build a verifiable portfolio.
Submissions become evidence. Reviewers with shipping experience score against a rubric; the result becomes a credential anyone can verify.
Why Ewance
CodeSeniorNew
Triage Brain-CT Stroke Detector with Calibrated Uncertainty
You receive a curated public head-CT dataset (about 2,800 scans, slice-level labels for hemorrhagic stroke) and a held-out 600-scan hospital cohort. Train a 3D CNN or 2.5D slice…
- Medical Imaging
- Convolutional Neural Networks
- Uncertainty Quantification
Machine Learning for Imaging and Medical Image Analysis
ResearchSeniorNew
SAT-Based Planner for Smart-Grid Demand Response
Encode the dispatch problem (which customers to curtail by how much, respecting per-customer contractual caps and grid-cell totals) as a SAT or MaxSAT instance. Solve 50 histori…
- Sat Based Planning
- Constraint Encoding
- Benchmarking
Automated Planning
AnalysisBeginnerNew
Audit a Hiring-Screen Classifier for Fairness Across Cohorts
You receive the classifier as a black-box API and a synthetic-but-realistic dataset of 8,000 CVs with imputed demographic proxies (gender, age band, regional cluster) and labele…
- Fairness Evaluation
- Disparate Impact
- Audit Methodology
Trustworthy AI, Robustness, and Safety
ResearchIntermediateNew
Train a Physics-Informed Neural Network for Heat Transfer in a Battery Pack
Solve the 2D unsteady heat-conduction equation on a square cell cross-section with a localized source and Dirichlet boundary conditions on the casing. Implement a baseline finit…
- Physics Informed Neural Networks
- Partial Differential Equations
- Pytorch
AI for Science and Engineering
CodeIntermediateNew
Plan Warehouse Pick Routes with a Classical Planner
You receive a stylized warehouse map (aisle graph), 30 sample shifts of pick tasks, and the current heuristic's outputs. Write a PDDL domain + problem generator, solve with at l…
- Pddl Modeling
- State Space Search
- Classical Planning
Automated Planning
CodeIntermediateNew
Diagnose Equipment Failures with a Bayesian Network
You receive 90 days of sensor logs (vibration, spindle temperature, coolant flow, ambient humidity), the maintenance log of 180 failure events labeled by root cause, and a short…
- Bayesian Networks
- Probabilistic Inference
- Parameter Learning
Probabilistic Graphical Models
CodeIntermediateNew
Build a Speaker-Diarization Pipeline for a Legal-Tech Startup
You receive 20 hours of de-identified hearing audio with ground-truth speaker labels (4 speaker classes per hearing). Build a speaker-diarization pipeline (pyannote-audio or sim…
- Speaker Diarization
- Speech Recognition
- Pyannote
Speech Recognition and Spoken Language Processing
AnalysisSeniorNew
Write a Copyright Risk Memo for a Foundation-Model Lab's Training Set
Cover (1) US fair-use exposure for training on web-scraped text and code, including the current state of pending major lawsuits at the time of writing; (2) the EU TDM exceptions…
- Copyright Law
- Regulatory Analysis
- Risk Mapping
AI Law, Policy, and Regulation
ResearchIntermediateNew
Red-Team an Image-Classification Pipeline for a Banking KYC Workflow
You receive the production image classifier as a black-box API plus a labeled validation set of 5,000 ID images. Run untargeted FGSM and PGD attacks (L_inf budget 4/255 and 8/25…
- Adversarial Attacks
- Robust Evaluation
- Red Teaming
Trustworthy AI, Robustness, and Safety
ResearchBeginnerNew
Run a Human-Preference Study Comparing Two Coding Assistants
Design a blinded paired-comparison study: 12 developer participants, each gets the same 8 realistic coding tasks (refactor, write a function, debug, test), each task is solved b…
- Experiment Design
- Statistical Evaluation
- Human Evaluation
AI Measurement and Evaluation
AnalysisBeginnerNew
Explain a Credit-Risk Model with SHAP for a Fintech
You receive a trained XGBoost credit-risk model (binary default prediction), the training feature schema (38 features), and a held-out 10,000-sample test set with labels. Comput…
- Shap
- Interpretability
- Fairness Analysis
Explainable and Interpretable AI
DesignBeginnerNew
Scaling a Sydney D2C Cosmetics Startup's Data Pipeline
You are tasked with designing a cloud-based data pipeline for GlowUp. The pipeline must ingest real-time user events (page views, purchases, returns) from web and mobile apps, p…
- Cloud Computing
- Apache Spark
- Nosql
Big Data and Cloud Technologies

How it works

From brief to credential, in six steps.

Step 01
Browse challenges aligned to your studies.
Step 02
Accept the one that fits your goals.
Step 03
Work through it with AI Copilot guidance.
Step 04
Submit for structured evaluation.
Step 05
Earn a verified credential.
Step 06
Add it to LinkedIn with one click.

Related fields

Industry teams behind a decade of practitioner briefs

Hiring from this pool?

Sponsor a challenge and meet candidates through actual work.

Industry teams can shape briefs around the skills they hire for, then evaluate students on rubric-scored deliverables — not resumes.

Explore sponsorship

Data Science Projects & Challenges | Ewance