Computer & Information Sciences

Data Science Challenges

Real data-science projects and challenges on Ewance — clean messy datasets, build and evaluate models, and turn raw data into decisions the way a working data scientist does. Solve them to build a portfolio of verified, recruiter-checkable proof you can do the work — not just describe it.

Recommended challenges

CodeBeginnerNew
Build a Face-Anonymization Tool for a Civic-Tech Newsroom
Use a pretrained face detector (RetinaFace or YOLOv8-face is fine). Build a Python tool with a Gradio or Streamlit UI that: (1) detects faces in an uploaded photo, (2) shows det…
- Object Detection
- Image Processing
- Opencv
Computer Vision (Undergraduate)
AnalysisBeginnerNew
Optimizing Ad Spend for a D2C Cosmetics Brand
You are a data analyst at Glow & Grow. Using the provided dataset (simulated), perform an exploratory data analysis to understand trends in ad performance. Build a regression mo…
- Data Analysis
- Regression
- Data Visualization
Data Analytics for Business
CodeSeniorNew
Grounded Language for a Robotics Pick-and-Place Demo
Use a tabletop simulator (PyBullet or Isaac Sim, both open) with 5 object types and 5 spatial relations (left of, right of, behind, in front of, on top of). Curate or generate a…
- Grounded Language Understanding
- Semantic Parsing
- Perception
Computational Semantics
CodeIntermediateNew
Fine-Tune a Small Transformer for Legal-Domain EN-DE Translation
You receive a 120,000-segment parallel EN-DE legal corpus and a held-out 1,000-segment test set with reference translations. Fine-tune a small pretrained Transformer (e.g., NLLB…
- Neural Mt
- Transformer
- Fine Tuning
Machine Translation
Develop in-demand professional skills.
Each challenge names the skills it strengthens. Over time, your profile fills with the competences a hiring manager would actually look for.
Why Ewance
StrategyBeginnerNew
Plan a Self-Improving Sales-Research Agent
Build the v0 agent: given a company URL, it gathers 5 fact bullets (recent news, headcount range, tech stack hints, hiring patterns, a recent leadership change) and drafts a 4-l…
- LLM Agents
- Agent Design
- Experimentation
AI Agents and LLM-Based Agents
CodeIntermediateNew
Design a Force-Controlled Polishing Skill for a Watchmaker
You receive simulated polishing trajectories from the manufacturer's robot, force-sensor logs from 20 master-craftsman demonstrations, and a quality-rubric (mirror finish 1-5) f…
- Impedance Control
- Force Control
- Manipulation
Robotics
ResearchIntermediateNew
Reward Shaping for a Quadruped Locomotion Policy
You receive a quadruped locomotion environment (Isaac Lab or pybullet-quadruped), the previous reward function (5 terms), and a budget of 6 training runs. Design 4 reward varian…
- Reward Shaping
- Ppo
- Locomotion
Robot Learning
AnalysisBeginnerNew
Predict Equipment Failure for a Wind-Farm Operator
You receive 18 months of SCADA (Supervisory Control and Data Acquisition — the standard turbine telemetry feed) data sampled every 10 minutes from all 240 turbines, with labeled…
- Classification
- Regularized Regression
- Gradient Boosting
Statistical Machine Learning
Explore role
Pricing Strategist
Set the price that captures value without leaving sales on the table. Demand modelling, willingness-to-pay research, and the disciplined experimentation that turns pricing into a competitive advantage.
Browse challenges
DesignBeginnerNew
Simulation-Based Capacity Planning for a 40-Person SaaS Scale-Up
Your task is to simulate Flowly's customer success operations over the next 12 months. Model the arrival of new enterprise customers, their onboarding and support requests, and …
- Simulation
- Capacity Planning
- Scenario Analysis
Business Analytics
CodeSeniorNew
Cost-Optimize a 24/7 LLM API Cluster
Profile the current usage (24-hour trace, per-team breakdown). Pick a cost-optimization mix from: time-based autoscaling, spot/preemptible instances with graceful drain, smarter…
- LLM Serving
- Autoscaling
- Ray
ML Engineering and Production ML
CodeIntermediateNew
Detect Change Points in a Trading Platform's Latency Telemetry
You receive 90 days of per-millisecond latency telemetry across 12 services, plus an incident log of 14 known regressions and 22 known false-alarm-class events. Implement and tu…
- Change Point Detection
- Anomaly Detection
- Time Series Analysis
Time Series Analysis and Forecasting
CodeSeniorNew
Coordinate a Fleet of Warehouse Robots
Implement a simulated warehouse grid with 80 robots solving a pick-and-deliver workload. Design a decentralized coordination protocol (recommend a contract-net or auction-based …
- Multi Agent Coordination
- Decentralized Algorithms
- Simulation
Multi-Agent Systems
Get recognized by recruiters and employers.
Credentials are blockchain-anchored via LearnCoin — tamper-evident, portable, link-shareable on LinkedIn and beyond.
Why Ewance
DesignFoundationalNew
Stakeholder Workshop on AI Risk for a Public-Sector Pilot
You receive a description of the pilot (24/7 LLM chatbot answering questions on municipal services in Spanish and Catalan). Design a 3-hour workshop for around 25 mixed-stakehol…
- Stakeholder Engagement
- Workshop Design
- Responsible Ai
AI Ethics, Fairness, and Responsible AI
AnalysisIntermediateNew
Sales Forecasting for a SaaS Scale-Up Moving Upmarket
You are given historical MRR data, enterprise pipeline data (deal size, stage, probability), and churn rates. Your task is to build a dynamic Excel model that forecasts MRR usin…
- Excel Modeling
- Vba Programming
- Monte Carlo Simulation
Spreadsheet Modeling and VBA
AnalysisIntermediateNew
Chest-X-Ray Deployment Audit Across Hospital Sites
You receive (1) a vendor-supplied multi-label chest-X-ray classifier, (2) the current single-site held-out evaluation set, (3) a 12,000-image multi-site evaluation set with 14-f…
- Medical Imaging
- Classification
- Model Evaluation
Machine Learning for Imaging and Medical Image Analysis
AnalysisBeginnerNew
Cost-Optimize an Embedding Pipeline for a Customer Support Knowledge Base
You receive: (a) the current pipeline (full re-embed on any article change, OpenAI text-embedding-3-large, 3,072 dims) with one month of cost logs, (b) a sample of 5,000 article…
- Embedding Models
- Cost Optimization
- Change Detection
Vector Databases and Embeddings
StrategyIntermediateNew
Design a Compliance Strategy for an AI Robo-Advisor in the EU
Anchor the work on the published EU AI Regulation risk classification (limited vs. high-risk systems) and the European Securities and Markets Authority guidelines on robo-advice…
- Ai Governance
- Regulatory Analysis
- Product Strategy
AI and Quantitative Finance
CodeBeginnerNew
Plan Safe Paths for a Last-Mile Sidewalk Robot
You receive 4 hours of recorded sidewalk traversals with annotated pedestrian tracks, occupancy grids, and a map of the pilot neighborhood. Implement a sampling-based planner (R…
- Motion Planning
- Sampling Based Planning
- Cost Function Design
Robot Perception and Autonomy
AnalysisBeginnerNew
Audit a Hiring-Screening Model for Demographic Bias
You receive: (a) inference API access to the production model (black-box), (b) a 12,000-resume audit benchmark with self-declared gender and age-band labels (consented, GDPR-com…
- Fairness Metrics
- Bias Auditing
- Model Evaluation
AI Ethics, Fairness, and Responsible AI
ResearchIntermediateNew
Evaluate a Knowledge-Graph-Augmented Recommender
You receive permission to use the public MovieLens 1M dataset plus a derived item-KG (movie -> genre, director, decade) built from Wikidata. Train two recommenders: a matrix-fac…
- Knowledge Graph Embeddings
- Recommender Systems
- Benchmarking
Knowledge Graphs and Semantic Web
AnalysisIntermediateNew
Compare ML Compiler Stacks on a Vision Backbone
Take a frozen ResNet-50 (or similar) in ONNX. Compile and benchmark it via TensorRT on Jetson + GPU, ONNX Runtime on all three, OpenVINO on x86 CPU, and IREE on ARM if time allo…
- Ml Compilers
- Tensorrt
- Onnx
Machine Learning Systems
CodeIntermediateNew
Segment Cells from Microscopy Images for a Pharma-AI Discovery Lab
You receive 3,500 microscopy images with pixel-level cell masks plus a 200-image hold-out set re-annotated by two biologists for inter-annotator agreement. Train a U-Net or SegF…
- Semantic Segmentation
- U Net
- Pytorch
Deep Learning for Computer Vision
CodeBeginnerNew
Build a Product Knowledge Graph for a Fast-Fashion Retailer
You receive 200 sample SKUs across 4 markets (Spain, Germany, Japan, Brazil) as CSVs with country-specific attribute names. Design an OWL ontology with shared classes for Produc…
- Knowledge Graphs
- Owl Ontology
- Rdf
Knowledge Graphs and Semantic Web
CodeIntermediateNew
Train a Reward Model on Customer-Support Preferences
You receive 8,000 labeled preference pairs from real support conversations (each pair is two model responses with a human-chosen winner). Fine-tune a small open-weights base mod…
- Reward Modeling
- Preference Learning
- Bradley Terry Loss
Machine Learning from Human Preferences (RLHF and Alignment)

How it works

From brief to credential, in six steps.

Step 01
Browse challenges aligned to your studies.
Step 02
Accept the one that fits your goals.
Step 03
Work through it with AI Copilot guidance.
Step 04
Submit for structured evaluation.
Step 05
Earn a verified credential.
Step 06
Add it to LinkedIn with one click.

Related fields

Industry teams behind a decade of practitioner briefs

Hiring from this pool?

Sponsor a challenge and meet candidates through actual work.

Industry teams can shape briefs around the skills they hire for, then evaluate students on rubric-scored deliverables — not resumes.

Explore sponsorship