Computer & Information Sciences

Data Science Challenges

Real data-science projects and challenges on Ewance — clean messy datasets, build and evaluate models, and turn raw data into decisions the way a working data scientist does. Solve them to build a portfolio of verified, recruiter-checkable proof you can do the work — not just describe it.

Recommended challenges

ResearchSeniorNew
Plan a Parameter-Efficient Fine-Tuning Strategy for a Big-Tech AI Lab
You will produce (1) a 6-page survey of four PEFT methods (LoRA, adapters, prefix tuning, IA3) with their strengths, weaknesses, and parameter footprints, (2) a one-page decisio…
- Parameter Efficient Fine Tuning
- Transfer Learning
- Fine Tuning
Meta-Learning, Transfer Learning, and Multi-Task Learning
ResearchBeginnerNew
Plan a Field Study for an Autonomous Sidewalk Delivery Robot
You will design a mixed-methods field study spanning two weeks of observation on a fixed route, intercept surveys with ~80 pedestrians, and 8 short interviews with neighborhood …
- Field Study Design
- Human Robot Interaction
- Research Ethics
Human-Robot Interaction
CodeBeginnerNew
Build a Crawler-and-Topic Pipeline for Public-Sector Web Analytics
You will build a polite, robots.txt-respecting crawler that ingests about 30,000 new posts/week across the 80 forums into a normalized dataset. Apply a topic model (BERTopic, wi…
- Web Crawling
- Topic Modeling
- NLP Pipeline
Social Network Analysis and Web Science
StrategyBeginnerNew
Spec a Voice Agent for an Airline's Disruption Support Line
You will produce a 6-page voice-agent product spec covering: (1) supported intents and out-of-scope handling, (2) handoff-to-human criteria, (3) latency and confidence threshold…
- Voice Agent Design
- Intent Design
- Metric Design
Speech Recognition and Spoken Language Processing
Practice your coursework on real scenarios.
Every challenge is shaped from real-world context — not generic exercises. The work mirrors what your degree prepares you for.
Why Ewance
CodeIntermediateNew
Build a Vision-Language Search for an E-commerce Catalog
Pick a vision-language encoder (OpenCLIP, SigLIP, or BLIP-2 image-text variant). Index all 600k product images into a vector database (Qdrant/FAISS). Build a query-time pipeline…
- Vision Language Models
- Clip
- Vector Search
Multimodal Machine Learning
AnalysisBeginnerNew
Evaluate Speech-to-Text Quality for a Contact-Center Analytics Vendor
You receive 200 anonymized call-recording snippets (2-4 minutes each, ~67 per language) with reference transcripts plus a domain glossary of about 600 product terms. Run all thr…
- Speech Recognition
- Sequence Models
- Model Evaluation
Machine Perception
AnalysisBeginnerNew
Build a Restoration Workflow for a Digital Heritage Archive
You receive 50 high-resolution scans of glass plates plus 3 reference 'gold' restorations done by a senior conservator. Design a reproducible workflow combining inpainting for s…
- Image Restoration
- Inpainting
- Tone Mapping
Image Processing and Computational Imaging
ResearchSeniorNew
Solve a POMDP for a Healthtech Diagnostic Pathway
You receive a simplified pathway: 5 possible underlying conditions, 8 possible diagnostic tests each with documented sensitivity and specificity, and an outcome payoff matrix fr…
- Pomdp Modeling
- Belief States
- Approximate Solvers
Decision Making Under Uncertainty
Explore role
Marketing Analyst
Plan and measure campaigns that grow the business. Funnel analytics, attribution, segmentation, and the rigorous measurement that lets marketing defend its budget at the leadership table.
Browse challenges
ResearchIntermediateNew
Audit an Agentic Workflow for Safety Failures
Read the system's existing capability spec + tool-allow-list. Design 50+ adversarial inputs across categories: prompt-injection, tool-confusion, scope-escape (agent does somethi…
- Ai Red Teaming
- Agent Safety
- Prompt Injection
Multi-Agent Systems
CodeIntermediateNew
Use Actor-Critic to Auto-Tune a HVAC Control Policy
You receive a Sinergym wrapper around the EnergyPlus model of one floor with 8 thermal zones, weather data for one year, and occupancy schedules. Train a Soft Actor-Critic (SAC,…
- Actor Critic
- Soft Actor Critic
- Continuous Control
Deep Reinforcement Learning
CodeIntermediateNew
Prototype Constitutional-AI Style Guardrails for an Internal Chatbot
Author a 'constitution' of 15 to 20 principles tailored to internal research use (no IP leakage, no off-label medical claims, no personnel-data fishing, etc.). Implement a criti…
- Constitutional Ai
- Alignment Techniques
- LLM Evaluation
AI Safety and Alignment
DesignIntermediateNew
Prototype an Explainability Panel for a Fintech Credit Assistant
You receive: the model's top-10 SHAP-style feature contributions per customer (a feature-importance technique that breaks an ML prediction into per-input contributions), the cur…
- Explainability Design
- Human Ai Interaction
- Figma Prototyping
Human-Computer Interaction for AI Systems
Build a verifiable portfolio.
Submissions become evidence. Reviewers with shipping experience score against a rubric; the result becomes a credential anyone can verify.
Why Ewance
ResearchIntermediateNew
Neuro-Symbolic Question Answering on an Enterprise Knowledge Graph
You receive a curated Turtle-format knowledge graph (around 2 million triples covering organizational structure, products, projects), 200 labeled question-SPARQL pairs split 140…
- Neuro Symbolic
- Sparql
- Knowledge Graphs
Fuzzy Logic, Knowledge Representation, and Symbolic Reasoning
DesignIntermediateNew
Design Hybrid Search for an E-Commerce Product Catalog
You receive 80,000 anonymized product records (title, description, category, attributes) and a sample of 30,000 search log entries with click-through labels. Embed the catalog w…
- Hybrid Search
- Embedding Models
- Bm25
Vector Databases and Embeddings
ResearchIntermediateNew
Train a NeRF for Real-Estate Virtual Tours
You receive a curated dataset of 3 apartments, each with around 120 input images and known camera poses (already SfM-processed). Train a NeRF variant (Instant-NGP or Nerfacto re…
- Neural Scene Representation
- Nerf
- Pytorch
3D Vision and Multi-View Geometry
AnalysisBeginnerNew
Diagnose Query Failures in an E-Commerce Search Box
You receive 6 months of anonymized query logs (~480 million rows): query string, language hint, results-shown count, top-3 product clicks, and add-to-cart events. Build a notebo…
- Query Log Analysis
- Clustering
- Ir Failure Analysis
Information Retrieval and Search
DesignSeniorNew
Dynamic Pricing Optimization for a Ride-Hailing Platform
You are a data scientist at CityRide. Using 6 months of historical trip data (pickup/dropoff, time, fare, surge multiplier), weather data, and local events calendar, you must bu…
- Reinforcement Learning
- Optimization
- Simulation
Data Science for Business
ResearchSeniorNew
Investigate Why Our Generative Model Memorizes Training Data
Pick a small open-source diffusion model (e.g., a Stable-Diffusion-class community model trained on LAION-subset). Reproduce a published membership-inference + extraction probe …
- Generative Models
- Memorization Analysis
- Differential Privacy
Advanced Deep Learning
AnalysisIntermediateNew
Frame an Energy-Storage Dispatch Decision as a Bayesian Decision Problem
You receive 2 years of hourly spot-price data, 2 years of wind generation data, and a manufacturer's battery degradation model. Frame dispatch as a Bayesian decision problem: mo…
- Bayesian Decision Theory
- Price Modeling
- Back Testing
Decision Making Under Uncertainty
CodeBeginnerNew
Building a Customer Segmentation Tool for a SaaS Scale-up
You are provided with a JSON file containing user data: user_id, total_logins, days_since_last_login, features_used (count), subscription_tier (free/basic/premium). Your task is…
- Python
- Pandas
- Scikit Learn
Programming for Business Applications
CodeIntermediateNew
LLM-Powered FAQ Chatbot for 40-Person SaaS Scale-up
You have access to TaskFlow's internal documentation, help articles, and a sample of 500 support tickets. Your task is to build a retrieval-augmented generation (RAG) pipeline: …
- LLM
- RAG
- Information Retrieval
Text Analytics and Natural Language Processing
CodeIntermediateNew
Description-Logic Reasoner for Insurance-Policy Coverage Checks
You receive 50 representative coverage rules in plain English (from the current rule engine) and a sample of 1,000 anonymized claim cases with the current engine's outcomes (cov…
- Description Logics
- Owl
- Reasoning
Fuzzy Logic, Knowledge Representation, and Symbolic Reasoning
DesignBeginnerNew
Draft a Model Card for a Generative Image Product
You receive the model's training-data summary, evaluation metrics, intended-use statement, and known failure modes from the ML team. Write: (a) a 3-page plain-language model car…
- Model Cards
- Transparency Documentation
- Responsible Ai
AI Ethics, Fairness, and Responsible AI
CodeBeginnerNew
Optimizing Inventory for a Milan D2C Cosmetics Brand
You are provided with 12 months of daily sales data for 10 SKUs, including unit price, cost, lead time, and current inventory. Your task is to develop an Excel-based inventory o…
- Excel Modeling
- Vba Programming
- Demand Forecasting
Spreadsheet Modeling and VBA

How it works

From brief to credential, in six steps.

Step 01
Browse challenges aligned to your studies.
Step 02
Accept the one that fits your goals.
Step 03
Work through it with AI Copilot guidance.
Step 04
Submit for structured evaluation.
Step 05
Earn a verified credential.
Step 06
Add it to LinkedIn with one click.

Related fields

Industry teams behind a decade of practitioner briefs

Hiring from this pool?

Sponsor a challenge and meet candidates through actual work.

Industry teams can shape briefs around the skills they hire for, then evaluate students on rubric-scored deliverables — not resumes.

Explore sponsorship