Computer & Information Sciences
Data Science Challenges
Real data-science projects and challenges on Ewance — clean messy datasets, build and evaluate models, and turn raw data into decisions the way a working data scientist does. Solve them to build a portfolio of verified, recruiter-checkable proof you can do the work — not just describe it.
Recommended challenges
- CodeBeginnerNew
Simulated Annealing for Shift Scheduling at a Hospital
You receive 6 months of anonymized shift demand data, the nurse roster (skills, certifications, contracted hours), and the labor-law hard constraints. Encode the schedule as a 7…
- Simulated Annealing
- Metaheuristics
- Constraint Handling
Evolutionary Computation and Metaheuristic Search - ResearchSeniorNew
Investigate Why Our Generative Model Memorizes Training Data
Pick a small open-source diffusion model (e.g., a Stable-Diffusion-class community model trained on LAION-subset). Reproduce a published membership-inference + extraction probe …
- Generative Models
- Memorization Analysis
- Differential Privacy
Advanced Deep Learning - AnalysisIntermediateNew
Compare Stereo Depth Methods for a Drone Inspection Startup
You receive 500 calibrated stereo pairs from a turbine inspection plus sparse LiDAR ground truth on each pair. Implement (or wrap) three depth estimators: OpenCV Semi-Global Mat…
- Stereo Depth Estimation
- Multi View Geometry
- Model Evaluation
3D Vision and Multi-View Geometry - StrategyBeginnerNew
Spec a Voice Agent for an Airline's Disruption Support Line
You will produce a 6-page voice-agent product spec covering: (1) supported intents and out-of-scope handling, (2) handoff-to-human criteria, (3) latency and confidence threshold…
- Voice Agent Design
- Intent Design
- Metric Design
Speech Recognition and Spoken Language Processing Practice your coursework on real scenarios.
Every challenge is shaped from real-world context — not generic exercises. The work mirrors what your degree prepares you for.
Why Ewance
- AnalysisBeginnerNew
Community Detection on a Pharma Clinical-Trial Investigator Graph
You receive a pre-fetched dump of around 15,000 trials from a public registry covering oncology over the last 10 years and a mapping of trials to investigator names + institutio…
- Community Detection
- Louvain
- Leiden
Machine Learning on Graphs - ResearchBeginnerNew
Run a Human-Preference Study Comparing Two Coding Assistants
Design a blinded paired-comparison study: 12 developer participants, each gets the same 8 realistic coding tasks (refactor, write a function, debug, test), each task is solved b…
- Experimental Design
- Statistical Evaluation
- Human Evaluation
AI Measurement and Evaluation - AnalysisIntermediateNew
Detect Defects on a Production Line for a Tier-1 Auto Supplier
You receive 12,000 labelled grayscale part images (8,000 good, 4,000 defective across 6 defect types) at 2048x2048. Build a pipeline that does classical preprocessing (illuminat…
- Defect Detection
- Cnn Classification
- Image Preprocessing
Image Processing and Computational Imaging - CodeIntermediateNew
Multi-Agent Research Assistant for Biotech Patent Review
You receive 20 historical patent applications with the firm's own prior-art memos as ground truth. Design and build a 3-agent system: (a) Searcher — issues queries to a patent-s…
- Ai Agents
- Multi Agent Collaboration
- Agent Evaluation
AI Agents and LLM-Based Agents - Browse challenges
Explore role
Product Manager
Ship product that solves real user problems. Combine user research, prototyping, and stakeholder alignment to turn ambiguous briefs into measurable wins — the role at the centre of modern software teams.
- StrategySeniorNew
Run a Mock Algorithmic-Discrimination Investigation for a Hiring-Tech Vendor
As a 3-person team, design and execute a 3-week mock inquiry. Produce: (1) the demand letter you imagine the regulator sending (scope, legal basis, data requested); (2) the vend…
- Regulatory Analysis
- Algorithmic Fairness
- Ai Governance Frameworks
AI Law, Policy, and Regulation - DesignBeginnerNew
Design an Automated Essay-Feedback System
You receive 20 anonymized middle-school essays scored by 2 human teachers on a 4-dimension rubric (structure, evidence, voice, mechanics). Design an LLM-based feedback system th…
- Automated Assessment
- Rubric Design
- Prompt Patterns
AI in Education and Learning Analytics - ResearchSeniorNew
Pretrain a Small Vision Transformer with Self-Supervised Learning
You receive 80,000 unlabeled 224x224 histology tiles plus 4,000 labeled tiles split into train/val/test. Pretrain a ViT-Small using a self-supervised method of your choice (DINO…
- Supervised Learning
- Vision Transformers
- Pytorch Or Tensorflow
Advanced Deep Learning - DesignBeginnerNew
Build an Attention-Visualization Tool for Translation Quality Audit
You will load a small open-source EN-FR transformer (e.g., Helsinki-NLP Opus-MT-en-fr), build a Streamlit or Gradio demo that lets the user paste English source, see the French …
- Attention Mechanisms
- Neural Mt
- Tool Design
Machine Translation Build a verifiable portfolio.
Submissions become evidence. Reviewers with shipping experience score against a rubric; the result becomes a credential anyone can verify.
Why Ewance
- CodeSeniorNew
Train a Manipulation Policy for Bin Picking with Imitation Learning
You receive a dataset of 500 teleop trajectories on the in-distribution part plus a held-out simulation environment with a never-seen part. Train an imitation-learning policy (D…
- Imitation Learning
- Manipulation
- Diffusion Policy
Advanced Robotics - AnalysisIntermediateNew
Cost-Optimize a Large-Scale Spark Job for an Ad-Tech Platform
You receive the Spark job source (PySpark), the EMR cluster config, and 5 nights of job-history JSON. Profile the job with the Spark UI + EMR metrics, identify the top 3 cost dr…
- Spark Optimization
- Cloud Services
- Cost Engineering
Cloud Computing for Data and ML - ResearchIntermediateNew
Audit a Public LLM Benchmark for Validity Threats
Choose one open LLM benchmark (e.g., MMLU, GPQA, BIG-Bench-Hard, MATH). Read the benchmark paper plus at least three follow-up critiques. Audit (1) data contamination risk again…
- Benchmark Evaluation
- Data Contamination Analysis
- Annotation Methodology
AI Measurement and Evaluation - CodeBeginnerNew
Build a Fairness Evaluation Harness for a Credit-Score Model
Implement a Python module that, given model predictions, ground truth, and group identifiers, computes demographic parity difference, equal-opportunity difference, predictive-pa…
- Algorithmic Fairness
- Statistical Evaluation
- Python Or Javascript
AI Measurement and Evaluation - CodeFoundationalNew
Tune a Pick-and-Place Controller for a Cosmetics Co-Packer
You receive 4 hours of logged trajectories from the existing controller (joint positions, target poses, miss/success labels) and read/write access to the controller config (YAML…
- Motion Control
- Trajectory Tuning
- Robot Kinematics
Robotics - CodeIntermediateNew
Extract Skills and Roles from Job Postings for a Recruiter Tool
You receive 30,000 anonymized job postings and a labelled 1,000-posting benchmark with (skill, role, seniority) spans. Fine-tune a small token classifier (e.g., DeBERTa-v3-base)…
- Information Extraction
- Token Classification
- Esco Taxonomy
Linguistic Engineering and Language Technologies - CodeIntermediateNew
Build a Vector-Search Backend for an Enterprise AI Knowledge Assistant
You receive a corpus of around 20,000 PDFs (mixed scanned and digital) totalling around 30 GB and a labeled retrieval set of 200 queries with human-judged ground-truth passages.…
- RAG Architectures
- Vector Database Basics
- Word Embeddings
Data Engineering and Big Data Systems - CodeIntermediateNew
Detect Coordinated Fraud Rings via Link Analysis at a Neobank
You receive 90 days of account, login, and transaction data (around 1.2 million accounts, around 30 million events) plus a labeled set of 80 known fraud rings. Build a multi-rel…
- Graph Analysis
- Community Detection
- Link Analysis
Data Mining and Knowledge Discovery - CodeBeginnerNew
Stack Five Models for a Kaggle-Style Forecasting Bake-Off
You receive a pseudonymized dataset of 24 months of daily shipment volumes across about 200 origin-destination lanes plus weather and holiday features. Train 5 base models, use …
- Ensemble Methods
- Time Series Forecasting
- Feature Engineering
Advanced Machine Learning - AnalysisIntermediateNew
A/B Testing for a 40-Person SaaS Scale-up Moving to Enterprise
You are a data analyst at TaskFlow. You are given the raw A/B test data (visitor logs, conversions, and downstream sales data). Your task is to perform a rigorous analysis: chec…
- A/B Testing
- Statistical Analysis
- Bayesian Methods
Data Science for Business - CodeIntermediateNew
Run a Monte Carlo Tree Search Strategy for a Robotics Pick-and-Place Task
You receive a simulator of the pick-and-place task: a bin with 10 randomly-placed parts, an action space of which part to pick next, and a reward = parts picked per minute with …
- Monte Carlo Tree Search
- Planning
- Simulation
Decision Making Under Uncertainty - ResearchIntermediateNew
Compare Kernel Methods to Trees on a Genomics Classification Task
You receive a curated benchmark of about 12,000 labeled variants with ~120 numerical + ~40 string features. Fit kernel SVMs (RBF, polynomial, string), random forest, and XGBoost…
- Kernel Methods
- Svm
- Tree Ensembles
Statistical Machine Learning
How it works
From brief to credential, in six steps.
Step 01
Browse challenges aligned to your studies.
Step 02
Accept the one that fits your goals.
Step 03
Work through it with AI Copilot guidance.
Step 04
Submit for structured evaluation.
Step 05
Earn a verified credential.
Step 06
Add it to LinkedIn with one click.
Industry teams behind a decade of practitioner briefs
Hiring from this pool?
Sponsor a challenge and meet candidates through actual work.
Industry teams can shape briefs around the skills they hire for, then evaluate students on rubric-scored deliverables — not resumes.



















































































