AI & Data
Statistics & Data Science Methods Challenges
Statistics & Data Science Methods challenges put you inside the work of drawing trustworthy conclusions from data. You'll build Statistics Fundamentals and Statistical Analysis, run Exploratory Data Analysis, Hypothesis Testing, Confidence Intervals, and Linear Regression, and design clean Sampling Methods.
From there you'll handle the harder edges — Bayesian methods, Causal inference, A/B testing with statistical significance, Monte Carlo Simulation, and Uncertainty Quantification — applying Experimental design the way data scientists actually do. Each challenge you solve earns a verified credential you can share with recruiters.
Recommended Challenges
- All
- Data Analysis
- Experimental design
- Simulation
- Exploratory Data Analysis
- Statistical Analysis
- Uncertainty Quantification
- Logistic regression
- Cost Modeling
- Hypothesis Testing
- Monte Carlo Simulation
- A/B testing with statistical significance
- Linear Regression
- Time series basics
- Bayesian methods
- Causal inference
- Sampling Methods
- StrategyBeginnerNew
Pitch a Regulatory Sandbox Application for an Edtech AI Product
Read the EU AI Regulation's regulatory-sandbox provisions. Pick a member-state sandbox program (Spain, Norway-as-EEA, or a German-state pilot are publicly documented options) an…
- Regulatory Analysis
- Ai Governance Frameworks
- Product Strategy
AI Law, Policy, and Regulation - CodeBeginnerNew
Build a Crop-Disease Classifier for a Smallholder Agritech Startup
You receive a curated 22,000-image cassava-disease dataset across 5 classes (4 diseases + healthy) plus a labeled 1,200-image held-out test set. Train a CNN classifier (start wi…
- Cnn Classification
- Cnn Architectures
- Transfer Learning
Deep Learning for Computer Vision - AnalysisBeginnerNew
Mine Association Rules for a Grocery Retailer's Promo Strategy
You receive 6 months of basket-level transaction data (around 22 million baskets, around 18,000 SKUs) plus a category taxonomy. Run association-rule mining (Apriori or FP-Growth…
- Association Rules
- Market Basket Analysis
- Apriori
Data Mining and Knowledge Discovery - AnalysisIntermediateNew
Detect Coordinated Inauthentic Behavior on a National News Platform
Receive an anonymized 90-day export of comments (user_id, article_id, timestamp, like_count, reply_to_id) and basic user metadata (registration date, login pattern bucket). Buil…
- Network Science
- Community Detection
- Graph Analysis
Network Science and Computational Social Science Practice your coursework on real scenarios.
Every challenge is shaped from real industry context — not generic exercises. The work mirrors what your degree prepares you for.
Why Ewance
- ResearchSeniorNew
Long-Context QA Evaluation Benchmark for Legal Memoranda
You receive 25 anonymized legal memoranda (50-90 pages each) and 100 QA pairs whose answers are deliberately spread across the documents (25 in pages 1-20, 25 in pages 20-40, 25…
- Long Context Qa
- Benchmark Design
- Model Evaluation
Question Answering and Conversational Systems - ResearchSeniorNew
Stress-Test Scalable Oversight on a Tool-Using Agent
Design a sandwich-oversight study: pick a task domain where non-expert oversight is plausible but not trivial (e.g., reviewing data-analysis steps, checking small bug fixes, eva…
- Scalable Oversight
- Alignment Research
- Experimental Design
AI Safety and Alignment - AnalysisIntermediateNew
Optimizing Portfolio for a Swiss Fintech Startup
Your task is to design a diversified portfolio of up to 8 ETFs from a provided universe of 10, maximizing the Sharpe ratio. You must present a clear rationale for your asset sel…
- Portfolio Optimization
- Modern Portfolio Theory
- Sharpe Ratio
Investment Analysis and Portfolio Management - ResearchBeginnerNew
Drug-Repurposing Candidate Screen with Embedding Similarity
You receive (1) a list of 15 known therapeutic candidates (SMILES + ChEMBL identifiers) for a single rare disease, (2) a database of about 4,500 marketed drugs (SMILES + ATC cod…
- Molecular Embeddings
- Similarity Search
- Transfer Learning
Machine Learning for Healthcare and Biomedicine - Browse challenges
Explore role
Product Manager
Ship product that solves real user problems. Combine user research, prototyping, and stakeholder alignment to turn ambiguous briefs into measurable wins — the role at the centre of modern software teams.
- AnalysisIntermediateNew
Design a Custom Page-Replacement Policy for a Tier-1 Cloud Provider Simulator
Use the provided simulator (Python harness wrapping a C++ page-cache model) and the team's 3 anonymized workload traces (web-cache, key-value store, batch analytics). Implement …
- Memory Management
- Page Replacement
- Benchmarking
Operating Systems - AnalysisBeginnerNew
Customer Churn Prediction for 40-Person SaaS Scale-Up
You receive a dataset with 500 customers and 10 features (e.g., monthly logins, number of support tickets, contract length, industry). Your task is to perform exploratory analys…
- Logistic Regression
- Classification
- Feature Engineering
Econometrics - DesignIntermediateNew
Design a Real-Time Order Pipeline for a Fintech Payments Platform
You receive a synthetic Kafka stream of around 500 transactions per second, a static merchant dimension table (about 80,000 rows), and a daily FX rate snapshot. Design an end-to…
- Streaming Data
- Kafka Event Streaming
- Stream Processing
Data Engineering and Big Data Systems - ResearchSeniorNew
Quantify Sim-to-Real Gap for a Warehouse Manipulation Policy
You receive a trained pick-and-place policy (PyTorch), the simulation env (Isaac Lab), and access to a real-arm rig (or recorded teleop episodes if hardware is unavailable). Def…
- Sim To Real
- Manipulation
- Experimental Design
Robot Perception and Autonomy Build a verifiable portfolio.
Submissions become evidence. Reviewers with shipping experience score against a rubric; the result becomes a credential anyone can verify.
Why Ewance
- AnalysisBeginnerNew
Build a Reproducible Pricing Analysis for a DTC Skincare Brand
You receive 24 months of order-line data (around 480,000 lines), a Shopify-style customer export, and a discount-code log. Build a Python pipeline that produces: SKU-level price…
- Data Wrangling
- Exploratory Data Analysis
- Cohort Analysis
Applied Data Analysis and Practical Data Science - ResearchSeniorNew
Certify Robustness for a Medical-Imaging Classifier
You receive the classifier (a PyTorch ResNet variant) and a 4,000-image labeled validation slice. Apply randomized smoothing (Cohen et al.) at sigma in {0.25, 0.5, 1.0}. Report …
- Certified Robustness
- Randomized Smoothing
- Formal Verification
Trustworthy AI, Robustness, and Safety - AnalysisIntermediateNew
Cache Coherence Protocol Comparison on a Multicore Simulator
Stand up gem5's Ruby coherence framework with both MESI and MOESI protocols on a 16-core configuration. Run the 6-benchmark suite (provided): producer-consumer queue, false-shar…
- Cache Coherence
- Multicore Architecture
- Simulation
Advanced Computer Architecture - ResearchSeniorNew
Experimental Design for a Fintech App's Savings Nudge
You are a behavioral data scientist at SaveSmart. Design a randomized controlled trial (RCT) to test the effect of a 'future self' nudge on savings behavior. Define treatment an…
- Experimental Design
- Statistical Analysis
- Nudge Theory
Behavioral Economics - ResearchBeginnerNew
Run a Human-Preference Study Comparing Two Coding Assistants
Design a blinded paired-comparison study: 12 developer participants, each gets the same 8 realistic coding tasks (refactor, write a function, debug, test), each task is solved b…
- Experimental Design
- Statistical Evaluation
- Human Evaluation
AI Measurement and Evaluation - ResearchSeniorNew
Open-Vocabulary Segmentation Benchmark for a Robotics R&D Lab
Use a curated 200-image household scene set (publicly-available HM3D renderings or COCO + a handful of household prompts). Benchmark 3 open-vocabulary segmentation models: SAM +…
- Open Vocabulary Segmentation
- Vision Language Models
- Benchmarking
Computer Vision - ResearchSeniorNew
Plan a Parameter-Efficient Fine-Tuning Strategy for a Big-Tech AI Lab
You will produce (1) a 6-page survey of four PEFT methods (LoRA, adapters, prefix tuning, IA3) with their strengths, weaknesses, and parameter footprints, (2) a one-page decisio…
- Parameter Efficient Fine Tuning
- Transfer Learning
- Fine Tuning
Meta-Learning, Transfer Learning, and Multi-Task Learning - ResearchSeniorNew
Reproduce a Mechanistic Interpretability Result on a Small Transformer
Pick a published mechanistic-interpretability paper that operates on a small (under 1 billion parameter) open-source transformer (e.g., GPT-2 small, Pythia 70M). Set up the envi…
- Mechanistic Interpretability
- Transformer Internals
- Pytorch Or Tensorflow
AI Safety and Alignment - ResearchSeniorNew
Train Cooperative Agents with Multi-Agent RL
Pick an open multi-agent environment (PettingZoo's MPE 'simple_spread', Overcooked-AI, or SMAC). Implement or wrap three methods: IPPO (independent PPO per agent), MAPPO (centra…
- Multi Agent Reinforcement Learning
- Ppo
- Pytorch Or Tensorflow
Multi-Agent Systems - ResearchIntermediateNew
Map Knowledge Diffusion in a Global Open-Source Ecosystem
Pull (or use a provided anonymized export of) the last 24 months of commit + PR + issue data for 60 projects in the foundation's portfolio. Build a temporal contributor-project …
- Network Science
- Temporal Networks
- Graph Analysis
Network Science and Computational Social Science - AnalysisBeginnerNew
Build a Public Open-Data Dashboard for Urban Mobility
Pull the city's open-data cyclist-collision dataset (10 years of incidents, geocoded). Define a clear before/after window around the protected-lane rollout, control for traffic-…
- Exploratory Data Analysis
- Data Wrangling
- Geospatial Analysis
Applied Data Analysis and Practical Data Science - AnalysisIntermediateNew
Audit BLEU vs. COMET on a Multilingual Customer-Support Corpus
You receive 600 source-translation-reference triples covering 6 languages (EN as source; ES/FR/DE/JA/PT-BR/HI as targets), each scored on adequacy and fluency (1-6) by 3 profess…
- Mt Evaluation
- Neural Mt
- Statistical Analysis
Machine Translation
How it works
From brief to credential, in six steps.
Step 01
Browse challenges aligned to your studies.
Step 02
Accept the one that fits your goals.
Step 03
Work through it with AI Copilot guidance.
Step 04
Submit for structured evaluation.
Step 05
Earn a verified credential.
Step 06
Add it to LinkedIn with one click.
Industry teams behind a decade of practitioner briefs
Hiring from this pool?
Sponsor a challenge and meet candidates through actual work.
Industry teams can shape briefs around the skills they hire for, then evaluate students on rubric-scored deliverables — not resumes.



















































































