Data Sciences Challenges

Explore data science challenges on Ewance to build skills employers expect from analysts and ML engineers. Work through challenges on data cleaning, exploratory analysis, modeling, and storytelling with data.

Explore Categories

All Business Computer Sciences Data Sciences Health Information Technology Social Sciences

Most Popular

All Foundational Beginner Intermediate Senior

DesignSeniorNew
Design a Lock-Free Concurrent Skip List for a Time-Series Database
Using the baseline implementation and benchmark harness in the provided code module (baseline-skiplist-rs), design and build a lock-free skip list that replaces the mutex-guarde…
- Lock Free
- Concurrent Data Structures
- Rust Programming
Open coursework
CodeSeniorNew
Bulk Synchronous Parallel PageRank for a Web-Graph Lab
Working from the lab situation record (`lab-case-file`), implement 25 iterations of PageRank under a Bulk Synchronous Parallel (BSP) model using either Apache Spark with GraphX …
- Parallel Algorithms
- Bsp
- Graph Algorithms
Open coursework
CodeSeniorNew
Assess Secure Cross-Bank Fraud Computation for a Four-Bank Consortium
Using the de-identified situation in the consortium case file and the two synthetic per-bank datasets (the bank suspicious-account lists and the merchant fraud-marker tallies), …
- Secure Computation
- Mpc
- Cryptography
Open coursework
CodeSeniorNew
Detect Use-After-Free with Pointer Analysis in a C++ Codebase
Build the codebase with compilation database (compile_commands.json). Write a custom Clang static-analyzer checker (or extend an existing one) that flags resource-handle uses af…
- Pointer Analysis
- Static Analysis
- Use After Free
Program Analysis
Develop in-demand professional skills.
Each challenge names the skills it strengthens. Over time, your profile fills with the competences a hiring manager would actually look for.
Why Ewance
DesignSeniorNew
Design a Semantic Configuration Validator Using Abstract Interpretation
Using the configuration schema reference, the three incident post-mortems, and the 200-configuration validation set provided, design an abstract domain that captures relations b…
- Abstract Interpretation
- Static Analysis
- Rust Programming
Open coursework
CodeSeniorNew
HyperLogLog Cardinality for Real-Time Ad-Auction Analytics
Implement HyperLogLog with precision parameter p in {12, 14, 16} (4KB, 16KB, 64KB sketches) and benchmark relative error on a replayed 3-hour production trace (around 13 billion…
- Hyperloglog
- Probabilistic Data Structures
- Streaming Systems
Randomized Algorithms
CodeSeniorNew
PDE Solver for Subsurface Reservoir Flow
Implement MPFA-O discretization for pressure on a tetrahedral mesh with explicit fault transmissibility (Aavatsmark et al. 2002 formulation). Couple to a temperature equation vi…
- Numerical Pdes
- Finite Volume
- Newton Krylov
Scientific Computing and Numerical Methods
ResearchSeniorNew
Benchmark Long-Context Architectures on a Legal-Doc Retrieval Task
You receive a public legal-QA dataset (e.g., LongBench's legal split or similar) filtered to documents over 50,000 tokens. Implement or wrap 3 architectures: a sliding-window Tr…
- Long Context Architectures
- State Space Models
- Transformers
Advanced Deep Learning
Explore role
Strategy Analyst
Frame the business question, model the options, build the recommendation. From market sizing to competitive analysis, this role is where strategy consulting meets in-house decision-making.
Browse challenges
ResearchSeniorNew
Investigate Why Our Generative Model Memorizes Training Data
Pick a small open-source diffusion model (e.g., a Stable-Diffusion-class community model trained on LAION-subset). Reproduce a published membership-inference + extraction probe …
- Generative Models
- Memorization Analysis
- Differential Privacy
Advanced Deep Learning
CodeSeniorNew
Train a Manipulation Policy for Bin Picking with Imitation Learning
You receive a dataset of 500 teleop trajectories on the in-distribution part plus a held-out simulation environment with a never-seen part. Train an imitation-learning policy (D…
- Imitation Learning
- Manipulation
- Diffusion Policy
Advanced Robotics
CodeSeniorNew
Plan Under Uncertainty for a Warehouse Restocking Robot
You receive a discrete-event simulator of a 1,200-shelf warehouse with calibrated optical-scanning error rates and stock-out cost per shelf. Formulate the restocking decision as…
- Planning Under Uncertainty
- Pomdp
- Monte Carlo Planning
Advanced Robotics
ResearchSeniorNew
Validate a Foundation Model for Protein-Ligand Docking Acceleration
Pick 20 publicly available protein-ligand complexes from the PDBbind dataset (or similar public source). Use a published open-source structural foundation model (e.g., a Boltz-s…
- Foundation Model Evaluation
- Structural Biology
- Model Validation
AI for Science and Engineering
Get recognized by recruiters and employers.
Credentials are blockchain-anchored via LearnCoin — tamper-evident, portable, link-shareable on LinkedIn and beyond.
Why Ewance
AnalysisSeniorNew
Write a Copyright Risk Memo for a Foundation-Model Lab's Training Set
Cover (1) US fair-use exposure for training on web-scraped text and code, including the current state of pending major lawsuits at the time of writing; (2) the EU TDM exceptions…
- Copyright Law
- Regulatory Analysis
- Risk Mapping
AI Law, Policy, and Regulation
StrategySeniorNew
Run a Mock Algorithmic-Discrimination Investigation for a Hiring-Tech Vendor
As a 3-person team, design and execute a 3-week mock inquiry. Produce: (1) the demand letter you imagine the regulator sending (scope, legal basis, data requested); (2) the vend…
- Regulatory Analysis
- Algorithmic Fairness
- Ai Governance
AI Law, Policy, and Regulation
ResearchSeniorNew
Reproduce a Mechanistic Interpretability Result on a Small Transformer
Pick a published mechanistic-interpretability paper that operates on a small (under 1 billion parameter) open-source transformer (e.g., GPT-2 small, Pythia 70M). Set up the envi…
- Mechanistic Interpretability
- Transformer Internals
- Pytorch
AI Safety and Alignment
ResearchSeniorNew
Stress-Test Scalable Oversight on a Tool-Using Agent
Design a sandwich-oversight study: pick a task domain where non-expert oversight is plausible but not trivial (e.g., reviewing data-analysis steps, checking small bug fixes, eva…
- Scalable Oversight
- Alignment Research
- Experiment Design
AI Safety and Alignment
CodeSeniorNew
Build an MLOps Platform Slice for a Fintech Risk Team
Across a 5-person team, ship (1) experiment tracking integrated into a sample model training job; (2) a model registry that promotes-by-tag; (3) a training pipeline orchestrated…
- Mlops Design
- Experiment Tracking
- Model Registry
Open coursework
CodeSeniorNew
Stand Up a Data Platform for a Mobility-Data Startup's First ML Model
As a 4-person team, build (1) a streaming ingestion path from a simulated telemetry source (Kafka + Python producer is fine); (2) a batch ETL job into a small warehouse (DuckDB …
- Data Engineering
- Streaming Ingestion
- Feature Store
AI Software Engineering Group Project
ResearchSeniorNew
SAT-Based Planner for Smart-Grid Demand Response
Encode the dispatch problem (which customers to curtail by how much, respecting per-customer contractual caps and grid-cell totals) as a SAT or MaxSAT instance. Solve 50 histori…
- Sat Based Planning
- Constraint Encoding
- Benchmarking
Automated Planning
CodeSeniorNew
Train a 3D Object Detector for Highway Trucking
Use the nuScenes or Waymo Open Dataset (open access) as your training and evaluation source. Fine-tune a strong baseline (e.g., CenterPoint or BEVFusion) and define an evaluatio…
- 3d Object Detection
- Perception
- Pytorch
AI for Autonomous Vehicles
ResearchSeniorNew
Trajectory Prediction Model for Urban Robotaxis
Use the Argoverse 2 motion-forecasting dataset (open access). Train an LSTM baseline + a transformer challenger (e.g., a small Wayformer or HiVT). Evaluate on minADE/minFDE (min…
- Trajectory Prediction
- Transformer Models
- Evaluation
AI for Autonomous Vehicles
DesignSeniorNew
Multi-Region Failover for an Enterprise RAG Service
Design and prototype: (1) a primary-region deployment of the RAG service (vector DB + LLM inference + retrieval API), (2) a passive secondary region with replicated vector store…
- Multi Region Architecture
- Disaster Recovery
- Infrastructure As Code
Cloud Computing for Data and ML
CodeSeniorNew
Grounded Language for a Robotics Pick-and-Place Demo
Use a tabletop simulator (PyBullet or Isaac Sim, both open) with 5 object types and 5 spatial relations (left of, right of, behind, in front of, on top of). Curate or generate a…
- Grounded Language Understanding
- Semantic Parsing
- Perception
Computational Semantics
CodeSeniorNew
Video Action Recognition for a Retail Loss-Prevention Startup
Use a public action-recognition dataset (UCF101 + a small curated retail-action subset; the latter is provided synthetic or you can label 50 short clips). Fine-tune a small back…
- Video Understanding
- Action Recognition
- Transfer Learning
Computer Vision

How it works

From brief to credential, in six steps.

Step 01
Browse challenges aligned to your studies.
Step 02
Accept the one that fits your goals.
Step 03
Work through it with AI Copilot guidance.
Step 04
Submit for structured evaluation.
Step 05
Earn a verified credential.
Step 06
Add it to LinkedIn with one click.

Industry teams behind a decade of practitioner briefs

Hiring from this pool?

Sponsor a challenge and meet candidates through actual work.

Industry teams can shape briefs around the skills they hire for, then evaluate students on rubric-scored deliverables — not resumes.

Explore sponsorship

Data Sciences Challenges | Ewance