Data Sciences Challenges
Explore data science challenges on Ewance to build skills employers expect from analysts and ML engineers. Work through challenges on data cleaning, exploratory analysis, modeling, and storytelling with data.
Most Popular
- CodeSeniorNew
Detect Use-After-Free with Pointer Analysis in a C++ Codebase
Build the codebase with compilation database (compile_commands.json). Write a custom Clang static-analyzer checker (or extend an existing one) that flags resource-handle uses af…
- Pointer Analysis
- Static Analysis
- Use After Free
Program Analysis - CodeSeniorNew
Abstract-Interpretation-Based Configuration Validator for a SaaS Platform
Define an abstract domain capturing relations between configuration fields (e.g. 'if billing.plan = enterprise then features.audit_log = true'). Implement an abstract-interpreta…
- Abstract Interpretation
- Static Analysis
- Rust Programming
Program Analysis - CodeSeniorNew
HyperLogLog Cardinality for Real-Time Ad-Auction Analytics
Implement HyperLogLog with precision parameter p in {12, 14, 16} (4KB, 16KB, 64KB sketches) and benchmark relative error on a replayed 3-hour production trace (around 13 billion…
- Hyperloglog
- Probabilistic Data Structures
- Streaming Systems
Randomized Algorithms - CodeSeniorNew
PDE Solver for Subsurface Reservoir Flow
Implement MPFA-O discretization for pressure on a tetrahedral mesh with explicit fault transmissibility (Aavatsmark et al. 2002 formulation). Couple to a temperature equation vi…
- Numerical Pdes
- Finite Volume
- Newton Krylov
Scientific Computing and Numerical Methods Develop in-demand professional skills.
Each challenge names the skills it strengthens. Over time, your profile fills with the competences a hiring manager would actually look for.
Why Ewance
- ResearchSeniorNew
Benchmark Long-Context Architectures on a Legal-Doc Retrieval Task
You receive a public legal-QA dataset (e.g., LongBench's legal split or similar) filtered to documents over 50,000 tokens. Implement or wrap 3 architectures: a sliding-window Tr…
- Long Context Architectures
- State Space Models
- Hugging Face Transformers
Advanced Deep Learning - ResearchSeniorNew
Investigate Why Our Generative Model Memorizes Training Data
Pick a small open-source diffusion model (e.g., a Stable-Diffusion-class community model trained on LAION-subset). Reproduce a published membership-inference + extraction probe …
- Generative Models
- Memorization Analysis
- Differential Privacy
Advanced Deep Learning - CodeSeniorNew
Train a Manipulation Policy for Bin Picking with Imitation Learning
You receive a dataset of 500 teleop trajectories on the in-distribution part plus a held-out simulation environment with a never-seen part. Train an imitation-learning policy (D…
- Imitation Learning
- Manipulation
- Diffusion Policy
Advanced Robotics - CodeSeniorNew
Plan Under Uncertainty for a Warehouse Restocking Robot
You receive a discrete-event simulator of a 1,200-shelf warehouse with calibrated optical-scanning error rates and stock-out cost per shelf. Formulate the restocking decision as…
- Planning Under Uncertainty
- Pomdp
- Monte Carlo Planning
Advanced Robotics - Browse challenges
Explore role
Strategy Analyst
Frame the business question, model the options, build the recommendation. From market sizing to competitive analysis, this role is where strategy consulting meets in-house decision-making.
- ResearchSeniorNew
Validate a Foundation Model for Protein-Ligand Docking Acceleration
Pick 20 publicly available protein-ligand complexes from the PDBbind dataset (or similar public source). Use a published open-source structural foundation model (e.g., a Boltz-s…
- Foundation Model Evaluation
- Structural Biology
- Model Validation
AI for Science and Engineering - AnalysisSeniorNew
Write a Copyright Risk Memo for a Foundation-Model Lab's Training Set
Cover (1) US fair-use exposure for training on web-scraped text and code, including the current state of pending major lawsuits at the time of writing; (2) the EU TDM exceptions…
- Copyright Law
- Regulatory Analysis
- Risk Mapping
AI Law, Policy, and Regulation - StrategySeniorNew
Run a Mock Algorithmic-Discrimination Investigation for a Hiring-Tech Vendor
As a 3-person team, design and execute a 3-week mock inquiry. Produce: (1) the demand letter you imagine the regulator sending (scope, legal basis, data requested); (2) the vend…
- Regulatory Analysis
- Algorithmic Fairness
- Ai Governance Frameworks
AI Law, Policy, and Regulation - ResearchSeniorNew
Reproduce a Mechanistic Interpretability Result on a Small Transformer
Pick a published mechanistic-interpretability paper that operates on a small (under 1 billion parameter) open-source transformer (e.g., GPT-2 small, Pythia 70M). Set up the envi…
- Mechanistic Interpretability
- Transformer Internals
- Pytorch Or Tensorflow
AI Safety and Alignment Get recognized by recruiters and employers.
Credentials are blockchain-anchored via LearnCoin — tamper-evident, portable, link-shareable on LinkedIn and beyond.
Why Ewance
- ResearchSeniorNew
Stress-Test Scalable Oversight on a Tool-Using Agent
Design a sandwich-oversight study: pick a task domain where non-expert oversight is plausible but not trivial (e.g., reviewing data-analysis steps, checking small bug fixes, eva…
- Scalable Oversight
- Alignment Research
- Experimental Design
AI Safety and Alignment - CodeSeniorNew
Build an MLOps Platform Slice for a Fintech Risk Team
Across a 5-person team, ship (1) experiment tracking integrated into a sample model training job; (2) a model registry that promotes-by-tag; (3) a training pipeline orchestrated…
- Mlops Design
- Experiment Tracking
- Model Registry
AI Software Engineering Group Project - CodeSeniorNew
Stand Up a Data Platform for a Mobility-Data Startup's First ML Model
As a 4-person team, build (1) a streaming ingestion path from a simulated telemetry source (Kafka + Python producer is fine); (2) a batch ETL job into a small warehouse (DuckDB …
- Data Engineering
- Streaming Ingestion
- Feature Store
AI Software Engineering Group Project - ResearchSeniorNew
SAT-Based Planner for Smart-Grid Demand Response
Encode the dispatch problem (which customers to curtail by how much, respecting per-customer contractual caps and grid-cell totals) as a SAT or MaxSAT instance. Solve 50 histori…
- Sat Based Planning
- Constraint Encoding
- Benchmarking
Automated Planning - CodeSeniorNew
Train a 3D Object Detector for Highway Trucking
Use the nuScenes or Waymo Open Dataset (open access) as your training and evaluation source. Fine-tune a strong baseline (e.g., CenterPoint or BEVFusion) and define an evaluatio…
- Object Detection
- Perception
- Pytorch Or Tensorflow
AI for Autonomous Vehicles - ResearchSeniorNew
Trajectory Prediction Model for Urban Robotaxis
Use the Argoverse 2 motion-forecasting dataset (open access). Train an LSTM baseline + a transformer challenger (e.g., a small Wayformer or HiVT). Evaluate on minADE/minFDE (min…
- Trajectory Prediction
- Transformer Models
- Evaluation
AI for Autonomous Vehicles - DesignSeniorNew
Multi-Region Failover for an Enterprise RAG Service
Design and prototype: (1) a primary-region deployment of the RAG service (vector DB + LLM inference + retrieval API), (2) a passive secondary region with replicated vector store…
- Multi Region Architecture
- Disaster Recovery
- Terraform
Cloud Computing for Data and ML - CodeSeniorNew
Grounded Language for a Robotics Pick-and-Place Demo
Use a tabletop simulator (PyBullet or Isaac Sim, both open) with 5 object types and 5 spatial relations (left of, right of, behind, in front of, on top of). Curate or generate a…
- Grounded Language Understanding
- Semantic Parsing
- Perception
Computational Semantics - CodeSeniorNew
Video Action Recognition for a Retail Loss-Prevention Startup
Use a public action-recognition dataset (UCF101 + a small curated retail-action subset; the latter is provided synthetic or you can label 50 short clips). Fine-tune a small back…
- Video Understanding
- Action Recognition
- Transfer Learning
Computer Vision - ResearchSeniorNew
Solve a POMDP for a Healthtech Diagnostic Pathway
You receive a simplified pathway: 5 possible underlying conditions, 8 possible diagnostic tests each with documented sensitivity and specificity, and an outcome payoff matrix fr…
- Pomdp Modeling
- Belief States
- Approximate Solvers
Decision Making Under Uncertainty - ResearchSeniorNew
Implement an Autoregressive Model for Anonymized Voice-Synthesis at a Defense Vendor
You receive a public-domain speech dataset (LibriTTS subset, around 50 speakers) and a fixed evaluation protocol (speaker-identifiability AUC, emotion-preservation MOS proxy, in…
- Autoregressive Models
- Voice Conversion
- Speech Synthesis
Deep Generative Models - ResearchSeniorNew
Concept-Activation Vectors for an Autonomous-Vehicle Perception Audit
You receive a trained semantic-segmentation model (8 classes including pedestrian, vehicle, road, sky), an internal validation set of 2,500 driving frames, and a small concept-i…
- Tcav
- Concept Explanations
- Interpretability
Explainable and Interpretable AI
How it works
From brief to credential, in six steps.
Step 01
Browse challenges aligned to your studies.
Step 02
Accept the one that fits your goals.
Step 03
Work through it with AI Copilot guidance.
Step 04
Submit for structured evaluation.
Step 05
Earn a verified credential.
Step 06
Add it to LinkedIn with one click.
Industry teams behind a decade of practitioner briefs
Hiring from this pool?
Sponsor a challenge and meet candidates through actual work.
Industry teams can shape briefs around the skills they hire for, then evaluate students on rubric-scored deliverables — not resumes.



















































































