Computer & Information Sciences
Data Science Challenges
Real data-science projects and challenges on Ewance — clean messy datasets, build and evaluate models, and turn raw data into decisions the way a working data scientist does. Solve them to build a portfolio of verified, recruiter-checkable proof you can do the work — not just describe it.
Recommended challenges
- StrategyIntermediateNew
Migrating from Relational to NoSQL for a Social Media Startup
You are given a simplified relational schema of SnapVibe's current database (users, posts, comments, likes). Your task is to: (1) analyze the limitations of the relational model…
- Nosql
- Mongodb
- Data Migration
Database Systems - ResearchSeniorNew
Pretrain a Small Vision Transformer with Self-Supervised Learning
You receive 80,000 unlabeled 224x224 histology tiles plus 4,000 labeled tiles split into train/val/test. Pretrain a ViT-Small using a self-supervised method of your choice (DINO…
- Supervised Learning
- Vision Transformers
- Pytorch Or Tensorflow
Advanced Deep Learning - CodeIntermediateNew
Build a Speaker-Diarization Pipeline for a Legal-Tech Startup
You receive 20 hours of de-identified hearing audio with ground-truth speaker labels (4 speaker classes per hearing). Build a speaker-diarization pipeline (pyannote-audio or sim…
- Speaker Diarization
- Speech Recognition
- Pyannote
Speech Recognition and Spoken Language Processing - CodeSeniorNew
Multilingual RAG for a European Customer-Support Knowledge Base
You receive 6,000 documents in 4 languages (mix of FAQs, parts catalogs, repair procedures) plus 120 labeled queries (30 per language) with gold source documents. Build a multil…
- Multilingual RAG
- Cross Lingual Retrieval
- Multilingual Embeddings
Retrieval-Augmented Generation Practice your coursework on real scenarios.
Every challenge is shaped from real industry context — not generic exercises. The work mirrors what your degree prepares you for.
Why Ewance
- CodeIntermediateNew
Multi-View Pose Estimation for a Sports-Analytics Startup
Use the publicly-released SoccerNet or a synthetic 4-view dataset (you can render with Unity or use a provided one). Implement a 2D pose estimator per view (HRNet or YOLOv8-pose…
- Pose Estimation
- Multi View Geometry
- 3d Reconstruction
Computer Vision - CodeBeginnerNew
Optimizing Inventory for a Milan D2C Cosmetics Brand
You are provided with 12 months of daily sales data for 10 SKUs, including unit price, cost, lead time, and current inventory. Your task is to develop an Excel-based inventory o…
- Spreadsheet Modeling
- Vba Programming
- Demand Forecasting
Spreadsheet Modeling and VBA - DesignIntermediateNew
Design a Customer 360 Graph for a Cross-Border Fintech
You receive 500 sample customer records across CRM, payments core, and KYC systems, plus a 50-record entity-resolution benchmark (pairs labelled same/different). Design an OWL o…
- Customer 360
- Entity Resolution
- Owl Ontology
Knowledge Graphs and Semantic Web - CodeIntermediateNew
Build a Serverless ETL Pipeline for a Climate-Tech Sensor Fleet
Build the pipeline using managed services only (e.g., S3 + Lambda + EventBridge + Glue, or GCS + Cloud Functions + Cloud Scheduler + BigQuery external tables). Source the data f…
- Serverless Architecture
- Etl Pipelines
- Terraform
Cloud Computing for Data and ML - Browse challenges
Explore role
Marketing Analyst
Plan and measure campaigns that grow the business. Funnel analytics, attribution, segmentation, and the rigorous measurement that lets marketing defend its budget at the leadership table.
- ResearchIntermediateNew
Sim-to-Real Domain Randomization for a Mobile Robot
You receive an Isaac Sim navigation environment, a baseline trained policy, a 50-episode real-bench test set (recorded sensor streams + ground truth) for offline policy evaluati…
- Domain Randomization
- Sim To Real
- Robot Navigation
Robot Learning - CodeBeginnerNew
Plan Safe Paths for a Last-Mile Sidewalk Robot
You receive 4 hours of recorded sidewalk traversals with annotated pedestrian tracks, occupancy grids, and a map of the pilot neighborhood. Implement a sampling-based planner (R…
- Motion Planning
- Sampling Based Planning
- Cost Function Design
Robot Perception and Autonomy - StrategyBeginnerNew
Scope a Demand-Forecasting Model with Operations Stakeholders
You receive recorded interview transcripts (or summary notes) for the three personas, plus a sample of the historical sales data. Map each stakeholder's pain to candidate ML pro…
- Stakeholder Framing
- Ml Problem Scoping
- Metric Design
Machine Learning in Practice - ResearchIntermediateNew
Evaluate VAEs vs. Diffusion for Synthetic Tabular-Data Generation
You receive a real labeled dataset (around 18,000 anonymized patient records, 32 features, binary outcome) and the team's existing VAE baseline. Train a tabular diffusion model …
- Tabular Diffusion
- Vae
- Synthetic Data
Generative AI Build a verifiable portfolio.
Submissions become evidence. Reviewers with shipping experience score against a rubric; the result becomes a credential anyone can verify.
Why Ewance
- ResearchSeniorNew
Inductive Logic Programming for a Fraud-Rule Discovery Pilot
You receive a labeled fraud dataset (around 25,000 transactions, around 4% positive class), a feature schema (28 features including device, geo, behavioral history), and a basel…
- Inductive Logic Programming
- Symbolic Ai
- Rule Learning
Fuzzy Logic, Knowledge Representation, and Symbolic Reasoning - ResearchSeniorNew
Solve a POMDP for a Healthtech Diagnostic Pathway
You receive a simplified pathway: 5 possible underlying conditions, 8 possible diagnostic tests each with documented sensitivity and specificity, and an outcome payoff matrix fr…
- Pomdp Modeling
- Belief States
- Approximate Solvers
Decision Making Under Uncertainty - DesignIntermediateNew
Design Hybrid Search for an E-Commerce Product Catalog
You receive 80,000 anonymized product records (title, description, category, attributes) and a sample of 30,000 search log entries with click-through labels. Embed the catalog w…
- Hybrid Search
- Embedding Models
- Bm25
Vector Databases and Embeddings - DesignIntermediateNew
Stand Up a Feature Store for a Series-B Fintech
Pick one priority feature group (recommend the 25 transaction-history features used by the fraud model). Define the offline source-of-truth (likely Snowflake or BigQuery), the o…
- Feature Store
- Feature Engineering
- Airflow Dags
ML Engineering and Production ML - CodeSeniorNew
Train a 3D Object Detector for Highway Trucking
Use the nuScenes or Waymo Open Dataset (open access) as your training and evaluation source. Fine-tune a strong baseline (e.g., CenterPoint or BEVFusion) and define an evaluatio…
- Object Detection
- Perception
- Pytorch Or Tensorflow
AI for Autonomous Vehicles - CodeBeginnerNew
Build a Crawler-and-Topic Pipeline for Public-Sector Web Analytics
You will build a polite, robots.txt-respecting crawler that ingests about 30,000 new posts/week across the 80 forums into a normalized dataset. Apply a topic model (BERTopic, wi…
- Web Crawling
- Topic Modeling
- NLP Pipeline
Social Network Analysis and Web Science - ResearchSeniorNew
Quantify Sim-to-Real Gap for a Warehouse Manipulation Policy
You receive a trained pick-and-place policy (PyTorch), the simulation env (Isaac Lab), and access to a real-arm rig (or recorded teleop episodes if hardware is unavailable). Def…
- Sim To Real
- Manipulation
- Experimental Design
Robot Perception and Autonomy - CodeIntermediateNew
Train a Reward Model on Customer-Support Preferences
You receive 8,000 labeled preference pairs from real support conversations (each pair is two model responses with a human-chosen winner). Fine-tune a small open-weights base mod…
- Reward Modeling
- Preference Learning
- Bradley Terry Loss
Machine Learning from Human Preferences (RLHF and Alignment) - DesignIntermediateNew
Design a Lab-Automation Pipeline for a Bangalore Materials Startup
Design (not build) the full closed-loop lab pipeline: data layer (LIMS plus experiment store), model layer (a surrogate plus an acquisition function such as Expected Improvement…
- Systems Architecture
- Active Learning
- Mlops Design
AI for Science and Engineering - CodeBeginnerNew
Image Search for a DTC Furniture Retailer's App
Use a pretrained vision-embedding model (CLIP ViT-B/32 or DINOv2-small). Index a catalog of around 1,500 furniture images. Curate a small evaluation set of around 50 user-style …
- Image Embeddings
- Vision Transformers
- Image Search
Computer Vision (Undergraduate) - DesignBeginnerNew
Design a Model-Monitoring Dashboard for an MLOps Platform
You receive realistic 30-day metric streams for three production models (a classifier, a regressor, and a ranker) including prediction volume, latency p50/p95, accuracy or RMSE …
- Dashboard Design
- Visual Hierarchy
- Chart Selection
Data Visualization - AnalysisBeginnerNew
Refit a Pricing Model for an Insurance Comparison Site
You receive 9 months of quote-impression data (about 14 million events) with about 60 features and click labels. Refit logistic regression with elastic-net regularization plus a…
- Regularized Regression
- Feature Interactions
- Calibration
Statistical Machine Learning
How it works
From brief to credential, in six steps.
Step 01
Browse challenges aligned to your studies.
Step 02
Accept the one that fits your goals.
Step 03
Work through it with AI Copilot guidance.
Step 04
Submit for structured evaluation.
Step 05
Earn a verified credential.
Step 06
Add it to LinkedIn with one click.
Industry teams behind a decade of practitioner briefs
Hiring from this pool?
Sponsor a challenge and meet candidates through actual work.
Industry teams can shape briefs around the skills they hire for, then evaluate students on rubric-scored deliverables — not resumes.



















































































