Computer & Information Sciences
Data Science Challenges
Real data-science projects and challenges on Ewance — clean messy datasets, build and evaluate models, and turn raw data into decisions the way a working data scientist does. Solve them to build a portfolio of verified, recruiter-checkable proof you can do the work — not just describe it.
Recommended challenges
- CodeIntermediateNew
Build a Serverless ETL Pipeline for a Climate-Tech Sensor Fleet
Build the pipeline using managed services only (e.g., S3 + Lambda + EventBridge + Glue, or GCS + Cloud Functions + Cloud Scheduler + BigQuery external tables). Source the data f…
- Serverless Architecture
- Etl Pipelines
- Terraform
Cloud Computing for Data and ML - ResearchBeginnerNew
Curate a Domain Lexicon for a Climate-Tech NLP Stack
You receive 5,000 policy documents and a benchmark of 200 documents with manually tagged domain terms. Curate a lexicon of ~1,500 terms with (1) canonical English form, (2) Swah…
- Lexical Resources
- Named Entity Recognition
- Spacy
Linguistic Engineering and Language Technologies - ResearchSeniorNew
Benchmark Long-Context Architectures on a Legal-Doc Retrieval Task
You receive a public legal-QA dataset (e.g., LongBench's legal split or similar) filtered to documents over 50,000 tokens. Implement or wrap 3 architectures: a sliding-window Tr…
- Long Context Architectures
- State Space Models
- Hugging Face Transformers
Advanced Deep Learning - AnalysisBeginnerNew
Audit Data Quality for a Climate Tech Sensor Network
You receive 30 days of ingested sensor data (around 400 million rows) plus the sensor inventory and known maintenance windows. Define a set of data-quality expectations (null ra…
- Data Quality
- Great Expectations
- Anomaly Detection
Data Engineering and Big Data Systems Practice your coursework on real scenarios.
Every challenge is shaped from real industry context — not generic exercises. The work mirrors what your degree prepares you for.
Why Ewance
- CodeIntermediateNew
Segment Cells from Microscopy Images for a Pharma-AI Discovery Lab
You receive 3,500 microscopy images with pixel-level cell masks plus a 200-image hold-out set re-annotated by two biologists for inter-annotator agreement. Train a U-Net or SegF…
- Semantic Segmentation
- U Net
- Pytorch Or Tensorflow
Deep Learning for Computer Vision - CodeBeginnerNew
Reduce Dimensionality on Sensor Streams for a Mid-Cap Robotics OEM
You receive 120 robot-hours of windowed sensor data (5s windows, 240 channels) with labels for normal vs. one of four fault classes. Implement (1) PCA, (2) kernel PCA with an RB…
- Dimensionality Reduction
- Kernel Methods
- Autoencoders
Machine Learning - CodeBeginnerNew
Quantize a Vision Model for a Smart-Doorbell SoC
You receive a trained FP32 PyTorch person-detector (mAP 0.74 on a 5k validation set) plus a calibration dataset of 500 unlabeled doorbell frames. Convert to ONNX, then apply pos…
- Quantization
- Model Optimization
- Onnx Optimization
Edge ML and On-Device Machine Learning - DesignIntermediateNew
Prototype an Explainability Panel for a Fintech Credit Assistant
You receive: the model's top-10 SHAP-style feature contributions per customer (a feature-importance technique that breaks an ML prediction into per-input contributions), the cur…
- Explainability Design
- Human Ai Interaction
- Figma Prototyping
Human-Computer Interaction for AI Systems - Browse challenges
Explore role
Marketing Analyst
Plan and measure campaigns that grow the business. Funnel analytics, attribution, segmentation, and the rigorous measurement that lets marketing defend its budget at the leadership table.
- CodeIntermediateNew
Design Safe Navigation Behavior for a Hospital Delivery Robot
You receive a dataset of 200 anonymized hospital corridor traces (people positions over time from the robot's LIDAR) plus the current planner's parameters. Design a policy that …
- Human Aware Navigation
- Ros2
- Motion Planning
Human-Robot Interaction - CodeIntermediateNew
Adapt Machine Translation to a Niche Domain
Pick an open MT base (NLLB-200 or a strong open M2M model). Build a parallel corpus of around 8,000 sentence pairs from the company's bilingual safety standards. Fine-tune on th…
- Machine Translation
- Domain Adaptation
- Hugging Face Transformers
Natural Language Processing - CodeIntermediateNew
Actor-Critic for Energy-Storage Dispatch
You receive 3 years of hourly day-ahead price data and a Python simulator that models state of charge, round-trip efficiency, and a 1-day price forecast with documented uncertai…
- Actor Critic
- A2c
- Deep Rl
Reinforcement Learning - CodeIntermediateNew
Build a Domain-Specific Named-Entity Recognizer for Legal Contracts
Start from a strong English NER base (spaCy transformer or LegalBERT). Fine-tune on a provided 1,200-contract labeled dataset for the 9 entity types. Handle long contracts (ofte…
- Named Entity Recognition
- Sequence Labeling
- Domain Adaptation
Natural Language Processing Build a verifiable portfolio.
Submissions become evidence. Reviewers with shipping experience score against a rubric; the result becomes a credential anyone can verify.
Why Ewance
- AnalysisBeginnerNew
Predict Equipment Failure for a Wind-Farm Operator
You receive 18 months of SCADA (Supervisory Control and Data Acquisition — the standard turbine telemetry feed) data sampled every 10 minutes from all 240 turbines, with labeled…
- Classification
- Regularized Regression
- Gradient Boosting
Statistical Machine Learning - AnalysisIntermediateNew
Imitation Learning from Human Demos for a Drone Inspection
You receive 6 hours of expert pilot demonstrations (state-action pairs at 20 Hz) recorded in an AirSim wind-farm environment with 3 turbine designs, plus a held-out 4th turbine …
- Imitation Learning
- Behavioral Cloning
- Dagger
Deep Reinforcement Learning - DesignBeginnerNew
Simulation-Based Capacity Planning for a 40-Person SaaS Scale-Up
Your task is to simulate Flowly's customer success operations over the next 12 months. Model the arrival of new enterprise customers, their onboarding and support requests, and …
- Simulation
- Capacity Planning
- Sensitivity & Scenario Analysis
Business Analytics - ResearchIntermediateNew
Build Saliency-Map Explanations for Dermatology Triage
You receive a trained CNN (ResNet-50 backbone, 7-class lesion classifier) and a 1,000-image held-out test set with dermatologist labels. Implement Integrated Gradients, GradCAM,…
- Saliency Maps
- Integrated Gradients
- Gradcam
Explainable and Interpretable AI - AnalysisBeginnerNew
Mine Association Rules for a Grocery Retailer's Promo Strategy
You receive 6 months of basket-level transaction data (around 22 million baskets, around 18,000 SKUs) plus a category taxonomy. Run association-rule mining (Apriori or FP-Growth…
- Association Rules
- Market Basket Analysis
- Apriori
Data Mining and Knowledge Discovery - AnalysisBeginnerNew
Cost-Optimize an Embedding Pipeline for a Customer Support Knowledge Base
You receive: (a) the current pipeline (full re-embed on any article change, OpenAI text-embedding-3-large, 3,072 dims) with one month of cost logs, (b) a sample of 5,000 article…
- Embedding Models
- Finops & Cost Optimization
- Change Detection
Vector Databases and Embeddings - ResearchIntermediateNew
Audit a Public LLM Benchmark for Validity Threats
Choose one open LLM benchmark (e.g., MMLU, GPQA, BIG-Bench-Hard, MATH). Read the benchmark paper plus at least three follow-up critiques. Audit (1) data contamination risk again…
- Benchmark Evaluation
- Data Contamination Analysis
- Annotation Methodology
AI Measurement and Evaluation - AnalysisBeginnerNew
Interpretable-by-Design GAM for an Insurer's Claims Triage
You receive an anonymized claims dataset (around 60,000 claims, target: log reserve), a feature schema (22 features), and an existing LightGBM baseline (held-out R^2 of 0.78). T…
- Generalized Additive Models
- Ebm
- Interpretability
Explainable and Interpretable AI - DesignBeginnerNew
Privacy-Preserving Crowd-Density Estimator for Transit Stations
Use a public crowd-counting dataset (e.g., ShanghaiTech or JHU-CROWD) to train a small crowd-density estimator (CSRNet or similar). Wrap it in an on-device pipeline (Python is f…
- Crowd Counting
- Scene Understanding
- Privacy By Design
Visual Intelligence and Visual Reasoning - ResearchIntermediateNew
Design a Capability Evaluation for an Open-Weights Coding Model
Pick a recent open-weights coding model (e.g., a Qwen, DeepSeek, or Llama variant). Design an evaluation set of around 40 coding tasks across 4 buckets: standard benign coding, …
- Capability Evaluation
- Safety Evaluation
- LLM Evaluation
AI Safety and Alignment - DesignIntermediateNew
Counterfactual Explanations for an Insurance Pricing Model
You receive a trained LightGBM regression model (premium in GBP), the feature schema (28 features, 14 mutable from the customer's side), and 500 sample quotes. Use DiCE (Diverse…
- Counterfactual Explanations
- Dice Ml
- Interpretability
Explainable and Interpretable AI - AnalysisFoundationalNew
Sentiment Analysis for Tel Aviv D2C Cosmetics Brand
You are provided with a dataset of 10,000 customer reviews (in English) with no labels. Your task is to preprocess the text, develop a sentiment classification model using NLP t…
- Text Preprocessing
- Sentiment Analysis
- Classification
Text Analytics and Natural Language Processing
How it works
From brief to credential, in six steps.
Step 01
Browse challenges aligned to your studies.
Step 02
Accept the one that fits your goals.
Step 03
Work through it with AI Copilot guidance.
Step 04
Submit for structured evaluation.
Step 05
Earn a verified credential.
Step 06
Add it to LinkedIn with one click.
Industry teams behind a decade of practitioner briefs
Hiring from this pool?
Sponsor a challenge and meet candidates through actual work.
Industry teams can shape briefs around the skills they hire for, then evaluate students on rubric-scored deliverables — not resumes.



















































































