Computer & Information Sciences
Data Science Challenges
Real data-science projects and challenges on Ewance — clean messy datasets, build and evaluate models, and turn raw data into decisions the way a working data scientist does. Solve them to build a portfolio of verified, recruiter-checkable proof you can do the work — not just describe it.
Recommended challenges
- CodeBeginnerNew
Calibrate a Demand Forecast with Bayesian Confidence Intervals
You receive 24 months of weekly demand for 600 SKUs plus the existing XGBoost point predictions. Fit a Bayesian conformal-prediction layer (or, alternatively, a Gaussian-Process…
- Bayesian Inference
- Uncertainty Quantification
- Conformal Prediction
Probabilistic Machine Learning - CodeBeginnerNew
Build a Real-Time Operations Wall Display for a Logistics AI Startup
You receive a websocket feed of operational events (around 20 events per second) plus a small KPI definition list (throughput per zone, late-truck count, exception queue depth, …
- Realtime Visualization
- Glanceability
- D3
Data Visualization - CodeIntermediateNew
Triage Medical-Imaging Annotations with a Small Vision Model
Train a binary normal/abnormal classifier on the public CheXpert or NIH ChestX-ray14 dataset. Use temperature scaling to calibrate the output, then define abstention thresholds …
- Cnn Classification
- Transfer Learning
- Calibration
Applied Machine Learning - CodeBeginnerNew
Behavior Cloning for a Pick-and-Place Manipulator
You receive 200 human teleoperated demonstrations (state + action trajectories) of picking 8 small electronic components from a tray and placing them at marked locations in a ro…
- Behavior Cloning
- Imitation Learning
- Manipulation
Robot Learning Develop in-demand professional skills.
Each challenge names the skills it strengthens. Over time, your profile fills with the competences a hiring manager would actually look for.
Why Ewance
- AnalysisBeginnerNew
Explain a Credit-Risk Model with SHAP for a Fintech
You receive a trained XGBoost credit-risk model (binary default prediction), the training feature schema (38 features), and a held-out 10,000-sample test set with labels. Comput…
- Shap
- Interpretability
- Fairness Analysis
Explainable and Interpretable AI - StrategyBeginnerNew
Spec a Voice Agent for an Airline's Disruption Support Line
You will produce a 6-page voice-agent product spec covering: (1) supported intents and out-of-scope handling, (2) handoff-to-human criteria, (3) latency and confidence threshold…
- Voice Agent Design
- Intent Design
- Metric Design
Speech Recognition and Spoken Language Processing - CodeIntermediateNew
Build a Multilingual Customer-Email Classifier
You receive 28,000 labeled emails (skewed toward English and Mandarin). Try at least two approaches: (1) a fine-tuned multilingual transformer (XLM-RoBERTa or mDeBERTa) and (2) …
- Text Classification
- Multilingual NLP
- Hugging Face Transformers
Natural Language Processing - ResearchSeniorNew
Diffusion-Policy Imitation for Bimanual Cooking Tasks
You receive 300 teleoperated demonstrations of a bimanual pour-and-stir task in a Robomimic-style simulator, deliberately including 2 valid solution modes per task (left-pour-ri…
- Diffusion Policies
- Imitation Learning
- Multimodal Action Distributions
Robot Learning - Browse challenges
Explore role
Pricing Strategist
Set the price that captures value without leaving sales on the table. Demand modelling, willingness-to-pay research, and the disciplined experimentation that turns pricing into a competitive advantage.
- AnalysisBeginnerNew
Diagnose Query Failures in an E-Commerce Search Box
You receive 6 months of anonymized query logs (~480 million rows): query string, language hint, results-shown count, top-3 product clicks, and add-to-cart events. Build a notebo…
- Query Log Analysis
- Clustering
- Ir Failure Analysis
Information Retrieval and Search - ResearchSeniorNew
Benchmark Conformal Prediction for a Healthcare Risk-Score
You receive a labeled dataset of about 25,000 patient encounters with the current risk-score's predictions and ground-truth 1-year outcomes. Implement and compare split conforma…
- Conformal Prediction
- Uncertainty Quantification
- Calibration
Statistical Machine Learning - AnalysisBeginnerNew
Build a Reproducible Pricing Analysis for a DTC Skincare Brand
You receive 24 months of order-line data (around 480,000 lines), a Shopify-style customer export, and a discount-code log. Build a Python pipeline that produces: SKU-level price…
- Data Wrangling
- Exploratory Data Analysis
- Cohort Analysis
Applied Data Analysis and Practical Data Science - DesignIntermediateNew
Design a Real-Time Order Pipeline for a Fintech Payments Platform
You receive a synthetic Kafka stream of around 500 transactions per second, a static merchant dimension table (about 80,000 rows), and a daily FX rate snapshot. Design an end-to…
- Streaming Data
- Kafka Event Streaming
- Stream Processing
Data Engineering and Big Data Systems Get recognized by recruiters and employers.
Credentials are blockchain-anchored via LearnCoin — tamper-evident, portable, link-shareable on LinkedIn and beyond.
Why Ewance
- AnalysisIntermediateNew
Benchmark NPUs for an Autonomous Forklift Vision Stack
You receive ONNX exports of the 3 production models, a labeled validation set of 2,000 forklift-camera frames, and developer-kit access to three NPU candidates (anonymized as NP…
- Edge Inference
- Npu Benchmarking
- Onnx Optimization
Edge ML and On-Device Machine Learning - ResearchSeniorNew
Benchmark Reward-from-Feedback Methods on a Tabletop Pick-Place
You will use a Franka Panda arm in PyBullet on a 4-object pick-and-place task. For each of the three feedback methods, train a reward model and a downstream policy until converg…
- Reinforcement Learning
- Reward Learning
- Preference Comparison
Human-Robot Interaction - DesignSeniorNew
Multi-Region Failover for an Enterprise RAG Service
Design and prototype: (1) a primary-region deployment of the RAG service (vector DB + LLM inference + retrieval API), (2) a passive secondary region with replicated vector store…
- Multi Region Architecture
- Disaster Recovery
- Terraform
Cloud Computing for Data and ML - ResearchSeniorNew
Benchmark Long-Context Architectures on a Legal-Doc Retrieval Task
You receive a public legal-QA dataset (e.g., LongBench's legal split or similar) filtered to documents over 50,000 tokens. Implement or wrap 3 architectures: a sliding-window Tr…
- Long Context Architectures
- State Space Models
- Hugging Face Transformers
Advanced Deep Learning - ResearchIntermediateNew
Neuro-Symbolic Question Answering on an Enterprise Knowledge Graph
You receive a curated Turtle-format knowledge graph (around 2 million triples covering organizational structure, products, projects), 200 labeled question-SPARQL pairs split 140…
- Neuro Symbolic
- Sparql
- Knowledge Graphs
Fuzzy Logic, Knowledge Representation, and Symbolic Reasoning - AnalysisBeginnerNew
Map Creator Communities for a Short-Form Video Platform
You receive a 90-day sample of about 4 million creator-creator interactions (duets, mentions, audience overlap) and creator metadata (region, language, content tag). Build a cre…
- Network Analysis
- Community Detection
- Graph Visualization
Social Network Analysis and Web Science - CodeBeginnerNew
Hybrid Search RAG for a HR-Policy Assistant
You receive 1,800 pages of policy documents (Markdown) and 150 labeled question-answer pairs with the gold source policy IDs. Build a hybrid retrieval pipeline: BM25 + dense emb…
- Hybrid Search
- Bm25
- Dense Retrieval
Retrieval-Augmented Generation - ResearchSeniorNew
Design a Distributed-Training Strategy for a Mid-Sized LLM
You will write a 5-page design memo that picks a parallelism strategy for fine-tuning a 13B model on 32 H100 GPUs, with a tokens-per-second estimate, a memory-per-GPU calculatio…
- Distributed Training
- Parallelism Strategies
- LLM Training
Machine Learning at Scale - AnalysisIntermediateNew
Draft GDPR + AI Act Data Provisions for a Training-Data Vendor
Anchor the work on (1) GDPR Articles 28 (processor obligations) and 32 (security), (2) the EU AI Regulation's data-governance article for high-risk systems, and (3) the EDPB's p…
- Data Protection Law
- Contract Redlining
- Regulatory Analysis
AI Law, Policy, and Regulation - CodeIntermediateNew
Restore Smartphone Low-Light Photos for a Consumer AI App
You receive 200 paired low-light / well-lit phone photos plus 1,000 unpaired low-light photos. Build a pipeline that combines a learned denoiser (e.g. a small DnCNN-style model …
- Image Restoration
- Denoising
- Tone Mapping
Image Processing and Computational Imaging - CodeIntermediateNew
Build a Federated Learning Prototype Across Two Hospitals
Simulate two sites with non-IID data splits (one site skews older, the other younger). Implement FedAvg using Flower (or PySyft). Run for at least 50 communication rounds; repor…
- Federated Learning
- Fedavg
- Secure Aggregation
Privacy-Preserving Machine Learning - CodeIntermediateNew
Description-Logic Reasoner for Insurance-Policy Coverage Checks
You receive 50 representative coverage rules in plain English (from the current rule engine) and a sample of 1,000 anonymized claim cases with the current engine's outcomes (cov…
- Description Logics
- Owl
- Reasoning
Fuzzy Logic, Knowledge Representation, and Symbolic Reasoning
How it works
From brief to credential, in six steps.
Step 01
Browse challenges aligned to your studies.
Step 02
Accept the one that fits your goals.
Step 03
Work through it with AI Copilot guidance.
Step 04
Submit for structured evaluation.
Step 05
Earn a verified credential.
Step 06
Add it to LinkedIn with one click.
Industry teams behind a decade of practitioner briefs
Hiring from this pool?
Sponsor a challenge and meet candidates through actual work.
Industry teams can shape briefs around the skills they hire for, then evaluate students on rubric-scored deliverables — not resumes.



















































































