Computer & Information Sciences
Data Science Challenges
Real data-science projects and challenges on Ewance — clean messy datasets, build and evaluate models, and turn raw data into decisions the way a working data scientist does. Solve them to build a portfolio of verified, recruiter-checkable proof you can do the work — not just describe it.
Recommended challenges
- ResearchSeniorNew
Audit a Production Model for Membership Inference Attacks
Use a black-box membership inference attack (e.g., the LiRA or shadow-model attack). You have query access to a sandboxed copy of the model + the original training data labels f…
- Membership Inference
- Privacy Attacks
- Model Evaluation
Privacy-Preserving Machine Learning - CodeIntermediateNew
Multi-Sensor Late-Fusion Prototype for an Indoor AGV
Use the public KITTI dataset (or a similar paired LiDAR+RGB dataset) restricted to static-obstacle classes. Implement a late-fusion baseline: a LiDAR-only detector (PointPillars…
- Sensor Fusion
- Object Detection
- Perception
AI for Autonomous Vehicles - CodeIntermediateNew
Detect Coordinated Fraud Rings via Link Analysis at a Neobank
You receive 90 days of account, login, and transaction data (around 1.2 million accounts, around 30 million events) plus a labeled set of 80 known fraud rings. Build a multi-rel…
- Graph Analysis
- Community Detection
- Link Analysis
Data Mining and Knowledge Discovery - CodeIntermediateNew
Plan Inventory Replenishment as an MDP for an E-Commerce AI Startup
You receive 18 months of daily demand for 50 representative SKUs at one warehouse plus lead-time and unit-cost data. For one SKU at a time, formulate an MDP with state = (on-han…
- Mdp Modeling
- Value Iteration
- Dynamic Programming
Decision Making Under Uncertainty Develop in-demand professional skills.
Each challenge names the skills it strengthens. Over time, your profile fills with the competences a hiring manager would actually look for.
Why Ewance
- CodeBeginnerNew
Semantic Segmentation for a Solar-Panel Inspection Drone
Use a publicly-available solar-panel dataset (or the PV-Defect-Detection dataset). Fine-tune a small U-Net or SegFormer-tiny on panel/no-panel pixel-level segmentation. Evaluate…
- Semantic Segmentation
- Cnn Classification
- Transfer Learning
Computer Vision (Undergraduate) - CodeIntermediateNew
Forecast Intraday FX Volatility for a London Liquidity Desk
You receive 18 months of tick-level mid-quote data for six FX pairs plus a calendar of scheduled macro events. Resample to 1-minute bars, engineer realized-volatility features, …
- Time Series Forecasting
- Feature Engineering
- Model Validation
AI and Quantitative Finance - AnalysisIntermediateNew
Cut Latency and Cost on a High-Volume Summarization Service
You receive 30 days of anonymized request logs (prompt token counts, completion token counts, latencies, models used). Profile the cost and latency distribution, then design and…
- Finops & Cost Optimization
- Latency Optimization
- Prompt Compression
LLM Application Development - AnalysisBeginnerNew
Cost-Model a Foundation-Model API Migration
You receive: 90 days of API logs (request volume, token distributions), the customer's golden eval set of 200 prompts, the incumbent and new pricing schedules, and quality ratin…
- Cost Modeling
- Ai Workforce Strategy
- Model Evaluation
AI for Business and AI Product Management - Browse challenges
Explore role
Pricing Strategist
Set the price that captures value without leaving sales on the table. Demand modelling, willingness-to-pay research, and the disciplined experimentation that turns pricing into a competitive advantage.
- CodeBeginnerNew
Train a Word-Alignment Model for Low-Resource Catalan-Aranese
You receive a 35,000-sentence Catalan-Aranese parallel corpus plus a 1,200-pair manually annotated word-alignment test set. Train (1) a classic statistical alignment baseline (e…
- Alignment
- Neural Mt
- Low Resource Mt
Machine Translation - ResearchIntermediateNew
Benchmark Graph-Embedding Methods on a Climate-Network Dataset
You receive a 200M-edge sample of the knowledge graph and a labeled entity-similarity test set (5,000 pairs with relevance labels). Benchmark three methods: a shallow embedding …
- Graph Embeddings
- Neural Networks
- Scalable Ml
Machine Learning at Scale - CodeBeginnerNew
Structured-Output Prompts for Invoice Extraction
You receive 300 real invoice transcripts (already OCR-ed) labeled with 14 target fields, plus the current production prompt and its 12 percent failure log. Design a new prompt u…
- Structured Output
- Json Schema
- Few Shot Prompting
Prompt Engineering - CodeIntermediateNew
Hierarchical Plans for an Aerospace Maintenance Crew Scheduler
You receive a synthetic week of 80 work orders with hierarchical decompositions, technician certifications, and shared-tool constraints. Implement an HTN planner (PyHOP or HDDL …
- Htn Planning
- Domain Modeling
- Constraint Handling
Automated Planning Get recognized by recruiters and employers.
Credentials are blockchain-anchored via LearnCoin — tamper-evident, portable, link-shareable on LinkedIn and beyond.
Why Ewance
- CodeIntermediateNew
Fine-Tune a Small Transformer for Legal-Domain EN-DE Translation
You receive a 120,000-segment parallel EN-DE legal corpus and a held-out 1,000-segment test set with reference translations. Fine-tune a small pretrained Transformer (e.g., NLLB…
- Neural Mt
- Hugging Face Transformers
- Fine Tuning
Machine Translation - ResearchIntermediateNew
Multi-Task Learning for a Healthtech Triage Model
You receive 40,000 anonymized de-identified intake-form records with two labels: urgency tier (4 classes) and routed sub-specialty (12 classes). Train (1) two independent classi…
- Multi Task Learning
- Transfer Learning
- Hugging Face Transformers
Meta-Learning, Transfer Learning, and Multi-Task Learning - CodeBeginnerNew
Build a Real-Time Operations Wall Display for a Logistics AI Startup
You receive a websocket feed of operational events (around 20 events per second) plus a small KPI definition list (throughput per zone, late-truck count, exception queue depth, …
- Realtime Visualization
- Glanceability
- D3
Data Visualization - DesignIntermediateNew
Build a Feature Store for a Fintech Fraud Team
You will design a feature-store layer covering 12 representative fraud features (account-level, merchant-level, transaction-level), with both batch (Spark) and online (low-laten…
- Feature Stores
- Data Pipelines
- Apache Spark
Machine Learning at Scale - ResearchBeginnerNew
Run a Human-Preference Study Comparing Two Coding Assistants
Design a blinded paired-comparison study: 12 developer participants, each gets the same 8 realistic coding tasks (refactor, write a function, debug, test), each task is solved b…
- Experimental Design
- Statistical Evaluation
- Human Evaluation
AI Measurement and Evaluation - CodeIntermediateNew
Build an Audio-Visual Speaker Diarization Pipeline
Build the pipeline: face detection + active-speaker detection on video, voice-activity detection + speaker embeddings on audio, then a fusion step that ties tracks to detected f…
- Audio Visual Fusion
- Speaker Diarization
- Active Speaker Detection
Multimodal Machine Learning - ResearchIntermediateNew
QLoRA Fine-Tune for a Customer-Support Domain Assistant
You receive 8,000 anonymized support ticket pairs (question -> agent response), the company's product documentation (around 600 pages), and a strong RAG baseline already running…
- Qlora
- Fine Tuning
- RAG Architectures
Fine-Tuning Large Language Models - CodeIntermediateNew
Design an SAT-Based Verifier for an Autonomous-Vehicle Test Lab
Model a simplified four-way intersection: agent positions, lights, and discrete time steps. Define 5 safety properties in propositional logic (e.g., 'no two agents in the inters…
- Sat Solving
- Logical Inference
- Formal Verification
Artificial Intelligence: Principles and Techniques - CodeIntermediateNew
Build a Hybrid Recommender for a Niche Consumer-AI Music App
You receive listening events (around 240 million plays) plus a content embedding per track (audio + curator tags). Build a collaborative filtering model (ALS or implicit-feedbac…
- Recommender Systems
- Collaborative Filtering
- Content Based Filtering
Data Mining and Knowledge Discovery - CodeIntermediateNew
Agentic RAG with Context-Window Budgeting
You receive a synthetic dataset of 60 founder-style queries paired with 'workspaces' (each up to 500 documents across 3 source types), plus gold-standard answers and citation li…
- Agentic RAG
- Context Window Management
- Iterative Retrieval
Retrieval-Augmented Generation - ResearchSeniorNew
Long-Context QA Evaluation Benchmark for Legal Memoranda
You receive 25 anonymized legal memoranda (50-90 pages each) and 100 QA pairs whose answers are deliberately spread across the documents (25 in pages 1-20, 25 in pages 20-40, 25…
- Long Context Qa
- Benchmark Design
- Model Evaluation
Question Answering and Conversational Systems - DesignSeniorNew
Multi-Region Failover for an Enterprise RAG Service
Design and prototype: (1) a primary-region deployment of the RAG service (vector DB + LLM inference + retrieval API), (2) a passive secondary region with replicated vector store…
- Multi Region Architecture
- Disaster Recovery
- Terraform
Cloud Computing for Data and ML
How it works
From brief to credential, in six steps.
Step 01
Browse challenges aligned to your studies.
Step 02
Accept the one that fits your goals.
Step 03
Work through it with AI Copilot guidance.
Step 04
Submit for structured evaluation.
Step 05
Earn a verified credential.
Step 06
Add it to LinkedIn with one click.
Industry teams behind a decade of practitioner briefs
Hiring from this pool?
Sponsor a challenge and meet candidates through actual work.
Industry teams can shape briefs around the skills they hire for, then evaluate students on rubric-scored deliverables — not resumes.



















































































