Computer & Information Sciences
Data Science Challenges
Real data-science projects and challenges on Ewance — clean messy datasets, build and evaluate models, and turn raw data into decisions the way a working data scientist does. Solve them to build a portfolio of verified, recruiter-checkable proof you can do the work — not just describe it.
Recommended challenges
- AnalysisBeginnerNew
Optimize Hyperparameters with Bayesian Optimization on a Tight Budget
You receive a B2B-SaaS churn dataset (about 12,000 customer-month rows, 38 features) and a fixed sweep budget of 40 trials per model family. Implement a Bayesian optimizer (Optu…
- Bayesian Optimization
- Hyperparameter Tuning
- Ensemble Methods
Advanced Machine Learning - AnalysisIntermediateNew
Optimize Stop-Loss Policies with Dynamic Programming at a Quant Fund
You receive five years of daily PnL series for 12 momentum strategies plus a small set of state features (rolling vol, drawdown, regime indicator). Calibrate a discrete Markov m…
- Dynamic Programming
- Backward Induction
- State Modeling
Decision Making Under Uncertainty - DesignIntermediateNew
Co-Design a Trust Layer for an Enterprise RAG Assistant
You will plan and run a 5-day remote co-design study with eight pilot users (a mix of plant operators and middle managers). Sessions 1-2: discover where trust breaks down. Sessi…
- Co Design
- User Research
- Trust And Transparency
Human-Computer Interaction for AI Systems - CodeIntermediateNew
Extractive QA on Clinical Trial Protocols
You receive 500 anonymized protocol PDFs (already OCR-ed to text) and 1,200 labeled question-answer pairs where each answer is an exact text span. Build an extractive QA system:…
- Extractive Qa
- Reading Comprehension
- Model Finetuning
Question Answering and Conversational Systems Develop in-demand professional skills.
Each challenge names the skills it strengthens. Over time, your profile fills with the competences a hiring manager would actually look for.
Why Ewance
- CodeIntermediateNew
Restore Smartphone Low-Light Photos for a Consumer AI App
You receive 200 paired low-light / well-lit phone photos plus 1,000 unpaired low-light photos. Build a pipeline that combines a learned denoiser (e.g. a small DnCNN-style model …
- Image Restoration
- Denoising
- Tone Mapping
Image Processing and Computational Imaging - CodeIntermediateNew
Safety-Critical Test Harness for an AV Planner
Use CARLA (open-source AV simulator) and encode 10 representative safety scenarios across 3 categories (cut-in, pedestrian emergence, signalized-intersection right-of-way). Writ…
- Simulation
- Scenario Testing
- Safety Evaluation
AI for Autonomous Vehicles - ResearchSeniorNew
Investigate Why Our Generative Model Memorizes Training Data
Pick a small open-source diffusion model (e.g., a Stable-Diffusion-class community model trained on LAION-subset). Reproduce a published membership-inference + extraction probe …
- Generative Models
- Memorization Analysis
- Differential Privacy
Advanced Deep Learning - CodeBeginnerNew
Plan Safe Paths for a Last-Mile Sidewalk Robot
You receive 4 hours of recorded sidewalk traversals with annotated pedestrian tracks, occupancy grids, and a map of the pilot neighborhood. Implement a sampling-based planner (R…
- Motion Planning
- Sampling Based Planning
- Cost Function Design
Robot Perception and Autonomy - Browse challenges
Explore role
Pricing Strategist
Set the price that captures value without leaving sales on the table. Demand modelling, willingness-to-pay research, and the disciplined experimentation that turns pricing into a competitive advantage.
- ResearchIntermediateNew
Explore the Bias-Variance Trade-off on a Tabular Healthcare Cohort
You receive a 90,000-patient anonymized de-identified tabular dataset (demographics, labs, claims-derived features) and a binary 12-month-readmission outcome. Pick three model f…
- Bias Variance Tradeoff
- Regularization
- Model Selection
Machine Learning - CodeBeginnerNew
Build a Face-Anonymization Tool for a Civic-Tech Newsroom
Use a pretrained face detector (RetinaFace or YOLOv8-face is fine). Build a Python tool with a Gradio or Streamlit UI that: (1) detects faces in an uploaded photo, (2) shows det…
- Object Detection
- Image Processing
- Opencv
Computer Vision (Undergraduate) - ResearchIntermediateNew
Hardware-Aware NAS for a Wearable ECG Classifier
You receive a labeled subset of an arrhythmia ECG dataset (about 80,000 10-second windows, 4 classes), a microcontroller latency lookup table (op-level milliseconds) for a Corte…
- Neural Architecture Search
- Hardware Aware Design
- Edge Inference
Edge ML and On-Device Machine Learning - CodeIntermediateNew
Train a VAE for Synthetic Tabular Data at a Healthtech Startup
You receive a synthetic-but-realistic clinical-trial table (around 50,000 patients, 35 columns, mixed continuous and categorical). Train a tabular VAE (or TVAE/CTGAN as alternat…
- Vae
- Tabular Generation
- Synthetic Data
Deep Generative Models Get recognized by recruiters and employers.
Credentials are blockchain-anchored via LearnCoin — tamper-evident, portable, link-shareable on LinkedIn and beyond.
Why Ewance
- AnalysisIntermediateNew
Draft GDPR + AI Act Data Provisions for a Training-Data Vendor
Anchor the work on (1) GDPR Articles 28 (processor obligations) and 32 (security), (2) the EU AI Regulation's data-governance article for high-risk systems, and (3) the EDPB's p…
- Data Protection Law
- Contract Redlining
- Regulatory Analysis
AI Law, Policy, and Regulation - DesignSeniorNew
Design Eval Suite for a Multimodal Brainstorming Assistant
You receive (1) the assistant's current API, (2) a list of 6 launch user-personas, and (3) the product team's quality target ('beat the previous model on 4 of 6 personas'). Desi…
- LLM Evaluation
- Multimodal Evaluation
- Safety Evaluation
Generative AI - AnalysisBeginnerNew
Analyze a Learning-Analytics Dataset for At-Risk Detection
You receive an anonymized dataset of LMS engagement features (logins, assignment submissions, forum posts, video-watch time), grade history, and a binary label for end-of-semest…
- Learning Analytics
- Classification
- Fairness Metrics
AI in Education and Learning Analytics - PresentationBeginnerNew
Storytelling Visualization of an Autonomous Vehicle Test Campaign
You receive aggregated test results: 12,000 test runs across dry, wet, and snow conditions, with metrics for disengagement rate, near-miss count, and route-completion percentage…
- Data Storytelling
- Audience Adaptation
- Chart Design
Data Visualization - AnalysisBeginnerNew
Audit a Climate-Tech Sensor Dataset for Production Readiness
You receive 18 months of raw sensor readings from 1,200 sensors (about 800M rows), plus a sensor-metadata table (location, firmware version, deployment date). Profile the data f…
- Data Quality Audit
- Data Profiling
- Time Series Analysis
Applied Data Analysis and Practical Data Science - DesignSeniorNew
Design a Distributed Training Job for a 13B-Parameter Model
Decide whether to use Fully Sharded Data Parallel (FSDP), Tensor Parallelism, Pipeline Parallelism, or a hybrid; justify against the 13B-param + 32-H100 setup. Calculate memory …
- Distributed Training
- Fsdp
- Pytorch Or Tensorflow
Machine Learning Systems - DesignBeginnerNew
Design an Automated Essay-Feedback System
You receive 20 anonymized middle-school essays scored by 2 human teachers on a 4-dimension rubric (structure, evidence, voice, mechanics). Design an LLM-based feedback system th…
- Automated Assessment
- Rubric Design
- Prompt Patterns
AI in Education and Learning Analytics - DesignBeginnerNew
Optimizing Inventory for a São Paulo D2C Cosmetics Brand
You are given a CSV file with raw sales, inventory, and supplier data. Your task is to design an E/R diagram, create the normalized relational schema in 3NF, populate it with sa…
- SQL
- Database Design
- Normalization
Database Systems - CodeIntermediateNew
Migrate a Legacy Warehouse to a Lakehouse for an Edtech AI Platform
You receive a Postgres dump of around 50 GB and the current dbt models that produce the student-attempts mart. Land the raw data in object storage (S3 or GCS) as Parquet partiti…
- Lakehouse Architecture
- Delta Lake
- Apache Spark
Data Engineering and Big Data Systems - AnalysisBeginnerNew
Community Detection on a Pharma Clinical-Trial Investigator Graph
You receive a pre-fetched dump of around 15,000 trials from a public registry covering oncology over the last 10 years and a mapping of trials to investigator names + institutio…
- Community Detection
- Louvain
- Leiden
Machine Learning on Graphs - CodeIntermediateNew
Scene-Graph Generation for Retail Shelf Audits
You receive 1,500 labeled shelf photos (anonymized product crops, bounding boxes, and ~12 relation types). Build a pipeline that, for a new shelf photo, outputs (a) detected pro…
- Scene Graph Generation
- Object Detection
- Relation Prediction
Visual Intelligence and Visual Reasoning - CodeIntermediateNew
Train a Domain-Specific Reranker for a Legal-Tech Search Box
You receive 20,000 (query, document, relevance-label) triples from the firm's contract corpus. Fine-tune a small cross-encoder (e.g., ms-marco-MiniLM-L-6-v2 or BAAI/bge-reranker…
- Cross Encoder Reranker
- Fine Tuning
- Ir Evaluation
Information Retrieval and Search
How it works
From brief to credential, in six steps.
Step 01
Browse challenges aligned to your studies.
Step 02
Accept the one that fits your goals.
Step 03
Work through it with AI Copilot guidance.
Step 04
Submit for structured evaluation.
Step 05
Earn a verified credential.
Step 06
Add it to LinkedIn with one click.
Industry teams behind a decade of practitioner briefs
Hiring from this pool?
Sponsor a challenge and meet candidates through actual work.
Industry teams can shape briefs around the skills they hire for, then evaluate students on rubric-scored deliverables — not resumes.



















































































