Data Sciences Challenges
Explore data science challenges on Ewance to build skills employers expect from analysts and ML engineers. Work through challenges on data cleaning, exploratory analysis, modeling, and storytelling with data.
Most Popular
- CodeBeginnerNew
Reduce Dimensionality on Sensor Streams for a Mid-Cap Robotics OEM
You receive 120 robot-hours of windowed sensor data (5s windows, 240 channels) with labels for normal vs. one of four fault classes. Implement (1) PCA, (2) kernel PCA with an RB…
- Dimensionality Reduction
- Kernel Methods
- Autoencoders
Machine Learning - AnalysisBeginnerNew
Evaluate Speech-to-Text Quality for a Contact-Center Analytics Vendor
You receive 200 anonymized call-recording snippets (2-4 minutes each, ~67 per language) with reference transcripts plus a domain glossary of about 600 product terms. Run all thr…
- Speech Recognition
- Sequence Models
- Model Evaluation
Machine Perception - CodeBeginnerNew
Prototype a Multimodal Visual-Question-Answering Demo
You will use a small open-source vision-language model (e.g., LLaVA-1.5-7B or PaliGemma) and prompt-engineer it for the warehouse-VQA task. Build a Gradio web demo. Construct a …
- Vision Language Models
- Multimodal Perception
- Prompt Patterns
Machine Perception - DesignBeginnerNew
Build an Attention-Visualization Tool for Translation Quality Audit
You will load a small open-source EN-FR transformer (e.g., Helsinki-NLP Opus-MT-en-fr), build a Streamlit or Gradio demo that lets the user paste English source, see the French …
- Attention Mechanisms
- Neural Mt
- Tool Design
Machine Translation Develop in-demand professional skills.
Each challenge names the skills it strengthens. Over time, your profile fills with the competences a hiring manager would actually look for.
Why Ewance
- CodeBeginnerNew
Train a Word-Alignment Model for Low-Resource Catalan-Aranese
You receive a 35,000-sentence Catalan-Aranese parallel corpus plus a 1,200-pair manually annotated word-alignment test set. Train (1) a classic statistical alignment baseline (e…
- Alignment
- Neural Mt
- Low Resource Mt
Machine Translation - ResearchBeginnerNew
Drug-Repurposing Candidate Screen with Embedding Similarity
You receive (1) a list of 15 known therapeutic candidates (SMILES + ChEMBL identifiers) for a single rare disease, (2) a database of about 4,500 marketed drugs (SMILES + ATC cod…
- Molecular Embeddings
- Similarity Search
- Transfer Learning
Machine Learning for Healthcare and Biomedicine - CodeBeginnerNew
Ship a Churn-Prediction Mini-Project End to End
You receive a 12-month anonymized dataset of subscriber events (logins, lesson completions, payment history, support tickets) for around 200,000 users. Define churn precisely (n…
- Feature Engineering
- Model Evaluation
- Gradient Boosting
AI/ML Practicum and Hands-on Lab - CodeBeginnerNew
Team Practicum: Build a Crop-Disease Classifier with a Field Partner
You receive a labeled dataset of about 8,000 phone photos plus around 1,200 unlabeled photos from a held-out county. Audit and clean the labels (expect 5-10% noise), train a Mob…
- Transfer Learning
- Pytorch Or Tensorflow
- Model Evaluation
AI/ML Practicum and Hands-on Lab - Browse challenges
Explore role
Strategy Analyst
Frame the business question, model the options, build the recommendation. From market sizing to competitive analysis, this role is where strategy consulting meets in-house decision-making.
- AnalysisBeginnerNew
Build a Topic-Modeling Pipeline for Citizen Feedback
Take the 60,000 comments (anonymized). Build a BERTopic pipeline with multilingual sentence embeddings (Catalan + Spanish + occasional English). Tune number-of-topics via topic-…
- Topic Modeling
- Bertopic
- Multilingual NLP
Natural Language Processing - AnalysisBeginnerNew
Approximate Inference for a Topic Model on Customer Tickets
You receive 180,000 tickets (subject + body) spanning the last 18 months. Preprocess into a bag-of-words representation with sensible stopwords and bigrams. Fit a 20-topic LDA v…
- Variational Inference
- Latent Dirichlet Allocation
- Approximate Inference
Probabilistic Graphical Models - CodeBeginnerNew
Markov Random Field for Image Segmentation in Crop Monitoring
You receive 60 Sentinel-2 image tiles (10-meter resolution) over 12 vineyards, each tile with per-pixel disease labels from agronomist field walks. Take the consultancy's existi…
- Markov Random Fields
- Graph Cuts
- Image Segmentation
Probabilistic Graphical Models - CodeBeginnerNew
Calibrate a Demand Forecast with Bayesian Confidence Intervals
You receive 24 months of weekly demand for 600 SKUs plus the existing XGBoost point predictions. Fit a Bayesian conformal-prediction layer (or, alternatively, a Gaussian-Process…
- Bayesian Inference
- Uncertainty Quantification
- Conformal Prediction
Probabilistic Machine Learning Get recognized by recruiters and employers.
Credentials are blockchain-anchored via LearnCoin — tamper-evident, portable, link-shareable on LinkedIn and beyond.
Why Ewance
- CodeBeginnerNew
Structured-Output Prompts for Invoice Extraction
You receive 300 real invoice transcripts (already OCR-ed) labeled with 14 target fields, plus the current production prompt and its 12 percent failure log. Design a new prompt u…
- Structured Output
- Json Schema
- Few Shot Prompting
Prompt Engineering - CodeBeginnerNew
Open-Domain QA over Product Documentation
You receive a snapshot of the documentation (Markdown) and 120 real support questions with the URLs of pages containing the answer. Build an open-domain QA pipeline: chunk the d…
- Open Domain Qa
- Passage Retrieval
- Reading Comprehension
Question Answering and Conversational Systems - DesignBeginnerNew
Conversational UI for a Personal-Finance Assistant
You will work from 4 scripted scenarios: 'how much did I spend on coffee last month', 'why did my rent payment fail', 'help me set up an emergency fund', and an out-of-scope 'is…
- Conversational Ui
- Dialogue Design
- Trust Design
Question Answering and Conversational Systems - CodeBeginnerNew
Tabular Q-Learning for Warehouse Slotting
You receive a Python discrete-event simulator with state encoded as a 12-dimensional categorical vector (around 8,000 reachable states) and 6 possible slotting actions, plus 2 y…
- Tabular Rl
- Q Learning
- Epsilon Greedy
Reinforcement Learning - CodeBeginnerNew
Hybrid Search RAG for a HR-Policy Assistant
You receive 1,800 pages of policy documents (Markdown) and 150 labeled question-answer pairs with the gold source policy IDs. Build a hybrid retrieval pipeline: BM25 + dense emb…
- Hybrid Search
- Bm25
- Dense Retrieval
Retrieval-Augmented Generation - AnalysisBeginnerNew
Chunking Strategy Bake-Off for Financial Filings
You receive 40 anonymized 10-K filings and 100 labeled questions split into 50 narrative (e.g., 'what is the company's main risk factor?') and 50 numerical (e.g., 'what was oper…
- Document Chunking
- Semantic Chunking
- Layout Aware Chunking
Retrieval-Augmented Generation - CodeBeginnerNew
Plan Safe Paths for a Last-Mile Sidewalk Robot
You receive 4 hours of recorded sidewalk traversals with annotated pedestrian tracks, occupancy grids, and a map of the pilot neighborhood. Implement a sampling-based planner (R…
- Motion Planning
- Sampling Based Planning
- Cost Function Design
Robot Perception and Autonomy - DesignBeginnerNew
Detect Sensor Drift for a Field Inspection Robot Fleet
You receive 60 days of telemetry from 12 robots, including IMU readings, camera exposure stats, and the inspection-quality scores produced downstream. Define drift signals (roll…
- Anomaly Detection
- Change Point Detection
- Sensor Fusion
Robot Perception and Autonomy - AnalysisBeginnerNew
Audit Safety Stops for a Cafe-Service Robot Pilot
You receive 30 days of logs covering 240 near-miss events (close approach to a human, low-battery emergency, network loss). For each event, classify whether the safety stop trig…
- Safety Analysis
- Incident Review
- Failure Mode Analysis
Robotics - AnalysisBeginnerNew
Right-Size a Real-Time Recommendation Serving Cluster
You receive 7 days of request-level telemetry (timestamp, latency, error code, pod) plus the existing Horizontal Pod Autoscaler (HPA) and node-group configs. Analyze traffic pat…
- Model Serving
- Kubernetes Orchestration
- Autoscaling
Machine Learning at Scale - AnalysisBeginnerNew
Map Creator Communities for a Short-Form Video Platform
You receive a 90-day sample of about 4 million creator-creator interactions (duets, mentions, audience overlap) and creator metadata (region, language, content tag). Build a cre…
- Network Analysis
- Community Detection
- Graph Visualization
Social Network Analysis and Web Science - AnalysisBeginnerNew
Model Diffusion of a Hashtag Across a Music-Discovery Platform
You receive 30 days of hashtag-usage data (about 2.4 million events) with account metadata and the follow graph between active hashtag users. Fit an information-diffusion model …
- Diffusion Models
- Network Analysis
- Causal Attribution
Social Network Analysis and Web Science
How it works
From brief to credential, in six steps.
Step 01
Browse challenges aligned to your studies.
Step 02
Accept the one that fits your goals.
Step 03
Work through it with AI Copilot guidance.
Step 04
Submit for structured evaluation.
Step 05
Earn a verified credential.
Step 06
Add it to LinkedIn with one click.
Industry teams behind a decade of practitioner briefs
Hiring from this pool?
Sponsor a challenge and meet candidates through actual work.
Industry teams can shape briefs around the skills they hire for, then evaluate students on rubric-scored deliverables — not resumes.



















































































