Data Science
Data Scientist
Somewhere between a researcher and a software engineer sits the data scientist, the person who answers questions a company didn't quite know how to ask. The work blends statistics, programming, and a stubborn refusal to accept easy explanations.
One week you might design an experiment to measure whether a new feature actually causes more usage, the next you might build a forecasting model that shapes next year's hiring plan. Students who enjoyed both math class and writing essays tend to thrive.
Tools like Python and PyMC become daily companions, but the deeper skill is asking the right question. Excellence looks like a recommendation that holds up six months later because the underlying causal reasoning was sound.
Problem framing + experimental design + defending methodology to non-technical audiences are exactly the parts of the role AI augments but doesn't replace.
US$134,855 median salary in United States¹
- AnalysisBeginnerNew
Diagnose Churn Drivers for a B2B SaaS Workflow Tool
You receive three CSV exports: 18 months of weekly product-usage events for about 1,800 accounts, the full support-ticket history, and account firmographics (industry, size, pla…
- Exploratory Data Analysis
- Data Wrangling
- Feature Engineering
Applied Data Analysis and Practical Data Science - AnalysisBeginnerNew
Spectral-Analyze Wearable Sleep Data for a Healthtech Pilot
You receive 30 nights of wearable data per 25 volunteers, with polysomnography-derived ground-truth stages (Wake / NREM / REM). Engineer spectral features (delta, theta, alpha, …
- Spectral Analysis
- Feature Engineering
- Wavelet Analysis
Time Series Analysis and Forecasting - AnalysisBeginnerNew
Mine Association Rules for a Grocery Retailer's Promo Strategy
You receive 6 months of basket-level transaction data (around 22 million baskets, around 18,000 SKUs) plus a category taxonomy. Run association-rule mining (Apriori or FP-Growth…
- Association Rules
- Market Basket Analysis
- Apriori
Data Mining and Knowledge Discovery - AnalysisBeginnerNew
Forecast Daily Demand for an Apparel Supply-Chain Team
You receive 24 months of daily sales for 500 SKUs across 200 stores, plus calendar features (holidays, promotions, weather codes). Forecast 14 days out per SKU-store. Benchmark …
- Time Series Forecasting
- Sarima
- Gradient Boosting
Time Series Analysis and Forecasting Practice your coursework on real scenarios.
Every challenge is shaped from real-world context — not generic exercises. The work mirrors what your degree prepares you for.
Why Ewance
- AnalysisBeginnerNew
Analyze a Learning-Analytics Dataset for At-Risk Detection
You receive an anonymized dataset of LMS engagement features (logins, assignment submissions, forum posts, video-watch time), grade history, and a binary label for end-of-semest…
- Learning Analytics
- Classification
- Fairness Metrics
AI in Education and Learning Analytics - AnalysisBeginnerNew
Build a Topic-Modeling Pipeline for Citizen Feedback
Take the 60,000 comments (anonymized). Build a BERTopic pipeline with multilingual sentence embeddings (Catalan + Spanish + occasional English). Tune number-of-topics via topic-…
- Topic Modeling
- Bertopic
- Multilingual NLP
Natural Language Processing - AnalysisBeginnerNew
Detect Fraudulent Refund Requests for a Mid-Market Marketplace
You receive a labeled dataset with buyer history, seller history, shipping carrier, refund reason text, and outcome label (legit / fraud). Train and evaluate at least two classi…
- Classification
- Model Calibration
- Imbalanced Classification
Machine Learning (Undergraduate) - AnalysisBeginnerNew
Model Diffusion of a Hashtag Across a Music-Discovery Platform
You receive 30 days of hashtag-usage data (about 2.4 million events) with account metadata and the follow graph between active hashtag users. Fit an information-diffusion model …
- Diffusion Models
- Network Analysis
- Causal Attribution
Social Network Analysis and Web Science - Browse challenges
Explore role
Product Manager
Ship product that solves real user problems. Combine user research, prototyping, and stakeholder alignment to turn ambiguous briefs into measurable wins — the role at the centre of modern software teams.
- CodeBeginnerNew
Build a Credit-Card Fraud Detector for a Singapore Neobank
You receive 9 months of anonymized authorization data (around 8 million transactions, around 0.4 percent fraud) plus current rule outcomes. Split temporally and train at least t…
- Classification Modeling
- Class Imbalance
- Model Calibration
AI and Quantitative Finance - AnalysisBeginnerNew
Explain a Credit-Risk Model with SHAP for a Fintech
You receive a trained XGBoost credit-risk model (binary default prediction), the training feature schema (38 features), and a held-out 10,000-sample test set with labels. Comput…
- Shap
- Interpretability
- Fairness Analysis
Explainable and Interpretable AI - AnalysisBeginnerNew
Customer Lifetime Value Model for SaaS Scale-up
You are provided with a dataset of 5,000 customers with features: acquisition channel, monthly spend, tenure, number of support tickets, product usage metrics, and churn flag. Y…
- Customer Lifetime Value
- Predictive Modeling
- Customer Segmentation
Marketing Analytics - AnalysisBeginnerNew
Optimizing Ad Spend for a D2C Cosmetics Brand
You are a data analyst at Glow & Grow. Using the provided dataset (simulated), perform an exploratory data analysis to understand trends in ad performance. Build a regression mo…
- Data Analysis
- Regression
- Data Visualization
Data Analytics for Business Build a verifiable portfolio.
Submissions become evidence. Reviewers with shipping experience score against a rubric; the result becomes a credential anyone can verify.
Why Ewance
- AnalysisBeginnerNew
Cluster a Telco's Subscriber Base for a Pricing Refresh
You receive 12 months of anonymized subscriber-level data: monthly minutes, SMS, mobile data, top-up frequency, top-up amount, churn flag, and tenure. Clean and feature-engineer…
- Clustering
- Feature Engineering
- Exploratory Data Analysis
Data Mining and Knowledge Discovery - AnalysisBeginnerNew
Interpretable-by-Design GAM for an Insurer's Claims Triage
You receive an anonymized claims dataset (around 60,000 claims, target: log reserve), a feature schema (22 features), and an existing LightGBM baseline (held-out R^2 of 0.78). T…
- Generalized Additive Models
- Ebm
- Interpretability
Explainable and Interpretable AI - AnalysisBeginnerNew
Diagnose Query Failures in an E-Commerce Search Box
You receive 6 months of anonymized query logs (~480 million rows): query string, language hint, results-shown count, top-3 product clicks, and add-to-cart events. Build a notebo…
- Query Log Analysis
- Clustering
- Ir Failure Analysis
Information Retrieval and Search - AnalysisBeginnerNew
Refit a Pricing Model for an Insurance Comparison Site
You receive 9 months of quote-impression data (about 14 million events) with about 60 features and click labels. Refit logistic regression with elastic-net regularization plus a…
- Regularized Regression
- Feature Interactions
- Calibration
Statistical Machine Learning - CodeBeginnerNew
Predict Subscription Churn for an EdTech Platform
You receive a CSV with about 18,000 student-month rows: features include login frequency, session length, quiz scores, parent app opens, and plan tier. The target is whether the…
- Supervised Learning
- Logistic Regression
- Gradient Boosting
Machine Learning (Undergraduate) - AnalysisBeginnerNew
Build a Reproducible Pricing Analysis for a DTC Skincare Brand
You receive 24 months of order-line data (around 480,000 lines), a Shopify-style customer export, and a discount-code log. Build a Python pipeline that produces: SKU-level price…
- Data Wrangling
- Exploratory Data Analysis
- Cohort Analysis
Applied Data Analysis and Practical Data Science - AnalysisBeginnerNew
Community Detection on a Pharma Clinical-Trial Investigator Graph
You receive a pre-fetched dump of around 15,000 trials from a public registry covering oncology over the last 10 years and a mapping of trials to investigator names + institutio…
- Community Detection
- Louvain
- Leiden
Machine Learning on Graphs - AnalysisBeginnerNew
Predict Equipment Failure for a Wind-Farm Operator
You receive 18 months of SCADA (Supervisory Control and Data Acquisition — the standard turbine telemetry feed) data sampled every 10 minutes from all 240 turbines, with labeled…
- Classification
- Regularized Regression
- Gradient Boosting
Statistical Machine Learning - CodeBeginnerNew
Predict Catalyst Properties for a Green-Hydrogen Pharma Spinout
Use an open catalyst dataset (e.g., Open Catalyst Project subset, or a Materials Project pull) where each candidate has descriptors and a target activity property. Train a tabul…
- Tabular Modeling
- Uncertainty Quantification
- Feature Engineering
AI for Science and Engineering - AnalysisBeginnerNew
Spectral Clustering for an Urban-Mobility Operator's Network
You receive 6 months of anonymized O-D trip data (around 4 million trips, around 8,000 virtual stations), the current 9 hand-drawn zones, and the operations team's KPIs (rebalan…
- Spectral Methods
- Spectral Clustering
- Graph Laplacian
Machine Learning on Graphs - CodeBeginnerNew
Build a Fairness Evaluation Harness for a Credit-Score Model
Implement a Python module that, given model predictions, ground truth, and group identifiers, computes demographic parity difference, equal-opportunity difference, predictive-pa…
- Algorithmic Fairness
- Statistical Evaluation
- Python Or Javascript
AI Measurement and Evaluation - AnalysisBeginnerNew
Churn Prediction for a Stockholm D2C Cosmetics Brand
You are a data science consultant hired by NordicGlow. Using the provided dataset (synthetic but realistic), you must preprocess the data, engineer features from transaction, cl…
- Data Preprocessing
- Feature Engineering
- Classification
Data Science for Business - AnalysisBeginnerNew
Build a Public Open-Data Dashboard for Urban Mobility
Pull the city's open-data cyclist-collision dataset (10 years of incidents, geocoded). Define a clear before/after window around the protected-lane rollout, control for traffic-…
- Exploratory Data Analysis
- Data Wrangling
- Geospatial Analysis
Applied Data Analysis and Practical Data Science - CodeBeginnerNew
Simulated Annealing for Shift Scheduling at a Hospital
You receive 6 months of anonymized shift demand data, the nurse roster (skills, certifications, contracted hours), and the labor-law hard constraints. Encode the schedule as a 7…
- Simulated Annealing
- Metaheuristics
- Constraint Handling
Evolutionary Computation and Metaheuristic Search - CodeBeginnerNew
Calibrate a Demand Forecast with Bayesian Confidence Intervals
You receive 24 months of weekly demand for 600 SKUs plus the existing XGBoost point predictions. Fit a Bayesian conformal-prediction layer (or, alternatively, a Gaussian-Process…
- Bayesian Inference
- Uncertainty Quantification
- Conformal Prediction
Probabilistic Machine Learning - CodeBeginnerNew
Stack Five Models for a Kaggle-Style Forecasting Bake-Off
You receive a pseudonymized dataset of 24 months of daily shipment volumes across about 200 origin-destination lanes plus weather and holiday features. Train 5 base models, use …
- Ensemble Methods
- Time Series Forecasting
- Feature Engineering
Advanced Machine Learning - AnalysisBeginnerNew
Map Creator Communities for a Short-Form Video Platform
You receive a 90-day sample of about 4 million creator-creator interactions (duets, mentions, audience overlap) and creator metadata (region, language, content tag). Build a cre…
- Network Analysis
- Community Detection
- Graph Visualization
Social Network Analysis and Web Science
How it works
From brief to credential, in six steps.
Step 01
Browse challenges aligned to your studies.
Step 02
Accept the one that fits your goals.
Step 03
Work through it with AI Copilot guidance.
Step 04
Submit for structured evaluation.
Step 05
Earn a verified credential.
Step 06
Add it to LinkedIn with one click.
Related roles you may want to explore
Browse all roles →Industry teams behind a decade of practitioner briefs
Hiring from this pool?
Sponsor a challenge and meet candidates through actual work.
Industry teams can shape briefs around the skills they hire for, then evaluate students on rubric-scored deliverables — not resumes.
¹ Median annual salary computed from the salary histogram of 50 current Adzuna job postings for "Data Scientist" in United States, fetched May 2026. Skills come from the disciplines tagged on live Ewance challenges for this role.
Portrait: Photo by Jake Nackos on Unsplash.



















































































