- ResearchExpertNew
Benchmark Long-Context Architectures on a Legal-Doc Retrieval Task
You receive a public legal-QA dataset (e.g., LongBench's legal split or similar) filtered to documents over 50,000 tokens. Implement or wrap 3 architectures: a sliding-window Tr…
- Long Context Architectures
- State Space Models
- Transformers
Advanced Deep Learning - ResearchAdvancedNew
Evaluate VAEs vs. Diffusion for Synthetic Tabular-Data Generation
You receive a real labeled dataset (around 18,000 anonymized patient records, 32 features, binary outcome) and the team's existing VAE baseline. Train a tabular diffusion model …
- Tabular Diffusion
- Vae
- Synthetic Data
Generative AI - CodeAdvancedNew
Train a VAE for Synthetic Tabular Data at a Healthtech Startup
You receive a synthetic-but-realistic clinical-trial table (around 50,000 patients, 35 columns, mixed continuous and categorical). Train a tabular VAE (or TVAE/CTGAN as alternat…
- Vae
- Tabular Generation
- Synthetic Data
Deep Generative Models - ResearchExpertNew
Embodied Visual Reasoning for a Warehouse Pick Assistant
Use an embodied simulator (Habitat 3.0 or Isaac Sim — pick one and justify) to render 300 cluttered-bin scenarios with a target item label. For each scenario, build two reasonin…
- Embodied Vision
- Vision Language Models
- Visual Reasoning
Visual Intelligence and Visual Reasoning Practice your coursework on real scenarios.
Every challenge is shaped from real industry context — not generic exercises. The work mirrors what your degree prepares you for.
Why Ewance
- ResearchExpertNew
Validate a Foundation Model for Protein-Ligand Docking Acceleration
Pick 20 publicly available protein-ligand complexes from the PDBbind dataset (or similar public source). Use a published open-source structural foundation model (e.g., a Boltz-s…
- Foundation Model Evaluation
- Structural Biology
- Model Validation
AI for Science and Engineering - ResearchExpertNew
Investigate Scaling Trends on a Small Open Benchmark
You will train 4 transformer language models (10M, 50M, 200M, 600M parameters) on a public pretraining corpus (e.g., a small subset of FineWeb or OpenWebText) under identical op…
- Scaling Laws
- Transformer Pretraining
- Compute Optimal Training
Large Language Models - ResearchExpertNew
Probabilistic Numerics for an ODE-Constrained Battery Model
You receive 12 months of charge/discharge cycle data for 50 battery packs from a delivery-van fleet, plus the existing single-particle ODE degradation model (Python). Use a prob…
- Probabilistic Numerics
- Bayesian Inference
- Ode Modeling
Probabilistic Machine Learning - ResearchExpertNew
Compare RNN vs Transformer for Long-Sequence Modeling
Pick a public trajectory dataset (e.g., Argoverse 2, Waymo Open, or ETH-UCY). Implement three models with comparable parameter counts (around 5M each): an LSTM baseline, a vanil…
- Transformers
- Rnn
- State Space Models
Neural Networks for NLP - Browse challenges
Explore role
Product Manager
Ship product that solves real user problems. Combine user research, prototyping, and stakeholder alignment to turn ambiguous briefs into measurable wins — the role at the centre of modern software teams.
- ResearchExpertNew
Train Cooperative Agents with Multi-Agent RL
Pick an open multi-agent environment (PettingZoo's MPE 'simple_spread', Overcooked-AI, or SMAC). Implement or wrap three methods: IPPO (independent PPO per agent), MAPPO (centra…
- Multi Agent Reinforcement Learning
- Ppo
- Pytorch
Multi-Agent Systems - ResearchExpertNew
Certify Robustness for a Medical-Imaging Classifier
You receive the classifier (a PyTorch ResNet variant) and a 4,000-image labeled validation slice. Apply randomized smoothing (Cohen et al.) at sigma in {0.25, 0.5, 1.0}. Report …
- Certified Robustness
- Randomized Smoothing
- Formal Verification
Trustworthy AI, Robustness, and Safety - ResearchAdvancedNew
Quantify Distribution Shift for a Climate-Risk Model
You receive the model artifact (a gradient boosted regressor predicting expected annual loss per property), 2010-2020 training data, and a 2021-2024 holdout. Quantify covariate …
- Distribution Shift
- Covariate Shift
- Concept Drift
Trustworthy AI, Robustness, and Safety - ResearchExpertNew
Plan a Parameter-Efficient Fine-Tuning Strategy for a Big-Tech AI Lab
You will produce (1) a 6-page survey of four PEFT methods (LoRA, adapters, prefix tuning, IA3) with their strengths, weaknesses, and parameter footprints, (2) a one-page decisio…
- Parameter Efficient Fine Tuning
- Transfer Learning
- Fine Tuning
Meta-Learning, Transfer Learning, and Multi-Task Learning Build a verifiable portfolio.
Submissions become evidence. Reviewers with shipping experience score against a rubric; the result becomes a credential anyone can verify.
Why Ewance
- CodeExpertNew
Run a Backpropagation Bug-Hunt on an Open-Source RL Implementation
You receive the offending fork (around 4,000 lines of PyTorch) and three known-failure seeds. Reproduce the NaN failure deterministically, instrument the forward and backward pa…
- Backpropagation
- Pytorch
- Debugging
Deep Learning - ResearchExpertNew
Benchmark Conformal Prediction for a Healthcare Risk-Score
You receive a labeled dataset of about 25,000 patient encounters with the current risk-score's predictions and ground-truth 1-year outcomes. Implement and compare split conforma…
- Conformal Prediction
- Uncertainty Quantification
- Calibration
Statistical Machine Learning - ResearchExpertNew
Benchmark Reward-from-Feedback Methods on a Tabletop Pick-Place
You will use a Franka Panda arm in PyBullet on a 4-object pick-and-place task. For each of the three feedback methods, train a reward model and a downstream policy until converg…
- Reinforcement Learning
- Reward Learning
- Preference Comparison
Human-Robot Interaction - ResearchExpertNew
Graph Transformer Research Probe for a Drug-Target Predictor
You receive a public drug-target interaction dataset (around 50,000 drug-target pairs with labels and molecular graphs), a strong GIN baseline, and a starter GraphGPS implementa…
- Graph Transformers
- Graph Neural Networks
- Message Passing
Machine Learning on Graphs - ResearchExpertNew
Implement an Autoregressive Model for Anonymized Voice-Synthesis at a Defense Vendor
You receive a public-domain speech dataset (LibriTTS subset, around 50 speakers) and a fixed evaluation protocol (speaker-identifiability AUC, emotion-preservation MOS proxy, in…
- Autoregressive Models
- Voice Conversion
- Speech Synthesis
Deep Generative Models - ResearchExpertNew
Model-Based RL for a Robotic Arm Pick-Place Task
You receive a PyBullet pick-and-place environment (Franka Panda arm, 12 object types, randomized starting poses) and a SAC baseline that hits 85% success after about 1.5 million…
- Model Based Rl
- World Models
- Reinforcement Learning
Deep Reinforcement Learning - ResearchExpertNew
Pre-Register and Run a Small Neural-Network Ablation Study
You will study how three architectural and regularization choices (depth: 2/4/8 hidden layers; activation: ReLU vs. GELU; weight decay: 0 / 1e-4 / 1e-3) affect a small MLP's tes…
- Neural Networks
- Regularization
- Experiment Design
Machine Learning
How it works
From brief to credential, in six steps.
Step 01
Browse challenges aligned to your studies.
Step 02
Accept the one that fits your goals.
Step 03
Work through it with AI Copilot guidance.
Step 04
Submit for structured evaluation.
Step 05
Earn a verified credential.
Step 06
Add it to LinkedIn with one click.
Related roles you may want to explore
Browse all roles →Industry teams behind a decade of practitioner briefs
Hiring from this pool?
Sponsor a challenge and meet candidates through actual work.
Industry teams can shape briefs around the skills they hire for, then evaluate students on rubric-scored deliverables — not resumes.
Skills and disciplines shown on this page are derived from the Ewance challenge catalogue. When the median annual salary is available for this role via Adzuna, it will be shown above with the sample size and country.



















































































