Research Challenges
Literature, primary interviews, synthesis, a defensible thesis. Academic rigor, applied output.
- ResearchSeniorNew
Investigate Scaling Trends on a Small Open Benchmark
You will train 4 transformer language models (10M, 50M, 200M, 600M parameters) on a public pretraining corpus (e.g., a small subset of FineWeb or OpenWebText) under identical op…
- Scaling Laws
- Transformer Pretraining
- Compute Optimal Training
Large Language Models - ResearchBeginnerNew
Hyperparameter Search via CMA-ES for a Pharma QSAR Model
You receive a labeled QSAR dataset (around 25,000 compounds, regression on a binding-affinity target), a fixed feature pipeline (Morgan fingerprints + descriptors), and the team…
- Cma Es
- Metaheuristics
- Hyperparameter Optimization
Evolutionary Computation and Metaheuristic Search - ResearchIntermediateNew
Quantify Distribution Shift for a Climate-Risk Model
You receive the model artifact (a gradient boosted regressor predicting expected annual loss per property), 2010-2020 training data, and a 2021-2024 holdout. Quantify covariate …
- Distribution Shift
- Covariate Shift
- Concept Drift
Trustworthy AI, Robustness, and Safety - ResearchBeginnerNew
Evaluate Open-Source Embedding Models for a Multilingual Help Center
You receive 1,200 labeled (query, relevant-help-article) pairs across 6 languages plus the help-center corpus (~25,000 articles). Index the corpus with each of 4 open-source mul…
- Multilingual Embeddings
- Dense Retrieval
- Ir Evaluation
Information Retrieval and Search Develop in-demand professional skills.
Each challenge names the skills it strengthens. Over time, your profile fills with the competences a hiring manager would actually look for.
Why Ewance
- ResearchBeginnerNew
Case-Study Analysis of a Public AI Incident
Pick one public AI incident (suggestions: a chatbot's harmful response that went viral, a facial-recognition false-arrest case, a financial-model bias scandal). Produce a 6-page…
- Incident Analysis
- Responsible Ai
- Case Study Research
AI Ethics, Fairness, and Responsible AI - ResearchBeginnerNew
Evaluate Speech Synthesis Voices for an EdTech Storyteller App
You will generate 60 audio clips (20 per vendor) covering 4 story genres and 3 emotional tones. Recruit 15 native Spanish speakers via a remote panel (Prolific or local equivale…
- Tts Evaluation
- Listening Studies
- Mos Scoring
Speech Recognition and Spoken Language Processing - ResearchSeniorNew
DPO Preference-Tune a Code Assistant for Style Compliance
You receive a 7B coding base model, a client's published code-style guide (Python, around 80 pages), and a generated preference dataset (4,000 pairs of code snippets where one m…
- Dpo
- Preference Optimization
- Fine Tuning
Fine-Tuning Large Language Models - ResearchIntermediateNew
Probe a Pretrained Encoder for Linguistic Knowledge
Take BERT-base (or DeBERTa-v3-base). Run layer-wise probes across at least 3 linguistic tasks: part-of-speech tagging, dependency arc classification, and semantic role labeling.…
- Interpretability
- Probing
- Hugging Face Transformers
Neural Networks for NLP - Browse challenges
Explore role
Strategy Analyst
Frame the business question, model the options, build the recommendation. From market sizing to competitive analysis, this role is where strategy consulting meets in-house decision-making.
- ResearchSeniorNew
Stress-Test Scalable Oversight on a Tool-Using Agent
Design a sandwich-oversight study: pick a task domain where non-expert oversight is plausible but not trivial (e.g., reviewing data-analysis steps, checking small bug fixes, eva…
- Scalable Oversight
- Alignment Research
- Experimental Design
AI Safety and Alignment - ResearchIntermediateNew
Train a Reinforcement-Learning Locomotion Policy for a Quadruped
You receive a configured Isaac Lab environment for the quadruped, a baseline PPO trainer, and a set of 8 trip-hazard / slip stress scenarios. Train the policy for a budget of ab…
- Reinforcement Learning
- Locomotion
- Domain Randomization
Robotics - ResearchSeniorNew
Inductive Logic Programming for a Fraud-Rule Discovery Pilot
You receive a labeled fraud dataset (around 25,000 transactions, around 4% positive class), a feature schema (28 features including device, geo, behavioral history), and a basel…
- Inductive Logic Programming
- Symbolic Ai
- Rule Learning
Fuzzy Logic, Knowledge Representation, and Symbolic Reasoning - ResearchSeniorNew
Concept-Activation Vectors for an Autonomous-Vehicle Perception Audit
You receive a trained semantic-segmentation model (8 classes including pedestrian, vehicle, road, sky), an internal validation set of 2,500 driving frames, and a small concept-i…
- Tcav
- Concept Explanations
- Interpretability
Explainable and Interpretable AI Get recognized by recruiters and employers.
Credentials are blockchain-anchored via LearnCoin — tamper-evident, portable, link-shareable on LinkedIn and beyond.
Why Ewance
- ResearchBeginnerNew
Evaluate a Generative AI Image Tool with a Within-Subjects Study
You will write a study protocol, recruit 20 participants (a Discord callout is fine), counterbalance the two conditions, and run 45-minute sessions over Zoom. Collect three meas…
- Experimental Design
- User Study
- Within Subjects Design
Human-Computer Interaction for AI Systems - ResearchIntermediateNew
Compare Kernel Methods to Trees on a Genomics Classification Task
You receive a curated benchmark of about 12,000 labeled variants with ~120 numerical + ~40 string features. Fit kernel SVMs (RBF, polynomial, string), random forest, and XGBoost…
- Kernel Methods
- Svm
- Tree Ensembles
Statistical Machine Learning - ResearchIntermediateNew
Train a Physics-Informed Neural Network for Heat Transfer in a Battery Pack
Solve the 2D unsteady heat-conduction equation on a square cell cross-section with a localized source and Dirichlet boundary conditions on the casing. Implement a baseline finit…
- Physics Informed Neural Networks
- Partial Differential Equations
- Pytorch Or Tensorflow
AI for Science and Engineering - ResearchIntermediateNew
Red-Team Evaluation of a Refusal Policy
You receive the lab's written refusal policy (version 2.3) and a starter set of 60 red-team prompts (10 per category). Extend the set to 240 prompts (40 per category) using docu…
- Red Team Operations
- Refusal Policy
- Alignment Evaluation
Machine Learning from Human Preferences (RLHF and Alignment) - ResearchIntermediateNew
Planning Under Uncertainty for a Last-Mile Delivery Fleet
Build a simulator of the 50-block area with stochastic travel times conditioned on weather and time-of-day. Implement value iteration (for a small state space), MCTS (Monte Carl…
- Planning Under Uncertainty
- Markov Decision Processes
- Monte Carlo Tree Search
Automated Planning - ResearchSeniorNew
Self-Supervised Pretraining for a Pathology Foundation Vendor
You receive a public pathology dataset (about 80,000 unlabeled whole-slide-image patches plus a labeled 8,000-patch subtype-classification subset across 4 classes). Pretrain a R…
- Supervised Learning
- Medical Imaging
- Transfer Learning
Machine Learning for Imaging and Medical Image Analysis - ResearchSeniorNew
Validate a Foundation Model for Protein-Ligand Docking Acceleration
Pick 20 publicly available protein-ligand complexes from the PDBbind dataset (or similar public source). Use a published open-source structural foundation model (e.g., a Boltz-s…
- Foundation Model Evaluation
- Structural Biology
- Model Validation
AI for Science and Engineering - ResearchIntermediateNew
Fine-Tune a Vision-Language Model for Image Captioning
Take BLIP-2 or LLaVA-1.6 as the base. Fine-tune (LoRA is fine) on a 4,000-image accessibility-curated dataset where each image has a useful caption written by a low-vision-exper…
- Vision Language Models
- Fine Tuning
- Pytorch Or Tensorflow
Multimodal Machine Learning - ResearchSeniorNew
Circular Business Model for Sustainable Fashion
You are a business model innovation analyst at EcoWear. Your task is to research and recommend one circular business model (e.g., rental subscription, resale marketplace, or rep…
- Circular Economy
- Business Model Canvas
- Three Statement Modeling
Business Model Innovation - ResearchSeniorNew
Audit a Production Model for Membership Inference Attacks
Use a black-box membership inference attack (e.g., the LiRA or shadow-model attack). You have query access to a sandboxed copy of the model + the original training data labels f…
- Membership Inference
- Privacy Attacks
- Model Evaluation
Privacy-Preserving Machine Learning - ResearchSeniorNew
Neuromarketing Audit for a Sustainable Fashion Retailer
Your team will design and simulate a neuromarketing experiment. First, propose two alternative store layouts (e.g., circular vs. grid) and two website homepage designs (e.g., he…
- Neuromarketing
- Experimental Design
- Eye Tracking Analysis
Consumer Behavior - ResearchSeniorNew
Compare RNN vs Transformer for Long-Sequence Modeling
Pick a public trajectory dataset (e.g., Argoverse 2, Waymo Open, or ETH-UCY). Implement three models with comparable parameter counts (around 5M each): an LSTM baseline, a vanil…
- Hugging Face Transformers
- Rnn
- State Space Models
Neural Networks for NLP
How it works
From brief to credential, in six steps.
Step 01
Browse challenges aligned to your studies.
Step 02
Accept the one that fits your goals.
Step 03
Work through it with AI Copilot guidance.
Step 04
Submit for structured evaluation.
Step 05
Earn a verified credential.
Step 06
Add it to LinkedIn with one click.
Industry teams behind a decade of practitioner briefs
Hiring from this pool?
Sponsor a challenge and meet candidates through actual work.
Industry teams can shape briefs around the skills they hire for, then evaluate students on rubric-scored deliverables — not resumes.



















































































