AI Engineering
Machine Learning Engineer
A model that works on a laptop and a model that works for millions of users are two very different artifacts, and machine learning engineers live in the gap between them. The role exists to take research-grade ML and turn it into reliable production systems, which means caring about latency, retraining pipelines, and what happens when the data distribution shifts at three in the morning.
Students grow into this through hands-on work with PyTorch or TensorFlow plus enough software engineering discipline to run real CI/CD. Tools like AWS SageMaker become part of the workflow.
Strong ML engineers can talk shop with data scientists on one side and platform engineers on the other, and that bilingual quality is often what gets them hired.
- ResearchSeniorNew
Inductive Logic Programming for a Fraud-Rule Discovery Pilot
You receive a labeled fraud dataset (around 25,000 transactions, around 4% positive class), a feature schema (28 features including device, geo, behavioral history), and a basel…
- Inductive Logic Programming
- Symbolic Ai
- Rule Learning
Fuzzy Logic, Knowledge Representation, and Symbolic Reasoning - CodeIntermediateNew
Build a Vision-Language Search for an E-commerce Catalog
Pick a vision-language encoder (OpenCLIP, SigLIP, or BLIP-2 image-text variant). Index all 600k product images into a vector database (Qdrant/FAISS). Build a query-time pipeline…
- Vision Language Models
- Clip
- Vector Database Basics
Multimodal Machine Learning - ResearchSeniorNew
Open-Vocabulary Segmentation Benchmark for a Robotics R&D Lab
Use a curated 200-image household scene set (publicly-available HM3D renderings or COCO + a handful of household prompts). Benchmark 3 open-vocabulary segmentation models: SAM +…
- Open Vocabulary Segmentation
- Vision Language Models
- Benchmarking
Computer Vision - ResearchIntermediateNew
Lab Project: Compare Three Architectures on Your Own Mini-Benchmark
Scope the problem yourself (suggested examples: sentiment classification on a niche domain, tabular anomaly detection, time-series forecasting on a public dataset). Define the t…
- Experimental Design
- A/B Testing With Statistical Significance
- Pytorch Or Tensorflow
AI/ML Practicum and Hands-on Lab Practice your coursework on real scenarios.
Every challenge is shaped from real-world context — not generic exercises. The work mirrors what your degree prepares you for.
Why Ewance
- ResearchIntermediateNew
Kernel Methods vs. Deep Learning on a Tiny-Data Drug-Discovery Task
You receive (or download) 3 public ADMET datasets from MoleculeNet (e.g., BBBP, Lipophilicity, FreeSolv). For each, train both: (a) a Gaussian process with a Tanimoto kernel ove…
- Kernel Methods
- Gaussian Processes
- Neural Networks
Advanced Machine Learning - CodeIntermediateNew
Few-Shot Defect Classifier for a Fast-Onboarding Industrial AI Vendor
You receive a multi-customer defect dataset (8 historical customers, 4-6 defect classes each). Treat 6 customers as the meta-training set and 2 as the held-out 'new customer' sc…
- Meta Learning
- Few Shot Learning
- Prototypical Networks
Meta-Learning, Transfer Learning, and Multi-Task Learning - CodeIntermediateNew
Train a Deep Q-Network for Warehouse Robot Routing
You receive a Gymnasium-compatible warehouse simulator (50x50 grid, 8 dynamic obstacle pedestrians, 20 randomized pick locations) and a baseline A* planner script. Train a DQN a…
- Deep Q Learning
- Reinforcement Learning
- Pytorch Or Tensorflow
Deep Reinforcement Learning - CodeIntermediateNew
Diagnose Equipment Failures with a Bayesian Network
You receive 90 days of sensor logs (vibration, spindle temperature, coolant flow, ambient humidity), the maintenance log of 180 failure events labeled by root cause, and a short…
- Bayesian Networks
- Probabilistic Inference
- Parameter Learning
Probabilistic Graphical Models - Browse challenges
Explore role
Product Manager
Ship product that solves real user problems. Combine user research, prototyping, and stakeholder alignment to turn ambiguous briefs into measurable wins — the role at the centre of modern software teams.
- CodeIntermediateNew
Triage Medical-Imaging Annotations with a Small Vision Model
Train a binary normal/abnormal classifier on the public CheXpert or NIH ChestX-ray14 dataset. Use temperature scaling to calibrate the output, then define abstention thresholds …
- Cnn Classification
- Transfer Learning
- Calibration
Applied Machine Learning - ResearchSeniorNew
Solve a POMDP for a Healthtech Diagnostic Pathway
You receive a simplified pathway: 5 possible underlying conditions, 8 possible diagnostic tests each with documented sensitivity and specificity, and an outcome payoff matrix fr…
- Pomdp Modeling
- Belief States
- Approximate Solvers
Decision Making Under Uncertainty - CodeBeginnerNew
Tune a Recommender for an EU Streaming Music App
Use the public Last.fm-360k or similar dataset (anonymized listening histories) as a stand-in. Implement a baseline matrix-factorization recommender, then a hybrid that adds tra…
- Recommender Systems
- Feature Engineering
- Model Evaluation
Applied Machine Learning - CodeBeginnerNew
Forecast Hourly Energy Demand for a Microgrid Operator
You receive 24 months of hourly demand, weather (temperature, irradiance), and calendar data for the community. Build a probabilistic forecaster (e.g., quantile regression with …
- Probabilistic Forecasting
- Quantile Regression
- Deep Forecasting
Time Series Analysis and Forecasting Build a verifiable portfolio.
Submissions become evidence. Reviewers with shipping experience score against a rubric; the result becomes a credential anyone can verify.
Why Ewance
- CodeSeniorNew
Train a Reinforcement-Learning Policy for Drone Obstacle Avoidance
You receive a custom Gymnasium drone-flight environment (provided), a baseline hand-engineered controller, and a target evaluation suite covering 4 obstacle densities. Train a P…
- Reinforcement Learning
- Ppo
- Robotics Simulation
Advanced Robotics - ResearchSeniorNew
Design a Distributed-Training Strategy for a Mid-Sized LLM
You will write a 5-page design memo that picks a parallelism strategy for fine-tuning a 13B model on 32 H100 GPUs, with a tokens-per-second estimate, a memory-per-GPU calculatio…
- Distributed Training
- Parallelism Strategies
- LLM Training
Machine Learning at Scale - CodeBeginnerNew
Build an MLP Baseline for Credit-Default Risk at a Fintech
You receive 18 months of anonymized credit-decision data (around 600,000 applications, 80 features) with a 90-day default label. Train an MLP with regularization (dropout, weigh…
- Mlp
- Regularization
- Tabular Deep Learning
Deep Learning - ResearchIntermediateNew
Evaluate a Knowledge-Graph-Augmented Recommender
You receive permission to use the public MovieLens 1M dataset plus a derived item-KG (movie -> genre, director, decade) built from Wikidata. Train two recommenders: a matrix-fac…
- Knowledge Graph Embeddings
- Recommender Systems
- Benchmarking
Knowledge Graphs and Semantic Web - CodeIntermediateNew
Defect Detection on PCBs for a Hardware-AI Manufacturer
Use the publicly-available PCB defect dataset (e.g., DeepPCB or HRIPCB). Fine-tune a small object detector (YOLOv8n or RT-DETR-small) on the 6 defect classes. Evaluate mean Aver…
- Object Detection
- Transfer Learning
- Model Evaluation
Computer Vision - CodeIntermediateNew
Design a Visual Search Backend for a Boutique Luxury Marketplace
You receive a catalog of 80,000 luxury items (image + sparse metadata) and a labeled query set of 300 user photos with hand-picked target items. Choose an embedding strategy (CL…
- Visual Search
- Word Embeddings
- Clip
Deep Learning for Computer Vision - ResearchSeniorNew
Reproduce a Mechanistic Interpretability Result on a Small Transformer
Pick a published mechanistic-interpretability paper that operates on a small (under 1 billion parameter) open-source transformer (e.g., GPT-2 small, Pythia 70M). Set up the envi…
- Mechanistic Interpretability
- Transformer Internals
- Pytorch Or Tensorflow
AI Safety and Alignment - CodeBeginnerNew
Ship a Churn-Prediction Mini-Project End to End
You receive a 12-month anonymized dataset of subscriber events (logins, lesson completions, payment history, support tickets) for around 200,000 users. Define churn precisely (n…
- Feature Engineering
- Model Evaluation
- Gradient Boosting
AI/ML Practicum and Hands-on Lab - CodeSeniorNew
Video Action Recognition for a Retail Loss-Prevention Startup
Use a public action-recognition dataset (UCF101 + a small curated retail-action subset; the latter is provided synthetic or you can label 50 short clips). Fine-tune a small back…
- Video Understanding
- Action Recognition
- Transfer Learning
Computer Vision - DesignSeniorNew
Design a Distributed Training Job for a 13B-Parameter Model
Decide whether to use Fully Sharded Data Parallel (FSDP), Tensor Parallelism, Pipeline Parallelism, or a hybrid; justify against the 13B-param + 32-H100 setup. Calculate memory …
- Distributed Training
- Fsdp
- Pytorch Or Tensorflow
Machine Learning Systems - AnalysisIntermediateNew
Imitation Learning from Human Demos for a Drone Inspection
You receive 6 hours of expert pilot demonstrations (state-action pairs at 20 Hz) recorded in an AirSim wind-farm environment with 3 turbine designs, plus a held-out 4th turbine …
- Imitation Learning
- Behavioral Cloning
- Dagger
Deep Reinforcement Learning - CodeIntermediateNew
Fine-Tune a Diffusion Model for a Sustainable-Fashion Mood-Board Tool
You receive around 1,200 curated images of sustainable garments tagged with silhouette and material. Choose a base diffusion model (Stable Diffusion 1.5/2.1 or SDXL) and apply L…
- Diffusion Models
- Fine Tuning
- Ai Image Generation
Deep Generative Models - CodeBeginnerNew
Team Practicum: Build a Crop-Disease Classifier with a Field Partner
You receive a labeled dataset of about 8,000 phone photos plus around 1,200 unlabeled photos from a held-out county. Audit and clean the labels (expect 5-10% noise), train a Mob…
- Transfer Learning
- Pytorch Or Tensorflow
- Model Evaluation
AI/ML Practicum and Hands-on Lab - ResearchIntermediateNew
Planning Under Uncertainty for a Last-Mile Delivery Fleet
Build a simulator of the 50-block area with stochastic travel times conditioned on weather and time-of-day. Implement value iteration (for a small state space), MCTS (Monte Carl…
- Planning Under Uncertainty
- Markov Decision Processes
- Monte Carlo Tree Search
Automated Planning - ResearchIntermediateNew
Sim-to-Real Domain Randomization for a Mobile Robot
You receive an Isaac Sim navigation environment, a baseline trained policy, a 50-episode real-bench test set (recorded sensor streams + ground truth) for offline policy evaluati…
- Domain Randomization
- Sim To Real
- Robot Navigation
Robot Learning - ResearchIntermediateNew
Train a Reinforcement-Learning Locomotion Policy for a Quadruped
You receive a configured Isaac Lab environment for the quadruped, a baseline PPO trainer, and a set of 8 trip-hazard / slip stress scenarios. Train the policy for a budget of ab…
- Reinforcement Learning
- Locomotion
- Domain Randomization
Robotics - ResearchSeniorNew
Curriculum RL for a Simulated Drone Inspection Task
You receive a PyBullet-based wind-turbine inspection simulator with parameterizable wind, blade orientation, and sensor noise. Design a 3-stage curriculum: (1) hover near a stat…
- Ppo
- Curriculum Learning
- Deep Rl
Reinforcement Learning - ResearchIntermediateNew
Train a NeRF for Real-Estate Virtual Tours
You receive a curated dataset of 3 apartments, each with around 120 input images and known camera poses (already SfM-processed). Train a NeRF variant (Instant-NGP or Nerfacto re…
- Neural Scene Representation
- Nerf
- Pytorch Or Tensorflow
3D Vision and Multi-View Geometry - ResearchIntermediateNew
Benchmark Graph-Embedding Methods on a Climate-Network Dataset
You receive a 200M-edge sample of the knowledge graph and a labeled entity-similarity test set (5,000 pairs with relevance labels). Benchmark three methods: a shallow embedding …
- Graph Embeddings
- Neural Networks
- Scalable Ml
Machine Learning at Scale - ResearchIntermediateNew
Explore the Bias-Variance Trade-off on a Tabular Healthcare Cohort
You receive a 90,000-patient anonymized de-identified tabular dataset (demographics, labs, claims-derived features) and a binary 12-month-readmission outcome. Pick three model f…
- Bias Variance Tradeoff
- Regularization
- Model Selection
Machine Learning - ResearchSeniorNew
Self-Supervised Pretraining for a Pathology Foundation Vendor
You receive a public pathology dataset (about 80,000 unlabeled whole-slide-image patches plus a labeled 8,000-patch subtype-classification subset across 4 classes). Pretrain a R…
- Supervised Learning
- Medical Imaging
- Transfer Learning
Machine Learning for Imaging and Medical Image Analysis - ResearchIntermediateNew
Reward Shaping for a Quadruped Locomotion Policy
You receive a quadruped locomotion environment (Isaac Lab or pybullet-quadruped), the previous reward function (5 terms), and a budget of 6 training runs. Design 4 reward varian…
- Reward Shaping
- Ppo
- Locomotion
Robot Learning - ResearchSeniorNew
Price American Options with a Deep Hedging Notebook
Simulate price paths for a single underlying (geometric Brownian motion is fine as a baseline; bonus for stochastic volatility). Implement Longstaff-Schwartz Monte Carlo as the …
- Deep Learning
- Stochastic Modeling
- Derivatives Pricing
AI and Quantitative Finance - CodeIntermediateNew
DPO Fine-Tune for a Domain-Specific Writing Assistant
You receive a base instruction-tuned model checkpoint plus 2,500 preference pairs from editorial reviews (each pair: two grant-application paragraphs, the editor-preferred winne…
- Dpo
- Preference Learning
- Model Finetuning
Machine Learning from Human Preferences (RLHF and Alignment) - ResearchSeniorNew
Pretrain a Small Vision Transformer with Self-Supervised Learning
You receive 80,000 unlabeled 224x224 histology tiles plus 4,000 labeled tiles split into train/val/test. Pretrain a ViT-Small using a self-supervised method of your choice (DINO…
- Supervised Learning
- Vision Transformers
- Pytorch Or Tensorflow
Advanced Deep Learning - CodeSeniorNew
PPO Alignment Loop with a Pretrained Reward Model
You receive a small open-weights base model (around 7B), a previously trained reward model, and 5,000 prompts (no responses) for PPO rollouts. Run PPO with TRL's PPOTrainer for …
- Rlhf
- Ppo
- Reward Hacking
Machine Learning from Human Preferences (RLHF and Alignment) - DesignIntermediateNew
Train a Self-Play Agent for a Card-Game Edtech Demo
Implement a small two-player imperfect-information card game (Kuhn poker or a 3-card simplified Hold'em variant). Implement CFR or CFR+ for the game and run self-play for at lea…
- Counterfactual Regret Minimization
- Self Play
- Game Theory
Artificial Intelligence: Principles and Techniques - ResearchIntermediateNew
Build a Generalization-Bound Tutorial for an MLE Onboarding Track
You will produce a Jupyter-notebook tutorial covering (1) sample-complexity intuition, (2) VC-dimension with worked examples for halfspaces and decision stumps, (3) Rademacher c…
- Statistical Learning Theory
- VC Dimension
- Rademacher Complexity
Statistical Machine Learning - CodeIntermediateNew
Train a Multimodal Classifier for Medical Triage
Pick a fusion architecture (early fusion via cross-attention, late fusion via score combination, or a unified multimodal encoder like FLAVA/CoCa). Train on the 14,000 pairs with…
- Multimodal Fusion
- Cross Attention
- Pytorch Or Tensorflow
Multimodal Machine Learning - CodeIntermediateNew
Fine-Tune a Diffusion Model for an E-commerce Product Studio
You receive 1,200 curated product + lifestyle images across 6 product categories, a brand-style guide, and the company's current studio cost per image (around EUR 18). Fine-tune…
- Diffusion Models
- Stable Diffusion
- Dreambooth
Generative AI - CodeIntermediateNew
Build a GAN-Based Defect Generator for a Hardware Manufacturing Line
You receive around 60,000 good-unit images and around 380 defective-unit images across 4 defect classes. Train a class-conditional GAN (StyleGAN2-ADA or a smaller alternative fo…
- Gans
- Class Conditional Generation
- Data Augmentation
Deep Generative Models - CodeSeniorNew
Coordinate a Fleet of Warehouse Robots
Implement a simulated warehouse grid with 80 robots solving a pick-and-deliver workload. Design a decentralized coordination protocol (recommend a contract-net or auction-based …
- Multi Agent Coordination
- Decentralized Algorithms
- Simulation
Multi-Agent Systems - CodeSeniorNew
Triage Brain-CT Stroke Detector with Calibrated Uncertainty
You receive a curated public head-CT dataset (about 2,800 scans, slice-level labels for hemorrhagic stroke) and a held-out 600-scan hospital cohort. Train a 3D CNN or 2.5D slice…
- Medical Imaging
- Neural Networks
- Uncertainty Quantification
Machine Learning for Imaging and Medical Image Analysis - CodeSeniorNew
Train a Manipulation Policy for Bin Picking with Imitation Learning
You receive a dataset of 500 teleop trajectories on the in-distribution part plus a held-out simulation environment with a never-seen part. Train an imitation-learning policy (D…
- Imitation Learning
- Manipulation
- Diffusion Policy
Advanced Robotics - CodeIntermediateNew
Build a Federated Learning Prototype Across Two Hospitals
Simulate two sites with non-IID data splits (one site skews older, the other younger). Implement FedAvg using Flower (or PySyft). Run for at least 50 communication rounds; repor…
- Federated Learning
- Fedavg
- Secure Aggregation
Privacy-Preserving Machine Learning - CodeIntermediateNew
Build a Small Transformer from Scratch and Train It on Code
Implement multi-head self-attention, RMSNorm, rotary positional embeddings, and a causal LM head from scratch — no Hugging Face shortcuts for the model code (you may use Hugging…
- Hugging Face Transformers
- Self Attention
- Pytorch Or Tensorflow
Neural Networks for NLP - CodeBeginnerNew
Behavior Cloning for a Pick-and-Place Manipulator
You receive 200 human teleoperated demonstrations (state + action trajectories) of picking 8 small electronic components from a tray and placing them at marked locations in a ro…
- Behavior Cloning
- Imitation Learning
- Manipulation
Robot Learning - CodeSeniorNew
Profile and Cut Inference Cost on a Recommender at Scale
You receive (1) a frozen ONNX export of the production model, (2) a sample request trace of 24 hours at 1% sampling, and (3) a single A100-class GPU sandbox. Profile with NVIDIA…
- Gpu Profiling
- Model Quantization
- Inference Optimization
Machine Learning Systems - ResearchIntermediateNew
Multi-Task Learning for a Healthtech Triage Model
You receive 40,000 anonymized de-identified intake-form records with two labels: urgency tier (4 classes) and routed sub-specialty (12 classes). Train (1) two independent classi…
- Multi Task Learning
- Transfer Learning
- Hugging Face Transformers
Meta-Learning, Transfer Learning, and Multi-Task Learning - CodeIntermediateNew
Prune and Distill a Speech Model for a Hearable
You receive a trained 280 KB CNN keyword spotter (10 keywords + silence + unknown) with 96.1% top-1 accuracy on the Google Speech Commands test set. Apply structured pruning (ch…
- Pruning
- Knowledge Distillation
- Model Compression
Edge ML and On-Device Machine Learning - AnalysisIntermediateNew
Compare Kernel SVMs and Gradient Boosting on Imbalanced Tabular Data
You receive a 220,000-row anonymized loan-default dataset with mixed numeric and categorical features and a ~6% positive class. Train and evaluate (1) an RBF-kernel SVM with pro…
- Kernel Methods
- Gradient Boosting
- Model Selection
Machine Learning - ResearchIntermediateNew
Probe a Pretrained Encoder for Linguistic Knowledge
Take BERT-base (or DeBERTa-v3-base). Run layer-wise probes across at least 3 linguistic tasks: part-of-speech tagging, dependency arc classification, and semantic role labeling.…
- Interpretability
- Probing
- Hugging Face Transformers
Neural Networks for NLP - ResearchSeniorNew
Quantify Sim-to-Real Gap for a Warehouse Manipulation Policy
You receive a trained pick-and-place policy (PyTorch), the simulation env (Isaac Lab), and access to a real-arm rig (or recorded teleop episodes if hardware is unavailable). Def…
- Sim To Real
- Manipulation
- Experimental Design
Robot Perception and Autonomy - AnalysisIntermediateNew
Transfer-Learning Backbone Bake-Off for Retail Product Tagging
You receive 80,000 retail product images tagged with multiple labels from a 250-tag taxonomy. Use each of the three pretrained backbones via two transfer strategies: (1) linear …
- Transfer Learning
- Fine Tuning
- Supervised Learning
Meta-Learning, Transfer Learning, and Multi-Task Learning - CodeIntermediateNew
Variational Autoencoder for Synthetic Tabular Banking Data
You receive a 500K-row anonymized transaction dataset with 25 columns (mixed numerical + categorical). Train a VAE (TabVAE or a small custom model) with appropriate likelihoods …
- Variational Inference
- Deep Generative Models
- Synthetic Data
Probabilistic Machine Learning - ResearchSeniorNew
Train a Small Diffusion Model for Synthetic Defect Generation
You receive 2,000 labeled defect images and 18,000 clean weld images. Train a small class-conditional latent diffusion model on the defect images (Hugging Face diffusers is fine…
- Generative Perception
- Diffusion Models
- Data Augmentation
Machine Perception - ResearchSeniorNew
SAT-Based Planner for Smart-Grid Demand Response
Encode the dispatch problem (which customers to curtail by how much, respecting per-customer contractual caps and grid-cell totals) as a SAT or MaxSAT instance. Solve 50 histori…
- Sat Based Planning
- Constraint Encoding
- Benchmarking
Automated Planning - CodeIntermediateNew
Train a Reward Model on Customer-Support Preferences
You receive 8,000 labeled preference pairs from real support conversations (each pair is two model responses with a human-chosen winner). Fine-tune a small open-weights base mod…
- Reward Modeling
- Preference Learning
- Bradley Terry Loss
Machine Learning from Human Preferences (RLHF and Alignment)
How it works
From brief to credential, in six steps.
Step 01
Browse challenges aligned to your studies.
Step 02
Accept the one that fits your goals.
Step 03
Work through it with AI Copilot guidance.
Step 04
Submit for structured evaluation.
Step 05
Earn a verified credential.
Step 06
Add it to LinkedIn with one click.
Related roles you may want to explore
Browse all roles →AI Engineering
AI Engineer
Between a promising research paper and a feature people actually use sits a long, unglamorous bridge — and AI engineers build it. The job is taking models that work in notebooks and shaping them into systems that hold up under real traffic, real costs, and real users with messy questions. Good work here looks like a retrieval pipeline that gets answers right ninety-something percent of the time, with evaluation harnesses catching regressions before they ship. Students grow into this role by treating Python and PyTorch as instruments rather than checkboxes, then learning how to reason about latency, evaluation, and cost together. If you enjoy stitching ideas into running software, this path will feel like home.
AI Engineering
Computer Vision Engineer
Teaching a machine to see is harder than it sounds and more interesting than it looks. Computer vision engineers shape the systems that read documents, navigate self-driving cars, screen medical images, and answer questions about photos. The role mixes the math of multi-view geometry with the engineering grind of getting models small and fast enough to run where they're needed — sometimes on a phone, sometimes on a robot. Good work here looks like a pipeline that holds up in real lighting, real motion, and real failure modes. Students grow into this path by getting hands-on with OpenCV and PyTorch early, then learning the harder craft of optimizing models without quietly destroying their accuracy.
AI Engineering
MLOps Engineer
Models in production fail in stranger ways than models in notebooks ever could. The MLOps engineer is the person who anticipates those failures and builds the scaffolding that makes machine learning survive contact with real users. Think feature stores that stay consistent between training and serving, deployment pipelines through MLflow that make rollbacks boring, and observability that catches drift before stakeholders notice. The work sits at the intersection of platform engineering and data science, and rewards people who like building tools other engineers will rely on. A student grows into this role by getting comfortable with Kubernetes early and developing taste for what a healthy ML system actually looks like under load.
AI Engineering
NLP Engineer
Language is messy. People misspell, contradict themselves, ask the same thing five different ways, and expect a machine to understand. NLP engineers build the systems that try. The role spans classical text processing in spaCy, modern retrieval-augmented architectures stitched together with LangChain, and the constant judgment calls about when to fine-tune, when to prompt, and when to fall back to rules. It rewards people who love both linguistics and systems thinking. Students grow into it through small projects — a question-answering bot over their notes, a classifier for their inbox — that surface the real failure modes of language models. Good NLP engineers obsess over evaluation as much as architecture.
AI Engineering
Prompt Engineer
Writing instructions for a model is a strange new craft. The words you choose, their order, the examples you include — all shape what a multi-billion-parameter system actually does next. Prompt engineers treat this as a real engineering discipline: versioning prompts in tools like PromptLayer, running evaluations across thousands of test cases, optimizing for cost and latency in production, and collaborating with domain experts to encode their judgment in text. The role is new enough that students often help define it on the job. Growing into it means building intuition for how models fail, when to fine-tune instead, and how to write specs precise enough to ship. Good prompt engineers measure everything and trust vibes only as a starting point.
Industry teams behind a decade of practitioner briefs
Hiring from this pool?
Sponsor a challenge and meet candidates through actual work.
Industry teams can shape briefs around the skills they hire for, then evaluate students on rubric-scored deliverables — not resumes.
Skills and disciplines shown on this page are derived from the Ewance challenge catalogue. When the median annual salary is available for this role via Adzuna, it will be shown above with the sample size and country.
Portrait: Photo by Yevgeniy KHVAN on Unsplash.



















































































