AI Engineering
AI Engineer
Between a promising research paper and a feature people actually use sits a long, unglamorous bridge — and AI engineers build it. The job is taking models that work in notebooks and shaping them into systems that hold up under real traffic, real costs, and real users with messy questions.
Good work here looks like a retrieval pipeline that gets answers right ninety-something percent of the time, with evaluation harnesses catching regressions before they ship. Students grow into this role by treating Python and PyTorch as instruments rather than checkboxes, then learning how to reason about latency, evaluation, and cost together.
If you enjoy stitching ideas into running software, this path will feel like home.
- CodeIntermediateNew
Multi-Agent Research Assistant for Biotech Patent Review
You receive 20 historical patent applications with the firm's own prior-art memos as ground truth. Design and build a 3-agent system: (a) Searcher — issues queries to a patent-s…
- Ai Agents
- Multi Agent Collaboration
- Agent Evaluation
AI Agents and LLM-Based Agents - CodeIntermediateNew
Localize a Mobile Robot with Particle-Filter SLAM
You receive 4 ROS bags from real customer plants, each containing 2D LiDAR scans, wheel odometry, and ground-truth poses (from a motion-capture cell used only for evaluation). I…
- State Estimation
- Particle Filter
- Slam
Advanced Robotics - CodeIntermediateNew
Build a Cross-Lingual Retrieval-Augmented QA System
Index around 5,000 internal-knowledge docs across the three languages using a multilingual embedding model (e.g., multilingual-e5 or BGE-M3). Build the retrieval-then-answer pip…
- RAG Architectures
- Cross Lingual Retrieval
- Multilingual Embeddings
Neural Networks for NLP - CodeIntermediateNew
Build a Vector-Search Backend for an Enterprise AI Knowledge Assistant
You receive a corpus of around 20,000 PDFs (mixed scanned and digital) totalling around 30 GB and a labeled retrieval set of 200 queries with human-judged ground-truth passages.…
- RAG Architectures
- Vector Database Basics
- Word Embeddings
Data Engineering and Big Data Systems Practice your coursework on real scenarios.
Every challenge is shaped from real-world context — not generic exercises. The work mirrors what your degree prepares you for.
Why Ewance
- CodeIntermediateNew
Hierarchical Plans for an Aerospace Maintenance Crew Scheduler
You receive a synthetic week of 80 work orders with hierarchical decompositions, technician certifications, and shared-tool constraints. Implement an HTN planner (PyHOP or HDDL …
- Htn Planning
- Domain Modeling
- Constraint Handling
Automated Planning - CodeIntermediateNew
Build an Internal-Tools Agent for a Mid-Cap Enterprise
You receive OpenAPI specs for 4 mock internal APIs and 30 reference question-answer pairs spanning easy lookups and multi-tool chains. Build the agent using an LLM tool-use fram…
- Ai Agents
- Tool Use
- Agent Evaluation
AI Agents and LLM-Based Agents - CodeIntermediateNew
Build a Tool-Calling Agent for an Internal Reporting Bot
You will implement the agent in either LangChain or LlamaIndex (your choice; defend it in the readme). Wire 4 tools: (1) read-only SQL on a sample warehouse, (2) a mocked BI met…
- Agent Orchestration
- Tool Calling
- Langchain Or Llamaindex
LLM Application Development - CodeIntermediateNew
Design a Force-Controlled Polishing Skill for a Watchmaker
You receive simulated polishing trajectories from the manufacturer's robot, force-sensor logs from 20 master-craftsman demonstrations, and a quality-rubric (mirror finish 1-5) f…
- Impedance Control
- Force Control
- Manipulation
Robotics - Browse challenges
Explore role
Product Manager
Ship product that solves real user problems. Combine user research, prototyping, and stakeholder alignment to turn ambiguous briefs into measurable wins — the role at the centre of modern software teams.
- CodeIntermediateNew
Build a Multimodal Generation Pipeline for a Tourism Operator
You receive 40 sample 30-second videos shot by tour guides, the operator's brand voice doc, and SEO keyword lists for EN/PT/ES. Build a pipeline that (1) extracts a representati…
- Multimodal Generation
- Vision Language Models
- LLM Inference
Generative AI - DesignIntermediateNew
Visualize Embedding Drift for a RAG Knowledge Assistant
You receive weekly snapshots over 12 weeks of around 50,000 document embeddings each (1024-dim). Design and build a visualization tool that: (a) projects each snapshot to 2D wit…
- Word Embeddings
- Dimensionality Reduction
- Umap
Data Visualization - CodeIntermediateNew
Finetune a Diffusion Model for Sustainable-Fashion Mockups
You receive 1,200 product photos with paired captions and the brand's style guide. Fine-tune a Stable-Diffusion-class base model with LoRA (Low-Rank Adaptation, a parameter-effi…
- Diffusion Models
- Lora Finetuning
- Pytorch Or Tensorflow
Advanced Deep Learning - CodeIntermediateNew
Agentic RAG with Context-Window Budgeting
You receive a synthetic dataset of 60 founder-style queries paired with 'workspaces' (each up to 500 documents across 3 source types), plus gold-standard answers and citation li…
- Agentic RAG
- Context Window Management
- Iterative Retrieval
Retrieval-Augmented Generation Build a verifiable portfolio.
Submissions become evidence. Reviewers with shipping experience score against a rubric; the result becomes a credential anyone can verify.
Why Ewance
- CodeIntermediateNew
Wire a Knowledge Graph into a Pharma RAG Assistant
You receive: 100 internal benchmark questions with reference answers; a 50,000-document anonymized RAG index; a curated drug-target-disease KG (~80,000 triples) loaded into a tr…
- Kg Grounded RAG
- Sparql
- Entity Linking
Knowledge Graphs and Semantic Web - CodeIntermediateNew
ReAct Agent for Legal-Research Tool-Use
You receive 30 research questions with paralegal-written gold answers and citation lists, plus stubbed implementations of the 4 tools (you do not need to build retrieval — just …
- React Prompting
- Tool Use
- Agent Design
Prompt Engineering - AnalysisIntermediateNew
Evaluate an Agent Suite on the SWE-Bench-Style Coding Benchmark
You receive a sandboxed set of 50 small repo-modification tasks (test-passing as the success signal). Run 3 open-source agent frameworks (e.g., OpenHands, SWE-agent, and Aider) …
- Ai Agents
- Agent Evaluation
- Benchmarking
AI Agents and LLM-Based Agents - CodeIntermediateNew
Ship a Streaming RAG Endpoint with Caching and Fallbacks
You will build a FastAPI service exposing one POST /chat endpoint that (1) streams tokens via Server-Sent Events, (2) caches identical (system_prompt, query, retrieved_context) …
- LLM API Integration
- Streaming
- Response Caching
LLM Application Development - CodeIntermediateNew
Implement Model Predictive Control for a Delivery Robot
You receive a kinematic bicycle model of the robot, a published track layout, and 30 minutes of recorded waypoint trajectories. Implement a nonlinear MPC controller using acados…
- Model Predictive Control
- Optimal Control
- Robotics Simulation
Advanced Robotics - CodeIntermediateNew
Temporal Planner for a Robotics Mission Operator
You receive 30 days of mission logs with task lists, time windows, and actual durations. Encode the planning problem with temporal PDDL (PDDL 2.1 durative actions) and solve wit…
- Temporal Planning
- Pddl Modeling
- Simulation
Automated Planning - AnalysisIntermediateNew
Cut Latency and Cost on a High-Volume Summarization Service
You receive 30 days of anonymized request logs (prompt token counts, completion token counts, latencies, models used). Profile the cost and latency distribution, then design and…
- Finops & Cost Optimization
- Latency Optimization
- Prompt Compression
LLM Application Development - CodeIntermediateNew
Build a LangGraph Multi-Agent Researcher
Design the four-agent topology with explicit message contracts. Implement each agent as a separate LLM call with role-specific system prompts, tool access (web search for resear…
- Multi Agent Orchestration
- Langgraph Or Crewai Workflows
- Tool Use
Multi-Agent Systems - CodeIntermediateNew
Reason over a Climate Policy Knowledge Graph for an EU Think Tank
Design a knowledge graph schema covering regulations, member states, sectors, transposition dates, and source-document citations. Ingest a curated dataset of around 200 nodes th…
- Knowledge Graphs
- Knowledge Representation
- Rule Based Reasoning
Artificial Intelligence: Principles and Techniques - CodeIntermediateNew
Design an SAT-Based Verifier for an Autonomous-Vehicle Test Lab
Model a simplified four-way intersection: agent positions, lights, and discrete time steps. Define 5 safety properties in propositional logic (e.g., 'no two agents in the inters…
- Sat Solving
- Logical Inference
- Formal Verification
Artificial Intelligence: Principles and Techniques - CodeIntermediateNew
Plan Warehouse Pick Routes with a Classical Planner
You receive a stylized warehouse map (aisle graph), 30 sample shifts of pick tasks, and the current heuristic's outputs. Write a PDDL domain + problem generator, solve with at l…
- Pddl Modeling
- State Space Search
- Classical Planning
Automated Planning - CodeIntermediateNew
Design Safe Navigation Behavior for a Hospital Delivery Robot
You receive a dataset of 200 anonymized hospital corridor traces (people positions over time from the robot's LIDAR) plus the current planner's parameters. Design a policy that …
- Human Aware Navigation
- Ros2
- Motion Planning
Human-Robot Interaction - ResearchIntermediateNew
QLoRA Fine-Tune for a Customer-Support Domain Assistant
You receive 8,000 anonymized support ticket pairs (question -> agent response), the company's product documentation (around 600 pages), and a strong RAG baseline already running…
- Qlora
- Fine Tuning
- RAG Architectures
Fine-Tuning Large Language Models - ResearchIntermediateNew
Neuro-Symbolic Question Answering on an Enterprise Knowledge Graph
You receive a curated Turtle-format knowledge graph (around 2 million triples covering organizational structure, products, projects), 200 labeled question-SPARQL pairs split 140…
- Neuro Symbolic
- Sparql
- Knowledge Graphs
Fuzzy Logic, Knowledge Representation, and Symbolic Reasoning - CodeIntermediateNew
Ship an MVP RAG Knowledge Assistant for a Climate-Tech Startup
As a 4-person team across a 6-week sprint, ship: (1) an ingestion pipeline for around 4,000 mixed PDFs and markdown files; (2) a vector store with documented chunking strategy; …
- RAG Architectures
- Software Engineering For Ai
- Vector Databases
AI Software Engineering Group Project
How it works
From brief to credential, in six steps.
Step 01
Browse challenges aligned to your studies.
Step 02
Accept the one that fits your goals.
Step 03
Work through it with AI Copilot guidance.
Step 04
Submit for structured evaluation.
Step 05
Earn a verified credential.
Step 06
Add it to LinkedIn with one click.
Related roles you may want to explore
Browse all roles →AI Engineering
Computer Vision Engineer
Teaching a machine to see is harder than it sounds and more interesting than it looks. Computer vision engineers shape the systems that read documents, navigate self-driving cars, screen medical images, and answer questions about photos. The role mixes the math of multi-view geometry with the engineering grind of getting models small and fast enough to run where they're needed — sometimes on a phone, sometimes on a robot. Good work here looks like a pipeline that holds up in real lighting, real motion, and real failure modes. Students grow into this path by getting hands-on with OpenCV and PyTorch early, then learning the harder craft of optimizing models without quietly destroying their accuracy.
AI Engineering
Machine Learning Engineer
A model that works on a laptop and a model that works for millions of users are two very different artifacts, and machine learning engineers live in the gap between them. The role exists to take research-grade ML and turn it into reliable production systems, which means caring about latency, retraining pipelines, and what happens when the data distribution shifts at three in the morning. Students grow into this through hands-on work with PyTorch or TensorFlow plus enough software engineering discipline to run real CI/CD. Tools like AWS SageMaker become part of the workflow. Strong ML engineers can talk shop with data scientists on one side and platform engineers on the other, and that bilingual quality is often what gets them hired.
AI Engineering
MLOps Engineer
Models in production fail in stranger ways than models in notebooks ever could. The MLOps engineer is the person who anticipates those failures and builds the scaffolding that makes machine learning survive contact with real users. Think feature stores that stay consistent between training and serving, deployment pipelines through MLflow that make rollbacks boring, and observability that catches drift before stakeholders notice. The work sits at the intersection of platform engineering and data science, and rewards people who like building tools other engineers will rely on. A student grows into this role by getting comfortable with Kubernetes early and developing taste for what a healthy ML system actually looks like under load.
AI Engineering
NLP Engineer
Language is messy. People misspell, contradict themselves, ask the same thing five different ways, and expect a machine to understand. NLP engineers build the systems that try. The role spans classical text processing in spaCy, modern retrieval-augmented architectures stitched together with LangChain, and the constant judgment calls about when to fine-tune, when to prompt, and when to fall back to rules. It rewards people who love both linguistics and systems thinking. Students grow into it through small projects — a question-answering bot over their notes, a classifier for their inbox — that surface the real failure modes of language models. Good NLP engineers obsess over evaluation as much as architecture.
AI Engineering
Prompt Engineer
Writing instructions for a model is a strange new craft. The words you choose, their order, the examples you include — all shape what a multi-billion-parameter system actually does next. Prompt engineers treat this as a real engineering discipline: versioning prompts in tools like PromptLayer, running evaluations across thousands of test cases, optimizing for cost and latency in production, and collaborating with domain experts to encode their judgment in text. The role is new enough that students often help define it on the job. Growing into it means building intuition for how models fail, when to fine-tune instead, and how to write specs precise enough to ship. Good prompt engineers measure everything and trust vibes only as a starting point.
Industry teams behind a decade of practitioner briefs
Hiring from this pool?
Sponsor a challenge and meet candidates through actual work.
Industry teams can shape briefs around the skills they hire for, then evaluate students on rubric-scored deliverables — not resumes.
Skills and disciplines shown on this page are derived from the Ewance challenge catalogue. When the median annual salary is available for this role via Adzuna, it will be shown above with the sample size and country.
Portrait: Photo by Ludovic Migneault on Unsplash.



















































































