AI Engineering
NLP Engineer
Language is messy. People misspell, contradict themselves, ask the same thing five different ways, and expect a machine to understand. NLP engineers build the systems that try.
The role spans classical text processing in spaCy, modern retrieval-augmented architectures stitched together with LangChain, and the constant judgment calls about when to fine-tune, when to prompt, and when to fall back to rules. It rewards people who love both linguistics and systems thinking. Students grow into it through small projects — a question-answering bot over their notes, a classifier for their inbox — that surface the real failure modes of language models.
Good NLP engineers obsess over evaluation as much as architecture.
- CodeIntermediateNew
Parse and Structure Clinical Discharge Summaries
Combine traditional NLP (section segmentation, sentence parsing) with LLM extraction (small open model + structured-output enforcement). Build the pipeline so every extracted fi…
- Structured Extraction
- Clinical NLP
- Parsing
Natural Language Processing - CodeIntermediateNew
Semantic Parser for an Enterprise Analytics Assistant
Define a small typed query language (filter, aggregate, group_by, time_range, metric). Curate or write 200 training examples covering the controlled subset and 50 held-out test …
- Semantic Parsing
- Grammar Design
- Transformer Models
Computational Semantics - CodeIntermediateNew
Instruction-Tune a Small Model for an Edtech Tutor
You receive a 1.5B base model (e.g., SmolLM-1.7B or Qwen-1.8B), permission to use 2 hours of a rented A100, and a curated seed of around 5,000 math-tutoring dialogues. Augment w…
- Instruction Tuning
- Fine Tuning
- Dataset Curation
Fine-Tuning Large Language Models - CodeIntermediateNew
Build a Multilingual Customer-Email Classifier
You receive 28,000 labeled emails (skewed toward English and Mandarin). Try at least two approaches: (1) a fine-tuned multilingual transformer (XLM-RoBERTa or mDeBERTa) and (2) …
- Text Classification
- Multilingual NLP
- Hugging Face Transformers
Natural Language Processing Practice your coursework on real scenarios.
Every challenge is shaped from real-world context — not generic exercises. The work mirrors what your degree prepares you for.
Why Ewance
- CodeIntermediateNew
Distributional Embeddings for a Multilingual Legal Search
Use a public multilingual corpus (e.g., MultiEURLEX or a subset of EUR-Lex) plus a small hand-built test set of around 100 cross-lingual query-passage pairs. Fine-tune (or evalu…
- Distributional Semantics
- Multilingual NLP
- Sentence Embeddings
Computational Semantics - CodeIntermediateNew
Adapt Machine Translation to a Niche Domain
Pick an open MT base (NLLB-200 or a strong open M2M model). Build a parallel corpus of around 8,000 sentence pairs from the company's bilingual safety standards. Fine-tune on th…
- Machine Translation
- Domain Adaptation
- Hugging Face Transformers
Natural Language Processing - CodeIntermediateNew
Domain-Adapt an NLP Pipeline from News to Customer-Support Tickets
You receive 30,000 anonymized customer-support tickets (PT-BR + ES) plus the news-trained NER and intent models. Apply continued pretraining of a multilingual encoder (e.g., XLM…
- Transfer Learning
- Domain Adaptation
- Continued Pretraining
Meta-Learning, Transfer Learning, and Multi-Task Learning - CodeIntermediateNew
Fine-Tune a Small Transformer for Legal-Domain EN-DE Translation
You receive a 120,000-segment parallel EN-DE legal corpus and a held-out 1,000-segment test set with reference translations. Fine-tune a small pretrained Transformer (e.g., NLLB…
- Neural Mt
- Hugging Face Transformers
- Fine Tuning
Machine Translation - Browse challenges
Explore role
Product Manager
Ship product that solves real user problems. Combine user research, prototyping, and stakeholder alignment to turn ambiguous briefs into measurable wins — the role at the centre of modern software teams.
- CodeIntermediateNew
Lambda-Calculus Semantic Parser for a Math-Tutor EdTech
Define a small typed lambda-calculus representation for linear equations and a small set of word-problem templates (rate, age, mixture). Build a parser that maps surface express…
- Semantic Parsing
- Lambda Calculus
- Symbolic Reasoning
Computational Semantics - CodeIntermediateNew
Fine-Tune a Sequence-to-Sequence Model for Code-Doc Generation
Take a small base model (CodeT5+ or a distilled CodeLlama-Instruct). Build the dataset by mining around 8,000 high-quality function-docstring pairs from permissively-licensed Py…
- Seq2seq
- Hugging Face Transformers
- Fine Tuning
Neural Networks for NLP - CodeIntermediateNew
Fine-Tune ASR for a Healthcare Voice-Note Startup
You receive about 40 hours of de-identified clinician voice notes paired with corrected transcripts plus a medical-terminology lexicon (about 8,000 drug + procedure terms). Fine…
- Asr
- Speech Recognition
- Domain Adaptation
Speech Recognition and Spoken Language Processing - CodeIntermediateNew
Natural Language Inference for an HR-AI Compliance Tool
Use SNLI/MNLI/ANLI as starting data and curate 200 domain-specific HR examples (synthetic or anonymized) for fine-tuning. Fine-tune a small encoder (DeBERTa-v3-base or similar),…
- Natural Language Inference
- Transformer Models
- Fine Tuning
Computational Semantics Build a verifiable portfolio.
Submissions become evidence. Reviewers with shipping experience score against a rubric; the result becomes a credential anyone can verify.
Why Ewance
- CodeIntermediateNew
Multi-Turn Dialogue Manager for a Banking Assistant
You receive a transcript dataset of 200 conversations (human-tagged with intent, slot values, and required outcome), a list of 8 supported intents, and tool stubs for 3 backend …
- Dialogue Management
- Intent Classification
- Slot Filling
Question Answering and Conversational Systems - CodeIntermediateNew
Fine-Tune a Transformer for Customer-Support Triage at an Enterprise AI Vendor
You receive 240,000 labeled support tickets across 14 queues, with English, Bahasa Indonesia, and Tagalog. Fine-tune a multilingual transformer encoder (XLM-RoBERTa-base is a st…
- Hugging Face Transformers
- Fine Tuning
- Multilingual NLP
Deep Learning - CodeIntermediateNew
Extract Skills and Roles from Job Postings for a Recruiter Tool
You receive 30,000 anonymized job postings and a labelled 1,000-posting benchmark with (skill, role, seniority) spans. Fine-tune a small token classifier (e.g., DeBERTa-v3-base)…
- Information Extraction
- Token Classification
- Esco Taxonomy
Linguistic Engineering and Language Technologies - CodeIntermediateNew
Build a Speaker-Diarization Pipeline for a Legal-Tech Startup
You receive 20 hours of de-identified hearing audio with ground-truth speaker labels (4 speaker classes per hearing). Build a speaker-diarization pipeline (pyannote-audio or sim…
- Speaker Diarization
- Speech Recognition
- Pyannote
Speech Recognition and Spoken Language Processing - CodeIntermediateNew
Extractive QA on Clinical Trial Protocols
You receive 500 anonymized protocol PDFs (already OCR-ed to text) and 1,200 labeled question-answer pairs where each answer is an exact text span. Build an extractive QA system:…
- Extractive Qa
- Reading Comprehension
- Model Finetuning
Question Answering and Conversational Systems - CodeIntermediateNew
Build a Domain-Specific Named-Entity Recognizer for Legal Contracts
Start from a strong English NER base (spaCy transformer or LegalBERT). Fine-tune on a provided 1,200-contract labeled dataset for the 9 entity types. Handle long contracts (ofte…
- Named Entity Recognition
- Sequence Labeling
- Domain Adaptation
Natural Language Processing - CodeIntermediateNew
Extract Structured Lease Terms for a Commercial Real-Estate Platform
You receive 500 anonymized lease PDFs and a labelled gold set of 150 leases with the 14 fields filled in. Build a pipeline that does (1) layout-aware PDF parsing (Unstructured, …
- Information Extraction
- Pdf Parsing
- Named Entity Recognition
Linguistic Engineering and Language Technologies - CodeIntermediateNew
Train a Domain-Specific Reranker for a Legal-Tech Search Box
You receive 20,000 (query, document, relevance-label) triples from the firm's contract corpus. Fine-tune a small cross-encoder (e.g., ms-marco-MiniLM-L-6-v2 or BAAI/bge-reranker…
- Cross Encoder Reranker
- Fine Tuning
- Ir Evaluation
Information Retrieval and Search
How it works
From brief to credential, in six steps.
Step 01
Browse challenges aligned to your studies.
Step 02
Accept the one that fits your goals.
Step 03
Work through it with AI Copilot guidance.
Step 04
Submit for structured evaluation.
Step 05
Earn a verified credential.
Step 06
Add it to LinkedIn with one click.
Related roles you may want to explore
Browse all roles →AI Engineering
AI Engineer
Between a promising research paper and a feature people actually use sits a long, unglamorous bridge — and AI engineers build it. The job is taking models that work in notebooks and shaping them into systems that hold up under real traffic, real costs, and real users with messy questions. Good work here looks like a retrieval pipeline that gets answers right ninety-something percent of the time, with evaluation harnesses catching regressions before they ship. Students grow into this role by treating Python and PyTorch as instruments rather than checkboxes, then learning how to reason about latency, evaluation, and cost together. If you enjoy stitching ideas into running software, this path will feel like home.
AI Engineering
Computer Vision Engineer
Teaching a machine to see is harder than it sounds and more interesting than it looks. Computer vision engineers shape the systems that read documents, navigate self-driving cars, screen medical images, and answer questions about photos. The role mixes the math of multi-view geometry with the engineering grind of getting models small and fast enough to run where they're needed — sometimes on a phone, sometimes on a robot. Good work here looks like a pipeline that holds up in real lighting, real motion, and real failure modes. Students grow into this path by getting hands-on with OpenCV and PyTorch early, then learning the harder craft of optimizing models without quietly destroying their accuracy.
AI Engineering
Machine Learning Engineer
A model that works on a laptop and a model that works for millions of users are two very different artifacts, and machine learning engineers live in the gap between them. The role exists to take research-grade ML and turn it into reliable production systems, which means caring about latency, retraining pipelines, and what happens when the data distribution shifts at three in the morning. Students grow into this through hands-on work with PyTorch or TensorFlow plus enough software engineering discipline to run real CI/CD. Tools like AWS SageMaker become part of the workflow. Strong ML engineers can talk shop with data scientists on one side and platform engineers on the other, and that bilingual quality is often what gets them hired.
AI Engineering
MLOps Engineer
Models in production fail in stranger ways than models in notebooks ever could. The MLOps engineer is the person who anticipates those failures and builds the scaffolding that makes machine learning survive contact with real users. Think feature stores that stay consistent between training and serving, deployment pipelines through MLflow that make rollbacks boring, and observability that catches drift before stakeholders notice. The work sits at the intersection of platform engineering and data science, and rewards people who like building tools other engineers will rely on. A student grows into this role by getting comfortable with Kubernetes early and developing taste for what a healthy ML system actually looks like under load.
AI Engineering
Prompt Engineer
Writing instructions for a model is a strange new craft. The words you choose, their order, the examples you include — all shape what a multi-billion-parameter system actually does next. Prompt engineers treat this as a real engineering discipline: versioning prompts in tools like PromptLayer, running evaluations across thousands of test cases, optimizing for cost and latency in production, and collaborating with domain experts to encode their judgment in text. The role is new enough that students often help define it on the job. Growing into it means building intuition for how models fail, when to fine-tune instead, and how to write specs precise enough to ship. Good prompt engineers measure everything and trust vibes only as a starting point.
Industry teams behind a decade of practitioner briefs
Hiring from this pool?
Sponsor a challenge and meet candidates through actual work.
Industry teams can shape briefs around the skills they hire for, then evaluate students on rubric-scored deliverables — not resumes.
Skills and disciplines shown on this page are derived from the Ewance challenge catalogue. When the median annual salary is available for this role via Adzuna, it will be shown above with the sample size and country.
Portrait: Photo by Beatriz Cattel on Unsplash.



















































































