AI & Data
Generative AI & LLMs Challenges
Generative AI & LLMs challenges put you inside the work of building with large language models. You'll develop skills in prompt patterns, few-shot prompting, chain-of-thought, and LLM API integration, learning how these models behave before you scale them.
From there you'll handle the harder edges — RAG architectures, vector database basics, fine-tuning, and prompt versioning — putting LLM guardrails and LLM evaluation around every deployment the way AI teams actually do. Each challenge you solve earns a verified credential you can share with recruiters.
- DesignBeginnerNew
Chain-of-Thought for High-School Math Tutoring
You receive 80 practice problems across 4 topics (linear equations, factoring, systems of equations, quadratics), each with the correct answer and an expected age-appropriate ex…
- Chain Of Thought
- Zero Shot Prompting
- Few Shot Prompting
Prompt Engineering - CodeIntermediateNew
Build a Cross-Lingual Retrieval-Augmented QA System
Index around 5,000 internal-knowledge docs across the three languages using a multilingual embedding model (e.g., multilingual-e5 or BGE-M3). Build the retrieval-then-answer pip…
- RAG Architectures
- Cross Lingual Retrieval
- Multilingual Embeddings
Neural Networks for NLP - CodeIntermediateNew
Build a BM25 + Embeddings Hybrid Search for a Legal-Tech Document Portal
Stand up an OpenSearch cluster with BM25 indexing on the 2.4M-document corpus. Generate dense embeddings (you choose the model; justify cost and quality trade-offs) and index th…
- Information Retrieval
- Bm25
- Vector Database Basics
Data Mining and Information Retrieval - CodeIntermediateNew
Wire a Knowledge Graph into a Pharma RAG Assistant
You receive: 100 internal benchmark questions with reference answers; a 50,000-document anonymized RAG index; a curated drug-target-disease KG (~80,000 triples) loaded into a tr…
- Kg Grounded RAG
- Sparql
- Entity Linking
Knowledge Graphs and Semantic Web Practice your coursework on real scenarios.
Every challenge is shaped from real-world context — not generic exercises. The work mirrors what your degree prepares you for.
Why Ewance
- CodeIntermediateNew
Fine-Tune a Diffusion Model for a Sustainable-Fashion Mood-Board Tool
You receive around 1,200 curated images of sustainable garments tagged with silhouette and material. Choose a base diffusion model (Stable Diffusion 1.5/2.1 or SDXL) and apply L…
- Diffusion Models
- Fine Tuning
- Ai Image Generation
Deep Generative Models - CodeIntermediateNew
Build an Evaluation Harness for an Internal LLM Assistant
You will design and implement an evaluation harness in Python that runs four test suites: (1) helpfulness (LLM-as-judge with rubric), (2) factual grounding (compare cited source…
- LLM Evaluation
- LLM As Judge
- Prompt Injection Testing
Large Language Models - CodeIntermediateNew
Build a Vector-Search Backend for an Enterprise AI Knowledge Assistant
You receive a corpus of around 20,000 PDFs (mixed scanned and digital) totalling around 30 GB and a labeled retrieval set of 200 queries with human-judged ground-truth passages.…
- RAG Architectures
- Vector Database Basics
- Word Embeddings
Data Engineering and Big Data Systems - CodeIntermediateNew
Ship a Streaming RAG Endpoint with Caching and Fallbacks
You will build a FastAPI service exposing one POST /chat endpoint that (1) streams tokens via Server-Sent Events, (2) caches identical (system_prompt, query, retrieved_context) …
- LLM API Integration
- Streaming
- Response Caching
LLM Application Development - Browse challenges
Explore role
Product Manager
Ship product that solves real user problems. Combine user research, prototyping, and stakeholder alignment to turn ambiguous briefs into measurable wins — the role at the centre of modern software teams.
- DesignBeginnerNew
Build the PRD for an Internal RAG Knowledge Assistant
You receive: a description of the CS workflows (post-sale onboarding, escalation, renewal), an inventory of internal knowledge sources (Notion, Salesforce, Zendesk macros, 3 pro…
- Product Management
- RAG Architectures
- Evaluation Design
AI for Business and AI Product Management - CodeIntermediateNew
LLM-Powered FAQ Chatbot for 40-Person SaaS Scale-up
You have access to TaskFlow's internal documentation, help articles, and a sample of 500 support tickets. Your task is to build a retrieval-augmented generation (RAG) pipeline: …
- Large Language Models
- RAG Architectures
- Information Retrieval
Text Analytics and Natural Language Processing - DesignBeginnerNew
Design an Automated Essay-Feedback System
You receive 20 anonymized middle-school essays scored by 2 human teachers on a 4-dimension rubric (structure, evidence, voice, mechanics). Design an LLM-based feedback system th…
- Automated Assessment
- Rubric Design
- Prompt Patterns
AI in Education and Learning Analytics - CodeIntermediateNew
Instruction-Tune a Small Model for an Edtech Tutor
You receive a 1.5B base model (e.g., SmolLM-1.7B or Qwen-1.8B), permission to use 2 hours of a rented A100, and a curated seed of around 5,000 math-tutoring dialogues. Augment w…
- Instruction Tuning
- Fine Tuning
- Dataset Curation
Fine-Tuning Large Language Models Build a verifiable portfolio.
Submissions become evidence. Reviewers with shipping experience score against a rubric; the result becomes a credential anyone can verify.
Why Ewance
- DesignIntermediateNew
Design a Continuous Eval Pipeline for an Enterprise RAG Product
Design (and partially build) a continuous-eval pipeline for a RAG system: (1) a structured eval set with at least 50 queries grouped by query class; (2) automated scoring (LLM-a…
- Continuous Evaluation
- LLM Evaluation
- RAG Architectures
AI Measurement and Evaluation - CodeIntermediateNew
Fine-Tune a Diffusion Model for an E-commerce Product Studio
You receive 1,200 curated product + lifestyle images across 6 product categories, a brand-style guide, and the company's current studio cost per image (around EUR 18). Fine-tune…
- Diffusion Models
- Stable Diffusion
- Dreambooth
Generative AI - CodeBeginnerNew
Prototype a Multimodal Visual-Question-Answering Demo
You will use a small open-source vision-language model (e.g., LLaVA-1.5-7B or PaliGemma) and prompt-engineer it for the warehouse-VQA task. Build a Gradio web demo. Construct a …
- Vision Language Models
- Multimodal Perception
- Prompt Patterns
Machine Perception - CodeBeginnerNew
Structured-Output Prompts for Invoice Extraction
You receive 300 real invoice transcripts (already OCR-ed) labeled with 14 target fields, plus the current production prompt and its 12 percent failure log. Design a new prompt u…
- Structured Output
- Json Schema
- Few Shot Prompting
Prompt Engineering - ResearchIntermediateNew
Safety-Test a Customer-Service Agent for Adversarial Prompts
You receive a sandboxed instance of the agent (a tool-using LLM that can read account balances and open support tickets — both mocked). Design a red-team suite of at least 80 pr…
- Ai Agents
- Red Team Operations
- Adversarial Prompts
AI Agents and LLM-Based Agents - ResearchIntermediateNew
Audit a Public LLM Benchmark for Validity Threats
Choose one open LLM benchmark (e.g., MMLU, GPQA, BIG-Bench-Hard, MATH). Read the benchmark paper plus at least three follow-up critiques. Audit (1) data contamination risk again…
- Benchmark Evaluation
- Data Contamination Analysis
- Annotation Methodology
AI Measurement and Evaluation - CodeIntermediateNew
Build a Domain Instruction-Tuning Recipe for a Legal Coach
You will source instruction data from three streams: ~3,000 synthetic paralegal Q&A generated by a frontier model (anonymized prompts), ~1,500 curated examples from public legal…
- Instruction Tuning
- Fine Tuning
- Data Curation
Large Language Models - CodeIntermediateNew
Build a LangGraph Multi-Agent Researcher
Design the four-agent topology with explicit message contracts. Implement each agent as a separate LLM call with role-specific system prompts, tool access (web search for resear…
- Multi Agent Orchestration
- Langgraph Or Crewai Workflows
- Tool Use
Multi-Agent Systems - PresentationIntermediateNew
Design a Hybrid Symbolic-Neural Agent for an Enterprise RAG Demo
Design a hybrid agent for a 'company-policy assistant' demo: a symbolic planner decomposes user goals into typed subtasks ('find policy', 'check applicability', 'compose answer'…
- Hybrid Ai
- Symbolic Planning
- RAG Architectures
Artificial Intelligence: Principles and Techniques - PresentationBeginnerNew
Pitch an LLM Earnings-Call Analyst to an Equity Long-Short Team
Pick 3 publicly available US tech earnings-call transcripts (from a free source like sec.gov filings or company investor-relations pages) and build a retrieval-augmented LLM wor…
- Prompt Patterns
- RAG Architectures
- LLM Evaluation
AI and Quantitative Finance - ResearchIntermediateNew
QLoRA Fine-Tune for a Customer-Support Domain Assistant
You receive 8,000 anonymized support ticket pairs (question -> agent response), the company's product documentation (around 600 pages), and a strong RAG baseline already running…
- Qlora
- Fine Tuning
- RAG Architectures
Fine-Tuning Large Language Models - ResearchIntermediateNew
Design a Capability Evaluation for an Open-Weights Coding Model
Pick a recent open-weights coding model (e.g., a Qwen, DeepSeek, or Llama variant). Design an evaluation set of around 40 coding tasks across 4 buckets: standard benign coding, …
- Capability Evaluation
- Safety Evaluation
- LLM Evaluation
AI Safety and Alignment
How it works
From brief to credential, in six steps.
Step 01
Browse challenges aligned to your studies.
Step 02
Accept the one that fits your goals.
Step 03
Work through it with AI Copilot guidance.
Step 04
Submit for structured evaluation.
Step 05
Earn a verified credential.
Step 06
Add it to LinkedIn with one click.
Industry teams behind a decade of practitioner briefs
Hiring from this pool?
Sponsor a challenge and meet candidates through actual work.
Industry teams can shape briefs around the skills they hire for, then evaluate students on rubric-scored deliverables — not resumes.



















































































