AI & Data
Generative AI & LLMs Challenges
Generative AI & LLMs challenges put you inside the work of building with large language models. You'll develop skills in prompt patterns, few-shot prompting, chain-of-thought, and LLM API integration, learning how these models behave before you scale them.
From there you'll handle the harder edges — RAG architectures, vector database basics, fine-tuning, and prompt versioning — putting LLM guardrails and LLM evaluation around every deployment the way AI teams actually do. Each challenge you solve earns a verified credential you can share with recruiters.
- AnalysisIntermediateNew
Transfer-Learning Backbone Bake-Off for Retail Product Tagging
You receive 80,000 retail product images tagged with multiple labels from a 250-tag taxonomy. Use each of the three pretrained backbones via two transfer strategies: (1) linear …
- Transfer Learning
- Fine Tuning
- Supervised Learning
Meta-Learning, Transfer Learning, and Multi-Task Learning - ResearchIntermediateNew
Neuro-Symbolic Question Answering on an Enterprise Knowledge Graph
You receive a curated Turtle-format knowledge graph (around 2 million triples covering organizational structure, products, projects), 200 labeled question-SPARQL pairs split 140…
- Neuro Symbolic
- Sparql
- Knowledge Graphs
Fuzzy Logic, Knowledge Representation, and Symbolic Reasoning - CodeIntermediateNew
Fine-Tune a Diffusion Model for a Sustainable-Fashion Mood-Board Tool
You receive around 1,200 curated images of sustainable garments tagged with silhouette and material. Choose a base diffusion model (Stable Diffusion 1.5/2.1 or SDXL) and apply L…
- Diffusion Models
- Fine Tuning
- Ai Image Generation
Deep Generative Models - ResearchBeginnerNew
Run a Human-Preference Study Comparing Two Coding Assistants
Design a blinded paired-comparison study: 12 developer participants, each gets the same 8 realistic coding tasks (refactor, write a function, debug, test), each task is solved b…
- Experimental Design
- Statistical Evaluation
- Human Evaluation
AI Measurement and Evaluation Practice your coursework on real scenarios.
Every challenge is shaped from real industry context — not generic exercises. The work mirrors what your degree prepares you for.
Why Ewance
- CodeIntermediateNew
Build a Cross-Lingual Retrieval-Augmented QA System
Index around 5,000 internal-knowledge docs across the three languages using a multilingual embedding model (e.g., multilingual-e5 or BGE-M3). Build the retrieval-then-answer pip…
- RAG Architectures
- Cross Lingual Retrieval
- Multilingual Embeddings
Neural Networks for NLP - ResearchSeniorNew
Multi-Tenant Vector Isolation for a B2B Knowledge Assistant
Build a small proof-of-concept in your chosen vector store (Pinecone or Qdrant — pick one and justify) that supports 10 simulated tenants with 1,000 vectors each. Implement the …
- Multi Tenant Isolation
- Vector Databases
- Threat Modeling
Vector Databases and Embeddings - DesignBeginnerNew
Generative AI Content Strategy for a Sustainable Fashion Brand
You must first define EcoWeave's brand voice by analyzing their existing content (provided). Then, design a set of prompts and a workflow (e.g., using ChatGPT or a no-code AI to…
- Prompt Patterns
- Content Strategy
- Brand Voice
Data-Driven Prototyping with AI - ResearchIntermediateNew
QLoRA Fine-Tune for a Customer-Support Domain Assistant
You receive 8,000 anonymized support ticket pairs (question -> agent response), the company's product documentation (around 600 pages), and a strong RAG baseline already running…
- Qlora
- Fine Tuning
- RAG Architectures
Fine-Tuning Large Language Models - Browse challenges
Explore role
Marketing Analyst
Plan and measure campaigns that grow the business. Funnel analytics, attribution, segmentation, and the rigorous measurement that lets marketing defend its budget at the leadership table.
- DesignIntermediateNew
Spec Trust-and-Safety Eval Harness for an LLM-Powered Customer-Support Bot
You will spec a 6-page evaluation harness covering: (1) jailbreak test set (about 200 prompts across 6 attack families), (2) PII-leakage probes (about 100 synthetic-customer pro…
- LLM Evaluation
- Red Team Operations
- Pii Detection
Trustworthy AI, Robustness, and Safety - CodeIntermediateNew
Multi-Agent Research Assistant for Biotech Patent Review
You receive 20 historical patent applications with the firm's own prior-art memos as ground truth. Design and build a 3-agent system: (a) Searcher — issues queries to a patent-s…
- Ai Agents
- Multi Agent Collaboration
- Agent Evaluation
AI Agents and LLM-Based Agents - CodeIntermediateNew
Natural Language Inference for an HR-AI Compliance Tool
Use SNLI/MNLI/ANLI as starting data and curate 200 domain-specific HR examples (synthetic or anonymized) for fine-tuning. Fine-tune a small encoder (DeBERTa-v3-base or similar),…
- Natural Language Inference
- Transformer Models
- Fine Tuning
Computational Semantics - CodeBeginnerNew
Prototype a Multimodal Visual-Question-Answering Demo
You will use a small open-source vision-language model (e.g., LLaVA-1.5-7B or PaliGemma) and prompt-engineer it for the warehouse-VQA task. Build a Gradio web demo. Construct a …
- Vision Language Models
- Multimodal Perception
- Prompt Patterns
Machine Perception Build a verifiable portfolio.
Submissions become evidence. Reviewers with shipping experience score against a rubric; the result becomes a credential anyone can verify.
Why Ewance
- ResearchIntermediateNew
Fine-Tune a Vision-Language Model for Image Captioning
Take BLIP-2 or LLaVA-1.6 as the base. Fine-tune (LoRA is fine) on a 4,000-image accessibility-curated dataset where each image has a useful caption written by a low-vision-exper…
- Vision Language Models
- Fine Tuning
- Pytorch Or Tensorflow
Multimodal Machine Learning - PresentationBeginnerNew
Pitch an LLM Earnings-Call Analyst to an Equity Long-Short Team
Pick 3 publicly available US tech earnings-call transcripts (from a free source like sec.gov filings or company investor-relations pages) and build a retrieval-augmented LLM wor…
- Prompt Patterns
- RAG Architectures
- LLM Evaluation
AI and Quantitative Finance - CodeIntermediateNew
LLM-Powered FAQ Chatbot for 40-Person SaaS Scale-up
You have access to TaskFlow's internal documentation, help articles, and a sample of 500 support tickets. Your task is to build a retrieval-augmented generation (RAG) pipeline: …
- Large Language Models
- RAG Architectures
- Information Retrieval
Text Analytics and Natural Language Processing - CodeIntermediateNew
Train a Domain-Specific Reranker for a Legal-Tech Search Box
You receive 20,000 (query, document, relevance-label) triples from the firm's contract corpus. Fine-tune a small cross-encoder (e.g., ms-marco-MiniLM-L-6-v2 or BAAI/bge-reranker…
- Cross Encoder Reranker
- Fine Tuning
- Ir Evaluation
Information Retrieval and Search - PresentationIntermediateNew
Design a Hybrid Symbolic-Neural Agent for an Enterprise RAG Demo
Design a hybrid agent for a 'company-policy assistant' demo: a symbolic planner decomposes user goals into typed subtasks ('find policy', 'check applicability', 'compose answer'…
- Hybrid Ai
- Symbolic Planning
- RAG Architectures
Artificial Intelligence: Principles and Techniques - CodeIntermediateNew
Design Prompt Versioning and Observability for a Coding Assistant
You will (1) design a prompt-registry data model (versions, owners, environments, change log) and implement it in Postgres + a small Python SDK, (2) instrument the assistant to …
- Prompt Versioning
- Observability
- Pii Scrubbing
LLM Application Development
How it works
From brief to credential, in six steps.
Step 01
Browse challenges aligned to your studies.
Step 02
Accept the one that fits your goals.
Step 03
Work through it with AI Copilot guidance.
Step 04
Submit for structured evaluation.
Step 05
Earn a verified credential.
Step 06
Add it to LinkedIn with one click.
Industry teams behind a decade of practitioner briefs
Hiring from this pool?
Sponsor a challenge and meet candidates through actual work.
Industry teams can shape briefs around the skills they hire for, then evaluate students on rubric-scored deliverables — not resumes.



















































































