AI & Data
NLP Challenges
NLP challenges put you inside the work of teaching machines to read and make sense of language. You'll develop skills in Natural Language Processing fundamentals, Text Tokenization and Word Embeddings, and tasks like Named Entity Recognition and Sequence labeling using NLTK.
From there you'll handle the harder edges — Encoder fine-tuning (BERT family) with Hugging Face Transformers, Custom tokenization, Relation extraction, Information Retrieval, and Multilingual NLP — building Knowledge Representation the way real NLP teams do. Each challenge you solve earns a verified credential you can share with recruiters.
- CodeIntermediateNew
Train a Domain-Specific Reranker for a Legal-Tech Search Box
You receive 20,000 (query, document, relevance-label) triples from the firm's contract corpus. Fine-tune a small cross-encoder (e.g., ms-marco-MiniLM-L-6-v2 or BAAI/bge-reranker…
- Cross Encoder Reranker
- Fine Tuning
- Ir Evaluation
Information Retrieval and Search - CodeIntermediateNew
Train a Sequence Model for Wearable-Telemetry Sleep Staging at a Healthtech
You receive 220 nights of wearable telemetry from 60 subjects with PSG ground-truth labels. Train three sequence models: an LSTM baseline, a 1D-CNN+GRU hybrid, and a small trans…
- Sequence Models
- Lstm
- Hugging Face Transformers
Deep Learning - CodeIntermediateNew
Adapt Machine Translation to a Niche Domain
Pick an open MT base (NLLB-200 or a strong open M2M model). Build a parallel corpus of around 8,000 sentence pairs from the company's bilingual safety standards. Fine-tune on th…
- Machine Translation
- Domain Adaptation
- Hugging Face Transformers
Natural Language Processing - ResearchIntermediateNew
QLoRA Fine-Tune for a Customer-Support Domain Assistant
You receive 8,000 anonymized support ticket pairs (question -> agent response), the company's product documentation (around 600 pages), and a strong RAG baseline already running…
- Qlora
- Fine Tuning
- RAG Architectures
Fine-Tuning Large Language Models Practice your coursework on real scenarios.
Every challenge is shaped from real-world context — not generic exercises. The work mirrors what your degree prepares you for.
Why Ewance
- CodeIntermediateNew
Build a Small Transformer from Scratch and Train It on Code
Implement multi-head self-attention, RMSNorm, rotary positional embeddings, and a causal LM head from scratch — no Hugging Face shortcuts for the model code (you may use Hugging…
- Hugging Face Transformers
- Self Attention
- Pytorch Or Tensorflow
Neural Networks for NLP - CodeIntermediateNew
Instruction-Tune a Small Model for an Edtech Tutor
You receive a 1.5B base model (e.g., SmolLM-1.7B or Qwen-1.8B), permission to use 2 hours of a rented A100, and a curated seed of around 5,000 math-tutoring dialogues. Augment w…
- Instruction Tuning
- Fine Tuning
- Dataset Curation
Fine-Tuning Large Language Models - CodeIntermediateNew
Design an SAT-Based Verifier for an Autonomous-Vehicle Test Lab
Model a simplified four-way intersection: agent positions, lights, and discrete time steps. Define 5 safety properties in propositional logic (e.g., 'no two agents in the inters…
- Sat Solving
- Logical Inference
- Formal Verification
Artificial Intelligence: Principles and Techniques - CodeIntermediateNew
Design a Visual Search Backend for a Boutique Luxury Marketplace
You receive a catalog of 80,000 luxury items (image + sparse metadata) and a labeled query set of 300 user photos with hand-picked target items. Choose an embedding strategy (CL…
- Visual Search
- Word Embeddings
- Clip
Deep Learning for Computer Vision - Browse challenges
Explore role
Product Manager
Ship product that solves real user problems. Combine user research, prototyping, and stakeholder alignment to turn ambiguous briefs into measurable wins — the role at the centre of modern software teams.
- ResearchBeginnerNew
Survey Information-Retrieval Research for an AdTech Platform's Roadmap
Build a reading list of 30-40 papers spanning SIGIR, RecSys, KDD, WSDM, and arXiv from 2023-2025 across (a) dense retrieval architectures, (b) learning-to-rank with click feedba…
- Information Retrieval
- Learning To Rank
- Research Synthesis
Data Mining and Information Retrieval - CodeBeginnerNew
Knowledge-Graph Recommender for a Niche Online Bookstore
Model the catalog as a knowledge graph (nodes: books, authors, genres, themes, eras, awards; edges: wrote, in-genre, has-theme, won, similar-to). Use Neo4j or a simple Python in…
- Knowledge Representation
- Knowledge Graphs
- Python Or Javascript
Introduction to Artificial Intelligence (CS Elective) - AnalysisIntermediateNew
Catastrophic-Forgetting Audit on a Domain Fine-Tune
You receive the fine-tuned 7B chemistry model and its base, plus a benchmark basket (MMLU subset, GSM8K, IFEval, a small instruction-following set). Run all 4 benchmarks on both…
- Catastrophic Forgetting
- LLM Evaluation
- Fine Tuning
Fine-Tuning Large Language Models - CodeIntermediateNew
Build a Domain-Specific Named-Entity Recognizer for Legal Contracts
Start from a strong English NER base (spaCy transformer or LegalBERT). Fine-tune on a provided 1,200-contract labeled dataset for the 9 entity types. Handle long contracts (ofte…
- Named Entity Recognition
- Sequence Labeling
- Domain Adaptation
Natural Language Processing Build a verifiable portfolio.
Submissions become evidence. Reviewers with shipping experience score against a rubric; the result becomes a credential anyone can verify.
Why Ewance
- CodeIntermediateNew
Build a Vector-Search Backend for an Enterprise AI Knowledge Assistant
You receive a corpus of around 20,000 PDFs (mixed scanned and digital) totalling around 30 GB and a labeled retrieval set of 200 queries with human-judged ground-truth passages.…
- RAG Architectures
- Vector Database Basics
- Word Embeddings
Data Engineering and Big Data Systems - CodeIntermediateNew
Fine-Tune a Transformer for Customer-Support Triage at an Enterprise AI Vendor
You receive 240,000 labeled support tickets across 14 queues, with English, Bahasa Indonesia, and Tagalog. Fine-tune a multilingual transformer encoder (XLM-RoBERTa-base is a st…
- Hugging Face Transformers
- Fine Tuning
- Multilingual NLP
Deep Learning - CodeFoundationalNew
Rule-Based Intent Classifier for a Customer-Support Triage Bot
Build a rule-based classifier in Python that runs ordered rules (regex + keyword + simple heuristics) against ticket subject + body. Use a hierarchical rule structure (high-prec…
- Knowledge Representation
- Rule Based Systems
- Python Or Javascript
Introduction to Artificial Intelligence (CS Elective) - ResearchIntermediateNew
Multi-Task Learning for a Healthtech Triage Model
You receive 40,000 anonymized de-identified intake-form records with two labels: urgency tier (4 classes) and routed sub-specialty (12 classes). Train (1) two independent classi…
- Multi Task Learning
- Transfer Learning
- Hugging Face Transformers
Meta-Learning, Transfer Learning, and Multi-Task Learning - CodeIntermediateNew
Description-Logic Reasoner for Insurance-Policy Coverage Checks
You receive 50 representative coverage rules in plain English (from the current rule engine) and a sample of 1,000 anonymized claim cases with the current engine's outcomes (cov…
- Description Logics
- Owl
- Reasoning
Fuzzy Logic, Knowledge Representation, and Symbolic Reasoning - ResearchSeniorNew
Plan a Parameter-Efficient Fine-Tuning Strategy for a Big-Tech AI Lab
You will produce (1) a 6-page survey of four PEFT methods (LoRA, adapters, prefix tuning, IA3) with their strengths, weaknesses, and parameter footprints, (2) a one-page decisio…
- Parameter Efficient Fine Tuning
- Transfer Learning
- Fine Tuning
Meta-Learning, Transfer Learning, and Multi-Task Learning - ResearchBeginnerNew
Curate a Domain Lexicon for a Climate-Tech NLP Stack
You receive 5,000 policy documents and a benchmark of 200 documents with manually tagged domain terms. Curate a lexicon of ~1,500 terms with (1) canonical English form, (2) Swah…
- Lexical Resources
- Named Entity Recognition
- Spacy
Linguistic Engineering and Language Technologies - CodeIntermediateNew
Extract Structured Lease Terms for a Commercial Real-Estate Platform
You receive 500 anonymized lease PDFs and a labelled gold set of 150 leases with the 14 fields filled in. Build a pipeline that does (1) layout-aware PDF parsing (Unstructured, …
- Information Extraction
- Pdf Parsing
- Named Entity Recognition
Linguistic Engineering and Language Technologies - CodeIntermediateNew
Build a Sequence Model for Sign-Language Word Recognition
You receive about 12,000 short (1-3s) webcam clips covering a 50-word vocabulary, with body+hand pose features pre-extracted (e.g., MediaPipe Holistic landmarks per frame). Buil…
- Sequence Models
- Hugging Face Transformers
- Pose Estimation
Machine Perception - DesignIntermediateNew
Build an OWL Ontology for a Pharma R&D Knowledge Base
You receive a CSV-form starter knowledge base (around 4,000 compounds, 600 targets, 1,200 assays) and a list of 12 competency questions the scientists currently can't answer wit…
- Ontology Design
- Owl
- Knowledge Representation
Fuzzy Logic, Knowledge Representation, and Symbolic Reasoning - DesignBeginnerNew
Build an Attention-Visualization Tool for Translation Quality Audit
You will load a small open-source EN-FR transformer (e.g., Helsinki-NLP Opus-MT-en-fr), build a Streamlit or Gradio demo that lets the user paste English source, see the French …
- Attention Mechanisms
- Neural Mt
- Tool Design
Machine Translation - CodeIntermediateNew
Build a 30-Day Readmission Risk Model on De-Identified EHR Data
You receive a curated MIMIC-style de-identified EHR cohort (about 28,000 admissions, demographics, comorbidities, labs, prior-admission counts) with 30-day readmission labels. T…
- Ehr Modeling
- Risk Stratification
- Model Calibration
Machine Learning for Healthcare and Biomedicine
How it works
From brief to credential, in six steps.
Step 01
Browse challenges aligned to your studies.
Step 02
Accept the one that fits your goals.
Step 03
Work through it with AI Copilot guidance.
Step 04
Submit for structured evaluation.
Step 05
Earn a verified credential.
Step 06
Add it to LinkedIn with one click.
Industry teams behind a decade of practitioner briefs
Hiring from this pool?
Sponsor a challenge and meet candidates through actual work.
Industry teams can shape briefs around the skills they hire for, then evaluate students on rubric-scored deliverables — not resumes.



















































































