Benchmark Design
If you like applying Benchmark Design, every challenge here gives you a chance to practice it on a real industry brief.
- ResearchExpertNew
Long-Context QA Evaluation Benchmark for Legal Memoranda
You receive 25 anonymized legal memoranda (50-90 pages each) and 100 QA pairs whose answers are deliberately spread across the documents (25 in pages 1-20, 25 in pages 20-40, 25…
- Long Context Qa
- Benchmark Design
- Model Evaluation
Question Answering and Conversational Systems - ResearchExpertNew
Investigate Scaling Trends on a Small Open Benchmark
You will train 4 transformer language models (10M, 50M, 200M, 600M parameters) on a public pretraining corpus (e.g., a small subset of FineWeb or OpenWebText) under identical op…
- Scaling Laws
- Transformer Pretraining
- Compute Optimal Training
Large Language Models
How it works
From brief to credential, in six steps.
Step 01
Browse challenges aligned to your studies.
Step 02
Accept the one that fits your goals.
Step 03
Work through it with AI Copilot guidance.
Step 04
Submit for structured evaluation.
Step 05
Earn a verified credential.
Step 06
Add it to LinkedIn with one click.
Industry teams behind a decade of practitioner briefs
Hiring from this pool?
Sponsor a challenge and meet candidates through actual work.
Industry teams can shape briefs around the skills they hire for, then evaluate students on rubric-scored deliverables — not resumes.



















































































