Prototype a Multimodal Visual-Question-Answering Demo

FreeVerified credential2 weeksIntermediate

Start free

Start this challenge

Overview

What this challenge is about.

Build a Gradio VQA demo for warehouse tasks, report accuracy, and deliver a client deck. Earn a verifiable certificate.

CredentialBlockchain-anchored

ShareableLinkedIn-ready

LanguageEnglish

PaceSelf-paced

The Brief

What you'll do, and what you'll demonstrate.

Build a working warehouse-VQA demo on a small vision-language model and quantify accuracy per question type.

Earning criteria — what you'll demonstrate

Apply a small open-source vision-language model to a domain VQA task
Prompt-engineer for grounded multimodal reasoning
Construct a balanced VQA evaluation set across question types
Present a working multimodal demo to a mixed client audience

Program Fit

Where this fits in your program.

Sharpens the same skills your degree expects you to demonstrate.

Machine Perception

Master · Ai Ml

Fit score: 1

Skills

Skills you'll demonstrate.

Each one shows up on your verified credential.

Careers

Roles this prepares you for.

Real titles. Real skill bridges. Pick the one closest to your trajectory.

Career paths this builds toward

Canonical roles

AI Engineer
AI Engineering

AI Engineer

Shipping a working multimodal demo for a real client meeting is the AI-engineer's signature deliverable at consultancies and AI-forward product teams.

This challenge sharpens

vision-language-models
demo-development
prompt-engineering

Prompt Engineer

Designing and iterating prompts for grounded multimodal reasoning is exactly the prompt-engineer skill set hiring managers screen for.

This challenge sharpens

prompt-engineering
vision-language-models
evaluation

Applied AI Scientist

Constructing a balanced VQA evaluation set and reporting per-question-type accuracy is the applied-AI-scientist's craft when shipping new capabilities.

This challenge sharpens

multimodal-perception
evaluation
vision-language-models

One more thing

You can put a credential on your CV by Friday.

Start this challenge