{/* Main Title Card */} {/* Source Selection Card */}

{ e.preventDefault(); onSelectCamera?.(); }} className="px-4 py-3 rounded-2xl" aria-label="Use Camera" > Use Camera { e.preventDefault(); onSelectDisplay?.(); }} className="px-4 py-3 rounded-2xl" aria-label="Share Tab or Screen" > Share Tab/Screen

{ e.preventDefault(); document.getElementById("video-file-input")?.click(); }} className="px-4 py-3 rounded-2xl" aria-label="Upload Video" > Upload Video

Start Live Captioning

{/* How It Works Card */}

How it works:

You are about to load{" "} FastVLM-0.5B , a powerful multimodal model optimized for in-browser inference.

Everything runs entirely in your browser with{" "} Transformers.js {" "} and ONNX Runtime Web, meaning no data is sent to a server. It can even run offline!

Get started by clicking the button below.

AI model will load when you click start