Generate speech from text using a reference audio sample
Real-time video captioning powered by FastVLM
Display a static web page