Image-Text-to-Text
Transformers
Safetensors
llava_qwen
conversational