You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

model - GGUF

This model was finetuned and converted to GGUF format using Unsloth.

Example usage:

  • For text only LLMs: llama-cli --hf repo_id/model_name -p "why is the sky blue?"
  • For multimodal models: llama-mtmd-cli -m model_name.gguf --mmproj mmproj_file.gguf

Available Model files:

  • qwen3-4b-thinking-2507.Q5_K_M.gguf
  • qwen3-4b-thinking-2507.Q8_0.gguf
  • qwen3-4b-thinking-2507.Q4_K_M.gguf

Ollama

An Ollama Modelfile is included for easy deployment.

Downloads last month
56
GGUF
Model size
4B params
Architecture
qwen3
Hardware compatibility
Log In to view the estimation

4-bit

5-bit

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support