GGUF format will make your great work accessible to more users!
the mainline llama.cpp PR is here: https://github.com/ggml-org/llama.cpp/pull/16831
· Sign up or log in to comment