Spaces:
Runtime error
Runtime error
| title: Llama Hqq 1 Bit | |
| emoji: π | |
| colorFrom: green | |
| colorTo: pink | |
| sdk: gradio | |
| sdk_version: 4.24.0 | |
| app_file: app.py | |
| license: llama2 | |
| train: false | |
| inference: false | |
| pipeline_tag: text-generation | |
| Demo for HQQ 1-bit quantized (binary weights) Llama2-7B-chat model using a low-rank adapter to improve the performance (referred to as HQQ+). | |
| You will need a GPU for this. | |
| https://huggingface.co/mobiuslabsgmbh/Llama-2-7b-chat-hf_1bitgs8_hqq | |