| Put downloaded Hugging Face model repo files in the `models` folder so the dev server can serve them locally. | |
| For Gemma (example): | |
| 1. Create directory: `models/google/gemma-2b/resolve/main/` | |
| 2. Download and place files into that directory as they would appear on HF, e.g.: | |
| - config.json | |
| - tokenizer.json | |
| - tokenizer_config.json | |
| - pytorch_model.bin (or converted files) | |
| - any other weights/tokenizer files | |
| The app expects to fetch assets at paths like: | |
| http://127.0.0.1:8812/models/google/gemma-2b/resolve/main/config.json | |
| After placing files, reload the app and select the local Gemma entry from the slash menu. If files are present, the loader should be able to read config and proceed; if not present you'll get 404s for the missing files. | |