localm / LOCAL_MODELS_README.md
mihailik's picture
It actually works (in Firefox)
480516b

Put downloaded Hugging Face model repo files in the models folder so the dev server can serve them locally.

For Gemma (example):

  1. Create directory: models/google/gemma-2b/resolve/main/
  2. Download and place files into that directory as they would appear on HF, e.g.:
    • config.json
    • tokenizer.json
    • tokenizer_config.json
    • pytorch_model.bin (or converted files)
    • any other weights/tokenizer files

The app expects to fetch assets at paths like:

http://127.0.0.1:8812/models/google/gemma-2b/resolve/main/config.json

After placing files, reload the app and select the local Gemma entry from the slash menu. If files are present, the loader should be able to read config and proceed; if not present you'll get 404s for the missing files.