Upload model.safetensors with huggingface_hub
#12
by
Narsil
- opened
No description provided.
so to make sure i understand! This new conversion removes the duplicated embeddings?
Yes !
It works by just copying the PT weights. (Instead of using AutoModelForCausalLM.from_pretrained("gpt2"); model.save_pretrained(..) which I think it what you might have used.save_pretrained with the head, saves the weights with an extra transformer. layer, and saves the head too.
makes sense, thanks! merging
julien-c
changed pull request status to
merged