Was the training done with FP8 or BF16?

#14

by mindkrypted - opened 4 days ago

4 days ago

As the title says, if the training was done in BF16, could we expect the release of those weights for getting better results while doing quantization?

Thanks,

pyzhao

MiniMax org 4 days ago

M2 was trained with FP8

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment