Qwen3-8B-NVFP4
NVFP4-quantized version of Qwen/Qwen3-8B produced with llmcompressor.
Notes
- Quantization scheme: NVFP4 (linear layers,
lm_headexcluded) - Calibration samples: 512
- Max sequence length during calibration: 2048
- Downloads last month
- 19
NVFP4-quantized version of Qwen/Qwen3-8B produced with llmcompressor.
lm_head excluded)