naver-hyperclovax/HyperCLOVAX-SEED-Text-Instruct-0.5B Text Generation β’ 0.6B β’ Updated Jul 21 β’ 3.99k β’ 74
naver-hyperclovax/HyperCLOVAX-SEED-Vision-Instruct-3B Text Generation β’ 4B β’ Updated Sep 16 β’ 66.5k β’ 211
naver-hyperclovax/HyperCLOVAX-SEED-Text-Instruct-1.5B Text Generation β’ 2B β’ Updated Oct 2 β’ 3.84k β’ 142
nvidia/Llama-3.1-Nemotron-Nano-8B-v1 Text Generation β’ 8B β’ Updated 18 days ago β’ 16.7k β’ β’ 210
naver-hyperclovax/HyperCLOVAX-SEED-Think-14B Text Generation β’ 15B β’ Updated Aug 27 β’ 4.09k β’ 91
naver-hyperclovax/HyperCLOVAX-SEED-Think-14B Text Generation β’ 15B β’ Updated Aug 27 β’ 4.09k β’ 91
view article Article DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr β’ Feb 7 β’ 243
Running 3.4k 3.4k The Ultra-Scale Playbook π The ultimate guide to training LLM on large GPU Clusters