CohenQu/Qwen3-4B-Instruct-POPE-MIX-first_guide-no_guide-v3-0.26 4B • Updated about 18 hours ago • 510
CohenQu/sft_Qwen3-1.7B_Continue_vs_Terminate.05.00_orchard Text Generation • 2B • Updated Jul 29 • 13
CohenQu/sft_Qwen3-1.7B_Continue_vs_Terminate.05.01_orchard Text Generation • 2B • Updated Jul 29 • 16
CohenQu/Joint-Train-deepscalar_RL_hard_500_verl_0.35_0.001_0.001_32_32_20k_4_0713 2B • Updated Jul 14 • 3
CohenQu/Joint-Train-deepscalar_RL_hard_500_verl_0.35_0.001_0.001_32_32_20k_4_0710 2B • Updated Jul 12 • 4
CohenQu/Joint-Train-deepscalar_RL_hard_500_verl_0.35_0.001_0.001_32_32_20k_4_new 2B • Updated Jun 28 • 4