arxiv:2501.08328
Richard Zhuang PRO
RZ412
AI & ML interests
LLM Routing, LLM + Games, Post-Training, Agents
Recent Activity
updated
a model
about 16 hours ago
RZ412/Qwen2.5-3B-Instruct-OT3-8K-QwQ-R1-RM-50-50-SS-42-AS-42
published
a model
2 days ago
DCAgent/swebench-sync-group-size-8-dev-test
published
a model
2 days ago
RZ412/Qwen2.5-3B-Instruct-OT3-8K-QwQ-R1-RM-50-50-SS-42-AS-42