Spiral RL

community

https://github.com/spiral-rl/spiral

spiral-rl

AI & ML interests

None defined yet.

Recent Activity

Benjamin-eecs authored a paper about 15 hours ago

SPICE: Self-Play In Corpus Environments Improves Reasoning

simonycl authored a paper 16 days ago

Verbalized Sampling: How to Mitigate Mode Collapse and Unlock LLM Diversity

Benjamin-eecs authored a paper 18 days ago

BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution

View all activity

Collections 1

models 2

spiral-rl/Spiral-Qwen3-4B

Text Generation • 4B • Updated Jul 5 • 23 • 4

spiral-rl/Spiral-DeepSeek-R1-Distill-Qwen-7B

Text Generation • 8B • Updated Jul 5 • 5 • 2

datasets 1

spiral-rl/Spiral-Kuhn-Poker-Qwen3-32B-SFT

Viewer • Updated Jul 5 • 25.5k • 20