title: README emoji: π colorFrom: blue colorTo: indigo sdk: static pinned: false
Researching scalable (RL) methods on language models.