67f4075 3d64e78 1524bde
1
2
3
4
5
6
7
8
9
10
11
12
--- title: README emoji: π colorFrom: blue colorTo: indigo sdk: static pinned: false --- Researching scalable (RL) methods on language models.