| license: mit | |
| tags: | |
| - text-game | |
| - world-model | |
| - rlvr | |
| datasets: | |
| - thuml/bytesized32-world-model-cot | |
| base_model: | |
| - thuml/bytesized32-world-model-sft | |
| See https://github.com/thuml/RLVR-World for examples for using this model. | |
| ## Citation | |
| ``` | |
| @article{wu2025rlvr, | |
| title={RLVR-World: Training World Models with Reinforcement Learning}, | |
| author={Jialong Wu and Shaofeng Yin and Ningya Feng and Mingsheng Long}, | |
| journal={arXiv preprint arXiv:2505.13934}, | |
| year={2025}, | |
| } |