thuml
/

bytesized32-world-model-rlvr-binary-reward

Model card Files Files and versions

bytesized32-world-model-rlvr-binary-reward / README.md

manchery's picture

Update README.md

501a03c verified 6 months ago

|

history blame contribute delete

486 Bytes

	---
	license: mit
	tags:
	- text-game
	- world-model
	- rlvr
	datasets:
	- thuml/bytesized32-world-model-cot
	base_model:
	- thuml/bytesized32-world-model-sft
	---
	See https://github.com/thuml/RLVR-World for examples for using this model.

	## Citation

	```
	@article{wu2025rlvr,
	title={RLVR-World: Training World Models with Reinforcement Learning},
	author={Jialong Wu and Shaofeng Yin and Ningya Feng and Mingsheng Long},
	journal={arXiv preprint arXiv:2505.13934},
	year={2025},
	}