UFT
					Collection
				
UFT: Unifying Supervised and Reinforcement Fine-Tuning
					• 
				80 items
				• 
				Updated
					
				•
					
					1
This repository contains the model presented in UFT: Unifying Supervised and Reinforcement Fine-Tuning.
Code: https://github.com/liumy2010/UFT
## References
* [UFT: Unifying Supervised and Reinforcement Fine-Tuning](https://arxiv.org/abs/2505.16984)
Base model
Qwen/Qwen2.5-0.5B