arxiv:2507.04632
Yun Qu
yunqu
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 2 months ago
Can Prompt Difficulty be Online Predicted for Accelerating RL Finetuning
of Reasoning Models?
authored
a paper
about 2 months ago
Latent Reward: LLM-Empowered Credit Assignment in Episodic Reinforcement
Learning
authored
a paper
about 2 months ago
LLM-Empowered State Representation for Reinforcement Learning
Organizations
None yet