arxiv:2504.20406
Paiheng Xu
paiheng
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
24 days ago
Agent Learning via Early Experience
upvoted
a
paper
2 months ago
LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model
upvoted
a
paper
2 months ago
Self-Rewarding Vision-Language Model via Reasoning Decomposition