arxiv:2501.11425
Zhiheng Xi
WooooDyy
AI & ML interests
None yet
Recent Activity
commented on
a paper
about 23 hours ago
Critique-RL: Training Language Models for Critiquing through Two-Stage
Reinforcement Learning
commented on
a paper
1 day ago
Critique-RL: Training Language Models for Critiquing through Two-Stage
Reinforcement Learning