Shaobai Jiang
shaobaij
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 12 hours ago
Supervised Reinforcement Learning: From Expert Trajectories to Step-wise
Reasoning
upvoted
a
paper
2 days ago
ImpossibleBench: Measuring LLMs' Propensity of Exploiting Test Cases
upvoted
a
paper
3 days ago
ReasonIF: Large Reasoning Models Fail to Follow Instructions During
Reasoning
Organizations
None yet