Yanxiao Zhao's picture

4 15 8

Yanxiao Zhao

sdpkjc

·

https://sdpkjc.me

AI & ML interests

Reinforcement Learning

Recent Activity

updated a dataset about 23 hours ago

TheFactoryX/edition_0001_Rowan-hellaswag-readymade

published a dataset about 23 hours ago

TheFactoryX/edition_0001_Rowan-hellaswag-readymade

updated a dataset about 23 hours ago

TheFactoryX/edition_0000_fancyzhx-ag_news-readymade

View all activity

Organizations

authored 3 papers about 2 months ago

ComputerRL: Scaling End-to-End Online Reinforcement Learning for Computer Use Agents

Paper • 2508.14040 • Published Aug 19 • 3

SATQuest: A Verifier for Logical Reasoning Evaluation and Reinforcement Fine-Tuning of LLMs

Paper • 2509.00930 • Published Aug 31 • 4

CAMEL: Continuous Action Masking Enabled by Large Language Models for Reinforcement Learning

Paper • 2502.11896 • Published Feb 17

authored 2 papers over 1 year ago

Open RL Benchmark: Comprehensive Tracked Experiments for Reinforcement Learning

Paper • 2402.03046 • Published Feb 5, 2024 • 7

Snapshot Reinforcement Learning: Leveraging Prior Trajectories for Efficiency

Paper • 2403.00673 • Published Mar 1, 2024 • 1