Yang Su's picture

3 4 16

Yang Su

yang-su2000

·

https://alicellm.github.io/

AI & ML interests

Long-Horizon RL Agent Alignment

Recent Activity

liked a dataset 23 days ago

Agent-Ark/Toucan-1.5M

new activity 6 months ago

Qwen/Qwen3-32B:The correct way of fine-tuning on multi-turn trajectories

new activity 6 months ago

Qwen/Qwen3-235B-A22B:Qwen3 not Using Tools in Complex Prompts Unlike QwQ-32B

View all activity

Organizations

Collections 1

Papers 1

arxiv:2412.15115

models 0

None public yet

datasets 0

None public yet