Yang Su
yang-su2000
		AI & ML interests
Long-Horizon RL Agent Alignment
		Recent Activity
						liked
								a dataset
							
						23 days ago
						
					
						
						
						
						Agent-Ark/Toucan-1.5M
						
						new activity
							
						6 months ago
						
					
						
						
						
						Qwen/Qwen3-32B:The correct way of fine-tuning on multi-turn trajectories
						
						new activity
							
						6 months ago
						
					
						
						
						
						Qwen/Qwen3-235B-A22B:Qwen3 not Using Tools in Complex Prompts Unlike QwQ-32B
						

