arxiv:2509.23352
							
						fuxiaolong
fxlong
		ยท
				AI & ML interests
None yet
		Recent Activity
						liked
								a model
							
						14 days ago
						
					
						
						
						
						Qwen/Qwen3-VL-8B-Instruct
						
						authored 
								a paper
							
						15 days ago
						
					
						
						
						Dynamic-TreeRPO: Breaking the Independent Trajectory Bottleneck with
  Structured Sampling