Linfeng Song
freesunshine0316
		ยท
				AI & ML interests
Researcher @Tencent  AI Lab working on reasoning and RLAIF with LLM, especially search + RL. Working on NLP since 2010.
		Recent Activity
						upvoted 
								a
								paper
							
						11 days ago
						
					
						
						
						Every Question Has Its Own Value: Reinforcement Learning with Explicit
  Human Values
						
						upvoted 
								a
								paper
							
						about 1 month ago
						
					
						
						
						VOGUE: Guiding Exploration with Visual Uncertainty Improves Multimodal
  Reasoning
						
						upvoted 
								a
								paper
							
						about 1 month ago
						
					
						
						
						CLUE: Non-parametric Verification from Experience via Hidden-State
  Clustering