2 10 2

Yuqian Fu

Yuqian-Fu

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use

updated a collection about 1 month ago

SRFT

updated a collection about 1 month ago

SRFT

View all activity

Organizations

None yet

upvoted a paper about 1 month ago

MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use

Paper • 2509.24002 • Published Sep 28 • 170

updated a collection about 1 month ago

SRFT

Collection

5 items • Updated Sep 28

published 2 models about 1 month ago

Yuqian-Fu/SRFT-Qwen2.5-Math-1.5B

2B • Updated Jul 24 • 2

Yuqian-Fu/SRFT-Qwen2.5-7B-Instruct

8B • Updated Jul 24 • 2

upvoted a paper about 2 months ago

The Illusion of Diminishing Returns: Measuring Long Horizon Execution in LLMs

Paper • 2509.09677 • Published Sep 11 • 34

authored a paper about 2 months ago

Harnessing Uncertainty: Entropy-Modulated Policy Gradients for Long-Horizon LLM Agents

Paper • 2509.09265 • Published Sep 11 • 45

upvoted 3 papers about 2 months ago

Harnessing Uncertainty: Entropy-Modulated Policy Gradients for Long-Horizon LLM Agents

Paper • 2509.09265 • Published Sep 11 • 45

DeepResearch Arena: The First Exam of LLMs' Research Abilities via Seminar-Grounded Tasks

Paper • 2509.01396 • Published Sep 1 • 56

Inverse IFEval: Can LLMs Unlearn Stubborn Training Conventions to Follow Real Instructions?

Paper • 2509.04292 • Published Sep 4 • 57

upvoted a paper 2 months ago

SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning

Paper • 2509.02479 • Published Sep 2 • 83

upvoted a paper 3 months ago

WideSearch: Benchmarking Agentic Broad Info-Seeking

Paper • 2508.07999 • Published Aug 11 • 109

liked a Space 3 months ago

3.4k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

upvoted a paper 3 months ago

SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

Paper • 2506.24119 • Published Jun 30 • 50

updated 3 models 3 months ago

upvoted a paper 4 months ago

A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published Jul 17 • 258

liked a dataset 4 months ago

open-thoughts/OpenThoughts-114k

Viewer • Updated Aug 31 • 228k • 49.9k • 770

authored a paper 4 months ago

RLAE: Reinforcement Learning-Assisted Ensemble for LLMs

Paper • 2506.00439 • Published May 31 • 1

Yuqian Fu

AI & ML interests

Recent Activity

Organizations

Yuqian-Fu's activity

The Ultra-Scale Playbook