weixun's picture

9

weixun

weixun

AI & ML interests

None yet

Recent Activity

upvoted a paper 17 days ago

Attention Illuminates LLM Reasoning: The Preplan-and-Anchor Rhythm Enables Fine-Grained Policy Optimization

upvoted a paper 19 days ago

Part II: ROLL Flash -- Accelerating RLVR and Agentic Training with Asynchrony

upvoted a paper about 1 month ago

GEM: A Gym for Agentic LLMs

View all activity

Organizations

None yet

upvoted a paper 17 days ago

Attention Illuminates LLM Reasoning: The Preplan-and-Anchor Rhythm Enables Fine-Grained Policy Optimization

Paper • 2510.13554 • Published 17 days ago • 55

upvoted a paper 19 days ago

Part II: ROLL Flash -- Accelerating RLVR and Agentic Training with Asynchrony

Paper • 2510.11345 • Published 19 days ago • 15

upvoted a paper about 1 month ago

GEM: A Gym for Agentic LLMs

Paper • 2510.01051 • Published Oct 1 • 87

upvoted a paper 2 months ago

Understanding Tool-Integrated Reasoning

Paper • 2508.19201 • Published Aug 26 • 32

upvoted a paper 5 months ago

Reinforcement Learning Optimization for Large-Scale Learning: An Efficient and User-Friendly Scaling Library

Paper • 2506.06122 • Published Jun 6 • 7

upvoted a paper 8 months ago

Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?

Paper • 2502.19361 • Published Feb 26 • 28

upvoted a paper 10 months ago

ProgCo: Program Helps Self-Correction of Large Language Models

Paper • 2501.01264 • Published Jan 2 • 26

upvoted a paper 12 months ago

Chinese SimpleQA: A Chinese Factuality Evaluation for Large Language Models

Paper • 2411.07140 • Published Nov 11, 2024 • 35

authored a paper over 1 year ago

OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework

Paper • 2405.11143 • Published May 20, 2024 • 41

upvoted a paper over 1 year ago

OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework

Paper • 2405.11143 • Published May 20, 2024 • 41