arxiv:2410.02115
Juntao Li
ljtsuda
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
2 days ago
FAPO: Flawed-Aware Policy Optimization for Efficient and Reliable
Reasoning
upvoted
a
paper
10 months ago
Test-time Computing: from System-1 Thinking to System-2 Thinking
upvoted
a
paper
about 1 year ago
Unleashing Reasoning Capability of LLMs via Scalable Question Synthesis
from Scratch
Organizations
None yet