Glyph: Scaling Context Windows via Visual-Text Compression Paper • 2510.17800 • Published 13 days ago • 64
AgentRL: Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework Paper • 2510.04206 • Published 28 days ago • 2
ComputerRL: Scaling End-to-End Online Reinforcement Learning for Computer Use Agents Paper • 2508.14040 • Published Aug 19 • 3
SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models Paper • 2412.11605 • Published Dec 16, 2024 • 18
ImageReward: Learning and Evaluating Human Preferences for Text-to-Image Generation Paper • 2304.05977 • Published Apr 12, 2023 • 3
Black-Box Prompt Optimization: Aligning Large Language Models without Model Training Paper • 2311.04155 • Published Nov 7, 2023 • 1