Pico-Banana-400K: A Large-Scale Dataset for Text-Guided Image Editing Paper • 2510.19808 • Published 6 days ago • 24
Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures Paper • 2505.09343 • Published May 14 • 71
QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs Paper • 2510.11696 • Published 15 days ago • 168
Advancing Medical Representation Learning Through High-Quality Data Paper • 2503.14377 • Published Mar 18 • 3
OmniRetarget: Interaction-Preserving Data Generation for Humanoid Whole-Body Loco-Manipulation and Scene Interaction Paper • 2509.26633 • Published 28 days ago • 5
Watch and Learn: Learning to Use Computers from Online Videos Paper • 2510.04673 • Published 22 days ago • 10
DC-VideoGen: Efficient Video Generation with Deep Compression Video Autoencoder Paper • 2509.25182 • Published 29 days ago • 36
BroRL: Scaling Reinforcement Learning via Broadened Exploration Paper • 2510.01180 • Published 27 days ago • 17
ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation Paper • 2510.04290 • Published 23 days ago • 10
Less is More: Recursive Reasoning with Tiny Networks Paper • 2510.04871 • Published 22 days ago • 453
TOUCAN: Synthesizing 1.5M Tool-Agentic Data from Real-World MCP Environments Paper • 2510.01179 • Published 27 days ago • 24
StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets? Paper • 2510.02209 • Published 26 days ago • 51