Less is More: Recursive Reasoning with Tiny Networks Paper • 2510.04871 • Published 22 days ago • 452
Cache-to-Cache: Direct Semantic Communication Between Large Language Models Paper • 2510.03215 • Published 25 days ago • 93
When Thoughts Meet Facts: Reusable Reasoning for Long-Context LMs Paper • 2510.07499 • Published 20 days ago • 46
StreamingVLM: Real-Time Understanding for Infinite Video Streams Paper • 2510.09608 • Published 18 days ago • 49
LiteStage: Latency-aware Layer Skipping for Multi-stage Reasoning Paper • 2510.14211 • Published 13 days ago • 6
Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning Paper • 2510.19338 • Published 6 days ago • 97
LightMem: Lightweight and Efficient Memory-Augmented Generation Paper • 2510.18866 • Published 7 days ago • 105
Glyph: Scaling Context Windows via Visual-Text Compression Paper • 2510.17800 • Published 8 days ago • 61