LimRank: Less is More for Reasoning-Intensive Information Reranking Paper • 2510.23544 • Published 2 days ago • 8
ARC-Encoder: learning compressed text representations for large language models Paper • 2510.20535 • Published 6 days ago • 5
Reasoning with Sampling: Your Base Model is Smarter Than You Think Paper • 2510.14901 • Published 13 days ago • 41
Directional Reasoning Injection for Fine-Tuning MLLMs Paper • 2510.15050 • Published 13 days ago • 10
LoongRL:Reinforcement Learning for Advanced Reasoning over Long Contexts Paper • 2510.19363 • Published 7 days ago • 57
Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning Paper • 2510.19338 • Published 7 days ago • 99
BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping Paper • 2510.18927 • Published 8 days ago • 79
ssToken: Self-modulated and Semantic-aware Token Selection for LLM Fine-tuning Paper • 2510.18250 • Published 8 days ago • 12
Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model Paper • 2510.18855 • Published 8 days ago • 60
Balanced Multi-Task Attention for Satellite Image Classification: A Systematic Approach to Achieving 97.23% Accuracy on EuroSAT Without Pre-Training Paper • 2510.15527 • Published 12 days ago • 2
DLER: Doing Length pEnalty Right - Incentivizing More Intelligence per Token via Reinforcement Learning Paper • 2510.15110 • Published 13 days ago • 15
MorphoBench: A Benchmark with Difficulty Adaptive to Model Reasoning Paper • 2510.14265 • Published 13 days ago • 19
Attention Is All You Need for KV Cache in Diffusion LLMs Paper • 2510.14973 • Published 13 days ago • 36
LaSeR: Reinforcement Learning with Last-Token Self-Rewarding Paper • 2510.14943 • Published 13 days ago • 37
WithAnyone: Towards Controllable and ID Consistent Image Generation Paper • 2510.14975 • Published 13 days ago • 79