LLM-Powered Hierarchical Language Agent for Real-time Human-AI Coordination Paper • 2312.15224 • Published Dec 23, 2023
Learning from Suboptimal Data in Continuous Control via Auto-Regressive Soft Q-Network Paper • 2502.00288 • Published Feb 1
What Can RL Bring to VLA Generalization? An Empirical Study Paper • 2505.19789 • Published May 26 • 1