Agent-RLVR: Training Software Engineering Agents via Guidance and Environment Rewards Paper • 2506.11425 • Published Jun 13