The Alignment Waltz: Jointly Training Agents to Collaborate for Safety Paper • 2510.08240 • Published 19 days ago • 41
Large Reasoning Models Learn Better Alignment from Flawed Thinking Paper • 2510.00938 • Published 27 days ago • 57