TreeBoN: Enhancing Inference-Time Alignment with Speculative Tree-Search and Best-of-N Sampling Paper • 2410.16033 • Published Oct 18, 2024
A Common Pitfall of Margin-based Language Model Alignment: Gradient Entanglement Paper • 2410.13828 • Published Oct 17, 2024 • 4
LLM-RankFusion: Mitigating Intrinsic Inconsistency in LLM-based Ranking Paper • 2406.00231 • Published May 31, 2024 • 1
AutoDefense: Multi-Agent LLM Defense against Jailbreak Attacks Paper • 2403.04783 • Published Mar 2, 2024 • 2