Exposing the Achilles' Heel: Evaluating LLMs Ability to Handle Mistakes in Mathematical Reasoning Paper • 2406.10834 • Published Jun 16, 2024
PromptWizard: Task-Aware Prompt Optimization Framework Paper • 2405.18369 • Published May 28, 2024 • 1
Self-Evolved Preference Optimization for Enhancing Mathematical Reasoning in Small Language Models Paper • 2503.04813 • Published Mar 4 • 1
Agentic Reasoning and Tool Integration for LLMs via Reinforcement Learning Paper • 2505.01441 • Published Apr 28 • 39