Singh's picture

1 3

Singh

joykirat

·

AI & ML interests

None yet

Recent Activity

updated a model about 1 month ago

joykirat/DeepSeek-R1-Distill-Qwen-7B-TRAAC

published a model about 1 month ago

joykirat/DeepSeek-R1-Distill-Qwen-7B-TRAAC

updated a model about 1 month ago

joykirat/Qwen3-4B-TRAAC

View all activity

Organizations

None yet

authored 4 papers 6 months ago

Exposing the Achilles' Heel: Evaluating LLMs Ability to Handle Mistakes in Mathematical Reasoning

Paper • 2406.10834 • Published Jun 16, 2024

PromptWizard: Task-Aware Prompt Optimization Framework

Paper • 2405.18369 • Published May 28, 2024 • 1

Self-Evolved Preference Optimization for Enhancing Mathematical Reasoning in Small Language Models

Paper • 2503.04813 • Published Mar 4 • 1

Agentic Reasoning and Tool Integration for LLMs via Reinforcement Learning

Paper • 2505.01441 • Published Apr 28 • 39