MoSLD: An Extremely Parameter-Efficient Mixture-of-Shared LoRAs for Multi-Task Learning Paper • 2412.08946 • Published Dec 12, 2024
AgentRefine: Enhancing Agent Generalization through Refinement Tuning Paper • 2501.01702 • Published Jan 3
On the Perception Bottleneck of VLMs for Chart Understanding Paper • 2503.18435 • Published Mar 24 • 1
Pitfalls of Rule- and Model-based Verifiers -- A Case Study on Mathematical Reasoning Paper • 2505.22203 • Published May 28 • 6
CareBot: A Pioneering Full-Process Open-Source Medical Language Model Paper • 2412.15236 • Published Dec 12, 2024 • 1
The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution Paper • 2510.25726 • Published 5 days ago • 42
SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild Paper • 2503.18892 • Published Mar 24 • 31
B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners Paper • 2412.17256 • Published Dec 23, 2024 • 47
What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction Tuning Paper • 2312.15685 • Published Dec 25, 2023 • 16
DolphCoder: Echo-Locating Code Large Language Models with Diverse and Multi-Objective Instruction Tuning Paper • 2402.09136 • Published Feb 14, 2024 • 1
CS-Bench: A Comprehensive Benchmark for Large Language Models towards Computer Science Mastery Paper • 2406.08587 • Published Jun 12, 2024 • 16
Aqulia-Med LLM: Pioneering Full-Process Open-Source Medical Language Models Paper • 2406.12182 • Published Jun 18, 2024
Automatic Instruction Evolving for Large Language Models Paper • 2406.00770 • Published Jun 2, 2024 • 2
FutureTOD: Teaching Future Knowledge to Pre-trained Language Model for Task-Oriented Dialogue Paper • 2306.10315 • Published Jun 17, 2023 • 1
Semi-Supervised Knowledge-Grounded Pre-training for Task-Oriented Dialog Systems Paper • 2210.08873 • Published Oct 17, 2022 • 1