Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
gabrielbo 's Collections
SmartRouter
SPaRK-RL

SPaRK-RL

updated Jun 17

combines reinforcement learning (RL) and large language models (LLMs) to improve exploration using diverse tool generation during inference

Upvote
1

  • gabrielbo/explore-rl-hotpota-trajectories

    Updated May 9 • 3

  • gabrielbo/swirl-trajectories-mmlu-pro

    Viewer • Updated May 20 • 24.8k • 19 • 2

  • gabrielbo/spark-model-QLoRA

    Text Generation • Updated May 24 • 1
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs