Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
lomahony 's Collections
Pythia-hh-all-sft-dpo
pythia-helpful-1epoch
pythia-helpful-epoch2
Pythia-helpful 3 epochs

pythia-helpful-1epoch

updated Mar 12, 2024

Pythia-2.8b supervised finetuned and DPO finetuned with the helpful subset of Anthropic-hh-rlhf dataset for 1 epoch.

Upvote
-

  • lomahony/pythia-410m-helpful-dpo

    Text Generation • Updated May 14, 2024 • 5

  • lomahony/pythia-2.8b-helpful-sft

    Text Generation • 3B • Updated May 14, 2024 • 6

  • lomahony/pythia-160m-helpful-sft

    Text Generation • 0.2B • Updated Nov 13, 2024 • 6

  • lomahony/pythia-70m-helpful-sft

    Text Generation • 70.4M • Updated Jan 20 • 10

  • lomahony/pythia-1.4b-helpful-sft

    Text Generation • 1B • Updated May 21 • 4

  • lomahony/pythia-1b-helpful-sft

    Text Generation • 1B • Updated Nov 26, 2024 • 7

  • lomahony/pythia-410m-helpful-sft

    Text Generation • 0.4B • Updated Jan 20 • 6

  • lomahony/pythia-2.8b-helpful-dpo

    Text Generation • Updated May 14, 2024 • 7

  • lomahony/pythia-1.4b-helpful-dpo

    Text Generation • Updated May 14, 2024 • 5

  • lomahony/pythia-160m-helpful-dpo

    Text Generation • Updated May 14, 2024 • 4

  • lomahony/pythia-70m-helpful-dpo

    Text Generation • Updated May 14, 2024 • 3

  • lomahony/pythia-1b-helpful-dpo

    Text Generation • Updated May 14, 2024 • 3
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs