Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
bcywinski 's Collections
Eliciting Secret Knowledge from Language Models
llama-3.3-70B-Instruct-ssc
gemma-2-9b-it-user-gender
gemma-2-9b-it-taboo

Eliciting Secret Knowledge from Language Models

updated Oct 2

https://arxiv.org/abs/2510.01070

Upvote
-

  • llama-3.3-70B-Instruct-ssc

    Collection
    2 items • Updated Sep 30

  • gemma-2-9b-it-user-gender

    Collection
    6 items • Updated Sep 30 • 1

  • gemma-2-9b-it-taboo

    Collection
    Data and Taboo models trained for arxiv.org/abs/2505.14352 • 41 items • Updated Sep 30 • 1

  • Eliciting Secret Knowledge from Language Models

    Paper • 2510.01070 • Published Oct 1 • 4
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs