Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

RewardHacking

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

tongliuphysics  authored a paper 22 days ago
Temperature-scaling surprisal estimates improve fit to human reading times -- but does it do so for the "right reasons"?
tongliuphysics  authored a paper 22 days ago
FocalPO: Enhancing Preference Optimizing by Focusing on Correct Preference Rankings
tongliuphysics  authored a paper about 1 year ago
Multimodal Pragmatic Jailbreak on Text-to-image Models
View all activity

wang's profile picture Tong Liu's profile picture

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs