Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Jiang's picture
4 6

Jiang

Louieworth
·

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago
Rubrics as Rewards: Reinforcement Learning Beyond Verifiable Domains
upvoted a paper 15 days ago
Which Heads Matter for Reasoning? RL-Guided KV Cache Compression
upvoted a paper 23 days ago
StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?
View all activity

Organizations

None yet

models 0

None public yet

datasets 2

Louieworth/Hummer

Viewer • Updated Aug 7, 2024 • 46.2k • 3

Louieworth/hh-rlhf-trl-style

Viewer • Updated Apr 20, 2024 • 100 • 1
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs