Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
257
21
Lize Pirenne
Inversta
Follow
shtefcs's profile picture
21world's profile picture
2 followers
·
1 following
Pangasius
AI & ML interests
LLMs, RL
Recent Activity
upvoted
a
paper
17 days ago
Reinforcement Learning on Pre-Training Data
upvoted
a
paper
17 days ago
A Survey of Reinforcement Learning for Large Reasoning Models
upvoted
a
paper
17 days ago
Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing
View all activity
Organizations
None yet
Inversta
's datasets
1
Sort: Recently updated
Inversta/rationale-databricks-dolly-cqa
Viewer
•
Updated
Nov 29, 2024
•
1.6k
•
19
•
1