Lize Pirenne's picture

257 21

Lize Pirenne

Inversta

·

Pangasius

AI & ML interests

LLMs, RL

Recent Activity

upvoted a paper 17 days ago

Reinforcement Learning on Pre-Training Data

upvoted a paper 17 days ago

A Survey of Reinforcement Learning for Large Reasoning Models

upvoted a paper 17 days ago

Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing

View all activity

Organizations

None yet

Inversta 's datasets 1

Inversta/rationale-databricks-dolly-cqa

Viewer • Updated Nov 29, 2024 • 1.6k • 19 • 1