2 10 15

Massimo Caccia

optimass

https://optimass.github.io/

AI & ML interests

None yet

Recent Activity

liked a model 28 days ago

ServiceNow-AI/Apriel-1.5-15b-Thinker

upvoted a paper about 2 months ago

A Survey of Reinforcement Learning for Large Reasoning Models

upvoted a paper about 2 months ago

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

View all activity

Organizations

liked a model 28 days ago

ServiceNow-AI/Apriel-1.5-15b-Thinker

Image-Text-to-Text • 15B • Updated 25 days ago • 56.7k • 425

upvoted 2 papers about 2 months ago

A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10 • 184

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2 • 219

commented a paper 3 months ago

How to Train Your LLM Web Agent: A Statistical Diagnosis

Paper • 2507.04103 • Published Jul 5 • 50 •

upvoted an article 4 months ago

Article

How to Train Your LLM Web Agent: A Statistical Diagnosis

•

Jul 8

• 14

liked a Space 4 months ago

2.84k

Anycoder

🏢

Generate Gradio app code from descriptions

upvoted a paper 4 months ago

How to Train Your LLM Web Agent: A Statistical Diagnosis

Paper • 2507.04103 • Published Jul 5 • 50

upvoted an article 5 months ago

Article

GRPO for GUI Grounding Done Right

•

Jun 11

• 34

upvoted an article 6 months ago

Article

PipelineRL

and 3 others •

Apr 25

• 38

liked a dataset 9 months ago

ServiceNow-AI/R1-Distill-SFT

Viewer • Updated Feb 8 • 1.85M • 1.09k • 307

updated 3 models 10 months ago

updated a model 11 months ago

web-agent/llama3.3_70b-nnetnav_rft_gpt4o_orm_start_1200

Updated Dec 18, 2024

authored 3 papers 11 months ago

WorkArena++: Towards Compositional Planning and Reasoning-based Common Knowledge Work Tasks

Paper • 2407.05291 • Published Jul 7, 2024 • 2

GitChameleon: Unmasking the Version-Switching Capabilities of Code Generation Models

Paper • 2411.05830 • Published Nov 5, 2024 • 21

The BrowserGym Ecosystem for Web Agent Research

Paper • 2412.05467 • Published Dec 6, 2024 • 22

upvoted a paper 11 months ago

The BrowserGym Ecosystem for Web Agent Research

Paper • 2412.05467 • Published Dec 6, 2024 • 22

upvoted a paper 12 months ago

GitChameleon: Unmasking the Version-Switching Capabilities of Code Generation Models

Paper • 2411.05830 • Published Nov 5, 2024 • 21

commented a paper 12 months ago

GitChameleon: Unmasking the Version-Switching Capabilities of Code Generation Models

Paper • 2411.05830 • Published Nov 5, 2024 • 21 •

Massimo Caccia

AI & ML interests

Recent Activity

Organizations

optimass's activity

How to Train Your LLM Web Agent: A Statistical Diagnosis

Anycoder

GRPO for GUI Grounding Done Right

PipelineRL