Konstantin Grotov's picture

3 2

Konstantin Grotov

konstantgr

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 12 hours ago

The Best of N Worlds: Aligning Reinforcement Learning with Best-of-N Sampling via max@k Optimisation

authored a paper 27 days ago

PIPer: On-Device Environment Setup via Online Reinforcement Learning

new activity 28 days ago

JetBrains-Research/PIPer-8B-RL-only:Improve model card: Add paper and code badges, update datasets metadata

View all activity

Organizations

Papers 1

arxiv:2509.25455

models 4

konstantgr/mnp-model-mistralai-Mistral-7B-Instruct-v0.2

Text Generation • 4B • Updated Jul 5, 2024

konstantgr/mnp-model-google-gemma-2b

Text Generation • 2B • Updated Jul 5, 2024 • 1

konstantgr/custom-goldfish-gpt2-experiment-roneneldan-TinyStories-100-seeded_random-0.001

Text Generation • 0.1B • Updated Jul 3, 2024

konstantgr/custom-goldfish-gpt2-experiment-roneneldan-TinyStories-30-hash-avalanche-0.001

Text Generation • 0.1B • Updated Jul 3, 2024

datasets 2

konstantgr/themisto

Viewer • Updated Apr 15 • 1.45k • 35

konstantgr/method-name-prediction

Viewer • Updated Jul 5, 2024 • 306k • 48