1 21 26

Emmanuel Sugutt

Sugutt

sugutt_
sugutt

AI & ML interests

Reinforcement learning Transformer models

Organizations

models 8

Sugutt/whisper-kalenjin-large

Updated Aug 28

Sugutt/whisper-small-hi

Updated Aug 5

Sugutt/finmap-expense-cat-model

0.1B • Updated Apr 23 • 1

Sugutt/finbert-expense-categorization

Text Classification • 0.1B • Updated Mar 25 • 1

Sugutt/Taxi-V3

Reinforcement Learning • Updated Mar 18

Sugutt/q-FrozenLake-v1-4x4-noSlippery

Reinforcement Learning • Updated Mar 18

Sugutt/ppo-Huggy

Reinforcement Learning • Updated Mar 6 • 8

Sugutt/lunarlander

Reinforcement Learning • Updated Jun 8, 2023 • 1

datasets 0

None public yet

Emmanuel Sugutt

AI & ML interests

Organizations

Collections 3

DuPO: Enabling Reliable LLM Self-Verification via Dual Preference Optimization

MEML-GRPO: Heterogeneous Multi-Expert Mutual Learning for RLVR Advancement

URPO: A Unified Reward & Policy Optimization Framework for Large Language Models

Grove MoE: Towards Efficient and Superior MoE LLMs with Adjugate Experts

MoBE: Mixture-of-Basis-Experts for Compressing MoE-based LLMs

SmallThinker: A Family of Efficient Large Language Models Natively Trained for Local Deployment

MiniCPM4: Ultra-Efficient LLMs on End Devices

DuPO: Enabling Reliable LLM Self-Verification via Dual Preference Optimization

MEML-GRPO: Heterogeneous Multi-Expert Mutual Learning for RLVR Advancement

URPO: A Unified Reward & Policy Optimization Framework for Large Language Models

Grove MoE: Towards Efficient and Superior MoE LLMs with Adjugate Experts

MoBE: Mixture-of-Basis-Experts for Compressing MoE-based LLMs

SmallThinker: A Family of Efficient Large Language Models Natively Trained for Local Deployment

MiniCPM4: Ultra-Efficient LLMs on End Devices

spaces 1

Mistralai Mixtral 8x7B Instruct V0.1

models 8

Sugutt/whisper-kalenjin-large

Sugutt/whisper-small-hi

Sugutt/finmap-expense-cat-model

Sugutt/finbert-expense-categorization

Sugutt/Taxi-V3

Sugutt/q-FrozenLake-v1-4x4-noSlippery

Sugutt/ppo-Huggy

Sugutt/lunarlander

datasets 0

Emmanuel Sugutt

AI & ML interests

Organizations

Collections 3

spaces 1

Mistralai Mixtral 8x7B Instruct V0.1

models 8 Sort: Recently updated

datasets 0

models 8