Lewis Tunstall's picture

Lewis Tunstall PRO

lewtun

·

https://lewtun.github.io/blog/

AI & ML interests

LLMs, LLMs, LLMs

Recent Activity

upvoted a paper 2 days ago

Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning

updated a Space 2 days ago

HuggingFaceTB/smol-training-playbook

upvoted an article 3 days ago

3+ Years of ML & Society at Hugging Face 🤗🤝🧑‍🤝‍🧑

View all activity

Organizations

upvoted a paper 2 days ago

Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning

Paper • 2510.25992 • Published 4 days ago • 27

updated a Space 2 days ago

The Smol Training Playbook: The Secrets to Building World-Class LLMs

upvoted an article 3 days ago

Article

3+ Years of ML & Society at Hugging Face 🤗🤝🧑‍🤝‍🧑

By

and 3 others •

4 days ago

• 13

liked a Space 3 days ago

ML & Society at HF

🤗 machine learning and society team website

upvoted an article 3 days ago

Article

huggingface_hub v1.0: Five Years of Building the Foundation of Open Machine Learning

7 days ago

• 51

liked 2 Spaces 3 days ago

Smol Training Playbook - Table of Contents

The Smol Training Playbook: The Secrets to Building World-Class LLMs

published a Space 3 days ago

The Smol Training Playbook: The Secrets to Building World-Class LLMs

upvoted an article 3 days ago

Article

Aligning to What? Rethinking Agent Generalization in MiniMax M2

By

•

3 days ago

• 15

published a dataset 3 days ago

HuggingFaceTB/OpenR1-Math-220k-default-verified

Viewer • Updated 26 days ago • 105k • 325

liked a dataset 3 days ago

neulab/agent-data-collection

Viewer • Updated Sep 9 • 225k • 3.62k • 48

liked a Space 4 days ago

Unlocking On-Policy Distillation for Any Model Family

upvoted a collection 4 days ago

gpt-oss-safeguard

gpt-oss-safeguard-120b and gpt-oss-safeguard-20b are safety reasoning models built-upon gpt-oss • 2 items • Updated 4 days ago • 51

upvoted a paper 4 days ago

Towards Cross-Tokenizer Distillation: the Universal Logit Distillation Loss for LLMs

Paper • 2402.12030 • Published Feb 19, 2024 • 3

upvoted a paper 5 days ago

Llama 2: Open Foundation and Fine-Tuned Chat Models

Paper • 2307.09288 • Published Jul 18, 2023 • 246

liked a dataset 5 days ago

HuggingFaceFW/finewiki

Viewer • Updated 11 days ago • 61.6M • 12.2k • 189

liked a model 5 days ago

OpenFold/OpenFold3

Updated 1 day ago • 23

updated a dataset 5 days ago

HuggingFaceTB/post-training-benchmarks-viewer

Viewer • Updated 5 days ago • 45 • 43

published a dataset 5 days ago

HuggingFaceTB/post-training-benchmarks-viewer

Viewer • Updated 5 days ago • 45 • 43

upvoted a paper 6 days ago

Huxley-Gödel Machine: Human-Level Coding Agent Development by an Approximation of the Optimal Self-Improving Machine

Paper • 2510.21614 • Published 9 days ago • 17