Datasets and models associated with the paper "Large-Scale Data Selection for Instruction Tuning" (https://arxiv.org/abs/2503.01807)
Hamish Ivison
hamishivi
AI & ML interests
NLP :)
Recent Activity
updated
a dataset
about 4 hours ago
hamishivi/olmo_msgs_thinker
published
a dataset
about 4 hours ago
hamishivi/olmo_msgs_thinker
updated
a model
about 5 hours ago
hamishivi/2010_rl_rag_NAR8_testing64_gpt5_under_trained_step300
Organizations
Tulu 2 Llama 3 Update
Llama 3 models trained on the tulu dataset, following https://arxiv.org/abs/2311.10702 (tulu 2) and https://arxiv.org/abs/2406.09279 (tulu 2.5).
Tulu V2 Suite
The set of models associated with the Tulu V2 technical report.
LM Preference Datasets
TESS 2
Models associated with the paper "TESS-2: A Large-Scale, Generalist Diffusion Language Model". Code: https://github.com/hamishivi/tess-2
7b tulu 2.5
a small run at 7b scale with ppo, following the unpacking dpo and ppo paper.
Tulu V1 Suite
The set of models associated with the paper "How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources".
Large-Scale Data Selection for Instruction Tuning
Datasets and models associated with the paper "Large-Scale Data Selection for Instruction Tuning" (https://arxiv.org/abs/2503.01807)
TESS 2
Models associated with the paper "TESS-2: A Large-Scale, Generalist Diffusion Language Model". Code: https://github.com/hamishivi/tess-2
Tulu 2 Llama 3 Update
Llama 3 models trained on the tulu dataset, following https://arxiv.org/abs/2311.10702 (tulu 2) and https://arxiv.org/abs/2406.09279 (tulu 2.5).
7b tulu 2.5
a small run at 7b scale with ppo, following the unpacking dpo and ppo paper.
Tulu V2 Suite
The set of models associated with the Tulu V2 technical report.
Tulu V1 Suite
The set of models associated with the paper "How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources".
LM Preference Datasets