view article Article huggingface_hub v1.0: Five Years of Building the Foundation of Open Machine Learning 2 days ago • 30
ImpossibleBench Datasets Collection Datasets constructed in ImpossibleBench https://arxiv.org/abs/2510.20270 • 2 items • Updated 5 days ago • 1
Nigeria Energy Sector Collection A collection of datasets across Nigeria's energy sector. • 35 items • Updated 17 days ago • 8
Amon Dîn Collection A collection of datasets from the Nigerian Telecommunications Sector • 34 items • Updated 22 days ago • 1
Tiny Language Model Datasets Collection Collection of Synthetic Datasets that can be used in pretraining of any the Tiny Language Model • 14 items • Updated Sep 21 • 29
view article Article Announcing the Synthetic Online Conversations Dataset (SOC) By marcodsn • Aug 12 • 11
MolmoAct Data Mixture Collection All datasets for the MolmoAct (Multimodal Open Language Model for Action) release. • 4 items • Updated Sep 6 • 15
view article Article Introducing AI Sheets: a tool to work with datasets using open AI models! Aug 8 • 101
view article Article What Open-Source Developers Need to Know about the EU AI Act's Rules for GPAI Models By yjernite and 5 others • Aug 4 • 28
view article Article Introducing Command A Vision: Multimodal AI built for Business By CohereLabs and 3 others • Jul 31 • 63