Exploring smol models (for text, vision and video) and high quality web and synthetic datasets
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model