Scaling Laws Meet Model Architecture: Toward Inference-Efficient LLMs Paper • 2510.18245 • Published 8 days ago • 6
D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI Paper • 2510.05684 • Published 22 days ago • 133
Less is More: Recursive Reasoning with Tiny Networks Paper • 2510.04871 • Published 23 days ago • 455
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection Paper • 2403.03507 • Published Mar 6, 2024 • 189
💧 LFM2 Collection LFM2 is a new generation of hybrid models, designed for on-device deployment. • 21 items • Updated 7 days ago • 114
NeuralOS: Towards Simulating Operating Systems via Neural Generative Models Paper • 2507.08800 • Published Jul 11 • 79
SnowflakeCore G1 Pre-Train Collection The base models of G1. All the Snowflake models are fully pre-train, not fine-tune of a pre-existing model. • 2 items • Updated Sep 2 • 1