zELO: ELO-inspired Training Method for Rerankers and Embedding Models Paper β’ 2509.12541 β’ Published Sep 16 β’ 4
DINOv3 Collection DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 β’ 13 items β’ Updated Aug 21 β’ 367
Qwen3 Collection Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants. β’ 79 items β’ Updated 3 days ago β’ 227
π§ SmolLM3 Collection Smol, multilingual, long-context reasoner β’ 14 items β’ Updated 25 days ago β’ 83
view article Article Accelerating LLM Code Generation Through Mask Store Streamlining By vivien β’ Jan 17 β’ 3
Utilities Collection No crazy stuff, but useful ones for in-between steps β’ 16 items β’ Updated Mar 19 β’ 7
π¦π Useful Tiny Video Converters Collection All spaces made to convert a video (of GIFs) to anything useful in your pipelines β’ 5 items β’ Updated Oct 3, 2024 β’ 7
view article Article Interactive Tools for machine learning, deep learning, and math By Suzana β’ May 26 β’ 47
D-FINE Collection State-of-the-art real-time object detection model with Apache 2.0 licence β’ 15 items β’ Updated May 5 β’ 55
Gemma 3 QAT Collection Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory β’ 15 items β’ Updated Jul 10 β’ 209
Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory Paper β’ 2504.19413 β’ Published Apr 28 β’ 28