view article Article Supercharge Edge AI With High‑Accuracy Reasoning Using NVIDIA Nemotron Nano 2 9B By nvidia and 9 others • Aug 18 • 30
view article Article NVIDIA Releases 6 Million Multi-Lingual Reasoning Dataset By nvidia and 4 others • Aug 20 • 18
view article Article Yay! Organizations can now publish blog Articles By huggingface and 3 others • Jan 20 • 52
view article Article From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels Aug 18 • 83
view article Article Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs Apr 29 • 40
Tiny dummy models Collection Randomly initialized tiny models for debugging/testing purpose • 133 items • Updated 11 days ago • 6
view article Article Universal Assisted Generation: Faster Decoding with Any Assistant Model Oct 29, 2024 • 59