Speculative Decoding Draft Models Collection Collection of OpenVINO optimized efficient draft models for speculative decoding • 4 items • Updated Sep 16 • 9
ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 151
view article Article Optimize and deploy models with Optimum-Intel and OpenVINO GenAI Sep 20, 2024 • 24