Shira Guskin's picture

9 4

Shira Guskin

sguskin

·

shira-g

AI & ML interests

None yet

Recent Activity

upvoted an article 28 days ago

Accelerating Qwen3-8B Agent on Intel® Core™ Ultra with Depth-Pruned Draft Models

published an article 30 days ago

Accelerating Qwen3-8B Agent on Intel® Core™ Ultra with Depth-Pruned Draft Models

updated a model 11 months ago

OpenVINO/Llama-3.1-8B-Instruct-FastDraft-150M-int8-ov

View all activity

Organizations

upvoted an article 28 days ago

Article

Accelerating Qwen3-8B Agent on Intel® Core™ Ultra with Depth-Pruned Draft Models

30 days ago

• 19

upvoted a collection 11 months ago

Speculative Decoding Draft Models

Collection of OpenVINO optimized efficient draft models for speculative decoding • 4 items • Updated Sep 16 • 9

upvoted a paper about 1 year ago

RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented Generation

Paper • 2408.02545 • Published Aug 5, 2024 • 39