Intel

company

Verified

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

n1ck-guo updated a model about 1 hour ago

Intel/GLM-4.6-REAP-218B-A32B-FP8-gguf-q2ks-mixed-AutoRound

n1ck-guo published a model about 4 hours ago

Intel/GLM-4.6-REAP-218B-A32B-FP8-gguf-q2ks-mixed-AutoRound

n1ck-guo updated a model about 4 hours ago

Intel/Ling-flash-2.0-gguf-q2ks-mixed-AutoRound

View all activity

Articles

Google Cloud C4 Brings a 70% TCO improvement on GPT OSS with Intel and Hugging Face

12 days ago

• 13

Get your VLM running in 3 simple steps on Intel CPUs

13 days ago

• 13

Accelerating Qwen3-8B Agent on Intel® Core™ Ultra with Depth-Pruned Draft Models

29 days ago

• 18

Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon

May 9, 2024

• 12

Blazing Fast SetFit Inference with 🤗 Optimum Intel on Xeon

Apr 3, 2024

• 11

n1ck-guo

updated a model about 1 hour ago

Intel/GLM-4.6-REAP-218B-A32B-FP8-gguf-q2ks-mixed-AutoRound

218B • Updated about 1 hour ago • 1

n1ck-guo

published a model about 4 hours ago

Intel/GLM-4.6-REAP-218B-A32B-FP8-gguf-q2ks-mixed-AutoRound

218B • Updated about 1 hour ago • 1

n1ck-guo

updated a model about 4 hours ago

Intel/Ling-flash-2.0-gguf-q2ks-mixed-AutoRound

103B • Updated about 4 hours ago • 178 • 4

n1ck-guo

updated a dataset about 7 hours ago

Intel/dynamic_model_information

Updated about 7 hours ago • 259

wenhuach

posted an update about 21 hours ago

Post

🚀 AutoRound(https://github.com/intel/auto-round) is now supported by SGLang!

After integrations with TorchAO, Transformers, and VLLM, AutoRound-quantized models are now officially compatible with SGLang — bringing faster and more flexible deployment to your LLM workflows.

💡 We’ve also enhanced the RTN mode (--iters 0), cutting quantization costs significantly for low-resource users.

⭐ Star our repo and stay tuned for more exciting updates!

wenhuach

in Intel/Ling-flash-2.0-gguf-q2ks-mixed-AutoRound about 23 hours ago

Inference with llama.cpp + Open WebUI gives repeating `?`

#1 opened 4 days ago by

whoisjeremylam

wenhuach

updated a model about 23 hours ago

Intel/Ling-flash-2.0-gguf-q2ks-mixed-AutoRound

103B • Updated about 4 hours ago • 178 • 4

danf

updated a Space 2 days ago

Hebrew Math Tutor

🚀

Streamlit template space

n1ck-guo

in Intel/Qwen3-235B-A22B-Instruct-2507-gguf-q2ks-mixed-AutoRound 4 days ago

Please can we get this model on the same quant ?

#1 opened 7 days ago by

groxaxo

n1ck-guo

updated a model 4 days ago

Intel/GLM-4.6-gguf-q2ks-mixed-AutoRound

357B • Updated 4 days ago • 440 • 6

n1ck-guo

published a model 4 days ago

Intel/Ling-flash-2.0-gguf-q2ks-mixed-AutoRound

103B • Updated about 4 hours ago • 178 • 4

n1ck-guo

in Intel/GLM-4.5-gguf-q2ks-mixed-AutoRound 6 days ago

GLM 4.6

#1 opened 10 days ago by

gellhorn

n1ck-guo

published a model 6 days ago

Intel/GLM-4.6-gguf-q2ks-mixed-AutoRound

357B • Updated 4 days ago • 440 • 6

wenhuach

updated a model 8 days ago

Intel/Qwen3-8B-GGUF-Q2KS-AS-AutoRound

8B • Updated 8 days ago • 203 • 3

wenhuach

published a model 8 days ago

Intel/Qwen3-8B-GGUF-Q2KS-AS-AutoRound

8B • Updated 8 days ago • 203 • 3

wenhuach

posted an update 12 days ago

Post

1683

AutoRound keeps evolving its LLM quantization algorithm! 🚀
After enhancing W2A16 quantization, we now offer a fast algorithm to generate mixed bits/data-type schemes (~2mins for 8B models), great for MXFP4 and W2A16.
Learn more: https://github.com/intel/auto-round/blob/main/docs/step_by_step.md#autoscheme

Jiqing

updated a dataset 12 days ago

Intel/blog

Viewer • Updated 12 days ago • 6 • 3.52k

Jiqing

published a dataset 12 days ago

Intel/blog

Viewer • Updated 12 days ago • 6 • 3.52k

pradeei

updated a Space 13 days ago

Intel® AI for Enterprise Inference

📚

LLM Chatbot with Intel® Gaudi®

wenhuach

posted an update about 2 months ago

Post

418

AutoRound v0.7 is out! 🚀
This release includes enhanced algorithms for W2A16, NVFP4, and MXFP4, along with support for FP8 models as input.
👉 Check out the full details here: https://github.com/intel/auto-round/releases/tag/v0.7.0

AI & ML interests

Recent Activity

Articles

Google Cloud C4 Brings a 70% TCO improvement on GPT OSS with Intel and Hugging Face

Get your VLM running in 3 simple steps on Intel CPUs

Accelerating Qwen3-8B Agent on Intel® Core™ Ultra with Depth-Pruned Draft Models

Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs

Introducing HELMET

Accelerating LLM Inference with TGI on Intel Gaudi

Benchmarking Language Model Performance on 5th Gen Xeon at GCP

通用辅助生成：使用任意辅助模型加速解码

Universal Assisted Generation: Faster Decoding with Any Assistant Model

更快的辅助生成: 动态推测

Faster Assisted Generation with Dynamic Speculation

使用 Optimum-Intel 和 OpenVINO GenAI 优化和部署模型

Optimize and deploy models with Optimum-Intel and OpenVINO GenAI

在英特尔 Gaudi 2 上加速蛋白质语言模型 ProtST

Accelerating Protein Language Model ProtST on Intel Gaudi 2

英特尔 Gaudi 加速辅助生成

Faster assisted generation support for Intel Gaudi

利用英特尔 Gaudi 2 和至强 CPU 构建经济高效的企业级 RAG 应用

Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon

Blazing Fast SetFit Inference with 🤗 Optimum Intel on Xeon

Team members 2,239

Intel's activity

Inference with llama.cpp + Open WebUI gives repeating `?`

Hebrew Math Tutor

Please can we get this model on the same quant ?

GLM 4.6

Intel® AI for Enterprise Inference