MLX Community

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

kernelpool new activity about 16 hours ago

mlx-community/MiniMax-M2-4bit:MiniMax-M2-4bit not supported

kernelpool updated a model about 19 hours ago

mlx-community/Kimi-Linear-48B-A3B-Instruct-4bit

saviochow new activity about 19 hours ago

mlx-community/mlx-my-repo:ERROR:root:Model type minimax not supported.

View all activity

mlx-community 's collections 98

Granite 4.0 Nano Language Models

mlx-community/granite-4.0-h-1b-3bit

Text Generation • 0.2B • Updated 3 days ago • 30
mlx-community/granite-4.0-h-1b-4bit

Text Generation • 0.2B • Updated 3 days ago • 45
mlx-community/granite-4.0-h-1b-5bit

Text Generation • 0.3B • Updated 3 days ago • 10
mlx-community/granite-4.0-h-1b-6bit

Text Generation • 0.3B • Updated 3 days ago • 10

MiniMax-M2

mlx-community/MiniMax-M2-3bit

Text Generation • 229B • Updated 4 days ago • 1.26k • 2
mlx-community/MiniMax-M2-4bit

Text Generation • 229B • Updated 4 days ago • 2.12k • 8
mlx-community/MiniMax-M2-5bit

Text Generation • 229B • Updated 3 days ago • 530 • 2
mlx-community/MiniMax-M2-6bit

Text Generation • 229B • Updated 3 days ago • 738 • 1

Qwen3-VL

mlx-community/Qwen3-VL-4B-Instruct-3bit

Image-Text-to-Text • 0.9B • Updated 17 days ago • 271
mlx-community/Qwen3-VL-4B-Instruct-5bit

Image-Text-to-Text • 1B • Updated 17 days ago • 124
mlx-community/Qwen3-VL-4B-Instruct-6bit

Image-Text-to-Text • 1B • Updated 17 days ago • 101
mlx-community/Qwen3-VL-4B-Instruct-8bit

Image-Text-to-Text • 2B • Updated 17 days ago • 567 • 3

💧LFM2-8B-A1B-MoE

Best in Class MoE, better than Qwen3. Optimised for Smaller devices sub 16 GB (M1/2/3/4) Apple Silicon.

mlx-community/LFM2-8B-A1B-3bit-MLX

Text Generation • 1B • Updated 24 days ago • 780 • 1
mlx-community/LFM2-8B-A1B-8bit-MLX

Text Generation • 8B • Updated 24 days ago • 380 • 2
mlx-community/LFM2-8B-A1B-6bit-MLX

Text Generation • 8B • Updated 24 days ago • 123
mlx-community/LFM2-8B-A1B-4bit

Text Generation • 1B • Updated 24 days ago • 1.77k • 3

Qwen3-Coder-MoE

💻 Significant Performance: among open models on Agentic Coding, Agentic Browser-Use, and other foundational coding tasks, achieving ~Claude Sonnet.

mlx-community/Qwen3-Coder-30B-A3B-Instruct-4bit

Text Generation • 31B • Updated Aug 6 • 630 • 10
mlx-community/Qwen3-Coder-480B-A35B-Instruct-4bit

Text Generation • 480B • Updated Jul 22 • 444 • 18
mlx-community/Qwen3-Coder-30B-A3B-Instruct-8bit

Text Generation • 31B • Updated Jul 31 • 427 • 2
mlx-community/Qwen3-Coder-30B-A3B-Instruct-8bit-DWQ-lr9e8

Text Generation • 31B • Updated Aug 1 • 131 • 1

GLM-4.6

mlx-community/GLM-4.6-4bit

Text Generation • 353B • Updated Sep 30 • 3.63k • 11
mlx-community/GLM-4.6-5bit

Text Generation • 353B • Updated Sep 30 • 1.24k • 3

Qwen3 Next

Alibaba's first hybrid model, designed to cut resources and speed things up.

mlx-community/Qwen3-Next-80B-A3B-Thinking-8bit

Text Generation • 80B • Updated Sep 13 • 164 • 2
mlx-community/Qwen3-Next-80B-A3B-Thinking-6bit

Text Generation • 80B • Updated Sep 13 • 45
mlx-community/Qwen3-Next-80B-A3B-Thinking-5bit

Text Generation • 80B • Updated Sep 13 • 38
mlx-community/Qwen3-Next-80B-A3B-Thinking-4bit

Text Generation • 80B • Updated Sep 13 • 210 • 2

Lille 130M

Very Small smart model created for the mobile

mlx-community/lille-130m-instruct-bf16

Text Generation • 0.1B • Updated Sep 5 • 45
mlx-community/lille-130m-instruct-fp16

Text Generation • 0.1B • Updated Sep 5 • 43 • 1
mlx-community/lille-130m-instruct-8bit

Text Generation • 35.8M • Updated Sep 5 • 7
mlx-community/lille-130m-instruct-6bit

Text Generation • 27.8M • Updated Sep 5 • 5

Apertus

SwissAI's Apertus models that support 1k languages

mlx-community/Apertus-8B-Instruct-2509-bf16

Text Generation • 8B • Updated Sep 3 • 345 • 4
mlx-community/Apertus-8B-Instruct-2509-8bit

Text Generation • 8B • Updated Sep 3 • 117
mlx-community/Apertus-8B-Instruct-2509-6bit

Text Generation • 8B • Updated Sep 3 • 52
mlx-community/Apertus-8B-Instruct-2509-4bit

Text Generation • 1B • Updated Sep 3 • 203 • 1

LFM2-VL

mlx-community/LFM2-VL-450M-4bit

Image-Text-to-Text • 0.1B • Updated Aug 16 • 43 • 1
mlx-community/LFM2-VL-450M-5bit

Image-Text-to-Text • 0.2B • Updated Aug 16 • 5 • 1
mlx-community/LFM2-VL-450M-6bit

Updated Aug 16 • 1
mlx-community/LFM2-VL-450M-8bit

Image-Text-to-Text • 0.2B • Updated Aug 16 • 101 • 10

VisualQuality-R1

Image Quality Assessment

mlx-community/VisualQuality-R1-7B-bf16

Reinforcement Learning • 8B • Updated Aug 6 • 19
mlx-community/VisualQuality-R1-7B-8bit

Reinforcement Learning • Updated Aug 6 • 9
mlx-community/VisualQuality-R1-7B-6bit

Reinforcement Learning • Updated Aug 6 • 8
mlx-community/VisualQuality-R1-7B-4bit

Reinforcement Learning • Updated Aug 6 • 10

olmOCR-0725

mlx-community/olmOCR-7B-0725-bf16

Image-to-Text • 8B • Updated Jul 24 • 22
mlx-community/olmOCR-7B-0725-8bit

Image-to-Text • Updated Jul 25 • 75 • 4
mlx-community/olmOCR-7B-0725-6bit

Image-to-Text • Updated Jul 25 • 21

SmolLM3

mlx-community/SmolLM3-3B-4bit

Text Generation • 0.5B • Updated Jul 8 • 2.7k • 4
mlx-community/SmolLM3-3B-3bit

Text Generation • 0.4B • Updated Jul 8 • 20 • 2
mlx-community/SmolLM3-3B-5bit

Text Generation • 0.6B • Updated Jul 8 • 34
mlx-community/SmolLM3-3B-8bit

Text Generation • 0.9B • Updated Jul 8 • 53 • 6

DiffuCoder-7B

Apple's text based diffusion model

mlx-community/DiffuCoder-7B-cpGRPO-8bit

Text Generation • 8B • Updated Jul 4 • 64 • 6
mlx-community/DiffuCoder-7B-cpGRPO-6bit

Text Generation • 8B • Updated Jul 4 • 13 • 1
mlx-community/DiffuCoder-7B-cpGRPO-4bit

Text Generation • 1B • Updated Jul 4 • 26 • 3

Gemma 3n - Text Only (LM)

Google's Gemma 3n converted to MLX using mlx-lm

mlx-community/gemma-3n-E4B-it-lm-bf16

Text Generation • 7B • Updated Jun 29 • 93 • 4
mlx-community/gemma-3n-E2B-it-lm-bf16

Text Generation • 4B • Updated Jun 29 • 100
mlx-community/gemma-3n-E4B-it-lm-4bit

Text Generation • 1B • Updated Jun 29 • 5.42k • 4
mlx-community/gemma-3n-E2B-it-lm-4bit

Text Generation • 0.7B • Updated Jun 29 • 5.3k • 1

Nanonets OCR

This collection houses Nanonets-OCR-s

mlx-community/Nanonets-OCR-s-bf16

Image-Text-to-Text • 4B • Updated Jun 18 • 228 • 2
mlx-community/Nanonets-OCR-s-8bit

Image-Text-to-Text • 2B • Updated Jul 25 • 41
mlx-community/Nanonets-OCR-s-6bit

Image-Text-to-Text • 1B • Updated Jul 25 • 31
mlx-community/Nanonets-OCR-s-4bit

Image-Text-to-Text • 1B • Updated Jul 25 • 61

Holo1

mlx-community/Holo1-3B-3bit

Image-Text-to-Text • 1B • Updated Jun 3 • 1 • 1
mlx-community/Holo1-3B-4bit

Image-Text-to-Text • 1B • Updated Jun 3 • 1 • 1
mlx-community/Holo1-3B-6bit

Image-Text-to-Text • 1B • Updated Jun 3 • 1
mlx-community/Holo1-3B-8bit

Image-Text-to-Text • 2B • Updated Jun 3

DeepSeek R1 0528

mlx-community/DeepSeek-R1-0528-4bit

Text Generation • 105B • Updated May 29 • 223 • 17
mlx-community/DeepSeek-R1-0528-Qwen3-8B-4bit

Text Generation • 1B • Updated May 30 • 688 • 4
mlx-community/DeepSeek-R1-0528-Qwen3-8B-4bit-DWQ

Text Generation • 1B • Updated May 29 • 151 • 8
mlx-community/DeepSeek-R1-0528-Qwen3-8B-8bit

Text Generation • 2B • Updated May 29 • 44 • 1

Devstral Small 2505

mlx-community/Devstral-Small-2505-3bit

Text Generation • 3B • Updated May 21 • 21 • 1
mlx-community/Devstral-Small-2505-4bit

Text Generation • 4B • Updated May 21 • 23 • 2
mlx-community/Devstral-Small-2505-6bit

Text Generation • Updated May 21 • 12 • 1
mlx-community/Devstral-Small-2505-8bit

Text Generation • Updated May 21 • 22 • 1

OuteTTS-1.0

mlx-community/Llama-OuteTTS-1.0-1B-fp16

Text-to-Speech • 1B • Updated May 19 • 28 • 3
mlx-community/Llama-OuteTTS-1.0-1B-4bit

Text-to-Speech • 0.2B • Updated May 19 • 116 • 1
mlx-community/Llama-OuteTTS-1.0-1B-8bit

Text-to-Speech • 0.4B • Updated May 19 • 15 • 1
mlx-community/Llama-OuteTTS-1.0-1B-6bit

Text-to-Speech • 0.3B • Updated May 19 • 6

Gemma 3 DWQ

Gemma 3 distilled weight quantized (DWQ) models

mlx-community/gemma-3-4b-it-4bit-DWQ

Text Generation • 0.7B • Updated May 14 • 119 • 1
mlx-community/gemma-3-12b-it-4bit-DWQ

Text Generation • 2B • Updated May 18 • 110 • 2
mlx-community/gemma-3-1b-it-4bit-DWQ

Text Generation • 0.2B • Updated May 14 • 65
mlx-community/gemma-3-27b-it-4bit-DWQ

Text Generation • 4B • Updated May 14 • 116 • 3

Josiefied and Abliterated Qwen3

Abliterated, and further fine-tuned to be the most uncensored models available. Now in MLX

mlx-community/Josiefied-Qwen3-30B-A3B-abliterated-v2-bf16

Text Generation • 31B • Updated Jun 19 • 28
mlx-community/Josiefied-Qwen3-30B-A3B-abliterated-v2-8bit

Text Generation • 31B • Updated Jun 19 • 115
mlx-community/Josiefied-Qwen3-30B-A3B-abliterated-v2-6bit

Text Generation • 31B • Updated Jun 19 • 34
mlx-community/Josiefied-Qwen3-30B-A3B-abliterated-v2-4bit

Text Generation • 31B • Updated Jun 19 • 160 • 2

NariLabs Dia-1.5B

mlx-community/Dia-1.6B

Text-to-Speech • 2B • Updated Apr 23 • 90 • 23
mlx-community/Dia-1.6B-6bit

Text-to-Speech • 2B • Updated Apr 24 • 19 • 10
mlx-community/Dia-1.6B-4bit

Text-to-Speech • 2B • Updated Apr 24 • 61 • 12
mlx-community/Dia-1.6B-3bit

Text-to-Speech • 2B • Updated Apr 24 • 5 • 3

PLaMo

mlx-community/plamo-2-8b

Text Generation • 9B • Updated Mar 16 • 2
mlx-community/plamo-2-1b

Text Generation • 1B • Updated Mar 15 • 202 • 4
mlx-community/plamo-2-1b-bf16

Text Generation • 1B • Updated Mar 15 • 44 • 2
mlx-community/plamo-2-8b-4bit

Text Generation • 1B • Updated Mar 15 • 3

Gemma 3 QAT

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory.

mlx-community/gemma-3-27b-it-qat-bf16

Image-Text-to-Text • Updated Apr 18 • 123 • 5
mlx-community/gemma-3-27b-it-qat-8bit

Image-Text-to-Text • Updated Apr 19 • 171 • 9
mlx-community/gemma-3-27b-it-qat-6bit

Image-Text-to-Text • Updated Apr 19 • 25
mlx-community/gemma-3-27b-it-qat-4bit

Image-Text-to-Text • Updated Apr 19 • 83.6k • 20

ModernBert

mlx-community/answerdotai-ModernBERT-base-8bit

Fill-Mask • 53M • Updated Apr 2 • 3
mlx-community/answerdotai-ModernBERT-base-4bit

Fill-Mask • 29.5M • Updated Apr 2 • 10
mlx-community/answerdotai-ModernBERT-base-bf16

Fill-Mask • 0.2B • Updated Apr 2 • 18 • 1
mlx-community/answerdotai-ModernBERT-Large-Instruct-4bit

Fill-Mask • 70M • Updated Apr 2 • 4

Qwen QwQ

mlx-community/QwQ-32B-4bit

Text Generation • 5B • Updated Mar 5 • 79 • 37
mlx-community/QwQ-32B-6bit

Text Generation • 7B • Updated Mar 5 • 9 • 3
mlx-community/QwQ-32B-8bit

Text Generation • 9B • Updated Mar 5 • 35 • 14
mlx-community/QwQ-32B-3bit

Text Generation • 4B • Updated Mar 5 • 4

UI-TARS

mlx-community/UI-TARS-7B-SFT-6bit

Image-Text-to-Text • 2B • Updated Mar 3 • 6
mlx-community/UI-TARS-7B-SFT-4bit

Image-Text-to-Text • 2B • Updated Mar 3 • 4
mlx-community/UI-TARS-7B-SFT-8bit

Image-Text-to-Text • 3B • Updated Mar 3 • 5
mlx-community/UI-TARS-7B-SFT-bf16

Image-Text-to-Text • 8B • Updated Mar 3 • 5

Kokoro TTS

Kokoro is an open-weight TTS model with 82 million parameters. Despite its lightweight architecture, it delivers amazing quality.

mlx-community/Kokoro-82M-bf16

Text-to-Speech • Updated Mar 8 • 5.1k • 25
mlx-community/Kokoro-82M-8bit

Text-to-Speech • Updated Mar 8 • 84 • 6
mlx-community/Kokoro-82M-6bit

Text-to-Speech • Updated Mar 8 • 28 • 2
mlx-community/Kokoro-82M-4bit

Text-to-Speech • Updated Mar 8 • 211 • 3

Arcee Virtuoso

mlx-community/Virtuoso-Medium-v2-8bit

Text Generation • 9B • Updated Jan 31 • 2 • 1
mlx-community/Virtuoso-Medium-v2-3bit

Text Generation • 4B • Updated Jan 30
mlx-community/Virtuoso-Medium-v2-4bit

Text Generation • 5B • Updated Jan 30 • 1
mlx-community/Virtuoso-Medium-v2-6bit

Text Generation • 7B • Updated Jan 30

DeepSeek-R1-Distill

mlx-community/deepseek-r1-distill-qwen-1.5b

2B • Updated Feb 26 • 644 • 23
mlx-community/DeepSeek-R1-Distill-Qwen-7B-4bit

1B • Updated Feb 26 • 576 • 17
mlx-community/DeepSeek-R1-Distill-Qwen-7B-8bit

2B • Updated Feb 26 • 122 • 8
mlx-community/DeepSeek-R1-Distill-Llama-8B-4bit

1B • Updated Feb 26 • 287 • 10

Qwen2.5-1M

mlx-community/Qwen2.5-7B-Instruct-1M-4bit

Text Generation • 1B • Updated Jan 26 • 131 • 10
mlx-community/Qwen2.5-7B-Instruct-1M-6bit

Text Generation • 2B • Updated Jan 26 • 1 • 2
mlx-community/Qwen2.5-7B-Instruct-1M-3bit

Text Generation • 1.0B • Updated Jan 26 • 2
mlx-community/Qwen2.5-7B-Instruct-1M-8bit

Text Generation • 2B • Updated Jan 26 • 13 • 3

Jina Reader-LM

Convert HTML content to LLM-friendly Markdown/JSON content

mlx-community/jinaai-ReaderLM-v2

0.2B • Updated Jan 17 • 194 • 22
mlx-community/reader-lm-1.5b

Text Generation • 2B • Updated Jan 18 • 1
mlx-community/reader-lm-0.5b

Text Generation • 0.5B • Updated Jan 18

QVQ-72B-Preview

mlx-community/QVQ-72B-Preview-4bit

Image-Text-to-Text • 11B • Updated Apr 12 • 8 • 7
mlx-community/QVQ-72B-Preview-6bit

Image-Text-to-Text • 16B • Updated Dec 24, 2024 • 2 • 2
mlx-community/QVQ-72B-Preview-3bit

Image-Text-to-Text • 9B • Updated Dec 24, 2024 • 2 • 5
mlx-community/QVQ-72B-Preview-8bit

Image-Text-to-Text • 21B • Updated Dec 24, 2024 • 1 • 3

Josiefied and Abliterated Qwen2.5

The best uncensored models

mlx-community/Josiefied-Qwen2.5-Coder-7B-Instruct-abliterated-v1

Text Generation • 8B • Updated Feb 16 • 12
mlx-community/Josiefied-Qwen2.5-Coder-7B-Instruct-abliterated-v1-8bit

Text Generation • 2B • Updated Feb 16 • 11
mlx-community/Josiefied-Qwen2.5-Coder-7B-Instruct-abliterated-v1-6bit

Text Generation • 2B • Updated Feb 16 • 8
mlx-community/Josiefied-Qwen2.5-Coder-7B-Instruct-abliterated-v1-4bit

Text Generation • 1B • Updated Feb 16 • 31 • 1

EXAONE-3.5

EXAONE 3.5, a collection of instruction-tuned bilingual generative models ranging from 2.4B to 32B parameters, developed by LG AI.

mlx-community/EXAONE-3.5-2.4B-Instruct-4bit

0.4B • Updated Dec 9, 2024 • 2 • 3
mlx-community/EXAONE-3.5-2.4B-Instruct-6bit

0.5B • Updated Dec 9, 2024
mlx-community/EXAONE-3.5-2.4B-Instruct-8bit

0.7B • Updated Dec 9, 2024
mlx-community/EXAONE-3.5-2.4B-Instruct-bf16

2B • Updated Dec 9, 2024

Paligemma 2

mlx-community/paligemma2-3b-ft-docci-448-8bit

Image-Text-to-Text • 0.9B • Updated Dec 5, 2024
mlx-community/paligemma2-3b-ft-docci-448-6bit

Image-Text-to-Text • 0.7B • Updated Dec 5, 2024
mlx-community/paligemma2-3b-ft-docci-448-bf16

Image-Text-to-Text • 3B • Updated Dec 5, 2024 • 8 • 1
mlx-community/paligemma2-10b-ft-docci-448-bf16

Image-Text-to-Text • 10B • Updated Dec 16, 2024 • 25 • 3

Idefics 3 + SmolVLM

mlx-community/SmolVLM-Instruct-4bit

Image-Text-to-Text • 0.5B • Updated Nov 29, 2024 • 228 • 5
mlx-community/SmolVLM-Instruct-6bit

Image-Text-to-Text • 0.6B • Updated Nov 29, 2024 • 8
mlx-community/SmolVLM-Instruct-8bit

Image-Text-to-Text • 0.7B • Updated Nov 29, 2024 • 23 • 9
mlx-community/SmolVLM-Instruct-bf16

Image-Text-to-Text • 2B • Updated Nov 29, 2024 • 13 • 5

Florence-2

mlx-community/Florence-2-base-ft-4bit

Image-Text-to-Text • 48.8M • Updated Nov 21, 2024 • 64 • 1
mlx-community/Florence-2-large-ft-bf16

Image-Text-to-Text • 0.8B • Updated Nov 21, 2024 • 52 • 1
mlx-community/Florence-2-base-ft-bf16

Image-Text-to-Text • 0.3B • Updated Nov 21, 2024 • 10 • 1
mlx-community/Florence-2-base-ft-8bit

Image-Text-to-Text • 81.7M • Updated Nov 21, 2024 • 29 • 1

Qwen2.5-Coder

Code-specific model series based on Qwen2.5

mlx-community/Qwen2.5-Coder-32B-Instruct-8bit

Text Generation • 9B • Updated Nov 11, 2024 • 90 • 11
mlx-community/Qwen2.5-Coder-14B-Instruct-4bit

Text Generation • 2B • Updated Nov 11, 2024 • 165 • 4
mlx-community/Qwen2.5-Coder-14B-Instruct-bf16

Text Generation • 15B • Updated Nov 11, 2024 • 15 • 2
mlx-community/Qwen2.5-Coder-3B-Instruct-8bit

Text Generation • 0.9B • Updated Nov 11, 2024 • 8

Lumimaid

A collection of Neversleep's RP focused Lumimaid LLMs.

mlx-community/Lumimaid-8B-v0.1

2B • Updated Oct 13, 2024 • 1
mlx-community/Lumimaid-70B-v0.1

20B • Updated Oct 13, 2024 • 1
mlx-community/Lumimaid-70B-v0.1-alt

20B • Updated Oct 13, 2024 • 1
mlx-community/Lumimaid-70B-v0.1-OAS

20B • Updated Oct 13, 2024

Qwen1.5

Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud.

mlx-community/Qwen1.5-1.8B-Chat-4bit

Text Generation • 0.5B • Updated Feb 18, 2024 • 9 • 2
mlx-community/Qwen1.5-0.5B-Chat-4bit

Text Generation • 72.6M • Updated Apr 18, 2024 • 3.47k • 4
mlx-community/Qwen1.5-14B-Chat-4bit

Text Generation • 3B • Updated Mar 8, 2024 • 7 • 1
mlx-community/Qwen1.5-7B-Chat-4bit

Text Generation • 2B • Updated Mar 7, 2024 • 7 • 2

Llama 3

mlx-community/Meta-Llama-3-8B-Instruct-4bit

Text Generation • 2B • Updated Apr 19, 2024 • 1.68k • 79
mlx-community/Meta-Llama-3-8B-4bit

Text Generation • 2B • Updated Apr 20, 2024 • 42 • 8
mlx-community/Meta-Llama-Guard-2-8B-4bit

Text Generation • 2B • Updated Apr 19, 2024 • 5
mlx-community/Meta-Llama-3-70B-4bit

Text Generation • 11B • Updated Apr 20, 2024 • 58 • 9

Phi-3

mlx-community/Phi-3-mini-4k-instruct-4bit

Text Generation • 0.6B • Updated Jul 11, 2024 • 669 • 12
mlx-community/Phi-3-mini-128k-instruct-4bit

Text Generation • 0.6B • Updated Jul 11, 2024 • 179 • 12
mlx-community/Phi-3-mini-128k-instruct-8bit

Text Generation • 1B • Updated Jul 11, 2024 • 38 • 10
mlx-community/Phi-3-mini-4k-instruct-8bit

Text Generation • 1B • Updated Jul 4, 2024 • 19 • 2

OpenELM

A family of Open-source Efficient Language Models from Apple.

mlx-community/OpenELM-3B

Text Generation • 3B • Updated Apr 26, 2024 • 7 • 8
mlx-community/OpenELM-1_1B-Instruct-8bit

0.3B • Updated Apr 24, 2024 • 40 • 1
mlx-community/OpenELM-1_1B-Instruct-4bit

0.2B • Updated Apr 24, 2024 • 61 • 2
mlx-community/OpenELM-1_1B-8bit

0.3B • Updated Apr 24, 2024 • 7 • 1

Mistral (Mamba) Codestral

mlx-community/Codestral-22B-v0.1-8bit

6B • Updated May 29, 2024 • 138 • 8
mlx-community/Codestral-22B-v0.1-4bit

3B • Updated May 29, 2024 • 348 • 13
mlx-community/Mamba-Codestral-7B-v0.1

7B • Updated Jan 21 • 60 • 2
mlx-community/Mamba-Codestral-7B-v0.1-8bit

2B • Updated Jan 21 • 62 • 2

Mamba

Mamba is a new LLM architecture that integrates the Structured State Space sequence model to manage lengthy data sequences.

mlx-community/mamba-1.4b-hf-f32

Text Generation • 1B • Updated Sep 21, 2024 • 12
mlx-community/mamba-1.4b-hf-f16

Text Generation • 1B • Updated Sep 21, 2024 • 5 • 1
mlx-community/mamba-790m-hf-f32

Text Generation • 0.8B • Updated Sep 21, 2024 • 3
mlx-community/mamba-790m-hf-f16

Text Generation • 0.8B • Updated Sep 21, 2024

Mistral NeMo

mlx-community/Mistral-Nemo-Base-2407-8bit

3B • Updated Jul 18, 2024 • 26 • 3
mlx-community/Mistral-Nemo-Base-2407-bf16

12B • Updated Jul 18, 2024 • 6 • 1
mlx-community/Mistral-Nemo-Instruct-2407-4bit

2B • Updated Nov 6, 2024 • 687 • 14
mlx-community/Mistral-Nemo-Base-2407-4bit

2B • Updated Jul 18, 2024 • 8 • 1

EnCodec

EnCodec models in MLX

mlx-community/encodec-48khz-float32

16.9M • Updated Sep 16, 2024 • 1 • 2
mlx-community/encodec-24khz-bfloat16

19M • Updated Sep 18, 2024
mlx-community/encodec-32khz-bfloat16

57.9M • Updated Sep 18, 2024 • 1
mlx-community/encodec-32khz-float32

57.9M • Updated Sep 18, 2024 • 19

LLaDA 2.0

mlx-community/LLaDA2.0-mini-preview-4bit

Text Generation • 16B • Updated 13 days ago • 87 • 1
mlx-community/LLaDA2.0-flash-preview-4bit

Text Generation • 103B • Updated 6 days ago • 42 • 2

olmOCR 2

mlx-community/olmOCR-2-7B-1025-bf16

Image-to-Text • 8B • Updated 9 days ago • 304 • 2
mlx-community/olmOCR-2-7B-1025-8bit

Image-to-Text • Updated 7 days ago • 157
mlx-community/olmOCR-2-7B-1025-6bit

Image-to-Text • Updated 7 days ago • 72
mlx-community/olmOCR-2-7B-1025-5bit

Image-to-Text • Updated 7 days ago • 110

Nanonets OCR2

This collection houses Nanonets-OCR2 models

mlx-community/Nanonets-OCR2-3B-bf16

Image-Text-to-Text • 4B • Updated 17 days ago • 302
mlx-community/Nanonets-OCR2-3B-8bit

Image-Text-to-Text • 2B • Updated 17 days ago • 271
mlx-community/Nanonets-OCR2-3B-6bit

Image-Text-to-Text • 1B • Updated 17 days ago • 115
mlx-community/Nanonets-OCR2-3B-4bit

Image-Text-to-Text • 1B • Updated 17 days ago • 328

ServiceNow-Apriel

Apriel-1.5-15b-Thinker is a multimodal reasoning model in ServiceNow’s Apriel SLM series which achieves competitive performance against models 10 time

mlx-community/Apriel-1.5-15b-Thinker-4bit

Text Generation • Updated 28 days ago • 687 • 2
mlx-community/Apriel-1.5-15b-Thinker-5bit

Text Generation • Updated 28 days ago • 119
mlx-community/Apriel-1.5-15b-Thinker-6bit-MLX

Image-Text-to-Text • Updated 28 days ago • 194 • 1
mlx-community/Apriel-1.5-15b-Thinker-3bit-MLX

Image-Text-to-Text • Updated 28 days ago • 80

Granite-4.0 Family

mlx-community/Granite-4.0-H-Tiny-4bit-DWQ

Text Generation • 1B • Updated 28 days ago • 902 • 2
mlx-community/granite-4.0-h-micro-8bit

Text Generation • 0.9B • Updated 29 days ago • 498 • 1
mlx-community/granite-4.0-h-small-4bit

Text Generation • 32B • Updated 29 days ago • 325
mlx-community/granite-4.0-tiny-preview-4bit

Text Generation • 1B • Updated Sep 10 • 61

GLM-4.5-Air

mlx-community/GLM-4.5-Air-mxfp4

Text Generation • Updated Sep 26 • 471 • 2
mlx-community/GLM-4.5-Air-3bit-DWQ-v2

Text Generation • 107B • Updated Aug 13 • 217 • 3
mlx-community/GLM-4.5-Air-8bit

Text Generation • 107B • Updated Jul 29 • 220 • 6
mlx-community/GLM-4.5-Air-4bit

Text Generation • 107B • Updated Jul 28 • 623 • 25

SEA-LION

SEA-LION mlx models by AI Singapore.

mlx-community/Gemma-SEA-LION-v4-27B-IT-mlx-4bit

Text Generation • 27B • Updated Sep 10 • 5 • 1
mlx-community/Llama-SEA-LION-v3.5-8B-R-mlx-4bit

Text Generation • 2B • Updated Sep 10 • 2
mlx-community/Gemma-SEA-LION-v3-9B-IT-mlx-4bit

Text Generation • 9B • Updated Sep 10 • 4
mlx-community/Llama-SEA-LION-v3-8B-IT-mlx-4bit

Text Generation • 2B • Updated Sep 10 • 1

EmbeddingGemma

mlx-community/embeddinggemma-300m-4bit

Sentence Similarity • 48.1M • Updated Sep 4 • 166 • 2
mlx-community/embeddinggemma-300m-5bit

Sentence Similarity • 57.7M • Updated Sep 4 • 43
mlx-community/embeddinggemma-300m-6bit

Sentence Similarity • 67.4M • Updated Sep 4 • 56
mlx-community/embeddinggemma-300m-8bit

Sentence Similarity • 86.6M • Updated Sep 4 • 509 • 2

Swahili Gemma 1B

A fine-tuned Gemma 3 1B instruction model specialized for English-to-Swahili translation and Swahili conversational AI. The model accepts input in bot

mlx-community/swahili-gemma-1b-mlx-fp16

Text Generation • Updated Aug 26 • 21 • 1

Gemma 3-270m

mlx-community/gemma-3-270m-it-4bit

Text Generation • 41.9M • Updated Aug 14 • 268 • 8
mlx-community/gemma-3-270m-it-5bit

Text Generation • 81.8M • Updated Aug 9 • 15
mlx-community/gemma-3-270m-it-6bit

Text Generation • 95.4M • Updated Aug 9 • 13
mlx-community/gemma-3-270m-it-8bit

Text Generation • 0.1B • Updated Aug 9 • 1.07k • 2

GLM 4.5

mlx-community/GLM-4.5-Air-4bit

Text Generation • 107B • Updated Jul 28 • 623 • 25
mlx-community/GLM-4.5-4bit

Text Generation • 353B • Updated Jul 28 • 306 • 16

LFM2

mlx-community/LFM2-350M-4bit

Text Generation • 55.4M • Updated Jul 11 • 72 • 1
mlx-community/LFM2-1.2B-5bit

Text Generation • 0.2B • Updated Jul 12 • 5 • 1
mlx-community/LFM2-1.2B-6bit

Text Generation • 0.3B • Updated Jul 12 • 41 • 3
mlx-community/LFM2-1.2B-8bit

Text Generation • 0.3B • Updated Jul 12 • 60 • 1

ERNIE-4.5

mlx-community/ERNIE-4.5-300B-A47B-PT-4bit

Text Generation • 299B • Updated Jul 4 • 20 • 2
mlx-community/ERNIE-4.5-21B-A3B-PT-bf16

Text Generation • 22B • Updated Jul 4 • 20 • 1
mlx-community/ERNIE-4.5-21B-A3B-PT-8bit

Text Generation • 22B • Updated Jul 4 • 20 • 2
mlx-community/ERNIE-4.5-21B-A3B-PT-6bit

Text Generation • 22B • Updated Jul 4 • 5

JetBrains Mellum

Series of code models by JetBrains

mlx-community/Mellum-4b-base

Text Generation • 4B • Updated Jun 28 • 15 • 1
mlx-community/Mellum-4b-base-4bit

Text Generation • 0.6B • Updated Jun 28 • 32 • 2
mlx-community/Mellum-4b-base-8bit

Text Generation • 1B • Updated Jun 28 • 9
mlx-community/Mellum-4b-sft-python

Text Generation • 4B • Updated Jun 28 • 28 • 1

Gemma 3n

mlx-community/gemma-3n-E2B-it-4bit

Image-Text-to-Text • 2B • Updated Jul 12 • 621 • 8
mlx-community/gemma-3n-E4B-it-4bit

Image-Text-to-Text • Updated Jul 13 • 616 • 5
mlx-community/gemma-3n-E4B-it-5bit

Image-Text-to-Text • Updated Jul 12 • 24
mlx-community/gemma-3n-E2B-it-5bit

Image-Text-to-Text • 2B • Updated Jul 12 • 16

BitNet 1.58

This collection houses BitNet-1.58, Falcon3-1.58 and Falcon-E quants.

mlx-community/bitnet-b1.58-2B-4T

Text Generation • 0.8B • Updated Jun 10 • 51 • 1
mlx-community/bitnet-b1.58-2B-4T-4bit

Text Generation • 0.6B • Updated Jun 10 • 119
mlx-community/bitnet-b1.58-2B-4T-8bit

Text Generation • 0.6B • Updated Jun 10 • 115
mlx-community/bitnet-b1.58-2B-4T-6bit

Text Generation • 0.6B • Updated Jun 10 • 10

Qwen3 DWQ Quants

High-quality 4-bit quants of the Qwen3 model family.

mlx-community/Qwen3-14B-4bit-DWQ-053125

Text Generation • 2B • Updated Jun 2 • 142 • 4
mlx-community/Qwen3-8B-4bit-DWQ-053125

Text Generation • 1B • Updated Jun 1 • 156 • 1
mlx-community/Qwen3-4B-4bit-DWQ-053125

Text Generation • 0.6B • Updated Jun 1 • 112 • 2
mlx-community/Qwen3-1.7B-4bit-DWQ-053125

Text Generation • 0.3B • Updated Jun 1 • 126 • 2

AceReason Nemotron

mlx-community/AceReason-Nemotron-7B-4bit

Text Generation • 1B • Updated May 26 • 61
mlx-community/AceReason-Nemotron-7B-8bit

Text Generation • 2B • Updated May 26
mlx-community/AceReason-Nemotron-7B-bf16

Text Generation • 8B • Updated May 26 • 3
mlx-community/AceReason-Nemotron-14B-4bit

Text Generation • 2B • Updated May 24 • 6

MedGemma

Collection of Gemma 3 variants for performance on medical text and image comprehension to accelerate building healthcare-based AI applications.

mlx-community/medgemma-4b-it-4bit

Image-Text-to-Text • 0.9B • Updated Jun 9 • 55 • 2
mlx-community/medgemma-4b-it-6bit

Image-Text-to-Text • 1B • Updated Jun 9 • 24 • 1
mlx-community/medgemma-4b-it-8bit

Image-Text-to-Text • 1B • Updated Jun 9 • 62 • 1
mlx-community/medgemma-4b-it-bf16

Image-Text-to-Text • 5B • Updated Jun 9 • 47 • 1

SimCLRv1

mlx-community/simclrv1-imagenet1k-resnet50-1x

Image Classification • Updated May 14
mlx-community/simclrv1-imagenet1k-resnet50-2x

Image Classification • Updated May 14
mlx-community/simclrv1-imagenet1k-resnet50-4x

Image Classification • Updated May 14

Parakeet

Nvidia's ASR models, now in MLX!

mlx-community/parakeet-ctc-0.6b

Automatic Speech Recognition • 0.6B • Updated May 10 • 457 • 2
mlx-community/parakeet-rnnt-0.6b

Automatic Speech Recognition • 0.6B • Updated May 10 • 1.09k
mlx-community/parakeet-ctc-1.1b

Automatic Speech Recognition • 1B • Updated May 10 • 15 • 1
mlx-community/parakeet-rnnt-1.1b

Automatic Speech Recognition • 1B • Updated May 10 • 25 • 1

Qwen3

mlx-community/Qwen3-0.6B-4bit

Text Generation • 93.2M • Updated Apr 28 • 5.09k • 5
mlx-community/Qwen3-0.6B-6bit

Text Generation • 0.1B • Updated Apr 28 • 37
mlx-community/Qwen3-0.6B-8bit

Text Generation • 0.2B • Updated May 4 • 967 • 5
mlx-community/Qwen3-0.6B-bf16

Text Generation • 0.6B • Updated Apr 28 • 306 • 4

GLM4

The GLM-4 and Z1 series are powerful open-source language models excelling in reasoning, code, and complex tasks.

mlx-community/GLM-Z1-32B-0414-4bit

Text Generation • 5B • Updated Apr 19 • 142 • 2
mlx-community/GLM-4-32B-0414-4bit

Text Generation • 5B • Updated Apr 21 • 255 • 5
mlx-community/GLM-4-32B-Base-0414-8bit

Text Generation • 9B • Updated Apr 21 • 22
mlx-community/GLM-4-32B-Base-0414-6bit

Text Generation • 7B • Updated Apr 21 • 26

Kimi-VL Thinking

mlx-community/Kimi-VL-A3B-Thinking-4bit

Image-Text-to-Text • Updated Apr 21 • 270 • 7
mlx-community/Kimi-VL-A3B-Thinking-6bit

Image-Text-to-Text • Updated Apr 17 • 20
mlx-community/Kimi-VL-A3B-Thinking-8bit

Image-Text-to-Text • Updated May 26 • 76 • 3

Llama 4

mlx-community/Llama-4-Scout-17B-16E-Instruct-4bit

Image-Text-to-Text • Updated May 3 • 369 • 9
mlx-community/Llama-4-Scout-17B-16E-Instruct-6bit

Image-Text-to-Text • Updated May 3 • 210 • 5
mlx-community/Llama-4-Scout-17B-16E-Instruct-8bit

Image-Text-to-Text • Updated May 3 • 222 • 3
mlx-community/Llama-4-Maverick-17B-16E-Instruct-4bit

Text Generation • 63B • Updated Apr 6 • 283 • 7

Gemma 3

A collection of lightweight, state-of-the-art open models built from the same research and technology that powers the Gemini 2.0 models

mlx-community/gemma-3-4b-it-8bit

Image-Text-to-Text • 2B • Updated Mar 19 • 625 • 5
mlx-community/gemma-3-4b-pt-4bit

Image-Text-to-Text • 1B • Updated Mar 18 • 45 • 3
mlx-community/gemma-3-4b-it-bf16

Image-Text-to-Text • 5B • Updated Mar 20 • 157 • 1
mlx-community/gemma-3-4b-pt-6bit

Image-Text-to-Text • 1B • Updated Mar 18 • 9

OLMoE

mlx-community/OLMoE-1B-7B-0125-Instruct

Text Generation • 7B • Updated Mar 4 • 4
mlx-community/OLMoE-1B-7B-0125-Instruct-8bit

Text Generation • 2B • Updated Mar 4 • 5
mlx-community/OLMoE-1B-7B-0125-Instruct-6bit

Text Generation • 2B • Updated Mar 4
mlx-community/OLMoE-1B-7B-0125-Instruct-4bit

Text Generation • 1B • Updated Mar 4 • 28 • 2

olmOCR

mlx-community/olmOCR-7B-0225-preview-bf16

Image-to-Text • 8B • Updated Mar 3 • 86 • 4
mlx-community/olmOCR-7B-0225-preview-4bit

Image-to-Text • 2B • Updated Mar 3 • 36 • 1
mlx-community/olmOCR-7B-0225-preview-6bit

Image-to-Text • 2B • Updated Mar 3 • 9
mlx-community/olmOCR-7B-0225-preview-8bit

Updated Mar 3

FuseAI

FuseAI is attempting to merge CoT models to achieve newer models that are more than the sum of their parts.

miabchdave/FuseO1-DeepSeekR1-QwQ-SkyT1-32B-Preview-MLX-8Q

Question Answering • 9B • Updated Feb 19 • 6 • 1

Mistral Small

mlx-community/Mistral-Small-24B-Instruct-2501-4bit

4B • Updated Jan 30 • 121 • 14
mlx-community/Mistral-Small-24B-Instruct-2501-3bit

3B • Updated Jan 30 • 22 • 2
mlx-community/Mistral-Small-24B-Instruct-2501-6bit

5B • Updated Jan 30 • 10
mlx-community/Mistral-Small-24B-Instruct-2501-8bit

7B • Updated Jan 30 • 25 • 3

Qwen2.5-VL

mlx-community/Qwen2.5-VL-72B-Instruct-8bit

Image-Text-to-Text • 21B • Updated Feb 25 • 59 • 2
mlx-community/Qwen2.5-VL-72B-Instruct-6bit

Image-Text-to-Text • 16B • Updated Feb 25 • 21 • 1
mlx-community/Qwen2.5-VL-72B-Instruct-4bit

Image-Text-to-Text • 12B • Updated Feb 25 • 228 • 7
mlx-community/Qwen2.5-VL-72B-Instruct-3bit

Image-Text-to-Text • 10B • Updated Feb 25 • 38 • 5

Mamba2

mlx-community/Mamba-Codestral-7B-v0.1

7B • Updated Jan 21 • 60 • 2
mlx-community/Mamba-Codestral-7B-v0.1-8bit

2B • Updated Jan 21 • 62 • 2
mlx-community/Mamba-Codestral-7B-v0.1-4bit

1B • Updated Jan 21 • 60 • 1
mlx-community/mamba2-2.7b-8bit

0.8B • Updated Jan 21 • 1

Helium-1

Kyutai's Helium-1 2B Model, outperforming other state of the art small models.

mlx-community/helium-1-preview-2b-float32

Text Generation • 2B • Updated Jan 18 • 2
mlx-community/helium-1-preview-2b

Text Generation • 2B • Updated Jan 18
mlx-community/helium-1-preview-2b-8bit

Text Generation • 0.6B • Updated Jan 18 • 8 • 1
mlx-community/helium-1-preview-2b-4bit

Text Generation • 0.3B • Updated Jan 18 • 2 • 1

DeepSeek-VL2

mlx-community/deepseek-vl2-6bit

Image-Text-to-Text • 6B • Updated Dec 22, 2024 • 45 • 1
mlx-community/deepseek-vl2-small-4bit

Image-Text-to-Text • 3B • Updated Dec 22, 2024 • 41
mlx-community/deepseek-vl2-4bit

Image-Text-to-Text • 4B • Updated Dec 22, 2024 • 69 • 1
mlx-community/deepseek-vl2-small-6bit

Image-Text-to-Text • 4B • Updated Dec 22, 2024 • 20

Falcon3 Mamba

mlx-community/Falcon3-Mamba-7B-Instruct

Text Generation • 7B • Updated Feb 14 • 3
mlx-community/Falcon3-Mamba-7B-Instruct-8bits

Text Generation • 2B • Updated Feb 14 • 4
mlx-community/Falcon3-Mamba-7B-Instruct-4bits

Text Generation • 1B • Updated Feb 14 • 7

Llama 3.3

mlx-community/Llama-3.3-70B-Instruct-8bit

Text Generation • 20B • Updated Dec 6, 2024 • 373 • 14
mlx-community/Llama-3.3-70B-Instruct-6bit

Text Generation • 15B • Updated Dec 6, 2024 • 69 • 5
mlx-community/Llama-3.3-70B-Instruct-3bit

Text Generation • 9B • Updated Dec 6, 2024 • 106 • 7
mlx-community/Llama-3.3-70B-Instruct-4bit

Text Generation • 11B • Updated Dec 6, 2024 • 1.07k • 30

QwQ-32B-Preview

mlx-community/QwQ-32B-Preview-4bit

Text Generation • 5B • Updated Nov 27, 2024 • 9 • 3
mlx-community/QwQ-32B-Preview-3bit

Text Generation • 4B • Updated Nov 28, 2024 • 10 • 5
mlx-community/QwQ-32B-Preview-6bit

Text Generation • 7B • Updated Nov 27, 2024 • 2 • 4
mlx-community/QwQ-32B-Preview-8bit

Text Generation • 9B • Updated Nov 27, 2024 • 13 • 6

Molmo

mlx-community/Molmo-7B-D-0924-4bit

Image-Text-to-Text • 2B • Updated Dec 27, 2024 • 26 • 2
mlx-community/Molmo-7B-D-0924-8bit

Image-Text-to-Text • 3B • Updated Dec 27, 2024 • 13 • 1
mlx-community/Molmo-7B-D-0924-bf16

Image-Text-to-Text • 8B • Updated Jan 1 • 28 • 1
mlx-community/Molmo-7B-D-0924-8bit-skip-vision

Updated Nov 20, 2024

Falcon-Mamba

Falcon Mamba models compatible with MLX

mlx-community/falcon-mamba-7b-4bit

1B • Updated Nov 15, 2024 • 5
mlx-community/falcon-mamba-7b-4bit-instruct

Text Generation • 1B • Updated Nov 15, 2024 • 6
mlx-community/falcon-mamba-7b-8bit-instruct

Text Generation • 2B • Updated Nov 15, 2024 • 6
mlx-community/falcon-mamba-7b-8bit

2B • Updated Nov 15, 2024 • 3

Ministral

mlx-community/Ministral-8B-Instruct-2410-bf16

8B • Updated Oct 17, 2024 • 29 • 2
mlx-community/Ministral-8B-Instruct-2410-4bit

1B • Updated Oct 17, 2024 • 148 • 9
mlx-community/Ministral-8B-Instruct-2410-8bit

2B • Updated Oct 17, 2024 • 24 • 2

Mistral

mlx-community/Mistral-7B-Instruct-v0.2

Text Generation • Updated Dec 23, 2023 • 209 • 20
mlx-community/Mistral-7B-v0.2-4bit

1B • Updated Mar 25, 2024 • 77 • 8
mlx-community/Hermes-2-Pro-Mistral-7B-4bit

1B • Updated Mar 14, 2024 • 21 • 3
mlx-community/Mixtral-8x7B-Instruct-v0.1

47B • Updated May 7, 2024 • 214 • 23

Code Gemma

Google’s Code-Gemma

mlx-community/codegemma-7b-it-8bit

Text Generation • 3B • Updated Apr 9, 2024 • 117 • 5
mlx-community/codegemma-2b-4bit

Text Generation • 0.8B • Updated Apr 9, 2024 • 4
mlx-community/codegemma-7b-4bit

Text Generation • 2B • Updated Apr 9, 2024 • 9
mlx-community/codegemma-7b-it-4bit

Text Generation • 2B • Updated Apr 9, 2024 • 25 • 1

Qwen2.5

The Qwen 2.5 models are a series of AI models trained on 18 trillion tokens, supporting 29 languages and offering advanced features such as instructio

mlx-community/Qwen2.5-72B-Instruct-bf16

Text Generation • 73B • Updated Sep 19, 2024 • 8
mlx-community/Qwen2.5-72B-Instruct-8bit

Text Generation • 20B • Updated Sep 19, 2024 • 21 • 3
mlx-community/Qwen2.5-72B-Instruct-4bit

Text Generation • 11B • Updated Sep 18, 2024 • 101 • 5
mlx-community/Qwen2.5-32B-Instruct-bf16

Text Generation • 33B • Updated Sep 18, 2024 • 12

Whisper

OpenAI Whisper speech recognition models in MLX format

mlx-community/whisper-large-v3-mlx

Updated Aug 9, 2024 • 8.45k • 61
mlx-community/whisper-tiny-mlx-q4

Updated Mar 9, 2024 • 41 • 2
mlx-community/whisper-large-v2-mlx-fp32

Updated Aug 9, 2024 • 1
mlx-community/whisper-tiny.en-mlx-q4

Updated Mar 9, 2024 • 9

Yi-1.5

mlx-community/Yi-1.5-9B-Chat-4bit

1B • Updated May 13, 2024 • 33 • 2
mlx-community/Yi-1.5-34B-Chat-8bit

10B • Updated May 13, 2024 • 24 • 3
mlx-community/Yi-1.5-9B-8bit

2B • Updated May 13, 2024 • 1 • 1
mlx-community/Yi-1.5-34B-8bit

10B • Updated May 13, 2024 • 1

Google Gemma2

mlx-community/gemma-2-27b-8bit

Text Generation • 8B • Updated Jun 27, 2024 • 5 • 2
mlx-community/gemma-2-27b-it-8bit

Text Generation • 8B • Updated Nov 6, 2024 • 71 • 10
mlx-community/gemma-2-9b-8bit

Text Generation • 3B • Updated Jun 27, 2024 • 71 • 9
mlx-community/gemma-2-27b-4bit

Text Generation • 4B • Updated Jun 27, 2024 • 6

HF SmolLM

A series of smol LLMs: 135M, 360M and 1.7B.

mlx-community/SmolLM-135M-8bit

Text Generation • 37.9M • Updated Jul 16, 2024 • 14
mlx-community/SmolLM-135M-fp16

Text Generation • 0.1B • Updated Jul 16, 2024 • 6 • 1
mlx-community/SmolLM-360M-4bit

Text Generation • 56.6M • Updated Jul 16, 2024 • 26
mlx-community/SmolLM-360M-8bit

Text Generation • 0.1B • Updated Jul 16, 2024 • 16

Llama 3.1

mlx-community/Meta-Llama-3.1-70B-bf16

Text Generation • 71B • Updated Jul 23, 2024 • 18 • 4
mlx-community/Meta-Llama-3.1-70B-Instruct-bf16

Text Generation • 71B • Updated Oct 6, 2024 • 9 • 2
mlx-community/Meta-Llama-3.1-8B-Instruct-bf16

Text Generation • 8B • Updated Oct 19, 2024 • 235 • 3
mlx-community/Meta-Llama-3.1-8B-Instruct-8bit

Text Generation • 2B • Updated Nov 26, 2024 • 318 • 10

Llama 3.2

Meta goes small with Llama3.2, both text only 1B and 3B, and the 11B Vision models.

mlx-community/Llama-3.2-11B-Vision-Instruct-abliterated

Image-Text-to-Text • 11B • Updated Dec 16, 2024 • 667 • 7
mlx-community/Llama-3.2-11B-Vision-Instruct-abliterated-8-bit

Image-Text-to-Text • 3B • Updated Dec 16, 2024 • 74
mlx-community/Llama-3.2-11B-Vision-Instruct-abliterated-4-bit

Image-Text-to-Text • 2B • Updated Dec 16, 2024 • 91 • 1
mlx-community/Llama-3.2-11B-Vision-Instruct-8bit

Image-to-Text • 3B • Updated Oct 18, 2024 • 581 • 10

Granite 4.0 Nano Language Models

mlx-community/granite-4.0-h-1b-3bit

Text Generation • 0.2B • Updated 3 days ago • 30
mlx-community/granite-4.0-h-1b-4bit

Text Generation • 0.2B • Updated 3 days ago • 45
mlx-community/granite-4.0-h-1b-5bit

Text Generation • 0.3B • Updated 3 days ago • 10
mlx-community/granite-4.0-h-1b-6bit

Text Generation • 0.3B • Updated 3 days ago • 10

LLaDA 2.0

mlx-community/LLaDA2.0-mini-preview-4bit

Text Generation • 16B • Updated 13 days ago • 87 • 1
mlx-community/LLaDA2.0-flash-preview-4bit

Text Generation • 103B • Updated 6 days ago • 42 • 2

MiniMax-M2

mlx-community/MiniMax-M2-3bit

Text Generation • 229B • Updated 4 days ago • 1.26k • 2
mlx-community/MiniMax-M2-4bit

Text Generation • 229B • Updated 4 days ago • 2.12k • 8
mlx-community/MiniMax-M2-5bit

Text Generation • 229B • Updated 3 days ago • 530 • 2
mlx-community/MiniMax-M2-6bit

Text Generation • 229B • Updated 3 days ago • 738 • 1

olmOCR 2

mlx-community/olmOCR-2-7B-1025-bf16

Image-to-Text • 8B • Updated 9 days ago • 304 • 2
mlx-community/olmOCR-2-7B-1025-8bit

Image-to-Text • Updated 7 days ago • 157
mlx-community/olmOCR-2-7B-1025-6bit

Image-to-Text • Updated 7 days ago • 72
mlx-community/olmOCR-2-7B-1025-5bit

Image-to-Text • Updated 7 days ago • 110

Qwen3-VL

mlx-community/Qwen3-VL-4B-Instruct-3bit

Image-Text-to-Text • 0.9B • Updated 17 days ago • 271
mlx-community/Qwen3-VL-4B-Instruct-5bit

Image-Text-to-Text • 1B • Updated 17 days ago • 124
mlx-community/Qwen3-VL-4B-Instruct-6bit

Image-Text-to-Text • 1B • Updated 17 days ago • 101
mlx-community/Qwen3-VL-4B-Instruct-8bit

Image-Text-to-Text • 2B • Updated 17 days ago • 567 • 3

Nanonets OCR2

This collection houses Nanonets-OCR2 models

mlx-community/Nanonets-OCR2-3B-bf16

Image-Text-to-Text • 4B • Updated 17 days ago • 302
mlx-community/Nanonets-OCR2-3B-8bit

Image-Text-to-Text • 2B • Updated 17 days ago • 271
mlx-community/Nanonets-OCR2-3B-6bit

Image-Text-to-Text • 1B • Updated 17 days ago • 115
mlx-community/Nanonets-OCR2-3B-4bit

Image-Text-to-Text • 1B • Updated 17 days ago • 328

💧LFM2-8B-A1B-MoE

Best in Class MoE, better than Qwen3. Optimised for Smaller devices sub 16 GB (M1/2/3/4) Apple Silicon.

mlx-community/LFM2-8B-A1B-3bit-MLX

Text Generation • 1B • Updated 24 days ago • 780 • 1
mlx-community/LFM2-8B-A1B-8bit-MLX

Text Generation • 8B • Updated 24 days ago • 380 • 2
mlx-community/LFM2-8B-A1B-6bit-MLX

Text Generation • 8B • Updated 24 days ago • 123
mlx-community/LFM2-8B-A1B-4bit

Text Generation • 1B • Updated 24 days ago • 1.77k • 3

ServiceNow-Apriel

Apriel-1.5-15b-Thinker is a multimodal reasoning model in ServiceNow’s Apriel SLM series which achieves competitive performance against models 10 time

mlx-community/Apriel-1.5-15b-Thinker-4bit

Text Generation • Updated 28 days ago • 687 • 2
mlx-community/Apriel-1.5-15b-Thinker-5bit

Text Generation • Updated 28 days ago • 119
mlx-community/Apriel-1.5-15b-Thinker-6bit-MLX

Image-Text-to-Text • Updated 28 days ago • 194 • 1
mlx-community/Apriel-1.5-15b-Thinker-3bit-MLX

Image-Text-to-Text • Updated 28 days ago • 80

Qwen3-Coder-MoE

💻 Significant Performance: among open models on Agentic Coding, Agentic Browser-Use, and other foundational coding tasks, achieving ~Claude Sonnet.

mlx-community/Qwen3-Coder-30B-A3B-Instruct-4bit

Text Generation • 31B • Updated Aug 6 • 630 • 10
mlx-community/Qwen3-Coder-480B-A35B-Instruct-4bit

Text Generation • 480B • Updated Jul 22 • 444 • 18
mlx-community/Qwen3-Coder-30B-A3B-Instruct-8bit

Text Generation • 31B • Updated Jul 31 • 427 • 2
mlx-community/Qwen3-Coder-30B-A3B-Instruct-8bit-DWQ-lr9e8

Text Generation • 31B • Updated Aug 1 • 131 • 1

Granite-4.0 Family

mlx-community/Granite-4.0-H-Tiny-4bit-DWQ

Text Generation • 1B • Updated 28 days ago • 902 • 2
mlx-community/granite-4.0-h-micro-8bit

Text Generation • 0.9B • Updated 29 days ago • 498 • 1
mlx-community/granite-4.0-h-small-4bit

Text Generation • 32B • Updated 29 days ago • 325
mlx-community/granite-4.0-tiny-preview-4bit

Text Generation • 1B • Updated Sep 10 • 61

GLM-4.6

mlx-community/GLM-4.6-4bit

Text Generation • 353B • Updated Sep 30 • 3.63k • 11
mlx-community/GLM-4.6-5bit

Text Generation • 353B • Updated Sep 30 • 1.24k • 3

GLM-4.5-Air

mlx-community/GLM-4.5-Air-mxfp4

Text Generation • Updated Sep 26 • 471 • 2
mlx-community/GLM-4.5-Air-3bit-DWQ-v2

Text Generation • 107B • Updated Aug 13 • 217 • 3
mlx-community/GLM-4.5-Air-8bit

Text Generation • 107B • Updated Jul 29 • 220 • 6
mlx-community/GLM-4.5-Air-4bit

Text Generation • 107B • Updated Jul 28 • 623 • 25

Qwen3 Next

Alibaba's first hybrid model, designed to cut resources and speed things up.

mlx-community/Qwen3-Next-80B-A3B-Thinking-8bit

Text Generation • 80B • Updated Sep 13 • 164 • 2
mlx-community/Qwen3-Next-80B-A3B-Thinking-6bit

Text Generation • 80B • Updated Sep 13 • 45
mlx-community/Qwen3-Next-80B-A3B-Thinking-5bit

Text Generation • 80B • Updated Sep 13 • 38
mlx-community/Qwen3-Next-80B-A3B-Thinking-4bit

Text Generation • 80B • Updated Sep 13 • 210 • 2

SEA-LION

SEA-LION mlx models by AI Singapore.

mlx-community/Gemma-SEA-LION-v4-27B-IT-mlx-4bit

Text Generation • 27B • Updated Sep 10 • 5 • 1
mlx-community/Llama-SEA-LION-v3.5-8B-R-mlx-4bit

Text Generation • 2B • Updated Sep 10 • 2
mlx-community/Gemma-SEA-LION-v3-9B-IT-mlx-4bit

Text Generation • 9B • Updated Sep 10 • 4
mlx-community/Llama-SEA-LION-v3-8B-IT-mlx-4bit

Text Generation • 2B • Updated Sep 10 • 1

Lille 130M

Very Small smart model created for the mobile

mlx-community/lille-130m-instruct-bf16

Text Generation • 0.1B • Updated Sep 5 • 45
mlx-community/lille-130m-instruct-fp16

Text Generation • 0.1B • Updated Sep 5 • 43 • 1
mlx-community/lille-130m-instruct-8bit

Text Generation • 35.8M • Updated Sep 5 • 7
mlx-community/lille-130m-instruct-6bit

Text Generation • 27.8M • Updated Sep 5 • 5

EmbeddingGemma

mlx-community/embeddinggemma-300m-4bit

Sentence Similarity • 48.1M • Updated Sep 4 • 166 • 2
mlx-community/embeddinggemma-300m-5bit

Sentence Similarity • 57.7M • Updated Sep 4 • 43
mlx-community/embeddinggemma-300m-6bit

Sentence Similarity • 67.4M • Updated Sep 4 • 56
mlx-community/embeddinggemma-300m-8bit

Sentence Similarity • 86.6M • Updated Sep 4 • 509 • 2

Apertus

SwissAI's Apertus models that support 1k languages

mlx-community/Apertus-8B-Instruct-2509-bf16

Text Generation • 8B • Updated Sep 3 • 345 • 4
mlx-community/Apertus-8B-Instruct-2509-8bit

Text Generation • 8B • Updated Sep 3 • 117
mlx-community/Apertus-8B-Instruct-2509-6bit

Text Generation • 8B • Updated Sep 3 • 52
mlx-community/Apertus-8B-Instruct-2509-4bit

Text Generation • 1B • Updated Sep 3 • 203 • 1

Swahili Gemma 1B

A fine-tuned Gemma 3 1B instruction model specialized for English-to-Swahili translation and Swahili conversational AI. The model accepts input in bot

mlx-community/swahili-gemma-1b-mlx-fp16

Text Generation • Updated Aug 26 • 21 • 1

LFM2-VL

mlx-community/LFM2-VL-450M-4bit

Image-Text-to-Text • 0.1B • Updated Aug 16 • 43 • 1
mlx-community/LFM2-VL-450M-5bit

Image-Text-to-Text • 0.2B • Updated Aug 16 • 5 • 1
mlx-community/LFM2-VL-450M-6bit

Updated Aug 16 • 1
mlx-community/LFM2-VL-450M-8bit

Image-Text-to-Text • 0.2B • Updated Aug 16 • 101 • 10

Gemma 3-270m

mlx-community/gemma-3-270m-it-4bit

Text Generation • 41.9M • Updated Aug 14 • 268 • 8
mlx-community/gemma-3-270m-it-5bit

Text Generation • 81.8M • Updated Aug 9 • 15
mlx-community/gemma-3-270m-it-6bit

Text Generation • 95.4M • Updated Aug 9 • 13
mlx-community/gemma-3-270m-it-8bit

Text Generation • 0.1B • Updated Aug 9 • 1.07k • 2

VisualQuality-R1

Image Quality Assessment

mlx-community/VisualQuality-R1-7B-bf16

Reinforcement Learning • 8B • Updated Aug 6 • 19
mlx-community/VisualQuality-R1-7B-8bit

Reinforcement Learning • Updated Aug 6 • 9
mlx-community/VisualQuality-R1-7B-6bit

Reinforcement Learning • Updated Aug 6 • 8
mlx-community/VisualQuality-R1-7B-4bit

Reinforcement Learning • Updated Aug 6 • 10

GLM 4.5

mlx-community/GLM-4.5-Air-4bit

Text Generation • 107B • Updated Jul 28 • 623 • 25
mlx-community/GLM-4.5-4bit

Text Generation • 353B • Updated Jul 28 • 306 • 16

olmOCR-0725

mlx-community/olmOCR-7B-0725-bf16

Image-to-Text • 8B • Updated Jul 24 • 22
mlx-community/olmOCR-7B-0725-8bit

Image-to-Text • Updated Jul 25 • 75 • 4
mlx-community/olmOCR-7B-0725-6bit

Image-to-Text • Updated Jul 25 • 21

LFM2

mlx-community/LFM2-350M-4bit

Text Generation • 55.4M • Updated Jul 11 • 72 • 1
mlx-community/LFM2-1.2B-5bit

Text Generation • 0.2B • Updated Jul 12 • 5 • 1
mlx-community/LFM2-1.2B-6bit

Text Generation • 0.3B • Updated Jul 12 • 41 • 3
mlx-community/LFM2-1.2B-8bit

Text Generation • 0.3B • Updated Jul 12 • 60 • 1

SmolLM3

mlx-community/SmolLM3-3B-4bit

Text Generation • 0.5B • Updated Jul 8 • 2.7k • 4
mlx-community/SmolLM3-3B-3bit

Text Generation • 0.4B • Updated Jul 8 • 20 • 2
mlx-community/SmolLM3-3B-5bit

Text Generation • 0.6B • Updated Jul 8 • 34
mlx-community/SmolLM3-3B-8bit

Text Generation • 0.9B • Updated Jul 8 • 53 • 6

ERNIE-4.5

mlx-community/ERNIE-4.5-300B-A47B-PT-4bit

Text Generation • 299B • Updated Jul 4 • 20 • 2
mlx-community/ERNIE-4.5-21B-A3B-PT-bf16

Text Generation • 22B • Updated Jul 4 • 20 • 1
mlx-community/ERNIE-4.5-21B-A3B-PT-8bit

Text Generation • 22B • Updated Jul 4 • 20 • 2
mlx-community/ERNIE-4.5-21B-A3B-PT-6bit

Text Generation • 22B • Updated Jul 4 • 5

DiffuCoder-7B

Apple's text based diffusion model

mlx-community/DiffuCoder-7B-cpGRPO-8bit

Text Generation • 8B • Updated Jul 4 • 64 • 6
mlx-community/DiffuCoder-7B-cpGRPO-6bit

Text Generation • 8B • Updated Jul 4 • 13 • 1
mlx-community/DiffuCoder-7B-cpGRPO-4bit

Text Generation • 1B • Updated Jul 4 • 26 • 3

JetBrains Mellum

Series of code models by JetBrains

mlx-community/Mellum-4b-base

Text Generation • 4B • Updated Jun 28 • 15 • 1
mlx-community/Mellum-4b-base-4bit

Text Generation • 0.6B • Updated Jun 28 • 32 • 2
mlx-community/Mellum-4b-base-8bit

Text Generation • 1B • Updated Jun 28 • 9
mlx-community/Mellum-4b-sft-python

Text Generation • 4B • Updated Jun 28 • 28 • 1

Gemma 3n - Text Only (LM)

Google's Gemma 3n converted to MLX using mlx-lm

mlx-community/gemma-3n-E4B-it-lm-bf16

Text Generation • 7B • Updated Jun 29 • 93 • 4
mlx-community/gemma-3n-E2B-it-lm-bf16

Text Generation • 4B • Updated Jun 29 • 100
mlx-community/gemma-3n-E4B-it-lm-4bit

Text Generation • 1B • Updated Jun 29 • 5.42k • 4
mlx-community/gemma-3n-E2B-it-lm-4bit

Text Generation • 0.7B • Updated Jun 29 • 5.3k • 1

Gemma 3n

mlx-community/gemma-3n-E2B-it-4bit

Image-Text-to-Text • 2B • Updated Jul 12 • 621 • 8
mlx-community/gemma-3n-E4B-it-4bit

Image-Text-to-Text • Updated Jul 13 • 616 • 5
mlx-community/gemma-3n-E4B-it-5bit

Image-Text-to-Text • Updated Jul 12 • 24
mlx-community/gemma-3n-E2B-it-5bit

Image-Text-to-Text • 2B • Updated Jul 12 • 16

Nanonets OCR

This collection houses Nanonets-OCR-s

mlx-community/Nanonets-OCR-s-bf16

Image-Text-to-Text • 4B • Updated Jun 18 • 228 • 2
mlx-community/Nanonets-OCR-s-8bit

Image-Text-to-Text • 2B • Updated Jul 25 • 41
mlx-community/Nanonets-OCR-s-6bit

Image-Text-to-Text • 1B • Updated Jul 25 • 31
mlx-community/Nanonets-OCR-s-4bit

Image-Text-to-Text • 1B • Updated Jul 25 • 61

BitNet 1.58

This collection houses BitNet-1.58, Falcon3-1.58 and Falcon-E quants.

mlx-community/bitnet-b1.58-2B-4T

Text Generation • 0.8B • Updated Jun 10 • 51 • 1
mlx-community/bitnet-b1.58-2B-4T-4bit

Text Generation • 0.6B • Updated Jun 10 • 119
mlx-community/bitnet-b1.58-2B-4T-8bit

Text Generation • 0.6B • Updated Jun 10 • 115
mlx-community/bitnet-b1.58-2B-4T-6bit

Text Generation • 0.6B • Updated Jun 10 • 10

Holo1

mlx-community/Holo1-3B-3bit

Image-Text-to-Text • 1B • Updated Jun 3 • 1 • 1
mlx-community/Holo1-3B-4bit

Image-Text-to-Text • 1B • Updated Jun 3 • 1 • 1
mlx-community/Holo1-3B-6bit

Image-Text-to-Text • 1B • Updated Jun 3 • 1
mlx-community/Holo1-3B-8bit

Image-Text-to-Text • 2B • Updated Jun 3

Qwen3 DWQ Quants

High-quality 4-bit quants of the Qwen3 model family.

mlx-community/Qwen3-14B-4bit-DWQ-053125

Text Generation • 2B • Updated Jun 2 • 142 • 4
mlx-community/Qwen3-8B-4bit-DWQ-053125

Text Generation • 1B • Updated Jun 1 • 156 • 1
mlx-community/Qwen3-4B-4bit-DWQ-053125

Text Generation • 0.6B • Updated Jun 1 • 112 • 2
mlx-community/Qwen3-1.7B-4bit-DWQ-053125

Text Generation • 0.3B • Updated Jun 1 • 126 • 2

DeepSeek R1 0528

mlx-community/DeepSeek-R1-0528-4bit

Text Generation • 105B • Updated May 29 • 223 • 17
mlx-community/DeepSeek-R1-0528-Qwen3-8B-4bit

Text Generation • 1B • Updated May 30 • 688 • 4
mlx-community/DeepSeek-R1-0528-Qwen3-8B-4bit-DWQ

Text Generation • 1B • Updated May 29 • 151 • 8
mlx-community/DeepSeek-R1-0528-Qwen3-8B-8bit

Text Generation • 2B • Updated May 29 • 44 • 1

AceReason Nemotron

mlx-community/AceReason-Nemotron-7B-4bit

Text Generation • 1B • Updated May 26 • 61
mlx-community/AceReason-Nemotron-7B-8bit

Text Generation • 2B • Updated May 26
mlx-community/AceReason-Nemotron-7B-bf16

Text Generation • 8B • Updated May 26 • 3
mlx-community/AceReason-Nemotron-14B-4bit

Text Generation • 2B • Updated May 24 • 6

Devstral Small 2505

mlx-community/Devstral-Small-2505-3bit

Text Generation • 3B • Updated May 21 • 21 • 1
mlx-community/Devstral-Small-2505-4bit

Text Generation • 4B • Updated May 21 • 23 • 2
mlx-community/Devstral-Small-2505-6bit

Text Generation • Updated May 21 • 12 • 1
mlx-community/Devstral-Small-2505-8bit

Text Generation • Updated May 21 • 22 • 1

MedGemma

Collection of Gemma 3 variants for performance on medical text and image comprehension to accelerate building healthcare-based AI applications.

mlx-community/medgemma-4b-it-4bit

Image-Text-to-Text • 0.9B • Updated Jun 9 • 55 • 2
mlx-community/medgemma-4b-it-6bit

Image-Text-to-Text • 1B • Updated Jun 9 • 24 • 1
mlx-community/medgemma-4b-it-8bit

Image-Text-to-Text • 1B • Updated Jun 9 • 62 • 1
mlx-community/medgemma-4b-it-bf16

Image-Text-to-Text • 5B • Updated Jun 9 • 47 • 1

OuteTTS-1.0

mlx-community/Llama-OuteTTS-1.0-1B-fp16

Text-to-Speech • 1B • Updated May 19 • 28 • 3
mlx-community/Llama-OuteTTS-1.0-1B-4bit

Text-to-Speech • 0.2B • Updated May 19 • 116 • 1
mlx-community/Llama-OuteTTS-1.0-1B-8bit

Text-to-Speech • 0.4B • Updated May 19 • 15 • 1
mlx-community/Llama-OuteTTS-1.0-1B-6bit

Text-to-Speech • 0.3B • Updated May 19 • 6

SimCLRv1

mlx-community/simclrv1-imagenet1k-resnet50-1x

Image Classification • Updated May 14
mlx-community/simclrv1-imagenet1k-resnet50-2x

Image Classification • Updated May 14
mlx-community/simclrv1-imagenet1k-resnet50-4x

Image Classification • Updated May 14

Gemma 3 DWQ

Gemma 3 distilled weight quantized (DWQ) models

mlx-community/gemma-3-4b-it-4bit-DWQ

Text Generation • 0.7B • Updated May 14 • 119 • 1
mlx-community/gemma-3-12b-it-4bit-DWQ

Text Generation • 2B • Updated May 18 • 110 • 2
mlx-community/gemma-3-1b-it-4bit-DWQ

Text Generation • 0.2B • Updated May 14 • 65
mlx-community/gemma-3-27b-it-4bit-DWQ

Text Generation • 4B • Updated May 14 • 116 • 3

Parakeet

Nvidia's ASR models, now in MLX!

mlx-community/parakeet-ctc-0.6b

Automatic Speech Recognition • 0.6B • Updated May 10 • 457 • 2
mlx-community/parakeet-rnnt-0.6b

Automatic Speech Recognition • 0.6B • Updated May 10 • 1.09k
mlx-community/parakeet-ctc-1.1b

Automatic Speech Recognition • 1B • Updated May 10 • 15 • 1
mlx-community/parakeet-rnnt-1.1b

Automatic Speech Recognition • 1B • Updated May 10 • 25 • 1

Josiefied and Abliterated Qwen3

Abliterated, and further fine-tuned to be the most uncensored models available. Now in MLX

mlx-community/Josiefied-Qwen3-30B-A3B-abliterated-v2-bf16

Text Generation • 31B • Updated Jun 19 • 28
mlx-community/Josiefied-Qwen3-30B-A3B-abliterated-v2-8bit

Text Generation • 31B • Updated Jun 19 • 115
mlx-community/Josiefied-Qwen3-30B-A3B-abliterated-v2-6bit

Text Generation • 31B • Updated Jun 19 • 34
mlx-community/Josiefied-Qwen3-30B-A3B-abliterated-v2-4bit

Text Generation • 31B • Updated Jun 19 • 160 • 2

Qwen3

mlx-community/Qwen3-0.6B-4bit

Text Generation • 93.2M • Updated Apr 28 • 5.09k • 5
mlx-community/Qwen3-0.6B-6bit

Text Generation • 0.1B • Updated Apr 28 • 37
mlx-community/Qwen3-0.6B-8bit

Text Generation • 0.2B • Updated May 4 • 967 • 5
mlx-community/Qwen3-0.6B-bf16

Text Generation • 0.6B • Updated Apr 28 • 306 • 4

NariLabs Dia-1.5B

mlx-community/Dia-1.6B

Text-to-Speech • 2B • Updated Apr 23 • 90 • 23
mlx-community/Dia-1.6B-6bit

Text-to-Speech • 2B • Updated Apr 24 • 19 • 10
mlx-community/Dia-1.6B-4bit

Text-to-Speech • 2B • Updated Apr 24 • 61 • 12
mlx-community/Dia-1.6B-3bit

Text-to-Speech • 2B • Updated Apr 24 • 5 • 3

GLM4

The GLM-4 and Z1 series are powerful open-source language models excelling in reasoning, code, and complex tasks.

mlx-community/GLM-Z1-32B-0414-4bit

Text Generation • 5B • Updated Apr 19 • 142 • 2
mlx-community/GLM-4-32B-0414-4bit

Text Generation • 5B • Updated Apr 21 • 255 • 5
mlx-community/GLM-4-32B-Base-0414-8bit

Text Generation • 9B • Updated Apr 21 • 22
mlx-community/GLM-4-32B-Base-0414-6bit

Text Generation • 7B • Updated Apr 21 • 26

PLaMo

mlx-community/plamo-2-8b

Text Generation • 9B • Updated Mar 16 • 2
mlx-community/plamo-2-1b

Text Generation • 1B • Updated Mar 15 • 202 • 4
mlx-community/plamo-2-1b-bf16

Text Generation • 1B • Updated Mar 15 • 44 • 2
mlx-community/plamo-2-8b-4bit

Text Generation • 1B • Updated Mar 15 • 3

Kimi-VL Thinking

mlx-community/Kimi-VL-A3B-Thinking-4bit

Image-Text-to-Text • Updated Apr 21 • 270 • 7
mlx-community/Kimi-VL-A3B-Thinking-6bit

Image-Text-to-Text • Updated Apr 17 • 20
mlx-community/Kimi-VL-A3B-Thinking-8bit

Image-Text-to-Text • Updated May 26 • 76 • 3

Gemma 3 QAT

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory.

mlx-community/gemma-3-27b-it-qat-bf16

Image-Text-to-Text • Updated Apr 18 • 123 • 5
mlx-community/gemma-3-27b-it-qat-8bit

Image-Text-to-Text • Updated Apr 19 • 171 • 9
mlx-community/gemma-3-27b-it-qat-6bit

Image-Text-to-Text • Updated Apr 19 • 25
mlx-community/gemma-3-27b-it-qat-4bit

Image-Text-to-Text • Updated Apr 19 • 83.6k • 20

Llama 4

mlx-community/Llama-4-Scout-17B-16E-Instruct-4bit

Image-Text-to-Text • Updated May 3 • 369 • 9
mlx-community/Llama-4-Scout-17B-16E-Instruct-6bit

Image-Text-to-Text • Updated May 3 • 210 • 5
mlx-community/Llama-4-Scout-17B-16E-Instruct-8bit

Image-Text-to-Text • Updated May 3 • 222 • 3
mlx-community/Llama-4-Maverick-17B-16E-Instruct-4bit

Text Generation • 63B • Updated Apr 6 • 283 • 7

ModernBert

mlx-community/answerdotai-ModernBERT-base-8bit

Fill-Mask • 53M • Updated Apr 2 • 3
mlx-community/answerdotai-ModernBERT-base-4bit

Fill-Mask • 29.5M • Updated Apr 2 • 10
mlx-community/answerdotai-ModernBERT-base-bf16

Fill-Mask • 0.2B • Updated Apr 2 • 18 • 1
mlx-community/answerdotai-ModernBERT-Large-Instruct-4bit

Fill-Mask • 70M • Updated Apr 2 • 4

Gemma 3

A collection of lightweight, state-of-the-art open models built from the same research and technology that powers the Gemini 2.0 models

mlx-community/gemma-3-4b-it-8bit

Image-Text-to-Text • 2B • Updated Mar 19 • 625 • 5
mlx-community/gemma-3-4b-pt-4bit

Image-Text-to-Text • 1B • Updated Mar 18 • 45 • 3
mlx-community/gemma-3-4b-it-bf16

Image-Text-to-Text • 5B • Updated Mar 20 • 157 • 1
mlx-community/gemma-3-4b-pt-6bit

Image-Text-to-Text • 1B • Updated Mar 18 • 9

Qwen QwQ

mlx-community/QwQ-32B-4bit

Text Generation • 5B • Updated Mar 5 • 79 • 37
mlx-community/QwQ-32B-6bit

Text Generation • 7B • Updated Mar 5 • 9 • 3
mlx-community/QwQ-32B-8bit

Text Generation • 9B • Updated Mar 5 • 35 • 14
mlx-community/QwQ-32B-3bit

Text Generation • 4B • Updated Mar 5 • 4

OLMoE

mlx-community/OLMoE-1B-7B-0125-Instruct

Text Generation • 7B • Updated Mar 4 • 4
mlx-community/OLMoE-1B-7B-0125-Instruct-8bit

Text Generation • 2B • Updated Mar 4 • 5
mlx-community/OLMoE-1B-7B-0125-Instruct-6bit

Text Generation • 2B • Updated Mar 4
mlx-community/OLMoE-1B-7B-0125-Instruct-4bit

Text Generation • 1B • Updated Mar 4 • 28 • 2

UI-TARS

mlx-community/UI-TARS-7B-SFT-6bit

Image-Text-to-Text • 2B • Updated Mar 3 • 6
mlx-community/UI-TARS-7B-SFT-4bit

Image-Text-to-Text • 2B • Updated Mar 3 • 4
mlx-community/UI-TARS-7B-SFT-8bit

Image-Text-to-Text • 3B • Updated Mar 3 • 5
mlx-community/UI-TARS-7B-SFT-bf16

Image-Text-to-Text • 8B • Updated Mar 3 • 5

olmOCR

mlx-community/olmOCR-7B-0225-preview-bf16

Image-to-Text • 8B • Updated Mar 3 • 86 • 4
mlx-community/olmOCR-7B-0225-preview-4bit

Image-to-Text • 2B • Updated Mar 3 • 36 • 1
mlx-community/olmOCR-7B-0225-preview-6bit

Image-to-Text • 2B • Updated Mar 3 • 9
mlx-community/olmOCR-7B-0225-preview-8bit

Updated Mar 3

Kokoro TTS

Kokoro is an open-weight TTS model with 82 million parameters. Despite its lightweight architecture, it delivers amazing quality.

mlx-community/Kokoro-82M-bf16

Text-to-Speech • Updated Mar 8 • 5.1k • 25
mlx-community/Kokoro-82M-8bit

Text-to-Speech • Updated Mar 8 • 84 • 6
mlx-community/Kokoro-82M-6bit

Text-to-Speech • Updated Mar 8 • 28 • 2
mlx-community/Kokoro-82M-4bit

Text-to-Speech • Updated Mar 8 • 211 • 3

FuseAI

FuseAI is attempting to merge CoT models to achieve newer models that are more than the sum of their parts.

miabchdave/FuseO1-DeepSeekR1-QwQ-SkyT1-32B-Preview-MLX-8Q

Question Answering • 9B • Updated Feb 19 • 6 • 1

Arcee Virtuoso

mlx-community/Virtuoso-Medium-v2-8bit

Text Generation • 9B • Updated Jan 31 • 2 • 1
mlx-community/Virtuoso-Medium-v2-3bit

Text Generation • 4B • Updated Jan 30
mlx-community/Virtuoso-Medium-v2-4bit

Text Generation • 5B • Updated Jan 30 • 1
mlx-community/Virtuoso-Medium-v2-6bit

Text Generation • 7B • Updated Jan 30

Mistral Small

mlx-community/Mistral-Small-24B-Instruct-2501-4bit

4B • Updated Jan 30 • 121 • 14
mlx-community/Mistral-Small-24B-Instruct-2501-3bit

3B • Updated Jan 30 • 22 • 2
mlx-community/Mistral-Small-24B-Instruct-2501-6bit

5B • Updated Jan 30 • 10
mlx-community/Mistral-Small-24B-Instruct-2501-8bit

7B • Updated Jan 30 • 25 • 3

DeepSeek-R1-Distill

mlx-community/deepseek-r1-distill-qwen-1.5b

2B • Updated Feb 26 • 644 • 23
mlx-community/DeepSeek-R1-Distill-Qwen-7B-4bit

1B • Updated Feb 26 • 576 • 17
mlx-community/DeepSeek-R1-Distill-Qwen-7B-8bit

2B • Updated Feb 26 • 122 • 8
mlx-community/DeepSeek-R1-Distill-Llama-8B-4bit

1B • Updated Feb 26 • 287 • 10

Qwen2.5-VL

mlx-community/Qwen2.5-VL-72B-Instruct-8bit

Image-Text-to-Text • 21B • Updated Feb 25 • 59 • 2
mlx-community/Qwen2.5-VL-72B-Instruct-6bit

Image-Text-to-Text • 16B • Updated Feb 25 • 21 • 1
mlx-community/Qwen2.5-VL-72B-Instruct-4bit

Image-Text-to-Text • 12B • Updated Feb 25 • 228 • 7
mlx-community/Qwen2.5-VL-72B-Instruct-3bit

Image-Text-to-Text • 10B • Updated Feb 25 • 38 • 5

Qwen2.5-1M

mlx-community/Qwen2.5-7B-Instruct-1M-4bit

Text Generation • 1B • Updated Jan 26 • 131 • 10
mlx-community/Qwen2.5-7B-Instruct-1M-6bit

Text Generation • 2B • Updated Jan 26 • 1 • 2
mlx-community/Qwen2.5-7B-Instruct-1M-3bit

Text Generation • 1.0B • Updated Jan 26 • 2
mlx-community/Qwen2.5-7B-Instruct-1M-8bit

Text Generation • 2B • Updated Jan 26 • 13 • 3

Mamba2

mlx-community/Mamba-Codestral-7B-v0.1

7B • Updated Jan 21 • 60 • 2
mlx-community/Mamba-Codestral-7B-v0.1-8bit

2B • Updated Jan 21 • 62 • 2
mlx-community/Mamba-Codestral-7B-v0.1-4bit

1B • Updated Jan 21 • 60 • 1
mlx-community/mamba2-2.7b-8bit

0.8B • Updated Jan 21 • 1

Jina Reader-LM

Convert HTML content to LLM-friendly Markdown/JSON content

mlx-community/jinaai-ReaderLM-v2

0.2B • Updated Jan 17 • 194 • 22
mlx-community/reader-lm-1.5b

Text Generation • 2B • Updated Jan 18 • 1
mlx-community/reader-lm-0.5b

Text Generation • 0.5B • Updated Jan 18

Helium-1

Kyutai's Helium-1 2B Model, outperforming other state of the art small models.

mlx-community/helium-1-preview-2b-float32

Text Generation • 2B • Updated Jan 18 • 2
mlx-community/helium-1-preview-2b

Text Generation • 2B • Updated Jan 18
mlx-community/helium-1-preview-2b-8bit

Text Generation • 0.6B • Updated Jan 18 • 8 • 1
mlx-community/helium-1-preview-2b-4bit

Text Generation • 0.3B • Updated Jan 18 • 2 • 1

QVQ-72B-Preview

mlx-community/QVQ-72B-Preview-4bit

Image-Text-to-Text • 11B • Updated Apr 12 • 8 • 7
mlx-community/QVQ-72B-Preview-6bit

Image-Text-to-Text • 16B • Updated Dec 24, 2024 • 2 • 2
mlx-community/QVQ-72B-Preview-3bit

Image-Text-to-Text • 9B • Updated Dec 24, 2024 • 2 • 5
mlx-community/QVQ-72B-Preview-8bit

Image-Text-to-Text • 21B • Updated Dec 24, 2024 • 1 • 3

DeepSeek-VL2

mlx-community/deepseek-vl2-6bit

Image-Text-to-Text • 6B • Updated Dec 22, 2024 • 45 • 1
mlx-community/deepseek-vl2-small-4bit

Image-Text-to-Text • 3B • Updated Dec 22, 2024 • 41
mlx-community/deepseek-vl2-4bit

Image-Text-to-Text • 4B • Updated Dec 22, 2024 • 69 • 1
mlx-community/deepseek-vl2-small-6bit

Image-Text-to-Text • 4B • Updated Dec 22, 2024 • 20

Josiefied and Abliterated Qwen2.5

The best uncensored models

mlx-community/Josiefied-Qwen2.5-Coder-7B-Instruct-abliterated-v1

Text Generation • 8B • Updated Feb 16 • 12
mlx-community/Josiefied-Qwen2.5-Coder-7B-Instruct-abliterated-v1-8bit

Text Generation • 2B • Updated Feb 16 • 11
mlx-community/Josiefied-Qwen2.5-Coder-7B-Instruct-abliterated-v1-6bit

Text Generation • 2B • Updated Feb 16 • 8
mlx-community/Josiefied-Qwen2.5-Coder-7B-Instruct-abliterated-v1-4bit

Text Generation • 1B • Updated Feb 16 • 31 • 1

Falcon3 Mamba

mlx-community/Falcon3-Mamba-7B-Instruct

Text Generation • 7B • Updated Feb 14 • 3
mlx-community/Falcon3-Mamba-7B-Instruct-8bits

Text Generation • 2B • Updated Feb 14 • 4
mlx-community/Falcon3-Mamba-7B-Instruct-4bits

Text Generation • 1B • Updated Feb 14 • 7

EXAONE-3.5

EXAONE 3.5, a collection of instruction-tuned bilingual generative models ranging from 2.4B to 32B parameters, developed by LG AI.

mlx-community/EXAONE-3.5-2.4B-Instruct-4bit

0.4B • Updated Dec 9, 2024 • 2 • 3
mlx-community/EXAONE-3.5-2.4B-Instruct-6bit

0.5B • Updated Dec 9, 2024
mlx-community/EXAONE-3.5-2.4B-Instruct-8bit

0.7B • Updated Dec 9, 2024
mlx-community/EXAONE-3.5-2.4B-Instruct-bf16

2B • Updated Dec 9, 2024

Llama 3.3

mlx-community/Llama-3.3-70B-Instruct-8bit

Text Generation • 20B • Updated Dec 6, 2024 • 373 • 14
mlx-community/Llama-3.3-70B-Instruct-6bit

Text Generation • 15B • Updated Dec 6, 2024 • 69 • 5
mlx-community/Llama-3.3-70B-Instruct-3bit

Text Generation • 9B • Updated Dec 6, 2024 • 106 • 7
mlx-community/Llama-3.3-70B-Instruct-4bit

Text Generation • 11B • Updated Dec 6, 2024 • 1.07k • 30

Paligemma 2

mlx-community/paligemma2-3b-ft-docci-448-8bit

Image-Text-to-Text • 0.9B • Updated Dec 5, 2024
mlx-community/paligemma2-3b-ft-docci-448-6bit

Image-Text-to-Text • 0.7B • Updated Dec 5, 2024
mlx-community/paligemma2-3b-ft-docci-448-bf16

Image-Text-to-Text • 3B • Updated Dec 5, 2024 • 8 • 1
mlx-community/paligemma2-10b-ft-docci-448-bf16

Image-Text-to-Text • 10B • Updated Dec 16, 2024 • 25 • 3

QwQ-32B-Preview

mlx-community/QwQ-32B-Preview-4bit

Text Generation • 5B • Updated Nov 27, 2024 • 9 • 3
mlx-community/QwQ-32B-Preview-3bit

Text Generation • 4B • Updated Nov 28, 2024 • 10 • 5
mlx-community/QwQ-32B-Preview-6bit

Text Generation • 7B • Updated Nov 27, 2024 • 2 • 4
mlx-community/QwQ-32B-Preview-8bit

Text Generation • 9B • Updated Nov 27, 2024 • 13 • 6

Idefics 3 + SmolVLM

mlx-community/SmolVLM-Instruct-4bit

Image-Text-to-Text • 0.5B • Updated Nov 29, 2024 • 228 • 5
mlx-community/SmolVLM-Instruct-6bit

Image-Text-to-Text • 0.6B • Updated Nov 29, 2024 • 8
mlx-community/SmolVLM-Instruct-8bit

Image-Text-to-Text • 0.7B • Updated Nov 29, 2024 • 23 • 9
mlx-community/SmolVLM-Instruct-bf16

Image-Text-to-Text • 2B • Updated Nov 29, 2024 • 13 • 5

Molmo

mlx-community/Molmo-7B-D-0924-4bit

Image-Text-to-Text • 2B • Updated Dec 27, 2024 • 26 • 2
mlx-community/Molmo-7B-D-0924-8bit

Image-Text-to-Text • 3B • Updated Dec 27, 2024 • 13 • 1
mlx-community/Molmo-7B-D-0924-bf16

Image-Text-to-Text • 8B • Updated Jan 1 • 28 • 1
mlx-community/Molmo-7B-D-0924-8bit-skip-vision

Updated Nov 20, 2024

Florence-2

mlx-community/Florence-2-base-ft-4bit

Image-Text-to-Text • 48.8M • Updated Nov 21, 2024 • 64 • 1
mlx-community/Florence-2-large-ft-bf16

Image-Text-to-Text • 0.8B • Updated Nov 21, 2024 • 52 • 1
mlx-community/Florence-2-base-ft-bf16

Image-Text-to-Text • 0.3B • Updated Nov 21, 2024 • 10 • 1
mlx-community/Florence-2-base-ft-8bit

Image-Text-to-Text • 81.7M • Updated Nov 21, 2024 • 29 • 1

Falcon-Mamba

Falcon Mamba models compatible with MLX

mlx-community/falcon-mamba-7b-4bit

1B • Updated Nov 15, 2024 • 5
mlx-community/falcon-mamba-7b-4bit-instruct

Text Generation • 1B • Updated Nov 15, 2024 • 6
mlx-community/falcon-mamba-7b-8bit-instruct

Text Generation • 2B • Updated Nov 15, 2024 • 6
mlx-community/falcon-mamba-7b-8bit

2B • Updated Nov 15, 2024 • 3

Qwen2.5-Coder

Code-specific model series based on Qwen2.5

mlx-community/Qwen2.5-Coder-32B-Instruct-8bit

Text Generation • 9B • Updated Nov 11, 2024 • 90 • 11
mlx-community/Qwen2.5-Coder-14B-Instruct-4bit

Text Generation • 2B • Updated Nov 11, 2024 • 165 • 4
mlx-community/Qwen2.5-Coder-14B-Instruct-bf16

Text Generation • 15B • Updated Nov 11, 2024 • 15 • 2
mlx-community/Qwen2.5-Coder-3B-Instruct-8bit

Text Generation • 0.9B • Updated Nov 11, 2024 • 8

Ministral

mlx-community/Ministral-8B-Instruct-2410-bf16

8B • Updated Oct 17, 2024 • 29 • 2
mlx-community/Ministral-8B-Instruct-2410-4bit

1B • Updated Oct 17, 2024 • 148 • 9
mlx-community/Ministral-8B-Instruct-2410-8bit

2B • Updated Oct 17, 2024 • 24 • 2

Lumimaid

A collection of Neversleep's RP focused Lumimaid LLMs.

mlx-community/Lumimaid-8B-v0.1

2B • Updated Oct 13, 2024 • 1
mlx-community/Lumimaid-70B-v0.1

20B • Updated Oct 13, 2024 • 1
mlx-community/Lumimaid-70B-v0.1-alt

20B • Updated Oct 13, 2024 • 1
mlx-community/Lumimaid-70B-v0.1-OAS

20B • Updated Oct 13, 2024

Mistral

mlx-community/Mistral-7B-Instruct-v0.2

Text Generation • Updated Dec 23, 2023 • 209 • 20
mlx-community/Mistral-7B-v0.2-4bit

1B • Updated Mar 25, 2024 • 77 • 8
mlx-community/Hermes-2-Pro-Mistral-7B-4bit

1B • Updated Mar 14, 2024 • 21 • 3
mlx-community/Mixtral-8x7B-Instruct-v0.1

47B • Updated May 7, 2024 • 214 • 23

Qwen1.5

Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud.

mlx-community/Qwen1.5-1.8B-Chat-4bit

Text Generation • 0.5B • Updated Feb 18, 2024 • 9 • 2
mlx-community/Qwen1.5-0.5B-Chat-4bit

Text Generation • 72.6M • Updated Apr 18, 2024 • 3.47k • 4
mlx-community/Qwen1.5-14B-Chat-4bit

Text Generation • 3B • Updated Mar 8, 2024 • 7 • 1
mlx-community/Qwen1.5-7B-Chat-4bit

Text Generation • 2B • Updated Mar 7, 2024 • 7 • 2

Code Gemma

Google’s Code-Gemma

mlx-community/codegemma-7b-it-8bit

Text Generation • 3B • Updated Apr 9, 2024 • 117 • 5
mlx-community/codegemma-2b-4bit

Text Generation • 0.8B • Updated Apr 9, 2024 • 4
mlx-community/codegemma-7b-4bit

Text Generation • 2B • Updated Apr 9, 2024 • 9
mlx-community/codegemma-7b-it-4bit

Text Generation • 2B • Updated Apr 9, 2024 • 25 • 1

Llama 3

mlx-community/Meta-Llama-3-8B-Instruct-4bit

Text Generation • 2B • Updated Apr 19, 2024 • 1.68k • 79
mlx-community/Meta-Llama-3-8B-4bit

Text Generation • 2B • Updated Apr 20, 2024 • 42 • 8
mlx-community/Meta-Llama-Guard-2-8B-4bit

Text Generation • 2B • Updated Apr 19, 2024 • 5
mlx-community/Meta-Llama-3-70B-4bit

Text Generation • 11B • Updated Apr 20, 2024 • 58 • 9

Qwen2.5

The Qwen 2.5 models are a series of AI models trained on 18 trillion tokens, supporting 29 languages and offering advanced features such as instructio

mlx-community/Qwen2.5-72B-Instruct-bf16

Text Generation • 73B • Updated Sep 19, 2024 • 8
mlx-community/Qwen2.5-72B-Instruct-8bit

Text Generation • 20B • Updated Sep 19, 2024 • 21 • 3
mlx-community/Qwen2.5-72B-Instruct-4bit

Text Generation • 11B • Updated Sep 18, 2024 • 101 • 5
mlx-community/Qwen2.5-32B-Instruct-bf16

Text Generation • 33B • Updated Sep 18, 2024 • 12

Phi-3

mlx-community/Phi-3-mini-4k-instruct-4bit

Text Generation • 0.6B • Updated Jul 11, 2024 • 669 • 12
mlx-community/Phi-3-mini-128k-instruct-4bit

Text Generation • 0.6B • Updated Jul 11, 2024 • 179 • 12
mlx-community/Phi-3-mini-128k-instruct-8bit

Text Generation • 1B • Updated Jul 11, 2024 • 38 • 10
mlx-community/Phi-3-mini-4k-instruct-8bit

Text Generation • 1B • Updated Jul 4, 2024 • 19 • 2

Whisper

OpenAI Whisper speech recognition models in MLX format

mlx-community/whisper-large-v3-mlx

Updated Aug 9, 2024 • 8.45k • 61
mlx-community/whisper-tiny-mlx-q4

Updated Mar 9, 2024 • 41 • 2
mlx-community/whisper-large-v2-mlx-fp32

Updated Aug 9, 2024 • 1
mlx-community/whisper-tiny.en-mlx-q4

Updated Mar 9, 2024 • 9

OpenELM

A family of Open-source Efficient Language Models from Apple.

mlx-community/OpenELM-3B

Text Generation • 3B • Updated Apr 26, 2024 • 7 • 8
mlx-community/OpenELM-1_1B-Instruct-8bit

0.3B • Updated Apr 24, 2024 • 40 • 1
mlx-community/OpenELM-1_1B-Instruct-4bit

0.2B • Updated Apr 24, 2024 • 61 • 2
mlx-community/OpenELM-1_1B-8bit

0.3B • Updated Apr 24, 2024 • 7 • 1

Yi-1.5

mlx-community/Yi-1.5-9B-Chat-4bit

1B • Updated May 13, 2024 • 33 • 2
mlx-community/Yi-1.5-34B-Chat-8bit

10B • Updated May 13, 2024 • 24 • 3
mlx-community/Yi-1.5-9B-8bit

2B • Updated May 13, 2024 • 1 • 1
mlx-community/Yi-1.5-34B-8bit

10B • Updated May 13, 2024 • 1

Mistral (Mamba) Codestral

mlx-community/Codestral-22B-v0.1-8bit

6B • Updated May 29, 2024 • 138 • 8
mlx-community/Codestral-22B-v0.1-4bit

3B • Updated May 29, 2024 • 348 • 13
mlx-community/Mamba-Codestral-7B-v0.1

7B • Updated Jan 21 • 60 • 2
mlx-community/Mamba-Codestral-7B-v0.1-8bit

2B • Updated Jan 21 • 62 • 2

Google Gemma2

mlx-community/gemma-2-27b-8bit

Text Generation • 8B • Updated Jun 27, 2024 • 5 • 2
mlx-community/gemma-2-27b-it-8bit

Text Generation • 8B • Updated Nov 6, 2024 • 71 • 10
mlx-community/gemma-2-9b-8bit

Text Generation • 3B • Updated Jun 27, 2024 • 71 • 9
mlx-community/gemma-2-27b-4bit

Text Generation • 4B • Updated Jun 27, 2024 • 6

Mamba

Mamba is a new LLM architecture that integrates the Structured State Space sequence model to manage lengthy data sequences.

mlx-community/mamba-1.4b-hf-f32

Text Generation • 1B • Updated Sep 21, 2024 • 12
mlx-community/mamba-1.4b-hf-f16

Text Generation • 1B • Updated Sep 21, 2024 • 5 • 1
mlx-community/mamba-790m-hf-f32

Text Generation • 0.8B • Updated Sep 21, 2024 • 3
mlx-community/mamba-790m-hf-f16

Text Generation • 0.8B • Updated Sep 21, 2024

HF SmolLM

A series of smol LLMs: 135M, 360M and 1.7B.

mlx-community/SmolLM-135M-8bit

Text Generation • 37.9M • Updated Jul 16, 2024 • 14
mlx-community/SmolLM-135M-fp16

Text Generation • 0.1B • Updated Jul 16, 2024 • 6 • 1
mlx-community/SmolLM-360M-4bit

Text Generation • 56.6M • Updated Jul 16, 2024 • 26
mlx-community/SmolLM-360M-8bit

Text Generation • 0.1B • Updated Jul 16, 2024 • 16

Mistral NeMo

mlx-community/Mistral-Nemo-Base-2407-8bit

3B • Updated Jul 18, 2024 • 26 • 3
mlx-community/Mistral-Nemo-Base-2407-bf16

12B • Updated Jul 18, 2024 • 6 • 1
mlx-community/Mistral-Nemo-Instruct-2407-4bit

2B • Updated Nov 6, 2024 • 687 • 14
mlx-community/Mistral-Nemo-Base-2407-4bit

2B • Updated Jul 18, 2024 • 8 • 1

Llama 3.1

mlx-community/Meta-Llama-3.1-70B-bf16

Text Generation • 71B • Updated Jul 23, 2024 • 18 • 4
mlx-community/Meta-Llama-3.1-70B-Instruct-bf16

Text Generation • 71B • Updated Oct 6, 2024 • 9 • 2
mlx-community/Meta-Llama-3.1-8B-Instruct-bf16

Text Generation • 8B • Updated Oct 19, 2024 • 235 • 3
mlx-community/Meta-Llama-3.1-8B-Instruct-8bit

Text Generation • 2B • Updated Nov 26, 2024 • 318 • 10

EnCodec

EnCodec models in MLX

mlx-community/encodec-48khz-float32

16.9M • Updated Sep 16, 2024 • 1 • 2
mlx-community/encodec-24khz-bfloat16

19M • Updated Sep 18, 2024
mlx-community/encodec-32khz-bfloat16

57.9M • Updated Sep 18, 2024 • 1
mlx-community/encodec-32khz-float32

57.9M • Updated Sep 18, 2024 • 19

Llama 3.2

Meta goes small with Llama3.2, both text only 1B and 3B, and the 11B Vision models.

mlx-community/Llama-3.2-11B-Vision-Instruct-abliterated

Image-Text-to-Text • 11B • Updated Dec 16, 2024 • 667 • 7
mlx-community/Llama-3.2-11B-Vision-Instruct-abliterated-8-bit

Image-Text-to-Text • 3B • Updated Dec 16, 2024 • 74
mlx-community/Llama-3.2-11B-Vision-Instruct-abliterated-4-bit

Image-Text-to-Text • 2B • Updated Dec 16, 2024 • 91 • 1
mlx-community/Llama-3.2-11B-Vision-Instruct-8bit

Image-to-Text • 3B • Updated Oct 18, 2024 • 581 • 10

AI & ML interests

Recent Activity

Team members 3,288

mlx-community 's collections 98