Available Memory
64GB
Unified Memory
Memory Bandwidth
273GB/s
Basic1000 GB/s max
Why Bandwidth Matters

LLM inference is memory-bound. Higher bandwidth directly translates to faster token generation, making it more important than raw compute power.

Top Picks for Your Hardware

All Compatible Models

Llama 3.2 1B

Llama1B
FP16very high
2.3 GB64 GB
Fast81.9 t/s
General
ollama run llama3.2:1b

Granite 3 MoE 1B

Granite1B
FP16very high
2.3 GB64 GB
Fast81.9 t/s
GeneralCoding
ollama run granite3-moe:1b

Qwen 2.5 0.5B

Qwen0.5B
FP16very high
1.2 GB64 GB
Fast164 t/s
General
ollama run qwen2.5:0.5b

SmolLM2 360M

SmolLM0.36B
FP16very high
0.8 GB64 GB
Fast228 t/s
General
ollama run smollm2:360m

Gemma 2 2B

Gemma2B
FP16very high
5.0 GB64 GB
Fast40.9 t/s
General
ollama run gemma2:2b

EXAONE 3.5 2.4B

EXAONE2.4B
FP16very high
5.0 GB64 GB
Fast34.1 t/s
GeneralCoding
ollama run exaone3.5:2.4b

SmolLM2 135M

SmolLM0.135B
FP16very high
0.3 GB64 GB
Fast607 t/s
General
ollama run smollm2:135m

Granite 3 Dense 2B

Granite2B
FP16very high
4.2 GB64 GB
Fast40.9 t/s
GeneralCoding
ollama run granite3-dense:2b

SmolLM2 1.7B

SmolLM1.7B
FP16very high
3.5 GB64 GB
Fast48.2 t/s
GeneralCoding
ollama run smollm2:1.7b

Qwen 2.5 3B

Qwen3B
FP16very high
7.0 GB64 GB
Good27.3 t/s
GeneralCoding
ollama run qwen2.5:3b

Llama 3.2 3B

Llama3B
FP16very high
6.5 GB64 GB
Good27.3 t/s
GeneralCoding
ollama run llama3.2:3b

StarCoder2 3B

StarCoder3B
FP16very high
6.5 GB64 GB
Good27.3 t/s
Coding
ollama run starcoder2:3b

Kimi K1.5 A3B

Kimi3B
FP16very high
6.5 GB64 GB
Good27.3 t/s
GeneralReasoningMath
ollama run kimi-k1.5:a3b

Granite 3 MoE 3B

Granite3B
FP16very high
6.5 GB64 GB
Good27.3 t/s
GeneralCoding
ollama run granite3-moe:3b

Phi-3 Mini (3.8B)

Phi3.8B
FP16very high
7.8 GB64 GB
Good21.6 t/s
GeneralCodingReasoning
ollama run phi3:mini

GLM Edge 4B

GLM4B
FP16very high
8.2 GB64 GB
Good20.5 t/s
GeneralCoding
ollama run glm-edge:4b

Codestral 22B

Mistral22B
FP16very high
44.0 GB64 GB
Very Slow3.7 t/s
Coding
ollama run codestral:22b

Qwen 2.5 14B

Qwen14B
FP16very high
29.0 GB64 GB
Slow5.9 t/s
GeneralCodingReasoningMath
ollama run qwen2.5:14b

InternLM 2.5 20B

InternLM20B
FP16very high
40.0 GB64 GB
Very Slow4.1 t/s
GeneralCodingReasoningMath
ollama run internlm2:20b

Sailor2 20B

Sailor20B
FP16very high
40.0 GB64 GB
Very Slow4.1 t/s
GeneralCodingReasoning
ollama run sailor2:20b

Code Llama 13B

Llama13B
FP16very high
26.0 GB64 GB
Slow6.3 t/s
Coding
ollama run codellama:13b

Orca 2 13B

Orca13B
FP16very high
26.0 GB64 GB
Slow6.3 t/s
GeneralReasoning
ollama run orca2:13b