LLM Hardware Checker: What Local AI Models Can You Run?

Available Memory

64GB

Unified Memory

Memory Bandwidth

273GB/s

Basic1000 GB/s max

Why Bandwidth Matters

LLM inference is memory bound. Higher bandwidth directly translates to faster token generation, making it more important than raw compute power.

Top Picks for Your Hardware

Qwen 2.5 1.5B

Qwen1.5B

Top Pick

FP16•very high

3.5 GB64 GB

Fast•54.6 t/s

GeneralCoding

ollama run qwen2.5:1.5b

StableLM 2 1.6B

StableLM1.6B

Top Pick

FP16•very high

3.4 GB64 GB

Fast•51.2 t/s

GeneralCoding

ollama run stablelm2:1.6b

GLM Edge 1.5B

GLM1.5B

Top Pick

FP16•very high

3.2 GB64 GB

Fast•54.6 t/s

General

ollama run glm-edge:1.5b

All Compatible Models

Gemma 3 1B

Gemma1B

FP16•very high

2.5 GB64 GB

Fast•81.9 t/s

General

ollama run gemma3:1b

Llama 3.2 1B

Llama1B

FP16•very high

2.3 GB64 GB

Fast•81.9 t/s

General

ollama run llama3.2:1b

Granite 3 MoE 1B

Granite1B

FP16•very high

2.3 GB64 GB

Fast•81.9 t/s

GeneralCoding

ollama run granite3-moe:1b

Qwen 3 0.6B

Qwen0.6B

FP16•very high

1.4 GB64 GB

Fast•137 t/s

General

ollama run qwen3:0.6b

Qwen 2.5 0.5B

Qwen0.5B

FP16•very high

1.2 GB64 GB

Fast•164 t/s

General

ollama run qwen2.5:0.5b

SmolLM2 360M

SmolLM0.36B

FP16•very high

0.8 GB64 GB

Fast•228 t/s

General

ollama run smollm2:360m

Gemma 2 2B

Gemma2B

FP16•very high

5.0 GB64 GB

Fast•40.9 t/s

General

ollama run gemma2:2b

EXAONE 3.5 2.4B

EXAONE2.4B

FP16•very high

5.0 GB64 GB

Fast•34.1 t/s

GeneralCoding

ollama run exaone3.5:2.4b

SmolLM2 135M

SmolLM0.135B

FP16•very high

0.3 GB64 GB

Fast•607 t/s

General

ollama run smollm2:135m

Granite 3 Dense 2B

Granite2B

FP16•very high

4.2 GB64 GB

Fast•40.9 t/s

GeneralCoding

ollama run granite3-dense:2b

Gemma 4 E2B

Gemma2B

FP16•very high

4.0 GB64 GB

Fast•40.9 t/s

General

ollama run gemma4:e2b

Qwen 3 1.7B

Qwen1.7B

FP16•very high

3.6 GB64 GB

Fast•48.2 t/s

GeneralCoding

ollama run qwen3:1.7b

SmolLM2 1.7B

SmolLM1.7B

FP16•very high

3.5 GB64 GB

Fast•48.2 t/s

GeneralCoding

ollama run smollm2:1.7b

Qwen 2.5 3B

Qwen3B

FP16•very high

7.0 GB64 GB

Good•27.3 t/s

GeneralCoding

ollama run qwen2.5:3b

Mistral 3 3B

Mistral3B

FP16•very high

7.0 GB64 GB

Good•27.3 t/s

GeneralCoding

ollama run mistral-3:3b

Llama 3.2 3B

Llama3B

FP16•very high

6.5 GB64 GB

Good•27.3 t/s

GeneralCoding

ollama run llama3.2:3b

StarCoder2 3B

StarCoder3B

FP16•very high

6.5 GB64 GB

Good•27.3 t/s

Coding

ollama run starcoder2:3b

Kimi K1.5 A3B

Kimi3B

FP16•very high

6.5 GB64 GB

Good•27.3 t/s

GeneralReasoningMath

ollama run kimi-k1.5:a3b

Granite 3 MoE 3B

Granite3B

FP16•very high

6.5 GB64 GB

Good•27.3 t/s

GeneralCoding

ollama run granite3-moe:3b

Granite 4.1 3B

Granite3B

FP16•very high

6.5 GB64 GB

Good•27.3 t/s

GeneralCoding

ollama run granite4.1:3b

Phi-3 Mini (3.8B)

Phi3.8B

FP16•very high

7.8 GB64 GB

Good•21.6 t/s

GeneralCodingReasoning

ollama run phi3:mini

Qwen 3 4B

Qwen4B

FP16•very high

8.5 GB64 GB

Good•20.5 t/s

GeneralCodingReasoning

ollama run qwen3:4b