FitLLM

Can I run this LLM? — fit by model × hardware

✅ Pick a combination

Computed with the open FitLLM engine — accurate per-layer KV-cache modeling, not a naive estimate. Updated 2026-06.

Each page is computed by the open FitLLM engine from official model configs.

Guides

Reference

Best GPU / Mac by model

GPU vs GPU

By model × hardware

All numbers are computed by the open-source fitllm-engine (MIT) from official model config.json values — reproduce or audit them yourself. Estimates; real usage varies with runtime (llama.cpp / MLX / Ollama), driver and display. Found a mismatch? Report it. · FitLLM home