Free
Runs in your browser
LLM VRAM Calculator
How much GPU memory do you need to run or fine-tune a model? Pick the size, precision, and what you're doing — get a VRAM estimate and a GPU that fits. Everything runs in your browser.
Estimated VRAM needed
—
GB
Estimate only — actual usage varies with batch size, sequence length, framework, and kernels.
Related guides
- LoRA & QLoRA fine-tuning guide — how the modes above actually differ
- LLM inference engines compared — once it fits, serve it efficiently
- Self-hosting LLMs vs cloud APIs — is buying the GPU worth it?