TERAFAB - GPU Memory / Model Fit Calculator

Check which model sizes fit in GPU memory. VRAM requirements depend on precision (FP16, INT8, INT4) and batch size.

Model Size (B params)

Precision

GPU Memory (GB)

Include KV cache overhead

Add 20% for inference context

Batch size (inference)

Contact Us

Questions or feedback? Get in touch via our contact form.

Visit our FAQ Center for more information and commonly asked questions.