To run Gemma 3 12B locally at Q4_K_M quantization, you need at minimum 12.4 GB of GPU VRAM.
Best used-market value for 24GB VRAM. Solid for 30B-class models.
Fastest 24GB consumer GPU. Excellent for daily local inference.
Pro workstation card with ECC memory. Maximum headroom at 24GB.
Use the interactive calculator to compare Gemma 3 12B across all available formats.
Open Live Calculator →