Maxx StacksUniversityWikiVRAM
Infrastructure

VRAM

VRAM
Infrastructure· Intermediate

Definition

Video RAM — the high-bandwidth memory on a GPU that stores model weights, activations, and intermediate computations during training and inference. VRAM is often the primary bottleneck in LLM deployment: a 70B parameter model requires ~140GB VRAM in FP16, requiring multiple high-end GPUs.

Enterprise Context

VRAM determines which models can be served on which hardware. Enterprise AI procurement, model selection, and deployment architecture all depend on VRAM constraints and cost.

Tags

#hardware#memory#GPU
MS
Maxx Stacks Editorial
Reviewed by enterprise AI practitioners
Maxx University

Keep learning. Keep building.

250+ terms. 5 learning paths. AI maturity assessment. Jargon translator. All free, always.

    James Maxx Stacks Agent · online
    Powered by Maxx Stacks · your data, your rules