nvidia
GPU
48GB VRAM
workstation
NVIDIA L40S
Ada-gen datacenter card. 48GB GDDR6 — popular at cloud GPU rentals as a budget H100 alternative.
Released 2023
Overview
Ada-gen datacenter card. 48GB GDDR6 — popular at cloud GPU rentals as a budget H100 alternative.
Specs
| VRAM | 48 GB |
| Power draw | 350 W |
| Released | 2023 |
| MSRP | $8500 |
| Backends | CUDA |
Models that fit
Open-weight models small enough to run on NVIDIA L40S with usable context.
Frequently asked
What models can NVIDIA L40S run?
With 48GB VRAM, the NVIDIA L40S runs 70B models in 4-bit quantization, plus everything smaller. See the model list below for tested combinations.
Does NVIDIA L40S support CUDA?
Yes — NVIDIA L40S is an NVIDIA card with full CUDA support, the most mature local-AI backend. llama.cpp, Ollama, vLLM, and ExLlamaV2 all run natively.
Reviewed by RunLocalAI Editorial. See our editorial policy for how we research and verify hardware specifications.