nvidia
GPU
16GB VRAM
high
NVIDIA GeForce RTX 4080
Original 4080. 16GB GDDR6X. Still capable for 14B–32B Q4 work.
Released 2022·~$1099 street
Overview
Original 4080. 16GB GDDR6X. Still capable for 14B–32B Q4 work.
Where to buy
Geo-routed to your region. Approx. $1099.
Some links above are affiliate links. We may earn a commission at no extra cost to you. How we make money.
Specs
| VRAM | 16 GB |
| Power draw | 320 W |
| Released | 2022 |
| MSRP | $1199 |
| Backends | CUDA Vulkan |
Models that fit
Open-weight models small enough to run on NVIDIA GeForce RTX 4080 with usable context.
Compare alternatives
Hardware worth comparing
Same VRAM tier and the one step above and below — so you can frame the buying decision against real options.
Same VRAM tier
Cards in the same memory band
Step up
More VRAM — bigger models, more context
Step down
Less VRAM — cheaper, more constrained
No verdicted hardware in the next tier down yet.
Frequently asked
What models can NVIDIA GeForce RTX 4080 run?
With 16GB VRAM, the NVIDIA GeForce RTX 4080 runs models up to 14B in 4-bit, or 7B at higher quantizations. See the model list below for tested combinations.
Does NVIDIA GeForce RTX 4080 support CUDA?
Yes — NVIDIA GeForce RTX 4080 is an NVIDIA card with full CUDA support, the most mature local-AI backend. llama.cpp, Ollama, vLLM, and ExLlamaV2 all run natively.
How much does NVIDIA GeForce RTX 4080 cost?
Current street price for NVIDIA GeForce RTX 4080 is around $1099 (MSRP $1199). Prices vary by region and supply.
Reviewed by RunLocalAI Editorial. See our editorial policy for how we research and verify hardware specifications.