NVIDIA GeForce RTX 5090
Blackwell flagship. 32GB GDDR7 on a 512-bit bus delivers ~1.79 TB/s memory bandwidth — the new top of consumer hardware for local LLM inference. Comfortably loads 70B Q4 with room for context.
Overview
Blackwell flagship. 32GB GDDR7 on a 512-bit bus delivers ~1.79 TB/s memory bandwidth — the new top of consumer hardware for local LLM inference. Comfortably loads 70B Q4 with room for context.
Some links above are affiliate links. We may earn a commission at no extra cost to you. How we make money.
Specs
| VRAM | 32 GB |
| Power draw | 575 W |
| Released | 2025 |
| MSRP | $1999 |
| Backends | CUDA Vulkan |
Models that fit
Open-weight models small enough to run on NVIDIA GeForce RTX 5090 with usable context.
Hardware worth comparing
Same VRAM tier and the one step above and below — so you can frame the buying decision against real options.
Frequently asked
What models can NVIDIA GeForce RTX 5090 run?
Does NVIDIA GeForce RTX 5090 support CUDA?
How much does NVIDIA GeForce RTX 5090 cost?
Reviewed by RunLocalAI Editorial. See our editorial policy for how we research and verify hardware specifications.