Build: NVIDIA GB200 NVL72 + — + 32 GB RAM (windows)
Full-VRAM resident, with room for context. No compromises.
ollama run gemma3:1bollama run llama3.2:1bollama run gemma4:e2bollama run llama3.2:3bollama run phi3.5:3.8bollama run gemma4:e4bollama run qwen3:4bollama run gemma3:4bollama run mistral:7bollama run codegemma:7bNeed more memory than you have. Shown for orientation.
Even with CPU offload, needs more memory than your VRAM (13824 GB) + 60% of system RAM (19 GB) combined.
Even with CPU offload, needs more memory than your VRAM (13824 GB) + 60% of system RAM (19 GB) combined.
Even with CPU offload, needs more memory than your VRAM (13824 GB) + 60% of system RAM (19 GB) combined.
Want a specific benchmark we don't have? Email support@runlocalai.co and we'll prioritize it.