NVIDIA GeForce GTX 1660 Super for local AI

This card is for the operator who needs a cheap inference runner for small models and already has a CUDA stack in place. The 6 GB VRAM ceiling and lack of Tensor cores mean it's strictly a budget workhorse, not a platform for serious local AI.

On 7B Q4 models, the 336 GB/s bandwidth delivers roughly 30-45 tok/s — usable for chat and code completion, but not for real-time streaming. 13B Q4 models (9 GB) won't fit at all, and even 8B Q4 models (5.5 GB) leave almost no room for context.

What breaks: anything above 6 GB VRAM. No Tensor cores means no FP8 or INT4 acceleration, so the card relies entirely on CUDA cores for compute. Software stack is limited to CUDA-only runtimes; no ROCm or Vulkan support out of the box.

Pass on this card if the workload includes 13B+ models, long-context inference, or any training/fine-tuning. The 6 GB ceiling is a hard wall, and the lack of Tensor cores makes it obsolete for modern quantization formats.

At $150 used, it's a fair price for a disposable inference node, but a used RTX 3060 12 GB for $180 is a far better long-term investment.

Frequently asked

What models can NVIDIA GeForce GTX 1660 Super run?

With 6GB VRAM, the NVIDIA GeForce GTX 1660 Super runs 7B models comfortably in Q4 quantization. See the model list below for tested combinations.

Does NVIDIA GeForce GTX 1660 Super support CUDA?

Yes — NVIDIA GeForce GTX 1660 Super is an NVIDIA card with full CUDA support, the most mature local-AI backend. llama.cpp, Ollama, vLLM, and ExLlamaV2 all run natively.

How much does NVIDIA GeForce GTX 1660 Super cost?

Current street price for NVIDIA GeForce GTX 1660 Super is around $150 (MSRP $229). Prices vary by region and supply.

VRAM	6 GB
Power draw	125 W
Released	2019
MSRP	$229
Backends	CUDA Vulkan

NVIDIA GeForce GTX 1660 Super

Our verdict

Overview

Specs

Models that fit

Frequently asked

What models can NVIDIA GeForce GTX 1660 Super run?

Does NVIDIA GeForce GTX 1660 Super support CUDA?

How much does NVIDIA GeForce GTX 1660 Super cost?

Where next?

Hardware worth comparing