RUNLOCALAIv38
→WILL IT RUNBEST GPUCOMPARETROUBLESHOOTSTARTPULSEMODELSHARDWARETOOLSBENCH
RUNLOCALAI

Operator-grade instrument for local-AI hardware intelligence. Hand-written verdicts. Real benchmarks. Reproducible commands.

OP·Fredoline Eruo
DIR
  • Models
  • Hardware
  • Tools
  • Benchmarks
  • Will it run?
GUIDES
  • Best GPU
  • Best laptop
  • Best Mac
  • Best used GPU
  • Best budget GPU
  • Best GPU for Ollama
  • Best GPU for SD
  • AI PC build $2K
  • CUDA vs ROCm
  • 16 vs 24 GB
  • Compare hardware
  • Custom compare
REF
  • Systems
  • Ecosystem maps
  • Pillar guides
  • Methodology
  • Glossary
  • Errors KB
  • Troubleshooting
  • Resources
  • Public API
EDITOR
  • About
  • About the author
  • Changelog
  • Latest
  • Updates
  • Submit benchmark
  • Send feedback
  • Trust
  • Editorial policy
  • How we make money
  • Contact
LEGAL
  • Privacy
  • Terms
  • Sitemap
MAIL · MONTHLY DIGEST
Get monthly local AI changes
Monthly recap. No spam.
DISCLOSURE

Some links on this site are affiliate links (Amazon Associates and other first-class retailers). When you buy through them, we earn a small commission at no extra cost to you. Affiliate links do not influence our verdicts — there are cards we rate highly that we don't have affiliate relationships with, and cards that sell well that we refuse to recommend. Read more →

SYS · ONLINEUPTIME · 100%2026 · operator-owned
RUNLOCALAI · v38
Errors / Build / compile failures / llama.cpp build fails: nvcc not found / CUDA toolkit missing
Build / compile failures
Verified by owner

llama.cpp build fails: nvcc not found / CUDA toolkit missing

make: nvcc: No such file or directory
By Fredoline Eruo · Last verified May 8, 2026

Cause

Building llama.cpp with GGML_CUDA=1 requires the full CUDA toolkit (nvcc compiler, CUDA headers, cuBLAS), not just the NVIDIA driver. nvidia-smi working doesn't mean the toolkit is installed — that's the driver only.

On Ubuntu, apt install nvidia-driver-XXX does NOT install the toolkit. On Windows, the GeForce Experience driver bundle does NOT include nvcc.

Solution

Linux (Ubuntu/Debian):

# Install matching CUDA toolkit (12.4 example — match what nvidia-smi reports)
sudo apt install nvidia-cuda-toolkit
# Or use NVIDIA's repo for a more recent version:
wget https://developer.download.nvidia.com/compute/cuda/repos/ubuntu2404/x86_64/cuda-keyring_1.1-1_all.deb
sudo dpkg -i cuda-keyring_1.1-1_all.deb
sudo apt update && sudo apt install cuda-toolkit-12-4

Then ensure nvcc is on PATH:

export PATH=/usr/local/cuda/bin:$PATH
export LD_LIBRARY_PATH=/usr/local/cuda/lib64:$LD_LIBRARY_PATH
nvcc --version  # should print version info

Windows: Install the CUDA Toolkit from developer.nvidia.com/cuda-downloads (separate download from the driver). Use Visual Studio's "x64 Native Tools Command Prompt" so cl.exe is on PATH too. Then:

cmake -B build -DGGML_CUDA=ON
cmake --build build --config Release -j

Fallback if the toolkit won't install (older distro, locked-down system): build with Vulkan instead. Cross-vendor, no toolkit needed:

make GGML_VULKAN=1 -j

Related errors

  • llama.cpp build fails: nvcc not found
  • llama.cpp CUDA build: unsupported GNU version! gcc versions later than X are not supported
  • exllamav2 ImportError: cannot import name 'ExLlamaV2' / undefined symbol
  • flash-attn install fails on Windows / no precompiled wheel

Did this fix it?

If your case was different, email support@runlocalai.co with what you saw and we'll update the page. If it worked but took different commands on your platform, we want to know that too.