RUNLOCALAIv38
→WILL IT RUNBEST GPUCOMPARETROUBLESHOOTSTARTPULSEMODELSHARDWARETOOLSBENCH
RUNLOCALAI

Operator-grade instrument for local-AI hardware intelligence. Hand-written verdicts. Real benchmarks. Reproducible commands.

OP·Fredoline Eruo
DIR
  • Models
  • Hardware
  • Tools
  • Benchmarks
  • Will it run?
GUIDES
  • Best GPU
  • Best laptop
  • Best Mac
  • Best used GPU
  • Best budget GPU
  • Best GPU for Ollama
  • Best GPU for SD
  • AI PC build $2K
  • CUDA vs ROCm
  • 16 vs 24 GB
  • Compare hardware
  • Custom compare
REF
  • Systems
  • Ecosystem maps
  • Pillar guides
  • Methodology
  • Glossary
  • Errors KB
  • Troubleshooting
  • Resources
  • Public API
EDITOR
  • About
  • About the author
  • Changelog
  • Latest
  • Updates
  • Submit benchmark
  • Send feedback
  • Trust
  • Editorial policy
  • How we make money
  • Contact
LEGAL
  • Privacy
  • Terms
  • Sitemap
MAIL · MONTHLY DIGEST
Get monthly local AI changes
Monthly recap. No spam.
DISCLOSURE

Some links on this site are affiliate links (Amazon Associates and other first-class retailers). When you buy through them, we earn a small commission at no extra cost to you. Affiliate links do not influence our verdicts — there are cards we rate highly that we don't have affiliate relationships with, and cards that sell well that we refuse to recommend. Read more →

SYS · ONLINEUPTIME · 100%2026 · operator-owned
RUNLOCALAI · v38
Glossary / Hardware & infrastructure / Unified Memory
Hardware & infrastructure

Unified Memory

Unified memory is a memory architecture where CPU and GPU share the same physical RAM pool, eliminating CPU↔GPU copies. Apple Silicon and AMD Strix Halo (Ryzen AI Max) use this; modern NVIDIA Grace-Hopper and GB10 systems also expose unified memory across chips.

For local AI, unified memory is the reason a 128 GB M3 Ultra Mac can load and run a 120 GB model without dedicated VRAM — something no consumer NVIDIA card can do.

The tradeoff is bandwidth: unified DDR/LPDDR (200–600 GB/s) sits between consumer GDDR6X and HBM. A model that fits on both an M-series Mac and an RTX 4090 will usually run faster on the 4090 due to bandwidth, even though the Mac has more capacity headroom.

Related terms

VRAM (Video RAM)Metal (Apple)HBM (High Bandwidth Memory)

See also

hardware: apple-m4-maxhardware: apple-m3-ultrahardware: nvidia-dgx-spark

Reviewed by Fredoline Eruo. See our editorial policy.

Buyer guides
  • Best Mac for local AI →
  • Best budget Mac →
When it doesn't work
  • MLX out of memory →
  • MPS fallback to CPU →
  • llama.cpp Metal crash →
Compare hardware
  • M4 Max vs RTX 4090 →
  • Mac Studio vs Windows AI PC →
Hardware
  • Apple M4 Max →