→WILL IT RUN BEST GPU COMPARE TROUBLESHOOT START PULSE MODELS HARDWARE TOOLS BENCH

>
Home
Hardware
Intel Arc 140V (Lunar Lake iGPU)

UNIT · INTEL · GPU

entry·Reviewed May 2026

Intel Arc 140V (Lunar Lake iGPU)

Intel Lunar Lake's Arc 140V iGPU (Xe2 Battlemage architecture). Highest iGPU bandwidth on Windows in 2026 (137 GB/s LPDDR5x-8533). ~12-18 tok/s on 7B Q4. Pairs with Intel's NPU 4 (48 TOPS) for hybrid inference. The 'best thin-laptop iGPU for AI' choice in 2026.

Released 2024·137 GB/s memory bandwidth

RUNLOCALAI SCORE

See full leaderboard →

115/ 1000

DD-tier

Estimated

Throughput

32/ 500

VRAM-fit

0/ 200

Ecosystem

80/ 200

Efficiency

52/ 100

Extrapolated from 137 GB/s bandwidth — 11.0 tok/s estimated. No measured benchmarks yet.

WORKLOAD FIT

Try other hardware →

Plain-English: Doesn't fit modern chat models usefully — vision models won't fit.

7B chat✗

Doesn't fit

14B chat✗

Doesn't fit

32B chat✗

Doesn't fit

70B chat✗

Doesn't fit

Coding agent✗

Doesn't fit

Vision (≤8B VLM)✗

Doesn't fit

Long context (32K)✗

Doesn't fit

✓Comfortable — fits with headroom

~Tight — works, no slack

△Marginal — needs aggressive quant

✗Doesn't fit usefully

Verdicts extrapolated from catalog VRAM + bandwidth + ecosystem flags. Hover any chip for the rationale. Want measured numbers? Submit your own run with runlocalai-bench --submit.

BLK · OVERVIEW

Overview

Intel Lunar Lake's Arc 140V iGPU (Xe2 Battlemage architecture). Highest iGPU bandwidth on Windows in 2026 (137 GB/s LPDDR5x-8533). ~12-18 tok/s on 7B Q4. Pairs with Intel's NPU 4 (48 TOPS) for hybrid inference. The 'best thin-laptop iGPU for AI' choice in 2026.

BLK · SPECS

Specs

VRAM	0 GB
Power draw	17 W
Released	2024
Backends	Vulkan

Frequently asked

Does Intel Arc 140V (Lunar Lake iGPU) support CUDA?

Intel Arc 140V (Lunar Lake iGPU) does not support CUDA. Use Vulkan-compatible tools (llama.cpp Vulkan backend) or check vendor-specific runtimes.

Where next?

Buyer guides

Best GPU for local AI →
Best laptop for local AI →
Best Mac for local AI →
Best used GPU for local AI →

Troubleshooting

CUDA out of memory →
Ollama running slowly →
ROCm not detected →
Model keeps crashing →

Reviewed by RunLocalAI Editorial. See our editorial policy for how we research and verify hardware specifications.

Operator-grade instrument for local-AI hardware intelligence. Hand-written verdicts. Real benchmarks. Reproducible commands.

OP·Fredoline Eruo

DIR

Models
Hardware
Tools
Benchmarks
Will it run?

GUIDES

Best GPU
Best laptop
Best Mac
Best used GPU
Best budget GPU
Best GPU for Ollama
Best GPU for SD
AI PC build $2K
CUDA vs ROCm
16 vs 24 GB
Compare hardware
Custom compare

REF

Systems
Ecosystem maps
Pillar guides
Methodology
Glossary
Errors KB
Troubleshooting
Resources
Public API

EDITOR

About
About the author
Changelog
Latest
Updates
Submit benchmark
Send feedback
Trust
Editorial policy
How we make money
Contact

LEGAL

Privacy
Terms
Sitemap

MAIL · MONTHLY DIGEST

Get monthly local AI changes

Monthly recap. No spam.

Email address

DISCLOSURE

Some links on this site are affiliate links (Amazon Associates and other first-class retailers). When you buy through them, we earn a small commission at no extra cost to you. Affiliate links do not influence our verdicts — there are cards we rate highly that we don't have affiliate relationships with, and cards that sell well that we refuse to recommend. Read more →

SYS · ONLINEUPTIME · 100%2026 · operator-owned

RUNLOCALAI · v38