Choose my GPU

Compare

Top pick

nvidia24 GB~$899Estimated(used-market price)

Operator-grade

NVIDIA GeForce RTX 3090

Top pick for your setup. With your $1,500 budget on Linux for coding agents, the NVIDIA GeForce RTX 3090 ranks here because 24 GB hits the workable band for coding agents — fits at sensible quants without becoming the bottleneck.

Sustained 450W+ — minimum 1000W Gold PSU + good airflow. Your power tolerance is moderate (350W ceiling), which this card will exceed under load.

Realistic model class

Qwen 2.5 Coder 32B Q4 + 32K context

Expected throughput

30-60 tok/s on 32B Q4 single-stream; 80-130 tok/s on 13B Q4.

Evidence

live data · editorial + reproduced community

Editorial

1benchmarks

Reproduced

0community

Stale (>18mo)

0rows

Cohort confidence

Low

1 cohort

Needs measurement

This recommendation is rule-based, not evidence-backed yet.

Only 1 benchmark — below the 5-row threshold for cohort signal.

Help us measure NVIDIA GeForce RTX 3090 →

Measured throughput

top 1 of 1 on file · most recent first

ed
llama 3.1 8b instructQ4_K_M
105.0tok/s2026-05

Featured in stacks

Dual RTX 3090 workstation stack — 70B-class on $1,800 of used GPUs — Workstation · GPUs (2× 24GB used, the cheapest path to 48 GB total)
Quad RTX 3090 workstation stack — the prosumer 100B-class ceiling — Homelab · GPUs (4× 24GB used; the prosumer-ceiling stack)

Show 1 benchmark feeding this card▸

ed
#340llama-3.1-8b-instruct · Q4_K_M
105.0 tok/s2026-05-13

How we scored this card▸

Each dimension is a 0-100 score. The card's position in the ranking is the weighted sum — but we surface tiers, not raw numbers. Bars are sorted by weight (most-influential first).

VRAM × workloadweight 22%70Good
Budget fitweight 18%95Excellent
OS compatibilityweight 16%100Excellent
Skill matchweight 10%95Excellent
Power headroomweight 8%80Strong
Multi-GPU pathweight 8%80Strong
Thermal / noiseweight 6%95Excellent
Gaming alignmentweight 6%95Excellent
Perf-per-wattweight 6%85Strong

Tier mapping: top ≥ 75 composite · alternate 60-74 · acceptable 40-59 · avoid < 40 or over-budget / incompatible.

Caveats

•Used-market only — fan/thermal-pad inspection required; new MSRP from launch is no longer the relevant price.

Try in custom builder →See model-fit table Recommended runtime: ollama

Estimated(rule-based scoring)Help us measure this — submit a benchmark for NVIDIA GeForce RTX 3090

Featured in these stacks

Dual RTX 3090 workstation stack — 70B-class on $1,800 of used GPUs — Workstation tier · GPUs (2× 24GB used, the cheapest path to 48 GB total)
Quad RTX 3090 workstation stack — the prosumer 100B-class ceiling — Homelab tier · GPUs (4× 24GB used; the prosumer-ceiling stack)

Compare

Top pick

nvidia24 GB~$1,199Estimated(used-market price)

Operator-grade

NVIDIA GeForce RTX 3090 Ti

Top pick for your setup. With your $1,500 budget on Linux for coding agents, the NVIDIA GeForce RTX 3090 Ti ranks here because 24 GB hits the workable band for coding agents — fits at sensible quants without becoming the bottleneck.

Sustained 450W+ — minimum 1000W Gold PSU + good airflow. Your power tolerance is moderate (350W ceiling), which this card will exceed under load.

Realistic model class

Qwen 2.5 Coder 32B Q4 + 32K context

Expected throughput

30-60 tok/s on 32B Q4 single-stream; 80-130 tok/s on 13B Q4.

Evidence

live data · editorial + reproduced community

Editorial

0benchmarks

Reproduced

0community

Stale (>18mo)

0rows

Cohort confidence

—

none

Needs measurement

This recommendation is rule-based, not evidence-backed yet.

No benchmarks on file for this hardware.

Help us measure NVIDIA GeForce RTX 3090 Ti →

How we scored this card▸

Each dimension is a 0-100 score. The card's position in the ranking is the weighted sum — but we surface tiers, not raw numbers. Bars are sorted by weight (most-influential first).

VRAM × workloadweight 22%70Good
Budget fitweight 18%95Excellent
OS compatibilityweight 16%100Excellent
Skill matchweight 10%95Excellent
Power headroomweight 8%25Weak
Multi-GPU pathweight 8%80Strong
Thermal / noiseweight 6%95Excellent
Gaming alignmentweight 6%95Excellent
Perf-per-wattweight 6%65Good

Tier mapping: top ≥ 75 composite · alternate 60-74 · acceptable 40-59 · avoid < 40 or over-budget / incompatible.

Caveats

•Sustained ~450W — plan for a 1000W+ PSU and adequate case airflow.
•Used-market only — fan/thermal-pad inspection required; new MSRP from launch is no longer the relevant price.

Try in custom builder →See model-fit table Recommended runtime: ollama

Estimated(rule-based scoring)Help us measure this — submit a benchmark for NVIDIA GeForce RTX 3090 Ti

Compare

Top pick

nvidia24 GB

Operator-grade

NVIDIA GeForce RTX 5090 Mobile

Top pick for your setup. With your $1,500 budget on Linux for coding agents, the NVIDIA GeForce RTX 5090 Mobile ranks here because 24 GB hits the workable band for coding agents — fits at sensible quants without becoming the bottleneck.

Realistic model class

Qwen 2.5 Coder 32B Q4 + 32K context

Expected throughput

30-60 tok/s on 32B Q4 single-stream; 80-130 tok/s on 13B Q4.

Evidence

live data · editorial + reproduced community

Editorial

0benchmarks

Reproduced

0community

Stale (>18mo)

0rows

Cohort confidence

—

none

Needs measurement

This recommendation is rule-based, not evidence-backed yet.

No benchmarks on file for this hardware.

Help us measure NVIDIA GeForce RTX 5090 Mobile →

How we scored this card▸

Each dimension is a 0-100 score. The card's position in the ranking is the weighted sum — but we surface tiers, not raw numbers. Bars are sorted by weight (most-influential first).

VRAM × workloadweight 22%70Good
Budget fitweight 18%50Acceptable
OS compatibilityweight 16%100Excellent
Skill matchweight 10%95Excellent
Power headroomweight 8%95Excellent
Multi-GPU pathweight 8%80Strong
Thermal / noiseweight 6%95Excellent
Gaming alignmentweight 6%95Excellent
Perf-per-wattweight 6%95Excellent

Tier mapping: top ≥ 75 composite · alternate 60-74 · acceptable 40-59 · avoid < 40 or over-budget / incompatible.

Try in custom builder →See model-fit table Recommended runtime: ollama

Estimated(rule-based scoring)Help us measure this — submit a benchmark for NVIDIA GeForce RTX 5090 Mobile

Compare

Top pick

nvidia16 GB~$849

Operator-grade

NVIDIA GeForce RTX 5070 Ti

Top pick for your setup. With your $1,500 budget on Linux for coding agents, the NVIDIA GeForce RTX 5070 Ti sits in this tier on a balance of capability, OS compat, power, and budget fit.

Realistic model class

Qwen 2.5 Coder 14B FP16, agents OK

Expected throughput

40-70 tok/s on 7B Q4; 20-35 tok/s on 13B Q4.

Evidence

live data · editorial + reproduced community

Editorial

0benchmarks

Reproduced

0community

Stale (>18mo)

0rows

Cohort confidence

—

none

Needs measurement

This recommendation is rule-based, not evidence-backed yet.

No benchmarks on file for this hardware.

Help us measure NVIDIA GeForce RTX 5070 Ti →

How we scored this card▸

Each dimension is a 0-100 score. The card's position in the ranking is the weighted sum — but we surface tiers, not raw numbers. Bars are sorted by weight (most-influential first).

VRAM × workloadweight 22%33Weak
Budget fitweight 18%95Excellent
OS compatibilityweight 16%100Excellent
Skill matchweight 10%95Excellent
Power headroomweight 8%80Strong
Multi-GPU pathweight 8%80Strong
Thermal / noiseweight 6%95Excellent
Gaming alignmentweight 6%90Excellent
Perf-per-wattweight 6%85Strong

Tier mapping: top ≥ 75 composite · alternate 60-74 · acceptable 40-59 · avoid < 40 or over-budget / incompatible.

Caveats

•16 GB is below the comfortable VRAM minimum for coding agents — expect quant downgrades or very tight context windows.

Try in custom builder →See model-fit table Recommended runtime: ollama

Estimated(rule-based scoring)Help us measure this — submit a benchmark for NVIDIA GeForce RTX 5070 Ti

Compare

Top pick

nvidia16 GB~$1,099

Operator-grade

NVIDIA GeForce RTX 4080

Top pick for your setup. With your $1,500 budget on Linux for coding agents, the NVIDIA GeForce RTX 4080 sits in this tier on a balance of capability, OS compat, power, and budget fit.

Realistic model class

Qwen 2.5 Coder 14B FP16, agents OK

Expected throughput

40-70 tok/s on 7B Q4; 20-35 tok/s on 13B Q4.

Evidence

live data · editorial + reproduced community

Editorial

0benchmarks

Reproduced

0community

Stale (>18mo)

0rows

Cohort confidence

—

none

Needs measurement

This recommendation is rule-based, not evidence-backed yet.

No benchmarks on file for this hardware.

Help us measure NVIDIA GeForce RTX 4080 →

How we scored this card▸

Each dimension is a 0-100 score. The card's position in the ranking is the weighted sum — but we surface tiers, not raw numbers. Bars are sorted by weight (most-influential first).

VRAM × workloadweight 22%33Weak
Budget fitweight 18%95Excellent
OS compatibilityweight 16%100Excellent
Skill matchweight 10%95Excellent
Power headroomweight 8%80Strong
Multi-GPU pathweight 8%80Strong
Thermal / noiseweight 6%95Excellent
Gaming alignmentweight 6%90Excellent
Perf-per-wattweight 6%85Strong

Tier mapping: top ≥ 75 composite · alternate 60-74 · acceptable 40-59 · avoid < 40 or over-budget / incompatible.

Caveats

•16 GB is below the comfortable VRAM minimum for coding agents — expect quant downgrades or very tight context windows.

Try in custom builder →See model-fit table Recommended runtime: ollama

Estimated(rule-based scoring)Help us measure this — submit a benchmark for NVIDIA GeForce RTX 4080

Compare

Top pick

nvidia16 GB~$829

Operator-grade

NVIDIA GeForce RTX 4070 Ti Super

Top pick for your setup. With your $1,500 budget on Linux for coding agents, the NVIDIA GeForce RTX 4070 Ti Super sits in this tier on a balance of capability, OS compat, power, and budget fit.

Realistic model class

Qwen 2.5 Coder 14B FP16, agents OK

Expected throughput

40-70 tok/s on 7B Q4; 20-35 tok/s on 13B Q4.

Evidence

live data · editorial + reproduced community

Editorial

0benchmarks

Reproduced

0community

Stale (>18mo)

0rows

Cohort confidence

—

none

Needs measurement

This recommendation is rule-based, not evidence-backed yet.

No benchmarks on file for this hardware.

Help us measure NVIDIA GeForce RTX 4070 Ti Super →

How we scored this card▸

Each dimension is a 0-100 score. The card's position in the ranking is the weighted sum — but we surface tiers, not raw numbers. Bars are sorted by weight (most-influential first).

VRAM × workloadweight 22%33Weak
Budget fitweight 18%95Excellent
OS compatibilityweight 16%100Excellent
Skill matchweight 10%95Excellent
Power headroomweight 8%80Strong
Multi-GPU pathweight 8%80Strong
Thermal / noiseweight 6%95Excellent
Gaming alignmentweight 6%90Excellent
Perf-per-wattweight 6%85Strong

Tier mapping: top ≥ 75 composite · alternate 60-74 · acceptable 40-59 · avoid < 40 or over-budget / incompatible.

Caveats

•16 GB is below the comfortable VRAM minimum for coding agents — expect quant downgrades or very tight context windows.

Try in custom builder →See model-fit table Recommended runtime: ollama

Estimated(rule-based scoring)Help us measure this — submit a benchmark for NVIDIA GeForce RTX 4070 Ti Super

Compare

Top pick

nvidia16 GB~$1,099

Operator-grade

NVIDIA GeForce RTX 4080 Super

Top pick for your setup. With your $1,500 budget on Linux for coding agents, the NVIDIA GeForce RTX 4080 Super sits in this tier on a balance of capability, OS compat, power, and budget fit.

Realistic model class

Qwen 2.5 Coder 14B FP16, agents OK

Expected throughput

40-70 tok/s on 7B Q4; 20-35 tok/s on 13B Q4.

Evidence

live data · editorial + reproduced community

Editorial

0benchmarks

Reproduced

0community

Stale (>18mo)

0rows

Cohort confidence

—

none

Needs measurement

This recommendation is rule-based, not evidence-backed yet.

No benchmarks on file for this hardware.

Help us measure NVIDIA GeForce RTX 4080 Super →

How we scored this card▸

Each dimension is a 0-100 score. The card's position in the ranking is the weighted sum — but we surface tiers, not raw numbers. Bars are sorted by weight (most-influential first).

VRAM × workloadweight 22%33Weak
Budget fitweight 18%95Excellent
OS compatibilityweight 16%100Excellent
Skill matchweight 10%95Excellent
Power headroomweight 8%80Strong
Multi-GPU pathweight 8%80Strong
Thermal / noiseweight 6%95Excellent
Gaming alignmentweight 6%90Excellent
Perf-per-wattweight 6%85Strong

Tier mapping: top ≥ 75 composite · alternate 60-74 · acceptable 40-59 · avoid < 40 or over-budget / incompatible.

Caveats

•16 GB is below the comfortable VRAM minimum for coding agents — expect quant downgrades or very tight context windows.

Try in custom builder →See model-fit table Recommended runtime: ollama

Estimated(rule-based scoring)Help us measure this — submit a benchmark for NVIDIA GeForce RTX 4080 Super

Compare

Top pick

nvidia16 GB~$459

Operator-grade

NVIDIA GeForce RTX 5060 Ti 16GB

Top pick for your setup. With your $1,500 budget on Linux for coding agents, the NVIDIA GeForce RTX 5060 Ti 16GB sits in this tier on a balance of capability, OS compat, power, and budget fit.

Realistic model class

Qwen 2.5 Coder 14B FP16, agents OK

Expected throughput

40-70 tok/s on 7B Q4; 20-35 tok/s on 13B Q4.

Evidence

live data · editorial + reproduced community

Editorial

0benchmarks

Reproduced

0community

Stale (>18mo)

0rows

Cohort confidence

—

none

Needs measurement

This recommendation is rule-based, not evidence-backed yet.

No benchmarks on file for this hardware.

Help us measure NVIDIA GeForce RTX 5060 Ti 16GB →

How we scored this card▸

Each dimension is a 0-100 score. The card's position in the ranking is the weighted sum — but we surface tiers, not raw numbers. Bars are sorted by weight (most-influential first).

VRAM × workloadweight 22%33Weak
Budget fitweight 18%80Strong
OS compatibilityweight 16%100Excellent
Skill matchweight 10%95Excellent
Power headroomweight 8%95Excellent
Multi-GPU pathweight 8%80Strong
Thermal / noiseweight 6%95Excellent
Gaming alignmentweight 6%70Good
Perf-per-wattweight 6%95Excellent

Tier mapping: top ≥ 75 composite · alternate 60-74 · acceptable 40-59 · avoid < 40 or over-budget / incompatible.

Caveats

•16 GB is below the comfortable VRAM minimum for coding agents — expect quant downgrades or very tight context windows.

Try in custom builder →See model-fit table Recommended runtime: ollama

Estimated(rule-based scoring)Help us measure this — submit a benchmark for NVIDIA GeForce RTX 5060 Ti 16GB

Compare

Top pick

nvidia16 GB~$1,199

Operator-grade

NVIDIA GeForce RTX 5080

Top pick for your setup. With your $1,500 budget on Linux for coding agents, the NVIDIA GeForce RTX 5080 sits in this tier on a balance of capability, OS compat, power, and budget fit.

Realistic model class

Qwen 2.5 Coder 14B FP16, agents OK

Expected throughput

60-100 tok/s on 7B Q4; 30-50 tok/s on 13B Q4.

Evidence

live data · editorial + reproduced community

Editorial

2benchmarks

Reproduced

0community

Stale (>18mo)

0rows

Cohort confidence

Low

2 cohorts

Needs measurement

This recommendation is rule-based, not evidence-backed yet.

Only 2 benchmarks — below the 5-row threshold for cohort signal.

Help us measure NVIDIA GeForce RTX 5080 →

Measured throughput

top 2 of 2 on file · most recent first

ed
repro
llama 3.1 8b instructQ4_K_M
132.2tok/s2026-05
ed
llama 3.1 8b instructQ4_K_M
118.2tok/s2026-05

Show 2 benchmarks feeding this card▸

ed
#338llama-3.1-8b-instruct · Q4_K_M
132.2 tok/s2026-05-11
repro
ed
#333llama-3.1-8b-instruct · Q4_K_M
118.2 tok/s2026-05-05

How we scored this card▸

Each dimension is a 0-100 score. The card's position in the ranking is the weighted sum — but we surface tiers, not raw numbers. Bars are sorted by weight (most-influential first).

VRAM × workloadweight 22%33Weak
Budget fitweight 18%95Excellent
OS compatibilityweight 16%100Excellent
Skill matchweight 10%95Excellent
Power headroomweight 8%50Acceptable
Multi-GPU pathweight 8%80Strong
Thermal / noiseweight 6%95Excellent
Gaming alignmentweight 6%95Excellent
Perf-per-wattweight 6%65Good

Tier mapping: top ≥ 75 composite · alternate 60-74 · acceptable 40-59 · avoid < 40 or over-budget / incompatible.

Caveats

•16 GB is below the comfortable VRAM minimum for coding agents — expect quant downgrades or very tight context windows.

Try in custom builder →See model-fit table Recommended runtime: ollama

Estimated(rule-based scoring)Help us measure this — submit a benchmark for NVIDIA GeForce RTX 5080

Tell us about your build

NVIDIA GeForce RTX 3090

NVIDIA GeForce RTX 3090 Ti

NVIDIA GeForce RTX 5090 Mobile

NVIDIA GeForce RTX 5070 Ti

NVIDIA GeForce RTX 4080

NVIDIA GeForce RTX 4070 Ti Super

NVIDIA GeForce RTX 4080 Super

NVIDIA GeForce RTX 5060 Ti 16GB

NVIDIA GeForce RTX 5080

Where to go from here