other
1000B parameters
Commercial OK

Kimi K2.6

Moonshot's long-context, agent-oriented MoE. Optimized for stability under tool use and multi-step coding/planning workflows.

License: Kimi Open Weights License·Released Mar 10, 2026·Context: 2,000,000 tokens

Overview

Moonshot's long-context, agent-oriented MoE. Optimized for stability under tool use and multi-step coding/planning workflows.

Strengths

  • Agent-tuned
  • Stable tool use
  • Long context

Weaknesses

  • Datacenter-class

Quantization variants

Each quantization trades model quality for file size and VRAM. Q4_K_M is the most popular starting point.

QuantizationFile sizeVRAM required
Q4_K_M600.0 GB700 GB

Get the model

HuggingFace

Original weights

huggingface.co/moonshotai/Kimi-K2.6

Source repository — direct quantization required.

Hardware that runs this

Cards with enough VRAM for at least one quantization of Kimi K2.6.

Compare alternatives

Models worth comparing

Same parameter band, plus what's one tier above and below — so you can decide what actually fits your hardware.

Step up
More capable — bigger memory footprint
No verdicted models in the next tier up yet.

Frequently asked

What's the minimum VRAM to run Kimi K2.6?

700GB of VRAM is enough to run Kimi K2.6 at the Q4_K_M quantization (file size 600.0 GB). Higher-quality quantizations need more.

Can I use Kimi K2.6 commercially?

Yes — Kimi K2.6 ships under the Kimi Open Weights License, which permits commercial use. Always read the license text before deployment.

What's the context length of Kimi K2.6?

Kimi K2.6 supports a context window of 2,000,000 tokens (about 2000K).

Source: huggingface.co/moonshotai/Kimi-K2.6

Reviewed by RunLocalAI Editorial. See our editorial policy for how we research and verify model claims.