ModelsNorth

North Mini Code

Capabilities
ReasoningTool UseStructured OutputsMultilingualImage InputsSafety ModesCitations
Pricing

For both trial keys and production keys, North Mini Code is free until rate limits are reached. Learn more about rate limits for different models and key types here.

North Mini Code can be used in production through Cohere's Model Vault.

Specifications
    Context Window: 256K tokens
    Max Output Tokens: 64K tokens
API Endpoints
Model ID
north-mini-code-1-0
Chat V2Chat V1Chat Completions

Description

North Mini Code is a 30B total / 3B active parameter Mixture of Experts model trained specifically for agentic coding. It is the first model in Cohere’s North family of code agent models, released under the Apache 2.0 license. Its small active footprint makes it suitable for local deployment while remaining competitive with larger open-source models on software engineering and terminal-based agentic benchmarks.

What can North Mini Code be used for?

North Mini Code is well-suited for:

  • Agentic software engineering: repo-level code changes inside harnesses like SWE-Agent and OpenCode.
  • Terminal-based agents: driving shell tools end-to-end across multi-turn tasks.
  • Local and on-device coding: the 3B active parameter footprint enables low-latency inference on local hardware.
  • Code generation: scientific coding and algorithmic reasoning tasks outside of an agent loop.

It was trained against multiple harnesses, so performance generalizes across agent scaffolds rather than being tuned to a single one.