North Mini Code

Capabilities

ReasoningTool UseStructured OutputsMultilingualImage InputsSafety ModesCitations

Pricing

For both trial keys and production keys, North Mini Code is free until rate limits are reached. Learn more about rate limits for different models and key types here.

North Mini Code can be used in production through Cohere's Model Vault.

Specifications

Context Window: 256K tokens

Max Output Tokens: 64K tokens

API Endpoints

Model ID

north-mini-code-1-0

Chat V2Chat V1Chat Completions

Try in Playground

Description

North Mini Code is a 30B total / 3B active parameter Mixture of Experts model trained specifically for agentic coding. It is the first model in Cohere’s North family of code agent models, released under the Apache 2.0 license. Its small active footprint makes it suitable for local deployment while remaining competitive with larger open-source models on software engineering and terminal-based agentic benchmarks.

What can North Mini Code be used for?

North Mini Code is well-suited for:

Agentic software engineering: repo-level code changes inside harnesses like SWE-Agent and OpenCode.
Terminal-based agents: driving shell tools end-to-end across multi-turn tasks.
Local and on-device coding: the 3B active parameter footprint enables low-latency inference on local hardware.
Code generation: scientific coding and algorithmic reasoning tasks outside of an agent loop.

It was trained against multiple harnesses, so performance generalizes across agent scaffolds rather than being tuned to a single one.