$2.5 / 1M tokens
$10 / 1M tokens
Command A is Cohere’s most performant model to date, excelling at real world enterprise tasks including tool use, retrieval augmented generation (RAG), agents, and multilingual use cases. At 111B parameters, Command A has a context length of 256K and only requires two GPUs (A100s / H100s) to run, while being significantly more efficient at inference time with 150% higher throughput compared to its predecessor, Command R+ 08-2024.
Command A is excellent for:
By default, the model is interactive and optimized for conversation, meaning it is verbose and uses markdown to highlight code. To override this behavior, developers should use a system instruction which asks the model to simply provide the answer and to not use markdown or code block markers. To learn more, consult our documentation on system instructions.