For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
DASHBOARDPLAYGROUNDDOCSCOMMUNITYLOG IN
Guides and conceptsAPI ReferenceRelease NotesLLMUCookbooks
Guides and conceptsAPI ReferenceRelease NotesLLMUCookbooks
  • Get Started
    • Introduction
    • Installation
    • Creating a client
    • Playground
    • FAQs
  • Models
    • An Overview of Cohere's Models
    • Aya
      • Command A+
      • Command A
      • Command A Reasoning
      • Command A Translate
      • Command A Vision
      • Command R7B
      • Command R+
      • Command R
    • Embed
    • Rerank
  • Text Generation
    • Introduction to Text Generation at Cohere
    • Using the Chat API
    • Reasoning
    • Image Inputs
    • Streaming Responses
    • Predictable Outputs
    • Advanced Generation Parameters
    • Tool Use
    • Tokens and Tokenizers
    • Summarizing Text
    • Safety Modes
  • Embeddings (Vectors, Search, Retrieval)
    • Introduction to Embeddings at Cohere
    • Semantic Search with Embeddings
    • Multimodal Embeddings
    • Batch Embedding Jobs
  • Going to Production
    • API Keys and Rate Limits
    • Going Live
    • Deprecations
    • How Does Cohere's Pricing Work?
  • Integrations
    • Integrating Embedding Models with Other Tools
    • Cohere and LangChain
    • LlamaIndex and Cohere
  • Deployment Options
    • Overview
    • SDK Compatibility
  • Tutorials
    • Cookbooks
    • LLM University
    • Build Things with Cohere!
    • Agentic RAG
    • Cohere on Azure
  • Responsible Use
    • Security
    • Usage Policy
    • Command A Technical Report
    • Command R and Command R+ Model Card
  • Cohere Labs
    • Cohere Labs Acceptable Use Policy
  • More Resources
    • Cohere Toolkit
    • Datasets
    • Improve Cohere Docs
LogoLogodocs
DASHBOARDPLAYGROUNDDOCSCOMMUNITYLOG IN
On this page
  • Description
  • What Can Command A Be Used For?
  • Command A is Chatty
ModelsCommand

Command A

Our largest, most performant model, ideal for building enterprise agents with a low compute footprint.
Was this page helpful?
Edit this page
Previous

Cohere's Command A Reasoning Model

Next
Built with
Capabilities
MultilingualSafety ModesCitationsTool UseStructured OutputsReasoningImage Inputs
Pricing
Input

$2.5 / 1M tokens

Output

$10 / 1M tokens

Specifications
    Context Window: 256,000 tokens
    Max Output Tokens: 8,000 tokens
    Knowledge Cutoff: June 1, 2024
    Hardware: Requires two GPUs to run (A100s / H100s)
API Endpoints
Model ID
command-a-plus-05-2026
Chat V2Chat V1Chat Completions
Try in Playground

Description

Command A is Cohere’s most performant model to date, excelling at real world enterprise tasks including tool use, retrieval augmented generation (RAG), agents, and multilingual use cases. At 111B parameters, Command A has a context length of 256K and only requires two GPUs (A100s / H100s) to run, while being significantly more efficient at inference time with 150% higher throughput compared to its predecessor, Command R+ 08-2024.

What Can Command A Be Used For?

Command A is excellent for:

  • Tool use - With tool use, Command models can be given tools such as search engines, APIs, vector databases, etc., which can expand their baseline functionality. Command A excels at tool use, exhibiting particular strength in using tools in real-world, diverse, and dynamic environments. In addition, Command A is good at avoiding unnecessarily calling tools, which is an important aspect of tool-use in practical applications.
  • Agents - As this is being written, agents are among the most exciting frontiers for large language models. Command A’s multistep tool use capabilities allow it to power fast and capable REACT agents. When set up as an internet-augmented research agent, for example, Command A ably completes tasks that require breaking down complex questions into subgoals, and also performs favorably in domains that utilize complex reasoning and active information seeking.
  • Retrieval augmented generation - Retrieval Augmented Generation (RAG) refers to the practice of ‘grounding’ model outputs in external data sources, which can increase accuracy. Command A is exceptionally good at generating responses in conversational tasks, attending over long inputs, and extracting and manipulating numerical information in financial settings.
  • Multilingual use cases - The model is trained to perform well in 23 languages: English, French, Spanish, Italian, German, Portuguese, Japanese, Korean, Chinese, Arabic, Russian, Polish, Turkish, Vietnamese, Dutch, Czech, Indonesian, Ukrainian, Romanian, Greek, Hindi, Hebrew, Persian. It has been trained to respond in the language of the user, or follow instructions to output a response in a different language. It also excels at performing cross-lingual tasks, such as translation or answering questions about content in other languages.

Command A is Chatty

By default, the model is interactive and optimized for conversation, meaning it is verbose and uses markdown to highlight code. To override this behavior, developers should use a system instruction which asks the model to simply provide the answer and to not use markdown or code block markers. To learn more, consult our documentation on system instructions.