ModelsCommand

The Command R Model

Command R is a large language model optimized for conversational interaction and long context tasks. It targets the “scalable” category of models that balance high performance with strong accuracy, enabling companies to move beyond proof of concept and into production.

Command R boasts high precision on retrieval augmented generation (RAG) and tool use tasks, low latency and high throughput, a long 128,000-token context length, and strong capabilities across 10 key languages.

For information on toxicity, safety, and using this model responsibly check out our Command model card.

Model Details

Model NameDescriptionModalityContext LengthMaximum Output TokensEndpoints
command-r-08-2024command-r-08-2024 is an update of the Command R model, delivered in August 2024.Text128k4kChat
command-r-03-2024Command R is an instruction-following conversational model that performs language tasks at a higher quality, more reliably, and with a longer context than previous models. It can be used for complex workflows like code generation, retrieval augmented generation (RAG), tool use, and agents.Text128k4kChat
command-rcommand-r is an alias for command-r-03-2024, so if you use command-r in the API, that’s the model you’re pointing to.Text128k4kChat
c4ai-aya-expanse-8bAya Expanse is a highly performant 8B multilingual model, designed to rival monolingual performance through innovations in instruction tuning with data arbitrage, preference training, and model merging. Serves 23 languages.Text8k4kChat
c4ai-aya-expanse-32bAya Expanse is a highly performant 32B multilingual model, designed to rival monolingual performance through innovations in instruction tuning with data arbitrage, preference training, and model merging. Serves 23 languages.Text8k4kChat

Command R August 2024 Release

Cohere’s flagship text-generation models, Command R and Command R+, received a substantial update in August 2024. We chose to designate these models with time stamps, so in the API Command R 08-2024 is accesible with command-r-08-2024.

With the release, both models include the following feature improvements:

  • For tool use, Command R and Command R+ have demonstrated improved decision-making around whether or not to use a tool.
  • The updated models are better able to follow instructions included by the user in the preamble.
  • Better structured data analysis for structured data manipulation.
  • Improved robustness to non-semantic prompt changes like white space or new lines.
  • Models will decline unanswerable questions and are now able to execute RAG workflows without citations

command-r-08-2024 delivers around 50% higher throughput and 20% lower latencies as compared to the previous Command R version, while cutting the hardware footprint required to serve the model by half. Read more in the relevant blog post.

What’s more, both these updated models can now operate in one of several safety modes, which gives developers more granular control over how models generate output in a variety of different contexts. Find more in these safety modes docs.

Unique Command R Model Capabilities

Command R has been trained on a massive corpus of diverse texts in multiple languages, and can perform a wide array of text-generation tasks. Moreover, Command R has been trained with a particular focus on excelling in some of the most critical business use-cases.

Multilingual Capabilities

We want Command R to serve as many people, organizations, and markets as possible, so the new Command R is capable of interacting in many languages to a fairly high degree of accuracy.

The model is optimized to perform well in the following languages: English, French, Spanish, Italian, German, Brazilian Portuguese, Japanese, Korean, Simplified Chinese, and Arabic.

Additionally, pre-training data has been included for the following 13 languages: Russian, Polish, Turkish, Vietnamese, Dutch, Czech, Indonesian, Ukrainian, Romanian, Greek, Hindi, Hebrew, Persian.

The model has been trained to respond in the language of the user. Here’s an example:

PYTHON
1import cohere
2co = cohere.ClientV2(api_key="<YOUR API KEY>")
3
4res = co.chat(
5 model="command-r-plus-08-2024",
6 messages=[
7 {
8 "role" : "user",
9 "content" : "Écris une description de produit pour une voiture électrique en 50 à 75 mots"
10 }
11 ]
12)
13
14print(res)

And here’s what the response might look like:

TEXT
Découvrez la voiture électrique qui va révolutionner votre façon de conduire.
Avec son design élégant, cette voiture offre une expérience de conduite unique
avec une accélération puissante et une autonomie impressionnante. Sa
technologie avancée vous garantit une charge rapide et une fiabilité inégalée.
Avec sa conception innovante et durable, cette voiture est parfaite pour les
trajets urbains et les longues distances. Profitez d'une conduite silencieuse
et vivez l'expérience de la voiture électrique!

Command R can not only be used to generate text in several languages but can also perform cross-lingual tasks such as translation or answering questions about content in other languages.

Retrieval Augmented Generation

Command R has been trained with the ability to ground its generations. This means that it can generate responses based on a list of supplied document snippets, and it will include citations in its response indicating the source of the information.

For more information, check out our dedicated guide on retrieval augmented generation.

Tool Use

Command R has been trained with conversational tool use capabilities. This functionality takes a conversation as input (with an optional user-system preamble), along with a list of available tools. The model will then generate a json-formatted list of actions to execute on a subset of those tools. For more information, check out our dedicated tool use guide.