ModelsCommand

Command R7B

Cohere’s Command R7B model

Command R7B is the smallest, fastest, and final model in our R family of enterprise-focused large language models (LLMs). With a context window of 128K and a compact architecture, Command R7B offers state-of-the-art performance across a variety of real-world tasks, and it is especially good at high throughput, latency-sensitive applications like chatbots and code assistants. What’s more, it’s small size also unlocks dramatically cheaper deployment infrastructure—such as consumer GPUs and CPUs—which means it can be used for on-device inference.

Command R7B is available today on the Cohere Platform as well as accessible on HuggingFace, or you can access it in the SDK with command-r7b-12-2024. For more information, check out our dedicated blog post.

Model Details

Model NameDescriptionModalityContext LengthMaximum Output TokensEndpoints
command-r7b-12-2024command-r7b-12-2024 is a small, fast update of the Command R+ model, delivered in December 2024. It excels at RAG, tool use, agents, and similar tasks requiring complex reasoning and multiple steps.Text128k4kChat

What Can Command R7B Be Used For?

Command R7B is excellent for:

  • RAG - Retrieval Augmented Generation (RAG) refers to the practice of ‘grounding’ model outputs in external data sources, which can increase accuracy. Command R7B is exceptionally good at generating responses in conversational tasks, attending over long inputs, and extracting and manipulating numerical information in financial settings.
  • Tool-use - With tool use, Command models can be given tools such as search engines, APIs, vector databases, etc., which can expand their baseline functionality. Command R7B excels at tool use, exhibiting particular strength in using tools in real-world, diverse, and dynamic environments. In addition, Command R7B is good at avoiding unnecessarily calling tools, which is an important aspect of tool-use in practical applications.
  • Agents - As this is being written, agents are among the most exciting frontiers for large language models. Command R7B’s multistep tool use capabilities allow it to power fast and capable REACT agents. When set up as an internet-augmented research agent, for example, Command R7B ably completes tasks that require breaking down complex questions into subgoals, and also performs favorably in domains that utilize complex reasoning and active information seeking.
Built with