Cohere's Command R7B Model

Capabilities

MultilingualSafety ModesCitationsTool UseStructured OutputsReasoningImage Inputs

Pricing

Input

$0.0375 / 1M tokens

Output

$0.15 / 1M tokens

Specifications

Context Window: 128,000 tokens

Max Output Tokens: 4,000 tokens

Knowledge Cutoff: June 1, 2024

API Endpoints

Model ID

command-r7b-12-2024

Chat V2Chat V1Chat Completions

Try in Playground

Description

Command R7B is the smallest and fastest model in our R family of enterprise-focused large language models (LLMs). With a context window of 128K and a compact architecture, Command R7B offers state-of-the-art performance across a variety of real-world tasks, and it is especially good at high throughput, latency-sensitive applications like chatbots and code assistants. What’s more, it’s small size also unlocks dramatically cheaper deployment infrastructure—such as consumer GPUs and CPUs—which means it can be used for on-device inference.

Command R7B is available today on the Cohere Platform as well as accessible on HuggingFace, or you can access it in the SDK with command-r7b-12-2024. For more information, check out our dedicated blog post.

What Can Command R7B Be Used For?

Command R7B is excellent for:

RAG - Retrieval Augmented Generation (RAG) refers to the practice of ‘grounding’ model outputs in external data sources, which can increase accuracy. Command R7B is exceptionally good at generating responses in conversational tasks, attending over long inputs, and extracting and manipulating numerical information in financial settings.
Tool-use - With tool use, Command models can be given tools such as search engines, APIs, vector databases, etc., which can expand their baseline functionality. Command R7B excels at tool use, exhibiting particular strength in using tools in real-world, diverse, and dynamic environments. In addition, Command R7B is good at avoiding unnecessarily calling tools, which is an important aspect of tool-use in practical applications.
Agents - As this is being written, agents are among the most exciting frontiers for large language models. Command R7B’s multistep tool use capabilities allow it to power fast and capable REACT agents. When set up as an internet-augmented research agent, for example, Command R7B ably completes tasks that require breaking down complex questions into subgoals, and also performs favorably in domains that utilize complex reasoning and active information seeking.