Announcing Cohere’s Command A+
We’re pleased to announce the release of Command A+, the last model in the Command A family of models, combining support for vision inputs, reasoning capabilities, translation capabilities, and agentic tasks all within the same model. It is also notably our first Mixture of Experts (MoE) model with 25 billion active parameters ands 218 billion total parameters.
Command A+ (command-a-plus-05-2026) is now available for all Cohere users through our standard API
endpoints. For enterprise customers, private deployment options are
available to ensure maximum security and control over your translation workflows.
For more detailed information about Command A+, including technical specifications and implementation examples, visit our model documentation.
Retirement of Embed v2.0 and Aya Expanse / Vision 8B
Effective April 4, 2026, the following models are no longer available. Requests using these model IDs will fail.
Retired models:
embed-english-v2.0embed-english-light-v2.0embed-multilingual-v2.0c4ai-aya-expanse-8bc4ai-aya-vision-8bWe recommend these replacements:
Embedding tasks
embed-english-v3.0embed-multilingual-v3.0embed-v4.0Chat tasks
command-r7b-12-2024command-a-03-2025command-a-reasoning-08-2025For the full announcement and lifecycle context, see the Deprecations page. For questions or assistance, contact support@cohere.com.
We’re pleased to announce the release of Cohere Transcribe, our first transcription model. Cohere Transcribe specializes in audio-in, text-out, automatic speech recognition (ASR).
cohere-transcribe-03-2026The model is available immediately through Cohere’s Audio Transcriptions API endpoint. You can start transcribing audio using the following example query:
You can access Cohere Transcribe via our API for free, low-setup experimentation subject to rate limits. See the Different Types of API Keys and Rate Limits page for usage details and integration guidance.
For production deployment without rate limits, provision a dedicated Model Vault. This enables low-latency, private cloud inference without having to manage infrastructure. Pricing is calculated per hour-instance, with discounted plans for longer-term commitments. Contact our team to discuss your requirements.
We’re pleased to announce the release of Rerank 4.0 our newest and most performant foundational model for ranking.
rerank-v4.0-pro: Optimized for state-of-the-art quality and complex use-casesrerank-v4.0-fast: Optimized for low latency and high throughput use-casesAs part of our ongoing commitment to delivering advanced AI solutions, we are deprecating the following models, features, and API endpoints:
Deprecated Models:
command-r-03-2024 (and the alias command-r)command-r-plus-04-2024 (and the alias command-r-plus)command-lightcommandsummarize (Refer to the migration guide for alternatives).For command model replacements, we recommend you use command-r-08-2024, command-r-plus-08-2024, or command-a-03-2025 (which is the strongest-performing model across domains) instead.
Retired Fine-Tuning Capabilities:
All fine-tuning options via dashboard and API for models including command-light, command, command-r, classify, and rerank are being retired. Previously fine-tuned models will no longer be accessible.
Deprecated Features and API Endpoints:
/v1/connectors (Managed connectors for RAG)/v1/chat parameters: connectors, search_queries_only/v1/generate (Legacy generative endpoint)/v1/summarize (Legacy summarization endpoint)/v1/classifyFor questions, reach out to support@cohere.com
We’re excited to announce the release of Command A Translate, Cohere’s first machine translation model. It achieves state-of-the-art performance at producing accurate, fluent translations across 23 languages.
The model is available immediately through Cohere’s Chat API endpoint. You can start translating text with simple prompts or integrate it programmatically into your applications.
Command A Translate (command-a-translate-08-2025) is now available for all Cohere users through our standard API endpoints. For enterprise customers, private deployment options are available to ensure maximum security and control over your translation workflows.
For more detailed information about Command A Translate, including technical specifications and implementation examples, visit our model documentation.
We’re excited to announce the release of Command A Reasoning, a hybrid reasoning model designed to excel at complex agentic tasks, in English and 22 other languages. With 111 billion parameters and a 256K context length, this model brings advanced reasoning capabilities to your applications through the familiar Command API interface.
Key Features
Technical Specifications
command-a-reasoning-08-2025Integrating Command A Reasoning is straightforward using the Chat API. Here’s a non-streaming example:
Customization Options
You can enable and disable thinking capabilities using the thinking parameter, and steer the model’s output with a flexible user-controlled thinking budget; for more details on token budgets, advanced configurations, and best practices, refer to our dedicated Reasoning documentation.
We’re excited to announce the release of Command A Vision, Cohere’s first commercial model capable of understanding and interpreting visual data alongside text. This addition to our Command family brings enterprise-grade vision capabilities to your applications with the same familiar Command API interface.
command-a-vision-07-2025Command A Vision excels in enterprise use cases including:
The API structure is identical to our existing Command models, making integration straightforward:
There’s much more to be said about working with images, various limitations, and best practices, which you can find in our dedicated Command A Vision and Image Inputs documents.
Announcing Cutting-Edge Cohere Models on OCI
We are thrilled to announce that the Oracle Cloud Infrastructure (OCI) Generative AI service now supports Cohere Command A, Rerank v3.5, Embed v3.0 multimodal. This marks a major advancement in providing OCI’s customers with enterprise-ready AI solutions.
Command A 03-2025 is the most performant Command model to date, delivering 150% of the throughput of its predecessor on only two GPUs.
Embed v3.0 is a cutting-edge AI search model enhanced with multimodal capabilities, allowing it to generate embeddings from both text and images.
Rerank 3.5, Cohere’s newest AI search foundation model, is engineered to improve the precision of enterprise search and retrieval-augmented generation (RAG) systems across a wide range of data formats (such as lengthy documents, emails, tables, JSON, and code) and in over 100 languages.
Check out Oracle’s announcement and documentation for more details.
We’re thrilled to announce the release of Embed 4, the most recent entrant into the Embed family of enterprise-focused large language models (LLMs).
Embed v4 is Cohere’s most performant search model to date, and supports the following new features:
Embed v4 achieves state of the art in the following areas:
Embed v4 is available today on the Cohere Platform, AWS Sagemaker, and Azure AI Foundry. For more information, check out our dedicated blog post here.