Announcing the Cohere Transcribe model

We’re pleased to announce the release of Cohere Transcribe, our first transcription model. Cohere Transcribe specializes in audio-in, text-out, automatic speech recognition (ASR).

Technical details

  • Model name: cohere-transcribe-03-2026
  • Input: Audio waveform
  • Output: Text
  • Languages covered: English, German, French, Italian, Spanish, Portuguese, Greek, Dutch, Polish, Vietnamese, Chinese, Arabic, Japanese, Korean.
  • License: Apache 2.0
  • API endpoint: Audio Transcriptions API

Getting started

The model is available immediately through Cohere’s Audio Transcriptions API endpoint. You can start transcribing audio using the following example query:

PYTHON
1import cohere
2
3co = cohere.ClientV2()
4
5response = co.audio.transcriptions.create(
6 model="cohere-transcribe-03-2026",
7 language="en",
8 file=open("./sample.wav", "rb"),
9)
10
11print(response)

Availability

You can access Cohere Transcribe via our API for free, low-setup experimentation subject to rate limits. See the Different Types of API Keys and Rate Limits page for usage details and integration guidance.

For production deployment without rate limits, provision a dedicated Model Vault. This enables low-latency, private cloud inference without having to manage infrastructure. Pricing is calculated per hour-instance, with discounted plans for longer-term commitments. Contact our team to discuss your requirements.