For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
DASHBOARDPLAYGROUNDDOCSCOMMUNITYLOG IN
Guides and conceptsAPI ReferenceRelease NotesLLMUCookbooks
Guides and conceptsAPI ReferenceRelease NotesLLMUCookbooks
  • Get Started
    • Introduction
    • Installation
    • Creating a client
    • Playground
    • FAQs
  • Models
    • An Overview of Cohere's Models
    • Aya
    • Embed
    • Rerank
  • Text Generation
    • Introduction to Text Generation at Cohere
    • Using the Chat API
    • Reasoning
    • Image Inputs
    • Streaming Responses
    • Predictable Outputs
    • Advanced Generation Parameters
    • Tool Use
    • Tokens and Tokenizers
    • Summarizing Text
    • Safety Modes
  • Embeddings (Vectors, Search, Retrieval)
    • Introduction to Embeddings at Cohere
    • Semantic Search with Embeddings
    • Multimodal Embeddings
    • Batch Embedding Jobs
  • Going to Production
    • API Keys and Rate Limits
    • Going Live
    • Deprecations
    • How Does Cohere's Pricing Work?
  • Integrations
    • Integrating Embedding Models with Other Tools
    • Cohere and LangChain
    • LlamaIndex and Cohere
  • Deployment Options
    • Overview
    • SDK Compatibility
  • Tutorials
    • Cookbooks
    • LLM University
    • Build Things with Cohere!
    • Agentic RAG
    • Cohere on Azure
  • Responsible Use
    • Security
    • Usage Policy
    • Command A Technical Report
    • Command R and Command R+ Model Card
  • Cohere Labs
    • Cohere Labs Acceptable Use Policy
  • More Resources
    • Cohere Toolkit
    • Datasets
    • Improve Cohere Docs
LogoLogodocs
DASHBOARDPLAYGROUNDDOCSCOMMUNITYLOG IN
On this page
  • Chat API (per model)
  • Other API Endpoints
Going to Production

Different Types of API Keys and Rate Limits

Was this page helpful?
Edit this page
Previous

Going Live with a Cohere Model

Next
Built with

Cohere offers two kinds of API keys: evaluation keys (free but limited in usage), and production keys (paid and much less limited in usage). You can create a trial or production key on the API keys page. For more details on pricing please see our pricing docs.

Prod keys work like trial keys for newer model variants such as Command A Reasoning. Please contact sales@cohere.com if you intend to use those models in production.
Trial keys (and prod keys on newer Chat model variants) are limited to 1,000 API calls a month.

Chat API (per model)

ModelTrial rate limitProduction rate limit
Command A+20 req / minContact sales@cohere.com
Command A Reasoning20 req / minContact sales@cohere.com
Command A Translate20 req / minContact sales@cohere.com
Command A Vision20 req / minContact sales@cohere.com
Command A20 req / min500 req / min
Command R+20 req / min500 req / min
Command R20 req / min500 req / min
Command R7B20 req / min500 req / min

Other API Endpoints

EndpointTrial rate limitProduction rate limit
Audio Transcriptions5 req / minContact sales@cohere.com
Embed2,000 inputs / min2,000 inputs / min
Embed (Images)5 inputs / min400 inputs / min
EmbedJob5 req / min50 req / min
Rerank10 req / min1,000 req / min
Tokenize100 req / min2,000 req / min
Default (anything not covered above)500 req / min500 req / min

If you have any questions or want to speak about getting a rate limit increase, reach out to support@cohere.com.