Different Types of API Keys and Rate Limits

Cohere offers two kinds of API keys: evaluation keys (free but limited in usage), and production keys (paid and much less limited in usage). You can create a trial or production key on the API keys page. For more details on pricing please see our pricing docs.

The table below shows the rate limits for each endpoint, expressed in requests per minute (20/min means 20 requests per minute).

EndpointTrial rate limitProduction rate limit
Chat20/min500/min
Embed100/min2,000/min
Embed (Images)5/min400/min
Rerank10/min1,000/min
Tokenize100/min2,000/min
Classify100/min1000/min
EmbedJob5/min50/min
Summarize (legacy)5/min500/min
Generate (legacy)5/min500/min
Default (anything not covered above)500/min500/min

In addition, all endpoints are limited to 1,000 calls per month with a trial key.

If you have any questions or want to speak about getting a rate limit increase, reach out to support@cohere.com.

Built with