API Keys and Rate Limits
Cohere offers two kinds of API keys: trial keys (with a variety of attendant limitations), and production keys (which have no such limitations).
In this document, we’ll discuss some of the limitations associated with a trial key before turning to the parameters of a production key.
Trial Key Limitations
Trial keys are rate-limited depending on the endpoint you want to use:
- Chat: 20/min
- Cluster: 5 calls/min
- Embed: 5 calls/min
- EmbedJob: 5 calls/min
- Rerank: 10 calls/min
- Generate (legacy): 5 calls/min
- Summarize (legacy): 5 calls/min
Chat and the Coral user interface are limited to a total of 1,000 calls a month with a trial key. All remaining endpoints are limited to a total of 1,000 calls per month with a trial key.
If you’d like to use Cohere endpoints in a production application or require higher throughput from our endpoints for your usage, you can upgrade to a production key.
With a trial key:
- Organizations can still have unlimited trial keys in the free tier.
- There is a defined usage limit on all the development API keys per minute (all keys add up to that rate limit).
- When a developer/org reaches a rate limit, they will receive an error that they have exceeded the limit/minute.
- Playground usage counts toward your trial key rate limit.
- If calls exceed the throttling we throw an error that says “Trial keys are throttled.” Please upgrade your API key or contact us directly on Discord.
- Trial keys are free to use even after you upgrade to a Production key.
Production Key Specifications
Production keys for all endpoints are rate-limited at 1,000 calls per minute and are intended for serving Cohere in a public-facing application and testing purposes. Usage of production keys is metered at price points which can be found on our pricing page.
To get a production key, start by navigating to the API Keys page in your Cohere dashboard. You’ll either need to be the admin of your organization, or ask your organization Admin to complete these steps.
From there, click on Create Production key to finish the process.
The whole process should complete in less than three minutes, and enables you to generate a production key that you can use to serve Cohere APIs in production. If you deploy without completing the go to production workflow, your API key may be temporarily or permanently revoked.