For both trial keys and production keys, Command A Reasoning is free until rate limits are reached. Learn more about rate limits for different models and key types here.
Command A Reasoning can be used in production through Cohere's Model Vault.
Command A Reasoning is Cohere’s first reasoning model to date, excelling at real world enterprise tasks including tool use, retrieval augmented generation (RAG), agents, and multilingual use cases. At 111B parameters, Command A has a context length of 256K. Command-a-Reasoning (CAR) is optimized to run on 4x H100 GPUs for production workloads. For non-production tasks such as tryouts and evaluations, it can also run on 4x A100 GPUs.
Command A is excellent for:
There’s more to be said about token budgets, enabling and disabling the thinking operation, etc., which can be found in our dedicated Reasoning guide.