For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
DASHBOARDPLAYGROUNDDOCSCOMMUNITYLOG IN
Guides and conceptsAPI ReferenceRelease NotesLLMUCookbooks
Guides and conceptsAPI ReferenceRelease NotesLLMUCookbooks
  • Get Started
    • Introduction
    • Installation
    • Creating a client
    • Playground
    • FAQs
  • Models
    • An Overview of Cohere's Models
    • Aya
      • Command A+
      • Command A
      • Command A Reasoning
      • Command A Translate
      • Command A Vision
      • Command R7B
      • Command R+
      • Command R
    • Embed
    • Rerank
  • Text Generation
    • Introduction to Text Generation at Cohere
    • Using the Chat API
    • Reasoning
    • Image Inputs
    • Streaming Responses
    • Predictable Outputs
    • Advanced Generation Parameters
    • Tool Use
    • Tokens and Tokenizers
    • Summarizing Text
    • Safety Modes
  • Embeddings (Vectors, Search, Retrieval)
    • Introduction to Embeddings at Cohere
    • Semantic Search with Embeddings
    • Multimodal Embeddings
    • Batch Embedding Jobs
  • Going to Production
    • API Keys and Rate Limits
    • Going Live
    • Deprecations
    • How Does Cohere's Pricing Work?
  • Integrations
    • Integrating Embedding Models with Other Tools
    • Cohere and LangChain
    • LlamaIndex and Cohere
  • Deployment Options
    • Overview
    • SDK Compatibility
  • Tutorials
    • Cookbooks
    • LLM University
    • Build Things with Cohere!
    • Agentic RAG
    • Cohere on Azure
  • Responsible Use
    • Security
    • Usage Policy
    • Command A Technical Report
    • Command R and Command R+ Model Card
  • Cohere Labs
    • Cohere Labs Acceptable Use Policy
  • More Resources
    • Cohere Toolkit
    • Datasets
    • Improve Cohere Docs
LogoLogodocs
DASHBOARDPLAYGROUNDDOCSCOMMUNITYLOG IN
On this page
  • Description
  • What Can Command A Vision be Used For?
  • Limitations
ModelsCommand

Cohere's Command A Vision Model

Command A Vision model details and specifications
Was this page helpful?
Edit this page
Previous

Cohere's Command R7B Model

Command R7B model details and specifications
Next
Built with
Capabilities
MultilingualImage InputsSafety ModesCitationsStructured OutputsReasoningTool Use
Pricing

For both trial keys and production keys, Command A Vision is free until rate limits are reached. Learn more about rate limits for different models and key types here.

To use Command A Vision in production, please reach out to sales at sales@cohere.com.

Specifications
    Context Window: 128,000 tokens
    Max Output Tokens: 8,000 tokens
    Knowledge Cutoff: June 1, 2024
API Endpoints
Model ID
command-a-vision-07-2025
Chat V2Chat CompletionsChat V1
Try in Playground

Description

Command A Vision is Cohere’s first multimodal model capable of understanding and interpreting visual data alongside text. With a 128K context length and support for up to 20 images per request, Command Vision excels at enterprise use cases including document analysis, chart interpretation, optical character recognition (OCR), and processing images featuring multiple languages. The model maintains the same API interface as other Command models, making it easy to integrate vision capabilities into existing applications.

What Can Command A Vision be Used For?

Command A Vision is excellent in enterprise use cases such as:

  • Analysis of charts, graphs, and diagrams;
  • Extracting and understanding in-image tables;
  • Document optical character recognition (OCR) and question answering;
  • Natural-language image processing.

Limitations

Be aware that tool use isn’t supported with this model.

Also, it’s important to mention that Command A Vision can accept images as input, but doesn’t generate them.

For more detailed breakdowns of these and other applications, check out our cookbooks. To learn more about how token counts work, the maximum number of images, and so on, check out our dedicated Image Inputs document.