This guide serves as a reference for developers looking to update their code that uses Cohere API v1 in favor of the new v2 standard. It outlines the key differences and necessary changes when migrating from Cohere API v1 to v2 and the various aspects of the API, including chat functionality, RAG (Retrieval-Augmented Generation), and tool use. Each section provides code examples for both v1 and v2, highlighting the structural changes in request formats, response handling, and new features introduced in v2.

PYTHON

1 # ! pip install -U cohere
2 
3 import cohere
4 
5 # instantiating the old client
6 co_v1 = cohere.Client(api_key="<YOUR API KEY>")
7 
8 # instantiating the new client
9 co_v2 = cohere.ClientV2(api_key="<YOUR API KEY>")

General

v2: model is a required field for Embed, Rerank, Classify, and Chat.

Embed

v2: embedding_types is a required field for Embed.

Chat

Messages

Message structure:
- v1: uses separate preamble and message parameters.
- v2: uses a single messages parameter consisting of a list of roles (system, user, assistant, or tool). The system role in v2 replaces the preamble parameter in v1.
Chat history:
- v1: manages the chat history via the chat_history parameter.
- v2: manages the chat history via the messages list.

PYTHON

1 res = co_v1.chat(
2     model="command-a-03-2025",
3     preamble="You respond in concise sentences.",
4     chat_history=[
5         {"role": "user", "message": "Hello"},
6         {
7             "role": "chatbot",
8             "message": "Hi, how can I help you today?",
9         },
10     ],
11     message="I'm joining a new startup called Co1t today. Could you help me write a one-sentence introduction message to my teammates?",
12 )
13 
14 print(res.text)

Excited to join the team at Co1t, where I look forward to contributing my skills and collaborating with everyone to drive innovation and success.

PYTHON

1 res = co_v2.chat(
2     model="command-a-03-2025",
3     messages=[
4         {
5             "role": "system",
6             "content": "You respond in concise sentences.",
7         },
8         {"role": "user", "content": "Hello"},
9         {
10             "role": "assistant",
11             "content": "Hi, how can I help you today?",
12         },
13         {
14             "role": "user",
15             "content": "I'm joining a new startup called Co1t today. Could you help me write a one-sentence introduction message to my teammates.",
16         },
17     ],
18 )
19 
20 print(res.message.content[0].text)

Excited to join the team at Co1t, bringing my passion for innovation and a background in [your expertise] to contribute to the company's success!

Response content

v1: Accessed via text
v2: Accessed via message.content[0].text

PYTHON

1 res = co_v1.chat(model="command-a-03-2025", message="What is 2 + 2")
2 
3 print(res.text)

The answer is 4.

PYTHON

1 res = co_v2.chat(
2     model="command-a-03-2025",
3     messages=[{"role": "user", "content": "What is 2 + 2"}],
4 )
5 
6 print(res.message.content[0].text)

The answer is 4.

Streaming

Events containing content:
- v1: chunk.event_type == "text-generation"
- v2: chunk.type == "content-delta"
Accessing response content:
- v1: chunk.text
- v2: chunk.delta.message.content.text

PYTHON

1 message = "I'm joining a new startup called Co1t today. Could you help me write a one-sentence introduction message to my teammates."
2 
3 res = co_v1.chat_stream(model="command-a-03-2025", message=message)
4 
5 for chunk in res:
6     if chunk.event_type == "text-generation":
7         print(chunk.text, end="")

"Hi, I'm [your name] and I'm thrilled to join the Co1t team today as a [your role], eager to contribute my skills and ideas to help drive innovation and success for our startup!"

PYTHON

1 message = "I'm joining a new startup called Co1t today. Could you help me write a one-sentence introduction message to my teammates."
2 
3 res = co_v2.chat_stream(
4     model="command-a-03-2025",
5     messages=[{"role": "user", "content": message}],
6 )
7 
8 for chunk in res:
9     if chunk:
10         if chunk.type == "content-delta":
11             print(chunk.delta.message.content.text, end="")

"Hi everyone, I'm thrilled to join the Co1t team today and look forward to contributing my skills and ideas to drive innovation and success!"

RAG

Documents

v1: the documents parameter supports a list of objects with multiple fields per document.
v2: the documents parameter supports a few different options for structuring documents:
- List of objects with data object: same as v1 described above, but each document passed as a data object (with an optional id field to be used in citations).
- List of objects with data string (with an optional id field to be used in citations).
- List of strings.

PYTHON

1 # Define the documents
2 documents_v1 = [
3     {
4         "text": "Reimbursing Travel Expenses: Easily manage your travel expenses by submitting them through our finance tool. Approvals are prompt and straightforward."
5     },
6     {
7         "text": "Health and Wellness Benefits: We care about your well-being and offer gym memberships, on-site yoga classes, and comprehensive health insurance."
8     },
9 ]
10 
11 # The user query
12 message = "Are there fitness-related benefits?"
13 
14 # Generate the response
15 res_v1 = co_v1.chat(
16     model="command-a-03-2025",
17     message=message,
18     documents=documents_v1,
19 )
20 
21 print(res_v1.text)

Yes, there are fitness-related benefits. We offer gym memberships, on-site yoga classes, and comprehensive health insurance.

PYTHON

1 # Define the documents
2 documents_v2 = [
3     {
4         "data": {
5             "text": "Reimbursing Travel Expenses: Easily manage your travel expenses by submitting them through our finance tool. Approvals are prompt and straightforward."
6         }
7     },
8     {
9         "data": {
10             "text": "Health and Wellness Benefits: We care about your well-being and offer gym memberships, on-site yoga classes, and comprehensive health insurance."
11         }
12     },
13 ]
14 
15 # The user query
16 message = "Are there fitness-related benefits?"
17 
18 # Generate the response
19 res_v2 = co_v2.chat(
20     model="command-a-03-2025",
21     messages=[{"role": "user", "content": message}],
22     documents=documents_v2,
23 )
24 
25 print(res_v2.message.content[0].text)

Yes, we offer gym memberships, on-site yoga classes, and comprehensive health insurance.

The following is a list of the the different options for structuring documents for RAG in v2.

PYTHON

1 documents_v2 = [
2     # List of objects with data string
3     {
4         "id": "123",
5         "data": "I love penguins. they are fluffy",
6     },
7     # List of objects with data object
8     {
9         "id": "456",
10         "data": {
11             "text": "I love penguins. they are fluffy",
12             "author": "Abdullah",
13             "create_date": "09021989",
14         },
15     },
16     # List of strings
17     "just a string",
18 ]

Citations

Citations access:
- v1: citations
- v2: message.citations
Cited documents access:
- v1: documents
- v2: as part of message.citations, in the sources field

PYTHON

1 # Yes, there are fitness-related benefits. We offer gym memberships, on-site yoga classes, and comprehensive health insurance.
2 
3 print(res_v1.citations)
4 print(res_v1.documents)

[ChatCitation(start=50, end=124, text='gym memberships, on-site yoga classes, and comprehensive health insurance.', document_ids=['doc_1'])]
[{'id': 'doc_1', 'text': 'Health and Wellness Benefits: We care about your well-being and offer gym memberships, on-site yoga classes, and comprehensive health insurance.'}]

PYTHON

1 # Yes, we offer gym memberships, on-site yoga classes, and comprehensive health insurance.
2 
3 print(res_v2.message.citations)

[Citation(start=14, end=88, text='gym memberships, on-site yoga classes, and comprehensive health insurance.', sources=[DocumentSource(type='document', id='doc:1', document={'id': 'doc:1', 'text': 'Health and Wellness Benefits: We care about your well-being and offer gym memberships, on-site yoga classes, and comprehensive health insurance.'})])]

Search query generation

v1: Uses search_queries_only parameter
v2: Supported via tools. We recommend using the v1 API for this functionality in order to leverage the force_single_step feature. Support in v2 will be coming soon.

Connectors

v1: Supported via the connectors parameter
v2: Supported via user-defined tools.

Web search

v1: Supported via the web-search connector in the connectors parameter
v2: Supported via user-defined tools.

Uses the web search connector to search the internet for information relevant to the user’s query.

PYTHON

1 res_v1 = co_v1.chat(
2     message="who won euro 2024",
3     connectors=[{"id": "web-search"}],
4 )
5 
6 print(res_v1.text)

Spain won the UEFA Euro 2024, defeating England 2-1 in the final.

Web search functionality is supported via tools.

PYTHON

1 # Any search engine can be used. This example uses the Tavily API.
2 from tavily import TavilyClient
3 
4 tavily_client = TavilyClient(api_key=os.environ["TAVILY_API_KEY"])
5 
6 
7 # Create a web search function
8 def web_search(queries: list[str]) -> list[dict]:
9 
10     documents = []
11 
12     for query in queries:
13         response = tavily_client.search(query, max_results=2)
14 
15         results = [
16             {
17                 "title": r["title"],
18                 "content": r["content"],
19                 "url": r["url"],
20             }
21             for r in response["results"]
22         ]
23 
24         for idx, result in enumerate(results):
25             document = {"id": str(idx), "data": result}
26             documents.append(document)
27 
28     return documents
29 
30 
31 # Define the web search tool
32 web_search_tool = [
33     {
34         "type": "function",
35         "function": {
36             "name": "web_search",
37             "description": "Returns a list of relevant document snippets for a textual query retrieved from the internet",
38             "parameters": {
39                 "type": "object",
40                 "properties": {
41                     "queries": {
42                         "type": "array",
43                         "items": {"type": "string"},
44                         "description": "a list of queries to search the internet with.",
45                     }
46                 },
47                 "required": ["queries"],
48             },
49         },
50     }
51 ]
52 
53 # The user query
54 query = "who won euro 2024"
55 
56 # Define a system message to optimize search query generation
57 instructions = "Write a search query that will find helpful information for answering the user's question accurately. If you need more than one search query, write a list of search queries. If you decide that a search is very unlikely to find information that would be useful in constructing a response to the user, you should instead directly answer."
58 
59 messages = [
60     {"role": "system", "content": instructions},
61     {"role": "user", "content": query},
62 ]
63 
64 model = "command-a-03-2025"
65 
66 # Generate search queries (if any)
67 response = co_v2.chat(
68     model=model, messages=messages, tools=web_search_tool
69 )
70 
71 search_queries = []
72 
73 while response.message.tool_calls:
74 
75     print("Tool plan:")
76     print(response.message.tool_plan, "\n")
77     print("Tool calls:")
78     for tc in response.message.tool_calls:
79         print(
80             f"Tool name: {tc.function.name} | Parameters: {tc.function.arguments}"
81         )
82     print("=" * 50)
83 
84     messages.append(
85         {
86             "role": "assistant",
87             "tool_calls": response.message.tool_calls,
88             "tool_plan": response.message.tool_plan,
89         }
90     )
91 
92     # Step 3: Get tool results
93     for idx, tc in enumerate(response.message.tool_calls):
94         tool_result = web_search(**json.loads(tc.function.arguments))
95         tool_content = []
96         for data in tool_result:
97             tool_content.append(
98                 {
99                     "type": "document",
100                     "document": {"data": json.dumps(data)},
101                 }
102             )
103             # Optional: add an "id" field in the "document" object, otherwise IDs are auto-generated
104         messages.append(
105             {
106                 "role": "tool",
107                 "tool_call_id": tc.id,
108                 "content": tool_content,
109             }
110         )
111 
112     # Step 4: Generate response and citations
113     response = co_v2.chat(
114         model=model, messages=messages, tools=web_search_tool
115     )
116 
117 print(response.message.content[0].text)

Tool plan:
I will search for 'who won euro 2024' to find out who won the competition. 
Tool calls:
Tool name: web_search | Parameters: {"queries":["who won euro 2024"]}
==================================================
Spain won the 2024 European Championship. They beat England in the final, with substitute Mikel Oyarzabal scoring the winning goal.

Streaming

Event containing content:
- v1: chunk.event_type == "text-generation"
- v2: chunk.type == "content-delta"
Accessing response content:
- v1: chunk.text
- v2: chunk.delta.message.content.text
Events containing citations:
- v1: chunk.event_type == "citation-generation"
- v2: chunk.type == "citation-start"
Accessing citations:
- v1: chunk.citations
- v2: chunk.delta.message.citations

PYTHON

1 message = "Are there fitness-related benefits?"
2 
3 res_v1 = co_v1.chat_stream(
4     model="command-a-03-2025",
5     message=message,
6     documents=documents_v1,
7 )
8 
9 for chunk in res_v1:
10     if chunk.event_type == "text-generation":
11         print(chunk.text, end="")
12     if chunk.event_type == "citation-generation":
13         print(f"\n{chunk.citations}")

Yes, we offer gym memberships, on-site yoga classes, and comprehensive health insurance as part of our health and wellness benefits.
[ChatCitation(start=14, end=87, text='gym memberships, on-site yoga classes, and comprehensive health insurance', document_ids=['doc_1'])]
[ChatCitation(start=103, end=132, text='health and wellness benefits.', document_ids=['doc_1'])]

PYTHON

1 message = "Are there fitness-related benefits?"
2 
3 messages = [{"role": "user", "content": message}]
4 
5 res_v2 = co_v2.chat_stream(
6     model="command-a-03-2025",
7     messages=messages,
8     documents=documents_v2,
9 )
10 
11 for chunk in res_v2:
12     if chunk:
13         if chunk.type == "content-delta":
14             print(chunk.delta.message.content.text, end="")
15         if chunk.type == "citation-start":
16             print(f"\n{chunk.delta.message.citations}")

Yes, we offer gym memberships, on-site yoga classes, and comprehensive health insurance.
start=14 end=88 text='gym memberships, on-site yoga classes, and comprehensive health insurance.' sources=[DocumentSource(type='document', id='doc:1', document={'id': 'doc:1', 'text': 'Health and Wellness Benefits: We care about your well-being and offer gym memberships, on-site yoga classes, and comprehensive health insurance.'})]

Tool use

Tool definition

v1: uses Python types to define tools.
v2: uses JSON schema to define tools.

PYTHON

1 def get_weather(location):
2     return {"temperature": "20C"}
3 
4 
5 functions_map = {"get_weather": get_weather}
6 
7 tools_v1 = [
8     {
9         "name": "get_weather",
10         "description": "Gets the weather of a given location",
11         "parameter_definitions": {
12             "location": {
13                 "description": "The location to get weather, example: San Francisco, CA",
14                 "type": "str",
15                 "required": True,
16             }
17         },
18     },
19 ]

PYTHON

1 def get_weather(location):
2     return [{"temperature": "20C"}]
3     # You can return a list of objects e.g. [{"url": "abc.com", "text": "..."}, {"url": "xyz.com", "text": "..."}]
4 
5 
6 functions_map = {"get_weather": get_weather}
7 
8 tools_v2 = [
9     {
10         "type": "function",
11         "function": {
12             "name": "get_weather",
13             "description": "gets the weather of a given location",
14             "parameters": {
15                 "type": "object",
16                 "properties": {
17                     "location": {
18                         "type": "string",
19                         "description": "the location to get weather, example: San Fransisco, CA",
20                     }
21                 },
22                 "required": ["location"],
23             },
24         },
25     },
26 ]

Tool calling

Response handling
- v1: Tool calls accessed through response.tool_calls
- v2: Tool calls accessed through response.message.tool_calls
Chat history management
- v1: Tool calls stored in the response’s chat_history
- v2: Append the tool call details (tool_calls and tool_plan) to the messages list

PYTHON

1 message = "What's the weather in Toronto?"
2 
3 res_v1 = co_v1.chat(
4     model="command-a-03-2025", message=message, tools=tools_v1
5 )
6 
7 print(res_v1.tool_calls)

[ToolCall(name='get_weather', parameters={'location': 'Toronto'})]

PYTHON

1 messages = [
2     {"role": "user", "content": "What's the weather in Toronto?"}
3 ]
4 
5 res_v2 = co_v2.chat(
6     model="command-a-03-2025", messages=messages, tools=tools_v2
7 )
8 
9 if res_v2.message.tool_calls:
10     messages.append(
11         {
12             "role": "assistant",
13             "tool_calls": res_v2.message.tool_calls,
14             "tool_plan": res_v2.message.tool_plan,
15         }
16     )
17 
18     print(res_v2.message.tool_calls)

[ToolCallV2(id='get_weather_k88p0m8504w5', type='function', function=ToolCallV2Function(name='get_weather', arguments='{"location":"Toronto"}'))]

Tool call ID

v1: Tool calls do not emit tool call IDs
v2: Tool calls emit tool call IDs. This will help the model match tool results to the right tool call.

PYTHON

1 tool_results = [
2     {
3         "call": {
4             "name": "<tool name>",
5             "parameters": {"<param name>": "<param value>"},
6         },
7         "outputs": [{"<key>": "<value>"}],
8     },
9 ]

PYTHON

1 messages = [
2     {
3         "role": "tool",
4         "tool_call_id": "123",
5         "content": [
6             {
7                 "type": "document",
8                 "document": {
9                     "id": "123",
10                     "data": {"<key>": "<value>"},
11                 },
12             }
13         ],
14     }
15 ]

Response generation

Tool execution: Chat history management
- v1: Append call and outputs to the chat history
- v2: Append tool_call_id and tool_content to messages to the chat history
Tool execution: Tool results
- v1: Passed as tool_results parameter
- v2: Incorporated into the messages list as tool responses
User message
- v1: Set as empty ("")
- v2: No action required

PYTHON

1 tool_content_v1 = []
2 if res_v1.tool_calls:
3     for tc in res_v1.tool_calls:
4         tool_call = {"name": tc.name, "parameters": tc.parameters}
5         tool_result = functions_map[tc.name](**tc.parameters)
6         tool_content_v1.append(
7             {"call": tool_call, "outputs": [tool_result]}
8         )
9 
10 res_v1 = co_v1.chat(
11     model="command-a-03-2025",
12     message="",
13     tools=tools_v1,
14     tool_results=tool_content_v1,
15     chat_history=res_v1.chat_history,
16 )
17 
18 print(res_v1.text)

It is currently 20°C in Toronto.

PYTHON

1 if res_v2.message.tool_calls:
2     for tc in res_v2.message.tool_calls:
3         tool_result = functions_map[tc.function.name](
4             **json.loads(tc.function.arguments)
5         )
6         tool_content_v2 = []
7         for data in tool_result:
8             tool_content_v2.append(
9                 {
10                     "type": "document",
11                     "document": {"data": json.dumps(data)},
12                 }
13             )
14             # Optional: add an "id" field in the "document" object, otherwise IDs are auto-generated
15         messages.append(
16             {
17                 "role": "tool",
18                 "tool_call_id": tc.id,
19                 "content": tool_content_v2,
20             }
21         )
22 
23 res_v2 = co_v2.chat(
24     model="command-a-03-2025", messages=messages, tools=tools_v2
25 )
26 
27 print(res_v2.message.content[0].text)

It's 20°C in Toronto.

Citations

Citations access:
- v1: citations
- v2: message.citations
Cited tools access:
- v1: documents
- v2: as part of message.citations, in the sources field

PYTHON

1 print(res_v1.citations)
2 print(res_v1.documents)

[ChatCitation(start=16, end=20, text='20°C', document_ids=['get_weather:0:2:0'])]
[{'id': 'get_weather:0:2:0', 'temperature': '20C', 'tool_name': 'get_weather'}]

PYTHON

1 print(res_v2.message.citations)

[Citation(start=5, end=9, text='20°C', sources=[ToolSource(type='tool', id='get_weather_k88p0m8504w5:0', tool_output={'temperature': '20C'})])]

Streaming

Event containing content:
- v1: chunk.event_type == "text-generation"
- v2: chunk.type == "content-delta"
Accessing response content:
- v1: chunk.text
- v2: chunk.delta.message.content.text
Events containing citations:
- v1: chunk.event_type == "citation-generation"
- v2: chunk.type == "citation-start"
Accessing citations:
- v1: chunk.citations
- v2: chunk.delta.message.citations

PYTHON

1 tool_content_v1 = []
2 if res_v1.tool_calls:
3     for tc in res_v1.tool_calls:
4         tool_call = {"name": tc.name, "parameters": tc.parameters}
5         tool_result = functions_map[tc.name](**tc.parameters)
6         tool_content_v1.append(
7             {"call": tool_call, "outputs": [tool_result]}
8         )
9 
10 res_v1 = co_v1.chat_stream(
11     message="",
12     tools=tools_v1,
13     tool_results=tool_content_v1,
14     chat_history=res_v1.chat_history,
15 )
16 
17 for chunk in res_v1:
18     if chunk.event_type == "text-generation":
19         print(chunk.text, end="")
20     if chunk.event_type == "citation-generation":
21         print(f"\n{chunk.citations}")

It's 20°C in Toronto.
[ChatCitation(start=5, end=9, text='20°C', document_ids=['get_weather:0:2:0', 'get_weather:0:4:0'])]

PYTHON

1 if res_v2.message.tool_calls:
2     for tc in res_v2.message.tool_calls:
3         tool_result = functions_map[tc.function.name](
4             **json.loads(tc.function.arguments)
5         )
6         tool_content_v2 = []
7         for data in tool_result:
8             tool_content_v2.append(
9                 {
10                     "type": "document",
11                     "document": {"data": json.dumps(data)},
12                 }
13             )
14             # Optional: add an "id" field in the "document" object, otherwise IDs are auto-generated
15         messages.append(
16             {
17                 "role": "tool",
18                 "tool_call_id": tc.id,
19                 "content": tool_content_v2,
20             }
21         )
22 
23 res_v2 = co_v2.chat_stream(
24     model="command-a-03-2025", messages=messages, tools=tools_v2
25 )
26 
27 for chunk in res_v2:
28     if chunk:
29         if chunk.type == "content-delta":
30             print(chunk.delta.message.content.text, end="")
31         elif chunk.type == "citation-start":
32             print(f"\n{chunk.delta.message.citations}")

It's 20°C in Toronto.
start=5 end=9 text='20°C' sources=[ToolSource(type='tool', id='get_weather_k88p0m8504w5:0', tool_output={'temperature': '20C'})]

Citation quality (both RAG and tool use)

v1: controlled via citation_quality parameter
v2: controlled via citation_options parameter (with mode as a key)

Unsupported features in v2

The following v1 features are not supported in v2:

General chat
- conversation_id parameter (chat history is now managed by the developer via the messages parameter)
RAG
- search_queries_only parameter
- connectors parameter
- prompt_truncation parameter
Tool use
- force_single_step parameter (all tool calls are now multi-step by default)