In an effort to make our language-model capabilities more widely available, we’ve partnered with a few major platforms to create hosted versions of our offerings.

This document will guide you through enabling development teams to access Cohere’s offerings on Amazon SageMaker.

Prerequisites

In order to successfully subscribe to Cohere’s offerings on Amazon SageMaker, the user will need the following Identity and Access Management (IAM) permissions:

  • AmazonSageMakerFullAccess
  • aws-marketplace:ViewSubscriptions
  • aws-marketplace:Subscribe
  • aws-marketplace:Unsubscribe

These permissions allow a user to manage your organization’s Amazon SageMaker subscriptions. Learn more about managing Amazon’s IAM Permissions here. Contact your AWS administrator if you have questions about account permissions.

You’ll also need to install the AWS Python SDK and some related tooling. Run:

  • pip install cohere-aws (or pip install --upgrade cohere-aws if you want to upgrade to the most recent version of the SDK).

Cohere with Amazon SageMaker Setup

First, navigate to Cohere’s SageMaker Marketplace to view the available product offerings. Select the product offering to which you are interested in subscribing.

Next, explore the tools on the Product Detail page to evaluate how you want to configure your subscription. It contains information related to:

  • Pricing: This section allows you to estimate the cost of running inference on different types of instances.
  • Usage: This section contains the technical details around supported data formats for each model, and offers links to documentation and notebooks that will help developers scope out the effort required to integrate with Cohere’s models.
  • Subscribing: This section will once again present you with both the pricing details and the EULA for final review before you accept the offer. This information is identical to the information on Product Detail page.
  • Configuration: The primary goal of this section is to retrieve the Amazon Resource Name (ARN) for the product you have subscribed to.

Embeddings

You can use this code to invoke Cohere’s embed model on Amazon SageMaker:

PYTHON
1import cohere
2
3co = cohere.SageMakerClient(
4 aws_region="us-east-1",
5 aws_access_key="...",
6 aws_secret_key="...",
7 aws_session_token="...",
8)
9
10# Input parameters for embed. In this example we are embedding hacker news post titles.
11texts = ["Interesting (Non software) books?",
12 "Non-tech books that have helped you grow professionally?",
13 "I sold my company last month for $5m. What do I do with the money?",
14 "How are you getting through (and back from) burning out?",
15 "I made $24k over the last month. Now what?",
16 "What kind of personal financial investment do you do?",
17 "Should I quit the field of software development?"]
18input_type = "clustering"
19truncate = "NONE" # optional
20model_id = "<YOUR ENDPOINT NAME>" # On SageMaker, you create a model name that you'll pass here.
21
22
23# Invoke the model and print the response
24result = co.embed(
25 model=model_id,
26 input_type=input_type,
27 texts=texts,
28 truncate=truncate)
29
30print(result)

Text Generation

You can use this code to invoke Cohere’s Command models on Amazon SageMaker:

PYTHON
1import cohere
2
3co = cohere.SageMakerClient(
4 aws_region="us-east-1",
5 aws_access_key="...",
6 aws_secret_key="...",
7 aws_session_token="...",
8)
9
10# Invoke the model and print the response
11result = co.chat(message="Write a LinkedIn post about starting a career in tech:",
12 model="<YOUR ENDPOINT NAME>") # On SageMaker, you create a model name that you'll pass here.
13
14print(result)

Next Steps

With your selected configuration and Product ARN available, you now have everything you need to integrate with Cohere’s model offerings on SageMaker.

Cohere recommends your next step be to find the appropriate notebook in Cohere’s list of Amazon SageMaker notebooks, and follow the instructions there, or provide the link to Cohere’s SageMaker notebooks to your development team to implement. The notebooks are thorough, developer-centric guides that will enable your team to begin leveraging Cohere’s endpoints in production for live inference.

If you have further questions about subscribing or configuring Cohere’s product offerings on Amazon SageMaker, please contact our team at support+aws@cohere.com.