Cohere Toolkit is a collection of pre-built components enabling developers to quickly build and deploy retrieval augmented generation (RAG) applications. With it, you can cut time-to-launch down from months to weeks, and deploy in as little as a few minutes.
The pre-built components fall into two big categories: front-end and back end.
Here’s an image that shows how these different components work together:
You can get started quickly with toolkit on Google Cloud Run, Microsoft Azure, or locally. Read this for more details, including CLI commands to run after cloning the repo and environment variables that need to be set.
The toolkit can be deployed on single containers, AWS ECS, and GCP. Find out how here.
If you want to configure old retrieval chains or add new ones, you’ll need to work through a few steps. These include installing poetry, setting up your local database, testing, etc. More context is available here.
The toolkit is powerful and flexible. There’s a lot you can do with it, including adding your own model deployment, calling the toolkit’s backend over the API, adding a connector, and much else besides.
Following the links in this document or read the full repository to find everything you need!