Aya Family of Models
Aya is a family of multilingual large language models that are designed to expand the number of languages covered by generative AI. Its 8-billion and 32-billion parameter “Expanse” offerings are optimized to perform well in these 23 languages: Arabic, Chinese (simplified & traditional), Czech, Dutch, English, French, German, Greek, Hebrew, Hebrew, Hindi, Indonesian, Italian, Japanese, Korean, Persian, Polish, Portuguese, Romanian, Russian, Spanish, Turkish, Ukrainian, and Vietnamese.
Model Details
How Can I Get Access to the Aya Models?
If you want to test Aya, you have three options. First (and simplest), you can use the Cohere playground or Hugging Face Space to play around with them and see what they’re capable of.
Second, you can use the Cohere Chat API to work with Aya programmatically. Here’s a very lightweight example of using the Cohere SDK to create a Spanish-language tutor with Aya that tells a story with simple Spanish vocabulary (NOTE: you’ll need an API key to run this code, and if you haven’t installed the Cohere SDK you can do that with pip install cohere
).
And here’s an example output (remember: these models are stochastic, and what you see might look quite different).
Finally, you can directly download the raw models for research purposes because Cohere For AI has released Aya Expanse 8B and Aya Expanse 32B as open-weight models, through HuggingFace. What’s more, the massively multilingual instruction data used for development of these models has been made available for download as well.
Find More
We hope you’ve found this as fascinating as we do! If you want to see more substantial projects you can check out these notebooks (source):