Instantly scale AI and machine learning workloads on GPU on-demand
Deploy your model
July 31, 2024 / July 31, 2024 by [email protected]
In this guide, we’ll take you through the release of the new update from MetaAI. This update saw changes to the existing Llama 3 8B & 70B models, while also releasing a new model with 405B parameters (Llama 3.1 405B). We’ll also deploy a quantized version of the Llama 3.1 8B Instruct model to UbiOps. […]
Read more »
Tagged
Functionality
June 21, 2024 / June 21, 2024 by [email protected]
Multi-model routing is a process of linking multiple AI models together. The routing can either be done in series or in parallel, meaning that you use a router to send prompts to specific models. Example of a simple multi-model route Multi-modal routing can have various sorts of benefits. It enables you to have smaller and […]
Collaborations
June 20, 2024 / June 20, 2024 by [email protected]
How UbiOps and Arize help you stay in control LLMs are all the rage at the moment, and the APIs of closed source models like GPT-4 have made it easier than ever to leverage the power of AI. However, for a lot of regulated industries these closed source models are not an option. Luckily there […]
Deploy your model LLM
April 25, 2024 / April 25, 2024 by [email protected]
What can you get out of this guide? In this guide, we explain how to: To successfully complete this guide, make sure you have: You’ll also need the following files: What is Llama 3 8B? Llama 3 is the most recent model of the Llama series developed by Meta. It comes in two sizes, the […]
Deploy your model UbiOps
April 18, 2024 / April 18, 2024 by [email protected]
What can you get out of this guide? In this guide, we explain how to: To successfully complete this guide, make sure you have: You’ll also need the following files which are available in the appendix: What is Gemma 7B? Gemma is the latest model series released by Google in February 2024. It comes in […]