LLM Archives - UbiOps - AI model serving, orchestration & training

Reducing inference costs for GenAI

Functionality LLM

Reducing inference costs for GenAI

May 28, 2024 / May 28, 2024 by UbiOps

Reducing inference costs for GenAI

Functionality LLM

How to optimize inference speed using batching, vLLM, and UbiOps

May 15, 2024 / May 15, 2024 by UbiOps

In this guide, we will show you how to increase data throughput for LLMs using batching, specifically by utilizing the vLLM library. We will explain some of the techniques it leverages and show why they are useful. We will be looking at the PagedAttention algorithm in particular. Our setup will achieve impressive performance results and […]

deploy LLama3

Deploy your model LLM

Deploy Llama 3 8B in under 15 minutes using UbiOps

April 25, 2024 / April 25, 2024 by UbiOps

What can you get out of this guide? In this guide, we explain how to: To successfully complete this guide, make sure you have: You’ll also need the following files: What is Llama 3 8B? Llama 3 is the most recent model of the Llama series developed by Meta. It comes in two sizes, the […]

Tagged

API deploy hugging face llama3 meta

LLM Technology UbiOps

Top 6 current LLM applications and use cases

February 29, 2024 / February 29, 2024 by UbiOps

We discussed how to classify a Large Language Model (LLM), so let’s talk about the different ways LLMs can be used in the real world. The potential applications of LLMs are countless, and their limits have yet to be crossed. However, this article should give you a general idea of some of the ways LLMs […]