• Documentation
  • Support
  • Go to my Account
Menu
  • Documentation
  • Support
  • Go to my Account
  • Product
  • Solutions

    By industry

    • Public
    • Healthcare
    • Critical Infrastructure
    • Public
    • Healthcare
    • Critical Infrastructure

    By application

    • Generative AI
    • Computer Vision
    • Time Series
    • Generative AI
    • Computer Vision
    • Time Series

    On-demand GPU

    Instantly scale AI and machine learning workloads on GPU on-demand

    Learn more
  • Customers

    Featured customers

    Bayer and UbiOps
    Scaling computer vision workloads across GPUs
    Innovating with AI towards a digitally secure Netherlands
    Process and analyze enormous amount of data (50+ AI and data science apps) with UbiOps
    Personalized medicine with AI for immunotherapy treatment
    Optimization of district heating grids with IoT data
    All customer stories
  • Resources
    • Documentation
    • Video guides
    • Tutorials
    • Technical integrations
    • NVIDIA AI Enterprise
    • Documentation
    • Video guides
    • Tutorials
    • Technical integrations
    • NVIDIA AI Enterprise
    • Blog
    • Whitepapers
    • Webinars, Interviews & Talks
    • Github
    • UbiOps Training
    • Blog
    • Whitepapers
    • Webinars, Interviews & Talks
    • Github
    • UbiOps Training
  • Company
    • About us
    • UbiOps Partners
    • Contact
    • On-demand GPUs
    • About us
    • UbiOps Partners
    • Contact
    • On-demand GPUs
    • UbiOps for Research & Education
    • Slack Community
    • Jobs at UbiOps
    • UbiOps for Research & Education
    • Slack Community
    • Jobs at UbiOps

    Latest news

    Why is Hybrid Cloud Deployment Useful?

    UbiOps Revolutionizes AI Model Inference Using AMD Instinct 

  • Partners
  • Book a demo
  • Login
Contact Us
Try for free
Arize and UbiOps

Collaborations

Managing and monitoring your LLM applications

June 20, 2024 / June 20, 2024 by [email protected]

How UbiOps and Arize help you stay in control LLMs are all the rage at the moment, and the APIs of closed source models like GPT-4 have made it easier than ever to leverage the power of AI. However, for a lot of regulated industries these closed source models are not an option. Luckily there […]

Read more »

Tagged

aideployllmmonitoring

Deploy your model UbiOps

Deploy Mistral 7B v0.3 (Function Calling)

June 7, 2024 / June 7, 2024 by [email protected]

When Mistral released their Mistral 7B v0.2 model it was claimed to be the most powerful 7B Large Language Model (LLM) at that time. Now Mistral has released a new version, called Mistral 7B v0.3. The new version of Mistral 7B builds further on the success of the previous version. The model has an increased […]

Read more »

Reducing inference costs for GenAI

Functionality LLM

Reducing inference costs for GenAI

May 28, 2024 / May 28, 2024 by [email protected]

Reducing inference costs for GenAI

Read more »

Creating a front-end for your Mistral RAG

Functionality Technology

Creating a front-end for your Mistral RAG

May 22, 2024 / May 22, 2024 by [email protected]

In a previous article we showed how you can set up a Retrievel Augmented Generation (RAG) framework for the Mistral-7B-v.02 Instruct LLM using the UbiOps WebApp. In this article we’ll go a step further and create a front-end for that set-up using Streamlit, and we’ll be using the UbiOps Python Client Library to set-up the […]

Read more »

Tagged

llmmistralRAG

Functionality LLM

How to optimize inference speed using batching, vLLM, and UbiOps

May 15, 2024 / May 15, 2024 by [email protected]

In this guide, we will show you how to increase data throughput for LLMs using batching, specifically by utilizing the vLLM library. We will explain some of the techniques it leverages and show why they are useful. We will be looking at the PagedAttention algorithm in particular. Our setup will achieve impressive performance results and […]

Read more »

Functionality

How to benchmark and optimize LLM inference performance (for data scientists)

May 3, 2024 / May 3, 2024 by [email protected]

Introduction Optimizing inference is a machine learning (ML) engineer’s task. In a lot of cases, though, it tends to fall into the hands of data scientists. Whether you’re a data scientist deploying models as a hobby or whether you work in a team that lacks engineers, at some point you will probably have to start […]

Read more »

Tagged

benchmarkllm
deploy LLama3

Deploy your model LLM

Deploy Llama 3 8B in under 15 minutes using UbiOps

April 25, 2024 / April 25, 2024 by [email protected]

What can you get out of this guide? In this guide, we explain how to: To successfully complete this guide, make sure you have: You’ll also need the following files: What is Llama 3 8B? Llama 3 is the most recent model of the Llama series developed by Meta. It comes in two sizes, the […]

Read more »

Tagged

APIdeployhugging facellama3meta
How to build a RAG query engine with LlamaIndex and UbiOps

Technology UbiOps

How to build a RAG query engine with LlamaIndex and UbiOps

April 18, 2024 / April 18, 2024 by [email protected]

Large Language Models (LLMs) are trained on vast datasets with data sourced from the public internet. But these datasets of course do not include specific datapoints regarding your business or use case. Retrieval-Augmented Generation (RAG) addresses this by dynamically incorporating your data as context in a prompt to your LLM. This way there is no […]

Read more »

Tagged

llamaindexRAG
Deploy Gemma 7B in under 15 minutes with UbiOps (1)

Deploy your model UbiOps

Deploy Gemma 7B in under 15 minutes with UbiOps

April 18, 2024 / April 18, 2024 by [email protected]

What can you get out of this guide? In this guide, we explain how to: To successfully complete this guide, make sure you have: You’ll also need the following files which are available in the appendix: What is Gemma 7B? Gemma is the latest model series released by Google in February 2024. It comes in […]

Read more »

Tagged

7bdeploygemma

Product update Uncategorized

New UbiOps features April 2024 

April 9, 2024 / April 9, 2024 by [email protected]

On the 9th of April 2024 we have released new functionality and made improvements to our UbiOps SaaS product. An overview of the changes is given below. Python client library version for this release: 4.4.0 CLI version for this release: 3.4.0 https://youtu.be/3sZdCpmX030  Port forwarding (beta) If you want to run processes in a deployment that […]

Read more »

Fine-tune a model on your own documentation

Functionality Technology

Fine-tune a model on your own documentation

March 28, 2024 / March 28, 2024 by [email protected]

In this article, we will be creating a chatbot which is fine-tuned on custom documentation. We’ll use UbiOps—which is an AI deployment, serving and management platform—to fine-tune and deploy the instruction-tuned Mistral-7B model taken from Hugging Face. We’ll explain some of the methods used to fine-tune models, such as instruction tuning and domain adaptation, but […]

Read more »

Tagged

#AIfinetuneml
What is AI model serving_

Deploy your model Functionality Technology UbiOps

What is model serving?

March 19, 2024 / March 21, 2024 by [email protected]

Model deployment or model serving designates the stage in which a trained model is brought to production and readily usable. A model-serving platform allows you to easily deploy and monitor your models hassle-free. Below is the MLOps dev cycle and how UbiOps can be used within that cycle. How UbiOps fits into the MLOps dev […]

Read more »

Page navigation
  • ‹
  • Page 1
  • Current Page 2
  • Page 3
  • Page 4
  • ›
  • »

Get updates and news from UbiOps

Newsletter

Contact

Headquarters The Hague

Wilhelmina van Pruisenweg 104
2595 AN, The Hague
The Netherlands
+31 70 792 00 91

Amsterdam Office

LAB42, room L2.16, Science Park 900, 1098 XH Amsterdam, the Netherlands

Company

  • Documentation
  • Support
  • Contact Us
  • Go to my Account
  • Documentation
  • Support
  • Contact Us
  • Go to my Account

Follow us

Linkedin Youtube Github Medium

UbiOps is a trademark of
Dutch Analytics B.V. Reg. 66849381

Knowledge Base

  • Tutorials
  • Video Guides
  • Blogs and News
  • Book a Demo
  • Tutorials
  • Video Guides
  • Blogs and News
  • Book a Demo
  • UbiOps Terms and Conditions
  • Privacy Policy
  • Cookie declaration
  • UbiOps Terms and Conditions
  • Privacy Policy
  • Cookie declaration