• Documentation
  • Support
  • Go to my Account
Menu
  • Documentation
  • Support
  • Go to my Account
  • Product
  • Solutions

    By industry

    • Public
    • Healthcare
    • Critical Infrastructure
    • Public
    • Healthcare
    • Critical Infrastructure

    By application

    • Generative AI
    • Computer Vision
    • Time Series
    • Generative AI
    • Computer Vision
    • Time Series

    On-demand GPU

    Instantly scale AI and machine learning workloads on GPU on-demand

    Learn more
  • Customers

    Featured customers

    Bayer and UbiOps
    Scaling computer vision workloads across GPUs
    Innovating with AI towards a digitally secure Netherlands
    Process and analyze enormous amount of data (50+ AI and data science apps) with UbiOps
    Personalized medicine with AI for immunotherapy treatment
    Optimization of district heating grids with IoT data
    All customer stories
  • Resources
    • Documentation
    • Video guides
    • Tutorials
    • Technical integrations
    • NVIDIA AI Enterprise
    • Documentation
    • Video guides
    • Tutorials
    • Technical integrations
    • NVIDIA AI Enterprise
    • Blog
    • Whitepapers
    • Webinars, Interviews & Talks
    • Github
    • UbiOps Training
    • Blog
    • Whitepapers
    • Webinars, Interviews & Talks
    • Github
    • UbiOps Training
  • Company
    • About us
    • UbiOps Partners
    • Contact
    • On-demand GPUs
    • About us
    • UbiOps Partners
    • Contact
    • On-demand GPUs
    • UbiOps for Research & Education
    • Slack Community
    • Jobs at UbiOps
    • UbiOps for Research & Education
    • Slack Community
    • Jobs at UbiOps

    Latest news

    Why is Hybrid Cloud Deployment Useful?

    UbiOps Revolutionizes AI Model Inference Using AMD Instinct 

  • Partners
  • Book a demo
  • Login
Contact Us
Try for free

Tag: llm

Deploy your model

Deploy Llama 3.1 8B Instruct on UbiOps

July 31, 2024 / July 31, 2024 by [email protected]

In this guide, we’ll take you through the release of the new update from MetaAI. This update saw changes to the existing Llama 3 8B & 70B models, while also releasing a new model with 405B parameters (Llama 3.1 405B). We’ll also deploy a quantized version of the  Llama 3.1 8B Instruct model to UbiOps. […]

Read more »

Tagged

deployllama3.1llm
Arize and UbiOps

Collaborations

Managing and monitoring your LLM applications

June 20, 2024 / June 20, 2024 by [email protected]

How UbiOps and Arize help you stay in control LLMs are all the rage at the moment, and the APIs of closed source models like GPT-4 have made it easier than ever to leverage the power of AI. However, for a lot of regulated industries these closed source models are not an option. Luckily there […]

Read more »

Tagged

aideployllmmonitoring
Creating a front-end for your Mistral RAG

Functionality Technology

Creating a front-end for your Mistral RAG

May 22, 2024 / May 22, 2024 by [email protected]

In a previous article we showed how you can set up a Retrievel Augmented Generation (RAG) framework for the Mistral-7B-v.02 Instruct LLM using the UbiOps WebApp. In this article we’ll go a step further and create a front-end for that set-up using Streamlit, and we’ll be using the UbiOps Python Client Library to set-up the […]

Read more »

Tagged

llmmistralRAG

Functionality

How to benchmark and optimize LLM inference performance (for data scientists)

May 3, 2024 / May 3, 2024 by [email protected]

Introduction Optimizing inference is a machine learning (ML) engineer’s task. In a lot of cases, though, it tends to fall into the hands of data scientists. Whether you’re a data scientist deploying models as a hobby or whether you work in a team that lacks engineers, at some point you will probably have to start […]

Read more »

Tagged

benchmarkllm

Deploy your model Technology

OpenAI vs. open-source LLM: Which model is best for your use case?

March 5, 2024 / March 11, 2024 by [email protected]

Introduction In a recent workshop on Large Language Models (LLMs), we asked attendees which LLMs they’re using or considering for their use cases. It turns out everyone was primarily focused on open-source LLMs like Mistral 7B, Llama 2, and Falcon. This left us wondering about a recent report by Reuters on OpenAI’s growth.  The report […]

Read more »

Tagged

llmOpenAIopensourceusecase

Technology UbiOps

Which LLM to choose for your use case? 

January 31, 2024 / February 1, 2024 by [email protected]

Given the number of Large Language Models (LLMs) out there, finding one that meets your specific use case can be a daunting task. The field is evolving rapidly, with new models and fine-tuned versions being released every single week. It follows that any list of LLMs and how they should be applied will be rapidly […]

Read more »

Tagged

aillmmlusecase
Implementing RAG for your LLM (Mistral)

Functionality Technology

Implementing RAG for your LLM (Mistral)

January 30, 2024 / February 20, 2024 by [email protected]

Most of the open-source models available on Huggingface come pre-trained on a large corpus of publicly available data, like WebText. In general, the size of these datasets give large language models (LLMs) an adequate performance for various use cases. For some, more specific, use cases, however, more domain specific knowledge is required for the LLM […]

Read more »

Tagged

llmmistralRAG
Falcon LLM fine - tuning

Functionality

Falcon LLM fine-tuning

December 18, 2023 / December 18, 2023 by [email protected]

In the good old days machine learning models were made from scratch by data scientists. This involved acquiring, and cleaning data before training a model and getting it to production. In recent years, though, the size of models has increased, and thus the training data required to train these new larger models as well. This […]

Read more »

Tagged

falconfine-tunellm
Sidebar

Latest news

  • February 5, 2025
  • October 10, 2024

Follow us

Linkedin Youtube Github Medium

Get updates and news from UbiOps

Newsletter

Contact

Headquarters The Hague

Wilhelmina van Pruisenweg 104
2595 AN, The Hague
The Netherlands
+31 70 792 00 91

Amsterdam Office

LAB42, room L2.16, Science Park 900, 1098 XH Amsterdam, the Netherlands

Company

  • Documentation
  • Support
  • Contact Us
  • Go to my Account
  • Documentation
  • Support
  • Contact Us
  • Go to my Account

Follow us

Linkedin Youtube Github Medium

UbiOps is a trademark of
Dutch Analytics B.V. Reg. 66849381

Knowledge Base

  • Tutorials
  • Video Guides
  • Blogs and News
  • Book a Demo
  • Tutorials
  • Video Guides
  • Blogs and News
  • Book a Demo
  • UbiOps Terms and Conditions
  • Privacy Policy
  • Cookie declaration
  • UbiOps Terms and Conditions
  • Privacy Policy
  • Cookie declaration