Instantly scale AI and machine learning workloads on GPU on-demand
Functionality Technology
September 17, 2024 / September 17, 2024 by [email protected]
In a linkedin post by Fergal Mcgovern in May, he tries to explain why around 83% of enterprise CIOs plan to place some workloads on-premise instead of on-cloud. Let’s briefly explain what we mean when we say on-cloud and on-premise: On-cloud storage is when data is stored in data centers operated by third parties which […]
Read more »
Technology UbiOps
September 4, 2024 / September 4, 2024 by [email protected]
Artificial intelligence (AI) has the potential to significantly improve efficiency in the medical field. However, as the healthcare sector has very sensitive data, organizations and regulators need to make sure that the technology they employ is safe and effective. In healthcare, data is very personal and contains a lot of sensitive data about the patient. […]
August 29, 2024 / August 29, 2024 by [email protected]
Machine learning operations (MLOps) involve a set of techniques and principles aimed at the design, development, deployment, and maintenance of machine learning models for production use. The purpose of MLOps is to establish a clear set of guidelines to simplify the complex process of bringing a model into production. You can also learn more about […]
Tagged
Product update Technology
July 11, 2024 / July 11, 2024 by [email protected]
On the 11th of July 2024 we have released new functionality and made improvements to our UbiOps SaaS product. An overview of the changes is given below. Python client library version for this release: 4.5.0 CLI version for this release: 2.22.0 https://www.youtube.com/watch?v=jiM9JWMakCQ More information about active instances To provide more information on what’s […]
Deploy your model UbiOps
June 7, 2024 / June 7, 2024 by [email protected]
When Mistral released their Mistral 7B v0.2 model it was claimed to be the most powerful 7B Large Language Model (LLM) at that time. Now Mistral has released a new version, called Mistral 7B v0.3. The new version of Mistral 7B builds further on the success of the previous version. The model has an increased […]
May 22, 2024 / May 22, 2024 by [email protected]
In a previous article we showed how you can set up a Retrievel Augmented Generation (RAG) framework for the Mistral-7B-v.02 Instruct LLM using the UbiOps WebApp. In this article we’ll go a step further and create a front-end for that set-up using Streamlit, and we’ll be using the UbiOps Python Client Library to set-up the […]
April 18, 2024 / April 18, 2024 by [email protected]
Large Language Models (LLMs) are trained on vast datasets with data sourced from the public internet. But these datasets of course do not include specific datapoints regarding your business or use case. Retrieval-Augmented Generation (RAG) addresses this by dynamically incorporating your data as context in a prompt to your LLM. This way there is no […]
What can you get out of this guide? In this guide, we explain how to: To successfully complete this guide, make sure you have: You’ll also need the following files which are available in the appendix: What is Gemma 7B? Gemma is the latest model series released by Google in February 2024. It comes in […]
March 28, 2024 / March 28, 2024 by [email protected]
In this article, we will be creating a chatbot which is fine-tuned on custom documentation. We’ll use UbiOps—which is an AI deployment, serving and management platform—to fine-tune and deploy the instruction-tuned Mistral-7B model taken from Hugging Face. We’ll explain some of the methods used to fine-tune models, such as instruction tuning and domain adaptation, but […]
Deploy your model Functionality Technology UbiOps
March 19, 2024 / March 21, 2024 by [email protected]
Model deployment or model serving designates the stage in which a trained model is brought to production and readily usable. A model-serving platform allows you to easily deploy and monitor your models hassle-free. Below is the MLOps dev cycle and how UbiOps can be used within that cycle. How UbiOps fits into the MLOps dev […]
Deploy your model Technology
March 5, 2024 / March 11, 2024 by [email protected]
Introduction In a recent workshop on Large Language Models (LLMs), we asked attendees which LLMs they’re using or considering for their use cases. It turns out everyone was primarily focused on open-source LLMs like Mistral 7B, Llama 2, and Falcon. This left us wondering about a recent report by Reuters on OpenAI’s growth. The report […]
March 4, 2024 / March 4, 2024 by [email protected]
In this article, we will analyze some of the drawbacks of Databricks when it comes to machine learning – specifically when it comes to the deployment stage of a model. Following this, we’ll explain why using UbiOps with Databricks makes sense, and follow up with some of the UbiOps platform’s core strengths. What is Databricks? […]
LLM Technology UbiOps
February 29, 2024 / February 29, 2024 by [email protected]
We discussed how to classify a Large Language Model (LLM), so let’s talk about the different ways LLMs can be used in the real world. The potential applications of LLMs are countless, and their limits have yet to be crossed. However, this article should give you a general idea of some of the ways LLMs […]
February 7, 2024 / February 7, 2024 by [email protected]
The world’s attention shifted when OpenAI made generative AI accessible to the public with ChatGPT. Then, the game changed again when GPT-4 was released – able to browse the internet with Bing and provide even more accurate information, including sources. Undoubtedly, the uprising of GenAI is already significantly impacting our way of working. But how […]
January 31, 2024 / February 1, 2024 by [email protected]
Given the number of Large Language Models (LLMs) out there, finding one that meets your specific use case can be a daunting task. The field is evolving rapidly, with new models and fine-tuned versions being released every single week. It follows that any list of LLMs and how they should be applied will be rapidly […]
January 30, 2024 / February 20, 2024 by [email protected]
Most of the open-source models available on Huggingface come pre-trained on a large corpus of publicly available data, like WebText. In general, the size of these datasets give large language models (LLMs) an adequate performance for various use cases. For some, more specific, use cases, however, more domain specific knowledge is required for the LLM […]
Blog Technology
January 10, 2024 / January 13, 2025 by [email protected]
What can you get out of this guide? In this guide, we explain how you can deploy any version of Falcon that’s available on Huggingface within 15 minutes. For this guide we’ll need to do the following: To successfully complete this guide, make sure you have: You’ll also need the following files: What is Falcon-7b-instruct? […]
Functionality UbiOps
December 28, 2023 / July 3, 2024 by [email protected]
Facilitate a hybrid-cloud strategy and save weeks of work!
Deploy your model Technology UbiOps
November 21, 2023 / January 8, 2024 by [email protected]
In October 2023 MistralAI released Mistral 7B, which is claimed to be the most powerful Large Language Model (LLM) to date for its size. As the name suggests the model has 7,3 billion parameters, and outperforms LLaMa 2 13B and LLaMa 1 34B on several benchmarks: Mistral 7B benchmarked against LLaMa 1 7 LLaMa 2 […]
August 15, 2023 / September 6, 2023 by [email protected]
Introduction A Graphical Processing Unit (GPU) is a processor that is made up of smaller, more specialized cores. Originally designed to accelerate graphical calculations, GPUs were developed to work in parallel processing, which means that they are able to process data simultaneously in order to complete tasks more quickly. In other words, GPUs are able […]
August 4, 2023 / January 22, 2024 by [email protected]
What can you get out of this guide? In this guide, we explain how to deploy LLaMa 2, an open-source Large Language Model (LLM), using UbiOps for easy model hosting and Streamlit for creating a chatbot UI. The guide provides step-by-step instructions for packaging a deployment, loading it into UbiOps, configuring compute on GPUs and […]
July 28, 2023 / August 17, 2023 by [email protected]
Artificial Intelligence (AI) has emerged as a world-changing technology with a wide range of applications across industries. From virtual assistants to autonomous vehicles and advanced data analytics, AI has started to revolutionize the way we live and work. However, most AI algorithms require a very large amount of computational power to process and analyze all the necessary data. This is where […]
June 25, 2023 / July 26, 2023 by [email protected]
One of the big reasons for the increased usage of AI on the web is the availability of open source foundation models. Increasingly, Artificial Intelligence (AI) lies at the heart of online tools and applications. For example, the global chatbot market is expected to reach $1 billion dollars by 2024, because they can save companies […]
Functionality Technology Whitepapers
The IT spending of financial institutions all over the world is steadily increasing, and is expected to reach over $750 billion dollars by 2025. This is partly because of the significant increase in the development and deployment of AI systems. AI-powered systems can process large volumes of data very quickly and at a large scale. […]