Published inUbiOps-techData Privacy and AI in HealthcareArtificial intelligence (AI) has the potential to significantly improve efficiency in the medical field. However, as the healthcare sector…Sep 4Sep 4
Published inUbiOps-techWhat is multi-model routing?Multi-model routing is a process of linking multiple AI models together. The routing can either be done in series or in parallel, meaning…Jun 20Jun 20
Published inUbiOps-techReducing inference costs for GenAIFor users of GenAI models, especially large language models (LLMs), inference costs remain one of the largest costs of using GenAI for…May 28May 28
Published inUbiOps-techHow to optimize inference speed using batching, vLLM, and UbiOpsLet’s learn how to increase data throughput for LLMs using batching, specifically by utilizing the vLLM library.May 16May 16
Published inUbiOps-techDeploy Llama 3 8B in under 15 minutes using UbiOpsWhat is special about the instruct version of Llama 3? Deploy it in 15 minutes!Apr 25Apr 25
Published inUbiOps-techDeploy Gemma 7B in under 15 minutes with UbiOpsWhat can you get out of this guide?Apr 19Apr 19
Published inUbiOps-techWhat is model serving?Model deployment or model serving designates the stage in which a trained model is brought to production and readily usable. A…Mar 19Mar 19
Published inUbiOps-techDeploy Gemma 2B for free using UbiOpsWhat can you get out of this guide?Mar 7Mar 7
Published inUbiOps-techTop 6 current LLM applications and use casesWe discussed how to classify a Large Language Model (LLM), so let’s talk about the different ways LLMs can be used in the real world. The…Feb 12Feb 12
Published inUbiOps-techWhich LLM to choose for your use case?Given the number of Large Language Models (LLMs) out there, finding one that meets your specific use case can be a daunting task. The field…Feb 5Feb 5