CLinUbiOps-techData Privacy and AI in HealthcareArtificial intelligence (AI) has the potential to significantly improve efficiency in the medical field. However, as the healthcare sector…Sep 4Sep 4
CLinUbiOps-techWhat is multi-model routing?Multi-model routing is a process of linking multiple AI models together. The routing can either be done in series or in parallel, meaning…Jun 20Jun 20
CLinUbiOps-techReducing inference costs for GenAIFor users of GenAI models, especially large language models (LLMs), inference costs remain one of the largest costs of using GenAI for…May 28May 28
CLinUbiOps-techHow to optimize inference speed using batching, vLLM, and UbiOpsLet’s learn how to increase data throughput for LLMs using batching, specifically by utilizing the vLLM library.May 16May 16
CLinUbiOps-techDeploy Llama 3 8B in under 15 minutes using UbiOpsWhat is special about the instruct version of Llama 3? Deploy it in 15 minutes!Apr 25Apr 25
CLinUbiOps-techDeploy Gemma 7B in under 15 minutes with UbiOpsWhat can you get out of this guide?Apr 19Apr 19
CLinUbiOps-techWhat is model serving?Model deployment or model serving designates the stage in which a trained model is brought to production and readily usable. A…Mar 19Mar 19
CLinUbiOps-techTop 6 current LLM applications and use casesWe discussed how to classify a Large Language Model (LLM), so let’s talk about the different ways LLMs can be used in the real world. The…Feb 12Feb 12
CLinUbiOps-techWhich LLM to choose for your use case?Given the number of Large Language Models (LLMs) out there, finding one that meets your specific use case can be a daunting task. The field…Feb 5Feb 5