Nikolay PenkovHow to add memory to a chat LLM modelLarge Language Model (LLMs) exhibit remarkable capabilities as standalone solutions for various natural language processing tasks. Without…7 min read·Feb 18, 2024--2--2
Nikolay PenkovRAG basics using a self hosted OpenAI compatible LLM serverAdvanced AI language models , such as OpenAI’s ChatGPT and Google’s Gemini have been at the forefront of driving innovations in various…9 min read·Feb 8, 2024----
Nikolay Penkov[Part 3] How models like ChatGPT work? — Depicting the Transformer architectureAlright, here we are, Part 3 of our journey towards understanding the inner workings of LLMs like ChatGPT. To recap, in Part 1 we…11 min read·Feb 1, 2024----
Nikolay Penkov[Part 2] How models like ChatGPT work? — Depicting the Transformer architectureWhat we have covered in Part 1 of this post are the basic building blocks of the Transformer architecture. Once you have coded the basic…9 min read·Jan 27, 2024----
Nikolay Penkov[Part 1] How models like ChatGPT work? — Depicting the Transformer architectureIf you’ve stumbled upon this page, chances are you’ve already caught wind of the buzz surrounding the revolutionary language model ChatGPT…10 min read·Jan 26, 2024----
Nikolay PenkovCreating your own dataset for LLM training using Label StudioIn the realm of Language Model Training, the critical task of data labeling takes center stage. To empower the next generation of…4 min read·Nov 10, 2023--1--1
Nikolay PenkovHow to deploy LLama 2 as an AWS Lambda function for scalable serverless inferenceAWS Lambda is a powerful serverless computing service, offering a myriad of advantages, such as auto-scaling, cost-effectiveness, and ease…6 min read·Oct 31, 2023--3--3
Nikolay PenkovHow to run Llama 2 locally on CPU + serving it as a Docker containerIn today’s digital landscape, the large language models are becoming increasingly widespread, revolutionizing the way we interact with…8 min read·Oct 29, 2023--4--4