Luc NguyeninGoPenAIUnderstanding the Basics of Reinforcement LearningAre you curious about a popular topic in machine learning called Reinforcement Learning from Human Feedback (RLHF)?May 14May 14
Luc NguyeninGoPenAIDeep Dive into Reinforcement Learning from Human FeedbackIn a previous post, I covered how to build a Large Language Model (LLM), including training it with online data, using Supervised…Mar 12Mar 12
Luc NguyeninGoPenAIConstructing Knowledge Graphs: A Guide to Using OpenAI and PyvisKnowledge graphs have revolutionized the way we organize and analyze data. By visually depicting entities and their interconnections, they…Feb 262Feb 262
Luc NguyenFine-Tuning Llama model for special taskIn the last post, I showed a simple example of how to fine-tune the Llama model to answer unique questions not commonly found on the…Feb 192Feb 192
Luc NguyenThings need to know before fine-tuning LLM modelsAlthough I’ve used the OpenAI API for over a year, fine-tuning with open-source models like Llama-2 posed challenges. Dealing with…Jan 28Jan 28
Luc NguyenLlama 2 Using Huggingface Part 1In my last blog post, I discussed the ease of using open-source LLM models like Llama through LMstudio — a simple and fantastic method…Jan 16Jan 16
Luc NguyenRunning Open Source LLM models LocallyIn my previous posts, I showed you how to build a chatbot connected to databases using OpenAI. However, the security risks associated with…Jan 152Jan 152
Luc NguyeninGoPenAIFine-Tuning OpenAI model for Specialized TasksAsking the LLM model to answer uncommon questions poses a significant challenge. I’ve been testing RAG, a tool that guides the LLM model…Dec 30, 2023Dec 30, 2023
Luc NguyenMemGPT: Assessing the Extent of Unlimited Context for LLMsThree weeks ago, I came across a video discussing MemGPT on a tech channel. I was impressed with the concept and capability of MemGPT in…Dec 26, 20231Dec 26, 20231