HamzaWhat is RLHF?Reinforcement learning from human feedback (and how OpenAI used it for InstructGPT)Oct 18, 2023Oct 18, 2023
HamzaDifferent Approaches to Improving LLMsWant to make your LLM more efficient? Here’s how you should think about fine-tuning, instruction tuning, and RLHFOct 16, 2023Oct 16, 2023
HamzaUnlock Business Value with OpenAI Model Fine-TuningAn insider’s guide to optimizing costs and performance with GPT-3.5Oct 9, 2023Oct 9, 2023