Antonio VelascoPrompt evaluationA python example on how to evaluate large language models output to account for variations in prompt, hyperparamenters or engines.Aug 9, 2023Aug 9, 2023