ChatGPT to Evaluate Generated Text

How good is ChatGPT for evaluating automatic summarization, story generation, and data-to-text generation?

Benjamin Marie
6 min readMar 16, 2023
Image from Pixabay and modified by the author.

ChatGPT has an impressive ability to perform natural language processing tasks (NLP) with simple instructions.

In a previous article, I presented and discussed the research work by Jiao et al. (2023) who evaluated the ability of ChatGPT to translate. The results are impressive and almost comparable with standard machine translation systems.

But ChatGPT is able to do much more than translating. The challenge is to find for which applications ChatGPT is actually good at or even better than other existing systems.

In this article, I review the work by Wang et al. (2023), who studied the ability of ChatGPT at evaluating natural language generation (NLG):

Is ChatGPT a Good NLG Evaluator? A Preliminary Study

The main objective is to find whether a system like ChatGPT can be used to judge the quality of text generated by NLG

--

--

Benjamin Marie

Ph.D, research scientist in NLP/AI. Exclusive articles and all my AI notebooks on https://newsletter.kaitchup.com/