It is a reward signal limited to the sense of “reward for having made a correct prediction”.
Martino Sorbaro

