Andrew Huberman· PhD
Um uh it's called temporal difference reinforcement learning. Um is the fact that in the world these expectations are going through their own trajectory. All right. And that's what dopamine is coding for. Any learning rule should code for the surprising outcome.