The primary value learned value (PVLV) model is a possible explanation for the reward-predictive firing properties of dopamine (DA) neurons.
[1] It simulates behavioral and neural data on Pavlovian conditioning and the midbrain dopaminergic neurons that fire in proportion to unexpected rewards.
It is an alternative to the temporal-differences (TD) algorithm.
This neuroscience article is a stub.
You can help Wikipedia by expanding it.