Mean Squared Projected Bellman Error
This requires some thought to deconstruct. See page 2 (bottom right) of http://www.machinelearning.org/archive/icml2009/papers/546.pdf for an explanation.
TQ is Bellman applied to Q, so Q-TQ would be something like: how far is Q from satisfying Bellman (which we know it must)?
This would be fine for tabular Q, but under function approximation such a Q function might not be realizable: there might not be a setting of the thetas that yields this. So the idea is to project:
\Pi(v) = argmin_theta || v - V_theta ||^_D
It turns out that under linear function approximation, this has a closed form, which is exactly the \Pi shown in the next paragraph.