I am trying to derive/deduce the bellman equation using Value and Q-functions.
I came only so far with understanding it and tried it myself in Latex:
Why is the $V^*$ suddenly in $Q^\pi$ function? Why not $Q^\pi = r + \gamma Q^\pi(s_{t+1}, a)$?
And the $Q^*$ doesn't make so much sense either