I've seen at least two versions from the CS229, wondering if there is a comprehensive resource around this topic
The first version:
$$ B(V)(s) = V'(s) = R(s) + \gamma \max_{a \in A} \sum_{s' \in S} P_{sa}(s') V(s') $$
from problem 5 in http://cs229.stanford.edu/ps/ps4/ps4.pdf. It's called Bellman update operator in the problem description.
The second version:
$$ B(V)(s) = V'(s) = R(s) + \gamma \sum_{s' \in S} P_{s\pi(s)}(s') V(s') $$ from problem 4 in from https://see.stanford.edu/materials/aimlcs229/problemset4.pdf, it's called Bellman operator in the problem description.
Given $a$ is equivalent to $\pi(s)$, the major difference is one has a $\max$ operator which the other doesn't, so which is which, then?