A classic example of "selection bias" involves looking at the performance of professional basketball players. The example goes, among NBA players there is no correlation between height and performance.
Obviously that cannot be generalized to "height has no relationship with being good at basketball for all players", it is just that by the time you are selected to play for the NBA, other features are more important.
What I can't seem to do is to explain this through a DAG. I've come up more or less with this structure:
But it seems clear to me that here fixing the value "Plays for NBA" to TRUE would not make height independent from performance. Does it mean that the structure of the DAG is wrong? Or is there a better way to show this kind of selection bias?