Do-Calculus for Causal Diagram 7.5 from "The Book of Why" (napkin problem)

Question

In "The Book of Why" the below causal diagram is described as the "simplest model" where estimation of the causal effect goes beyond front and back-door adjustment and thus requires do-calculus.

Here, $W$, $X$, $Y$, $Z$ are all observed, and $U_1$ and $U_2$ are Unobserved.

I have played around a bit with the rules, about which I am still shaky, but haven't come up with a solution for $P(Y|do(X))$ that can be estimated from the diagram. Could someone better than me help come up with a solution?

No idea myself, but I will say this: once you've read The Book of Why and Causal Inference in Statistics: A Primer, you might be ready (though I'm finding it EXTREMELY challenging) to tackle the "big daddy" book: Causality: Models, Reasoning, and Inference. That last book has by far the most complete discussion of the do calculus, and I would imagine that if you could master its discussion, you could probably derive the correct expression for $P(Y|\operatorname{do}(X)).$ — Adrian Keister, Mar 19 '21 at 14:15

Carlos Cinelli · Accepted Answer · 2021-03-19T23:43:13.427

I answered this once on twitter, I can reproduce the answer here.

Derivation (graphs licensing each step are provided below).

$$ \begin{align} P(y|do(x)) &= P(y|do(x), do(z)) \qquad &\text{Rule 3: $(Y \perp\!\!\!\perp Z|X)_{G_\overline{XZ}}$}\\ &= P(y |x, do(z)) \qquad &\text{Rule 2: $(Y\perp\!\!\!\perp X)_{G_{\overline{Z}\underline{X}}}$}\\ &= \frac{P(y, x|do(z))}{P(x|do(z))} \qquad &\text{Def. of conditional probability}\\ &= \frac{\sum_{w}P(y, x|z, w)P(w)}{\sum_{w}P(x|z,w)P(w)}\qquad &\text{Backdoor using with $W$: $(\{Y, X\}\perp\!\!\!\perp Z|W)_{G_{\underline{Z}}}$} \end{align} $$

What does each step mean of the derivation means in plain English?

Step 1 simply states that, when $X$ is held fixed (by intervention), manipulating $Z$ has no effect on $Y$;
Step 2 states that, when $Z$ is held fixed (by intervention), there is no confounding between $X$ and $Y$;
Step 3 is just applying the definition of conditional probability; and, finally,
Step 4 notes that adjusting for $W$ is sufficient to identify the causal effect of $Z$ on $X$ and $Y$. This is because $W$ blocks all confounding paths from $Z$ to $X$ and $Y$ (backdoor paths). So we can use vanilla backdoor adjustment here.

Modified Graphs

This is excellent, thank you! – Mir Henglin Mar 20 '21 at 17:04 — Mir Henglin, Mar 20 '21 at 17:04

Do-Calculus for Causal Diagram 7.5 from "The Book of Why" (napkin problem)

1 Answers1