In this paper, I am confused about the classification of selection bias and its connection to Pearl's do-calculus.
Following notation of paper, let $E$ be a binary exposure, D be a binary outcome, $L1$ be a covariate vector, and $S$ be selection. $D^e$ refers to potential outcomes. Our goal is to target the true ATE defined as $P(D^1=1)-P(D^0=1)$
They define Type 1 Selection Bias (SB) as that which does not have internal validity but has external validity i.e.
- $P(D=1|E=1,S=1)-P(D=1|E=0,S=1) \ne P(D^1=1|S=1)-P(D^0=1|S=1)$
- But, $P(D^1=1|S=1)-P(D^0=1|S=1)=P(D^1=1)-P(D^0=1)$
They define Type 2 SB as that which does not have external validity but has internal validity i.e.
- $P(D=1|E=1,S=1)-P(D=1|E=0,S=1) = P(D^1=1|S=1)-P(D^0=1|S=1)$
- But, $P(D^1=1|S=1)-P(D^0=1|S=1) \ne P(D^1=1)-P(D^0=1)$
They provide the following DAG and explain that this is an example of Type 1 SB.
When I do Pearl's rules of do-calculus on the DAG, I'm not getting this. If I were to translate having no SB from above into sufficient action statements, I believe I'd need the following for $e=1,0$:
- $p(d|e,s) = p(d|do(e),s)$
- $p(d|do(e),s) = p(d|do(e))$
The first bullet is an action/observation (rule 2). For this, I would need, $D \perp E | S$ in the DAG after removing all arrows from E. So this appears to hold (since it removes S as a collider).
The second bullet is an insertion/deletion (rule 1). For this, I would need, $D \perp S|E$ in the DAG after removing all arrows into E - in this case, the DAG remains the same (an arrow would be removed if confounding were present). Regardless, this still has a backdoor path from D to L1 to S. So this appears to be violated. Hence, based on these rules, I see it as, at worst, Type 2 SB.
As they say in the paper, conditioning on L1 allows us to cure the selection bias presented by this DAG. But, this seems to address issue 2, since now we have $p(d| do(e),s,l1) = p(d|do(e),l1)$ as the above backdoor path is blocked once I also condition on $L1$. From randomization/ignorability, standardization permits identification.
Intuition-based answers aside, why is this do-calculus approach incorrect?
