Causal Inference

Notes on Causal Inference following the course by Brady Neal.

Skip Moses https://example.com/norajones
2022-05-31

In this post we preview the course on Causal Inference. This post will give a brief overview of the main concepts.

What is causal inference?

Motivating Example

Suppose we have some disease we are tying to treat with treatment A and treatment B. Our only goal is minimizing death. Suppose treatment B is much more scarce than treatment A.

Data at Treatment Level

Treatment Total
A 240/1500 16%
B 105/550 19%

Data at the Condition Level

Treatment Mild Severe Total
A 210/1400 15% 30/100 30% 240/1500 16%
B 5/50 10% 100/500 20% 105/550 19%

\[ \frac{1400}{1500}(0.15) + \frac{100}{1500}(0.30) = 0.16 \]

\[ \frac{50}{550}(0.10) + \frac{500}{550}(0.20) = 0.19 \]

Which treatment should you choose?

Correlation does not imply causation

What does imply causation?

Potential Outcomes

Fundamental problem of causal inference

Work around (Average Treatment Effect ATE)

\[ \mathbb{E}[Y_i(1)] - \mathbb{E}[Y_i(0)] \neq \mathbb{E}[Y\vert T=1] - \mathbb{E}[Y\vert T = 0] \] The left hand side is causal, while the right hand side is causal and confounding.

Observational Sudies

\[ \mathbb{E}[Y(t) \vert W = w] := \mathbb{E}[Y\vert do(T=t), W =w] = \mathbb{E}[Y\vert t,w] \] - This still depends on \(w\), so we take the marginal

\[ \mathbb{E}[Y(t)] := \mathbb{E}[Y\vert do(T=t)] = \mathbb{E}_W\mathbb{E}[Y\vert t, W] \]

Example

Treatment Mild Severe Total
A 210/1400 15% 30/100 30% 240/1500 16%
B 5/50 10% 100/500 20% 105/550 19%

\[ \mathbb{E}[Y\vert do(T=t)] = \mathbb{E}_W\mathbb{E}[Y\vert t, C] = \sum_{c\in C} \mathbb{E}[Y\vert t,c]P(c) \]

\[ \frac{1450}{2050}(0.15) + \frac{600}{2050}(0.30) \approx 0.194 \]

\[ \frac{1450}{2050}(0.10) + \frac{600}{2050}(0.20) \approx 0.129 \]