Table of contents
Review: Instrumental Variable Analysis
- Dilemma: We cannot directly regress Y on X because X is endogenous—there are unobservable confounders U related to both X and Y.
- Instrumental variable: We have a variable Z satisfying two conditions: ①instrumental exogeneity: Cov(Z,e)=0, meaning no confounders affect Z and Z affects Y only through X; ②instrumental relevance: Cov(Z,X)≠0.
- Identification: We have IV estimand β=Cov(Z,Y)/Cov(Z,X) and we can regress X on Z first, then regress Y on the fitted values X_hat (i.e., two-stage least squares).
Principal Stratification
Notations for each individual i:
- - 0 means not encouraged, 1 means encouraged—this is the instrumental var.
- - this are potential treatments; each individual has these two digits.
- 0&1 in index refers to the realization of Z
- Each individual i also has two potential outcomes
- 0&1 in index also refers to the realization of Z (not T!)
- Randomization of encouragement Z means
- But always keep in mind that the treatment is not random: no
Based on and , individuals fall into four groups:
- Compliers:
- Always-takers:
- Never-takers:
- Defiers:
To summarize:
Z_i = 1 | Z_i = 0 | |
---|---|---|
T_i = 1 | compliers & always-takers | defiers & always-takers |
T_i = 0 | defiers & never-takers | compliers & never-takers |
Remarks: We can only observe one potential treatment for each individual, so we cannot identify which group an individual belongs to. Next, we’ll see that the IV estimand equals to the LATE estimand—the average treatment effect among compliers.
ITT Effect Decomposition
Intention-to-treat analysis focuses on the effect of encouragement to treatment, rather than the treatment eventually received—that is, the effect of Z on Y. We have:
In this decomposition, g refers to the four groups. Clearly, ITT effects for always-takers and never-takers are 0 because they always/never take the treatment regardless of Z. We also assume there are no defiers. Hence, we get:
We know Z is exogenous (i.e., Z is independent of potential treatments and outcomes), so:
The ITT effect for compliers is actually the ATE for compliers because for compliers, Z is equivalent to T:
Finally, what’s the proportion of compliers? Because Z is exogenous/randomly assigned, the proportions of compliers, always-takers, and never-takers are the same for Z=1 and Z=0. Hence:
To sum up,
Two hints for the last step:
- Law of total expectation:
Thus, the IV estimand is essentially the average treatment effect for the compliers, named “local average treatment effect (LATE).”