Statisticsmedium

Treatment Effect of D on Y Where D Is Binary and Y Is Continuous

Question

Consider the treatment effect of DD on YY where DD is a binary treatment variable and YY is a continuous outcome. Discuss identification strategies, potential outcomes framework, and estimation methods.

Potential outcomes, identification strategies, and estimation methods for causal inference.

Average Treatment Effect

E[Y(1)−Y(0)]

ATE = E[Y(1) − Y(0)] under the potential outcomes framework

1Potential Outcomes Framework

For each individual ii, define potential outcomes Yi(1)Y_i(1) (outcome if treated) and Yi(0)Y_i(0) (outcome if untreated). The individual treatment effect is τi=Yi(1)Yi(0)\tau_i = Y_i(1) - Y_i(0).

Fundamental Problem of Causal Inference

We can never observe both Yi(1)Y_i(1) and Yi(0)Y_i(0) for the same individual. This is the **fundamental problem** — we must estimate the counterfactual from data on other individuals.
  Potential Outcomes:

  Individual i:   D=1 → Yᵢ(1) (observed)
                  D=0 → Yᵢ(0) (counterfactual)

  τᵢ = Yᵢ(1) − Yᵢ(0) ← never observed directly

2Average Treatment Effect (ATE)

The ATE is τATE=E[Y(1)Y(0)]\tau_{ATE} = E[Y(1) - Y(0)]. Under **random assignment** (D(Y(0),Y(1))D \perp (Y(0), Y(1))): τATE=E[YD=1]E[YD=0]\tau_{ATE} = E[Y|D=1] - E[Y|D=0]

Randomization Solves Everything

With random treatment assignment, the simple difference in means is an unbiased estimator of the ATE. This is why **randomized controlled trials** (RCTs) are the gold standard for causal inference.

3Selection Bias in Observational Data

Without randomization: E[YD=1]E[YD=0]=τATE+E[Y(0)D=1]E[Y(0)D=0]Selection BiasE[Y|D=1] - E[Y|D=0] = \tau_{ATE} + \underbrace{E[Y(0)|D=1] - E[Y(0)|D=0]}_{\text{Selection Bias}}

Conditional Independence (CIA)

D(Y(0),Y(1))XD \perp (Y(0), Y(1)) | X — controlling for covariates XX removes the selection bias. This is also called **unconfoundedness** or the **ignorability** assumption. It requires that all confounders are observed.

4Estimation Methods

MethodKey AssumptionWhen to UseStrength
OLS with controlsCIA: D ⊥ Y(d)|XRich covariate dataSimple, transparent
Propensity Score MatchingCIA + overlapMany covariatesReduces dimension
IPWCIA + overlapFlexible, semiparametricDoubly robust variants
Instrumental VariablesValid instrument ZCIA fails, instrument availableHandles unobserved confounders
RDDContinuity at cutoffTreatment assigned by thresholdQuasi-experimental
Diff-in-DiffParallel trendsPanel data, policy changeControls for fixed effects
ATE (randomized)E[Y|D=1] − E[Y|D=0]
ATE (observational, CIA)E_X[E[Y|D=1,X] − E[Y|D=0,X]]
LATE (IV)Cov(Y,Z)/Cov(D,Z)
GOLD STANDARDRandomized experiment

Quiz

Test your understanding with these questions.

1
What is the fundamental problem of causal inference?
2
Under random assignment, the ATE can be estimated by: