TTE.ac
Course

Introduction to Selection Bias

Slide 1 of 8

What is Selection Bias?

When Our Sample Doesn't Represent the Population

Selection bias occurs when the association observed in the selected sample differs from the association in the target population we wish to make inferences about. Unlike confounding, which arises from common causes, selection bias arises from the process of who gets included in our analysis.

Selection Bias Defined

Selection bias occurs when the probability of being included in the study (or remaining in follow-up) depends on both the exposure and the outcome, or their causes. This creates a spurious association or distorts the true causal relationship.

Sources of Selection

Selection into an analysis can happen at multiple stages:

StageMechanismExample
Study enrollmentEligibility criteria, willingness to participateOnly volunteers in health studies
Loss to follow-upDropout during studySicker patients stop attending visits
Analysis restrictionConditioning on post-treatment variablesAnalyzing only "responders"
SurvivalOnly living individuals observableStudying only heart attack survivors

The Graphical Structure

The classic selection bias structure involves conditioning on a collider:

A → S ← Y

Where S represents selection into the study. If both treatment A and outcome Y (or their causes) affect who gets selected, conditioning on S (by only analyzing selected individuals) opens a biasing path.

Selection vs. Confounding

Confounding arises from common causes of A and Y. Selection bias arises from conditioning on common effects of A and Y. Both create spurious associations, but through different mechanisms.

Use arrow keys to navigate