TEST: Using Permutation Inference to Strengthen Regression Discontinuity Analysis
Using Permutation Inference to Strengthen Regression Discontinuity Analysis
Introduction
Regression Discontinuity (RD) designs are a powerful tool for causal inference, but traditional parametric approaches can be sensitive to model specification. Permutation inference offers a robust alternative that doesn’t rely on distributional assumptions and can provide more credible statistical inference.
What is Permutation Inference?
Permutation inference tests the null hypothesis by randomly reassigning treatment status and comparing the observed effect to the distribution of effects under random assignment. In RD contexts, this typically involves:
- Randomly reassigning the treatment cutoff
- Calculating the treatment effect at each permuted cutoff
- Comparing the true effect to this null distribution
A Simple Example
Let’s consider a scholarship program where students with test scores above 75 receive financial aid.
Sample Data
Here’s what our data might look like:
| Student | Test Score | Received Aid | Outcome |
|---|---|---|---|
| 1 | 73.2 | No | 2.8 |
| 2 | 76.1 | Yes | 3.4 |
| 3 | 74.8 | No | 2.9 |
| 4 | 77.3 | Yes | 3.6 |
| 5 | 72.5 | No | 2.7 |
Note: This is simulated data for illustration purposes.
Implementation Steps
Step 1: Calculate the True Effect
Observed effect = Mean(Outcome | Score ≥ 75) - Mean(Outcome | Score < 75)
= 3.5 - 2.8 = 0.7
Step 2: Generate Permutation Distribution
- Randomly select 1000 alternative cutoff points
- For each cutoff, calculate the treatment effect
- Build a null distribution of these effects
Step 3: Calculate P-value
p-value = Proportion of permuted effects ≥ observed effect
Advantages of This Approach
- Assumption-free: No need to specify functional form
- Robust: Less sensitive to outliers and model misspecification
- Intuitive: Clear interpretation of statistical significance
- Flexible: Works with any test statistic
Key Considerations
Window Selection: Choose permutation range carefully - too narrow misses the point, too wide includes irrelevant comparisons.
The permutation window should typically:
- Exclude the true cutoff region
- Cover a reasonable range of the running variable
- Maintain sufficient sample size for each permutation
Conclusion
Permutation inference provides a valuable complement to traditional RD analysis. While it requires more computational effort, the robust statistical properties and assumption-free framework make it particularly valuable when:
- Sample sizes are modest
- Functional form is uncertain
- Traditional methods show sensitivity to specification
By combining traditional parametric estimates with permutation-based inference, researchers can build more convincing causal arguments.
For more details on implementation, see Cattaneo, Idrobo, and Titiunik (2020) or explore the rdrobust package in R.