2025 · Causal Inference

English Premier League Spending Analysis

Causal Inference Sports Analytics R

English Premier League Spending Analysis

University of Maryland | Jan 2025 – Apr 2025

R Synthetic Control Difference-in-Differences ggplot2

Project Overview

Applied quasi-experimental causal inference methods to examine the relationship between squad investment and team performance in the English Premier League.

Key Contributions

Causal Framework: Applied synthetic control and difference-in-differences (DiD) methods to estimate causal effects of squad investment on league position, controlling for confounding factors
Model Implementation: Constructed synthetic counterfactuals for treated clubs using convex combinations of donor pool teams, validating pre-treatment parallel trends
Statistical Inference: Conducted placebo tests and permutation inference to assess statistical significance, finding significant ROI heterogeneity across club tiers (top-6 vs. mid-table)

Technologies Used

  • Languages: R
  • Methods: Synthetic Control, Difference-in-Differences
  • Visualization: ggplot2
  • Data Sources: TransferMarkt, Premier League

Key Findings

AnalysisResult
Treatment Effect (Top-6)Significant positive
Treatment Effect (Mid-table)Heterogeneous
Pre-treatment Parallel TrendsValidated
Placebo TestsPassed