Run any Skill in Manus with one click

did-analysis

Econometrics skill for Difference-in-Differences (DID) analysis. Activates when the user asks about: "difference in differences", "DID", "DiD", "diff-in-diff", "parallel trends", "treatment group", "control group", "pre-treatment", "post-treatment", "policy evaluation", "natural experiment", "staggered DID", "event study regression", "two-way fixed effects DID", "callaway santanna", "sun and abraham", "双重差分", "倍差法", "平行趋势", "处理组", "对照组", "政策评估", "事件研究", "交错DID", "渐进处理"

Run Skill in Manus

Overview

Install command

npx skills add https://github.com/zhouziyue233/great-econometrics --skill did-analysis

Copy and paste this command into Claude Code to install the skill

Source

zhouziyue233/great-econometrics

Stars4

Forks0

UpdatedApril 3, 2026 at 04:39

File Explorer

2 files

SKILL.md

readonly

More from this repository

same repository

literature-review

zhouziyue233/great-econometrics

Search, summarize, and synthesize economics literature. find research gaps, position your contribution.

2026-04-074

beamer-ppt

zhouziyue233/great-econometrics

Create Beamer-style academic PPTX presentations using python-pptx. Produces publication-quality .pptx files with navy-blue Metropolis theme (16:9, frame title bars, progress bar) for conference talks, job market presentations, and seminar slides. Called by /present command.

2026-04-034

data-pipeline

zhouziyue233/great-econometrics

End-to-end data pipeline for empirical research: fetch economic data from APIs (FRED, World Bank, IMF, BLS, OECD, Yahoo Finance), clean and transform raw data, construct strategy-specific variables, and validate panel structure. Use when asked to fetch data, download data, clean data, merge datasets, prepare analysis-ready data.

2026-04-034

figure

zhouziyue233/great-econometrics

Called by /plot to generate and upgrade econometric figures to top-journal standards.

2026-04-034

iv-estimation

zhouziyue233/great-econometrics

Econometrics skill for instrumental variables and treatment effect estimation. Activates when the user asks about: "instrumental variables", "IV estimation", "2SLS", "two-stage least squares", "endogeneity", "weak instruments", "first stage", "Sargan test", "overidentification", "propensity score matching", "PSM", "average treatment effect", "ATT", "LATE", "local average treatment effect", "endogenous regressor", "instrument validity", "工具变量", "两阶段最小二乘", "内生性", "弱工具变量", "倾向得分匹配", "平均处理效应", "处理效应", "局部平均处理效应"

2026-04-034

ml-causal

zhouziyue233/great-econometrics

Econometrics skill for machine learning methods in causal inference. Activates when the user asks about: "causal forest", "generalized random forest", "GRF", "double machine learning", "DML", "debiased machine learning", "LASSO for variable selection", "post-LASSO", "heterogeneous treatment effects", "CATE", "conditional average treatment effect", "BLP analysis", "CLAN analysis", "causal tree", "honest estimation", "因果森林", "双重机器学习", "异质性处理效应", "条件平均处理效应", "LASSO变量选择", "机器学习因果推断", "去偏机器学习"

2026-04-034

Source

zhouziyue233

zhouziyue233/great-econometrics

View GitHub Repository View Creator Repositories

Install command

Download

Run Skill in Manus

Useful forSOC

Data ScientistsComputer and Mathematical Occupations15-2051L4

name

did-analysis

description

Difference-in-Differences (DID) Skill

This skill guides complete DID analysis: from assumption validation and model specification to staggered treatment designs and event study regressions. Designed for policy evaluation and natural experiment settings.

Core DID Logic

DID compares the change in outcomes for a treatment group before and after treatment to the change for a control group over the same period.

DID Estimator = (Ȳ_treat,post − Ȳ_treat,pre) − (Ȳ_ctrl,post − Ȳ_ctrl,pre)

Key Assumption (Parallel Trends): In the absence of treatment, the treatment group's outcome would have evolved in parallel with the control group.

DID Workflow

Design check: Confirm treatment/control assignment and timing
Parallel trends: Test with pre-treatment event study regression
Baseline regression: 2×2 DID or TWFE regression
Staggered design check: If adoption dates vary, use robust estimators
Robustness: Placebo treatment, alternative control groups, callaway-santanna

Basic 2×2 DID Model

Y_it = β₀ + β₁·Treat_i + β₂·Post_t + β₃·(Treat_i × Post_t) + ε_it

β₃ = DID estimate (ATT)

Code Templates

# Python — 2×2 DID with TWFE
import statsmodels.formula.api as smf

# Simple 2x2
model = smf.ols('y ~ treat + post + treat_post', data=df).fit(cov_type='HC3')

# TWFE with entity and time FE (preferred)
from linearmodels.panel import PanelOLS
df_panel = df.set_index(['entity_id', 'year'])
twfe = PanelOLS(df_panel['y'], df_panel[['treat_post']],
                entity_effects=True, time_effects=True)
result = twfe.fit(cov_type='clustered', cluster_entity=True)
print(result.summary)

# R — TWFE
library(plm); library(lmtest); library(sandwich)
panel_df <- pdata.frame(df, index = c("entity_id", "year"))
twfe <- plm(y ~ treat_post, data = panel_df, model = "within", effect = "twoways")
coeftest(twfe, vcov = vcovHC(twfe, cluster = "group"))

* Stata — TWFE with clustered SE
xtset entity_id year
xtreg y treat_post i.year, fe cluster(entity_id)
* Or equivalently:
reghdfe y treat_post, absorb(entity_id year) cluster(entity_id)

Parallel Trends: Event Study Regression

Replace the single treat_post dummy with relative-time dummies to visualize pre-trends:

* Stata — event study
reghdfe y ib(-1).rel_time, absorb(entity_id year) cluster(entity_id)
coefplot, vertical yline(0) xline(0) ///
    title("Event Study: Pre/Post Treatment Effects") ///
    xlabel(, angle(45))

# R — event study
library(fixest)
es_model <- feols(y ~ i(rel_time, treat, ref = -1) | entity_id + year,
                  data = df, cluster = ~entity_id)
iplot(es_model, xlab = "Periods relative to treatment")

Interpreting the event study plot:

Pre-treatment coefficients ≈ 0 → parallel trends assumption holds
Pre-trend test: joint F-test for all pre-treatment coefficients = 0
Post-treatment coefficients show dynamic treatment effects

Staggered DID

When units adopt treatment at different times, standard TWFE can be biased (Callaway-Sant'Anna, Sun-Abraham).

# R — Callaway-Sant'Anna estimator (csdid)
library(did)
cs_result <- att_gt(yname = "y",
                    gname = "cohort_year",   # year of first treatment (0 if never treated)
                    idname = "entity_id",
                    tname  = "year",
                    xformla = ~x1 + x2,
                    data = df)

# Aggregate to average ATT
aggte(cs_result, type = "simple")   # Overall ATT
aggte(cs_result, type = "dynamic")  # Dynamic effects
ggdid(cs_result)

# R — Sun-Abraham (fixest)
library(fixest)
sa_model <- feols(y ~ sunab(cohort_year, year) | entity_id + year,
                  data = df, cluster = ~entity_id)
iplot(sa_model)

* Stata — Callaway-Sant'Anna (csdid from SSC)
csdid y x1 x2, ivar(entity_id) time(year) gvar(cohort_year)
csdid_plot

Robustness Checks for DID

Placebo treatment dates: assign fake treatment 1–2 periods before actual treatment
Placebo treatment groups: run DID using only control units with a fake treatment
Alternative control groups: restrict to more comparable controls
Continuous treatment intensity: use dose-response DID

Reporting Standards

Report event study plot as Figure (essential for credibility)
State the parallel trends assumption and supporting evidence
Report DID coefficient with clustered SE (cluster at entity level)
Discuss potential violations: anticipation effects, Ashenfelter's dip, spillovers
For staggered designs, always use CS or SA estimators and explain why

See references/did-reference.md for heterogeneous treatment effects, triple-difference models, synthetic control comparison, Borusyak-Jaravel-Spiess imputation estimator, de Chaisemartin-D'Haultfoeuille estimator, and Roth (2022) pre-trends power analysis.

Common Pitfalls

Using TWFE with staggered treatment and heterogeneous effects: Standard TWFE is biased — use Callaway-Sant'Anna, Sun-Abraham, or Borusyak-Jaravel-Spiess
Clustering at the treatment level: Don't cluster at the individual level if treatment varies at the state level — cluster at the state level
Failing to reject pre-trends ≠ parallel trends hold: Low power is common; use Roth (2022) power analysis to assess
Ignoring anticipation effects: If agents anticipate treatment, pre-treatment coefficients may be non-zero even with parallel trends
Not showing the event study plot: Reviewers expect to see pre-trends visually — always include the event study figure