Add Event Study (aka Dynamic Difference in Differences) functionality #584

drbenvincent · 2025-12-05T21:56:19Z

Towards Event Study and Staggered Diff in Diff #384

This pull request adds support for event study analysis to the CausalPy package. The main changes include introducing the new EventStudy class, updating the package exports to include it, and providing a utility function for generating synthetic panel data suitable for event study and dynamic DiD analyses.

This implementation does not deal with staggered treatments. Add a warning callout box, as data validation with exception
Update model description - the model being used is not static, it depends on the user provided formula. Give some examples.
Add an example where we remove the time fixed effects and add in other predictors or some form of seasonal trend.
Check the integration with the reporting layer.
~~Consider adding the time relative to treatment column in the notebook, not hidden in the experiment class.~~
Ensure we are exposing plot options via arguments
Simplify data plot using a 1-liner with seaborn
Why is the event study column not appearing in the table on home page?

📚 Documentation preview 📚: https://causalpy--584.org.readthedocs.build/en/584/

review-notebook-app · 2025-12-05T21:56:24Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

codecov · 2025-12-05T22:06:20Z

Codecov Report

❌ Patch coverage is 97.13914% with 22 lines in your changes missing coverage. Please review.
✅ Project coverage is 94.50%. Comparing base (f90ddde) to head (15faa7e).

Files with missing lines	Patch %	Lines
causalpy/reporting.py	74.57%	7 Missing and 8 partials ⚠️
causalpy/experiments/event_study.py	96.51%	2 Missing and 5 partials ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #584      +/-   ##
==========================================
+ Coverage   93.21%   94.50%   +1.29%     
==========================================
  Files          35       37       +2     
  Lines        5511     6278     +767     
  Branches      358      420      +62     
==========================================
+ Hits         5137     5933     +796     
+ Misses        246      205      -41     
- Partials      128      140      +12

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

The EventStudy class now requires a patsy-style formula to specify the outcome and fixed effects, removing the separate outcome_col argument. Design matrix construction uses patsy, and event-time dummies are appended. Input validation checks for formula presence, and tests and documentation are updated to reflect the new API and output format.

drbenvincent · 2025-12-05T23:26:30Z

@cursor review

Expanded documentation to explain the patsy formula syntax, the role of unit and time fixed effects, and how event-time dummies ($\beta_k$) are automatically constructed by the EventStudy class. Added details on the event window and reference event time parameters for clearer guidance.

Added a warning in the EventStudy class and documentation that the implementation only supports simultaneous treatment timing and does not support staggered adoption. Introduced a validation to raise a DataException if treated units have different treatment times. Added a corresponding test to ensure staggered adoption raises an error, and updated the notebook to clarify estimator limitations.

Enhanced the _bayesian_plot and _ols_plot methods in EventStudy to support configurable figure size and HDI probability. Updated docstrings to document new parameters and improved plot labeling for clarity.

Introduces event study support to effect_summary(), including parallel trends check and dynamic effect reporting. Updates event study class to allow HDI probability customization and reporting, and extends documentation with effect summary usage and interpretation.

The `generate_event_study_data` function now supports optional time-varying predictors generated as AR(1) processes, controlled by new parameters: `predictor_effects`, `ar_phi`, and `ar_scale`. Added the `generate_ar1_series` utility function. Updated docstrings and examples to reflect these changes. The event study PyMC notebook was updated with additional analysis and improved section headings.

Introduces integration tests for the EventStudy.effect_summary method using both PyMC and sklearn models. Tests verify the returned EffectSummary object, its table and text attributes, and key output elements.

juanitorduz · 2025-12-06T12:05:49Z

I will take a look in the next few days :)

nialloulton · 2025-12-06T16:28:38Z

@cursor review

cursor · 2025-12-06T16:28:43Z

PR Summary

Adds an Event Studies section with the event_study_pymc.ipynb notebook to the docs and cites Sun & Abraham (2021) in references.

Docs:
- Notebooks index: Add an Event Studies toctree with event_study_pymc.ipynb in docs/source/notebooks/index.md.
- References: Add bibliographic entry for Sun & Abraham (2021) in docs/source/references.bib.

^{Written by Cursor Bugbot for commit 786a5f8. This will update automatically on new commits. Configure here.}

nialloulton · 2025-12-06T16:30:29Z

bugbot run

cursor · 2025-12-06T16:36:55Z

causalpy/experiments/event_study.py

+        # (staggered adoption is not currently supported)
+        treated_times = self.data.loc[
+            ~self.data[self.treat_time_col].isna(), self.treat_time_col
+        ].unique()


Bug: Validation rejects valid data when using documented np.inf for controls

The docstring at line 81 states users can "Use NaN or np.inf for never-treated (control) units," but the staggered adoption validation only uses .isna() to filter out control units. Since np.inf is not detected by .isna(), control units marked with np.inf are included in treated_times. When combined with treated units having finite treatment times, .unique() returns both values (e.g., [10, inf]), causing a false DataException about staggered adoption when the data is actually valid. The same issue affects _compute_event_time() where treated_mask would incorrectly include np.inf control units.

Additional Locations (1)

causalpy/experiments/event_study.py#L225-L226

cursor · 2025-12-06T16:37:53Z

causalpy/experiments/event_study.py

+                }
+            rows.append(row)
+
+        return pd.DataFrame(rows)


Bug: Unused round_to parameter in get_event_time_summary method

The round_to parameter in get_event_time_summary is documented as controlling "Number of decimals for rounding" but is never actually used in the function body. The returned DataFrame contains unrounded values from float(coeff.mean()), float(coeff.std()), and hdi computations. This means users who pass round_to=3 expecting rounded output will get full precision values instead.

cursor · 2025-12-06T16:37:53Z

causalpy/experiments/event_study.py

+        # Convert patsy output to DataFrames for manipulation
+        X_df = pd.DataFrame(
+            X, columns=X.design_info.column_names, index=self.data.index
+        )


Bug: Index mismatch if patsy drops rows with NaN

When building the design matrix, dmatrices from patsy may drop rows containing NaN values (its default NA_action='drop' behavior). The code then creates X_df using index=self.data.index, which still has all original rows. If patsy dropped any rows, X will have fewer rows than self.data.index has elements, causing a pandas ValueError about shape mismatch. This affects users who provide data with missing values in the outcome or predictor columns used in the formula.

Added Event Study to the experiment support table in reporting_statistics.md and updated AGENTS.md to instruct updating the table when adding new experiment types.

This commit adds extensive new tests to test_reporting.py and test_synthetic_data.py, covering error handling, experiment type detection, OLS statistics edge cases, prose and table generation for various models, and all synthetic data generation utilities. These tests improve coverage and robustness for reporting and data simulation functions.

Expanded documentation in both the EventStudy class and the event study PyMC notebook to explain the equivalence between indicator functions and dummy variables. Added details on how dummy variables are constructed for each event time, the omission of the reference period to avoid multicollinearity, and the interpretation of regression coefficients as ATT at each event time.

initial commit

34c93da

drbenvincent added the major label Dec 5, 2025

drbenvincent added 2 commits December 5, 2025 21:59

add new notebook to index so it renders in the docs

8d624ba

Add notebook indexing guideline to documentation

f78eac2

drbenvincent added 6 commits December 5, 2025 22:06

add cell tags

7abd4f5

add glossary entry + refs

3e09253

update README with event study functionality notes

bed06c5

fix math formatting

da9f77c

add reference

e2c59f4

drbenvincent added 14 commits December 5, 2025 23:41

Add plot customization options to EventStudy

c4e678e

Enhanced the _bayesian_plot and _ols_plot methods in EventStudy to support configurable figure size and HDI probability. Updated docstrings to document new parameters and improved plot labeling for clarity.

plot data with seaborn, not matplotlib

0915ebe

re-render notebook

9cff7f0

run make uml

59595d2

minor changes + re-run notebook

b606970

update the homepage (index.md, not README.md)

deee019

Add note about dual README files in documentation

af32b60

Add tests for EventStudy effect_summary method

10690c6

Introduces integration tests for the EventStudy.effect_summary method using both PyMC and sklearn models. Tests verify the returned EffectSummary object, its table and text attributes, and key output elements.

re-run notebook

b442e4d

minor notebook edit

786a5f8

drbenvincent marked this pull request as ready for review December 6, 2025 11:48

drbenvincent requested review from NathanielF and juanitorduz December 6, 2025 11:49

drbenvincent requested a review from cetagostini December 6, 2025 11:49

fix spelling error

1d66f25

cursor bot reviewed Dec 6, 2025

View reviewed changes

drbenvincent added 3 commits December 6, 2025 17:03

Document Event Study support in effect_summary

308ee80

Added Event Study to the experiment support table in reporting_statistics.md and updated AGENTS.md to instruct updating the table when adding new experiment types.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Event Study (aka Dynamic Difference in Differences) functionality #584

Add Event Study (aka Dynamic Difference in Differences) functionality #584

Uh oh!

drbenvincent commented Dec 5, 2025 •

edited

Loading

Uh oh!

review-notebook-app bot commented Dec 5, 2025

Uh oh!

codecov bot commented Dec 5, 2025 •

edited

Loading

Uh oh!

drbenvincent commented Dec 5, 2025

Uh oh!

juanitorduz commented Dec 6, 2025

Uh oh!

nialloulton commented Dec 6, 2025

Uh oh!

cursor bot commented Dec 6, 2025 •

edited

Loading

Uh oh!

nialloulton commented Dec 6, 2025

Uh oh!

cursor bot Dec 6, 2025

Uh oh!

cursor bot Dec 6, 2025

Uh oh!

cursor bot Dec 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Add Event Study (aka Dynamic Difference in Differences) functionality #584

Are you sure you want to change the base?

Add Event Study (aka Dynamic Difference in Differences) functionality #584

Uh oh!

Conversation

drbenvincent commented Dec 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

review-notebook-app bot commented Dec 5, 2025

Uh oh!

codecov bot commented Dec 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

drbenvincent commented Dec 5, 2025

Uh oh!

juanitorduz commented Dec 6, 2025

Uh oh!

nialloulton commented Dec 6, 2025

Uh oh!

cursor bot commented Dec 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Summary

Uh oh!

nialloulton commented Dec 6, 2025

Uh oh!

cursor bot Dec 6, 2025

Choose a reason for hiding this comment

Bug: Validation rejects valid data when using documented np.inf for controls

Uh oh!

cursor bot Dec 6, 2025

Choose a reason for hiding this comment

Bug: Unused round_to parameter in get_event_time_summary method

Uh oh!

cursor bot Dec 6, 2025

Choose a reason for hiding this comment

Bug: Index mismatch if patsy drops rows with NaN

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

drbenvincent commented Dec 5, 2025 •

edited

Loading

codecov bot commented Dec 5, 2025 •

edited

Loading

cursor bot commented Dec 6, 2025 •

edited

Loading

Bug: Validation rejects valid data when using documented `np.inf` for controls

Bug: Unused `round_to` parameter in `get_event_time_summary` method