iframe-proxy

igerber · 2026-06-06T18:54:38Z

Summary

PR-B of the ImputationDiD methodology validation — the source-validation pass of the Borusyak, Jaravel & Spiess (2024, REStud 91(6)) audit (PR-A #529 added the paper review). Validating against R didimputation uncovered and fixed a ~27% downward bias in the analytical standard errors without covariates (a real correctness bug; point estimates were always correct).

Three code corrections in diff_diff/imputation.py — behavior change: SE / t / p / CI values change without covariates; point estimates unchanged:

Untreated v_it weights (Theorem 3 variance). The covariate-free path used a balanced two-way closed form -(w_i/n0_i + w_t/n0_t - w/N0), wrong for the always-unbalanced Ω₀ in staggered designs → SEs ~27% too small. Replaced with the exact projection -A₀(A₀'A₀)⁻¹A₁'w (the covariate path's method), and kept all unit dummies in the design (the prior drop-first-unit/no-intercept design was one rank short → a further ~1.6% bias). SEs now match didimputation to ~1e-10. A singular Ω₀ routes to a dense-lstsq fallback (SciPy spsolve returns NaN + MatrixRankWarning without raising — promoted to an error so the fallback fires in production).
Auxiliary model (Equation 8): observation-level mean → the paper's unit-clustered Σ_i(Σ_t v)(Σ_t v·τ̂)/Σ_i(Σ_t v)², NaN-safe.
Untreated Step-1 residuals preserve NaN for missing FE (symmetric with the treated path) instead of a silent fillna(0.0).

The multiplier bootstrap resamples the same Theorem-3 influence function, so bootstrap SEs may also shift.

Methodology references

Method: ImputationDiD. Source: Borusyak, Jaravel & Spiess (2024), Revisiting Event-Study Designs: Robust and Efficient Estimation, Review of Economic Studies 91(6), 3253–3285 (DOI). R reference: didimputation v0.5.0.
Deviations (documented in REGISTRY.md ## ImputationDiD): R didimputation implements Equation 8 only at the cohort×event-time partition (= diff-diff's default aux_partition="cohort_horizon"); diff-diff additionally offers coarser cohort/horizon partitions (no R analogue, hand-calc validated). Multiplier bootstrap + survey-design TSL variance are library extensions. Leave-one-out variance (Supp. App. A.9) is not implemented (tracked).

Validation

New tests/test_methodology_imputation.py — paper-equation Verified Components (Theorem 1/2; Theorem 3 / eqs 6-8 + white-box unit-clustered Eq. 8 hand-calc + NaN-co-group edge + singular-Ω₀ dense-fallback regression; Proposition 5 K≥H̄ non-identification; Test 1 / eq 9 + Proposition 9) and TestImputationDiDParityR (overall + per-horizon ATT and SE vs didimputation, no silent skips).
R parity goldens: benchmarks/data/didimputation_golden.json (generator benchmarks/R/generate_didimputation_golden.R).
tests/test_imputation.py: tightened the coarser-partition conservatism test.
Full fast suite: 7585 passed, 0 failed (the SE change breaks nothing downstream). 6 fresh local AI-review rounds → converged clean.
METHODOLOGY_REVIEW.md row → Complete (Verified Components / Corrections Made / Deviations / R Comparison Results).

Security / privacy

Confirm no secrets/PII in this PR: Yes (the source PDF + R install logs are not committed).

🤖 Generated with Claude Code

github-actions · 2026-06-06T18:58:56Z

Overall Assessment

✅ Looks good — no unmitigated P0/P1 findings found.

Executive Summary

No undocumented methodology mismatch found for the ImputationDiD variance changes.
The FE-only path now uses the documented exact projection, and the Eq. 8 auxiliary residual implementation matches the unit-clustered formula in the registry.
R deviation for coarser aux_partition modes is documented, so it is informational only.
Performance cost from recomputing the projection per target is already tracked in TODO.md.
I could not run tests in this environment because pytest and numpy are unavailable.

Methodology

Finding: P3 informational — documented R deviation for auxiliary partitions
Impact: diff-diff exposes aux_partition="cohort" / "horizon" where R didimputation only matches the default cohort × event-time partition. This is documented in the registry, so it is not a defect.
Concrete fix: None required. Optional: keep future parity claims scoped to aux_partition="cohort_horizon".
Location: docs/methodology/REGISTRY.md:L1335-L1336, diff_diff/imputation.py:L1646-L1687

Finding: P3 informational — projection source validation relies partly on R parity
Impact: The registry notes that Supplementary Proposition A3 is not in the reviewed main-article PDF, and validates the projection empirically against R. This is explicit and acceptable under the project’s documented-deviation rules.
Concrete fix: None required. Optional: add a supplementary-material review note when available.
Location: docs/methodology/REGISTRY.md:L1280-L1284, diff_diff/imputation.py:L1334-L1354

Code Quality

No findings.
Severity: N/A
Impact: The modified inference paths use safe_inference() and do not introduce the known inline t-stat/p-value/CI anti-pattern.
Concrete fix: None.

Performance

Finding: P3 informational, tracked in TODO.md — projection refactor increases repeated solve cost
Impact: _compute_v_untreated_with_covariates() now serves FE-only targets too, so overall/event-study/group/bootstrap paths rebuild and solve the same target-invariant normal equations repeatedly. This is already tracked as deferred performance work.
Concrete fix: No PR blocker. Implement the tracked TODO by caching A0, A1, and a factorization of A0'WA0 per fit(), then solving only target-specific RHS vectors.
Location: diff_diff/imputation.py:L1548-L1587, TODO.md:L156-L160

Maintainability

No findings.
Severity: N/A
Impact: The new helper behavior is documented in code and registry, including sparse-solver fallback behavior.
Concrete fix: None.

Tech Debt

Finding: P3 informational, tracked in TODO.md — covariate-path R parity gap remains
Impact: The FE-design correction also affects covariate variance projection, but the new R golden covers the no-covariate staggered panel only. This is explicitly tracked as low-priority deferred validation.
Concrete fix: No PR blocker. Follow the TODO: add a time-varying covariate R golden or a small dense-design hand calculation for the covariate projection.
Location: TODO.md:L169-L174

Security

No findings.
Severity: N/A
Impact: I did not see committed secrets in the changed benchmark/script/docs/test files.
Concrete fix: None.

Documentation/Tests

Finding: P3 informational — local verification not executed
Impact: I could not run the targeted tests because the review environment lacks pytest and numpy, so this review is static.
Concrete fix: Ensure CI runs tests/test_methodology_imputation.py and the existing ImputationDiD suite with project dependencies installed.
Location: tests/test_methodology_imputation.py:L294-L401, tests/test_imputation.py:L813-L842

…-clustered Eq.8 + R parity Source-validation pass of the Borusyak, Jaravel & Spiess (2024, REStud 91(6)) audit (PR-A #529 added the paper review). Three code corrections in diff_diff/imputation.py (behavior = SE values change; point estimates unchanged): 1. Untreated v_it weights (Theorem 3 conservative variance). The covariate-free path used the BALANCED two-way closed form -(w_i/n0_i + w_t/n0_t - w/N0), wrong for the always-unbalanced Omega_0 in staggered designs -> analytical SEs ~27% too small. Replaced with the exact projection -A0 (A0'A0)^-1 A1' w (the covariate path's method), and fixed that design to keep all unit dummies (the prior drop-first-unit/no-intercept design was one rank short -> a further ~1.6% bias). SEs now match R didimputation::did_imputation (observed ~1e-10; tests assert abs=1e-7). A singular Omega_0 routes to a dense-lstsq fallback (SciPy spsolve returns NaN + MatrixRankWarning without raising; promoted to an error so the fallback fires under production filters). Bootstrap SEs (which resample the same Theorem-3 influence function) may also shift. 2. Auxiliary model (Equation 8): observation-level mean sum(v*tau)/sum(v) -> the paper's unit-clustered sum_i(sum_t v)(sum_t v*tau)/sum_i(sum_t v)^2, NaN-safe. 3. Untreated Step-1 residuals preserve NaN for missing FE (symmetric with the treated path) instead of a silent fillna(0.0). Validation: - tests/test_methodology_imputation.py: paper-equation Verified Components (Theorem 1/2; Theorem 3/eqs 6-8 + white-box unit-clustered Eq.8 hand-calc + NaN-co-group edge + singular-Omega0 dense-fallback regression; Proposition 5 K>=H_bar non-ID; Test 1/eq 9 + Proposition 9) and TestImputationDiDParityR (overall + per-horizon ATT and SE vs didimputation, no silent skips). - benchmarks/R/generate_didimputation_golden.R + benchmarks/data/didimputation_* (didimputation v0.5.0 goldens). - tests/test_imputation.py: tightened the coarser-partition conservatism test. - Full fast suite: 7585 passed (the SE change breaks nothing downstream). Docs/tracker: REGISTRY ## ImputationDiD (Eq.8 now exact unit-clustered + a Deviation-from-R note; v_it observation-weights bullet updated to the exact projection); paper review flipped to "implemented"; METHODOLOGY_REVIEW.md row -> Complete (Verified Components / Corrections Made / Deviations / R Comparison Results); CHANGELOG entry; TODO PR-B rows removed + follow-ups tracked (LOO refinement, projection-factorization caching, covariate-path R parity). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

github-actions · 2026-06-06T20:40:25Z

🔁 AI review rerun (requested by @igerber)

Head SHA: 9ccb59fa5f6f6163a41f04fa13bd57defd7f89d3

Overall Assessment

✅ Looks good — no unmitigated P0/P1 findings.

Executive Summary

Re-review found no unresolved previous P1+ issues and no new unmitigated P0/P1 findings.
The ImputationDiD variance path now matches the documented exact projection contract in REGISTRY.md.
The auxiliary residual path implements BJS Eq. 8’s unit-clustered aggregator and covers NaN/zero-weight edge cases.
R didimputation parity and the coarser aux_partition deviation are documented; the deviation is informational only.
I could not execute tests locally because SciPy is unavailable in this environment.

Methodology

Finding: P3 informational — documented R deviation for auxiliary partitions
Impact: R parity applies to the default cohort × event-time partition; cohort and horizon partitions are library extensions with no R analogue. This is documented with a Note (deviation from R), so it is not a defect.
Concrete fix: None required.
Location: docs/methodology/REGISTRY.md:L1335-L1336, diff_diff/imputation.py:L1646-L1654

Finding: P3 informational — projection validation partly relies on R parity
Impact: The exact untreated projection is documented and R-parity pinned, but Supplementary Proposition A3 is not fully source-reviewed in the checked-in paper review. This is explicit in the registry and compatible with approval.
Concrete fix: None required; optionally add a supplementary-material review note later.
Location: docs/methodology/REGISTRY.md:L1280-L1284, tests/test_methodology_imputation.py:L576-L644

Code Quality

No findings.
Severity: N/A
Impact: The changed variance and Eq. 8 paths are localized and documented in-code; no new inline inference anti-pattern was introduced in the modified hunks.
Concrete fix: None.

Performance

Finding: P3 informational, tracked in TODO.md — repeated projection/factorization cost
Impact: Routing FE-only variance through _compute_v_untreated_with_covariates() increases repeated sparse design construction and solves across overall/event-study/group/bootstrap targets. This is already tracked as deferred performance work.
Concrete fix: No blocker; implement the tracked cache/factorization reuse later.
Location: diff_diff/imputation.py:L1548-L1587, TODO.md:L160

Maintainability

No findings.
Severity: N/A
Impact: The code, registry, paper review, changelog, and methodology tracker describe the same projection and Eq. 8 behavior.
Concrete fix: None.

Tech Debt

Finding: P3 informational, tracked in TODO.md — covariate-path parity gap remains
Impact: The covariate projection shares the corrected design path, but dedicated covariate R parity/hand-calc is deferred and tracked.
Concrete fix: No blocker; add the tracked covariate golden or dense hand-calc later.
Location: TODO.md:L174

Finding: P3 informational, tracked in TODO.md — LOO variance refinement not implemented
Impact: BJS Supplementary Appendix A.9 leave-one-out refinement remains unimplemented, but the asymptotic Theorem 3 variance is implemented and R parity matches the package default.
Concrete fix: None required for this PR.
Location: TODO.md:L93

Security

No findings.
Severity: N/A
Impact: No secrets or private material were apparent in the added R generator, JSON golden, CSV fixture, docs, or tests.
Concrete fix: None.

Documentation/Tests

Finding: P3 informational — local tests not executed
Impact: I could not run the new methodology tests because scipy is not installed in this review environment. Static review only.
Concrete fix: Ensure CI runs tests/test_methodology_imputation.py and the existing ImputationDiD suite with project dependencies installed.
Location: tests/test_methodology_imputation.py:L1-L54

Bump version 3.5.1 -> 3.5.2 across __init__.py, pyproject.toml, rust/Cargo.toml, llms-full.txt, and CITATION.cff (date-released 2026-06-08). Reconcile the CHANGELOG: the Firpo & Possebom (2018) confidence-sets -by-test-inversion feature (PR igerber#527) was filed under [3.5.1] but merged AFTER the v3.5.1 tag was cut, so the tagged v3.5.1 did not actually contain it. Move that entry into the new [3.5.2] section alongside everything else that landed post-tag (CBWSDID balancing igerber#534, SyntheticControl conformal inference igerber#530, the placebo_effects -> variance_effects rename/deprecation igerber#532, and the ImputationDiD validation + SE fixes igerber#533). The Firpo PR-A paper review (igerber#524, docs-only) stays in [3.5.1] since it was in that tag. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

igerber added the ready-for-ci Triggers CI test workflows label Jun 6, 2026

igerber force-pushed the feature/imputation-eq8-methodology branch from 0647207 to 9ccb59f Compare June 6, 2026 20:37

igerber merged commit fbdcbb9 into main Jun 6, 2026
26 checks passed

igerber deleted the feature/imputation-eq8-methodology branch June 6, 2026 23:13

igerber mentioned this pull request Jun 8, 2026

Release v3.5.2: version bump + CHANGELOG reconciliation #536

Merged

Sunbelt Computer Software

PL/B Language Development and Support

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ImputationDiD methodology validation (PR-B): exact FE variance + unit-clustered Eq.8 + R parity#533

ImputationDiD methodology validation (PR-B): exact FE variance + unit-clustered Eq.8 + R parity#533
igerber merged 1 commit into
mainfrom
feature/imputation-eq8-methodology

igerber commented Jun 6, 2026

Uh oh!

github-actions Bot commented Jun 6, 2026

Uh oh!

github-actions Bot commented Jun 6, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Sunbelt Computer Software

PL/B Language Development and Support

Conversation

igerber commented Jun 6, 2026

Summary

Methodology references

Validation

Security / privacy

Uh oh!

github-actions Bot commented Jun 6, 2026

Overall Assessment

Executive Summary

Methodology

Code Quality

Performance

Maintainability

Tech Debt

Security

Documentation/Tests

Uh oh!

github-actions Bot commented Jun 6, 2026

Overall Assessment

Executive Summary

Methodology

Code Quality

Performance

Maintainability

Tech Debt

Security

Documentation/Tests

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant