PyPI - diffindiff - Versions diffs - 2.2.3__tar.gz → 2.2.5__tar.gz - Mend

diffindiff 2.2.3tar.gz → 2.2.5tar.gz

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (21) hide show

{diffindiff-2.2.3 → diffindiff-2.2.5}/PKG-INFO RENAMED Viewed

@@ -1,12 +1,12 @@
 Metadata-Version: 2.1
 Name: diffindiff
-Version: 2.2.3
+Version: 2.2.5
 Summary: diffindiff: Python library for convenient Difference-in-Differences Analyses
 Author: Thomas Wieland
 Author-email: geowieland@googlemail.com
 Description-Content-Type: text/markdown
-# diffindiff: Difference-in-Differences (DiD) Analysis Python Library
+# diffindiff: A Python library for convenient difference-in-differences analyses
 This Python library is designed for performing Difference-in-Differences (DiD) analyses in a convenient way. It allows users to construct datasets, define treatment and control groups, and set treatment periods. DiD model analyses may be conducted with both datasets created by built-in functions and ready-to-use external datasets. Both simultaneous and staggered adoption are supported. The library allows for various extensions, such as two-way fixed effects models, group- or individual-specific effects, post-treatment periods, and triple-difference estimations. Additionally, it includes functions for visualizing results, such as plotting DiD coefficients with confidence intervals and illustrating the temporal evolution of staggered treatments. Furthermore, several functions for rigorous treatment setting and data diagnostics are incorporated.
@@ -16,9 +16,33 @@ This Python library is designed for performing Difference-in-Differences (DiD) a
 Thomas Wieland [ORCID](https://orcid.org/0000-0001-5168-9846) [EMail](mailto:geowieland@googlemail.com)
-## Updates v2.2.3
-- Bugfixes:
-  - Spillover treatment really works now
+## Availability
+- 📦 PyPI: [diffindiff](https://pypi.org/project/diffindiff/)
+- 💻 GitHub Repository: [diffindiff_official](https://github.com/geowieland/diffindiff_official)
+- 📄 DOI (Zenodo): [10.5281/zenodo.18639559](https://doi.org/10.5281/zenodo.18639559)
+## Citation
+If you use this software, please cite:
+Wieland, T. (2026). diffindiff: A Python library for convenient difference-in-differences analyses (Version 2.2.4) [Computer software]. Zenodo. https://doi.org/10.5281/zenodo.18656820
+## Installation
+To install the package, use `pip`:
+```bash
+pip install diffindiff
+```
+To install the package from GitHub with `pip`:
+```bash
+pip install git+https://github.com/geowieland/diffindiff_official.git
+```
 ## Features
@@ -54,25 +78,6 @@ Thomas Wieland [ORCID](https://orcid.org/0000-0001-5168-9846) [EMail](mailto:geo
   - Test for parallel trend assumption
-## Literature
-  - Baker AC, Larcker DF, Wang CCY (2022) How much should we trust staggered difference-in-differences estimates? *Journal of Financial Economics* 144(2): 370-395. [10.1016/j.jfineco.2022.01.004](https://doi.org/10.1016/j.jfineco.2022.01.004)
-  - Card D, Krueger AD (1994) Minimum Wages and Employment: A Case Study of the Fast Food Industry in New Jersey and Pennsylvania. *The American Economic Review* 84(4): 772-793. [JSTOR](https://www.jstor.org/stable/2677856)
-  - de Haas S, Götz G, Heim S (2022) Measuring the effect of COVID‑19‑related night curfews in a bundled intervention within Germany. *Scientific Reports* 12: 19732. [10.1038/s41598-022-24086-9](https://doi.org/10.1038/s41598-022-24086-9)
-  - Goodman-Bacon A (2021) Difference-in-differences with variation in treatment timing. *Journal of Econometrics* 225(2): 254-277. [10.1016/j.jeconom.2021.03.014](https://doi.org/10.1016/j.jeconom.2021.03.014)
-  - Greene WH (2012) *Econometric Analysis*.
-  - Goldfarb A, Tucker C, Wang Y (2022) Conducting Research in Marketing with Quasi-Experiments. *Journal of Marketing* 86(3): 1-19. [10.1177/00222429221082977](https://doi.org/10.1177/00222429221082977)
-  - Isporhing IE, Lipfert M, Pestel N (2021) Does re-opening schools contribute to the spread of SARS-CoV-2? Evidence from staggered summer breaks in Germany. *Journal of Public Economics* 198: 104426. [10.1016/j.jpubeco.2021.104426](https://doi.org/10.1016/j.jpubeco.2021.104426)
-  - Li KT, Luo L, Pattabhiramaiah A (2024) Causal Inference with Quasi-Experimental Data. *IMPACT at JMR* November 13, 2024. [AMA](https://www.ama.org/marketing-news/causal-inference-with-quasi-experimental-data/)
-  - Olden A (2018) What do you buy when no one's watching? The effect of self-service checkouts on the composition of sales in retail. Discussion paper FOR 3/18, Norwegian School of Economics, Norway. [http://hdl.handle.net/11250/2490886](http://hdl.handle.net/11250/2490886)
-  - Olden A, Moen J (2022) The triple difference estimator. *The Econometrics Journal* 25(3): 531-553. [10.1093/ectj/utac010](https://doi.org/10.1093/ectj/utac010)
-  - Strassmann A, Çolak Y, Serra-Burriel M, Nordestgaard BG, Turk A, Afzal S, Puhan MA (2023) Nationwide indoor smoking ban and impact on smoking behaviour and lung function: a two-population natural experiment. *Thorax* 78(2): 144-150. [10.1136/thoraxjnl-2021-218436](https://doi.org/10.1136/thoraxjnl-2021-218436)
-  - Villa JM (2016) diff: Simplifying the estimation of difference-in-differences treatment effects. *The Stata Journal* 16(1): 52-71. [10.1177/1536867X1601600108](https://doi.org/10.1177/1536867X1601600108)
-  - von Bismarck-Osten C, Borusyak K, Schönberg U (2022) The role of schools in transmission of the SARS-CoV-2 virus: quasi-experimental evidence from Germany. *Economic Policy* 37(109): 87–130. [10.1093/epolic/eiac001](https://doi.org/10.1093/epolic/eiac001)
-  - Wieland T (2025) Assessing the effectiveness of non-pharmaceutical interventions in the SARS-CoV-2 pandemic: results of a natural experiment regarding Baden-Württemberg (Germany) and Switzerland in the second infection wave. *Journal of Public Health: From Theory to Practice* 33(11): 2497-2511. [10.1007/s10389-024-02218-x](https://doi.org/10.1007/s10389-024-02218-x)
-  - Wooldridge JM (2012) *Introductory Econometrics. A Modern Approach*.
 ## Examples
 ```python
@@ -143,9 +148,27 @@ curfew_model_withgroups.plot_group_treatment_effects(
 See the /tests directory for usage examples of most of the included functions.
-## Installation
+## Literature
-To install the package, use `pip`:
+  - Baker AC, Larcker DF, Wang CCY (2022) How much should we trust staggered difference-in-differences estimates? *Journal of Financial Economics* 144(2): 370-395. [10.1016/j.jfineco.2022.01.004](https://doi.org/10.1016/j.jfineco.2022.01.004)
+  - Card D, Krueger AD (1994) Minimum Wages and Employment: A Case Study of the Fast Food Industry in New Jersey and Pennsylvania. *The American Economic Review* 84(4): 772-793. [JSTOR](https://www.jstor.org/stable/2677856)
+  - de Haas S, Götz G, Heim S (2022) Measuring the effect of COVID‑19‑related night curfews in a bundled intervention within Germany. *Scientific Reports* 12: 19732. [10.1038/s41598-022-24086-9](https://doi.org/10.1038/s41598-022-24086-9)
+  - Goodman-Bacon A (2021) Difference-in-differences with variation in treatment timing. *Journal of Econometrics* 225(2): 254-277. [10.1016/j.jeconom.2021.03.014](https://doi.org/10.1016/j.jeconom.2021.03.014)
+  - Greene WH (2012) *Econometric Analysis*.
+  - Goldfarb A, Tucker C, Wang Y (2022) Conducting Research in Marketing with Quasi-Experiments. *Journal of Marketing* 86(3): 1-19. [10.1177/00222429221082977](https://doi.org/10.1177/00222429221082977)
+  - Isporhing IE, Lipfert M, Pestel N (2021) Does re-opening schools contribute to the spread of SARS-CoV-2? Evidence from staggered summer breaks in Germany. *Journal of Public Economics* 198: 104426. [10.1016/j.jpubeco.2021.104426](https://doi.org/10.1016/j.jpubeco.2021.104426)
+  - Li KT, Luo L, Pattabhiramaiah A (2024) Causal Inference with Quasi-Experimental Data. *IMPACT at JMR* November 13, 2024. [AMA](https://www.ama.org/marketing-news/causal-inference-with-quasi-experimental-data/)
+  - Olden A (2018) What do you buy when no one's watching? The effect of self-service checkouts on the composition of sales in retail. Discussion paper FOR 3/18, Norwegian School of Economics, Norway. [http://hdl.handle.net/11250/2490886](http://hdl.handle.net/11250/2490886)
+  - Olden A, Moen J (2022) The triple difference estimator. *The Econometrics Journal* 25(3): 531-553. [10.1093/ectj/utac010](https://doi.org/10.1093/ectj/utac010)
+  - Strassmann A, Çolak Y, Serra-Burriel M, Nordestgaard BG, Turk A, Afzal S, Puhan MA (2023) Nationwide indoor smoking ban and impact on smoking behaviour and lung function: a two-population natural experiment. *Thorax* 78(2): 144-150. [10.1136/thoraxjnl-2021-218436](https://doi.org/10.1136/thoraxjnl-2021-218436)
+  - Villa JM (2016) diff: Simplifying the estimation of difference-in-differences treatment effects. *The Stata Journal* 16(1): 52-71. [10.1177/1536867X1601600108](https://doi.org/10.1177/1536867X1601600108)
+  - von Bismarck-Osten C, Borusyak K, Schönberg U (2022) The role of schools in transmission of the SARS-CoV-2 virus: quasi-experimental evidence from Germany. *Economic Policy* 37(109): 87–130. [10.1093/epolic/eiac001](https://doi.org/10.1093/epolic/eiac001)
+  - Wieland T (2025) Assessing the effectiveness of non-pharmaceutical interventions in the SARS-CoV-2 pandemic: results of a natural experiment regarding Baden-Württemberg (Germany) and Switzerland in the second infection wave. *Journal of Public Health: From Theory to Practice* 33(11): 2497-2511. [10.1007/s10389-024-02218-x](https://doi.org/10.1007/s10389-024-02218-x)
+  - Wooldridge JM (2012) *Introductory Econometrics. A Modern Approach*.
-```bash
-pip install diffindiff
+## What's new (v2.2.5)
+- Bugfixes:
+  - Incorrect import
+- Other:
+  - Update README

{diffindiff-2.2.3 → diffindiff-2.2.5}/README.md RENAMED Viewed

@@ -1,4 +1,4 @@
-# diffindiff: Difference-in-Differences (DiD) Analysis Python Library
+# diffindiff: A Python library for convenient difference-in-differences analyses
 This Python library is designed for performing Difference-in-Differences (DiD) analyses in a convenient way. It allows users to construct datasets, define treatment and control groups, and set treatment periods. DiD model analyses may be conducted with both datasets created by built-in functions and ready-to-use external datasets. Both simultaneous and staggered adoption are supported. The library allows for various extensions, such as two-way fixed effects models, group- or individual-specific effects, post-treatment periods, and triple-difference estimations. Additionally, it includes functions for visualizing results, such as plotting DiD coefficients with confidence intervals and illustrating the temporal evolution of staggered treatments. Furthermore, several functions for rigorous treatment setting and data diagnostics are incorporated.
@@ -8,9 +8,33 @@ This Python library is designed for performing Difference-in-Differences (DiD) a
 Thomas Wieland [ORCID](https://orcid.org/0000-0001-5168-9846) [EMail](mailto:geowieland@googlemail.com)
-## Updates v2.2.3
-- Bugfixes:
-  - Spillover treatment really works now
+## Availability
+- 📦 PyPI: [diffindiff](https://pypi.org/project/diffindiff/)
+- 💻 GitHub Repository: [diffindiff_official](https://github.com/geowieland/diffindiff_official)
+- 📄 DOI (Zenodo): [10.5281/zenodo.18639559](https://doi.org/10.5281/zenodo.18639559)
+## Citation
+If you use this software, please cite:
+Wieland, T. (2026). diffindiff: A Python library for convenient difference-in-differences analyses (Version 2.2.4) [Computer software]. Zenodo. https://doi.org/10.5281/zenodo.18656820
+## Installation
+To install the package, use `pip`:
+```bash
+pip install diffindiff
+```
+To install the package from GitHub with `pip`:
+```bash
+pip install git+https://github.com/geowieland/diffindiff_official.git
+```
 ## Features
@@ -46,25 +70,6 @@ Thomas Wieland [ORCID](https://orcid.org/0000-0001-5168-9846) [EMail](mailto:geo
   - Test for parallel trend assumption
-## Literature
-  - Baker AC, Larcker DF, Wang CCY (2022) How much should we trust staggered difference-in-differences estimates? *Journal of Financial Economics* 144(2): 370-395. [10.1016/j.jfineco.2022.01.004](https://doi.org/10.1016/j.jfineco.2022.01.004)
-  - Card D, Krueger AD (1994) Minimum Wages and Employment: A Case Study of the Fast Food Industry in New Jersey and Pennsylvania. *The American Economic Review* 84(4): 772-793. [JSTOR](https://www.jstor.org/stable/2677856)
-  - de Haas S, Götz G, Heim S (2022) Measuring the effect of COVID‑19‑related night curfews in a bundled intervention within Germany. *Scientific Reports* 12: 19732. [10.1038/s41598-022-24086-9](https://doi.org/10.1038/s41598-022-24086-9)
-  - Goodman-Bacon A (2021) Difference-in-differences with variation in treatment timing. *Journal of Econometrics* 225(2): 254-277. [10.1016/j.jeconom.2021.03.014](https://doi.org/10.1016/j.jeconom.2021.03.014)
-  - Greene WH (2012) *Econometric Analysis*.
-  - Goldfarb A, Tucker C, Wang Y (2022) Conducting Research in Marketing with Quasi-Experiments. *Journal of Marketing* 86(3): 1-19. [10.1177/00222429221082977](https://doi.org/10.1177/00222429221082977)
-  - Isporhing IE, Lipfert M, Pestel N (2021) Does re-opening schools contribute to the spread of SARS-CoV-2? Evidence from staggered summer breaks in Germany. *Journal of Public Economics* 198: 104426. [10.1016/j.jpubeco.2021.104426](https://doi.org/10.1016/j.jpubeco.2021.104426)
-  - Li KT, Luo L, Pattabhiramaiah A (2024) Causal Inference with Quasi-Experimental Data. *IMPACT at JMR* November 13, 2024. [AMA](https://www.ama.org/marketing-news/causal-inference-with-quasi-experimental-data/)
-  - Olden A (2018) What do you buy when no one's watching? The effect of self-service checkouts on the composition of sales in retail. Discussion paper FOR 3/18, Norwegian School of Economics, Norway. [http://hdl.handle.net/11250/2490886](http://hdl.handle.net/11250/2490886)
-  - Olden A, Moen J (2022) The triple difference estimator. *The Econometrics Journal* 25(3): 531-553. [10.1093/ectj/utac010](https://doi.org/10.1093/ectj/utac010)
-  - Strassmann A, Çolak Y, Serra-Burriel M, Nordestgaard BG, Turk A, Afzal S, Puhan MA (2023) Nationwide indoor smoking ban and impact on smoking behaviour and lung function: a two-population natural experiment. *Thorax* 78(2): 144-150. [10.1136/thoraxjnl-2021-218436](https://doi.org/10.1136/thoraxjnl-2021-218436)
-  - Villa JM (2016) diff: Simplifying the estimation of difference-in-differences treatment effects. *The Stata Journal* 16(1): 52-71. [10.1177/1536867X1601600108](https://doi.org/10.1177/1536867X1601600108)
-  - von Bismarck-Osten C, Borusyak K, Schönberg U (2022) The role of schools in transmission of the SARS-CoV-2 virus: quasi-experimental evidence from Germany. *Economic Policy* 37(109): 87–130. [10.1093/epolic/eiac001](https://doi.org/10.1093/epolic/eiac001)
-  - Wieland T (2025) Assessing the effectiveness of non-pharmaceutical interventions in the SARS-CoV-2 pandemic: results of a natural experiment regarding Baden-Württemberg (Germany) and Switzerland in the second infection wave. *Journal of Public Health: From Theory to Practice* 33(11): 2497-2511. [10.1007/s10389-024-02218-x](https://doi.org/10.1007/s10389-024-02218-x)
-  - Wooldridge JM (2012) *Introductory Econometrics. A Modern Approach*.
 ## Examples
 ```python
@@ -135,9 +140,27 @@ curfew_model_withgroups.plot_group_treatment_effects(
 See the /tests directory for usage examples of most of the included functions.
-## Installation
+## Literature
-To install the package, use `pip`:
+  - Baker AC, Larcker DF, Wang CCY (2022) How much should we trust staggered difference-in-differences estimates? *Journal of Financial Economics* 144(2): 370-395. [10.1016/j.jfineco.2022.01.004](https://doi.org/10.1016/j.jfineco.2022.01.004)
+  - Card D, Krueger AD (1994) Minimum Wages and Employment: A Case Study of the Fast Food Industry in New Jersey and Pennsylvania. *The American Economic Review* 84(4): 772-793. [JSTOR](https://www.jstor.org/stable/2677856)
+  - de Haas S, Götz G, Heim S (2022) Measuring the effect of COVID‑19‑related night curfews in a bundled intervention within Germany. *Scientific Reports* 12: 19732. [10.1038/s41598-022-24086-9](https://doi.org/10.1038/s41598-022-24086-9)
+  - Goodman-Bacon A (2021) Difference-in-differences with variation in treatment timing. *Journal of Econometrics* 225(2): 254-277. [10.1016/j.jeconom.2021.03.014](https://doi.org/10.1016/j.jeconom.2021.03.014)
+  - Greene WH (2012) *Econometric Analysis*.
+  - Goldfarb A, Tucker C, Wang Y (2022) Conducting Research in Marketing with Quasi-Experiments. *Journal of Marketing* 86(3): 1-19. [10.1177/00222429221082977](https://doi.org/10.1177/00222429221082977)
+  - Isporhing IE, Lipfert M, Pestel N (2021) Does re-opening schools contribute to the spread of SARS-CoV-2? Evidence from staggered summer breaks in Germany. *Journal of Public Economics* 198: 104426. [10.1016/j.jpubeco.2021.104426](https://doi.org/10.1016/j.jpubeco.2021.104426)
+  - Li KT, Luo L, Pattabhiramaiah A (2024) Causal Inference with Quasi-Experimental Data. *IMPACT at JMR* November 13, 2024. [AMA](https://www.ama.org/marketing-news/causal-inference-with-quasi-experimental-data/)
+  - Olden A (2018) What do you buy when no one's watching? The effect of self-service checkouts on the composition of sales in retail. Discussion paper FOR 3/18, Norwegian School of Economics, Norway. [http://hdl.handle.net/11250/2490886](http://hdl.handle.net/11250/2490886)
+  - Olden A, Moen J (2022) The triple difference estimator. *The Econometrics Journal* 25(3): 531-553. [10.1093/ectj/utac010](https://doi.org/10.1093/ectj/utac010)
+  - Strassmann A, Çolak Y, Serra-Burriel M, Nordestgaard BG, Turk A, Afzal S, Puhan MA (2023) Nationwide indoor smoking ban and impact on smoking behaviour and lung function: a two-population natural experiment. *Thorax* 78(2): 144-150. [10.1136/thoraxjnl-2021-218436](https://doi.org/10.1136/thoraxjnl-2021-218436)
+  - Villa JM (2016) diff: Simplifying the estimation of difference-in-differences treatment effects. *The Stata Journal* 16(1): 52-71. [10.1177/1536867X1601600108](https://doi.org/10.1177/1536867X1601600108)
+  - von Bismarck-Osten C, Borusyak K, Schönberg U (2022) The role of schools in transmission of the SARS-CoV-2 virus: quasi-experimental evidence from Germany. *Economic Policy* 37(109): 87–130. [10.1093/epolic/eiac001](https://doi.org/10.1093/epolic/eiac001)
+  - Wieland T (2025) Assessing the effectiveness of non-pharmaceutical interventions in the SARS-CoV-2 pandemic: results of a natural experiment regarding Baden-Württemberg (Germany) and Switzerland in the second infection wave. *Journal of Public Health: From Theory to Practice* 33(11): 2497-2511. [10.1007/s10389-024-02218-x](https://doi.org/10.1007/s10389-024-02218-x)
+  - Wooldridge JM (2012) *Introductory Econometrics. A Modern Approach*.
-```bash
-pip install diffindiff
+## What's new (v2.2.5)
+- Bugfixes:
+  - Incorrect import
+- Other:
+  - Update README

{diffindiff-2.2.3 → diffindiff-2.2.5}/diffindiff/didanalysis.py RENAMED Viewed

@@ -4,8 +4,8 @@
 # Author:      Thomas Wieland
 #              ORCID: 0000-0001-5168-9846
 #              mail: geowieland@googlemail.com
-# Version:     2.2.1
-# Last update: 2025-12-06 12:26
+# Version:     2.2.2
+# Last update: 2025-12-07 10:27
 # Copyright (c) 2025 Thomas Wieland
 #-----------------------------------------------------------------------
@@ -893,7 +893,7 @@ class DiffModel:
         treatment_diagnostics = model_config["treatment_diagnostics"]
         no_treatments = model_config["no_treatments"]
         outcome_col = model_config["outcome_col"]
-        outcome_col_predicted = outcome_col+"_predicted"
+        outcome_col_predicted = f"{outcome_col}{config.PREDICTED_SUFFIX}"
         if TG_col is None and treatment is None:
             if no_treatments == 1:
@@ -928,8 +928,7 @@ class DiffModel:
         if ("TG" in plot_intervals_groups and "CG" not in plot_intervals_groups) or ("CG" in plot_intervals_groups and "TG" not in plot_intervals_groups):
             lines_labels_required = lines_labels_required+1
         if "TG" in plot_intervals_groups and "CG" in plot_intervals_groups:
-            lines_labels_required = lines_labels_required+2
+            lines_labels_required = lines_labels_required+2
         assert len(lines_col) == lines_col_required, f"Parameter 'lines_col' must be a list with {lines_col_required} entries"
         assert len(lines_style) == lines_style_required, f"Parameter 'lines_style' must be a list with {lines_col_required} entries"
         assert len(lines_labels) == lines_labels_required, f"Parameter 'lines_labels' must be a list with {lines_labels_required} entries"
@@ -1392,6 +1391,13 @@ def did_analysis(
         *treatment_col
         ]
+    data = tools.panel_index(
+        data = data,
+        unit_col = unit_col,
+        time_col = time_col,
+        verbose = verbose
+        )
     treatment_diagnostics_results = helper.treatment_diagnostics(
         data = data,
         unit_col=unit_col,
@@ -1731,6 +1737,7 @@ def did_analysis(
         spillover = helper.create_spillover(
             data=data,
             unit_col=unit_col,
+            time_col=time_col,
             treatment_col=treatment_col,
             spillover_treatment=spillover_treatment,
             spillover_units=spillover_units
@@ -1928,7 +1935,14 @@ def ddd_analysis(
             )
         cols_relevant = cols_relevant + covariates
+    data = tools.panel_index(
+        data = data,
+        unit_col = unit_col,
+        time_col = time_col,
+        verbose = verbose
+        )
     treatment_diagnostics_results = helper.treatment_diagnostics(
         data = data,
         unit_col=unit_col,

{diffindiff-2.2.3 → diffindiff-2.2.5}/diffindiff/didanalysis_helper.py RENAMED Viewed

@@ -4,8 +4,8 @@
 # Author:      Thomas Wieland
 #              ORCID: 0000-0001-5168-9846
 #              mail: geowieland@googlemail.com
-# Version:     1.0.4
-# Last update: 2025-12-06 12:26
+# Version:     1.0.5
+# Last update: 2025-12-07 10:27
 # Copyright (c) 2025 Thomas Wieland
 #-----------------------------------------------------------------------
@@ -172,7 +172,9 @@ def create_specific_treatment_effects(
 def create_spillover(
     data: pd.DataFrame,
     unit_col: str,
+    time_col: str,
     treatment_col: list,
+    TT_col: str = None,
     spillover_treatment: list = [],
     spillover_units: list = [],
     verbose: bool = config.VERBOSE
@@ -191,6 +193,19 @@ def create_spillover(
     for treatment in treatment_col:
+        if TT_col is None:
+            TT_col = config.TT_COL
+            data = tools.treatment_time_col(
+                data = data,
+                unit_col = unit_col,
+                time_col = time_col,
+                treatment_col = treatment,
+                create_TT_col = TT_col,
+                verbose = verbose
+                )[0]
         sp_unit_col = f"{config.SPILLOVER_UNIT_PREFIX}{config.DELIMITER}{treatment}"
         sp_treatment_col = f"{config.SPILLOVER_PREFIX}{config.DELIMITER}{treatment}"
@@ -205,7 +220,7 @@ def create_spillover(
             sp_unit_col
             ] = 1
-        data[sp_treatment_col] = data[sp_unit_col]*data[treatment]
+        data[sp_treatment_col] = data[sp_unit_col]*data[TT_col]
     spillover_treatment_vars_join = ' + '.join(spillover_treatment_vars)

{diffindiff-2.2.3 → diffindiff-2.2.5}/diffindiff/diddata.py RENAMED Viewed

@@ -4,8 +4,8 @@
 # Author:      Thomas Wieland
 #              ORCID: 0000-0001-5168-9846
 #              mail: geowieland@googlemail.com
-# Version:     2.1.3
-# Last update: 2025-12-06 10:49
+# Version:     2.1.5
+# Last update: 2025-12-07 10:27
 # Copyright (c) 2025 Thomas Wieland
 #-----------------------------------------------------------------------
@@ -478,7 +478,7 @@ class DiffData:
         ):
         if unit_col is None and time_col is None:
-            raise ValueError("unit_col and/or time_col must be stated")
+            raise ValueError("Parameter 'unit_col' and/or 'time_col' must be stated")
         if verbose:
             if len(variables) > 0:
@@ -488,16 +488,16 @@ class DiffData:
         did_modeldata = self.get_did_modeldata_df()
+        additional_df = tools.panel_index(
+            data=additional_df,
+            unit_col=unit_col,
+            time_col=time_col,
+            verbose=verbose
+            )
         existing_variables = []
-        if unit_col is not None and time_col is not None:
-            additional_df = tools.panel_index(
-                data=additional_df,
-                unit_col=unit_col,
-                time_col=time_col,
-                verbose=verbose
-                )
+        if unit_col is not None and time_col is not None:
             if variables is None:
@@ -659,15 +659,15 @@ class DiffData:
         new_merge = tools.panel_index(
             data=new_merge,
-            unit_col=unit_id_col,
-            time_col=time_col,
+            unit_col=config.UNIT_COL,
+            time_col=config.TIME_COL,
             verbose=verbose
             )
         did_modeldata_old = tools.panel_index(
             data=did_modeldata_old,
-            unit_col=unit_id_col,
-            time_col=time_col,
+            unit_col=config.UNIT_COL,
+            time_col=config.TIME_COL,
             verbose=verbose
             )
@@ -1055,24 +1055,12 @@ def merge_data(
         treatment_data_df,
         how = "cross"
         )
-    if drop_missing or missing_replace_by_zero:
-        modeldata_ismissing = tools.is_missing(
-            data = did_modeldata,
-            drop_missing = drop_missing,
-            missing_replace_by_zero = missing_replace_by_zero,
-            verbose = False
-            )
-        did_modeldata = modeldata_ismissing[2]
     did_modeldata[treatment_name] = did_modeldata[TG_col] * did_modeldata[TT_col]
     if treatment_config["after_treatment_period"]:
         did_modeldata[after_treatment_name] = did_modeldata[TG_col] * did_modeldata[ATT_col]
-    if np.dtype(did_modeldata[config.TIME_COL]) != np.dtype(outcome_data[time_col]):
-        print(f"WARNING: Time columns of treatment data and outcome data differ: {str(np.dtype(did_modeldata[config.TIME_COL]))}, {str(np.dtype(outcome_data[time_col]))}. This might induce an error while building the model dataset.")
     did_modeldata = tools.panel_index(
         data=did_modeldata,
         unit_col=config.UNIT_COL,
@@ -1086,7 +1074,7 @@ def merge_data(
         time_col=time_col,
         verbose=verbose
         )
     if keep_columns:
         outcome_data_short = outcome_data
     else:
@@ -1097,6 +1085,15 @@ def merge_data(
         on=config.UNIT_TIME_COL,
         how="left"
         )
+    if drop_missing or missing_replace_by_zero:
+        modeldata_ismissing = tools.is_missing(
+            data = did_modeldata,
+            drop_missing = drop_missing,
+            missing_replace_by_zero = missing_replace_by_zero,
+            verbose = False
+            )
+        did_modeldata = modeldata_ismissing[2]
     outcome_col_original = outcome_col
     unit_time_col_original = unit_id_col, time_col
@@ -1230,7 +1227,7 @@ def create_counterfactual(
         unit_col = unit_col,
         time_col = time_col,
         treatment_col = treatment_col
-        )
+        )[0]
     units = tools.unique(units_tt[unit_col])
     if not isnotreatment[0]:

{diffindiff-2.2.3 → diffindiff-2.2.5}/diffindiff/didtools.py RENAMED Viewed

@@ -4,8 +4,8 @@
 # Author:      Thomas Wieland
 #              ORCID: 0000-0001-5168-9846
 #              mail: geowieland@googlemail.com
-# Version:     2.1.1
-# Last update: 2025-12-06 10:48
+# Version:     2.1.4
+# Last update: 2025-12-07 10:27
 # Copyright (c) 2025 Thomas Wieland
 #-----------------------------------------------------------------------
@@ -54,25 +54,39 @@ def panel_index(
     ):
     to_str = []
-    unit_x_time = True
-    if data[unit_col].dtype != 'object':
-        data[unit_col] = data[unit_col].astype(str)
-        to_str.append(unit_col)
+    if unit_col is not None:
+        if data[unit_col].dtype != 'object':
+            data[unit_col] = data[unit_col].astype(str)
+            to_str.append(unit_col)
+    else:
+        if verbose:
+            print("NOTE: No unit column was stated")
+    if time_col is not None:
+        if data[time_col].dtype != 'object':
+            data[time_col] = data[time_col].astype(str)
+            to_str.append(time_col)
+    else:
+        if verbose:
+            print("NOTE: No time column was stated")
-    if data[time_col].dtype != 'object':
-        data[time_col] = data[time_col].astype(str)
-        to_str.append(time_col)
+    if verbose and len(to_str) > 0:
+        print(f"NOTE: The following columns were converted to str: {', '.join(to_str)}.")
     if config.UNIT_TIME_COL not in data.columns:
-        unit_x_time = False
-        data[config.UNIT_TIME_COL] = data[unit_col]+config.DELIMITER+data[time_col]
+        if unit_col is not None and time_col is not None:
-    if verbose:
-        if len(to_str) > 0:
-            print(f"NOTE: The following columns were converted to str: {', '.join(to_str)}.")
-        if not unit_x_time:
-            print(f"NOTE: The following unit-time-index column was included: {config.UNIT_TIME_COL}.")
+            data[config.UNIT_TIME_COL] = data[unit_col]+config.DELIMITER+data[time_col]
+            if verbose:
+                print(f"NOTE: The following unit-time-index column was created: {config.UNIT_TIME_COL}.")
+        else:
+            if verbose:
+                print("No unit-time-index column was created.")
     return data
@@ -170,8 +184,8 @@ def is_binary(
     if verbose:
         print("OK")
-    if not binary:
-        print(f"NOTE: treatment column '{treatment_col}' is not binary. Likely treatment format is: {treatment_format}.")
+        if not binary:
+            print(f"NOTE: treatment column '{treatment_col}' is not binary. Likely treatment format is: {treatment_format}.")
     return [
         binary,
@@ -268,8 +282,8 @@ def is_simultaneous(
     if verbose:
         print("OK")
-    if not simultaneous and data_isnotreatment[0]:
-        print(f"NOTE: treatment '{treatment_col}' is not simultaneous.")
+        if not simultaneous and data_isnotreatment[0]:
+            print(f"NOTE: treatment '{treatment_col}' is not simultaneous.")
     if simultaneous and not data_isnotreatment[0]:
         print(f"WARNING: treatment '{treatment_col}' is simultaneous and does not include a {config.NO_TREATMENT_CG_DESCRIPTION}")
@@ -303,8 +317,8 @@ def is_notreatment(
     if verbose:
         print("OK")
-    if not no_treatment:
-        print(f"NOTE: treatment '{treatment_col}' does not include a {config.NO_TREATMENT_CG_DESCRIPTION}.")
+        if not no_treatment:
+            print(f"NOTE: treatment '{treatment_col}' does not include a {config.NO_TREATMENT_CG_DESCRIPTION}.")
     return [
         no_treatment,
@@ -342,8 +356,8 @@ def treatment_group_col(
     if verbose:
         print("OK")
-    if create_TG_col_exists:
-        print(f"NOTE: Column {create_TG_col} already exists. Saved treatment group in column {config.TG_COL}{config.DELIMITER}{treatment_col}.")
+        if create_TG_col_exists:
+            print(f"NOTE: Column {create_TG_col} already exists. Saved treatment group in column {config.TG_COL}{config.DELIMITER}{treatment_col}.")
     return [
         data,
@@ -351,6 +365,32 @@ def treatment_group_col(
         create_TG_col
         ]
+def treatment_time_col(
+    data: pd.DataFrame,
+    unit_col: str,
+    time_col: str,
+    treatment_col: str,
+    create_TT_col: str = "TT",
+    verbose: bool = config.VERBOSE
+    ):
+    tt = treatment_times(
+        data = data,
+        unit_col = unit_col,
+        time_col = time_col,
+        treatment_col = treatment_col,
+        verbose = verbose
+        )[1]
+    data[create_TT_col] = 0
+    data.loc[data[time_col].isin(tt), create_TT_col] = 1
+    return [
+        data,
+        tt,
+        create_TT_col
+    ]
 def untreated_units(
     data: pd.DataFrame,
     unit_col: str,
@@ -460,10 +500,11 @@ def is_prepost(
     if verbose:
         print("OK")
-    if prepost:
-        print("NOTE: Panel data is pre-post.")
-    else:
-        print("NOTE: Panel data is multi-period panel data.")
+    if verbose:
+        if prepost:
+            print("NOTE: Panel data is pre-post.")
+        else:
+            print("NOTE: Panel data is multi-period panel data.")
     return prepost
@@ -502,8 +543,8 @@ def is_multiple_treatment_period(
     if verbose:
         print("OK")
-    if units_multiple > 0:
-        print(f"NOTE: There are {units_multiple} observational units with multiple treatment periods with respect to treatment '{treatment_col}'.")
+        if units_multiple > 0:
+            print(f"NOTE: There are {units_multiple} observational units with multiple treatment periods with respect to treatment '{treatment_col}'.")
     return [
         multiple_treatment_period,
@@ -591,12 +632,22 @@ def treatment_times(
             unit_col,
             time_col,
             treatment_col
-            ]
+            ],
+        verbose=verbose
         )
+    is_multiple_treatment_period(
+        data = data,
+        unit_col = unit_col,
+        treatment_col = treatment_col,
+        verbose = verbose
+        )[0]
     if verbose:
         print(f"Identifying treatment times for treatment '{treatment_col}'", end = " ... ")
+    tt = list(unique(data.loc[data[treatment_col] == 1, time_col]))
     units = unique(data[unit_col])
     units_tt = pd.DataFrame(columns = [unit_col, "treatment_min", "treatment_max"])
@@ -628,7 +679,10 @@ def treatment_times(
     if verbose:
         print("OK")
-    return units_tt
+    return [
+        units_tt,
+        tt
+    ]
 def model_wrapper(
     y,
@@ -833,8 +887,6 @@ def fit_metrics(
         RSQ_ADJ = (1-(1-RSQ)*((observations-1)/(observations-indep_vars_no-1)))
     else:
-        print("NOTE: As no number of independent vars was stated, no Adj. R-Squared is calculated.")
         RSQ_ADJ = np.nan
@@ -854,8 +906,13 @@ def fit_metrics(
     if verbose:
         print("OK")
-    if len(obs_exp_clean) < len(observed) or len(obs_exp_clean) < len(expected):
-        print("NOTE: Vectors 'observed' and/or 'expected' contain NaNs which were dropped.")
+    if verbose:
+        if RSQ_ADJ == np.nan:
+            print("NOTE: As no number of independent vars was stated, no Adj. R-Squared is calculated.")
+        if len(obs_exp_clean) < len(observed) or len(obs_exp_clean) < len(expected):
+            print("NOTE: Vectors 'observed' and/or 'expected' contain NaNs which were dropped.")
     modelfit_results = [
         model_residuals,

{diffindiff-2.2.3 → diffindiff-2.2.5}/diffindiff.egg-info/PKG-INFO RENAMED Viewed

@@ -1,12 +1,12 @@
 Metadata-Version: 2.1
 Name: diffindiff
-Version: 2.2.3
+Version: 2.2.5
 Summary: diffindiff: Python library for convenient Difference-in-Differences Analyses
 Author: Thomas Wieland
 Author-email: geowieland@googlemail.com
 Description-Content-Type: text/markdown
-# diffindiff: Difference-in-Differences (DiD) Analysis Python Library
+# diffindiff: A Python library for convenient difference-in-differences analyses
 This Python library is designed for performing Difference-in-Differences (DiD) analyses in a convenient way. It allows users to construct datasets, define treatment and control groups, and set treatment periods. DiD model analyses may be conducted with both datasets created by built-in functions and ready-to-use external datasets. Both simultaneous and staggered adoption are supported. The library allows for various extensions, such as two-way fixed effects models, group- or individual-specific effects, post-treatment periods, and triple-difference estimations. Additionally, it includes functions for visualizing results, such as plotting DiD coefficients with confidence intervals and illustrating the temporal evolution of staggered treatments. Furthermore, several functions for rigorous treatment setting and data diagnostics are incorporated.
@@ -16,9 +16,33 @@ This Python library is designed for performing Difference-in-Differences (DiD) a
 Thomas Wieland [ORCID](https://orcid.org/0000-0001-5168-9846) [EMail](mailto:geowieland@googlemail.com)
-## Updates v2.2.3
-- Bugfixes:
-  - Spillover treatment really works now
+## Availability
+- 📦 PyPI: [diffindiff](https://pypi.org/project/diffindiff/)
+- 💻 GitHub Repository: [diffindiff_official](https://github.com/geowieland/diffindiff_official)
+- 📄 DOI (Zenodo): [10.5281/zenodo.18639559](https://doi.org/10.5281/zenodo.18639559)
+## Citation
+If you use this software, please cite:
+Wieland, T. (2026). diffindiff: A Python library for convenient difference-in-differences analyses (Version 2.2.4) [Computer software]. Zenodo. https://doi.org/10.5281/zenodo.18656820
+## Installation
+To install the package, use `pip`:
+```bash
+pip install diffindiff
+```
+To install the package from GitHub with `pip`:
+```bash
+pip install git+https://github.com/geowieland/diffindiff_official.git
+```
 ## Features
@@ -54,25 +78,6 @@ Thomas Wieland [ORCID](https://orcid.org/0000-0001-5168-9846) [EMail](mailto:geo
   - Test for parallel trend assumption
-## Literature
-  - Baker AC, Larcker DF, Wang CCY (2022) How much should we trust staggered difference-in-differences estimates? *Journal of Financial Economics* 144(2): 370-395. [10.1016/j.jfineco.2022.01.004](https://doi.org/10.1016/j.jfineco.2022.01.004)
-  - Card D, Krueger AD (1994) Minimum Wages and Employment: A Case Study of the Fast Food Industry in New Jersey and Pennsylvania. *The American Economic Review* 84(4): 772-793. [JSTOR](https://www.jstor.org/stable/2677856)
-  - de Haas S, Götz G, Heim S (2022) Measuring the effect of COVID‑19‑related night curfews in a bundled intervention within Germany. *Scientific Reports* 12: 19732. [10.1038/s41598-022-24086-9](https://doi.org/10.1038/s41598-022-24086-9)
-  - Goodman-Bacon A (2021) Difference-in-differences with variation in treatment timing. *Journal of Econometrics* 225(2): 254-277. [10.1016/j.jeconom.2021.03.014](https://doi.org/10.1016/j.jeconom.2021.03.014)
-  - Greene WH (2012) *Econometric Analysis*.
-  - Goldfarb A, Tucker C, Wang Y (2022) Conducting Research in Marketing with Quasi-Experiments. *Journal of Marketing* 86(3): 1-19. [10.1177/00222429221082977](https://doi.org/10.1177/00222429221082977)
-  - Isporhing IE, Lipfert M, Pestel N (2021) Does re-opening schools contribute to the spread of SARS-CoV-2? Evidence from staggered summer breaks in Germany. *Journal of Public Economics* 198: 104426. [10.1016/j.jpubeco.2021.104426](https://doi.org/10.1016/j.jpubeco.2021.104426)
-  - Li KT, Luo L, Pattabhiramaiah A (2024) Causal Inference with Quasi-Experimental Data. *IMPACT at JMR* November 13, 2024. [AMA](https://www.ama.org/marketing-news/causal-inference-with-quasi-experimental-data/)
-  - Olden A (2018) What do you buy when no one's watching? The effect of self-service checkouts on the composition of sales in retail. Discussion paper FOR 3/18, Norwegian School of Economics, Norway. [http://hdl.handle.net/11250/2490886](http://hdl.handle.net/11250/2490886)
-  - Olden A, Moen J (2022) The triple difference estimator. *The Econometrics Journal* 25(3): 531-553. [10.1093/ectj/utac010](https://doi.org/10.1093/ectj/utac010)
-  - Strassmann A, Çolak Y, Serra-Burriel M, Nordestgaard BG, Turk A, Afzal S, Puhan MA (2023) Nationwide indoor smoking ban and impact on smoking behaviour and lung function: a two-population natural experiment. *Thorax* 78(2): 144-150. [10.1136/thoraxjnl-2021-218436](https://doi.org/10.1136/thoraxjnl-2021-218436)
-  - Villa JM (2016) diff: Simplifying the estimation of difference-in-differences treatment effects. *The Stata Journal* 16(1): 52-71. [10.1177/1536867X1601600108](https://doi.org/10.1177/1536867X1601600108)
-  - von Bismarck-Osten C, Borusyak K, Schönberg U (2022) The role of schools in transmission of the SARS-CoV-2 virus: quasi-experimental evidence from Germany. *Economic Policy* 37(109): 87–130. [10.1093/epolic/eiac001](https://doi.org/10.1093/epolic/eiac001)
-  - Wieland T (2025) Assessing the effectiveness of non-pharmaceutical interventions in the SARS-CoV-2 pandemic: results of a natural experiment regarding Baden-Württemberg (Germany) and Switzerland in the second infection wave. *Journal of Public Health: From Theory to Practice* 33(11): 2497-2511. [10.1007/s10389-024-02218-x](https://doi.org/10.1007/s10389-024-02218-x)
-  - Wooldridge JM (2012) *Introductory Econometrics. A Modern Approach*.
 ## Examples
 ```python
@@ -143,9 +148,27 @@ curfew_model_withgroups.plot_group_treatment_effects(
 See the /tests directory for usage examples of most of the included functions.
-## Installation
+## Literature
-To install the package, use `pip`:
+  - Baker AC, Larcker DF, Wang CCY (2022) How much should we trust staggered difference-in-differences estimates? *Journal of Financial Economics* 144(2): 370-395. [10.1016/j.jfineco.2022.01.004](https://doi.org/10.1016/j.jfineco.2022.01.004)
+  - Card D, Krueger AD (1994) Minimum Wages and Employment: A Case Study of the Fast Food Industry in New Jersey and Pennsylvania. *The American Economic Review* 84(4): 772-793. [JSTOR](https://www.jstor.org/stable/2677856)
+  - de Haas S, Götz G, Heim S (2022) Measuring the effect of COVID‑19‑related night curfews in a bundled intervention within Germany. *Scientific Reports* 12: 19732. [10.1038/s41598-022-24086-9](https://doi.org/10.1038/s41598-022-24086-9)
+  - Goodman-Bacon A (2021) Difference-in-differences with variation in treatment timing. *Journal of Econometrics* 225(2): 254-277. [10.1016/j.jeconom.2021.03.014](https://doi.org/10.1016/j.jeconom.2021.03.014)
+  - Greene WH (2012) *Econometric Analysis*.
+  - Goldfarb A, Tucker C, Wang Y (2022) Conducting Research in Marketing with Quasi-Experiments. *Journal of Marketing* 86(3): 1-19. [10.1177/00222429221082977](https://doi.org/10.1177/00222429221082977)
+  - Isporhing IE, Lipfert M, Pestel N (2021) Does re-opening schools contribute to the spread of SARS-CoV-2? Evidence from staggered summer breaks in Germany. *Journal of Public Economics* 198: 104426. [10.1016/j.jpubeco.2021.104426](https://doi.org/10.1016/j.jpubeco.2021.104426)
+  - Li KT, Luo L, Pattabhiramaiah A (2024) Causal Inference with Quasi-Experimental Data. *IMPACT at JMR* November 13, 2024. [AMA](https://www.ama.org/marketing-news/causal-inference-with-quasi-experimental-data/)
+  - Olden A (2018) What do you buy when no one's watching? The effect of self-service checkouts on the composition of sales in retail. Discussion paper FOR 3/18, Norwegian School of Economics, Norway. [http://hdl.handle.net/11250/2490886](http://hdl.handle.net/11250/2490886)
+  - Olden A, Moen J (2022) The triple difference estimator. *The Econometrics Journal* 25(3): 531-553. [10.1093/ectj/utac010](https://doi.org/10.1093/ectj/utac010)
+  - Strassmann A, Çolak Y, Serra-Burriel M, Nordestgaard BG, Turk A, Afzal S, Puhan MA (2023) Nationwide indoor smoking ban and impact on smoking behaviour and lung function: a two-population natural experiment. *Thorax* 78(2): 144-150. [10.1136/thoraxjnl-2021-218436](https://doi.org/10.1136/thoraxjnl-2021-218436)
+  - Villa JM (2016) diff: Simplifying the estimation of difference-in-differences treatment effects. *The Stata Journal* 16(1): 52-71. [10.1177/1536867X1601600108](https://doi.org/10.1177/1536867X1601600108)
+  - von Bismarck-Osten C, Borusyak K, Schönberg U (2022) The role of schools in transmission of the SARS-CoV-2 virus: quasi-experimental evidence from Germany. *Economic Policy* 37(109): 87–130. [10.1093/epolic/eiac001](https://doi.org/10.1093/epolic/eiac001)
+  - Wieland T (2025) Assessing the effectiveness of non-pharmaceutical interventions in the SARS-CoV-2 pandemic: results of a natural experiment regarding Baden-Württemberg (Germany) and Switzerland in the second infection wave. *Journal of Public Health: From Theory to Practice* 33(11): 2497-2511. [10.1007/s10389-024-02218-x](https://doi.org/10.1007/s10389-024-02218-x)
+  - Wooldridge JM (2012) *Introductory Econometrics. A Modern Approach*.
-```bash
-pip install diffindiff
+## What's new (v2.2.5)
+- Bugfixes:
+  - Incorrect import
+- Other:
+  - Update README

{diffindiff-2.2.3 → diffindiff-2.2.5}/setup.py RENAMED Viewed

@@ -1,6 +1,5 @@
 from setuptools import setup, find_packages
 import os
-import diffindiff.config as config
 def read_README():
     with open(os.path.join(os.path.dirname(__file__), 'README.md'), encoding='utf-8') as f:
@@ -8,7 +7,7 @@ def read_README():
 setup(
     name='diffindiff',
-    version='2.2.3',
+    version='2.2.5',
     description='diffindiff: Python library for convenient Difference-in-Differences Analyses',
     packages=find_packages(include=["diffindiff", "diffindiff.tests"]),
     include_package_data=True,