Stefan Faridani

Assistant Professor of Economics

Georgia Institute of Technology

Research

Testing for underpowered literatures

Presented at the Berkeley ITSS Meeting in March 2024: Video

Under Review

How many experimental studies would have come to different conclusions had they been run on larger samples? I show how to estimate the expected number of statistically significant results that a set of experiments would have reported had their sample sizes all been counterfactually increased by a chosen factor. The deconvolution estimator is consistent and asymptotically normal. Unlike existing methods, my approach requires no assumptions about the distribution of true treatment effects of the interventions being studied. This method includes an adjustment for publication bias in the reported t-scores. An application to randomized controlled trials (RCTs) published in top economics journals finds that doubling every experiment's sample size would only increase the power of two-sided t-tests by 7.2 percentage points on average. I argue that this effect is small by showing that it is comparable to the effect for systematic replication projects in laboratory psychology where previous studies enabled accurate power calculations ex ante. These effects are both smaller than for non-RCTs. This comparison suggests that RCTs are on average relatively insensitive to sample size increases. The policy implication is that grant givers should generally fund more experiments rather than fewer, larger ones. Transparency R package, Arxiv.

Linear estimation of global average treatment effects (with Paul Niehaus)

Presentations: CEPR Development Economics Annual Symposium, Urban Economics Association Annual Meeting, Microeconometrics Class of 2024 Conference, SEEDS conference, Georgia Econometrics Workshop

Under Review

We study the problem of estimating the average causal effect of treating every member of a population, as opposed to none, using an experiment that treats only some. We consider settings where spillovers have global support and decay slowly with (a generalized notion of) distance. We derive the minimax rate over both estimators and designs, and show that it increases with the spatial rate of spillover decay. Estimators based on OLS regressions like those used to analyze recent large-scale experiments are consistent (though only after de-weighting), achieve the minimax rate when the DGP is linear, and converge faster than IPW-based alternatives when treatment clusters are small, providing one justification for OLS's ubiquity. When the DGP is nonlinear they remain consistent but converge slowly. We further address inference and bandwidth selection. Applied to the cash transfer experiment studied by Egger et al. (2022) these methods yield a 20% larger estimated effect on consumption. Arxiv

Social Effects, Spillovers, and Scale-up of Teacher Training in Uganda: an RCT (with Vesall Nourani, Moustafa El-Kashlan, and Sara Tamayo)

While nearly half of Ugandan schoolchildren enter secondary school, fewer than 10% complete it. Low teaching quality may be a factor. We study the effects and spillovers of training secondary school teachers in rural Uganda with an RCT. Teachers were randomly assigned to an innovative training program run by Kimanya-Ngeyo in November 2021 and training is ongoing in waves. Our RCT design allows us to study teacher-to-teacher spillovers over time by randomly assigning half of treated schools to treat teachers in "cliques", where treated teachers know each other well vs. the other half of treated schools who were assigned to treat teachers in "anti-cliques", where treated teachers do not know each other well. AEA Registration here.

A Sharp and Robust Test for Selective Reporting

Draft coming soon!

This paper proposes a test that is consistent against every detectable form of selective reporting and remains interpretable even when the t-scores are not exactly normal. The test statistic is the distance between the smoothed empirical t-curve and the set of all distributions that would be possible in the absence of any selective reporting. This novel projection test can only be evaded in large meta-samples by selective reporting that also evades all other valid tests of restrictions on the distribution t-scores. A second benefit of the projection test is that under the null we can interpret the projection residual as noise plus bias incurred from approximating the t-score's exact distribution with the normal. Applying the test to the \cite{bb} meta-data, we find that the t-curves for RCTs, IVs, and DIDs are more distorted than could arise by chance. But an Edgeworth Expansion reveals that these distortions are small enough to be plausibly explained by the only approximate normality of the t-score and the detection of selective reporting is therefore more fragile than previously known.

Page updated

Google Sites

Report abuse