Miha Gazvoda (@mihagazvoda)

If your frequentist analysis gives the same result as a Bayesian analysis, that means nothing for either paradigm. No frequentist is bothered that Bayes "works" - they use Bayesian calculations all the time. No Bayesian is bothered that a freq estimator agrees - it's interpretation that is doubted.

04.06.2025 06:30 👍 58 🔁 9 💬 6 📌 0

While these insights likely won’t differ drastically across businesses, performing such analysis with your own experiments can help clarify the (implicit) trade-offs you're making.

mihagazvoda.com/files/nhst-r...

6/6

06.05.2025 19:40 👍 0 🔁 0 💬 0 📌 0

To address these, a useful alternative I see is applying a shrinkage estimator based on past experiments (and experiment-specific details), combined with explicit cost-benefit analysis. As @statmodeling.bsky.social puts it, let’s make decisions in real-world units: dollars, customers.

5/6

06.05.2025 19:40 👍 0 🔁 0 💬 1 📌 0

* Sacrifices business impact by excessively focusing on minimizing losses at the expense of gains.
* Misleads learnings due to exaggerated significant effect estimates, poor coverage, and low replication rates—though the estimated direction typically remains correct.

4/6

06.05.2025 19:40 👍 0 🔁 0 💬 1 📌 0

Adopting this approach reveals that, under realistic assumptions, NHST with conventional error rates often:

3/6

06.05.2025 19:40 👍 0 🔁 0 💬 1 📌 0

I highlight how NHST’s effectiveness is often explained using an unrealistic scenario—pretending effects are strictly either “null” or “true.” Instead, I suggest performing a meta-analysis of past experiments to estimate the distribution of unobserved true effects.

2/6

06.05.2025 19:40 👍 1 🔁 0 💬 1 📌 0

I’ve written a personal-view article, “A Reality Check on Null Hypothesis Significance Testing in A/B Testing", soon appearing in Eppo’s Outperform magazine (print).

1/6

06.05.2025 19:40 👍 3 🔁 1 💬 1 📌 0

Reminder that the blog is here too!

@statmodeling.bsky.social

Please repost 🙏

11.11.2024 14:57 👍 10 🔁 9 💬 0 📌 0

I like this paper about CI interpretation: pure.rug.nl/ws/portalfil...

14.01.2025 18:55 👍 1 🔁 0 💬 0 📌 0

Why we (usually) don't have to worry about multiple comparisons Applied researchers often find themselves making statistical inferences in settings that would seem to require multiple comparisons adjustments. We challenge the Type I error paradigm that underlies t...

Since it is inefficient to attempt to educate every reviewer individually, I am yeeting into your feed this clear paper from Gelman, Hill, and Yajima on how Bayesians can do even better than correcting for multiple comparisons. arxiv.org/abs/0907.2478

26.11.2024 09:11 👍 298 🔁 66 💬 17 📌 5

Gelman, Hill, and Vehtari in Regression and Other Stories suggest fitting a regression with the same variables as were used for matching/IPTW.

28.11.2024 21:18 👍 3 🔁 0 💬 1 📌 0

Miha Gazvoda

Latest posts by Miha Gazvoda @mihagazvoda