Miha Gazvoda's Avatar

Miha Gazvoda

@mihagazvoda

data scientist @ http://booking.com | blog @ http://mihagazvoda.com | outdoor sports | priors my own

31
Followers
71
Following
8
Posts
21.11.2024
Joined
Posts Following

Latest posts by Miha Gazvoda @mihagazvoda

If your frequentist analysis gives the same result as a Bayesian analysis, that means nothing for either paradigm. No frequentist is bothered that Bayes "works" - they use Bayesian calculations all the time. No Bayesian is bothered that a freq estimator agrees - it's interpretation that is doubted.

04.06.2025 06:30 πŸ‘ 58 πŸ” 9 πŸ’¬ 6 πŸ“Œ 0

While these insights likely won’t differ drastically across businesses, performing such analysis with your own experiments can help clarify the (implicit) trade-offs you're making.

mihagazvoda.com/files/nhst-r...

6/6

06.05.2025 19:40 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

To address these, a useful alternative I see is applying a shrinkage estimator based on past experiments (and experiment-specific details), combined with explicit cost-benefit analysis. As @statmodeling.bsky.social puts it, let’s make decisions in real-world units: dollars, customers.

5/6

06.05.2025 19:40 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

* Sacrifices business impact by excessively focusing on minimizing losses at the expense of gains.
* Misleads learnings due to exaggerated significant effect estimates, poor coverage, and low replication ratesβ€”though the estimated direction typically remains correct.

4/6

06.05.2025 19:40 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Adopting this approach reveals that, under realistic assumptions, NHST with conventional error rates often:

3/6

06.05.2025 19:40 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

I highlight how NHST’s effectiveness is often explained using an unrealistic scenarioβ€”pretending effects are strictly either β€œnull” or β€œtrue.” Instead, I suggest performing a meta-analysis of past experiments to estimate the distribution of unobserved true effects.

2/6

06.05.2025 19:40 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

I’ve written a personal-view article, β€œA Reality Check on Null Hypothesis Significance Testing in A/B Testing", soon appearing in Eppo’s Outperform magazine (print).

1/6

06.05.2025 19:40 πŸ‘ 3 πŸ” 1 πŸ’¬ 1 πŸ“Œ 0

Reminder that the blog is here too!

@statmodeling.bsky.social

Please repost πŸ™

11.11.2024 14:57 πŸ‘ 10 πŸ” 9 πŸ’¬ 0 πŸ“Œ 0

I like this paper about CI interpretation: pure.rug.nl/ws/portalfil...

14.01.2025 18:55 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Preview
Why we (usually) don't have to worry about multiple comparisons Applied researchers often find themselves making statistical inferences in settings that would seem to require multiple comparisons adjustments. We challenge the Type I error paradigm that underlies t...

Since it is inefficient to attempt to educate every reviewer individually, I am yeeting into your feed this clear paper from Gelman, Hill, and Yajima on how Bayesians can do even better than correcting for multiple comparisons. arxiv.org/abs/0907.2478

26.11.2024 09:11 πŸ‘ 298 πŸ” 66 πŸ’¬ 17 πŸ“Œ 5

Gelman, Hill, and Vehtari in Regression and Other Stories suggest fitting a regression with the same variables as were used for matching/IPTW.

28.11.2024 21:18 πŸ‘ 3 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0