Bob C-J and Geoff Cumming's Avatar

Bob C-J and Geoff Cumming

@thenewstats

Open science, estimation statistics, and random thoughts from Bob Calin-Jageman and Geoff Cumming. https://thenewstatistics.com/itns/

336
Followers
194
Following
911
Posts
09.11.2023
Joined
Posts Following

Latest posts by Bob C-J and Geoff Cumming @thenewstats

Post image

doi.org/10.1111/jep....
Statistical reform spreads around the world and across disciplines: Beyond the p Value Dichotomy: Alternatives for Statistical Inferenceβ€”A Critical Review

09.03.2026 09:29 πŸ‘ 2 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

thenewstatistics.com/itns/2026/03...
We want the best research methods–of course–but should also consider philosophy, ethics, global heating and health.

09.03.2026 07:45 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

If you have an idea for a piece for this series, let me know!

26.02.2026 01:29 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Improving Your Neuroscience | eNeuro

Check out the entire collection of Improving Your Neuroscience here: www.eneuro.org/collection/i...

Great papers for you and your trainees on confirmation bias, dealing with non-normal data, teaching rigor, using simulations to plan research and more.

26.02.2026 01:29 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Preview
Experimental Designs for Preclinical Neuroscience Experiments: Part Iβ€”Design Basics Rigorous, statistically grounded experimental design is central to ethical and effective animal research. Foundational principles for statistically based Design of Experiments (DOE) were established o...

The eNeuro "Improving Your Neuroscience" series continues with a two-part primer on experimental design and blocking from Penelope Reynolds. These new guides will help you up your design game. #stats #neuroscience

www.eneuro.org/content/13/2...

www.eneuro.org/content/13/2...

26.02.2026 01:29 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

3 of the researchers were participants... I know I wouldn't have been able to resist, either! Suit me up with my custom-printed barn-owl ears, please!

12.02.2026 18:41 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Why did they do this? To find out if the human auditory system is flexible enough to learn to localize sounds vertically, the way barn owls do. The answer: partly! But honestly, I'm just excited that they *did* this.

12.02.2026 18:41 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Figure from a research paper showing custom-printed barn-owl molds fitted to a human research participant.  Don't worry if you can't see them -- they are not nearly as cool as the phrase 'barn-owl ears' would have you expecting.

Figure from a research paper showing custom-printed barn-owl molds fitted to a human research participant. Don't worry if you can't see them -- they are not nearly as cool as the phrase 'barn-owl ears' would have you expecting.

Scientists created prosthetic barn-owl ears and got human participants to wear them for 3-59 days.

Yes, people walked around with barn-owl ears for up to 2 months *for science*.

This is the type of news I need today to restore a bit of my faith in humanity.

www.biorxiv.org/content/10.6...

12.02.2026 18:41 πŸ‘ 2 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Because traditional NHST doesn't allow the skeptic to win, we've maligned the need to do so: uninteresting, just mistakes, can't prove a negative, etc. But this is just rationalizing our way into accepting the limitations of the tool. Fruitful science must be able to rule in and rule out effects!

12.02.2026 18:33 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

"near 0" findings have been just essential for science as "not 0" findings. Pasteur, for example, showed that quantity of bacteria formed under sterile conditions is near 0, disproving the theory of spontaneous generation.

12.02.2026 18:33 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

So, just tests with results that we like?

I think you are confusing "effect is near 0" with uninteresting. Not true! Near 0 is interesting (and vital to publish) when a leading theory predicted an effect. It's interesting (and vital) to producing parsimonious models.

12.02.2026 18:33 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Personally, I think we need all valid studies published. What we need (I think) is to think more critically about quality/validityβ€” it is not about p value obtained or even ci widthβ€” it is evaluated from construct validity, positive and negative controls, design features, overlapping evidence, etc.

12.02.2026 14:44 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Can you clarify this a bit? Meaning, you don’t think wide cis (unclear) should be published but narrow ones (clear) should be? Or something else?

12.02.2026 14:40 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

A valid testing regime has the possible outcomes: yes, no, unclear. It makes no sense for a scientific literature to have only β€œyes” results. We support this regime with jaundiced ideas about null results that on reflection are poor epistemology.

12.02.2026 13:32 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

I really don’t understand the jaundiced view of validly estimating no effect (β€œnull results”). Don’t detectives work by ruling out suspects? Isn’t the greater part of learning weakening unhelpful associations? Don’t we accept Popper’s view that science advances through falsification?

12.02.2026 13:26 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

You run a study because you believe z is an important factor, your results estimate is is negligible.. that is new and important knowledge. Theory A and Theory B make contrasting predictions (large effect, no effect), data estimating a negligible difference helps arbitrate between these.

12.02.2026 13:20 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Judgement of quality/validity should not be based on if we like the results. Crap methods/ideas can diminish effects leading to p > .05 or inflate effects producing p < .05. Many studies do belong in the file drawer, but that p value generated is not a valid guide.

12.02.2026 13:14 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Sterling, 1959 updated. We still aren’t sharing science in a scientific manner. With 65 years of knowing the problem, how are we still struggling to address it?

11.02.2026 17:45 πŸ‘ 3 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

I haven’t! Seems like a cool idea, though likely a good bit harder than p, df, and test stat correspondence.. but would be cool!

29.01.2026 15:34 πŸ‘ 1 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0
GitHub - ACCLAB/dabestr: Data Analysis with Bootstrap Estimation in R Data Analysis with Bootstrap Estimation in R. Contribute to ACCLAB/dabestr development by creating an account on GitHub.

For Python: github.com/ACCLAB/dabestr

For R: github.com/ACCLAB/DABES...

29.01.2026 15:08 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Getting over ANOVA: Estimation graphics for multi-group comparisons Data analysis in experimental science mainly relies on null-hypothesis significance testing, despite its well-known limitations. A powerful alternative is estimation statistics, which focuses on effect-size quantification. However, current estimation tools struggle with the complex, multi-group comparisons common in biological research. Here we introduce DABEST 2.0, an estimation framework for complex experimental designs, including shared-control, repeated-measures, two-way factorial experiments, and meta-analysis of replicates. ### Competing Interest Statement The authors have declared no competing interest.

So excited to see DaBest 2.0 is out: get bootstrapped estimation statistics for simple through complex designs, all with beautiful visualization, available in R and Python.

Check it out!

Pre-print describing new features for complex designs: www.biorxiv.org/content/10.6...

#stats

29.01.2026 15:08 πŸ‘ 4 πŸ” 1 πŸ’¬ 2 πŸ“Œ 0
Preview
The Challenges and Solutions of In‐Person Big‐Team Science Big-Team Science (BTS) offers a powerful framework for advancing psychological research through large-scale collaboration, yet the unique challenges of conducting in-person BTS studies remain under-e....

Looking forward to reading this retrospective on the promise and perils of organizing an RRR with in-person data collected.

From: @aggieerin.bsky.social , @psforscher.bsky.social , and others.

#stats

doi.org/10.1111/spc3...

26.01.2026 01:42 πŸ‘ 3 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Preview
RegCheck RegCheck is an AI tool to compare preregistrations with papers instantly.

Comparing registrations to published papers is essential to research integrity - and almost no one does it routinely because it's slow, messy, and time-demanding.

RegCheck was built to help make this process easier.

Today, we launch RegCheck V2.

🧡

regcheck.app

22.01.2026 11:05 πŸ‘ 174 πŸ” 90 πŸ’¬ 8 πŸ“Œ 6
Preview
esci: Estimation Statistics with Confidence Intervals A collection of functions and 'jamovi' module for the estimation approach to inferential statistics, the approach which emphasizes effect sizes, interval estimates, and meta-analysis. Nearly all funct...

esci 1.0.9 is now on CRAN.

No big changes, just a couple of bug fixes and some compatibility changes for statpsych 1.9. Still a great easy way to get effect sizes and fair tests for many common designs.

cran.r-project.org/web/packages...

#stats

16.01.2026 23:58 πŸ‘ 5 πŸ” 2 πŸ’¬ 0 πŸ“Œ 0
Post image

thenewstatistics.com/itns/2026/01...
A statistics textbook for the AI era

16.01.2026 22:44 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

thenewstatistics.com/itns/2026/01...

16.01.2026 22:40 πŸ‘ 2 πŸ” 0 πŸ’¬ 0 πŸ“Œ 1
Post image

The Faculty for Undergraduate Teaching Workshop is back this summer, July16-19, right outside of Chicago. A great conference with great people. Registration and abstract submission are open… it is going to be great!

#neuroscience

www.funfaculty.org/conference_2...

15.01.2026 21:50 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Proud to have played a small part in this as a data-collection site. And so proud of @luis-a-gomez.bsky.social , who organized this project at our university as an undegrad and has launched to great success in a clinical psych PhD program at Purdue.

14.01.2026 14:31 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
The stereotype-threat effect among
women was virtually null and not significant (N = 1275, g = 0.04, SE = 0.08, 95% CI [-0.11; 0.20]), and considerably smaller than the original study (N = 45, d = -0.82; 95% CI [-1.45; -0.18]).

The stereotype-threat effect among women was virtually null and not significant (N = 1275, g = 0.04, SE = 0.08, 95% CI [-0.11; 0.20]), and considerably smaller than the original study (N = 45, d = -0.82; 95% CI [-1.45; -0.18]).

Another threat to stereotype threat?

An RRR (in press, AMPPS) shows the ~0 impact of stereotype threat on female math performance.

Fantastic, arduous work from leads Andrea Stoevenbelt, Paulette C. Flore, and Jelte Wicherts (and DU alum @luis-a-gomez.bsky.social) #stats

osf.io/preprints/ps...

14.01.2026 14:31 πŸ‘ 2 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

And, of course, each barrier raised to junk will have some false positives or harms, so the more content-based protections added, the marginally worse off the good-faith scientists.

Maybe we need some type of reputational scoring system for individuals, institutions, and journals.

14.01.2026 14:13 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0