Ben Bolker's Avatar

Ben Bolker

@bbolker

Ecology, evolution, epidemiology, statistics (mixed models). McMaster University, Hamilton, Ontario https://math.mcmaster.ca/bolker

2,247
Followers
596
Following
809
Posts
07.10.2023
Joined
Posts Following

Latest posts by Ben Bolker @bbolker

πŸŽ‰πŸŽ‰πŸŽ‰ Long-awaited: lme4 2.0-1 now handles some simple structured covariance matrices: diagonal ['true' - not just semantic splitting of `(x+y|f)` into `(x|f)` + `(y|f)`], compound symmetric, AR1 (no GPs/factor-analytic/reduced-rank/etc. yet ...)

05.03.2026 21:55 πŸ‘ 12 πŸ” 3 πŸ’¬ 0 πŸ“Œ 0
BMB peeves

it's on my "peeves" list too: bbolker.github.io/bbmisc/peeve...

05.03.2026 21:51 πŸ‘ 5 πŸ” 1 πŸ’¬ 1 πŸ“Œ 0
couverture du livre introduction Γ  la statistique bayΓ©sienne avec R

couverture du livre introduction Γ  la statistique bayΓ©sienne avec R

πŸ“˜ Nouveau livre !

Introduction Γ  la statistique bayΓ©sienne avec le logiciel R

Découvrez pas à pas la statistique bayésienne et sa mise en œuvre dans R : théorème de Bayes, MCMC, priors, régression, GLM/GLMM et comparaison/validation de modèles.

Aux Γ©ditions Quae πŸ™

πŸ“· CrΓ©dit photo : Yann Raulet

05.03.2026 15:32 πŸ‘ 45 πŸ” 16 πŸ’¬ 2 πŸ“Œ 2

What's your Erd"os-Bacon-Sabbath number? (Is it finite?)

05.03.2026 12:33 πŸ‘ 2 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
MCMC Interactive Gallery

Maybe not "statistical concept", but this is fun for MCMC algorithms chi-feng.github.io/mcmc-demo/ap... (references at chi-feng.github.io/mcmc-demo/)

05.03.2026 00:04 πŸ‘ 11 πŸ” 3 πŸ’¬ 1 πŸ“Œ 0
Preview
MixedModels.jl A Julia package for fitting (statistical) mixed-effects models

A quick plug for juliapackages.com/p/mixedmodels (by Doug Bates and phillipalday.com -- similar capabilities to lme4 but **much** faster

04.03.2026 23:32 πŸ‘ 14 πŸ” 5 πŸ’¬ 1 πŸ“Œ 0
Preview
What happens if the explanatory and response variables are sorted independently before regression? Suppose we have data set $(X_i,Y_i)$ with $n$ points. We want to perform a linear regression, but first we sort the $X_i$ values and the $Y_i$ values independently of each other, forming data set $...

For those who haven't yet seen this classic: stats.stackexchange.com/q/185507/2126 (someone whose manager was insisting that they do this ...)

04.03.2026 13:46 πŸ‘ 33 πŸ” 5 πŸ’¬ 1 πŸ“Œ 4
The Bayesian results imply much higher risk of early collapse than maximum likelihood methods. This difference is due to large probabilities of early collapse for certain parameter values that are plausible in light of the data. Because of simplifying assumptions, these results are not directly applicable to assessment. Nevertheless they imply that maximum likelihood and similar methods based upon point parameter estimates will grossly underestimate the risk of early collapse.

The Bayesian results imply much higher risk of early collapse than maximum likelihood methods. This difference is due to large probabilities of early collapse for certain parameter values that are plausible in light of the data. Because of simplifying assumptions, these results are not directly applicable to assessment. Nevertheless they imply that maximum likelihood and similar methods based upon point parameter estimates will grossly underestimate the risk of early collapse.

This is decision-theoretic rather than stats only, but: Ludwig, Donald. β€œUncertainty and the Assessment of Extinction Probabilities.” Ecological Applications 6, no. 4 (1996): 1067–76. doi.org/10.2307/2269....

03.03.2026 15:52 πŸ‘ 1 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0

My thoughts on this aren't fully cooked, but: this presupposes that non-null results are more interesting than null results (which is mostly true given the way most scientists set up hypotheses but doesn't have to be). Is there a bias-interestingness tradeoff dial we could adjust?

02.03.2026 20:28 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

complete separation?

02.03.2026 14:33 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

It's good you asked, since I got to go down a rabbit hole/didn't know about him before.

02.03.2026 00:05 πŸ‘ 2 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Preview
Ernest Everett Just - Wikipedia

I think your AI overlords are bullshitting/hallucinating. See the other part of thread on en.wikipedia.org/wiki/Ernest_... (who sounds like a true badass BTW ...) [off-topic: is there something I can read to update my mental model of BSky threading, which I don't get at all?]

02.03.2026 00:04 πŸ‘ 2 πŸ” 1 πŸ’¬ 2 πŸ“Œ 0

Sorry, missed alt-text on the second screenshot: "As Just remarked in the symposium this morning, he is interested more in the back than in the bristles on the back and more in eyes than in eye color"

01.03.2026 20:49 πŸ‘ 2 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
"Embryologist E. E. Just complained that genetics and selection could explain why populations of flies had more or fewer bristles on their backs, but it couldn't explain how a fly constructed its back in the first place (Harrison 1937: 372; Gilbert et al 1996: 361)

"Embryologist E. E. Just complained that genetics and selection could explain why populations of flies had more or fewer bristles on their backs, but it couldn't explain how a fly constructed its back in the first place (Harrison 1937: 372; Gilbert et al 1996: 361)

Post image

Quote is attributed from Amundson R (2005) The Changing role of the embryo in evolutionary thought: Roots of Evo-Devo.
Cambridge University Press, Cambridge (Google books screenshot β†’ Harrison, Ross G. 1937. β€œEmbryology and Its Relations.” Science 85 (2207): 369–74 (JSTOR screenshot)

01.03.2026 20:48 πŸ‘ 4 πŸ” 1 πŸ’¬ 2 πŸ“Œ 0

Think this is what you want? philarchive.org/archive/LIAT... quotes: "I am interested more in the fly’s back than the bristles on its back, and more in its eye than its eye color (E.E. Just)" . Paper has more details, plus see: en.wikipedia.org/wiki/Ernest_...

01.03.2026 20:40 πŸ‘ 3 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

I get the idea ... I thought the people I tagged might have an idea about the source (both fly people, although I think more on the evo of morphology side than strictly evo-devo ... [my characterization, not theirs])

01.03.2026 19:37 πŸ‘ 2 πŸ” 0 πŸ’¬ 4 πŸ“Œ 0

@idworkin.bsky.social ? @thelonglab.bsky.social ?

01.03.2026 19:14 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Thinking inside the box Dirk Eddelbuettel, R, C++, Rcpp

"newbies" check (pkgs w/o prior releases on CRAN) have a particularly fussy human-administered set of checks (e.g. do all functions have explicitly documented return values?) Also, things like spell-check false positives (can be fixed via dirk.eddelbuettel.com/blog/2017/08... )

01.03.2026 17:54 πŸ‘ 1 πŸ” 1 πŸ’¬ 1 πŸ“Œ 0
Canada’s Fentanyl Czar - Privy Council Office - Canada.ca

don't forget the πŸ‡¨πŸ‡¦ fentanyl czar! www.canada.ca/en/privy-cou... (not really Canada's idea ...)

01.03.2026 00:01 πŸ‘ 6 πŸ” 0 πŸ’¬ 2 πŸ“Œ 0

... in the bottom of a locked file cabinet in a disused lavatory with a sign on the door saying "Beware of Leopard" ...

28.02.2026 00:24 πŸ‘ 4 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
results of `apropos("R.?[Vv]ersion", ignore.case=FALSE)`: c("getRversion", "R.version", "R.Version", "R.version.string")

results of `apropos("R.?[Vv]ersion", ignore.case=FALSE)`: c("getRversion", "R.version", "R.Version", "R.version.string")

argh.

28.02.2026 00:18 πŸ‘ 4 πŸ” 1 πŸ’¬ 1 πŸ“Œ 0
Preview
Request: function to compare lockfiles Β· Issue #1736 Β· rstudio/renv I would be interested in having a function that would be able to take two lockfiles and present a report of packages that are added, removed, upgraded, and downgraded. I believe that the functional...

While we're here, the other tool I'd love (if the universe would magically grant it to me) would be a lockfile/sessionInfo "diff" utility ... (see github.com/rstudio/renv... )

27.02.2026 00:00 πŸ‘ 2 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Fun that the papal bulls were alphabetized by author, *all under "P"* (e.g. "Pope Benedict XVI" not "Benedict XVI"). My admittedly ancient Chicago Manual of Style (13th ed., 1982) says only (rule 18.76) "Monarchs & popes should be listed accorded to their 'official', not personal, names" ...

26.02.2026 23:28 πŸ‘ 3 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

I agree that it doesn't (that I know of) exist. It would be tempting to write it oneself, or possibly to vibe-code it: using `renv` lockfile format as a target seems sensible (since then you get the "now install all this stuff" functionality for free from `renv`)

26.02.2026 20:01 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Preview
projection/prediction of NSERC Discovery award dates projection/prediction of NSERC Discovery award dates - nserc_dates.R

niche (pronounced "neesh" πŸ‡¨πŸ‡¦) question: can anyone tell me the dates when NSERC Discovery Grant results were announced (by university research offices to applicants) in 2024 and 2025? I want to update my predictive model gist.github.com/bbolker/ce64... for this year ...

26.02.2026 00:37 πŸ‘ 3 πŸ” 0 πŸ’¬ 2 πŸ“Œ 0

be careful, we're sliding toward the leading-comma debate ...

25.02.2026 19:09 πŸ‘ 2 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Circle 4
Over-Vectorizing
We skirted past Plutus, the fierce wolf with a swollen face, down into the fourth Circle. Here we found the lustful.

It is a good thing to want to vectorize when there is no effective way to do so. It is a bad thing to attempt it anyway. A common reflex is to use a function in the apply family. This is not vectorization, it is loop-hiding. The apply function has a for loop in its definition. The lapply function buries the loop, but execution times tend to be roughly equal to an explicit for loop. (Confusion over this is understandable, as there
is a significant difference in execution speed with at least some versions of S+.) Table 4.1 summarizes the uses of the apply family of functions.
Base your decision of using an apply function on Uwe’s Maxim (page 20).
The issue is of human time rather than silicon chip time. Human time can be wasted by taking longer to write the code, and (often much more importantly) by taking more time to understand subsequently what it does

Circle 4 Over-Vectorizing We skirted past Plutus, the fierce wolf with a swollen face, down into the fourth Circle. Here we found the lustful. It is a good thing to want to vectorize when there is no effective way to do so. It is a bad thing to attempt it anyway. A common reflex is to use a function in the apply family. This is not vectorization, it is loop-hiding. The apply function has a for loop in its definition. The lapply function buries the loop, but execution times tend to be roughly equal to an explicit for loop. (Confusion over this is understandable, as there is a significant difference in execution speed with at least some versions of S+.) Table 4.1 summarizes the uses of the apply family of functions. Base your decision of using an apply function on Uwe’s Maxim (page 20). The issue is of human time rather than silicon chip time. Human time can be wasted by taking longer to write the code, and (often much more importantly) by taking more time to understand subsequently what it does

www.burns-stat.com/pages/Tutor/...

25.02.2026 13:50 πŸ‘ 15 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0

I guess in a pinch I could download the zenodo repo and re-build the paper, but that seems like a hassle (and maybe there's a reason the preprint disappeared?)

24.02.2026 20:38 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Code and data for: Composable probabilistic models can lower barriers to rigorous infectious disease modelling Clearance version of the draft

I'm getting 404 errors for the paper/paper site links, can't find anything on github.com/EpiAware . I can still get to the Zenodo site zenodo.org/doi/10.5281/... and the talk www.youtube.com/watch?v=FQYO... but would like to be able to share the pre-pre-print if it's still available?

24.02.2026 20:36 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

But why would it matter if AT&T got angry? The only AT&T code in COPYRIGHTS is some stuff in stats/src/loess[cf].[cf] ... John Chambers's (only AT&T/Bell Labs R-core member?) contributions were presumably independent of his employment there ...

24.02.2026 20:00 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0