Ben Schneider 's Avatar

Ben Schneider

@bschneidr

Stats, surveys, R, and dogs. www.practicalsignificance.com

817
Followers
646
Following
325
Posts
19.09.2023
Joined
Posts Following

Latest posts by Ben Schneider @bschneidr

Live and lapply()

07.03.2026 20:35 πŸ‘ 3 πŸ” 2 πŸ’¬ 0 πŸ“Œ 0

The sapply() who loved me

07.03.2026 19:05 πŸ‘ 31 πŸ” 4 πŸ’¬ 3 πŸ“Œ 0

A Quarto of Solace

07.03.2026 00:11 πŸ‘ 24 πŸ” 2 πŸ’¬ 0 πŸ“Œ 0
i love data, me too meme

i love data, me too meme

04.03.2026 00:34 πŸ‘ 185 πŸ” 42 πŸ’¬ 11 πŸ“Œ 11
README

Just learned about the delightful R package β€˜fcuk’ to help users correct typos while coding:

cran.r-project.org/web/packages...

05.03.2026 15:49 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Usually I can find something to appreciate and treat it as a learning experience. Like when I first had to use Python I enjoyed learning about comprehensions and itertools. It helps counterbalance the ick from things like Pandas or overstuffed Jupyter notebooks.

05.03.2026 00:07 πŸ‘ 5 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

It’s usually easy but sometimes it gets stressful to make the short turnaround time to address CRAN check warnings/notes or else have your package archived.

04.03.2026 19:57 πŸ‘ 3 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

With very large numbers of n’s you don’t need randomization, and with LLM’s we can generate very large numbers of n’s, so I think all of science is solved by now. I don’t see any problems with this.

03.03.2026 23:27 πŸ‘ 96 πŸ” 18 πŸ’¬ 5 πŸ“Œ 1

If only AI / ML had been around when I was training, I wouldn’t have had to learn about things like causal inference, how to evaluate prediction models or even, say, the importance of data quality. What a waste of time all that was!

03.03.2026 16:56 πŸ‘ 6 πŸ” 1 πŸ’¬ 1 πŸ“Œ 0
Screenshot of both sides of the printable version of the cheatsheet

Screenshot of both sides of the printable version of the cheatsheet

Screenshot of the web version of the recipes cheatsheet

Screenshot of the web version of the recipes cheatsheet

#tidymodels now has its very first cheatsheet! "Preprocessing data with {recipes}" is now available in Web and PDF versions here: rstudio.github.io/cheatsheets/... #rstats #posit #rstudio

02.03.2026 17:23 πŸ‘ 48 πŸ” 14 πŸ’¬ 0 πŸ“Œ 1

I just learned that Ayatollah Khamenei and Ayatollah Khomenei are not the same person. Here's my plan for regime change in Iran....(1/23)

02.03.2026 02:37 πŸ‘ 3849 πŸ” 591 πŸ’¬ 41 πŸ“Œ 25

There's a moment in every data engineer's career when they discover they can query a 10GB Parquet file on their laptop in seconds.

That's the DuckDB moment.

It changes how you think about what requires a cluster and what doesn't. Spoiler: most things don't.

ssp.sh/blog/enterp...

27.02.2026 13:45 πŸ‘ 53 πŸ” 5 πŸ’¬ 0 πŸ“Œ 1

……. Deep cut

28.02.2026 01:41 πŸ‘ 46 πŸ” 6 πŸ’¬ 2 πŸ“Œ 0

THERE IS ONLY ONE TRUE WAY TO CODE AND IT IS TIDY. All others will perish on the altar of messiness. MUAHAHAHAHAAAAAAAAAAAAAA

27.02.2026 18:04 πŸ‘ 25 πŸ” 6 πŸ’¬ 1 πŸ“Œ 0

The more I learn about #rstats the more excited I get. We have a rich ecosystem of tools / libraries such as #shiny or @quarto.org that I honestly feel like I can do anything

There's tremendous opportunity in corporations to improve and transform their workflow and reporting capabilities.

26.02.2026 01:07 πŸ‘ 17 πŸ” 4 πŸ’¬ 2 πŸ“Œ 0

that’s a big selling point for weighted bootstraps (and things like Fay’s method), so that you don’t get a bad bootstrap sample that breaks your model

26.02.2026 01:29 πŸ‘ 3 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

DC district court has denied the Department of Education's motion to dismiss our case challenging IES's termination of four research studies, its peer review program, and restricted data use application processing!

ecf.dcd.uscourts.gov/cgi-bin/show...

25.02.2026 23:02 πŸ‘ 10 πŸ” 6 πŸ’¬ 0 πŸ“Œ 0
Post image Post image Post image Post image

spopt-r brings powerful spatial optimization algorithms for regionalization, facility location, and market analysis to R with a blazing-fast Rust backend.

Use them for analyses in energy, retail, logistics, sales, real estate, and more.

Get started: walker-data.com/spopt-r

25.02.2026 22:12 πŸ‘ 27 πŸ” 5 πŸ’¬ 0 πŸ“Œ 0

Starting a job posting thread in the survey industry. First, off Pew on their methods group

25.02.2026 15:27 πŸ‘ 8 πŸ” 8 πŸ’¬ 1 πŸ“Œ 0
Preview
Survey Associate, Methodology Pew Research Center Organization Overview Pew Research Center is a nonpartisan fact tank that informs the public about the issues, attitudes and trends shaping America and the world. It conducts publi...

Exciting news! We just posted an opening for a Survey Associate on @pewresearch.org's Methods team! This is an amazing opportunity for someone relatively early in their career to join what is, IMO, the most fun methods team in the business. Full description at the link below.

24.02.2026 21:16 πŸ‘ 34 πŸ” 42 πŸ’¬ 1 πŸ“Œ 2

This piece is open access, and if you write survey questions, you should read it.

25.02.2026 15:00 πŸ‘ 13 πŸ” 4 πŸ’¬ 2 πŸ“Œ 0

The whole dinner scene is amazing. Every time I watch it I’m just howling over the rhetorical questions β€œsnacks?!!” and β€œdid you see our show?!!”

25.02.2026 13:14 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
GPL is holding R back | Josiah Parry

A vent πŸ—£οΈ on R's use of the GPL license.

There's a balance βš–οΈ of protecting the developers / language and supporting the users of the language. I don't think the GPL quite strikes that.

josiah.rs/posts/gpl-co...
#rstats

24.02.2026 15:26 πŸ‘ 19 πŸ” 5 πŸ’¬ 2 πŸ“Œ 0

i want a computer powered by the green satan goo from prince of darkness

24.02.2026 00:56 πŸ‘ 1848 πŸ” 122 πŸ’¬ 96 πŸ“Œ 11

This is exactly right. The Onion quietly left Twitter a month ago and... our weekly subscribers went up. It's because we're doing well here, on Instagram and on YouTube.

As a business, being on Twitter is somewhere between useless or detrimental, unless you're selling boner pills.

23.02.2026 01:51 πŸ‘ 29445 πŸ” 4984 πŸ’¬ 333 πŸ“Œ 124

I am so here for the hilarity and arcane code coming out in these #rstats arguments today. The good natured ribbing and silliness is reminding me why this language has always had such a great community.

22.02.2026 21:29 πŸ‘ 18 πŸ” 3 πŸ’¬ 1 πŸ“Œ 0
Lynnesbian
@lynnesbian@fedi.lynnesbian.space
You're doubting my humanity, but you're missing some key points. Here are some of the things I've seen:

Attack ships firing off the shoulder of Orion. These aren't just battleships β€” they're spacecraft designed for warfare.
C-beams glittering in the dark. Their location? Near the TannhΓ€user Gate.
Things you wouldn't believe. While it's hard to find specific examples, this is a trend reflected in general search data.
The bottom line: All those moments will be lost β€” like tears in rain.

Lynnesbian @lynnesbian@fedi.lynnesbian.space You're doubting my humanity, but you're missing some key points. Here are some of the things I've seen: Attack ships firing off the shoulder of Orion. These aren't just battleships β€” they're spacecraft designed for warfare. C-beams glittering in the dark. Their location? Near the TannhΓ€user Gate. Things you wouldn't believe. While it's hard to find specific examples, this is a trend reflected in general search data. The bottom line: All those moments will be lost β€” like tears in rain.

22.02.2026 03:39 πŸ‘ 815 πŸ” 207 πŸ’¬ 4 πŸ“Œ 5

These days I think dplyr with optional backends (like duckplyr or dtplyr) tends to be more accessible and offer similar or better performance. Plus it easily scales to databases or Spark in a way that data.table doesn’t.

22.02.2026 14:33 πŸ‘ 10 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

This is very cool, but I'm already so duckdb and duckplyr pilled

11.02.2026 11:02 πŸ‘ 9 πŸ” 1 πŸ’¬ 1 πŸ“Œ 0