Joe Powers's Avatar

Joe Powers

@joepowers16

Behavioral Data Scientist interested in experimentation, causal inference, and workflow.

85
Followers
332
Following
35
Posts
10.10.2023
Joined
Posts Following

Latest posts by Joe Powers @joepowers16

Preview
how to set truncated t-distribution prior in bambi %pip install bambi==0.15.0 %pip install numpy==1.26.4 import bambi as bmb import polars as pl from polars import col, lit import pandas as pd import numpy as np import arviz as az np.random.see...

I have been struggling to set a truncated prior in bambi with a non-zero central tendency. I have posted a reprex with data at this stackoverflow link if anyone can offer support. #bambi stackoverflow.com/questions/79...

28.02.2025 15:54 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Each sim could have 100k rows with 5 columns. From each model fit I would save a row with 10 model stats. Simulation volume varies but typically 4000 to 10000. Python and R are the languages.

26.11.2024 18:06 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

There are scenarios I would tackle that involved fitting models to orders of magnitude more datasets, but I'm running into the limits of my mental model on how simulation studies effectively scale, and that prompted the initial post.

26.11.2024 18:04 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

I simulate 10s of thousands of datasets using likelihood functions from numpy or base R, and then I fit competing models to each dataset and save key model stats. I use the sampling distribution of model stats to compare the business impact of adopting different models for decision making.

26.11.2024 18:03 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

For an aspiring simulation study developer, what resources would you recommend to get your head around designing for scalable simulation studies and effectively using PySpark/Databricks and/or AWS Sagemaker to scale large sim studies (cc: @jdlong.cerebralmastication.com, @jordannafa.bsky.social)

26.11.2024 15:22 πŸ‘ 0 πŸ” 0 πŸ’¬ 2 πŸ“Œ 0

Neat approach to using Bluesky to thread blog comments

26.11.2024 14:02 πŸ‘ 18 πŸ” 2 πŸ’¬ 1 πŸ“Œ 0

@phdemetri.bsky.social great post! You set a parameter for true_probability for A & B in simulating future samples. So you're computing the Pr(change your mind | a 3% real effect). Why not base the future simulations and Pr(change your mind) upon a prior distribution rather than a parameter?

12.11.2024 16:35 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

yes

11.11.2024 01:11 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Thank you both, I will check these out!

11.11.2024 01:01 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

@jordannafa.bsky.social or @solomonkurz.bsky.social can you provide any leads?

10.11.2024 18:32 πŸ‘ 0 πŸ” 0 πŸ’¬ 2 πŸ“Œ 0

I've been churning a lot about how to use expected value of information as a stopping criteria for AB tests. Most of the literature assumes there is some cost associated with sampling, but in the context of an AB test, the only cost is the opportunity cost of delaying rollout of the best variant.

10.11.2024 18:31 πŸ‘ 2 πŸ” 0 πŸ’¬ 2 πŸ“Œ 0

# create a .Rprofile in your project root directory & add this:
.First <- function() {
r_files <- list.files("R", pattern = "\\.R$", full.names = TRUE)

# Source each file
for (file in r_files) {
source(file)
}

cat("All scripts in", "R", "have been executed.\n")
}

03.11.2024 17:06 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Just learned that you can source an R/ directory in your project by default every time you open the project, and have all of those project specific functions available by default! #rstats

03.11.2024 17:05 πŸ‘ 0 πŸ” 0 πŸ’¬ 2 πŸ“Œ 0

I’m curious if anything came out of this line of inquiry

17.10.2024 03:46 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

I’ll be at MIT CODE this week if any of you would like to meet in person.

17.10.2024 03:44 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Is there an R or Python package with API to historic weather data? I am trying to access records of daily temp highs & lows in San Diego microclimates, so I'm hoping to gather records from as local stations as possible rather than averages for all of san diego. #rstats #Python #climatescience

15.10.2024 15:20 πŸ‘ 2 πŸ” 0 πŸ’¬ 2 πŸ“Œ 0

take the model matrix and mean-center of all predictors. (even the dummy-coded factors). here is that what that looks like for `mpg ~ wt + factor(cyl)`. brms fits the regression model on centered data and so the prior for the intercept is on the mean when everything is mean centered

05.09.2024 20:21 πŸ‘ 13 πŸ” 3 πŸ’¬ 1 πŸ“Œ 2

serious question: how do you coach people to become strategic thinkers?

or maybe even more basic, how do you teach them to categorize and process info to recognize patterns?

01.05.2024 18:08 πŸ‘ 2 πŸ” 2 πŸ’¬ 4 πŸ“Œ 0

Both books have this theme about the futility of non-strategic action. Perhaps cementing that belief is step one.

02.05.2024 02:26 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

I tried to pursue this question during my PhD. It’s a tough nut for a variety of reasons, but I really enjoyed Richard Rumelt’s Good Strategy Bad Strategy and Ericcson’s Peak. Strategy is hard work and most people avoid it. Expertise is pattern recognition and that takes a lot of time and feedback.

02.05.2024 02:25 πŸ‘ 2 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Have others seen stats that try to express type M and type D errors simultaneously?

03.02.2024 17:27 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Defining practical accuracy as % of posterior that is directionally accurate but less than a 50% overestimate was pure shooting from the hip. For small effects e.g., 0.25% lift, this would be rather narrow (0>theta>0.375%) and for large effects e.g., 6% lift, the range would be broader (0>theta>9%).

03.02.2024 17:14 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image Post image

The benefits of informed priors for decision making. First plot demonstrates shrinkage from an informed prior in a growing sample. Second plot tries to quantify accuracy gains for decision making. I'm thinking about expressing "practical accuracy" as defined in the subtitle. Feedback very welcome.

03.02.2024 17:10 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Appreciate everyone’s feedback!

26.01.2024 15:05 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Sharing to my future self Ben Bolker’s amazing syntax table for lme4 and brms models bbolker.github.io/mixedmodels-...

26.01.2024 14:45 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

I will stop optimizing this now

25.01.2024 17:11 πŸ‘ 2 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Great post and great paper. The baseball example is really illuminating because it shows how partially pooled estimates from early in the season predict end-of-season batting average better than raw estimates from early season.

25.01.2024 16:40 πŸ‘ 3 πŸ” 0 πŸ’¬ 2 πŸ“Œ 0
Post image

Trying to convey the value of partial pooling for colleagues who chase noise-prone subgroup analyses. Imagine I know two people who have also spent a lot of time on this. Have either of you found visualizations that intuitively convey the phenomena? @solomonkurz.bsky.social & @jordannafa.bsky.social

25.01.2024 15:01 πŸ‘ 3 πŸ” 1 πŸ’¬ 4 πŸ“Œ 0

@gkountourides.bsky.social I remembered your question about better collaboration through git. I have been reading raps-with-r.dev and think you would find it relevant.

14.12.2023 01:11 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0