strim
strim
Did I hear "sports betting"? @stat-ron.bsky.social
I still embrace Rmd
@statsinthewild.bsky.social
Hereβs a full draft of the upcoming second edition of my βData Visualization: A Practical Introductionβ: socviz.co
who dis @statsinthewild.bsky.social
Now live: presentation videos. Find all recorded talks on our Youtube. Link below.
Recorded presentations from the 2026 ASI Summit are now live on YouTube. To see all of the incredible talks from this year's Summit, visit the link below:
www.youtube.com/playlist?lis...
Registration for the 2026 #SMTDataChallenge is OPEN! We want to see what you would put βOn the Big Screen!β We want to see how you can tell stories like SMT through a metric, visualization, or interactive fan experience that could be displayed on a MiLB scoreboard!
@statsinthewild.bsky.social still does df[1,]
I do this all the time too (also to avoid scientific notation)
π£ Competition Launch Alert! Our 12th annual March ML Mania competition is here!
π― Forecast the outcomes of the 2026 NCAA basketball tournaments by predicting the probabilities of every possible matchup
π° $50,000 Prize Pool
β° Final Submission: March 19th, 2026
www.kaggle.com/competitions...
Looking again for clear eyes and full hearts at the Dallas Cowboys. Would you be a good fit or know someone who is? Please help us amplify our search by sharing this message. #DallasCowboys #CowboysNation #NFL #workinsports #SportsAnalytics #sportsjobs
2026 Strategic Football Fellow(s)
is.gd/UTuHb6
dplyr 1.2.0 is out now and we are SO excited!
- `filter_out()` for dropping rows
- `recode_values()`, `replace_values()`, and `replace_when()` that join `case_when()` as a complete family of recoding/replacing tools
These are huge quality of life wins for #rstats!
tidyverse.org/blog/2026/02...
Congrats to the finalists and honorable mentions for the 2026 #BigDataBowl Analytics Track
Side-by-side comparison of two multi-panel bubble charts faceted by world region. The left column shows the default facet labels placed above each panel (βAfricaβ, βAmericasβ, βAsiaβ, βEuropeβ, βOceaniaβ). The right column shows the same charts, but the facet labels are moved inside each panel at the top-left using a negative margin. In the center, there is a title reading βWant to place your facet labels inside each panel?β with an arrow pointing right, followed by a short ggplot2 theme code snippet demonstrating how to move strip text inside the panel.
I ignored the strip.clip argument in #ggplot2 for way too long π²
Combined with a small negative margin tweak, you can place facet labels inside each panel. A tiny trick that makes small multiples feel so much cleaner.
π΅ no manual coordinates
π΅ inherits theme styling
π΅ scales nicely when resizing
The folks at Carnegie Mellon have identified a new factor that potentially influences the success of an offense -- unpredictability in the timing between pre-snap motion and the start of the play. (And the kings in this regard are Mahomes and Brady.)
www.nbcsports.com/nfl/profootb...
The Hudl Performance Insights 2025 Research Papers are available to read now.
Nine winners used Hudl Statsbomb Event + 360 & Hudl Physical Data to build their findings and present them live on the Research Stage at the event.
Download them here π½ www.hudl.com/en_gb/hudlpr...
The 6th Connecticut Sports Analytics Symposium was held April 11β12 at Yale University, drawing about 150 registrants for keynotes, a data challenge, workshops, and poster sessions. Read a recap: magazine.amstat.org/...
πA multilevel model with heterogeneous variances for snap timing in the National Football League
πNguyen and Yurko snap into the passing lanes
doi.org/10.1093/jrss...
Cre?
An illustration of a curling stone. The 2026 CSAS Data Challenge will feature mixed doubles curling power play optimization.
The Connecticut Sports Analytics Symposium is inviting student teams to participate in the 2026 CSAS Data Challenge. Registration is due by December 1 with submissions accepted until January 15. Finalists will be notified in February and invited to present at CSAS. stattrak.amstat.org/...
To be effective, data science agents need to be able to read plots reliably. @sara-altman.bsky.social and I wrote about some concerning findings on LLMs' ability to interpret plots when the content contradicts their expectations on the @posit.co blog.
posit.co/blog/introdu...
Do you teach #rstats? Do your students complain about how lame and old-fashioned dplyr is? Don't worry: I have the solution for you: github.com/hadley/genzp....
genzplyr is dplyr, but bussin fr fr no cap.
www.kaggle.com/code/tindata...
My paper (coauthored w/ @stat-ron.bsky.social) on modeling variability in QB snap timing using #BigDataBowl data is published in JRSSA.
academic.oup.com/jrsssa/advan...
I have 3 streams left and thatβs insane. Iβve streamed longer than Iβve been in baseball and longer than Iβve been a neuroscientist. Playing guitar may be the only thing Iβve done longer than stream, to far less success.
See yβall over the next 3 days!
disappointed that the substack name is not hai davai
NFL Big Data Bowl dropped this past week and while I'm not entering this year, I did want to have some fun with tracking data and other football side projects so I'm launching a Substack as a public forum to do so. Plan to do some BDB tutorials. Check it out!
cincysam6.substack.com/p/welcome-fr...