Timothy Keyes (@timothykeyes)

😭🤣🤪

17.09.2025 20:45 👍 0 🔁 0 💬 0 📌 0

I’m not sure what face I’m making in that screenshot (lol) but I’m glad you enjoyed! (Funny faces notwithstanding)

17.09.2025 19:46 👍 1 🔁 0 💬 1 📌 0

A chat interface showing a conversation starting with a user saying "Are there any talks about evals" and a response "Yes, Simon Couch just gave one, and he hopes you enjoyed it." There's a teddy bear mascot wearing a teal shirt and cap in the bottom right corner of the chat window.

A picture from the side of a conference session room, with myself standing in front.

Just wrapped up a talk at #positconf2025 about LLM evaluation with R! Such a joy to hang out with #rstats folks in person and hear about what others are working on.

Resources and slides: github.com/simonpcouch/...

17.09.2025 19:08 👍 25 🔁 3 💬 1 📌 0

my most recent hot take: after a year working inside my hospital’s IT system, i’m increasingly convinced that clinical informatics should have been a residency (like pathology or radiology), not a fellowship.

13.05.2025 04:10 👍 5 🔁 0 💬 0 📌 0

idk what it means that so many people I know to be thoughtful, compassionate, and brilliant physicians have told me that - while they love taking care of patients - the growing frustrations and exhaustion they experience in clinical life make the whole thing feel unsustainable in the long-term

23.03.2025 23:50 👍 3 🔁 0 💬 0 📌 0

as i prepare to go back to med school, i am haunted daily by the number of my med school classmates (who are now residents, fellows, or attendings) who have reached out to me to learn about my side-career in health tech because they want to explore alternatives to a full-time clinical career

23.03.2025 23:50 👍 1 🔁 0 💬 1 📌 0

how are you such a legend - this is amazing!

20.03.2025 23:12 👍 5 🔁 0 💬 0 📌 0

all this talk about AI and i’m still sitting here thinking about the warm, familiar embrace of OLS

20.03.2025 03:39 👍 6 🔁 0 💬 0 📌 0

today i got an email in which a statistician cited a paper from over 200 years ago and all i could think is how that’s the kind of power computer scientists wish they had

20.03.2025 03:35 👍 24 🔁 2 💬 4 📌 0

I feel like every paper I read about LLMs in health care takes for granted that the encouraging, but not necessarily transformative, results we’ve seen so far are going to scale upward as LLMs improve. But are we really safe to assume that LLMs will keep getting better? I’m not so sure.

12.03.2025 05:51 👍 2 🔁 0 💬 1 📌 0

anyone who made me learn the brachial plexus will now be forced to understand gradients, sorry i don’t make the rules

12.03.2025 02:48 👍 0 🔁 0 💬 0 📌 0

Meme of a still from a Dr. Phil interview. An unamused girl is pictured with a banner reading “aneska says violence works for her”

forced my physician collaborators to listen to me talk about partial derivatives today

12.03.2025 02:46 👍 3 🔁 0 💬 1 📌 0

I can already tell that this paper is going triple-platinum in the Keyes household. Thank you for sharing!

08.03.2025 01:53 👍 0 🔁 0 💬 1 📌 0

Red teaming ChatGPT in medicine to yield real-world insights on model behavior - npj Digital Medicine npj Digital Medicine - Red teaming ChatGPT in medicine to yield real-world insights on model behavior

It’s finally out! We brought a multidisciplinary team of physicians, computer scientists, and engineers to red team LLMs for healthcare uses. And we have shared the dataset! www.nature.com/articles/s41...

07.03.2025 18:46 👍 44 🔁 11 💬 2 📌 4

disclaimer: I haven’t finished medical school yet (i will someday!) so this is mostly stolen valor, but even having a small amount of clinical training has imo helped me so much to understand small details that my technical colleagues don’t really notice or care about

01.03.2025 02:37 👍 2 🔁 0 💬 0 📌 0

i’m working on a few really cool applications of LLMs in health right now (in the real world, in a deployment environment) and my most valuable asset by far in this work isn’t being the most technical person on the team (i’m not); it’s knowing just enough about medicine to ask the right questions

01.03.2025 02:37 👍 4 🔁 0 💬 1 📌 0

AI Grand Rounds Episode 27 From Clinical Notes to GPT-4: Dr. Emily Alsentzer on Natural Language Processing in Medicine

Dr. @emilyalsentzer.bsky.social, a Stanford faculty member and expert in clinical #AI, discusses the evolution of natural language processing, the challenges of AI in clinical settings, and what the future holds for open-source medical AI. Full episode: nejm.ai/4gOGeSo

#MedSky #MLSky

19.02.2025 14:22 👍 17 🔁 6 💬 1 📌 0

"What I find hard to reconcile is, on the one hand, we want to not fall behind on AI writ large. And on the other hand, the very people we need to ensure that agility are being let go.”—Nigam Shah @stanfordhai.bsky.social

17.02.2025 01:13 👍 146 🔁 24 💬 9 📌 0

i have to respect how non-technical folks use spreadsheets - they give you these beautiful murals with all kinds of colors and spacing and intricate patterns. i must say it really breaks my heart to immediately flatten it all with read_csv

18.02.2025 01:34 👍 2 🔁 0 💬 0 📌 0

(i appreciate the use of benchmarks and think they’re really valuable, but benchmarks must be drawn from some population of possible observations, right? so there will still be sampling error in any metrics computed using a benchmark, even if all models use the same benchmark)

02.02.2025 23:18 👍 1 🔁 0 💬 0 📌 0

i’ve been trying to understand why reporting some estimate of confidence intervals around performance metrics (or null hypothesis significance testing) is not more common in the machine learning/AI literature. i think this is changing, but there’s still a lot of weird (or missing) statistics ime

02.02.2025 23:18 👍 2 🔁 0 💬 1 📌 0

i think the vast ambiguity in the term “data scientist” causes a lot of headaches. am i an engineer? am i a computer scientist? am i a statistician? who knows!!!

31.01.2025 20:35 👍 1 🔁 0 💬 1 📌 0

started my day writing a few (simple) statistical proofs for a data science project (to justify simplifying a calculation from something complicated to something simpler and equivalent) and wow it really is nice to dust off the old PhD and put it to use every once in a while

31.01.2025 20:30 👍 2 🔁 0 💬 0 📌 0

This was such a wonderful read - thanks for sharing it @emilyriederer.bsky.social! I’ve been using poetry for package development recently, but now I’m eager to try uv!🤞 (And seaborn.objects might be a nice alternative to plotnine too!)

27.01.2025 05:51 👍 1 🔁 0 💬 0 📌 0

This is really cool! Congrats and thanks for sharing!

03.01.2025 20:44 👍 2 🔁 0 💬 0 📌 0

i rounded on stanford’s palliative medicine service today, and it was such a powerful reminder that end-of-life care clinicians are some of the most kind and empathetic people in health care. truly in awe of their ability to bear witness to (and guide people through) such difficult moments

23.12.2024 23:59 👍 1 🔁 0 💬 0 📌 0

Thanks for sharing! I’m excited to bring this to our group for journal club.

07.12.2024 05:07 👍 2 🔁 0 💬 1 📌 0

when someone tells you that they’ve solved an important problem using ai, that’s very exciting! but you should not take their word for it. if they don’t provide very transparent tools for monitoring/evaluating their ai product, it’s pretty safe to assume even *they* don’t know if it actually works

06.12.2024 01:27 👍 7 🔁 0 💬 0 📌 0

i am not an ai doomer by any stretch, but building ai systems (big or small) that solve useful problems in health care is really hard! validation experiments are really hard (and sometimes really expensive) to do! very few ai products work exactly as advertised, and many don’t work at all!

06.12.2024 01:27 👍 5 🔁 0 💬 1 📌 0

Timothy Keyes

Latest posts by Timothy Keyes @timothykeyes