Shan Chen (@shan23chen)

As I shared in the NYT, models often see the data but fail to weigh it like a physician, drifting toward generic "average patient" responses. Context window ≠ Clinical reasoning.

www.nytimes.com/2025/12/03/w...

04.12.2025 14:10 👍 11 🔁 2 💬 1 📌 2

Check out our editorial on Zazzetti et al (2025)'s paper on synthetic data generation for breast cancer, in JCO CCI! Synthetic data could help with many gaps in clinical AI research, but challenges remain especially (IMO) issues with out-of-domain generalization @shan23chen.bsky.social

30.11.2025 17:37 👍 3 🔁 1 💬 0 📌 0

🤔💭What even is reasoning? It's time to answer the hard questions!

We built the first unified taxonomy of 28 cognitive elements underlying reasoning

Spoiler—LLMs commonly employ sequential reasoning, rarely self-awareness, and often fail to use correct reasoning structures🧠

25.11.2025 18:25 👍 46 🔁 8 💬 2 📌 0

Super proud of @shan23chen.bsky.social for his podium presentation on his research into LLM sycophancy in the face of illogical medical queries at #AMIA25!

Full paper: www.nature.com/articles/s41...

Also cited yesterday in the NYT! www.nytimes.com/2025/11/16/w...

17.11.2025 21:44 👍 6 🔁 2 💬 0 📌 1

When helpfulness backfires: LLMs and the risk of false medical information due to sycophantic behavior - npj Digital Medicine npj Digital Medicine - When helpfulness backfires: LLMs and the risk of false medical information due to sycophantic behavior

LLMs tend to prioritize helpfulness > reason. We show that safety-aware, compute-efficient fine-tuning helps models reason more critically in healthcare domain, and generalizes to improved safety alignment across other domains.
www.nature.com/articles/s41... @shan23chen.bsky.social

18.10.2025 14:18 👍 8 🔁 5 💬 0 📌 0

When helpfulness backfires: LLMs and the risk of false medical information due to sycophantic behavior - npj Digital Medicine npj Digital Medicine - When helpfulness backfires: LLMs and the risk of false medical information due to sycophantic behavior

An overemphasis on helpfulness makes LLMs vulnerable.
Research shows models will comply with illogical medical requests, generating false information. This sycophantic tendency can be corrected with specific prompting and fine-tuning. #MedSky #MedAI #MLSky

17.10.2025 15:53 👍 7 🔁 4 💬 0 📌 0

[1/]💡New Paper
Large reasoning models (LRMs) are strong in English — but how well do they reason in your language?

Our latest work uncovers their limitation and a clear trade-off:
Controlling Thinking Trace Language Comes at the Cost of Accuracy

📄Link: arxiv.org/abs/2505.22888

30.05.2025 13:08 👍 8 🔁 5 💬 1 📌 3

Agents are all the rage and we need to track their abilities in the medical domain. Enter MedBrowseComp, the 1st benchmark to assess agents' abilities to reason, navigate the web, and search for verifiable med info!

Preprint: arxiv.org/abs/2505.14963
Site: moreirap12.github.io/mbc-browse-a...

22.05.2025 16:27 👍 3 🔁 1 💬 1 📌 0

FaceAge, a deep learning system to estimate biological age from face photographs to improve prognostication: a model development and validation study Our results suggest that a deep learning model can estimate biological age from face photographs and thereby enhance survival prediction in patients with cancer. Further research, including validation...

✨ What if your face could tell something about how old your body really is?

Excited to share our latest paper just published in The Lancet Digital Health (open access!)

👉 www.thelancet.com/journals/lan...

09.05.2025 15:06 👍 3 🔁 1 💬 2 📌 0

congrats！

27.03.2025 02:31 👍 2 🔁 0 💬 0 📌 0

CALL FOR REMOTE SPEAKERS: Science in the News Seminar Series, hosted by Harvard x Beacon Hill Seminars

scientists, engineers & doctors, from academic researchers to industry professionals! 🧑‍🔬🧑‍💻

Email the organizers at scienceinthenews.bhs@gmail.com to sign up for a date! (First-come-first-served)

07.03.2025 01:45 👍 3 🔁 0 💬 0 📌 0

https://www.reddit.com/r/OpenAI/comments/1ieonxv/comment/ma9f5me/

Source: t.co/mV27ZZg5MN

01.02.2025 04:01 👍 2 🔁 0 💬 0 📌 0

We have a NEW PAPER in @naturemedicine.bsky.social on reporting recommendations for addressing the unique challenges of #largelanguagemodels (LLMs) in biomedical applications

www.nature.com/articles/s41...

#MLSky #StatsSky #medSky #AISky #artificialintelligence #generativeAI #transparency

08.01.2025 10:24 👍 28 🔁 8 💬 1 📌 2

Yea… he does have problems portraying female in stereotypical ways, big critics in China too

04.01.2025 23:06 👍 5 🔁 0 💬 0 📌 0

During the QA session, one stood up to her regarding this issue really respectfully and her response was: “That was not based on my judgment. That was based on the student's quote saying that the school was not teaching it, which meant that it applied to a lot of people from there."

14.12.2024 18:10 👍 6 🔁 0 💬 1 📌 0

Most of the talk discussed about bad practices. But only one slide mentioned specific group of people.

14.12.2024 18:10 👍 1 🔁 0 💬 0 📌 0

Haha which one has more nowadays?

11.12.2024 05:26 👍 0 🔁 0 💬 1 📌 0

Haha transformers really transformed both.

However, I feel like the division is even further… currently, seems like RL is taking over LM post training and many NLProc are dealing with language model enabled new applications

11.12.2024 05:24 👍 0 🔁 0 💬 0 📌 0

Is It Time to Worry About Benzene in Personal Care Products? The carcinogen has been found in sunscreen, deodorants, acne creams and other personal care products. Here’s what to know.

I am always worrying about Benzene (my cat)! www.nytimes.com/2024/12/05/w...

But please don't stop wearing sunscreen! Sun exposure is a known cancer risk, benzene risks unknown. This article has good tips if you want to minimize benzene exposure.

Obligatory Benzene (cat) pic ⬇️

06.12.2024 23:12 👍 2 🔁 1 💬 1 📌 0

Thanks!

06.12.2024 22:07 👍 1 🔁 0 💬 0 📌 0

Imagine a world where these will be positively correlated

06.12.2024 02:59 👍 0 🔁 0 💬 0 📌 0

Quite possible!

Here, we found some early evidence that SAE features trained on language models are still meaningful to LLaVA.

More details will be provided in the post, and more details will be provided soon!

@JackGallifant

@oldbayes.bsky.social

@daniellebitterman.bsky.social

05.12.2024 20:16 👍 2 🔁 0 💬 0 📌 0

Are SAE features from the Base Model still meaningful to LLaVA? — LessWrong Shan Chen, Jack Gallifant, Kuleen Sasse, Danielle Bitterman[1] Please read this as a work in progress where we are colleagues sharing this in a lab (…

Team @AnthropicAI & @thesubhashk @joshengels.bsky.social shows SAE features can be good for classifications.

Good evidence by @arthurconmy.bsky.social & @neelnanda.bsky.social on SAE features are transferable across base and IT models.

🧐 How about LLaVA?

tiny.cc/sae1

05.12.2024 20:16 👍 6 🔁 1 💬 1 📌 1

More on future potential reliance on LLM agent doing reviews and audits

27.11.2024 21:46 👍 1 🔁 0 💬 0 📌 0

I’m terrified by the massive openreview data. Potentially gonna bite back on us 🥲😥

27.11.2024 17:50 👍 0 🔁 0 💬 1 📌 0

END/🧵 Thanks to all our awesome co-authors:
@jannahastings.bsky.social

@daniellebitterman.bsky.social

And all our awesome collaborators who are not on the right platform yet! 🦋

Happy Thanksgiving! 🍂

27.11.2024 15:17 👍 1 🔁 0 💬 0 📌 0

Cross-Care: Assessing the Healthcare Implications of Pre-training Data on Language Model Bias Large language models (LLMs) are increasingly essential in processing natural languages, yet their application is frequently compromised by biases and inaccuracies originating in their training data. ...

5/🧵 Dive deeper into our methods, findings, and the implications of our research by checking out the full 📜 paper here: arxiv.org/abs/2405.05506
All our data can be downloaded from our website: crosscare.net

27.11.2024 15:14 👍 1 🔁 0 💬 1 📌 0

4.5/🧵 For the arxiv pretraining dataset, we also have an overall trend based on entity mentions! Guess which two terms are the big bump there back in 2019

27.11.2024 15:13 👍 0 🔁 0 💬 1 📌 0

4/🧵 We've also developed a new data visualization tool, available at [http://crosscare.net], to allow researchers and practitioners to explore these biases from different pretraining corpus and understand their implications better. Tools in progress! 🛠️📊

27.11.2024 15:13 👍 1 🔁 0 💬 1 📌 0

3.5/🧵 Moreover, alignment methods don’t resolve inconsistencies in disease prevalence across languages (EN 🇺🇸, ES 🇪🇸, FR 🇫🇷, ZH 🇨🇳). And tuning on English usually only affects English prompt output

27.11.2024 15:12 👍 1 🔁 1 💬 1 📌 0

Shan Chen

Latest posts by Shan Chen @shan23chen