Can LLMs use ToM to genuinely persuade you, or do they just use good rhetoric? In our new preprint, we use the MINDGAMES framework to test this. Surprisingly, LLMs like o3 can be incredibly effective persuaders *without* actually understanding your mental states. π§΅π
10.03.2026 03:36
π 12
π 5
π¬ 1
π 1
Check out our work testing Planning Theory of Mind in LLMs and humans!
@jaredlcm.bsky.social is talking about this at #CogSci2025 this Friday at 4pm, and at @colmweb.org in Montreal in October. Don't miss him, his presentations are top shelf.
29.07.2025 20:43
π 5
π 0
π¬ 0
π 0
Sage Journals: Discover world-class research
Subscription and open access journals from Sage, the world's leading independent academic publisher.
Lucky to be back from a month-long break, with a paper out in @bigdatasoc.bsky.social! How do academics participate in the construction of βAIβ as a research field? We interviewed 90 university-based AI researchers in the UK, US, and Aus. Paper here: journals.sagepub.com/doi/10.1177/...
1/4
04.02.2025 00:59
π 10
π 3
π¬ 1
π 1
@kathyreid.au
18.12.2024 04:03
π 1
π 0
π¬ 1
π 0
I'm at Neurips, presenting this work with @glenberman.bsky.social , Ned Cooper and @wesleydeng.bsky.social . Come check out our poster at the EvalEval workshop on Sunday or DM me to chat!
12.12.2024 19:58
π 5
π 3
π¬ 0
π 0
In a paper weβre workshopping this week at #NeurIPS2024, Ned Cooper, @wesleydeng.bsky.social, and Ben Hutchinson and I ask: what is the model of societal impacts reflected in efforts to evaluate GenAI systems?
Paper: arxiv.org/abs/2410.22985
Workshop: evaleval.github.io
1/5
12.12.2024 00:11
π 3
π 4
π¬ 2
π 1