How do AI chatbots see us?
BKC Spring Speaker Series EventWhen you talk with a chatbot, what does it “think” about you? Recent work in AI interpretability, based on high-dimensional geometry, is beginning to provide some intrig...
Join us tomorrow as @wattenberg.bsky.social and I talk about how instrumenting AI chatbots with real-time dashboards can help reveal social cognition capabilities -- something that can be both useful and problematic.
This talk is open to the public.
cyber.harvard.edu/events/how-d...
04.03.2025 18:26
👍 6
🔁 0
💬 0
📌 0
GitHub - ARBORproject/arborproject.github.io
Contribute to ARBORproject/arborproject.github.io development by creating an account on GitHub.
Excited to announce ARBOR: a radically-open project on AI interpretability for reasoning models
github.com/ARBORproject...
Join us in collectively analyzing and interpreting how reasoning works!
20.02.2025 19:55
👍 12
🔁 1
💬 0
📌 0