🚨New paper:
Current reports on AI audits/evals often omit crucial details, and there are huge disparities between the thoroughness of different reports. Even technically rigorous evals can offer little useful insight if reported selectively or obscurely.
Audit cards can help.
21.04.2025 17:10
👍 2
🔁 2
💬 1
📌 0
Some researchers are rethinking how to measure AI intelligence
Current popular benchmarks are often inadequate or too easy to game, experts say.
A recent Stanford paper reveals that many popular AI benchmarks are fundamentally flawed: They can be outdated, easily gamed, or inaccurate. Stanford HAI Graduate Fellow
@ankareuel.bsky.social talks about how researchers are rethinking AI benchmarks: www.emergingtechbrew.com/stories/2025...
25.03.2025 21:26
👍 9
🔁 3
💬 1
📌 0
Hey Kabir! A lot of it is applicable for different types of evals, especially when it comes to reporting considerations. Would you mind sharing more infos here or via DM on the hackathon? Sounds like this would be a cool opportunity to extend the BetterBench work!
28.01.2025 00:45
👍 3
🔁 0
💬 1
📌 0
Submitting a benchmark to
ICML? Check out our NeurIPS Spotlight paper BetterBench! We outline best practices for benchmark design, implementation & reporting to help shift community norms. Be part of the change! 🙌
+ Add your benchmark to our database for visibility: betterbench.stanford.edu
27.01.2025 22:02
👍 12
🔁 3
💬 1
📌 0
This is such a hard one :D And I think it extends beyond being patient with the students but also being patient with yourself knowing that you won't get everything perfect the first time around (or ever 🥲)
05.01.2025 17:45
👍 6
🔁 0
💬 1
📌 0
🔄 Sharing is caring! Help us reach as wide of an audience as possible by spreading the word. Your support is key in crafting an insightful, community-driven chapter and help key researchers in the field get their work promoted! Thank you! 🙏#StanfordHAI #AIIndex x/
05.01.2025 17:42
👍 5
🔁 2
💬 0
📌 0
The AI Index is an initiative by @stanfordhai.bsky.social. The annual report showcases AI research to enable decision-makers to advance AI responsibly. Previous versions have been cited 300+ times; it's been featured in top media outlets like the @nytimes.com & the @financialtimes.com. 4/
05.01.2025 17:42
👍 5
🔁 0
💬 1
📌 0
Our chapter will cover fairness & non-discrimination, transparency, explainability, data governance & privacy, security, societal impact, and more. Plus, a special subchapter on responsible AI agents! 🤖 3/
05.01.2025 17:42
👍 0
🔁 0
💬 1
📌 0
📢 Excited to share: I'm again leading the efforts for the Responsible AI chapter for Stanford's 2025 AI Index, curated by @stanfordhai.bsky.social. As last year, we're asking you to submit your favorite papers on the topic for consideration (including your own!) 🧵 1/
05.01.2025 17:42
👍 13
🔁 8
💬 1
📌 0
This is all awesome advice, thank you so much for sharing! This is an in-person course but we’ll make all lectures publicly available.
04.01.2025 04:30
👍 1
🔁 0
💬 0
📌 0
I‘m teaching my first own course starting next week (Intro to AI Governance at Stanford). Super proud but also nervous 🥹 Any advice from more seasoned instructors? 😬 #AcademicTwitter #AcademicChatter #TeachingTips #AcademicAdvice
04.01.2025 03:14
👍 11
🔁 0
💬 2
📌 0
The regular reminder of my starter packs full of amazing folks / accounts to follow. I am trying to keep them up to date but let me know if I missed you.
24.12.2024 08:28
👍 5
🔁 1
💬 0
📌 0
Thank you, Stefanie! ❤️
19.12.2024 18:46
👍 0
🔁 0
💬 0
📌 0
In our latest brief, Stanford scholars present a novel assessment framework for evaluating the quality of AI benchmarks and share best practices for minimum quality assurance. @ankareuel.bsky.social @chansmi.bsky.social @mlamparth.bsky.social hai.stanford.edu/what-makes-g...
11.12.2024 18:08
👍 11
🔁 4
💬 0
📌 0
Looking forward to your talk! :)
09.12.2024 20:36
👍 0
🔁 0
💬 0
📌 0
Thanks a ton, Federico! :)
07.12.2024 19:37
👍 1
🔁 0
💬 0
📌 0
Thanks so much, Lorena!
07.12.2024 19:37
👍 1
🔁 0
💬 0
📌 0
Thanks so much, Daniel!
07.12.2024 19:37
👍 1
🔁 0
💬 0
📌 0
Thanks a lot, Stephan 😊
07.12.2024 06:42
👍 1
🔁 0
💬 0
📌 0
Thank you Karen 🦋
07.12.2024 02:02
👍 1
🔁 0
💬 0
📌 0
Thanks so much! And yes, very much looking forward to the weekend 😁🫶
06.12.2024 23:50
👍 1
🔁 0
💬 0
📌 0
Thanks a lot!
06.12.2024 23:40
👍 1
🔁 0
💬 1
📌 0
In the same boat as @mlamparth.bsky.social, would appreciate if you could add me, too, please! Thanks so much 😊
06.12.2024 22:47
👍 2
🔁 0
💬 2
📌 0
In the same boat as @mlamparth.bsky.social, would appreciate if you could add me, too, please! Thanks so much 😊
06.12.2024 22:47
👍 2
🔁 0
💬 1
📌 0
In the same boat as @mlamparth.bsky.social, would appreciate if you could add me, too! Thanks so much 😊
06.12.2024 22:46
👍 2
🔁 0
💬 1
📌 0
In the same boat as @mlamparth.bsky.social, would appreciate if you could add me, too! Thanks so much 😊
06.12.2024 22:46
👍 2
🔁 0
💬 0
📌 0
Would appreciate if you could add me to the Responsible AI and the Security starter packs, similar to @mlamparth.bsky.social, I’m moving here from X 😊
06.12.2024 22:45
👍 2
🔁 0
💬 1
📌 0
In the same boat as @mlamparth.bsky.social, would appreciate if you could add me, too! Thanks so much 😊
06.12.2024 22:42
👍 2
🔁 0
💬 1
📌 0