Timo Kaufmann

@timokauf

PhD Student at LMU Munich. Focus on RL, reward learning and learning from human preferences.

7
Followers 10
Following 9
Posts 26.03.2025
Joined

Posts Following

Latest posts by Timo Kaufmann @timokauf

Joint work with Yannick Metz, Daniel Keim, and Eyke Hüllermeier.

04.12.2025 21:06 👍 0 🔁 0 💬 0 📌 0

Benefits: improved reward model generalization, better data efficiency, and stronger policies. Looking forward to seeing you at the poster!

Paper and more: timokaufmann.com/responserank/

04.12.2025 21:05 👍 0 🔁 0 💬 1 📌 0

The key insight is that these signals only need to be locally valid and relative (e.g., within one annotator's comparisons). No need to model the exact relationship to strength. Just rank which comparisons are stronger.

04.12.2025 21:04 👍 0 🔁 0 💬 1 📌 0

The core idea: Not all preferences are equal. ResponseRank learns preference strength from implicit signals in your data, like inter-annotator agreement, stated confidence, or response times.

04.12.2025 21:04 👍 1 🔁 0 💬 1 📌 0

Presenting ResponseRank at #NeurIPS2025! Come by poster #405 at 4:30pm today if you're in San Diego 👋

04.12.2025 21:03 👍 0 🔁 0 💬 1 📌 0

Just noticed the key deadlines for #ICLR2026 out! PSA for everyone else who's been waiting.

Full paper: Sept 24 AoE.

27.06.2025 09:12 👍 0 🔁 0 💬 0 📌 0

A bit late to post on BlueSky, but I had a great time at our poster session. Very cool to see so much interest in ICAI :)

28.04.2025 09:19 👍 1 🔁 0 💬 0 📌 0

🕵🏻💬 Introducing Feedback Forensics: a new tool to investigate pairwise preference data.

Feedback data is notoriously difficult to interpret and has many known issues – our app aims to help!

Try it at app.feedbackforensics.com

Three example use-cases 👇🧵

17.03.2025 18:12 👍 7 🔁 2 💬 1 📌 0

Currently visiting @arduin.io in Cambridge. I didn't realize it's this beautiful!

Do I know anyone here that I haven't met up with yet?

26.03.2025 09:00 👍 2 🔁 0 💬 0 📌 0

👋 I'll start cross-posting from twitter for now.

26.03.2025 08:58 👍 3 🔁 0 💬 0 📌 0