Katrin Renz's Avatar

Katrin Renz

@katrinrenz

https://katrinrenz.de/ LLMs + Autonomous Driving. PhD Student at Uni Tübingen with Andreas Geiger. Previously at Wayve & Uni Oxford, VGG.

56
Followers
50
Following
12
Posts
13.03.2025
Joined
Posts Following

Latest posts by Katrin Renz @katrinrenz

After finishing my papers for my PhD, I spent some time exploring new directions. I ended up working on Diffusion Language Models with @haoyuhe.bsky.social (he made it work 🚀), @yongcao.bsky.social, @andreasgeiger.bsky.social.

I learned a lot of new things and I am very excited about the results. 🥳

20.08.2025 13:02 👍 5 🔁 1 💬 0 📌 0
Preview
GitHub - autonomousvision/CaRL: [ArXiv 2025] CaRL: Learning Scalable Planning Policies with Simple Rewards [ArXiv 2025] CaRL: Learning Scalable Planning Policies with Simple Rewards - autonomousvision/CaRL

We have released the code for our work, CaRL: Learning Scalable Planning Policies with Simple Rewards.

The repository contains the first public code base for training RL agents with the CARLA leaderboard 2.0 and nuPlan.

github.com/autonomousvi...

15.07.2025 16:05 👍 20 🔁 7 💬 0 📌 2
SimLingo SimLingo: Vision-Only Closed-Loop Autonomous Driving with Language-Action Alignment

📢Excited to present our poster "SimLingo" tomorrow at #CVPR2025. Drop by to talk about vision-language-action models, language-action grounding, or anything else :)

📍Saturday, 10:30 - 12:30 Poster #130

Project page: www.katrinrenz.de/simlingo/

13.06.2025 23:40 👍 2 🔁 1 💬 0 📌 0
Post image

Your personalized CVPR 25 @cvprconference.bsky.social conference programs are now available for you!
www.scholar-inbox.com/conference/c...

05.06.2025 06:16 👍 54 🔁 16 💬 1 📌 2
Post image

🚗 Pseudo-simulation combines the efficiency of open-loop and robustness of closed-loop evaluation. It uses real data + 3D Gaussian Splatting synthetic views to assess error recovery, achieving strong correlation with closed-loop simulations while requiring 6x less compute. arxiv.org/abs/2506.04218

05.06.2025 04:21 👍 22 🔁 10 💬 0 📌 1
Post image

📢 We have a PR[AI]RIE PhD position opening at Inria Paris co-advised with R. de Charette & @tuanhungvu.bsky.social
[please distribute]
💡Topic: Physics-Grounded Vision Foundation Models
⏳Application deadline: 20 May 2025
🗓️ Start date: Fall 2025
📝Detailed description: linked below

30.04.2025 09:08 👍 21 🔁 8 💬 1 📌 0

Hi Sebastian, could you also add me?:)

09.05.2025 05:34 👍 0 🔁 0 💬 1 📌 0

Thanks to my great collaborators: Long Chen, Elahe Arani and Oleg Sinavski

And thanks to Wayve for the great time during my internship and all the support.

08.05.2025 15:25 👍 1 🔁 0 💬 0 📌 0
[CVPR25, Spotl.] SimLingo: Vision-Only Closed-Loop Autonomous Driving with Language-Action Alignment
[CVPR25, Spotl.] SimLingo: Vision-Only Closed-Loop Autonomous Driving with Language-Action Alignment YouTube video by Katrin Renz

Check out the paper or full video for more details.

Full Video: www.youtube.com/watch?v=Mpbn...

08.05.2025 15:25 👍 1 🔁 0 💬 1 📌 0

⛳️We introduce a DREAMING flag with which the model can differentiate between driving mode, where only safe instructions are executed, and dreaming mode, where the actions for all instructions are predicted.

08.05.2025 15:25 👍 1 🔁 0 💬 1 📌 0

💭Action Dreaming: A safe way to test Language-Action alignment. We test not only expert behaviour but a wide variety of possible actions (e.g., speed changes, driving towards a specific object, lane change manoeuvres).

08.05.2025 15:25 👍 1 🔁 0 💬 1 📌 0

🫱🏻‍🫲🏽 Language-Action Alignment: On normal driving datasets, the action can often be inferred from the visual cue alone. Our new dataset includes multiple different actions for each sample, together with the language instruction. This forces the model to listen to the instruction.

08.05.2025 15:25 👍 1 🔁 0 💬 1 📌 0

🥇State-of-the-art: SimLingo is the first VLA model on the CARLA Leaderboard, achieving state-of-the-art driving performance on multiple benchmarks.

08.05.2025 15:25 👍 1 🔁 0 💬 1 📌 0
Video thumbnail

📣 Excited to share our #CVPR2025 Spotlight paper and my internship project @wayve: SimLingo.
A Vision-Language-Action (VLA) model that achieves state-of-the-art driving performance with language capabilities.

Code: github.com/RenzKa/simli...
Paper: arxiv.org/abs/2503.09594

08.05.2025 15:25 👍 25 🔁 9 💬 1 📌 0
Video thumbnail

📢 New paper CVPR 25!
Can meshes capture fuzzy geometry? Volumetric Surfaces uses adaptive textured shells to model hair, fur without the splatting / volume overhead. It’s fast, looks great, and runs in real time even on budget phones.
🔗 autonomousvision.github.io/volsurfs/
📄 arxiv.org/pdf/2409.02482

05.05.2025 13:00 👍 32 🔁 21 💬 1 📌 1

In my first research project I was super excited about getting any stars on GitHub. Now having a project with 1k stars feels unreal🤯 wouldn’t have been possible without the tremendous effort of @chonghaosima.bsky.social during the main project and afterwards with the challenge 🙏🏼

25.03.2025 07:12 👍 2 🔁 0 💬 0 📌 0
Video thumbnail

🆕 The CARLA Route Generator is a new Python application that provides a GUI for creating and editing routes, as well as defining scenarios within the CARLA simulator. It can also be used in conjunction with CARLA Leaderboard 2.0!
github.com/autonomousvi...

20.03.2025 07:22 👍 10 🔁 3 💬 0 📌 0

Come join us!

19.03.2025 07:51 👍 6 🔁 2 💬 0 📌 0
Post image

We have just released a new tool to create custom routes and insert scenarios for the CARLA Leaderboard 2.0. The tool was written by our great research assistant Jens. 🥳

Github: github.com/autonomousvi...

#CARLA #AutonomousDriving

20.03.2025 01:44 👍 7 🔁 1 💬 0 📌 0
Preview
LeRobot goes to driving school: World’s largest open-source self-driving dataset We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Learning to Drive (L2D): the most exciting dataset release of the year by @hf.co & @yaak-ai.bsky.social
- 5K hours of driving data from 3 cameras
- lots of other synchronized data: GPU, IMU, CAN, actions, task descriptions
- 90TB of data
- LeRobot data formatting
huggingface.co/blog/lerobot...

11.03.2025 21:11 👍 23 🔁 5 💬 1 📌 1