Regularized self-play RL in grounded simulation effectively adapts driving policies to completely new cities. ๐ฝ -> ๐ผ
Really enjoyed collaborating on this work, led by Zilin and Saeed! Check out Zilin's post below for a great summary
๐งต: x.com/nirhso/statu...
๐: arxiv.org/abs/2602.15891
20.02.2026 20:09
๐ 21
๐ 3
๐ฌ 0
๐ 2
๐ Excited to share REPPO, a new on-policy RL agent!
TL;DR: Replace PPO with REPPO for fewer hyperparameter headaches and more robust training.
REPPO, led by @cvoelcker.bsky.social, will be presented at ICLR 2026. How does it work? ๐งต๐
13.02.2026 19:28
๐ 25
๐ 10
๐ฌ 1
๐ 0
Crongratulations Andreas!
04.02.2026 13:20
๐ 1
๐ 0
๐ฌ 0
๐ 0
haha, thought it was weird that the integers were defined as float and thought it was about cache line optimizations.
13.01.2026 23:08
๐ 0
๐ 0
๐ฌ 0
๐ 0
Tรผbingen AI Research Building, where the Cluster of Excellence "Machine Learning" is based.
๐ขWeโre hiring: W3-Professorship in Machine Learning in Physics @unituebingen.bsky.social! What weโre looking for: Established research profile in a core area of #physics (condensedmatter, quantum or theoretical particle physics), strong track record in research questions related to #ML and/or #AI.
15.12.2025 09:23
๐ 6
๐ 10
๐ฌ 1
๐ 1
is faster?
13.01.2026 07:01
๐ 0
๐ 0
๐ฌ 1
๐ 0
PufferDrive 2.0 release
YouTube video by Daphne Cornelisse
What if you could train agents on a ๐ฑ๐ฒ๐ฐ๐ฎ๐ฑ๐ฒ of driving experience in ๐๐ป๐ฑ๐ฒ๐ฟ ๐ฎ๐ป ๐ต๐ผ๐๐ฟ, on a single GPU?
Excited to share ๐๐ช๐๐๐๐ง๐ฟ๐ง๐๐ซ๐ 2.0: A fast, friendly driving simulator with RL training via PufferLib at ๐ฏ๐ฌ๐ฌ๐ ๐๐๐ฒ๐ฝ๐/๐๐ฒ๐ฐ ๐ก + ๐
youtu.be/LfQ324R-cbE?...
30.12.2025 16:12
๐ 53
๐ 10
๐ฌ 3
๐ 1
Our new E2E driving method, TransFuser v6, is out on ArXiv.
It outperforms all other methods on CARLA by a wide margin, 95 DS on Bench2Drive!
We show that minimizing the asymmetry between data annotator and policy is key for strong IL results.
Code, models, and paper:
ln2697.github.io/lead/
27.12.2025 01:42
๐ 30
๐ 6
๐ฌ 0
๐ 1
The Future of Focused Research Organizations:
Working with Convergent on the NSF Tech Labs Initiative
This article is from people who have thought about FROs for years and have experience with what works and what doesn't.
I have always appreciated the restraint in defining the niche of FROs in the broader ecosystem; it comes out clearly in this piece.
www.essentialtechnology.blog/p/the-future...
17.12.2025 13:48
๐ 3
๐ 3
๐ฌ 0
๐ 0
Unfortunately it appears much of the academic community has reconstituted itself on LinkedIn
15.12.2025 01:45
๐ 58
๐ 7
๐ฌ 11
๐ 3
I am so happy and excited that this project got funded!
11.12.2025 18:52
๐ 29
๐ 3
๐ฌ 5
๐ 0
true, you could try to collect some dataset withgood coverage by running online RL first and then do offline RL in future iterations to save sim compute
09.12.2025 20:00
๐ 1
๐ 0
๐ฌ 0
๐ 0
This is not bringing back offline RL (but online RL). The purpose of closed-loop training here is to gather data in OOD states with the model.
Stitching doesn't work if your base dataset doesn't cover the state space well, which is the case in autonomous driving.
09.12.2025 19:41
๐ 0
๐ 0
๐ฌ 1
๐ 0
๐
08.12.2025 15:05
๐ 1
๐ 0
๐ฌ 0
๐ 0
Tired Europe: Let's do tons of AI regulations
Wired Europe: Let's do tons of AI open source
#aiPULSE2025
05.12.2025 20:00
๐ 10
๐ 1
๐ฌ 0
๐ 0
This essay, roughly on dual use, has been haunting me for a while now:
dl.acm.org/doi/pdf/10.1...
03.12.2025 08:06
๐ 28
๐ 3
๐ฌ 3
๐ 0
Excited to be at #Neurips2025 this week to present our paper "Monoculture or Multiplicity: Which is it?", joint work with Moritz Hardt.
๐ Paper #1000: openreview.net/pdf?id=DO5Lt...
๐ Wed, Dec 3, 2025 โข 4:30 PM โ 7:30 PM
Feel free to come by and reach out!
A short ๐งต.
02.12.2025 15:55
๐ 16
๐ 4
๐ฌ 1
๐ 0
Attending #Neurips2025? Get your personalized Scholar Inbox conference program now to easily navigate the poster sessions and find what you are looking for:
www.scholar-inbox.com/conference/n...
02.12.2025 06:37
๐ 34
๐ 12
๐ฌ 0
๐ 0
Scholar Inbox for NeurIPS is live now.
01.12.2025 19:44
๐ 14
๐ 5
๐ฌ 0
๐ 2
Preprint site arXiv is banning computer-science reviews: hereโs why
The repository is taking steps to tackle a surge in low quality, AI-generated content.
www.nature.com/articles/d41...
ArXiv banned surveys due to AI slop spam.
Now we need to wait for them to be peer-reviewed.
Bad development, we need to find better solutions to AI slop than banning unreviewed papers.
Getting a survey reviewed at a good journal can take over a year. :(
01.12.2025 14:36
๐ 0
๐ 0
๐ฌ 0
๐ 0
Quick reminder about the EPFL PhD program deadline (EDIC) on Dec 15.
27.11.2025 10:14
๐ 4
๐ 2
๐ฌ 0
๐ 0
no this work focusses on IL.
I would personally be interested whether RL models habe similar failures but it is much harder to do this type of analysis when the model predicts actions not waypoints. (Can't do it offline anymore)
26.11.2025 11:18
๐ 0
๐ 0
๐ฌ 0
๐ 0
๐ Introducing TMLR Beyond PDF!
๐ฌ This is a new, HTML-based submission format for TMLR, that supports interactive figures and videos, along with the usual LaTeX and images.
๐ Thanks to TMLR Editors in Chief: Hugo Larochelle, @gautamkamath.com, Naila Murray, Nihar B. Shah, and Laurent Charlin!
25.11.2025 16:11
๐ 75
๐ 22
๐ฌ 1
๐ 3
Congratulations to @cworthy.org on their announcement today!
Learn more about this wonderful FRO here: www.youtube.com/watch?v=DA-e...
24.11.2025 19:23
๐ 4
๐ 1
๐ฌ 0
๐ 0
Apply - Interfolio
{{$ctrl.$state.data.pageTitle}} - Apply - Interfolio
Come be our colleague in the robotics and embodied intelligence center at NYU!
๐ท Professor in Robotics / Embodied AI (Open Rank)
apply.interfolio.com/176977
๐ท Faculty Fellow in Robotics / Embodied AI
apply.interfolio.com/177077
18.11.2025 20:01
๐ 20
๐ 6
๐ฌ 1
๐ 0