Orr Krupnik (@orrkrup) — bluesky.baby

Also not excited for the first occurrence of this on OpenReview

14.02.2026 10:34 👍 0 🔁 0 💬 0 📌 0

[PERF] Replace np.column_stack with np.vstack().T by crabby-rathbun · Pull Request #31132 · matplotlib/matplotlib This PR addresses issue #31130 by replacing specific safe occurrences of np.column_stack with np.vstack().T for better performance. IMPORTANT: This is a more targeted fix than originally proposed. ...

Prime example (found on the other platform) of why we should be careful with reward specification / alignment / guardrails / <enter your favorite AI safety topic here>.

How much of this is human guided, and how much is just optimizing the “get PR merged” reward?

github.com/matplotlib/m...

14.02.2026 09:16 👍 3 🔁 0 💬 1 📌 0

Take a look at our new paper!

We improve sample efficiency and performance in off-policy RL by prioritizing experience with the semantic knowledge of a pre-trained VLM, and not even a very large one 🌍🤖📈🏆

Glad for the opportunity to work with @eladsharony.bsky.social and @tomjur.bsky.social !

10.02.2026 12:29 👍 0 🔁 0 💬 0 📌 0

Are there any good robotics and/or RL podcasts still running in 2025?
I used to enjoy The Robot Brains by Pieter Abbeel and TalkRL by @robinchauhan.bsky.social , but open to different styles too!

18.07.2025 07:02 👍 5 🔁 0 💬 0 📌 0

VIPER: Visual Perception and Explainable Reasoning for Sequential Decision-Making While Large Language Models (LLMs) excel at reasoning on text and Vision-Language Models (VLMs) are highly effective for visual perception, applying those models for visual instruction-based planning ...

I find this idea really neat - VLMs are great at describing scenes, but LLMs are better reasoners, so let's use text as an interim representation.

Kind of reminiscent of the bitter lesson, only on a more "local" scale

arxiv.org/abs/2503.15108

22.04.2025 11:00 👍 1 🔁 0 💬 0 📌 0

Check out our new #ICLR2025 paper: EC-Diffuser leverages a novel Transformer-based diffusion denoiser to learn goal-conditioned multi-object manipulation policy from pixels!👇
Paper: www.arxiv.org/abs/2412.18907
Project page: sites.google.com/view/ec-diff...
Code: github.com/carl-qi/EC-D...

19.02.2025 16:10 👍 2 🔁 1 💬 1 📌 1

Also probably an issue of salience bias - you hear about virtually every plane crash and a lot of shootings, but road fatalities rarely make the news.

31.12.2024 10:34 👍 3 🔁 0 💬 0 📌 0

If interested on our take on addressing inverse RL in large state spaces, go to meet @filippo_lazzati and @alberto_metelli in the poster session 5 #NeurIPS2024 today (paper -> arxiv.org/abs/2406.03812)

13.12.2024 14:33 👍 5 🔁 2 💬 1 📌 0

That speaks to the lack of good, standardized benchmarks for RL, more than anything else.

(Disclaimer: haven’t read the papers yet)

28.11.2024 06:56 👍 0 🔁 0 💬 1 📌 0

I agree completely. I just think the challenge will remain policy and public perception regarding public transit, same as it is today - just amplified by the effort that's been put into the technology by car manufacturers.

27.11.2024 13:06 👍 2 🔁 0 💬 0 📌 0

This is actually something that worries me - how can we ensure that all the progress in autonomous driving doesn't just put a lot more single-person cars on the road? And also, how do we convince people that this isn't an alternative, and that we need to keep investing in transit.

27.11.2024 11:13 👍 0 🔁 0 💬 1 📌 0

Want to learn / teach RL?  
Check out new book draft:
Reinforcement Learning - Foundations sites.google.com/view/rlfound...
W/ Shie Mannor & Yishay Mansour
This is a rigorous first course in RL, based on our teaching at TAU CS and Technion ECE.

25.11.2024 12:08 👍 154 🔁 34 💬 4 📌 4

Been thinking about building a replacement for the arXiv daily email for a while, this looks like it might save me the trouble :)

25.11.2024 05:49 👍 3 🔁 0 💬 0 📌 0

Just out of curiosity: what’s the action space here?

22.11.2024 10:05 👍 1 🔁 0 💬 1 📌 0

Let’s use the real data to improve the simulators and get better massive, procedurally generated data 🤩

20.11.2024 16:55 👍 2 🔁 0 💬 0 📌 0

Robot Metabolism: Towards machines that can grow by consuming other machines Biological lifeforms can heal, grow, adapt, and reproduce -- abilities essential for sustained survival and development. In contrast, robots today are primarily monolithic machines with limited abilit...

Some papers really feel like a glimpse into the future!

This one also serves as a powerful reminder that a lot of what we're focused on in the AI + robotics space is constrained by the hardware we have.

arxiv.org/abs/2411.11192

19.11.2024 18:24 👍 2 🔁 0 💬 0 📌 0

Orr Krupnik

Latest posts by Orr Krupnik @orrkrup