Jesse Farebrother's Avatar

Jesse Farebrother

@brosa.ca

Ph.D. Student studying AI & decision making at Mila / McGill University. Currently at FAIR @ Meta. Previously Google DeepMind & Google Brain. https://brosa.ca

660
Followers
140
Following
14
Posts
17.11.2024
Joined
Posts Following

Latest posts by Jesse Farebrother @brosa.ca

Preview
GitHub - JesseFarebro/xtils: A collection of utilities for machine learning experiments. A collection of utilities for machine learning experiments. - JesseFarebro/xtils

Sadly the debugger is missing many nice-to-have features. I wrote a much improved debugger for Jax you can install here: github.com/JesseFarebro... which will probably give you a much better experience πŸ™‚

06.05.2025 00:25 πŸ‘ 8 πŸ” 1 πŸ’¬ 1 πŸ“Œ 0
Video thumbnail

3) At the World Models workshop, I'll be giving an oral on a new approach to learning a generative model of successor states through flow matching / diffusion.

πŸ“Peridot 201 & 206
πŸ“…Mon 28 Apr 5 PM - 5:30 PM

Check out the paper on arXiv: arxiv.org/abs/2503.09817 with a full thread coming soon πŸ™‚.

22.04.2025 21:26 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Non-Adversarial Inverse Reinforcement Learning via Successor Feature Matching Non-Adversarial Inverse Reinforcement Learning via Successor Feature Matching

2) Arnav & I will be around presenting our work on successor feature matching at:

πŸ“Hall 3 + Hall 2B #572
πŸ“…Sat 26 Apr 10 AM β€” 12:30 PM

Check out the website and paper: arnavkj1995.github.io/SFM/

22.04.2025 21:26 πŸ‘ 2 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Preview
Meta Motivo A first-of-its-kind behavioral foundation model to control a virtual physics-based humanoid agent for a wide range of whole-body tasks.

1) Many of the team members will be around the poster for Meta Motivo at:

πŸ“Hall 3 + Hall 2B #555
πŸ“… Thu 24 Apr 10 AM β€” 12:30 PM

Don't forget to check out the demo for yourself: metamotivo.metademolab.com and the paper now on arXiv: arxiv.org/abs/2504.11054

22.04.2025 21:26 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Video thumbnail

Excited to be in Singapore πŸ‡ΈπŸ‡¬ for #ICLR25! We’ll present 1) Meta Motivo, a first-of-its-kind model enabling zero-shot humanoid control for any reward, goal, or motion; 2) imitation via successor feature matching; 3) flow matching for generative TD learning of future experience.

22.04.2025 21:26 πŸ‘ 8 πŸ” 1 πŸ’¬ 1 πŸ“Œ 0

As an undergraduate student, taking Rich's course at @ualberta.bsky.social was a defining moment in my academic journey. His work and teachings have shaped the paths of countless researchers, including my own. Congrats, Rich & Andy!

05.03.2025 17:43 πŸ‘ 7 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0
Preview
Stop Regressing: Training Value Functions via Classification for... Value functions are an essential component in deep reinforcement learning (RL), that are typically trained via mean squared error regression to match bootstrapped target values. However, scaling...

And for categorical representations, @brosa.ca openreview.net/forum?id=dVp... is pretty much canon at this point!

31.01.2025 06:16 πŸ‘ 7 πŸ” 1 πŸ’¬ 2 πŸ“Œ 0
Preview
Foundations of Multivariate Distributional Reinforcement Learning In reinforcement learning (RL), the consideration of multivariate reward signals has led to fundamental advancements in multi-objective decision-making, transfer learning, and representation learning....

This paper by @harwiltz.bsky.social and colleagues is one of the high points of #NeurIPS2024 for me so far:
arxiv.org/abs/2409.00328

12.12.2024 21:40 πŸ‘ 16 πŸ” 3 πŸ’¬ 2 πŸ“Œ 1

Come by West Ballroom A-D #6404 4:30 PM–7:30 PM tonight (Thursday) to talk to @jessefarebro.bsky.social and me about CALE!

12.12.2024 18:32 πŸ‘ 9 πŸ” 2 πŸ’¬ 0 πŸ“Œ 1

West Ballroom A-D #6704
πŸ“…12 Dec 11:00 AM β€” 1:00 PM

bsky.app/profile/harw...

09.12.2024 15:59 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

West Ballroom A-D #6404
πŸ“…12 Dec 4:30 PM β€” 7:30 PM

bsky.app/profile/pcas...

09.12.2024 15:59 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Landed in Vancouver for #NeurIPS. We’ll be presenting a couple of papers with @pcastr.bsky.social and @harwiltz.bsky.social πŸ§΅πŸ‘‡ and stay tuned later in the week for an exciting announcement!

09.12.2024 15:59 πŸ‘ 6 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Video thumbnail

How can you 0-shot transfer predictions of long-term performance across reward functions *and* risk-sensitive utilities?

We can do this via Distributional Successor Features. Our recent work introduces the 1st tractable & provably convergent algos for learning DSFs.

#NeurIPS2024 #6704
12 Dec, 11-2

09.12.2024 15:30 πŸ‘ 16 πŸ” 4 πŸ’¬ 3 πŸ“Œ 2
Post image

Distributional SFs: enable 0-shot generalization of return *distribution* functions across a finite-dimensional reward function class

"Foundations of Multivariate Distributional Reinforcement Learning"

#NeurIPS2024 #6704
12 Dec 11am-2pm
neurips.cc/virtual/2024...

Wiltzer Farebrother Rowland

08.12.2024 22:43 πŸ‘ 5 πŸ” 2 πŸ’¬ 0 πŸ“Œ 0
The Atari 2600 console and an image of some of the games used in the ALE

The Atari 2600 console and an image of some of the games used in the ALE

πŸ“’ In Defense Of Atari πŸ“’

New blog post in which I argue why the ALE is still a valuable resource for RL research!

psc-g.github.io/posts/resear...

04.12.2024 00:07 πŸ‘ 41 πŸ” 8 πŸ’¬ 1 πŸ“Œ 2

πŸ‘‹

28.11.2024 06:29 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Preview
Stop Regressing: Training Value Functions via Classification for... Value functions are an essential component in deep reinforcement learning (RL), that are typically trained via mean squared error regression to match bootstrapped target values. However, scaling...

This is how our work on classification vs. regression started, it was folk knowledge in many circles but I had overestimated the extent to which the wider community knew this. openreview.net/forum?id=dVp...

27.11.2024 13:38 πŸ‘ 20 πŸ” 2 πŸ’¬ 1 πŸ“Œ 0

Jax RL work as in writing envs in Jax? If so, each has their place, eg, look at Mujoco in envpool vs MJX, there’s a clear tradeoff point as you increase the number of environments.

24.11.2024 10:21 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

πŸ‘‹

24.11.2024 09:49 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Picture of Trinity College Dublin's campanile in the front square of the college. Writing overlayed to indicate that the RLDM conference will take place from the 11th to the 14th of June, 2025. Logos for Trinity College and the RLDM conference at the bottom of the image.

Picture of Trinity College Dublin's campanile in the front square of the college. Writing overlayed to indicate that the RLDM conference will take place from the 11th to the 14th of June, 2025. Logos for Trinity College and the RLDM conference at the bottom of the image.

Come join us at Trinity College Dublin for the 6th Multidisciplinary Conference on Reinforcement Learning and Decision Making (RLDM2025)

Abstract deadline January 15: rldm.org/submit

15.11.2024 10:53 πŸ‘ 72 πŸ” 38 πŸ’¬ 1 πŸ“Œ 2

πŸ‘‹ can I be added as well?

24.11.2024 09:44 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0