Marlos C. Machado (@marloscmachado)

Government of Canada launches new initiative to recruit world-leading researchers - Canada.ca Canada will invest $1.7 billion to attract top global talent

"Canada Impact+ Research Chairs program—a new $1 billion investment that will provide Canadian institutions the opportunity to recruit top-tier international researchers with expertise in key areas ..."

www.canada.ca/en/innovatio...

10.12.2025 00:24 👍 2 🔁 0 💬 0 📌 0

Thrilled to start 2026 as faculty in Psych & CS
@ualberta.bsky.social + Amii.ca Fellow! 🥳 Recruiting students to develop theories of cognition in natural & artificial systems 🤖💭🧠. Find me at #NeurIPS2025 workshops (speaking coginterp.github.io/neurips2025 & organising @dataonbrainmind.bsky.social)

06.12.2025 19:26 👍 103 🔁 27 💬 4 📌 1

The Computing Science Dept. at the University of Alberta has multiple faculty job openings. Please share this broadly. We have a great environment!

- CS Theory: tinyurl.com/zrh9mk69
- Network/Cyber Security: tinyurl.com/renxazzy
- Robotics/CV/Graphics: tinyurl.com/ypcsfbff

27.11.2025 18:00 👍 9 🔁 2 💬 0 📌 0

The Department of Computing Science at the University of Alberta at the University of Alberta has an opening for another tenure-track faculty in robotics. Please, spread the word.

I can attest to how awesome our department and @amiithinks.bsky.social are!

(Official job posting coming soon.)

20.11.2025 14:54 👍 3 🔁 0 💬 0 📌 0

Ratatouille (2007)

07.10.2025 21:58 👍 6 🔁 0 💬 0 📌 0

This paper has now been accepted @neuripsconf.bsky.social !

Huge congratulations, Hon Tik (Rick) Tse and Siddarth Chandrasekar.

18.09.2025 22:04 👍 8 🔁 3 💬 0 📌 0

2/2: “Conquerors live in dread of the day when they are shown to be, not superior, but simply lucky.”

― N.K. Jemisin, The Stone Sky

27.08.2025 02:20 👍 2 🔁 0 💬 0 📌 0

1/2: But there are none so frightened, or so strange in their fear, as conquerors. They conjure phantoms endlessly, terrified that their victims will someday do back what was done to them—even if, in truth, their victims couldn’t care less about such pettiness and have moved on.”

27.08.2025 02:20 👍 2 🔁 1 💬 1 📌 0

RLC 2025 - Outstanding Paper Awards

Excited to announce the RLC best paper awards! Like last year, we wanted to highlight the many excellent ways you can do research.
rl-conference.cc/RLC2025Award...

07.08.2025 20:30 👍 10 🔁 6 💬 0 📌 1

* RLC Journal to Conference Track:*
(Originally published at TMLR)

- Deep RL track (Thu): AGaLiTe: Approximate Gated Linear Transformers for Online Reinforcement Learning by S. Pramanik

04.08.2025 15:49 👍 1 🔁 0 💬 0 📌 0

* RLC Full Papers:*
(These are great papers!)

- Deep RL track (Thu): Deep Reinforcement Learning with Gradient Eligibility Traces by E. Elelimy
- Foundations track (Fri): An Analysis of Action-Value Temporal-Difference Methods That Learn State Values by B. Daley and P. Nagarajan

04.08.2025 15:49 👍 1 🔁 0 💬 1 📌 0

* RLC Workshop Papers (2/2):*
Inductive Biases in RL
sites.google.com/view/ibrl-wo...

- A Study of Value-Aware Eigenoptions by H. Kotamreddy

04.08.2025 15:49 👍 0 🔁 0 💬 1 📌 0

Workshop on Reinforcement Learning Beyond Rewards: Ingredients for Developing Generalist Agents

* RLC Workshop Papers (1/2):*
RL Beyond Rewards
rlbrew2-workshop.github.io

- Tue 11:59 (spotlight talk): Towards An Option Basis To Optimize All Rewards by S. Chandrasekar
- The World Is Bigger: A Computationally-Embedded Perspective on the Big World Hypothesis by A. Lewandowsi

04.08.2025 15:49 👍 0 🔁 0 💬 1 📌 0

Here's what our group will be presenting at RLC'25.

* Invited Talks at Workshops:*
Tue 10:00: The Causal RL Workshop sites.google.com/uci.edu/crlw...
Tue 14:30: Inductive Biases in RL (IBRL) Workshop
sites.google.com/view/ibrl-wo...
Tue 15:00: Panel Discussion at IBRL Workshop

04.08.2025 15:49 👍 0 🔁 0 💬 1 📌 0

RLC starts tomorrow here in Edmonton. I couldn't be more excited! It has a fantastic roll of speakers, great papers, and workshops. And this time, it is in Edmonton 😁

@rl-conference.bsky.social is my favourite conference, and no, it is not because I am one of its organizers this year.

04.08.2025 15:27 👍 12 🔁 3 💬 0 📌 0

This was a great long-term effort from @martinklissarov.bsky.social, Akhil Bagaria, and @ray-luo.bsky.social, and it led to a great overview of the ideas behind leveraging temporal abstractions in AI.

If anything, I think this is very useful resource for anyone interested in this field!

27.06.2025 20:57 👍 6 🔁 1 💬 0 📌 0

To align better with workshop acceptance dates, 𝐑𝐋𝐂 𝐢𝐬 𝐞𝐱𝐭𝐞𝐧𝐝𝐢𝐧𝐠 𝐢𝐭𝐬 𝐞𝐚𝐫𝐥𝐲 𝐫𝐞𝐠𝐢𝐬𝐭𝐫𝐚𝐭𝐢𝐨𝐧 𝐝𝐞𝐚𝐝𝐥𝐢𝐧𝐞 𝐭𝐨 𝐉𝐮𝐧𝐞 𝟐𝟑𝐫𝐝!

30.05.2025 15:02 👍 8 🔁 3 💬 0 📌 1

9/9: I genuinely think AgarCL might unlock new research avenues in CRL, including loss of plasticity, exploration, representation learning, and more. I do hope you consider using it.

Repo: github.com/machado-rese...
Website: agarcl.github.io
Preprint: arxiv.org/abs/2505.18347

27.05.2025 03:48 👍 6 🔁 0 💬 0 📌 0

8/9: Well, if you are still interested, you should probably consider reading the paper, but it is interesting to see that most of the agents we considered were able to reach human-level performance only in the most benign settings. And we did use a lot of computing here!

27.05.2025 03:48 👍 4 🔁 1 💬 1 📌 0

7/9: Through mini-games, we tried to quantify and isolate some of the challenges AgarCL poses, including partial observability, non-stationarity, exploration, hyperparameter tuning, and the non-episodic nature of the environment (so easy to forget!). Where do our agents "break"?

27.05.2025 03:48 👍 1 🔁 0 💬 1 📌 0

6/9: Importantly, this is a challenge problem that forces us to deal with many problems we often avoid, such as hyperparameter sweeps and exploration in CRL.

It is perhaps no surprise that the classic algorithms we considered couldn't really make much progress in the full game.

27.05.2025 03:48 👍 2 🔁 0 💬 1 📌 0

5/9: Over time, even the agent's observation will change, as the camera needs to zoom out to accommodate more agents; not to mention that there are other agents in the environment. I'm very excited about AgarCL because I think it allows us to ask questions we couldn't before.

27.05.2025 03:48 👍 2 🔁 0 💬 1 📌 0

4/9: AgarCL is an adaptation of agar.io, a game with simple mechanics that lead to complex interactions. It's non-episodic, and a key aspect is that the agent dynamics change as it accumulates mass: It becomes slower, gains new affordances, sheds more mass, etc.

27.05.2025 03:48 👍 3 🔁 0 💬 1 📌 1

3/9: AgarCL is our attempt at an environment with the complexity of a "big world" but in a smooth way, where the "laws of physics" don't change. It has complex dynamics, is partially observable, with non-stationarity, pixel-based observations, and a hybrid action space.

27.05.2025 03:48 👍 3 🔁 0 💬 1 📌 0

2/9: CRL is often motivated by the idea that the world is bigger than the agent, requiring tracking. We usually simulate this with non-stationarity by cycling through classic episodic problems. I've written papers like this, but it feels too artificial.

arxiv.org/abs/2303.07507

27.05.2025 03:48 👍 3 🔁 0 💬 1 📌 0

📢 I'm very excited to release AgarCL, a new evaluation platform for research in continual reinforcement learning‼️

Repo: github.com/machado-rese...
Website: agarcl.github.io
Preprint: arxiv.org/abs/2505.18347

Details below 👇

27.05.2025 03:48 👍 29 🔁 8 💬 1 📌 0

This is great, thanks for sharing! We will read your paper carefully.

25.05.2025 17:58 👍 1 🔁 0 💬 0 📌 0

Reward-Aware Proto-Representations in Reinforcement Learning In recent years, the successor representation (SR) has attracted increasing attention in reinforcement learning (RL), and it has been used to address some of its key challenges, such as exploration, c...

7/7: We just scratched the surface here, but I think this could be the beginning of something interesting; that might be relevant to research questions ranging from safety in RL all the way to cognitive sciences.

Again, here's the preprint by Tse et al.: arxiv.org/abs/2505.16217

24.05.2025 15:23 👍 9 🔁 0 💬 0 📌 0

6/7: We also show that, when compared to the SR, the DR gives rise to qualitatively different behavior in all sorts of tasks, such as reward shaping, exploration, & option discovery. Similar to what we did w/ STOMP, sometimes there's value in being aware of the reward function 😁

24.05.2025 15:23 👍 4 🔁 0 💬 1 📌 0

5/7: What we do is to lay some of the theoretical foundation underlying the DR, including establishing some general TD learning and dynamic programming updates, connecting the DR to the SR, and extending the DR to the FA setting, similar to how SFs do it for the SR.

24.05.2025 15:23 👍 4 🔁 0 💬 1 📌 0

Marlos C. Machado

Latest posts by Marlos C. Machado @marloscmachado