Reading group TODAY 3pm GMT (UK) time π
Lukas SchΓ€fer (Microsoft Research) will be joining us to present predictive inverse dynamics models (PIDMs) as an alternative to behaviour cloning in offline imitation learning π‘
Paper: arxiv.org/abs/2601.21718
Join here: edinburgh-rl.github.io/reading-group
05.03.2026 10:47
π 3
π 0
π¬ 1
π 0
Reading group TODAY at 3pm UK time (GMT)!
Ruaridh Mon-Williams will be joining us to present some interesting recent work on emergent partner modelling π‘
Paper: arxiv.org/abs/2505.17323
Join here: edinburgh-rl.github.io/reading-group
19.02.2026 12:13
π 6
π 0
π¬ 0
π 0
Reading group TOMORROW 3-4pm UK!
Joe Marino (Google DeepMind) will present SIMA 2, a generalist embodied agent designed to operate across a wide range of 3D virtual worlds π
Join here: edinburgh-rl.github.io/reading-group
21.01.2026 18:41
π 8
π 4
π¬ 0
π 0
Reading group TODAY 2pm BST!
"Despite years of research in offline reinforcement learning, the field has failed to deliver major breakthroughs..."
Matthew Jackson and Jarek Liesen (Oxford) will present Unifloral - unified implementations and evaluations for offline RL: arxiv.org/abs/2504.11453
09.01.2026 11:53
π 3
π 1
π¬ 0
π 0
* 12:00 GMT+0, still in summer mode :D
20.11.2025 11:20
π 1
π 0
π¬ 0
π 0
Reading group today at 12:00 BST!
We are continuing our NeurIPS series. Felix Chalumeau from InstaDeep will present his oral paper on how inference strategies can boost performance in MARL. π
Papers:
arxiv.org/abs/2505.21236
Meeting:
bit.ly/488sD6n
20.11.2025 10:40
π 3
π 0
π¬ 1
π 0
Reading group today at 2pm BST!
We are starting our NeurIPS series with Sable and Oryx, sequence models for scalable multi-agent coordination from the RL Research Team at InstaDeep. π
Papers:
- Sable: bit.ly/3Lme7jH
- Oryx: bit.ly/47GJb4T
Meeting:
- bit.ly/3JoEbtU
06.11.2025 10:43
π 3
π 1
π¬ 0
π 0
RLJ Β· Learning to Explore in Diverse Reward Settings via Temporal-Difference-Error Maximization
Reinforcement Learning Journal (RLJ)
RL reading group in less than 2 hours at 15:00 BST π₯πΊ
Speaker: Sebastian Griesbach
Title: Learning to Explore in Diverse Reward Settings via Temporal-Difference-Error Maximization πΊοΈ (RLJ)
Paper: rlj.cs.umass.edu/2025/papers/...
Meeting link: teams.microsoft.com/l/meetup-joi...
01.10.2025 12:38
π 4
π 0
π¬ 0
π 0
π’ RL reading group TODAY @ 15:00 BST (in 2 hours!) π’
Speakers: Olya Mastikhina and Dhruv Sreenivas (University of Montreal & Mila - Quebec AI Institute)
Title: Optimistic critics can empower small actorsπ¦Έ
Details: edinburgh-rl.github.io/reading-group
19.09.2025 11:55
π 2
π 2
π¬ 0
π 0
UoE RL Reading Group
University of Edinburgh Reinforcement Learning Reading Group
π’ RL reading group Thursday @ 16:00 BST π’
Speaker: Alex Lewandowski
Title: The World Is Bigger: A Computationally-Embedded Perspective on the Big World Hypothesis π
Details: edinburgh-rl.github.io/reading-group
03.09.2025 11:32
π 6
π 3
π¬ 0
π 0
UoE RL Reading Group
University of Edinburgh Reinforcement Learning Reading Group
RL reading group TODAY @ 15:00 BST π₯
Speaker: Cam Allen (Postdoc, UC Berkeley)
Title: The Agent Must Choose the Problem Model
Details: edinburgh-rl.github.io/reading-group
24.07.2025 05:39
π 3
π 1
π¬ 0
π 0
We are super excited to kick things off again with Mattie Fellows (Postdoc @ FLAIR in Oxford) today 15:00 BST!
Paper: Simplifying Deep Temporal Difference Learning
Check out our website for full info edinburgh-rl.github.io/reading-grou...
10.07.2025 10:31
π 4
π 0
π¬ 0
π 2
We regularly host guest speakers, so please get in touch if you're interested in presenting your work π€β€οΈβπ₯
10.07.2025 10:30
π 1
π 0
π¬ 0
π 0
π€ RL & Agents Reading Group
Please add your details to join the RL & Agents reading group.
π Sign up: forms.gle/anHVSi97d6F6...
π³οΈ Propose papers: github.com/edinburgh-rl...
πΊ Past recordings: www.youtube.com/@RL_And_Agen...
10.07.2025 10:29
π 2
π 1
π¬ 0
π 0
Hello world! This is the RL & Agents Reading Group
We organise regular meetings to discuss recent papers in Reinforcement Learning (RL), Multi-Agent RL and related areas (open-ended learning, LLM agents, robotics, etc).
Meetings take place online and are open to everyone π
10.07.2025 10:29
π 37
π 12
π¬ 1
π 3