RLVG Sessions @ RLC 2025 - YouTube
Recorded sessions of the Reinforcement Learning and Video Games (RLVG) workshop at the Reinforcement Learning Conference (RLC), 2025. Recorded on August 5, 2025
Missed our workshop at @rl-conference.bsky.social?
Watch all keynote presentations and the panel discussion on YouTube:
youtube.com/playlist?lis...
Big thanks to our awesome speakers: @togelius.bsky.social, @jmac-ai.bsky.social, Roberta Raileanu, Michael Bowling and @pcastr.bsky.social!
25.09.2025 09:53
π 14
π 4
π¬ 0
π 3
Exciting news - early bird registration is now open for #RLDM2025!
π Register now: forms.gle/QZS1GkZhYGRF...
Register now to save β¬100 on your ticket. Early bird prices are only available until 1st April.
11.02.2025 14:56
π 16
π 15
π¬ 2
π 2
Doing some slides on reinforcement learning, and passing through k-armed bandits along the way.
The phrase "Optimism under uncertainty is a simple, effective, and in some cases optimal way to minimize regret" hits different these days.
06.02.2025 23:42
π 5
π 2
π¬ 0
π 0
If you like Reinforcement Learning, I created a feed that tracks recent popular posts on the topic:
#RL #RLHF #ReinforcementLearning
06.02.2025 21:00
π 2
π 0
π¬ 0
π 0
I used the DeepSeek Reinforcement Learning GRPO algorithm to train a triangle creature. The edges are "muscles": red=extend, blue=contract; vertices are "mass pumps": red=grow, blue=shrink. Reward = DeltaX - |DeltaY|. Full video incoming: Subscribe at www.youtube.com/@MihaiNicaMath
01.02.2025 22:05
π 24
π 3
π¬ 2
π 1
A gazelle calf struggles to its feet minutes after being born. Half an hour later it is running at 20 miles per hour.
1/ I was looking through Sutton and Barto's "Reinforcement Learning" book and it had this remarkable claim, so I Googled it. Can you guess what the response was? [little π§΅] β΅
29.12.2024 15:30
π 15
π 2
π¬ 2
π 0
A New Social
We build bridges, not walls.
Excited to announce that Iβm teaming up with @quillmatiq.com on a new non-profit for the open social web, across protocols, with Bridgy Fed as its first main project! Introducing A New Social.
17.12.2024 17:28
π 1154
π 136
π¬ 42
π 12
ππ’Christmas came early π
Our new textbook on multi-agent reinforcement learning with @mitpress.bsky.social is out NOW!
The book is available from MIT Press (mitpress.mit.edu/978026204937...) or your bookstore nearby.
Why you should be interested, what you get, and more all in a π§΅π
17.12.2024 14:32
π 32
π 9
π¬ 1
π 1
Data Science Environment Away day going very well indeed - after a morning of presentations a well deserved lunch break!
16.12.2024 13:06
π 3
π 1
π¬ 0
π 1
We did it! 1st use of VR in teaching at @lancasteruni.bsky.social taking our students to an underwater world π includes data collected from our lovely students to allow us to assess its effectiveness. Thanks to The Hydrous for providing the underwater coral reef experience FREE thehydro.us
02.12.2024 16:17
π 22
π 4
π¬ 0
π 0
Be great to link up with any Lancaster University people on this site @bsky.app
03.12.2024 15:53
π 2
π 2
π¬ 1
π 0
Likewise!
17.12.2024 13:57
π 0
π 0
π¬ 0
π 0
Reinforcement Learning: An Overview
This manuscript gives a big-picture, up-to-date overview of the field of (deep) reinforcement learning and sequential decision making, covering value-based RL, policy-gradient methods, model-based met...
An updated intro to reinforcement learning by Kevin Murphy: arxiv.org/abs/2412.05265! Like their books, it covers a lot and is quite up to date with modern approaches. It also is pretty unique in coverage, I don't think a lot of this is synthesized anywhere else yet
09.12.2024 14:27
π 271
π 73
π¬ 9
π 5
Interviewing Finbarr Timbers on the "We are So Back" Era of Reinforcement Learning
Listen now | Interconnects interview #11. An overview on the past, present, and future of RL.
This is one I've wanted to do for a while: ask why RL has been continually underestimated in the last 2 years.
Interviewing Finbarr Timbers on the "We are So Back" Era of Reinforcement Learning
Interconnects interview #11. An overview on the past, present, and future of RL.
https://buff.ly/3Vqrbqj
05.12.2024 15:40
π 58
π 8
π¬ 2
π 2
Reinforcement Learning
Check out my article on Reinforcement Learning for the Open Encyclopedia of Cognitive Science! oecs.mit.edu/pub/k2ek981x...
@oecs-bot.bsky.social
04.12.2024 18:57
π 24
π 14
π¬ 0
π 0
Just starting to build out my home office to have a nice space to push through the end of my PhD. What's your number one item to have in it? Can be anything from plants to monitors!
ππ§ͺ
#phdlife
#academia
04.12.2024 10:03
π 6
π 2
π¬ 6
π 0
The deep goal of bluesky is to decentralize the social internet so that every individual controls their experience of it rather than having it be controlled by 5 random billionaires. Everyone thinks they signed up for a demuskified twitter...we actually signed an exciting and bizarre experiment.
03.12.2024 16:05
π 57590
π 6378
π¬ 1286
π 456
How come I didn't know about this BeNeRL seminar series? It focuses on practical RL and seems really great!
www.benerl.org/seminar-seri...
I would have loved to hear Benjamin Eysenbach, Chris Lu and Edward Hu... Next one is on December 19th.
03.12.2024 16:11
π 9
π 2
π¬ 0
π 0
Want to learn / teach RL? β¨
Check out new book draft:
Reinforcement Learning - Foundationsβ¨sites.google.com/view/rlfound...
W/ Shie Mannor & Yishay Mansour
This is a rigorous first course in RL, based on our teaching at TAU CS and Technion ECE.
25.11.2024 12:08
π 154
π 34
π¬ 4
π 4
Streaming Deep Reinforcement Learning Finally Works
Natural intelligence processes experience as a continuous stream, sensing, acting, and learning moment-by-moment in real time. Streaming learning, the modus operandi of classic reinforcement learning ...
Streaming Deep Reinforcement Learning Finally Works, by
M. Elsayed, G. Vasan, A. R. Mahmood, is one of those papers I wish I had written π
This paper seems to allow us to do RL with NNs as it should have always been done. Everyone should read it!
arxiv.org/abs/2410.14606
27.11.2024 23:09
π 92
π 20
π¬ 2
π 0
Adding my love letter to
arxiv.org/pdf/2304.01315
Empirical Design in Reinforcement Learning
by
Andrew Patterson, Samuel Neumann, Martha White, Adam White
JMLR 25 (2024) 1-63
#ReinforcementLearning
These arenβt the heroes we deserve, but they are the heroes we need.
23.11.2024 13:40
π 211
π 47
π¬ 7
π 6
Does everyone in your community agree on some folk knowledge that isnβt published anywhere? Put it in a paper! Itβs a pretty valuable contribution
26.11.2024 22:31
π 202
π 26
π¬ 24
π 10
Learn Git Branching
An interactive Git visualization tool to educate and challenge!
On one of the first projects I supervised in my PhD, a student repeatedly ignored suggestions to commit and then accidentally deleted the project at the end of the semester. Please use git! There are even "fun" games you can use to learn it:
learngitbranching.js.org
27.11.2024 15:40
π 60
π 6
π¬ 5
π 0