Theodore Tollet's Avatar

Theodore Tollet

@ttollet

Passionate about #ReinforcementLearning, aiming to improve people’s lives through technology. πŸ“š Researching RL for structured action spaces. πŸ” Reposts β‰  endorsements Any/All PhD Candidate Lancaster University

22
Followers
324
Following
2
Posts
24.11.2024
Joined
Posts Following

Latest posts by Theodore Tollet @ttollet

Preview
RLVG Sessions @ RLC 2025 - YouTube Recorded sessions of the Reinforcement Learning and Video Games (RLVG) workshop at the Reinforcement Learning Conference (RLC), 2025. Recorded on August 5, 2025

Missed our workshop at @rl-conference.bsky.social?
Watch all keynote presentations and the panel discussion on YouTube:

youtube.com/playlist?lis...

Big thanks to our awesome speakers: @togelius.bsky.social, @jmac-ai.bsky.social, Roberta Raileanu, Michael Bowling and @pcastr.bsky.social!

25.09.2025 09:53 πŸ‘ 14 πŸ” 4 πŸ’¬ 0 πŸ“Œ 3
Post image

Exciting news - early bird registration is now open for #RLDM2025!

πŸ”— Register now: forms.gle/QZS1GkZhYGRF...

Register now to save €100 on your ticket. Early bird prices are only available until 1st April.

11.02.2025 14:56 πŸ‘ 16 πŸ” 15 πŸ’¬ 2 πŸ“Œ 2

Doing some slides on reinforcement learning, and passing through k-armed bandits along the way.

The phrase "Optimism under uncertainty is a simple, effective, and in some cases optimal way to minimize regret" hits different these days.

06.02.2025 23:42 πŸ‘ 5 πŸ” 2 πŸ’¬ 0 πŸ“Œ 0

If you like Reinforcement Learning, I created a feed that tracks recent popular posts on the topic:
#RL #RLHF #ReinforcementLearning

06.02.2025 21:00 πŸ‘ 2 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Video thumbnail

I used the DeepSeek Reinforcement Learning GRPO algorithm to train a triangle creature. The edges are "muscles": red=extend, blue=contract; vertices are "mass pumps": red=grow, blue=shrink. Reward = DeltaX - |DeltaY|. Full video incoming: Subscribe at www.youtube.com/@MihaiNicaMath

01.02.2025 22:05 πŸ‘ 24 πŸ” 3 πŸ’¬ 2 πŸ“Œ 1
Preview
Bluesky’s science takeover: 70% of Nature poll respondents use platform Roughly 6,000 readers answered our poll, with many declaring that Bluesky was nicer, kinder and less antagonistic to science than X.

On the migration of the scientific community to Bluesky:
www.nature.com/articles/d41... πŸ§ͺ

27.01.2025 20:19 πŸ‘ 6636 πŸ” 1235 πŸ’¬ 98 πŸ“Œ 75
A gazelle calf struggles to its feet minutes after being born. Half an hour later it is running at 20 miles per hour.

A gazelle calf struggles to its feet minutes after being born. Half an hour later it is running at 20 miles per hour.

1/ I was looking through Sutton and Barto's "Reinforcement Learning" book and it had this remarkable claim, so I Googled it. Can you guess what the response was? [little 🧡] ↡

29.12.2024 15:30 πŸ‘ 15 πŸ” 2 πŸ’¬ 2 πŸ“Œ 0
Preview
A New Social We build bridges, not walls.

Excited to announce that I’m teaming up with @quillmatiq.com on a new non-profit for the open social web, across protocols, with Bridgy Fed as its first main project! Introducing A New Social.

17.12.2024 17:28 πŸ‘ 1154 πŸ” 136 πŸ’¬ 42 πŸ“Œ 12

πŸ“šπŸ“’Christmas came early πŸŽ…
Our new textbook on multi-agent reinforcement learning with @mitpress.bsky.social is out NOW!

The book is available from MIT Press (mitpress.mit.edu/978026204937...) or your bookstore nearby.

Why you should be interested, what you get, and more all in a πŸ§΅πŸ‘‡

17.12.2024 14:32 πŸ‘ 32 πŸ” 9 πŸ’¬ 1 πŸ“Œ 1
Post image

Data Science Environment Away day going very well indeed - after a morning of presentations a well deserved lunch break!

16.12.2024 13:06 πŸ‘ 3 πŸ” 1 πŸ’¬ 0 πŸ“Œ 1
Post image

We did it! 1st use of VR in teaching at @lancasteruni.bsky.social taking our students to an underwater world πŸ˜ƒ includes data collected from our lovely students to allow us to assess its effectiveness. Thanks to The Hydrous for providing the underwater coral reef experience FREE thehydro.us

02.12.2024 16:17 πŸ‘ 22 πŸ” 4 πŸ’¬ 0 πŸ“Œ 0

Be great to link up with any Lancaster University people on this site @bsky.app

03.12.2024 15:53 πŸ‘ 2 πŸ” 2 πŸ’¬ 1 πŸ“Œ 0

Likewise!

17.12.2024 13:57 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Preview
Reinforcement Learning: An Overview This manuscript gives a big-picture, up-to-date overview of the field of (deep) reinforcement learning and sequential decision making, covering value-based RL, policy-gradient methods, model-based met...

An updated intro to reinforcement learning by Kevin Murphy: arxiv.org/abs/2412.05265! Like their books, it covers a lot and is quite up to date with modern approaches. It also is pretty unique in coverage, I don't think a lot of this is synthesized anywhere else yet

09.12.2024 14:27 πŸ‘ 271 πŸ” 73 πŸ’¬ 9 πŸ“Œ 5
Preview
Interviewing Finbarr Timbers on the "We are So Back" Era of Reinforcement Learning Listen now | Interconnects interview #11. An overview on the past, present, and future of RL.

This is one I've wanted to do for a while: ask why RL has been continually underestimated in the last 2 years.

Interviewing Finbarr Timbers on the "We are So Back" Era of Reinforcement Learning
Interconnects interview #11. An overview on the past, present, and future of RL.
https://buff.ly/3Vqrbqj

05.12.2024 15:40 πŸ‘ 58 πŸ” 8 πŸ’¬ 2 πŸ“Œ 2
Preview
Reinforcement Learning

Check out my article on Reinforcement Learning for the Open Encyclopedia of Cognitive Science! oecs.mit.edu/pub/k2ek981x...
@oecs-bot.bsky.social

04.12.2024 18:57 πŸ‘ 24 πŸ” 14 πŸ’¬ 0 πŸ“Œ 0

Just starting to build out my home office to have a nice space to push through the end of my PhD. What's your number one item to have in it? Can be anything from plants to monitors!
🌊πŸ§ͺ
#phdlife
#academia

04.12.2024 10:03 πŸ‘ 6 πŸ” 2 πŸ’¬ 6 πŸ“Œ 0

The deep goal of bluesky is to decentralize the social internet so that every individual controls their experience of it rather than having it be controlled by 5 random billionaires. Everyone thinks they signed up for a demuskified twitter...we actually signed an exciting and bizarre experiment.

03.12.2024 16:05 πŸ‘ 57590 πŸ” 6378 πŸ’¬ 1286 πŸ“Œ 456

How come I didn't know about this BeNeRL seminar series? It focuses on practical RL and seems really great!

www.benerl.org/seminar-seri...

I would have loved to hear Benjamin Eysenbach, Chris Lu and Edward Hu... Next one is on December 19th.

03.12.2024 16:11 πŸ‘ 9 πŸ” 2 πŸ’¬ 0 πŸ“Œ 0
Preview
MIT researchers develop an efficient way to train more reliable AI agents MIT researchers developed an efficient approach for training more reliable reinforcement learning models, focusing on complex tasks that involve variability. This could enable the leverage of reinforc...

Reinforcement learning (RL) remains surprisingly brittle to contextual variations in tasks. Our new method (NeurIPS 2024) for solving contextual RL problems achieves 5-50x better sample efficiency on standard & traffic benchmarks. Featured by MIT news! news.mit.edu/2024/mit-res...

25.11.2024 19:35 πŸ‘ 14 πŸ” 3 πŸ’¬ 1 πŸ“Œ 1
Post image Post image Post image

Want to learn / teach RL? 

Check out new book draft:
Reinforcement Learning - Foundations
sites.google.com/view/rlfound...
W/ Shie Mannor & Yishay Mansour
This is a rigorous first course in RL, based on our teaching at TAU CS and Technion ECE.

25.11.2024 12:08 πŸ‘ 154 πŸ” 34 πŸ’¬ 4 πŸ“Œ 4
Preview
Streaming Deep Reinforcement Learning Finally Works Natural intelligence processes experience as a continuous stream, sensing, acting, and learning moment-by-moment in real time. Streaming learning, the modus operandi of classic reinforcement learning ...

Streaming Deep Reinforcement Learning Finally Works, by
M. Elsayed, G. Vasan, A. R. Mahmood, is one of those papers I wish I had written πŸ˜…

This paper seems to allow us to do RL with NNs as it should have always been done. Everyone should read it!

arxiv.org/abs/2410.14606

27.11.2024 23:09 πŸ‘ 92 πŸ” 20 πŸ’¬ 2 πŸ“Œ 0

Adding my love letter to

arxiv.org/pdf/2304.01315

Empirical Design in Reinforcement Learning
by
Andrew Patterson, Samuel Neumann, Martha White, Adam White

JMLR 25 (2024) 1-63
#ReinforcementLearning

These aren’t the heroes we deserve, but they are the heroes we need.

23.11.2024 13:40 πŸ‘ 211 πŸ” 47 πŸ’¬ 7 πŸ“Œ 6

Does everyone in your community agree on some folk knowledge that isn’t published anywhere? Put it in a paper! It’s a pretty valuable contribution

26.11.2024 22:31 πŸ‘ 202 πŸ” 26 πŸ’¬ 24 πŸ“Œ 10
Preview
Learn Git Branching An interactive Git visualization tool to educate and challenge!

On one of the first projects I supervised in my PhD, a student repeatedly ignored suggestions to commit and then accidentally deleted the project at the end of the semester. Please use git! There are even "fun" games you can use to learn it:
learngitbranching.js.org

27.11.2024 15:40 πŸ‘ 60 πŸ” 6 πŸ’¬ 5 πŸ“Œ 0