Check our work on max entropy RL! We introduce an off-policy method to maximize the entropy of the future state-action visitation distribution, leading to policies that explore effectively and achieve high performance 🎯
Link 📑 arxiv.org/abs/2412.06655
#RL #MaxEntRL #Exploration
13.12.2024 09:22
👍 8
🔁 4
💬 0
📌 0