Symmetry and Structure in Deep Reinforcement Learning
- 14:00 19th April 2022 ( Trinity Term 2022 )Lecture Theatre B
In this talk, I will discuss our work on symmetry and structure in reinforcement learning. In particular, I will discuss MDP Homomorphic Networks, a class of networks that ties transformations of observations to transformations of decisions. Such symmetries are ubiquitous in deep reinforcement learning, but often ignored in current approaches. Enforcing this prior knowledge into policy and value networks allows us to reduce the size of the solution space, a necessity in problems with large numbers of possible observations. I will showcase the benefits of our approach on agents in virtual environments. Building on the foundations of MDP Homomorphic Networks, I will also discuss our ongoing work on symmetries among multiple agents. This forms a basis for my vision for reinforcement learning for complex virtual environments, as well as for problems with intractable search spaces.
Register here.