A Regret Minimization Approach to Mutli-Agent Control and RL

Workshop

Multi-Agent Reinforcement Learning and Bandit Learning

Speaker(s)

Elad Hazan (Princeton University and Google Research)

Location

Calvin Lab Auditorium

Date

Tuesday, May 3, 2022

Time

10:15 – 11 a.m. PT

Abstract

We'll start by describing a new paradigm in reinforcement learning called nonstochastic control, how it relates to existing frameworks, and survey efficient gradient-based methods for regret minimization in this model. We then proceed to describe recent work on multi-agent learning based on regret minimization methods that reach an equilibrium. We'll conclude with remaining challenges and potential directions for further research.

A Regret Minimization Approach to Mutli-Agent Control and RL

Abstract

Video Recording