Optimizing Intended Reward Functions: Extracting All the Right Information From All the Right Places

Workshop

Theory of Reinforcement Learning Boot Camp

Speaker(s)

Anca Dragan (UC Berkeley)

Location

Zoom

Date

Monday, Aug. 31, 2020

Time

3:30 – 4:30 p.m. PT

Abstract

AI work tends to focus on how to optimize a specified reward function, but rewards that lead to the desired behavior consistently are not so easy to specify. Rather than optimizing specified reward, which is already hard, robots have the much harder job of optimizing intended reward. While the specified reward does not have as much information as we make our robots pretend, the good news is that humans constantly leak information about what the robot should optimize. In this talk, we will explore how to read the right amount of information from different types of human behavior -- and even the lack thereof.

Attachment

Slides

Optimizing Intended Reward Functions: Extracting All the Right Information From All the Right Places

Abstract

Attachment

Video Recording