Skip to main content
Search
Utility navigation
Calendar
Contact
Login
MAKE A GIFT
Main navigation
Home
Programs & Events
Research Programs
Workshops & Symposia
Public Lectures
Research Pods
Internal Program Activities
Algorithms, Society, and the Law
People
Scientific Leadership
Staff
Current Long-Term Visitors
Research Fellows
Postdoctoral Researchers
Scientific Advisory Board
Governance Board
Industry Advisory Council
Affiliated Faculty
Science Communicators in Residence
Law and Society Fellows
Participate
Apply to Participate
Plan Your Visit
Location & Directions
Postdoctoral Research Fellowships
Law and Society Fellowships
Science Communicator in Residence Program
Circles
Breakthroughs Workshops and Goldwasser Exploratory Workshops
Support
Annual Fund
Funders
Industrial Partnerships
News & Videos
News
Videos
About
Image
Mathematics of Online Decision Making
Program
Theory of Reinforcement Learning
Date
Monday, Oct. 26
–
Friday, Oct. 30, 2020
Back to calendar
Breadcrumb
Home
Workshop & Symposia
Schedule
Secondary tabs
The Workshop
Schedule
Videos
All talks are listed in Pacific Time (PDT).
Monday, Oct. 26, 2020
8:50
–
9 a.m.
Opening Remarks
9
–
9:30 a.m.
Online Multiserver Convex Chasing and Optimization
Yuval Rabani (Hebrew University of Jerusalem)
Video
9:30
–
10 a.m.
Multi-Task Optimal Experiment Design
Steffen Grunewalder (Lancaster University)
Video
10
–
10:30 a.m.
Selfish Robustness and Equilibria in Multi-Player Bandits
Vianney Perchet (ENSAE & Criteo AI Lab)
Video
10:30
–
11 a.m.
Discussion
11
–
11:30 a.m.
Break
11:30 a.m.
–
12 p.m.
Pure Exploration Problems
Wouter Koolen (Centrum Wiskunde & Informatica)
Video
12
–
12:30 p.m.
Gradient Descent-Ascent Provably Converges to Strict Local Minmax Equilibria with a Finite Timescale Separation
Lillian Ratliff (University of Washington)
Video
12:30
–
1 p.m.
Learning Outcomes in Queueing Systems
Eva Tardos (Cornell)
Video
1
–
1:30 p.m.
Discussion
Tuesday, Oct. 27, 2020
9
–
9:30 a.m.
Pandora's Box with Correlations: Learning and Approximation
Shuchi Chawla (University of Wisconsin, Madison)
Video
9:30
–
10 a.m.
Regret Minimization for Stochastic Shortest Paths
Yishay Mansour (Tel Aviv University)
Video
10
–
10:30 a.m.
Robust Algorithms for Secretaries and Bandits
Anupam Gupta
Video
10:30
–
11 a.m.
Discussion
11
–
11:30 a.m.
Break
11:30 a.m.
–
12 p.m.
The Non-Stochastic Control Framework
Naman Agarwal (Google)
Video
12
–
12:30 p.m.
Competitive Algorithms for Online Control
Yisong Yue (Caltech)
Video
12:30
–
1 p.m.
Discussion
Wednesday, Oct. 28, 2020
9
–
9:30 a.m.
A Unifying View of Optimism in Episodic Reinforcement Learning
Ciara Pike-Burke (Imperial College London)
Video
9:30
–
10 a.m.
On the Complexity of Learning Good Policies With and Without Rewards
Emilie Kaufmann (CNRS & University of Lille)
Video
10
–
10:30 a.m.
Model-Based Reinforcement Learning with Value-Targeted Regression
Mengdi Wang (Princeton University)
Video
10:30
–
11 a.m.
Discussion
11 a.m.
–
12 p.m.
Gather.town
Thursday, Oct. 29, 2020
9
–
9:30 a.m.
A Generalization Bound for Online Variational Inference
Pierre Alquier (Riken AIP)
Video
9:30
–
10 a.m.
Beating the Curse of Dimensionality in High-Dimensional Optimal Stopping
David Goldberg (Cornell ORIE)
Video
10
–
10:30 a.m.
Online Learning via Offline Greedy Algorithms: Applications in Market Design and Optimization
Negin Golrezaei (MIT)
Video
10:30
–
11 a.m.
Discussion
11
–
11:30 a.m.
Break
11:30 a.m.
–
12 p.m.
Country-Scale Bandit Implementation for Targeted COVID-19 Testing
Hamsa Bastani (Wharton School of the University of Pennsylvania)
Video
12
–
12:30 p.m.
Multi-Player Multi-Armed Bandit: Can We Still Collaborate at Homes Without "Zoom"?
Yuanzhi Li (Carnegie Mellon University)
Video
12:30
–
1 p.m.
Multiplayer Bandit Learning - From Competition to Cooperation
Simina Branzei (Purdue University)
Video
1
–
1:30 p.m.
Discussion
Friday, Oct. 30, 2020
9
–
9:30 a.m.
Representation Learning and Exploration in Reinforcement Learning
Akshay Krishnamurthy (Microsoft Research)
Video
9:30
–
10 a.m.
Corruption Robust Exploration in Episodic Reinforcement Learning
Aleksandrs Slivkins (Microsoft Research NYC)
Video
10
–
10:30 a.m.
On the Global Convergence and Approximation Benefits of Policy Gradient Methods
Daniel Russo (Columbia University)
Video
10:30
–
11 a.m.
Discussion
11
–
11:30 a.m.
Break
11:30 a.m.
–
12 p.m.
An Alternative Softmax Operator for Reinforcement Learning
Michael Littman (Brown University)
Video
12
–
12:30 p.m.
What Are the Statistical Limits of Offline Reinforcement Learning With Function Approximation?
Sham Kakade (University of Washington & Microsoft Research)
Video
12:30
–
1 p.m.
Discussion
Share this page
Copy URL of this page
link to homepage
Close
Main navigation
Home
Programs & Events
Research Programs
Workshops & Symposia
Public Lectures
Research Pods
Internal Program Activities
Algorithms, Society, and the Law
People
Scientific Leadership
Staff
Current Long-Term Visitors
Research Fellows
Postdoctoral Researchers
Scientific Advisory Board
Governance Board
Industry Advisory Council
Affiliated Faculty
Science Communicators in Residence
Law and Society Fellows
Participate
Apply to Participate
Plan Your Visit
Location & Directions
Postdoctoral Research Fellowships
Law and Society Fellowships
Science Communicator in Residence Program
Circles
Breakthroughs Workshops and Goldwasser Exploratory Workshops
Support
Annual Fund
Funders
Industrial Partnerships
News & Videos
News
Videos
About
Utility navigation
Calendar
Contact
Login
MAKE A GIFT
link to homepage
Close
Search