Playlist: 25 videos
Mathematics of Online Decision Making
0:29:56
Akshay Krishnamurthy (Microsoft Research)
https://simons.berkeley.edu/talks/representation-learning-and-exploration-reinforcement-learning
Mathematics of Online Decision Making
Visit talk page
https://simons.berkeley.edu/talks/representation-learning-and-exploration-reinforcement-learning
Mathematics of Online Decision Making
0:28:34
Aleksandrs Slivkins (Microsoft Research NYC)
https://simons.berkeley.edu/talks/corruption-robust-exploration-episodic-reinforcement-learning
Mathematics of Online Decision Making
Visit talk page
https://simons.berkeley.edu/talks/corruption-robust-exploration-episodic-reinforcement-learning
Mathematics of Online Decision Making
0:34:40
Daniel Russo (Columbia University)
https://simons.berkeley.edu/talks/global-convergence-and-approximation-benefits-policy-gradient-methods
Mathematics of Online Decision Making
Visit talk page
https://simons.berkeley.edu/talks/global-convergence-and-approximation-benefits-policy-gradient-methods
Mathematics of Online Decision Making
0:30:42
Michael Littman (Brown University)
An Alternative Softmax Operator for Reinforcement Learning
Mathematics of Online Decision Making
Visit talk page
An Alternative Softmax Operator for Reinforcement Learning
Mathematics of Online Decision Making
0:55:8
Sham Kakade (University of Washington & Microsoft Research)
https://simons.berkeley.edu/talks/lower-bounds-batch-reinforcement-learning
Mathematics of Online Decision Making
Visit talk page
https://simons.berkeley.edu/talks/lower-bounds-batch-reinforcement-learning
Mathematics of Online Decision Making