Abstract

I will present the first polynomial-time method with poly(n)sqrt(T)-regret for bandit convex optimization.

Video Recording