Communication Efficient Distributed Optimization for Machine Learning

Workshop

Big Data Reunion Workshop

Speaker(s)

Martin Jaggi (Ecole Polytechnique Fédérale de Lausanne)

Location

Calvin Lab Auditorium

Date

Tuesday, Dec. 16, 2014

Time

11:10 – 11:35 a.m. PT

Abstract

Communication remains the most significant bottleneck in the performance of distributed optimization algorithms for large-scale machine learning. We propose a communication-efficient framework, COCOA, that uses local computation in a primal-dual setting to dramatically reduce the amount of necessary communication. We provide a strong convergence rate analysis for this class of algorithms, as well as experiments on real-world distributed datasets with implementations in Spark. In our experiments, we find that as compared to state-of-the-art mini-batch versions of SGD and SDCA algorithms, COCOA converges to the same .001-accurate solution quality on average 25× as quickly.

This is joint work with Virginia Smith, Martin Takáč, Jonathan Terhorst, Sanjay Krishnan, Thomas Hofmann, Michael I. Jordan

Attachment

Communication Efficient Distributed Optimization for Machine Learning (slides)