This program will bring together researchers working on algorithmic, mathematical and statistical aspects of modern Data Science, with the aim of identifying a set of core techniques and principles that form a foundation for the subject. While the foundations of Data Science lie at the intersection between computer science, statistics and applied mathematics, each of those disciplines in turn developed in response to particular long-standing problems. Building a foundation for modern Data Science requires rethinking not only how those three research areas interact with data, implementations and applications, but also how each of the areas interacts with the others. For example, differing applications in computer science and scientific computing have led to different formalizations of appropriate models, questions to consider, computational environments (such as single machine vs distributed data centers vs supercomputers), and so on. Similarly, business, internet and social media applications tend to have certain design requirements and to generate certain types of questions, and these tend to be very different from those that arise in scientific and medical applications. As well as these differences, there are also many similarities between these areas. Developing the theoretical foundations of Data Science requires paying appropriate attention to the questions and issues of domain scientists who generate and use the data, and to the computational environments and platforms supporting this work.
Long-Term Participants [tentative list, including organizers]:
Ery Arias-Castro (UC San Diego), Laura Balzano (University of Michigan), Peter Bartlett (UC Berkeley), Shai Ben-David (University of Waterloo), Vladamir Braverman (Johns Hopkins University), Amit Chakrabarti (Dartmouth College), Kenneth Clarkson (IBM Research), Artur Czumaj (University of Warwick), Anirban Dasgupta (IIT Gandhinagar), Sanjoy Dasgupta (UC San Diego), Ilias Diakonikolas (University of Southern California), Maryam Fazel (University of Washington), Anupam Gupta (Carnegie Mellon University), Mohammad Taghi Hajiaghayi (University of Maryland), Adel Javanmard (University of Southern California), T. S. Jayram (IBM Almaden), Brendan Juba (Washington University in St. Louis), Ravindran Kannan (Microsoft Research India), Michael Kapralov (EPFL), Robi Krauthgamer (Weizmann Institute), Gabor Lugosi (Pompeu Fabra University, Barcelona), Michael Mahoney (International Computer Science Institute and UC Berkeley), Dustin Mixon (Ohio State University), Andrea Montanari (Stanford University), Sayan Mukherjee (Duke University), Boaz Nadler (Weizmann Institute), Deanna Needell (UCLA), Rasmus Pagh (IT University of Copenhagen), Jeff Phillips (University of Utah), Eric Price (University of Texas at Austin), Fred Roosta (University of Queensland), Barna Saha (UMass Amherst), Sujay Sanghavi (University of Texas at Austin), Michael Saunders (Stanford University), Madeleine Udell (Cornell University), Santosh Vempala (Georgia Tech), Bei Wang (University of Utah), Rachel Ward (University of Texas, Austin), David Woodruff (IBM Research).
sympa [at] lists [dot] simons [dot] berkeley [dot] edu (body: subscribe%20datascience2018annoucements%40lists.simons.berkeley.edu) (Click here to subscribe to our announcements email list for this program).
Those interested in participating in this program should send an email to the organizers at this datascience2018 [at] lists [dot] simons [dot] berkeley [dot] edu (at this address.)
Program image by Luisa Lee