Playlist: 21 videos

Societal Considerations and Applications

Remote video URL
0:41:56
Lydia Liu (Cornell University)
https://simons.berkeley.edu/talks/what-really-matters-fairness-machine-learning-delayed-impact-and-other-desiderata
Societal Considerations and Applications

From education to lending, consequential decisions in society increasingly rely on data-driven algorithms. Yet the long-term impact of algorithmic decision making is largely ill-understood, and there exist serious challenges to ensuring equitable benefits, in theory and practice. While the subject of algorithmic fairness has received much attention, algorithmic fairness criteria have significant limitations as tools for promoting equitable benefits. In this talk, we review various fairness desiderata in machine learning and when they may be in conflict. We then introduce the notion of delayed impact---the welfare impact of decision-making algorithms on populations after decision outcomes are observed, motivated, for example, by the change in average credit scores after a new loan approval algorithm is applied. We demonstrate that several statistical criteria for fair machine learning, if applied as a constraint to decision-making, can result in harm to the welfare of a disadvantaged population. We end by considering future directions for fairness in machine learning that evince a holistic and interdisciplinary approach.
Visit talk page
Remote video URL
0:46:46
Jacob Steinhardt (UC Berkeley)
https://simons.berkeley.edu/node/22932
Societal Considerations and Applications

Digital recommender systems such as Spotify and Netflix affect not only consumer behavior but also producer incentives: producers seek to supply content that will be recommended by the system. But what content will be produced? To understand this, we model users and content as D-dimensional vectors, and assume the system recommends the content that has the highest dot product with each user. In contrast to traditional economic models, here the producer decision space is high-dimensional and the user base is heterogeneous. This gives rise to new qualitative phenomena at equilibrium: the formation of genres,and the possibility of positive profit at equilibrium. We characterize these phenomena in terms of the geometry of the users and the structure of producer costs. At a conceptual level, our work serves as a starting point to investigate how recommender systems shape supply-side
competition between producers.

Joint work with Meena Jagadeesan and Nikhil Garg
Visit talk page
Remote video URL
0:52:32
Hamsa Bastani (Wharton School, University of Pennsylvania)
https://simons.berkeley.edu/talks/decision-aware-learning-global-health-supply-chains
Societal Considerations and Applications

The combination of machine learning (for prediction) and optimization (for decision-making) is increasingly used in practice. However, a key challenge is the need to align the loss function used to train the machine learning model with the decision loss associated with the downstream optimization problem. Traditional solutions have limited flexibility in the model architecture and/or scale poorly to large datasets. We propose a "light-touch" decision-aware learning heuristic that uses a novel Taylor expansion of the optimal decision loss to derive the machine learning loss. Importantly, our approach only requires a simple re-weighting of the training data, allowing it to flexibly and scalably be incorporated into complex modern data science pipelines, yet producing sizable efficiency gains. We apply our framework to optimize the distribution of essential medicines in collaboration with policymakers at the Sierra Leone National Medical Supplies Agency; highly uncertain demand and limited budgets currently result in excessive unmet demand. We leverage random forests with meta-learning to learn complex cross-correlations across facilities, and apply our decision-aware learning approach to align the prediction loss with the objective of minimizing unmet demand. Out-of-sample results demonstrate that our end-to-end approach significantly reduces unmet demand across 1000+ health facilities throughout Sierra Leone. Joint work with O. Bastani, T.-H. Chung and V. Rostami.
Visit talk page
Remote video URL
1:4:41
Jann Spiess (Stanford University)
https://simons.berkeley.edu/talks/unpacking-black-box-regulating-algorithmic-decisions
Societal Considerations and Applications

We show how to optimally regulate prediction algorithms in a world where (a) high-stakes decisions such as lending, medical testing or hiring are made by a complex `black-box' prediction functions, (b) there is an incentive conflict between the agent who designs the prediction function and a principal who oversees the use of the algorithm, and (c) the principal is limited in how much she can learn about the agent's black-box model. We show that limiting agents to prediction functions that are simple enough to be fully transparent is inefficient as long as the bias induced by misalignment between principal's and agent's preferences is small relative to the uncertainty about the true state of the world. Algorithmic audits can improve welfare, but the gains depend on the design of the audit tools. Tools that focus on minimizing overall information loss, the focus of many post-hoc explainer tools, will generally be inefficient since they focus on explaining the average behavior of the prediction function rather than those aspects that are most indicative of a misaligned choice. Targeted tools that focus on the source of incentive misalignment, e.g., excess false positives or racial disparities, can provide first-best solutions. We provide empirical support for our theoretical findings using an application in consumer lending.
Visit talk page
Remote video URL
1:0:15
Kimon Drakopoulos (University of Southern California)
https://simons.berkeley.edu/node/22914
Societal Considerations and Applications

Throughout the coronavirus disease 2019 (COVID-19) pandemic, countries have relied on a variety of ad hoc border control protocols to allow for non-essential travel while safeguarding public health, from quarantining all travellers to restricting entry from select nations on the basis of population-level epidemiological metrics such as cases, deaths or testing positivity rates1,2. Here we report the design and performance of a reinforcement learning system, nicknamed Eva. In the summer of 2020, Eva was deployed across all Greek borders to limit the influx of asymptomatic travellers infected with severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), and to inform border policies through real-time estimates of COVID-19 prevalence. In contrast to country-wide protocols, Eva allocated Greece’s limited testing resources on the basis of incoming travellers’ demographic information and testing results from previous travellers. By comparing Eva’s performance against modelled counterfactual scenarios, we show that Eva identified 1.85 times as many asymptomatic, infected travellers as random surveillance testing, with up to 2–4 times as many during peak travel, and 1.25–1.45 times as many asymptomatic, infected travellers as testing policies that utilize only epidemiological metrics. We demonstrate that this latter benefit arises, at least partially, because population-level epidemiological metrics had limited predictive value for the actual prevalence of SARS-CoV-2 among asymptomatic travellers and exhibited strong country-specific idiosyncrasies in the summer of 2020. Our results raise serious concerns on the effectiveness of country-agnostic internationally proposed border control policies3 that are based on population-level epidemiological metrics. Instead, our work represents a successful example of the potential of reinforcement learning and real-time data for safeguarding public health.
Visit talk page
Remote video URL
0:53:35
Vahideh Manshadi (Yale University)
https://simons.berkeley.edu/node/22916
Societal Considerations and Applications

In this talk, I describe our collaboration with VolunteerMatch (VM), the largest nationwide platform that connects nonprofits with volunteers. Through our work with VM, we have identified a key feature shared by many matching platforms (including Etsy, DonorsChoose, and VM): the supply side (e.g., nonprofits on the VM platform) not only relies on the platform’s internal recommendation algorithm to draw traffic but also utilizes other channels, such as social media, to attract external visitors. Such visitors arrive via direct links to their intended options, thus bypassing the platform’s recommendation algorithm. For example, of the 1.3 million monthly visitors to the VM platform, approximately 30% are external traffic directed to VM as a result of off-platform outreach activities, such as when nonprofits publicize volunteering opportunities on LinkedIn or Facebook. This motivated us to introduce the problem of online matching with multi-channel traffic, a variant of a canonical online matching problem. Taking a competitive analysis approach, we first demonstrate the shortcomings of a commonly-used algorithm that is optimal in the absence of external traffic. Then, we propose a new algorithm that achieves a near-optimal competitive ratio in certain regimes. Beyond theoretical guarantees, we demonstrate our algorithm’s practical effectiveness in simulations based on VM data. Time permitting, I will also report on implementing an improved recommendation algorithm on the VM platform and present data from our ensuing experimentation. (Joint work with Scott Rodilitz, Daniela Saban, and Akshaya Suresh)
Visit talk page