Safety-Guaranteed LLMs

Program

Special Year on Large Language Models and Transformers, Part 2

Location

Calvin Lab auditorium

Date

Monday, Apr. 14 – Friday, Apr. 18, 2025

About

As the landscape of artificial intelligence evolves, ensuring the safety and alignment of superintelligent language models (LLMs) is paramount. This workshop will delve into the theoretical foundations of LLM safety. This could include topics like the Bayesian view of LLM safety versus the RL view of safety and other theories.

The flavor of this workshop is futuristic, focusing on how to ensure a superintelligent LLM/AI remains safe and aligned with humans. This workshop is a joint effort of the Simons Institute and IVADO.

Key Topics:

Bayesian Approaches to LLM Safety
Reinforcement Learning Perspectives on Safety
Theoretical Frameworks for Ensuring AI Alignment
Case Studies and Practical Implications
Future Directions in LLM Safety Research

If you require special accommodation, please contact our access coordinator at simonsevents@berkeley.edu with as much advance notice as possible.

Please note: the Simons Institute regularly captures photos and video of activity around the Institute for use in videos, publications, and promotional materials.

Chairs/Organizers

Yoshua Bengio

(IVADO - Mila - Université de Montréal)

Siva Reddy

(IVADO - Mila - McGill University)

Umesh Vazirani

(Simons Institute, UC Berkeley)