Skip to main content

Utility navigation

  • Calendar
  • Contact
  • Login
  • MAKE A GIFT
Berkeley University of California
Home Home

Main navigation

  • Home
  • Programs & Events
    • Research Programs
    • Workshops & Symposia
    • Public Lectures
    • Research Pods
    • Internal Program Activities
    • Algorithms, Society, and the Law
  • People
    • Scientific Leadership
    • Staff
    • Current Long-Term Visitors
    • Research Fellows
    • Postdoctoral Researchers
    • Scientific Advisory Board
    • Governance Board
    • Industry Advisory Council
    • Affiliated Faculty
    • Science Communicators in Residence
    • Law and Society Fellows
  • Participate
    • Apply to Participate
    • Plan Your Visit
    • Location & Directions
    • Postdoctoral Research Fellowships
    • Law and Society Fellowships
    • Science Communicator in Residence Program
    • Circles
    • Breakthroughs Workshops and Goldwasser Exploratory Workshops
  • Support
    • Annual Fund
    • Funders
    • Industrial Partnerships
  • News & Videos
    • News
    • Videos
  • About
Image
Large Language Models and Transformers: Part 2 (SPRING)

Safety-Guaranteed LLMs

Program
Special Year on Large Language Models and Transformers, Part 2
Location

Calvin Lab auditorium

Date
Monday, Apr. 14 – Friday, Apr. 18, 2025
Back to calendar

Breadcrumb

  1. Home
  2. Workshop & Symposia
  3. Videos

Secondary tabs

  • The Workshop
  • Schedule
  • Videos
Remote video URL
Workshops

Simulating Counterfactual Training

Visit talk page
Remote video URL
Workshops

Controlling Untrusted AIs With Monitors

Visit talk page
Remote video URL
Workshops

Can We Get Asymptotic Safety Guarantees Based On Scalable Oversight?

Visit talk page
Remote video URL
Workshops

Amortised Inference Meets Llms: Algorithms And Implications For Faithful Knowledge Extraction

Visit talk page
Remote video URL
Richard M. Karp Distinguished Lectures

Panel Discussion

Visit talk page
Remote video URL
Workshops

Robustness of jailbreaking across aligned LLMs, reasoning models and agents

Visit talk page
Remote video URL
Workshops

Adversarial Robustness of LLMs' Safety Alignment

Visit talk page
Remote video URL
Workshops

Antidistillation Sampling

Visit talk page
Remote video URL
Workshops

Causal Representation Learning: A Natural Fit for Mechanistic Interpretability

Visit talk page
Remote video URL
Workshops

Out Of Distribution, Out Of Control? Understanding Safety Challenges In AI

Visit talk page

Pagination

  • Current page 1
  • Page 2
  • Next page Next
Home
The Simons Institute for the Theory of Computing is the world's leading venue for collaborative research in theoretical computer science.

Footer

  • Programs & Events
  • About
  • Participate
  • Workshops & Symposia
  • Contact Us
  • Calendar
  • Accessibility

Footer social media

  • Twitter
  • Facebook
  • Youtube
© 2013–2025 Simons Institute for the Theory of Computing. All Rights Reserved.
link to homepage

Main navigation

  • Home
  • Programs & Events
    • Research Programs
    • Workshops & Symposia
    • Public Lectures
    • Research Pods
    • Internal Program Activities
    • Algorithms, Society, and the Law
  • People
    • Scientific Leadership
    • Staff
    • Current Long-Term Visitors
    • Research Fellows
    • Postdoctoral Researchers
    • Scientific Advisory Board
    • Governance Board
    • Industry Advisory Council
    • Affiliated Faculty
    • Science Communicators in Residence
    • Law and Society Fellows
  • Participate
    • Apply to Participate
    • Plan Your Visit
    • Location & Directions
    • Postdoctoral Research Fellowships
    • Law and Society Fellowships
    • Science Communicator in Residence Program
    • Circles
    • Breakthroughs Workshops and Goldwasser Exploratory Workshops
  • Support
    • Annual Fund
    • Funders
    • Industrial Partnerships
  • News & Videos
    • News
    • Videos
  • About

Utility navigation

  • Calendar
  • Contact
  • Login
  • MAKE A GIFT
link to homepage