Skip to main content

Utility navigation

  • Calendar
  • Contact
  • Login
  • MAKE A GIFT
Berkeley University of California
Home Home

Main navigation

  • Programs & Events
    • Research Programs
    • Workshops & Symposia
    • Public Lectures
    • Research Pods
    • Internal Program Activities
    • Algorithms, Society, and the Law
  • Participate
    • Apply to Participate
    • Propose a Program
    • Postdoctoral Research Fellowships
    • Law and Society Fellowships
    • Science Communicator in Residence Program
    • Circles
    • Breakthroughs Workshops and Goldwasser Exploratory Workshops
  • People
    • Scientific Leadership
    • Staff
    • Current Long-Term Visitors
    • Research Fellows
    • Postdoctoral Researchers
    • Scientific Advisory Board
    • Governance Board
    • Affiliated Faculty
    • Science Communicators in Residence
    • Law and Society Fellows
    • Chancellor's Professors
  • News & Videos
    • News
    • Videos
  • Support for the Institute
    • Annual Fund
    • All Funders
    • Institutional Partnerships
  • For Visitors
    • Visitor Guide
    • Plan Your Visit
    • Location & Directions
    • Accessibility
    • Building Access
    • IT Guide
  • About

Results 2321 - 2330 of 23900

Workshop Talk
|
Apr. 3, 2025

Controllable and Creative Natural Language Generation

Recent advances in large language models (LLMs) have achieved remarkable results across a wide range of natural language processing (NLP) applications. As LLMs grow increasingly capable, the need to control their generation outcome becomes more pressing,...

Workshop Talk
|
Apr. 3, 2025

On Knowledge Separation and Latent Diffusion for Text

In this talk, I address two pressing shortcomings of large language models (LLMs):
In the first part, I argue that it is useful to distinguish between common knowledge—widely known information—and tail knowledge, which is highly specific and typically looked up rather than remembered. I introduce a simple pre-training strategy that separates tail from common knowledge and encourages the model to refrain from memorizing tail knowledge. Instead, the model learns to retrieve such information from an external database during inference, which is constructed during pre-training.
In the second part of the talk, I turn to the issue of controllability. I demonstrate that LLMs can be controlled effectively when guided by latent diffusion models. I explain the inner workings of diffusion models and how they can be adapted to generate latent states for autoregressive decoders, enabling more precise and reliable control over LLM outputs.

Video
|
Apr. 3, 2025
Panel Discussion: The Power of Unentangled Quantum Proofs With Non-negative Amplitudes | Quantum Colloquium
Video
|
Apr. 3, 2025
Panel Discussion: Optimization by Decoded Quantum Interferometry | Quantum Colloquium
Video
|
Apr. 3, 2025
Panel Discussion: Logical Quantum Processor Based On Reconfigurable Atom Arrays | Quantum Colloquium
Video
|
Apr. 3, 2025
Panel Discussion: Logical Quantum Processor Based On Reconfigurable Atom Arrays | Quantum Colloquium
Workshop Talk
|
Apr. 2, 2025

The Move Toward AGI: Why Large Language Models Surprised Almost Everyone, and What’s Coming Next | Theoretically Speaking

The advent of large language models (LLMs) such as ChatGPT changed forever the public perception of artificial intelligence. Our panel of experts will discuss why LLMs proved to be so surprising, even to researchers in the field, and why the explosion of increasingly powerful models has inevitably led to whispers about artificial general intelligence (AGI). We'll examine whether LLMs are sufficient to get us to AGI and, if not, what the missing ingredients might be.

Click here to visit event webpage.

_______________________

Theoretically Speaking is a lecture series highlighting exciting advances in theoretical computer science for a broad general audience. Events are free and open to the public, with first-come, first-served seating. No special background is assumed. Registration is required. This lecture will be viewable afterward on this page and on our YouTube channel, following captioning.

Light refreshments will be provided before the talk, starting at 4:30 p.m.

The Simons Institute regularly captures photos and video of activity around the Institute for use in publications and promotional materials. 

If you require special accommodation, please contact our access coordinator at simonsevents@berkeley.edu with as much advance notice as possible.

Workshop Talk
|
Apr. 2, 2025

Talk by

Abstract not available.

Workshop Talk
|
Apr. 2, 2025

On cognitive maps, LLMs, world models, and understanding

No abstract available.

Workshop Talk
|
Apr. 2, 2025

DeepSeek-R1 Thoughtology: <Thinking> about LLM Reasoning

Large Reasoning Models like DeepSeek-R1 mark a fundamental shift in how LLMs approach complex problems. Instead of directly producing an answer for a given input, DeepSeek-R1 creates detailed multi-step reasoning chains, seemingly “thinking” about a problem before providing an answer. This reasoning process is publicly available to the user, creating endless opportunities for studying the reasoning behaviour of the model and opening up the field of Thoughtology. Starting from a taxonomy of DeepSeek-R1’s basic building blocks of reasoning, our analyses on DeepSeek-R1 investigate the impact and controllability of thought length, management of long or confusing contexts, cultural and safety concerns, and the status of DeepSeek-R1 vis-à-vis cognitive phenomena, such as human-like language processing and world modelling. Our findings paint a nuanced picture. Notably, we show DeepSeek-R1 has a ‘sweet spot’ of reasoning, where extra inference time can impair model performance. Furthermore, we find a tendency for DeepSeek-R1 to persistently ruminate on previously explored problem formulations, obstructing further exploration. I will also present, VinePPO, an effective RL algorithm to improve reasoning abilities.

Pagination

  • Previous page Previous
  • Page 231
  • Page 232
  • Current page 233
  • Page 234
  • Page 235
  • Next page Next
Home
The Simons Institute for the Theory of Computing is the world's leading venue for collaborative research in theoretical computer science.

Footer

  • Programs & Events
  • Participate
  • Workshops & Symposia
  • Contact Us
  • Calendar
  • Accessibility

Footer social media

  • Twitter
  • Facebook
  • Youtube
© 2013–2026 Simons Institute for the Theory of Computing. All Rights Reserved.
link to homepage

Main navigation

  • Programs & Events
    • Research Programs
    • Workshops & Symposia
    • Public Lectures
    • Research Pods
    • Internal Program Activities
    • Algorithms, Society, and the Law
  • Participate
    • Apply to Participate
    • Propose a Program
    • Postdoctoral Research Fellowships
    • Law and Society Fellowships
    • Science Communicator in Residence Program
    • Circles
    • Breakthroughs Workshops and Goldwasser Exploratory Workshops
  • People
    • Scientific Leadership
    • Staff
    • Current Long-Term Visitors
    • Research Fellows
    • Postdoctoral Researchers
    • Scientific Advisory Board
    • Governance Board
    • Affiliated Faculty
    • Science Communicators in Residence
    • Law and Society Fellows
    • Chancellor's Professors
  • News & Videos
    • News
    • Videos
  • Support for the Institute
    • Annual Fund
    • All Funders
    • Institutional Partnerships
  • For Visitors
    • Visitor Guide
    • Plan Your Visit
    • Location & Directions
    • Accessibility
    • Building Access
    • IT Guide
  • About

Utility navigation

  • Calendar
  • Contact
  • Login
  • MAKE A GIFT
link to homepage