Abstract In this talk, I will introduce the transformer, a ubiquitous building block of language models. I will also pontificate on why the architecture is the way it is, and what exactly it is trying to accomplish. Attachment File LLM24-BC Slides - Daniel Hsu and Ankur Moitra 1.pdf File LLM24-BC Slides - Daniel Hsu Cited Papers.pdf