data:image/s3,"s3://crabby-images/85c88/85c88eb653f5b011f4c3e941f699689b06492b83" alt="Large Language Models and Transformers: Part 1 (FALL)"
Abstract
Prompt injection attacks are a significant threat to the security of LLM-integrated applications. These attacks exploit the lack of a clear separation between instructions/prompts and user data. I will introduce the notion of structured queries, a general approach to tackle this problem by explicitly separating prompt and data and training LLMs to respect this separation. I will describe how to adjust standard instruction tuning to respect this separation, and show the resulting models provide significant improvements in robustness against prompt injection.