Abstract
Safety Neurons in Large Language Models
Large language models (LLMs) achieve state-of-the-art performance across a wide range of tasks, but their widespread deployment raises urgent concerns around security, privacy, and misuse. Building on recent...