Dissecting Machine Unlearning for Large Language Models: Selective Pruning Method
The author introduces a machine unlearning method called selective pruning specifically designed for Large Language Models, focusing on removing neurons based on their importance to specific capabilities. This approach offers a data-efficient method to identify and eliminate neurons enabling specific behaviors.