Generating Meaningful Counterfactuals to Interactively Analyze and Understand Large Language Models
The core message of this paper is to propose a novel algorithm for generating meaningful and grammatically correct textual counterfactuals, and an interactive visualization tool called LLM Analyzer to help users understand the behaviors of large language models (LLMs) by analyzing these counterfactuals.