Enhancing Personalized Text Generation with Neural Bandits and White-box Language Models
A novel online method that employs neural bandit algorithms to dynamically optimize soft instruction embeddings based on user feedback, enhancing the personalization of open-ended text generation by white-box large language models.