Towards Stable and Robust Prompt Tuning for Few-shot Learning via Input Separation
A novel language model architecture named StablePT that processes textual information and soft prompts separately but keeps interaction between them, helping to stabilize model performance across different initialization of hard/soft prompts.