Scalable Method for Instruction Following Language Model
The author presents a scalable method, instruction backtranslation, to improve language models' ability to follow instructions by leveraging unlabeled data and self-training. This approach outperforms existing models on the Alpaca leaderboard without relying on distillation data.