Iterated learning, inspired by cultural transmission theory, can improve the compositionality of large vision-language models.