Visual grounding can enhance word learning efficiency in neural language models, especially in low-data regimes.