Leveraging Large Language Models to Efficiently Generate Multilingual Training Data for Dense Retrieval
Synthetic training data generation using large language models can effectively substitute for expensive human-labeled data in improving multilingual dense retrieval models.