核心概念
Amharic LLaMA and LLaVA aim to enhance language models for low resource languages like Amharic through data augmentation and multimodal capabilities.
统计
"Amqa: Amharic question answering dataset, 2023."
"An amharic news text classification dataset, 2021."
"Seamlessm4t: Massively multilingual multimodal machine translation, 2023."
"Blip: Bootstrapping language-image pre-training for unified vision-language understanding and generation, 2022."
"LoRA: Low-rank adaptation of large language models, 2022."
引用
"Large Language Models (LLMs) excel at natural language processing tasks."
"LLMs struggle with low-resource languages like Amharic due to limited training data."
"Data augmentation through machine translation to create diverse Amharic tokens."