PALO is a large multilingual multimodal model designed to bridge the gap between vision and language tasks across ten major languages, offering inclusive and high-performing capabilities.
PALO introduces a Large Multilingual Multimodal Model to bridge the gap in vision-language tasks across ten major languages, emphasizing inclusivity and performance improvements.