LaVy: A Pioneering Vietnamese Multimodal Large Language Model for Advancing Visual-Linguistic Understanding
LaVy is a state-of-the-art Vietnamese Multimodal Large Language Model (MLLM) that aims to bridge the gap between Vietnamese Large Language Models (LLMs) and MLLMs, enabling complex reasoning and linguistic comprehension in tasks that involve both visual and textual information.