Analyzing Multimodal Assistant with Small Language Models
The author explores the design aspects of Multimodal Small Language Models (MSLMs) and introduces Mipha, an efficient multimodal assistant that outperforms large models without additional training data.