Enhancing Medical Multimodal Capabilities through PubMedVision: A Large-Scale, High-Quality Dataset for Injecting Visual Knowledge into Language Models
Constructing PubMedVision, a large-scale, high-quality medical multimodal dataset, to significantly boost the multimodal capabilities of language models in medical applications.