Precise Text-Based Reasoning About Vector Graphics Through Primal Visual Description
VDLM, a text-based visual reasoning framework, leverages Scalable Vector Graphics (SVG) and an intermediate Primal Visual Description (PVD) representation to enable precise perception and reasoning about vector graphics, outperforming state-of-the-art large multimodal models.