Efficient Multimodal Generative Framework for Extracting Implicit Product Attribute Values
EIVEN, a data and parameter-efficient multimodal generative framework, leverages the rich inherent knowledge of pre-trained language models and vision encoders to effectively extract implicit product attribute values while requiring less labeled data. It also introduces a novel Learning-by-Comparison technique to reduce model confusion among similar attribute values.