Training A Small Emotional Vision Language Model for Visual Art Comprehension
Stats
RTX 2080 Tiでトレーニングおよび評価可能であり、非常に強力なパフォーマンスを発揮します。
Quotes
"The proposed model can be trained and evaluated on a single RTX 2080 Ti while exhibiting very strong performance."
"Our model is very competitive compared with LLaVA-FT, having higher accuracy and efficiency."