下载 Linnk AI
•
研究助手
>
登录
洞察
-
Multimodal Autonomous Agents
OmniACT: A Dataset and Benchmark for Multimodal Generalist Autonomous Agents
Virtual agents can automate computer tasks, but current models struggle with visual understanding.
1