The I-PHYRE framework challenges agents to demonstrate intuitive physical reasoning, multi-step planning, and in-situ intervention. It addresses the gap in evaluating agents' abilities to interact with dynamic events. The framework consists of four game splits designed to scrutinize learning and generalization of essential principles of interactive physical reasoning. Existing works have limitations in exploring physical reasoning due to constraints like passive observation or single-round interventions. I-PHYRE aims to bridge these gaps by emphasizing intuitive physical reasoning, multi-step interventions, and in-situ interactions. The framework includes 40 distinct games categorized into basic, noisy, compositional, and multi-ball splits for training and generalization assessment.
toiselle kielelle
lähdeaineistosta
arxiv.org
Tärkeimmät oivallukset
by Shiqian Li,K... klo arxiv.org 03-26-2024
https://arxiv.org/pdf/2312.03009.pdfSyvällisempiä Kysymyksiä