The I-PHYRE framework challenges agents to demonstrate intuitive physical reasoning, multi-step planning, and in-situ intervention. It addresses the gap in evaluating agents' abilities to interact with dynamic events. The framework consists of four game splits designed to scrutinize learning and generalization of essential principles of interactive physical reasoning. Existing works have limitations in exploring physical reasoning due to constraints like passive observation or single-round interventions. I-PHYRE aims to bridge these gaps by emphasizing intuitive physical reasoning, multi-step interventions, and in-situ interactions. The framework includes 40 distinct games categorized into basic, noisy, compositional, and multi-ball splits for training and generalization assessment.
Naar een andere taal
vanuit de broninhoud
arxiv.org
Belangrijkste Inzichten Gedestilleerd Uit
by Shiqian Li,K... om arxiv.org 03-26-2024
https://arxiv.org/pdf/2312.03009.pdfDiepere vragen