The paper studies the problem of learning action models with full observability, following the learning by search paradigm. It develops a theory for action model learning based on version spaces, which interprets the task as a search for hypotheses consistent with the learning examples.
The key contributions are:
The paper establishes a precise mapping between version spaces and action model learning, deriving update rules that exploit the structure of the hypothesis space.
It shows how to manipulate the version space representation to extract sound (safe) and complete action model formulations, proving that both converge to the true model given enough demonstrations.
The sound model generates plans that are guaranteed to work with the true model, while the complete model ensures the existence of a plan if one exists for the true model.
Experiments demonstrate the complementarity of the sound and complete models, with their relative performance depending on the characteristics of the domain and the distribution of positive and negative demonstrations.
The paper provides a theoretical first-principles investigation of action model learning, unifying previous work on safe models and introducing a new perspective on complete models.
To Another Language
from source content
arxiv.org
Key Insights Distilled From
by Diego Aineto... at arxiv.org 04-16-2024
https://arxiv.org/pdf/2404.09631.pdfDeeper Inquiries