核心概念
A game-theoretic decision-making strategy is proposed for an autonomous ego robot to block a racing opponent that aims to overtake it, by considering the opponent's level of reasoning and potential for strategy changes.
摘要
The paper presents a game-theoretic approach for an autonomous ego robot to block a racing opponent that aims to overtake it. The key aspects are:
-
Trajectory Planning:
- The trajectories for both the ego robot and opponent are generated as fifth-order polynomials, with parameters determined based on the initial state and target points.
- A set of trajectory candidates is constructed for each agent, considering different acceleration and lateral target values.
-
Interaction Modeling using Level-K Framework:
- The level-K framework is used to model the strategies of the agents, where a level-K agent makes optimal decisions while assuming the other agents are level-(K-1).
- The ego robot's reward function is designed as the negative of the opponent's reward, forming a zero-sum game.
- The opponent's level is estimated online based on its past behavior, and the ego robot selects a trajectory one level higher than the estimated opponent's level.
-
Adaptive Trajectory Mixing Approach:
- To account for potential changes in the opponent's strategy, the ego robot mixes its best trajectory with a fail-safe trajectory that is the best response to the opponent's least likely trajectory.
- The mixing proportion is determined by a level change potential, which is updated based on whether the opponent's estimated level changes.
The proposed approach is evaluated through simulations and human-in-the-loop experiments, demonstrating superior performance in blocking the opponent's overtaking attempts compared to the conventional level-K framework.
統計資料
The maximum speed limit for the opponent is 0.61 m/s, while the maximum speed limit for the ego robot is 0.6 m/s.