核心概念
A multi-brain collaborative control system that integrates blind and perceptive policies to enable quadruped robots to maintain stable locomotion even when the perceptual system is impaired or observational data is incomplete.
摘要
The researchers propose a Multi-Brain Collaborative Control (MBC) system that combines the concepts of Multi-Agent Reinforcement Learning (MARL) and introduces collaboration between a Blind Policy and a Perceptive Policy for quadruped robot locomotion.
The Blind Policy relies on proprioceptive information and preset algorithms, suitable for known and structured environments but lacking adaptability in complex or unknown environments. The Perceptive Policy uses visual sensors to obtain detailed environmental information, allowing the robot to adapt to complex terrains, but its effectiveness is limited under occluded conditions or when perception fails.
The MBC system aims to address these challenges by enabling the Blind Policy and Perceptive Policy to collaborate and make decisions collectively. This approach enhances the robot's decision-making and adaptability in complex environments, particularly when the perceptual system is impaired or observational data is incomplete.
The key highlights of the research include:
- A novel Multi-Brain Game Collaboration System (MBC) that uses MARL to enable independent and collaborative optimization of the Blind and Perceptive Policies.
- Successful implementation of "Perception Hot Swap" - the ability to maintain robust locomotion when the external perception system suddenly fails.
- Enhanced mobility in complex environments through the non-zero-sum game between the Blind and Perceptive Policies.
The researchers conducted extensive simulations and real-world experiments to validate the effectiveness of their approach, demonstrating the quadruped robot's ability to navigate challenging terrains, including stairs, gaps, and pillars, even when the perception system is impaired.
统计
The robot was able to achieve a 99.3% success rate in crossing a 0.35m gap, a 97.6% success rate in climbing a 0.30m pit, and an 86.7% success rate in navigating a pillar obstacle with a size of 0.4m and a distance of 1.6m.
When the perception system suddenly failed, the robot maintained a 97% success rate in climbing stairs, a 100% success rate in descending stairs, and a 90% success rate in traversing discrete obstacles, with a mean X-displacement (MXD) of 19.97 and 17.04, respectively.
引用
"Unlike the Blind Policy, the Perceptive Policy is not as robust under these conditions."
"Addressing the challenges mentioned, this study integrates Multi-Agent Reinforcement Learning (MARL) to propose the concept of MBC:Multi-Brain Game Collaboration controller."
"This approach enhances decision-making and adaptability in complex environments."