Leveraging Equal Preferences to Enhance Feedback Efficiency in Preference-Based Reinforcement Learning
Simultaneous learning from both equal and explicit preferences enables preference-based reinforcement learning agents to better understand human feedback, leading to improved feedback efficiency and task performance.