The authors present SplAgger, a method that combines permutation variant and invariant components to enhance meta-reinforcement learning performance.
Task inference sequence models are beneficial in meta-RL, even without task inference objectives.
Task inference sequence models are beneficial in meta-RL, even without task inference objectives.