핵심 개념
A novel multi-objective Deep Reinforcement Learning approach that learns allocation policies to jointly optimize efficiency and workload fairness in collaborative human-robot order picking systems.
초록
The content describes a novel approach to address the optimization problem in collaborative human-robot order picking systems. Key highlights:
- The problem considers both efficiency (total picking time) and workload fairness (standard deviation of picker workloads) as objectives, addressing the limitation of existing solutions that focus solely on efficiency.
- A multi-objective Deep Reinforcement Learning (DRL) approach is proposed to learn effective allocation policies that outline the trade-offs between the two objectives.
- A novel Aisle-Embedding Multi-Objective Aware Network (AEMO-Net) architecture is introduced to effectively capture regional information and extract representations related to efficiency and fairness.
- Extensive experiments demonstrate that the proposed approach can find non-dominated policy sets that outperform greedy and rule-based benchmarks on both efficiency and fairness objectives.
- The trained policies also show good transferability properties when tested on scenarios with different warehouse sizes.
통계
The total time to complete all pickruns should be minimized.
The standard deviation of the total workload (mass of picked items) across all pickers should be minimized.
인용구
"While optimizing efficiency (total picking time) is a dominant focus within both traditional and robotized warehousing settings, our study also takes into account the workload fairness, an objective often ignored in the literature."
"Existing solutions typically focus on deterministic scenarios and optimizing for efficiency. However, the sole focus on efficiency can negatively impact human well-being. If some pickers must pick much larger/heavier workloads than others, it can place considerable physical and mental strain on them."