Regression 대신 Classification을 사용한 Scalable Deep RL의 Value Functions 훈련
Value functions trained with categorical cross-entropy significantly improve performance and scalability in various domains, showcasing the potential of using classification instead of regression in deep RL.