Proposing a novel reinforcement learning algorithm named TOLE for controllable text generation with token-level rewards, enhancing robustness and performance.