AI systems can surpass human capabilities by leveraging easy-to-hard generalization through process-supervised reward models, enhancing performance on complex tasks.