Advancing Large Language Model Reasoning Capabilities with Preference Trees
EURUS, a suite of large language models optimized for reasoning, achieves state-of-the-art results on diverse benchmarks covering mathematics, code generation, and logical reasoning problems by leveraging ULTRAINTERACT, a newly-curated large-scale, high-quality alignment dataset designed for complex reasoning tasks.