Download Linnk AI
•
Research Assistant
>
Sign In
insight
-
Preference Optimization in LLMs
Curry-DPO: Enhancing Alignment with Curriculum Learning & Ranked Preferences
Curry-DPO improves LLM alignment using multiple preference pairs and curriculum learning.
1