HybridFlow: A Flexible and Efficient Framework for Reinforcement Learning from Human Feedback (RLHF)
HybridFlow is a flexible and efficient framework that combines single-controller and multi-controller paradigms to enable flexible representation and efficient execution of diverse RLHF dataflows.