Discovering Preference Optimization Algorithms with and for Large Language Models

Discovering Preference Optimization Algorithms with and for Large Language Models