research-papers
Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge
Original Paper: https://arxiv.org/abs/2407.19594 By: Tianhao Wu, Weizhe Yuan, Olga Golovneva, Jing Xu, Yuandong Tian, Jiantao Jiao, Jason Weston, Sainbayar Sukhbaatar Abstract: Large Language Models (LLMs) are rapidly surpassing human knowledge in many domains. While improving these models traditionally relies on costly human data, recent self-rewarding