Chain of Hindsight Aligns Language Models with Feedback

Chain of Hindsight Aligns Language Models with Feedback