research-papers
Query-Dependent Prompt Evaluation and Optimization with Offline Inverse RL
Original Paper: https://arxiv.org/abs/2309.06553 By: Hao Sun, Alihan Hüyük, Mihaela van der Schaar Abstract: In this study, we aim to enhance the arithmetic reasoning ability of Large Language Models (LLMs) through zero-shot prompt optimization. We identify a previously overlooked objective of query dependency in such optimization