Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning

Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning