NeurIPS A Universal Prompt Generator for Large Language Models

Poster
in
Workshop: Workshop on robustness of zero/few-shot learning in foundation models (R0-FoMo)

A Universal Prompt Generator for Large Language Models

Gurusha Juneja · Amit Sharma

[ Abstract ]

Abstract:

LLMs are primarily reliant on high-quality and task-specific prompts. However, the prompt engineering process relies on clever heuristics and requires multiple iterations. Some recent works attempt to automate this process by improving upon human written prompts. However, creating high-quality prompts from scratch is still an unresolved challenge owing to its inherent complexity. In this work, we propose UniPrompt, a novel technique for generating high-quality human-like prompts from scratch. To do so, we identify characteristic features of human-generated prompts such as being detailed and consisting of multiple sections. Our proposed method, UniPrompt, takes as input a single sentence description of the task and generates human-like sectioned prompts using an auxiliary language model. We train the model in two stages. First, the model is finetuned on multiple tasks using a novel dataset curated using GPT-4 across over 500 tasks. Second, we align the auxiliary model to generate task-relevant (high accuracy) prompts by collecting a prompt preference dataset and optimizing the model using the Direct Preference Optimization method. Importantly, UniPrompt is task-agnostic: once trained, it can be used to generate prompts for any task. We find that UniPrompt outperforms human-generated prompts, GPT-generated prompts, and other prompt optimization techniques across diverse tasks on medicine, causality, and hate speech by up to 5.1 %, 7.2 %, and 11.1 % respectively.

Chat is not available.

Poster in Workshop: Workshop on robustness of zero/few-shot learning in foundation models (R0-FoMo)

A Universal Prompt Generator for Large Language Models

Gurusha Juneja · Amit Sharma

Poster
in
Workshop: Workshop on robustness of zero/few-shot learning in foundation models (R0-FoMo)