NeurIPS Generating Human-Like Goals by Synthesizing Reward-Producing Programs

Poster
in
Workshop: Intrinsically Motivated Open-ended Learning (IMOL) Workshop

Generating Human-Like Goals by Synthesizing Reward-Producing Programs

Guy Davidson · Graham Todd · Todd Gureckis · Julian Togelius · Brenden Lake

Keywords: [ goal representations ] [ Program Synthesis ] [ quality-diversity ] [ contrastive learning ] [ goal programs ]

[ Abstract ] [ Project Page ]

[ OpenReview]

Abstract:

Humans show a remarkable capacity to generate novel goals, for learning and play alike, and modeling this human capacity would be a valuable step toward more generally-capable artificial agents. We describe a computational model for generating novel human-like goals represented in a domain-specific language (DSL). We learn a ‘human-likeness’ fitness function over expressions in this DSL from a small (<100 game) human dataset collected in an online experiment. We then use a Quality-Diversity (QD) approach to generate a variety of human-like games with different characteristics and high fitness. We demonstrate that our method can generate synthetic games that are syntactically coherent under the DSL, semantically sensible with respect to environmental objects and their affordances, but distinct from human games in the training set. We discuss key components of our model and its current shortcomings, in the hope that this work helps inspire progress toward self-directed agents with human-like goals.

Chat is not available.

Poster in Workshop: Intrinsically Motivated Open-ended Learning (IMOL) Workshop

Generating Human-Like Goals by Synthesizing Reward-Producing Programs

Guy Davidson · Graham Todd · Todd Gureckis · Julian Togelius · Brenden Lake

Poster
in
Workshop: Intrinsically Motivated Open-ended Learning (IMOL) Workshop