Poster

An Analytical Study of Utility Functions in Multi-Objective Reinforcement Learning

Manel Rodríguez Soto ⋅ Juan A Rodríguez-Aguilar ⋅ Maite López-Sánchez

[ Paper] [ Slides] [ Poster] [ OpenReview]

Abstract

Multi-objective reinforcement learning (MORL) is an excellent framework for multi-objective sequential decision-making. MORL employs a utility function to aggregate multiple objectives into one that expresses a user's preferences. However, MORL still misses two crucial theoretical analyses of the properties of utility functions: (1) a characterisation of the utility functions for which an associated optimal policy exists, and (2) a characterisation of the types of preferences that can be expressed as utility functions. As a result, we formally characterise the families of preferences and utility functions that MORL should focus on: those for which an optimal policy is guaranteed to exist. We expect our theoretical results to promote the development of novel MORL algorithms that exploit our theoretical findings.

Video

Chat is not available.