NeurIPS Revisiting Value Alignment Through the Lens of Human-Aware AI

Poster
in
Workshop: HCAI@NeurIPS 2022, Human Centered AI

Revisiting Value Alignment Through the Lens of Human-Aware AI

Sarath Sreedharan · Subbarao Kambhampati

Keywords: [ value alignment ] [ Theory of mind ] [ Mental modeling ]

[ Abstract ] [ Project Page ]

[ Poster] [ OpenReview] [ Topia]

Abstract:

Value alignment has been widely argued to be one of the central safety problems in AI. While the problem itself arises from the way humans interact with the AI systems, most current solutions to value alignment tend to sideline the human or make unrealistic assumptions about possible human interactions. In this position paper, we propose a human-centered formalization of the value alignment problem that generalizes human-AI interaction frameworks that were originally developed for explainable AI. We see how such a human-aware formulation of the problem provides us with novel ways of addressing and understanding the problem.

Chat is not available.

Poster in Workshop: HCAI@NeurIPS 2022, Human Centered AI

Revisiting Value Alignment Through the Lens of Human-Aware AI

Sarath Sreedharan · Subbarao Kambhampati

Poster
in
Workshop: HCAI@NeurIPS 2022, Human Centered AI