Ensemble Policies for Diverse Query-Generation in Preference Alignment of Robot Navigation

Publication Authors J. de Heuvel; F. Seiler; M. Bennewitz
Published in IEEE International on Human & Robot Interactive Communication (RO-MAN)
Year of Publication 2024
Abstract

To align mobile robot navigation policies with user preferences through

reinforcement learning from human feedback (RLHF), reliable and

behavior-diverse user queries are required. However, deterministic policies

fail to generate a variety of navigation trajectory suggestions for a given

navigation task configuration. We introduce EnQuery, a query generation

approach using an ensemble of policies that achieve behavioral diversity

through a regularization term. For a given navigation task, EnQuery produces

multiple navigation trajectory suggestions, thereby optimizing the efficiency

of preference data collection with fewer queries. Our methodology demonstrates

superior performance in aligning navigation policies with user preferences in

low-query regimes, offering enhanced policy convergence from sparse preference

queries. The evaluation is complemented with a novel explainability

representation, capturing full scene navigation behavior of the mobile robot in

a single plot.

Type of Publication Article
Lead Image No image
Lead Image Caption
Text
Images
Teaser Image 1
Teaser Image 2
Files and Media
Local Video File
Local PDF File
Settings
Versioning enabled yes
Short name deheuvel24roman
Layout
Blocks { "029f2e27-a19c-48cb-aae1-206c5e99311d": { "@type": "slate", "value": [ { "type": "p", "children": [ { "text": "" } ] } ], "plaintext": "" } }
Blocks Layout { "items": [ "029f2e27-a19c-48cb-aae1-206c5e99311d" ] }
Options
Categorization
Related Items
Contents

There are currently no items in this folder.