Low-level grounding in a multimodal mobile service robot conversational system using graphical models

Prodanov, Plamen; Drygajlo, Andrzej; Richiardi, Jonas; Alexander, Anil

Informations

Fulltext

Low-level grounding in a multimodal mobile service robot conversational system using graphical models

Prodanov, Plamen ; Drygajlo, Andrzej ; Richiardi, Jonas ; Alexander, Anil

In: Intelligent Service Robotics, 2008, vol. 1, no. 1, p. 3-26

Ajouter à la liste personnelle

Titre

Low-level grounding in a multimodal mobile service robot conversational system using graphical models

Auteur

Prodanov, Plamen. Perceptual Artificial Intelligence Laboratory, Signal Processing Institute, Swiss Federal Institute of Technology Lausanne (EPFL), Lausanne, Switzerland
Drygajlo, Andrzej. Perceptual Artificial Intelligence Laboratory, Signal Processing Institute, Swiss Federal Institute of Technology Lausanne (EPFL), Lausanne, Switzerland
Richiardi, Jonas. Perceptual Artificial Intelligence Laboratory, Signal Processing Institute, Swiss Federal Institute of Technology Lausanne (EPFL), Lausanne, Switzerland
Alexander, Anil. Clarifying Technologies Ltd, Oxford, UK

Type de document

Postprint

Langue

Anglais

Publié dans

Intelligent Service Robotics, 2008, vol. 1, no. 1, p. 3-26. Springer-Verlag

Autre version électronique

Publisher's version : https://doi.org/10.1007/s11370-006-0001-9

Classification

Mécanique

Mots clés

Service robots ; Spoken interaction ; Grounding ; Bayesian networks ; Efficient inference

Identifiant OAI-PMH

oai:doc.rero.ch:315603

Summary

The main task of a service robot with a voice-enabled communication interface is to engage a user in dialogue providing an access to the services it is designed for. In managing such interaction, inferring the user goal (intention) from the request for a service at each dialogue turn is the key issue. In service robot deployment conditions speech recognition limitations with noisy speech input and inexperienced users may jeopardize user goal identification. In this paper, we introduce a grounding state-based model motivated by reducing the risk of communication failure due to incorrect user goal identification. The model exploits the multiple modalities available in the service robot system to provide evidence for reaching grounding states. In order to handle the speech input as sufficiently grounded (correctly understood) by the robot, four proposed states have to be reached. Bayesian networks combining speech and non-speech modalities during user goal identification are used to estimate probability that each grounding state has been reached. These probabilities serve as a base for detecting whether the user is attending to the conversation, as well as for deciding on an alternative input modality (e.g., buttons) when the speech modality is unreliable. The Bayesian networks used in the grounding model are specially designed for modularity and computationally efficient inference. The potential of the proposed model is demonstrated comparing a conversational system for the mobile service robot RoboX employing only speech recognition for user goal identification, and a system equipped with multimodal grounding. The evaluation experiments use component and system level metrics for technical (objective) and user-based (subjective) evaluation with multimodal data collected during the conversations of the robot RoboX with users

Low-level grounding in a multimodal mobile service robot conversational system using graphical models

Prodanov, Plamen ; Drygajlo, Andrzej ; Richiardi, Jonas ; Alexander, Anil

In: Intelligent Service Robotics, 2008, vol. 1, no. 1, p. 3-26

Voir aussi

Exporter vers

Low-level grounding in a multimodal mobile service robot conversational system using graphical models

Prodanov, Plamen ; Drygajlo, Andrzej ; Richiardi, Jonas ; Alexander, Anil

In: Intelligent Service Robotics, 2008, vol. 1, no. 1, p. 3-26

Voir aussi

Liens

Partager

Exporter vers