Speech and language processing#

Your robot can recognize speech, handle dialogues, and synthesise voice in several languages. It is fully compliant with the ROS4HRI REP-155 ROS standard.

The entire speech pipeline runs on-board; no cloud-based services are used (and consequently, no Internet connection is required).

The following figure provides complete picture of the speech processing pipeline on PAL robots.

The main building blocks are:

the robot’s reSpeaker microphone;
the Vosk speech recognizer. Learn more about speech recognition on PAL robots;
a node monitoring incoming speech for wake-up/sleep keywords;
the RASA dialogue manager. Learn more about dialogue management;
the acapela speech synthesis library. Learn more about speech synthesis.

General documentation#

Dialogue management
Wake-up word detector
Internationalisation and language support

Tutorials#

Create, translate or update a chatbot
Trigger custom behaviours from the chatbot

How-to#

Speech synthesis (TTS)
How-to: Automatic Speech Recognition (ASR)

FAQ#

How to change ARI’s language?
What languages are supported by ARI?
How to change the ‘wake-up’ (or ‘sleep’) keywords?
What is the default wake-up word?

References#

Speech and language processing capabilities
ASR, TTS and dialogue management APIs
Default chit-chat/smalltalk capabilities of PAL’s robots