Speech and language processing#
Your robot can recognize speech, handle dialogues, and synthesise voice in several languages. It is fully compliant with the ROS4HRI REP-155 ROS standard.
The entire speech pipeline runs on-board; no cloud-based services are used (and consequently, no Internet connection is required).
The following figure provides complete picture of the speech processing pipeline on PAL robots.
The main building blocks are:
the robot’s reSpeaker microphone;
the Vosk speech recognizer. Learn more about speech recognition on PAL robots;
a node monitoring incoming speech for wake-up/sleep keywords;
the RASA dialogue manager. Learn more about dialogue management;
the acapela speech synthesis library. Learn more about speech synthesis.