Tutorial: Parrot the human¶

This is the first of a series of tutorials on how to use the communication subsystem of the robot. We will have code examples of increasing complexity, progressively leveraging additional features of the communication subsystem.

This tutorial is about how to program a simple parrot-like behavior.

🏁 Goal of this tutorial

By the end of this tutorial, you will know how to:

create a simple ROS 2 package interacting with the communication subsystem

use the ASR system to recognize the human speech

use the How-to: Speech synthesis (TTS) system to reply back to the human

Pre-requisites¶

You should first have read how to:

develop ROS 2 packages for the robot,
familiarize with the ASR and How-to: Speech synthesis (TTS) packages.

How it works¶

The parrot_tutorial example node below shows how to subscribe to the audio transcription, provided by ASR, and reply back the same sentence. To utter a sentence, we use its How-to: Speech synthesis (TTS) capability, calling the /say.

#!/usr/bin/env python3
# -*- coding: utf-8 -*-
from communication_skills.action import Say
from hri_msgs.msg import LiveSpeech
import rclpy
from rclpy.action import ActionClient
from rclpy.callback_groups import MutuallyExclusiveCallbackGroup
from rclpy.executors import ExternalShutdownException, MultiThreadedExecutor
from rclpy.node import Node
from rclpy.task import Future

# The following demo subscribes to speech-to-text output and triggers TTS
# based on response

class ASRDemo(Node):
    def __init__(self):
        super().__init__("asr_tutorial")

        self.asr_sub = self.create_subscription(
            LiveSpeech,
            '/humans/voices/anonymous_speaker/speech',
            self.asr_result,
            1)

        self.say_client = ActionClient(
            self,
            Say,
            "/say",
            callback_group=MutuallyExclusiveCallbackGroup())
        self.say_client.wait_for_server()

        self.tts_goal_future_handle = Future()

        self.get_logger().info("ASR demo ready")

    def asr_result(self, msg: LiveSpeech):

        # the LiveSpeech message has two main field: incremental and final.
        # 'incremental' is updated has soon as a word is recognized, and
        # will change while the sentence recognition progresses.
        # 'final' is only set at the end, when a full sentence is
        # recognized.
        sentence = msg.final

        self.get_logger().info("Understood sentence: " + sentence)

        goal = Say.Goal()
        goal.meta.priority = 1
        if (sentence == "hello"):
            goal.input = "Hello!"
        elif (sentence == "how are you"):
            goal.input = "I am feeling great"
        elif (sentence == "goodbye"):
            goal.input = "See you!"

        self.say_client.send_goal(goal)



if __name__ == "__main__":
    rclpy.init()
    node = ASRDemo()
    executor = MultiThreadedExecutor()
    executor.add_node(node)
    try:
        executor.spin()
    except (KeyboardInterrupt, ExternalShutdownException):
        node.destroy_node()

To run this ROS 2 node, you can start a PAL OS Docker container and create a ROS 2 Python package for it as described in Build and run a new ROS package.

You can notice how in the list of included libraries, we have included the rclpy library (which is the ROS 2 Python library), and the PAL libraries communication_skills and hri_msgs. Make sure to include them all in your package.xml:

<?xml version="1.0"?>
<?xml-model href="http://download.ros.org/schema/package_format3.xsd" schematypens="http://www.w3.org/2001/XMLSchema"?>
<package format="3">
  ...
  <depend>communication_skills</depend>
  <depend>hri_msgs</depend>
  <depend>rclpy</depend>
  ...
</package>

Then, you can either:

deploy the package on the robot and run the script from the robot, OR
build and run the node from the container, making sure to connect to the robot as specified in Set up ROS 2 communication with the robot

Next steps¶

The next tutorial is yet to be written, check back later! .. TODO: Add a link to the next tutorial