Controling a robot using voice - Speech recognition module for robots

NEW | BOOSTER K1 | HUMANOID FOR EDUCATION & RESEARCH

My Cart 0 item(s) - €0.00

There are no more items in your cart

Total (tax incl.) €0.00

Create a quote

View cart

My account Login

Robotics
- Mobile robots
  - Indoor mobile robots
  - Outdoor mobile robots
  - Mobile robot manipulators
- Quadruped Robot
  - Spot robot
  - Unitree Robotics
  - Deep Robotics
- Cobots
  - Franka Robotics
  - Kinova Robotics
  - Doosan Robotics
  - UFactory
- Humanoid robots
  - Booster T1
  - Booster K1
  - Poppy
- Brands
- Service Robots
  - Cleaning robots
  - Delivery robots
  - Reception robots
- Robotic arms
  - Trossen Robotics
  - Niryo
  - Teleoperated Robotic Arms
- Programmable drones
- Robotic components
  - Wheels | Hubs
  - E-Puck robot
- Compatible robots
Electronics
- Microcontrollers
  - Raspberry Pi
  - Arduino
  - NVIDIA Jetson
  - Odroid
- LiDAR
  - Ouster
  - Robosense
  - Hokuyo
  - Sick
  - Slamtec
  - Ydlidar
- Depth Cameras
  - Intel Realsense
  - Luxonis
  - Stereolabs
- Sensors for robotics
  - IMU | Gyroscopes
  - Cameras | Vision
  - Measure and analysis
- Brands
Motorisation
- Dynamixel
  - Dynamixel servomotors
  - Dynamixel-P Servomotors
  - Dynamixel-Y Servomotors
  - Dynamixel modules and parts
- Motor controllers
  - Roboteq motor controllers
  - Sabertooth motor controllers
Brands
- A-G
  - AgileX Robotics
  - Softbank Robotics Aldebaran
  - Angatec
  - Booster Robotics
  - Boston Dynamics
  - Clearpath Robotics
  - Deep Robotics
  - Direct Drive Tech
  - Doosan Robotics
  - EZ-WHEEL
  - FictionLab
  - Franka Robotics
- H-N
  - Hokuyo
  - Husarion
  - Intel RealSense
  - Kinova
  - MakerBeam
  - Niryo
  - Nvidia
- O-R
  - Odroid
  - OnRobot
  - Ouster
  - PUDU
  - Raspberry Pi
  - Robosense
  - Roboteq
  - Robotis
  - Robotnik
  - Running Brains Robotics
- S-Z
  - Saleae
  - Sick
  - Slamtec
  - Stereolabs
  - Temi
  - Trossen Robotics
  - UFactory
  - Unitree Robotics
  - Vitirover
  - YDLIDAR
Engineering department
Resources
Good deals

Robotics
- Mobile robots
  - Indoor mobile robots
  - Outdoor mobile robots
  - Mobile robot manipulators
- Quadruped Robot
  - Spot robot
  - Unitree Robotics
  - Deep Robotics
- Cobots
  - Franka Robotics
  - Kinova Robotics
  - Doosan Robotics
  - UFactory
- Humanoid robots
  - Booster T1
  - Booster K1
  - Poppy
- Brands
- Service Robots
  - Cleaning robots
  - Delivery robots
  - Reception robots
- Robotic arms
  - Trossen Robotics
  - Niryo
  - Teleoperated Robotic Arms
- Programmable drones
- Robotic components
  - Wheels | Hubs
  - E-Puck robot
- Compatible robots
Electronics
- Microcontrollers
  - Raspberry Pi
  - Arduino
  - NVIDIA Jetson
  - Odroid
- LiDAR
  - Ouster
  - Robosense
  - Hokuyo
  - Sick
  - Slamtec
  - Ydlidar
- Depth Cameras
  - Intel Realsense
  - Luxonis
  - Stereolabs
- Sensors for robotics
  - IMU | Gyroscopes
  - Cameras | Vision
  - Measure and analysis
- Brands
Motorisation
- Dynamixel
  - Dynamixel servomotors
  - Dynamixel-P Servomotors
  - Dynamixel-Y Servomotors
  - Dynamixel modules and parts
- Motor controllers
  - Roboteq motor controllers
  - Sabertooth motor controllers
Brands
- A-G
  - AgileX Robotics
  - Softbank Robotics Aldebaran
  - Angatec
  - Booster Robotics
  - Boston Dynamics
  - Clearpath Robotics
  - Deep Robotics
  - Direct Drive Tech
  - Doosan Robotics
  - EZ-WHEEL
  - FictionLab
  - Franka Robotics
- H-N
  - Hokuyo
  - Husarion
  - Intel RealSense
  - Kinova
  - MakerBeam
  - Niryo
  - Nvidia
- O-R
  - Odroid
  - OnRobot
  - Ouster
  - PUDU
  - Raspberry Pi
  - Robosense
  - Roboteq
  - Robotis
  - Robotnik
  - Running Brains Robotics
- S-Z
  - Saleae
  - Sick
  - Slamtec
  - Stereolabs
  - Temi
  - Trossen Robotics
  - UFactory
  - Unitree Robotics
  - Vitirover
  - YDLIDAR
Engineering department
Resources
Good deals

My Cart 0 item(s) - €0.00

There are no more items in your cart

Total (tax incl.) €0.00

Create a quote

View cart

My account Login

This article describes how to use the Parallax Say It Voice Recognition Module to control a Boe-Bot robot using your voice. Speech recognition is an important asset for a robot, increasing its ability to interact with human beings and above all using their most natural form of communication.

Voice Recognition Module for Parallax robots

The Say It Module is a board specially produced for Parallax by Veear (http://www.veear.eu).

In this article, we will be showing you how to install the Say It Module on a Boe-Bot robot, how to configure the Say It Module using the graphical interface supplied, how to test these settings, and how to generate BS2 code to trigger actions according to the recognised words, since this is after all the ultimate goal. The article therefore follows a logical order. However, before introducing the Say It Module, a quick recap of the fundamentals of speech recognition may prove useful.

For an introduction to the Parallax Boe-Bot robot, please refer to this article.

The fundamentals of speech recognition

Introduction and operation

Automatic speech recognition together with speech synthesis is part of what is known as speech processing.

The aim of speech recognition is to analyse a word or phrase picked up by a microphone and transcribe it in text form onto a computer (or equivalent) so that it can be used. The main uses of speech recognition are automatic dictation or vocal applications over the telephone.

This is a two-phase operation:

The acoustic signal is extracted and broken down into 30-microsecond segments for analysis. For each of these segments, something known as an acoustic image is extracted. This is in fact a vector of the main characteristics of the signal.
The idea is then to determine for each segment the phoneme most likely to correspond to this signal. The phoneme is the smallest unit of spoken language. The English language is made up of between 44 and 47 phonemes, depending on the dialect. For each segment of signal, the program determines the probability of match with each phoneme and combines these probabilities with the pronunciation probabilities for a word (since a word is a sequence of phonemes, certain series of phonemes are more likely than others, simply because they form words) and the probability of a word occurring in the target language (some words are more frequent than others).

To find out more:
http://en.wikipedia.org/wiki/Speech_recognition
Sphinx is an open source speech recognition software application: http://cmusphinx.sourceforge.net

In the context of robotics

Introducing speech recognition is a natural goal in robotics as it is one of the most common forms of communication for humans. In human language, this is referred to as natural language. This is very advanced as it depends on the context, the person, the moment, etc. Although a great deal of progress has been made in robotics in recent years, it has not yet mastered this.

Leaving aside highly advanced robots in laboratories such as ICub, or the very impressive progress made in the military sector, the problems encountered in personal robotics are more mundane. Speech recognition with a mobile robot depends not only on the quality of the components but also on the environment (as always with robotics), which can be noisy or cause an echo, or modify the perception of the sound due to its shape (the robot could perceive a sound differently if it is going along a wall for example). As you will see later, controlling a robot using your voice is more complicated than sitting in front of your desk dictating to your computer using a microphone.

Installation and configuration of the Say It Module

The Say It Module

The Say It Module is a board measuring 6.3 cm by 1.3 cm that plugs into the application connector. It has a microphone at the top and a very useful LED telling the user when the module is in voice recognition mode. It can be used to record up to 32 commands (configured words).

Connecting the Say It Module to the robot

The Say It Module plugs into the application connector (circled in red on the picture below). The power switch must be at 0 when the board is plugged in.

Be aware that the way you connect the board is important. To avoid mistakes, the Vdd sign (Vdd is the 5V regulated input) is marked at the bottom left on the back of the Say It Module. The pin closest to this sign should therefore be connected to the connector marked Vdd on the application connector (in the bottom left of the diagram above).

Once connected, the Say It Module is in the following position:

Connecting the Say-it module on the Boe-Bot robot

Once the board is plugged in, download and install the Say It GUI application. This can be obtained by clicking on http://www.parallax.com/Portals/0/Downloads/docs/prod/comm/SayItGUI-1.1.5.zip

Once the software is installed, turn on the power to the Say It board by switching the Boe-Bot power switch to 1 and launch the Say It GUI software. You must specify the COM port to which the Boe-Bot robot is connected and press the Connect button (both circled in red in the figure below). Please note that the BASIC Stamp Editor programming environment Debug terminal must be closed when you click the Connect button.

Connecting to the module using the Say-it GUI

If this is the first time you are connecting the Say It Module, it will automatically ask you if you want to download something called a “Bridge” onto your BASIC Stamp. This BASIC Stamp add-on is required to operate the Say It Module. Click on Yes, and leave it to download the Bridge. When it has finished, the progress bar at the top right of the Say It GUI interface will be completely green.

Loading the Say-it speech recognition module from Parallax

All done!! You are now ready to start using the module.

If you need any further details, you can download the full documentation for the Say It Module in English at http://www.parallax.com/Portals/0/Downloads/docs/prod/comm/30080_SayItModulev1.2.pdf

A first test

Before changing the settings on the module, we are going to test the words pre-programmed on the module to get an idea of how it is used and of the various concepts we will be needing later.

To carry out the following tests, ensure your robot is connected to your computer using the USB cable and that you have connected the robot in the Say It GUI interface as explained previously in this document. Set the power switch on your robot to position 1 (i.e. board on but motors off).

In the Say It GUI interface, select the first line in the list on the left displaying the word Trigger, then click on the Group Test button as shown in the figure below.

Test of the speech recognition system in the GUI interface of the Say-it module from Parallax

When you press this button, the Say It Module is in listening mode for a few seconds and tries to recognise the word or words contained in the selected group (here the word “robot” is the only word configured in the Trigger group). Say the word “robot” close to the Say It board microphone. If it does not recognise the word, it will display a small window with the word “timeout”, indicating that it has not managed to recognise the word “robot” within the allotted time of the listening phase. If, on the other hand, it has recognised the word, it will highlight the word in light green in your interface (a light green that flashes for a few seconds).

You will have noticed that the LED on the Say It board lights up when the robot is in listening mode. This will prove very useful later on.

You can also practise with other groups, such as wordset group no. 1 (which contains eight words) or the other two wordset groups (which contain six and eleven words).

A bit more about concepts

In the last paragraph, we looked at the trigger concept and other concepts such as groups and wordsets. Let’s look at these in more detail.

Triggers

This is the word used to trigger speech recognition, hence the name. The robot knows that after this word you are going to say a word that will correspond to a command.

Groups

The configured words are commands, as they are meant to trigger an action (we will do this later). Commands are separated into groups.

Wordsets

Wordsets are pre-programmed groups in which words cannot be added or deleted.

Configuring your own commands

To add a new command, proceed as follows:

1Select the group to which you wish to add the new command.
Click on the Add Command button, which is the button resembling a cartoon speech bubble with a green cross on it.
Enter the name of the command in the line that has just been added to the right-hand window. The name can be any chain.
Select the line you have just added by clicking on it (it should be highlighted in blue).
Click on the Command Training button resembling a cartoon speech bubble with a cog wheel on it. The following window opens:
Click on the Phase 1 button and say the word you wish to save once.
Click on the Phase 2 button (which will have replaced the Phase 1 button) and repeat the word corresponding to the command.

That’s it, your command is now configured. To test it, use the Group Test button as before. If you are not satisfied, start the configuration process again. Commands are deleted using the Remove Command button.

Now you know how to add a command, you can populate your own groups.

Generating a BS2 code

You have already seen that the Say It Module can recognise commands, and that we can also add commands. Let’s now see how to tell the robot to perform an action when it recognises a command.

The Say It GUI allows you to generate BS2 code (PBASIC) using the Generate Code button (the one showing a magician’s wand on a white sheet of paper).

Generating and testing code

In our example, we have added the following five commands to group 1:

FORWARD
STOP
REVERSE
RIGHT
LEFT

The interface asks you what to call and where to put the generated code. Once you have done this, you can close the Say It GUI and use the BASIC Stamp Editor, the Boe-Bot programming environment.

The generated code is quite long but, before testing it, you must change a parameter, i.e. the value of the VRGROUP parameter. Its default value is 0, which means it is expecting to recognise commands from group 0. In our example, we have added the five commands to group 1, so the VRGROUP value must be 1.

In the same way, in order for our test to be more relevant, we have modified part of the code as follows. Although this modification is not essential, it makes the test more eloquent (no pun intended!).

Before change	After change

Run the modified program and test the various commands one after the other. The Debug Terminal should look like this:

The first line contains the word “error”, indicating that it did not recognise any configured command during the listening phase. This shows how the generated code works, i.e. that the listening phase is in a loop that lasts five seconds each time. To change this setting, change the value of the VRA1 parameter in line 81.

Adding robot commands

We have added a series of routines at the end of the final code allowing the robot to move forwards, backwards, etc. and we have also added the call statement to these routines in the piece of code already changed previously, as shown in the following figure:

The code now allows you to control the robot using your voice! The complete code can be downloaded by clicking on this link.

To take the experience further, an example code is available from Parallax at http://www.parallax.com/Portals/0/Downloads/docs/prod/comm/SayItExampleCode.zip. In particular, it allows you to test the trigger concept.

All that remains for you to do now is test out this speech recognition module!

Generation Robots (http://www.generationrobots.co.uk)

Explicit prior permission is required for any use or reproduction

CONTACT

+33 (0)5 56 39 37 05
Mérignac (33) FRANCE

This website uses its own and third-party cookies to improve our services and show you advertising related to your preferences by analyzing your browsing habits. To give your consent to its use, press the Accept button.

More information Customize cookies

Functional cookies

Yes

Description and cookies

Functional cookies are strictly necessary to provide the services of the shop, as well as for its proper functioning, so it is not possible to refuse their use. They allow the user to browse through our website and use the different options or services that exist on it.


PHP_SESSID	www.generationrobots.com	The PHPSESSID cookie is native to PHP and allows websites to store serialised status data. On the website it is used to establish a user session and to pass state data through a temporary cookie, which is commonly known as a session cookie. These Cookies will only remain on your computer until you close your browser.	Session
PrestaShop-#	www.generationrobots.com	This is a cookie used by Prestashop to store information and keep the user's session open. It stores information such as currency, language, customer ID, among other data necessary for the proper functioning of the shop.	480 hours
rc::a	Google	It is used to read and filter bot requests.	Persistent
rc::c	Google	It is used to read and filter bot requests.	Persistent

Advertising cookies

Yes

Description and cookies

Advertising cookies collect information about the ads shown to website users.


ads/ga-audiences	Google	These cookies are used by Google AdWords to re-engage visitors that are likely to convert to customers based on the visitor’s online behaviour across websites.	Session
fr	Facebook	Used by Facebook to deliver a series of advertisement products such as real time bidding from third party advertisers.	3 months
tr	Facebook	Used by Facebook to deliver a series of advertisement products such as real time bidding from third party advertisers.	Session
_fbp	Facebook	Used by Facebook to deliver a series of advertisement products such as real time bidding from third party advertisers.	3 months

Performance cookies

Yes

Description and cookies

Analytical cookies collect information about the user's browsing experience in the store. This will be the default purpose that activates Google's "consent mode".


_ga	Google	Registers a unique ID that is used to generate statistical data on how the visitor uses the website.	2 years
_gat	Google	Used by Google Analytics to throttle request rate	1 day
_gat_gtag_UA_#	Google	Used to throttle request rate.	1 minute
_gd#	Google	This is a Google Analytics Session cookie used to generate statistical data on how you use the website which is removed when you quit your browser.	Session
_gid	Google	Registers a unique ID that is used to generate statistical data on how the visitor uses the website.	1 day

The fundamentals of speech recognition

Introduction and operation

In the context of robotics

Installation and configuration of the Say It Module

The Say It Module

Connecting the Say It Module to the robot

A first test

A bit more about concepts

Triggers

Groups

Wordsets

Configuring your own commands

Generating a BS2 code

Generating and testing code

Adding robot commands

Mobile robots

Quadruped robots

Robot arms and cobots

Service robots

Humanoid Robots

Robotic components