Test Method and Test Report
To ensure the DUT performance, some tests can be performed to verify the following parameters:
Wake-up rate
Speech recognition rate
False wake-up rate
Response Accuracy Rate Under Playback
Response time
Test Room Requirement
These tests must be performed in a proper test room. The requirements for this test room include:
Size
Area: no smaller than 4 m * 3.2 m
Height: no lower than 2.3 m
Setup
The floor should be equipped with carpet, the ceiling should be equipped with common acoustic damping materials, and the wall should have 1 to 2 walls with curtains to prevent strong reflection.
Room reverberation time (RT60) within the range of [125, 8k] shall be within 0.2 - 0.7 seconds.
Do not use anechoic chamber.
Background noise: must < 35 dBA, best < 30 dBA
Temperature and humidity: 20±10°C, 50%±20%
Placement of DUT, external noise and voice:
Place the DUT, external noise and voice according the actual use scenario of your DUT.
Note
The RT60, background noise, and the placement of DUT, external noise and voice should be kept the same in all tests.
Test Case Design
When designing test cases, it’s suggested to factor in some or all of the following parameters based on the actual use scenarios of the product. For example,
- Different types of noises
White noise
Human noise
Music
News
… …
Test cases with multiple noise sources can also be added when necessary
- Different noise levels
< 35 dBA
45 dBA
55 dBA
65 dBA
- Different voice levels
54 dBA
59 dBA
64 dBA
- Different SNR
9 dBA
4 dBA
-1 dBA
Espressif Test and Result
In all the tests described in this section, the placement of DUT, external noise and voice can be seen in the diagrams below.
As seen in the diagrams above, place
The DUT 0.75 meters above the ground.
The voice 3 meters away from the DUT and 1.5 meters above the ground.
The external noise 45°C apart from the voice, 2 meters away from the DUT and 1.2 meters above the ground.
The sound pressure meter right above the DUT by 0.75 meters.
Wake-up Rate Test
Wake-up rate: the probability of the DUT correctly wakes up to a wake word.
Espressif’s Wake-up Rate Test and Result
Test Case |
Noise Type |
Noise Decibel |
Voice Decibel |
SNR |
Wake-up Rate |
---|---|---|---|---|---|
1 |
/ |
/ |
59 dBA |
/ |
99% |
2 |
White noise |
55 dBA |
59 dBA |
>= 4 dBA |
99% |
3 |
Human noise |
55 dBA |
59 dBA |
>= 4 dBA |
99% |
Speech Recognition Rate Test
Speech recognition rate: the probability of the DUT correctly recognizes the established command words when the DUT is in the speech recognition state.
Espressif’s Speech Recognition Rate Test and Result
Test Case |
Noise Type |
Noise Decibel |
Voice Decibel |
SNR |
Speech Recognition Rate |
---|---|---|---|---|---|
1 |
/ |
/ |
59 dBA |
/ |
91.5% |
2 |
White noise |
55 dBA |
59 dBA |
>= 4 dBA |
78.25% |
3 |
Human noise |
55 dBA |
59 dBA |
>= 4 dBA |
82.77% |
False Wake-up Rate Test
False wake-up rate: the probability of the DUT incorrectly wakes up to a random word (that is not a wake word).
Espressif’s False Wake-up Rate Test and Result
Test Case |
Noise Type |
Noise Decibel |
Test Duration |
Number of False Wake-up |
---|---|---|---|---|
1 |
Music |
55 dBA |
12 hours |
1 time |
2 |
News |
55 dBA |
12 hours |
1 time |
Response Accuracy Rate Under Playback
Interrupting wake-up rate: the probability of the DUT correctly responds to a wake word or a command word while playing sounds, such as music or TTS. This test is required for products with AEC feature.
Espressif’s Interrupting Wake-up Rate Test and Result
Test Case |
Noise Type |
Noise / Voice Decibel |
SNR |
Wake-up Rate |
Speech Recognition Rate |
---|---|---|---|---|---|
1 |
Music |
69 dBA / 59 dBA |
>= 10 dBA |
100% |
96% |
2 |
TTS |
69 dBA / 59 dBA |
>= 10 dBA |
100% |
96% |
Response Time Test
Response time: the time required for the DUT to respond to a command word. It’s measured as the time duration after a command word and before the DUT starts playing sound (see the diagram below).
Espressif’s Response Time Test and Result
Test Case |
Noise / Voice Decibel |
SNR |
Response Time |
---|---|---|---|
1 |
NA / 59 dBA |
/ |
< 500 ms |