Speech Recognition: End-Point Detection
The end point detection technique is applied to extract the region of interest from the raw speech signal. In other words, it removes the silent region from speech signals. The basic technique of end point detection is to find the energy level of a signal. Signal energy level is calculated in frames, where each frame consists of N samples. The frames are usually overlapped with the adjacent frames to produce a smooth energy line. Fig 1 shows the energy plot of “One”.

Fig 1: (a) Amplitude vs time plot of “One” (b) energy level of the signal
Accurate end point detection is important to reduce processing load and increase the accuracy of a speech recognition system. Basically there are two famous endpoint detection algorithms. First algorithm uses signal features based on energy levels and second algorithm uses signal features based on the rate of zero crossings. The combination of both gives good result, but nevertheless increases the complexity of the program and also the processing time.
Fig 2 shows the signal of “one” sampled at 8000Hz for 10650 samples or 1.33 seconds. Before the speech begins, the waveform started as silence for about 5000 samples. After the utterance, the signal remains in silence state again for about 2000 samples. Throwing the unwanted silence region, the processing time can be improved to 3650/10650 * 100 = 34.3% by assuming all the frames in the region of interest have been processed. The energy level of the signal is inspected and a threshold value is determined from the energy plot. Fig 3 shows the cropped signal, where the silence region has been eliminated, and the remaining region of interest are used for further processing.
Fig 2: (a) Original signal, (b) End-point detection by using the energy level of the speech signal

Fig 3: (a) Detected end point, (b) Cropped signal/region of interest

8 Comments:
HI!
I found your blog because I was looking for some information about algorithms to detect endpoints using energy. I would like to know if you have more information about this or if you can tell me where can I found it.
Thanks.
Hi,
it's me again I forgot to leave you my e-mail address: esaucastilloc@gmail.com!!
Thanks again!
Hi - Thanks for your info on end-point recognisition. Can you please let me know more details on end-point detection. I'm doing a project on VR for my engineering and your inputs will be really helpful.
Thanks again,
Ram
i want a matlab code for end point detection by using energy??
thanks alot
do you use neural network toolbox in matlab??
hi, i want Speech Recognition: End-Point Detection code plese sent to super_mon21@hotmail.com
hi..,i want matlab code for end point detection and the crooped signal.....sent to email id:srividyajv20@gmail.com
hye,
may i know more detail about end-point detection.And i want matlab code for end point detection and the crooped signal.....kindly please sent to email id: kids.hasrul@gmail.com
Post a Comment
<< Home