
A tool for real time Keyword Spotting in phone conversations.
The keyword spotting technology (KWS) enables the detection of word occurrences in audio or video streams. The detection consists in computing a score for word utterances. When the score exceeds a threshold, the time window in which the word is detected is stored. The result of KWS is a list of word occurrences.
KWS requires less resources than Automatic Speech Recognition (ASR) and can be easily integrated in embedded systems. Furthermore, KWS works better in low resolution / noisy audio files.
The area of application for KWS is given by developing more complex functionalities starting from the base functionality. The KWS can follow the utterance of a specific word or a specific set of words in any or given order by time line aligning the results obtained by searching a set of words in the same audio stream. On the other hand, each word or combination of words can trigger an action like record the occurrence to be later use or start a specific task.