How to Best Monitor Agent Script Adherence with Custom Vocab & Keyword Spotting

by Byron Mathias-Fuqua

IDENTIFYING THE PRESENCE OR ABSENCE OF A WORD OR PHRASE

Many call centers and call center monitoring software need a way to search through recordings to monitor agent script adherence. This allows work force managers to coach staff and business owners to monitor for any customer service issues. A popular question that we get from developers is, how can we use VoiceBase features to identify recordings that end with, “Thank you for calling company XYZ” with a high level of confidence that the entire phrase was present in the recording?

Great question! Here is a quick 3 step process to ensure you spot mandatory phrases:

1. ADD THE PHRASE TO CUSTOM VOCABULARY

Custom Vocabulary is a new pre-processing feature that allows you to specify per file, a set of terms or phrases that are important and should be preferentially recognized when transcribed. Without using our Custom Vocabulary feature you may notice a number of slight differences which will make it annoying to detect the phrase, “Thank you for calling VoiceBase”. For example, instead you may get “Thank you for calling voice space” or“Thank you fore calling VoiceBase” – where both of these phrases would result in false-negatives. By adding the correct phrase to Custom Vocabulary, the speech engine will know what to expect and will correct those similar phrases to be the expected one.

There are two ways to use Custom Vocabulary:

Adhoc: Add words or phrases to a job at upload time on the fly. Use this configuration at upload:

      {
         "configuration": {
              "executor": "v2",
              "transcripts": {
                 "vocabularies": [
                     {
                        "terms" : [
                           "Thank you for calling VoiceBase",
                           "Phrase Two",
                           "Phrase Three"
                        ]
                     }
                 ]
             }
          }
      }

Pre-defined: Create a group of words and phrases beforehand, then reference the group name at upload time. For example, create a group first by making a PUT request to /definitions/transcripts/vocabularies with this JSON body:

       {
           "vocabulary" : {
               "name": "agentScript",
               "terms": [
                           "Thank you for calling VoiceBase", 
                           "Phrase Two",
                           "Phrase Three"
               ]
           }
       }

Then apply that group at upload using this configuration:

{  
   "configuration":{  
      "transcripts":{  
         "engine":"premium",
         "vocabularies":[  
            {  
               "name":"agentScript"
            }
         ]
      }
   }
}

2. ADD THE PHRASE TO A KEYWORD SPOTTING GROUP

By adding the phrases to a Keyword Spotting group VoiceBase will be able to identify occurrences of the phrase and return that in the JSON response. Keyword Spotting groups work similar to Custom Vocabulary groups in the API – first, create a group with phrases, then apply that group at upload time.

To create a keyword spotting group send a PUT request to /definitions/keywords/groups/{groupname}

The body of the PUT request is a JSON object (Content-Type: Application/json) that looks like this:

{  
   "name":"agentScript",
   "keywords":[  
      "Thank you for calling VoiceBase",
      "Phrase Two",
      "Phrase Three"
   ]
}

And to apply that at upload time, use the following configuration:

{  
   "configuration":{  
      "keywords":{  
         "groups":[  
            "agentScript"
         ]
      }
   }
}

Your Custom Vocabulary and Keyword Spotting groups will look similar most of the time. There are some cases for single word spotting where you may be very sensitive to false positives, in that case only using Keyword Spotting is recommended.

3. COLLECT THE JSON RESPONSE

The results will be available in the JSON response at /media/{mediaId} and look like this:

  "groups": [
          {
            "keywords": [
              {
                "t": {
                  "unknown": [
                    2.05
                  ]
                },
                "name": "Thank you for calling VoiceBase"
              }
            ],
            "name": "thankvb",
            "type": "group"
          }
        ]

The 2.05 indicates one occurrence beginning at 2.05 seconds into the recording.

READY, SET, <GET>

Ready to try the VoiceBase speech API? You can check out the docs here, or read more about what’s possible here. Oh, and here are the Top 10 API Commands To Get Started, those might be helpful.

We can’t wait to see what you’ll detect.

Download our E-Book

Byron Mathias-Fuqua

Bryon has one of the most recognizable faces at VoiceBase. He has played a key role as one of our early sales engineers in onboarding many of our enterprise customers. You may have seen him at Twilio’s Signal, DreamForce, Enterprise Connect, or one of our other shows with a new demo to show off! When Bryon disconnects from the speech analytics world you can typically find him in his natural habitat on the beaches of Santa Cruz, California. He is also particularly gifted at the popular 90s craze, Dance Dance Revolution, but tries to keep it “low key”.

More From the Voice analytics blog

What Is Voice of the Customer?

L...

Predictive Analytics for Strategic Insights

Predictive analytics is an advanced form of data mining that leverages machine learning to identify patterns in voice recordings, intuit a speaker’s intent, and predict a future outcome — be it a sale, account cancellation, or one of many customized “X” signals your clients might request.

What Is Wrap-Up Time? 7 Ways to Reduce It

C...

custom vocabulary keyword spotting speech analytics