10th February 2021

Dana Shalev of NICE inContact discusses how voice identification can be used to positively impact contact centre performance.
Voice identification, also known as voice recognition or speaker recognition, is a biometric technology that is being adopted by call centers that must authenticate each caller before they can provide service.
Voice identification uses the innate biological characteristics of a person’s voice to create a voiceprint that is unique to that person. Its biometric properties make voice identification difficult to spoof.
It’s also easier for users, who no longer need to remember passwords or the answers to security questions. With voice identification, there is nothing to forget or lose. The authentication is part of the person.
Voice and speech recognition are not the same. Voice identification is about verifying the identity of the speaker. Speech recognition is about understanding the words being spoken. At the outset, digital personal assistants like Siri and Alexa could understand the verbal questions they were asked, but could not verify who was asking the question.
Today, Apple’s Siri uses voice identification technology to recognize the person who is speaking. Similarly, Amazon users can create a personal voice profile and train Alexa to recognize it.
As we can see, the use cases are different yet complementary. Voice identification is used to authenticate the speaker, while speech recognition is used to understand what the speaker is saying.
Voice identification technology captures and measures the physical qualities of a person’s voice when speaking as well as the unique biological parameters that combine to produce that voice.
These vocal qualities include:
A voiceprint is created by capturing the dominant frequencies and tones in spoken words and reducing them to a digital format that collectively creates a unique “voiceprint” for that person.
Voice identification technology relies on a microphone, plus two main methods for capturing a voiceprint (i.e. enrolling a customer) and then using the voiceprint template for real-time authentication.
The easiest voice identification method is Text-Independent, where vocal samples are captured from regular speech that is random and unplanned. Customers can say anything they want for their passphrase.
In advanced voice identification systems, the customer’s voice is captured passively during normal conversation. There is no need to formally speak and repeat a passphrase in order to enroll.
Alternately, voice identification may be Text-Dependent, where the same words or passphrases are used to capture the voiceprint at enrollment and subsequently to authenticate the customer each time.
Text-dependent voice identification and authentication may be static or dynamic. Static authentication uses the same passphrase every time. Dynamic authentication uses a randomized passphrase that is different each time.
For example, customers could be asked to say a number of pre-defined phrases at enrollment and any of these could be used for authentication. Likewise, unique phrases such as random number sequences, could be generated for each customer.
In any case, text-dependent voice identification requires active capture and enrollment of the customer.
Customer service call centers are the natural use case for voice identification. It’s a great way to authenticate customers quickly and reliably without the tedious security and authentication procedures that customers dislike.
Not surprisingly, most customers find voice identification more convenient and less risky than sharing personal IDs such as driver’s license or credit-card numbers over the phone.
The rise of the mobile app is also driving voice identification uses cases. Cyber threats have prompted many applications to require two-factor authentication. A single password is not enough.
Voice identification can be paired with facial recognition to authenticate users of mobile banking, finance, and other high-security apps. Voice identification is ideal for the mobile generation because it only requires a microphone which is standard equipment in most mobile devices.
Voice identification is also a hands-free and presence-free authentication method. Other forms of biometric authentication require the customer’s physical presence to scan a finger, a hand, or a retina, which may not be convenient or possible in a car or on public transport.
On the plus side, voice identification offers many advantages over other forms of user authentication as well as other biometrics:
Voice identification also has some disadvantages, including:
User authentication has become one of the great banes of our modern, connected existence.
Customers forget passwords, can’t remember their answers to security questions, and resent being treated with suspicion when they want to access their own bank account!
Voice identification technology simplifies the authentication process and makes it more secure.