Speech-based technologies offer the most natural user interface for messaging the human voice. Dramatic advances in Automatic Speech Recognition (ASR) and Text-to-Speech (TTS) enable capabilities such as voice-activated dialing and barge-in (using the voice to navigate during a call), which deliver a more natural user experience while driving the adoption of voice-enabled messaging solutions.
Voice-enabled messaging lets users sort through multiple media messages quickly and easily. Such a system frees users from entering touchtone commands to access their messages and helps them comply with the laws of a growing number of states that mandate hands-free cell phone operation while driving.
BCI Networks a speech-enabled directory dialer application acts as a virtual telephone operator, providing callers with immediate access to company directory information. By using advanced ASR and TTS technologies, Seven does the following:
Asks the caller to say the name of the person to whom they wish to speak
Retrieves the appropriate information from a database
Connects the caller to the correct extension
Speech-enabled directory provides workers with a convenient and efficient way to reach their colleagues whether they are working from the office or a remote location.
Automatic Speech Recognition (ASR)
Automatic speech recognition is rapidly entering the mainstream. Its algorithms provide a speech-enabled messaging system to hear and understand human speech. While early speech applications recognized only a small vocabulary of 20 to 30 words, the accuracy and vocabulary size of new ASR engines has dramatically improved, fueled by refined algorithms, dramatic increases in processing power, and lower costs. Today's speech systems support naturally spoken phrases and do not require prior training. Support for multiple languages is also becoming a standard feature of many systems.
Text-to-speech technology is a computer system's ability to translate text into synthesized speech, and allows email, fax messages, and Web-based text content to be read to human listeners, usually over the phone, in unified messaging and communications systems. Because it uses synthesized rather than digitized speech, TTS eliminates the need for scripting and the studio recording of human subjects, making it very inexpensive. It can also be updated quickly and sounds uniform.