By using Google Speech Synthesis (GSS) plugin to UniMRCP Server, IVR platforms can utilize Google Cloud Text-to-Speech API via the industry-standard Media Resource Control Protocol (MRCP) version 1 and 2.
Google Cloud Text-to-Speech API synthesizes natural-sounding speech, providing the following main features.
Supports 32 voices in 12 languages and variants, with more to come soon.
Exclusive access to DeepMind WaveNet voices that provide the most natural-sounding speech.
Customize your speech with SSML tags that allow you to add pauses, numbers, date and time formatting, and other pronunciation instructions.
Customize your speaking rate to be 4x faster or slower than the normal rate.
Customize the pitch of your selected voice, up to 20 semitones more or less than the default output.
Increase the volume of the output by up to 16db or decrease the volume up to -96db.
By using Watson Speech Synthesis (SS) plugin to UniMRCP Server, IVR platforms can utilize IBM Watson Text to Speech API via the industry-standard Media Resource Control Protocol (MRCP) version 1 and 2.
IBM Watson Text to Speech API performs text to speech conversion supporting the following main features.
Text is instantly synthesized into human-sounding speech and can be used for real-time conversion.
The text to speech API supports a variety of languages.
By supporting SSML 1.0, the API allows changing certain characteristics of generated voice output like speaking rate, volume and pronunciation.
The text to speech API supports a variety of different male and female voices.
By using Amazon Web Services (AWS) Polly plugin to UniMRCP Server, IVR platforms can utilize AWS Polly Text-to-Speech API via the industry-standard Media Resource Control Protocol (MRCP) version 1 and 2.
AWS Polly is a Text-to-Speech service that uses advanced deep learning technologies to synthesize speech that sounds like a human voice.
Polly uses best-in-class Text-to-Speech (TTS) technology to synthesize natural speech with high pronunciation accuracy (including abbreviations, acronym expansions, date/time interpretations, and homograph disambiguation).
Polly ensures fast response times, which make it a viable option for low-latency use cases such as dialog systems.
Polly supports dozens of voices and multiple languages, offering male and female voice options for most languages.
Text-to-Speech conversion done in the cloud dramatically reduces local resource requirements. This enables support of all the available languages and voices at the best possible quality. Moreover, speech improvements are instantly available to all end-users and do not require additional updates for devices.
By using Yandex Speech Synthesis (SS) plugin to UniMRCP Server, IVR platforms can utilize Yandex SpeechKit Text to Speech API via the industry-standard Media Resource Control Protocol (MRCP) version 1 and 2.
Yandex SpeechKit Text to Speech API performs text to speech conversion supporting the following main features.
Yandex SpeechKit composes speech from more than a million individual phonemes, with intonation set by a neural network trained on numerous real-life examples.
The text to speech API currently supports four languages.
The response time of API is so quick, that it allows for an efficient implementation of audio data streaming.
The text to speech API supports a variety of different male and female voices.
By using Azure Speech Synthesis (SS) plugin to UniMRCP Server, IVR platforms can utilize Microsoft Azure Speech API via the industry-standard Media Resource Control Protocol (MRCP) version 1 and 2.
Microsoft Azure Speech API performs text to speech conversion supporting the following main features.
Text is instantly synthesized into human-sounding speech and can be used for real-time conversion.
The text to speech API supports a variety of languages.
By supporting SSML 1.0, the API allows changing certain characteristics of generated voice output like speaking rate, volume and pronunciation.
The text to speech API supports a variety of different male and female voices.