Speech Synthesis

Google Speech Synthesis

Price per Channel

$30.00

Google Speech Synthesis

By using Google Speech Synthesis (GSS) plugin to UniMRCP Server, IVR platforms can utilize Google Cloud Text-to-Speech API via the industry-standard Media Resource Control Protocol (MRCP) version 1 and 2.

Google Cloud Text-to-Speech API synthesizes natural-sounding speech, providing the following main features.

Multilingual

Supports 32 voices in 12 languages and variants, with more to come soon.

Wavenet Voices

Exclusive access to DeepMind WaveNet voices that provide the most natural-sounding speech.

Text and SSML support

Customize your speech with SSML tags that allow you to add pauses, numbers, date and time formatting, and other pronunciation instructions.

Speaking Rate Tuning

Customize your speaking rate to be 4x faster or slower than the normal rate.

Pitch Tuning

Customize the pitch of your selected voice, up to 20 semitones more or less than the default output.

Volume Gain Control

Increase the volume of the output by up to 16db or decrease the volume up to -96db.

MORE

Watson Speech Synthesis

Price per Channel

$30.00

Watson Speech Synthesis

By using Watson Speech Synthesis (SS) plugin to UniMRCP Server, IVR platforms can utilize IBM Watson Text to Speech API via the industry-standard Media Resource Control Protocol (MRCP) version 1 and 2.

IBM Watson Text to Speech API performs text to speech conversion supporting the following main features.

Human Sounding Speech

Text is instantly synthesized into human-sounding speech and can be used for real-time conversion.

Languages

The text to speech API supports a variety of languages.

Voice Output Parameters

By supporting SSML 1.0, the API allows changing certain characteristics of generated voice output like speaking rate, volume and pronunciation.

Voices

The text to speech API supports a variety of different male and female voices.

MORE

Polly Speech Synthesis

Price per Channel

$30.00

Polly Speech Synthesis

By using Amazon Web Services (AWS) Polly plugin to UniMRCP Server, IVR platforms can utilize AWS Polly Text-to-Speech API via the industry-standard Media Resource Control Protocol (MRCP) version 1 and 2.

AWS Polly is a Text-to-Speech service that uses advanced deep learning technologies to synthesize speech that sounds like a human voice.

High Quality

Polly uses best-in-class Text-to-Speech (TTS) technology to synthesize natural speech with high pronunciation accuracy (including abbreviations, acronym expansions, date/time interpretations, and homograph disambiguation).

Low Latency

Polly ensures fast response times, which make it a viable option for low-latency use cases such as dialog systems.

Large Portfolio of Languages and Voices

Polly supports dozens of voices and multiple languages, offering male and female voice options for most languages.

Cloud-based Solution

Text-to-Speech conversion done in the cloud dramatically reduces local resource requirements. This enables support of all the available languages and voices at the best possible quality. Moreover, speech improvements are instantly available to all end-users and do not require additional updates for devices.

MORE

Yandex Speech Synthesis

Price per Channel

$30.00

Yandex Speech Synthesis

By using Yandex Speech Synthesis (SS) plugin to UniMRCP Server, IVR platforms can utilize Yandex SpeechKit Text to Speech API via the industry-standard Media Resource Control Protocol (MRCP) version 1 and 2.

Yandex SpeechKit Text to Speech API performs text to speech conversion supporting the following main features.

Natural-sounding Speech

Yandex SpeechKit composes speech from more than a million individual phonemes, with intonation set by a neural network trained on numerous real-life examples.

Languages

The text to speech API currently supports four languages.

Real-time Synthesis

The response time of API is so quick, that it allows for an efficient implementation of audio data streaming.

Voices

The text to speech API supports a variety of different male and female voices.

MORE

Azure Speech Synthesis

Price per Channel

$30.00

Azure Speech Synthesis

By using Azure Speech Synthesis (SS) plugin to UniMRCP Server, IVR platforms can utilize Microsoft Azure Speech API via the industry-standard Media Resource Control Protocol (MRCP) version 1 and 2.

Microsoft Azure Speech API performs text to speech conversion supporting the following main features.

Human Sounding Speech

Text is instantly synthesized into human-sounding speech and can be used for real-time conversion.

Languages

The text to speech API supports a variety of languages.

Voice Output Parameters

By supporting SSML 1.0, the API allows changing certain characteristics of generated voice output like speaking rate, volume and pronunciation.

Voices

The text to speech API supports a variety of different male and female voices.

MORE

Google Cloud

Microsoft Azure

IBM Watson

Amazon Web Services

Yandex Cloud

GoVivace

Misc

UniMRCP

Asterisk Modules

Speech Synthesis Engines

Google Speech Synthesis

Google Speech Synthesis

Multilingual

Wavenet Voices

Text and SSML support

Speaking Rate Tuning

Pitch Tuning

Volume Gain Control

Watson Speech Synthesis

Watson Speech Synthesis

Human Sounding Speech

Languages

Voice Output Parameters

Voices

Polly Speech Synthesis

Polly Speech Synthesis

High Quality

Low Latency

Large Portfolio of Languages and Voices

Cloud-based Solution

Yandex Speech Synthesis

Yandex Speech Synthesis

Natural-sounding Speech

Languages

Real-time Synthesis

Voices

Azure Speech Synthesis

Azure Speech Synthesis

Human Sounding Speech

Languages

Voice Output Parameters

Voices

Unispeech

Products

Platforms

Projects

Universal Speech Solutions LLC