Determining what TTS markup to use

Last reviewed: 4/2/2012

HOW Article ID: H041202

The information in this article applies to:

  • VoiceMarkupKit 4

Summary

Synthesizers (i.e., speech APIs) support unique markup syntax. By generating TTS markup at runtime, your application can maximize the quality of TTS playback and offer your end users the flexibility of using various synthesizers with your application.

Voices of the same API type and version tend to reproduce speech in the same manner regardless of gender and language. This is because the synthesis engine and the language models are common for the voices.

The easiest implementation that can have the biggest impact you can make on the playback quality is through adjusting the speed (rate), emphasis, and pronunciation. For example, it is usually better to slow down for words with fewer syllables such as digits (1, 2, 3) and speed up for long multisyllable words such as product titles or last names.

The best way to determine ideal TTS markup is to audition a voice with various settings to determine what works best for your application. The Chant Developer Workbench is ideal for that. It provides a TTS markup editing surface that enables you to test text fragments as well as complete SSML documents.

More Information

Synthesizers (i.e., speech APIs) support unique markup syntax. VoiceMarkupKit supports the following synthesizers and their markup syntax:

SynthesizerSpeech APIMarkup Syntax
Cepstral (all languages)Cepstral SwiftW3C SSML
Microsoft SAPI 4 (all languages)SAPI 4SAPI 4 Control Tags
Microsoft SAPI 5 (all languages)SAPI 5SAPI 5 XML Markup, W3C SSML
Microsoft MSP (all languages)MSPSAPI 5 XML Markup, W3C SSML
Nuance RealSpeak Solo (all languages)Nuance RealSpeak SoloL&H Native Control Sequence, SAPI 5 XML Markup
Nuance Vocalizer Auotmotive (all languages)Nuance Vocalizer AuotmotiveL&H Native Control Sequence, SAPI 5 XML Markup
Nuance Vocalizer Mobile (all languages)Nuance Vocalizer MobileL&H Native Control Sequence
Nuance Vocalizer Network (all languages)Nuance Vocalizer NetworkL&H Native Control Sequence, SAPI 5 XML Markup, W3C SSML

To get started, open a new markup document within Chant Developer Workbench development and testing environment:

Marking Up Text
Marking Up Text : Highlight and click. It's that simple to markup text for enhanced speech synthesis.
SSML Editing
SSML Editing: Edit SAPI 4, SAPI 5, and W3C Speech Synthesis Markup Language (SSML) faster with built-in intelliprompt that suggest valid markup syntax.
TTS Playback
TTS Playback: Playback text-to-speech markup with a click of the button. Highlight specific text or playback the entire document.

Learn more about Chant VoiceMarkupKit »