Last reviewed: 3/23/2024 11:31:32 AM

<speak>

A speak element is the required root element of a W3C SSML document. It defines the beginning and end of the text markup.

<?xml version="1.0"?>
<speak version="1.0"
         xmlns="http://www.w3.org/2001/10/synthesis"
         xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
         xsi:schemaLocation="http://www.w3.org/2001/10/synthesis
                   http://www.w3.org/TR/speech-synthesis/synthesis.xsd"
         xml:lang="en-US">
  ... the body ...
</speak>

Attributes

version

Specifies the version of the specification to be used for the document and must have the value "1.0".

xml:base

Specifies a document's base URI explicitly.

xml:lang

Instructs the synthesizer to speak content in the indicated language.

xmlns

The URI of the SSML name space.

xmlns:xsi

The URI of the SSML schema instance.

xsi:schemaLocation

The URI of the SSML schema.

Children

<audio>, <break>, <desc>, <emphasis>, <lexicon>, <mark>, <meta>, <metadata>, <p>, <phoneme>, <prosody>, <say-as>, <sub>, <s>, and <voice>.

Parents

none

Source: Speech Synthesis Markup Language (SSML) Version 1.0