Last reviewed: 3/23/2024 10:57:24 AM
<viseme>
A viseme is the visual description of a phoneme in spoken language and can be applied to all input text within its enclosing voice element.
<?xml version="1.0"?>
<speak version="1.0"
xmlns="http://www.w3.org/2001/10/synthesis"
xmlns:mstts="http://www.w3.org/2001/mstts"
xml:lang="en-US">
<voice name="en-US-JennyNeural">
<mstts:viseme type="FacialExpression"/>
Rainbow has seven colors: Red, orange, yellow, green, blue, indigo, and violet.
</voice>
</speak>
Attributes
type
The type of viseme output as follows:
- redlips_front – lip-sync with viseme ID and audio offset output
- FacialExpression – blend shapes output
Children
none
Parents
Source: Microsoft Azure Speech Synthesis Markup Language (SSML)