Last reviewed: 3/23/2024 10:54:26 AM
<say-as>
The say-as element specifies synthesizer-specific information about the type of text construct contained within the element to help with rendering the contained text.
<?xml version="1.0"?>
<speak version="1.0"
xmlns="http://www.w3.org/2001/10/synthesis"
xml:lang="en-US">
<voice name="en-US-JennyNeural">
<p>
Your <say-as interpret-as="ordinal"> 1st </say-as> request was for <say-as interpret-as="cardinal"> 1 </say-as> room
on <say-as interpret-as="date" format="mdy"> 10/19/2010 </say-as>, with early arrival at <say-as interpret-as="time" format="hms12"> 12:35pm </say-as>.
</p>
</voice>
</speak>
Attributes
interpret-as
Specifies the content type of the contained text construct as one listed in the table below.
format
Optional. Provides additional information about the precise formatting of the element's text for content types that might have ambiguous formats as one listed in the table below.
details
Optional. Indicates the level of detail to be spoken. For example, this attribute might request that the speech synthesis engine pronounce punctuation marks. There are no standard values defined for detail.
interpret-as | format | Interpretation |
---|---|---|
characters, spell-out | none |
The text is spoken as individual letters (spelled out). The speech synthesis engine pronounces:
As "T E S T."
|
cardinal, number | none |
The text is spoken as a cardinal number. The speech synthesis engine pronounces:
As "There are ten options."
|
ordinal | none |
The text is spoken as an ordinal number. The speech synthesis engine pronounces:
As "Select the third option."
|
number_digit | none |
The text is spoken as a sequence of individual digits. The speech synthesis engine pronounces:
As "1 2 3 4 5 6 7 8 9."
|
fraction | none |
The text is spoken as a fractional number. The speech synthesis engine pronounces:
As "three eighths of an inch."
|
date | dmy, mdy, ymd, ydm, ym, my, md, dm, d, m, y |
The text is spoken as a date. The format attribute specifies the date's format (d=day, m=month, and y=year). The speech synthesis engine pronounces:
As "Today is October nineteenth two thousand sixteen."
|
time | hms12, hms24 |
The text is spoken as a time. The format attribute specifies whether the time is specified by using a 12-hour clock (hms12) or a 24-hour clock (hms24). Use a colon to separate numbers representing hours, minutes, and seconds. Here are some valid time examples: 12:35, 1:14:32, 08:15, and 02:50:45. The speech synthesis engine pronounces:
As "The train departs at four A M."
|
duration | hms, hm, ms |
The text is spoken as a duration. The format attribute specifies the duration's format (h=hour, m=minute, and s=second). The speech synthesis engine pronounces:
As "one hour eighteen minutes and thirty seconds".
As "one minute and eighteen seconds".
This tag is only supported on English and Spanish.
|
telephone | none |
The text is spoken as a telephone number. The speech synthesis engine pronounces:
As "My number is area code eight eight eight five five five one two one two."
|
currency | none |
The text is spoken as a currency. The speech synthesis engine pronounces:
As "ninety-nine US dollars and ninety cents."
|
address | none |
The text is spoken as an address. The speech synthesis engine pronounces:
As "I'm at 150th Court Northeast Redmond Washington."
|
name | none |
The text is spoken as a person's name. The speech synthesis engine pronounces:
As [æd].
|
Children
none
Parents
<audio>, <emphasis>, <p>, <prosody>, <speak>, <s>, and <voice>.
Source: Microsoft Azure Speech Synthesis Markup Language (SSML)