Language Identifier
A language identifier is part of the grammar header that specifies the language to be used when performing decodes.
The format of the language identifier follows the convention set out by RFC 3066. The identifier is a language and country pair, such as "en-US" for United States English.
The following languages are supported. As new languages and dialects are added over time, refer to the Languages Available Overview article for the most current list — specifically the Grammar ASR column for languages supported when working with grammars.
The platform supports the following languages:
Language Code | Description |
da-DK | Danish acoustic model and dictionary |
de-CH | Swiss German acoustic model and dictionary |
de-DE | German acoustic model and dictionary |
en-AU | Australian English acoustic model and dictionary |
en-GB | U.K. English acoustic model and dictionary |
en-IN | Indian English acoustic model and dictionary |
en-US | American English acoustic model and dictionary |
es-CO | South American Spanish acoustic model and dictionary |
es-MX | Mexican Spanish acoustic model and dictionary |
es-US | North American Spanish acoustic model and dictionary |
fr-CA | French Canadian acoustic model and dictionary |
fr-FR | European French acoustic model and dictionary |
it-IT | Italian acoustic model and dictionary |
ja-JP | Japanese acoustic model and dictionary |
ko-KO | Korean acoustic model and dictionary |
nl-NL | Dutch acoustic model and dictionary |
no-NO | Norwegian acoustic model and dictionary |
pt-BR | Brazilian Portuguese acoustic model and dictionary |
pt-PT | European Portuguese acoustic model and dictionary |
ru-RU | Russian acoustic model and dictionary |
sv-SE | Swedish acoustic model and dictionary |
zh-CMN | Mandarin acoustic model and dictionary |
zh-CN | Cantonese acoustic model and dictionary |
To specify the language in a grammar, use the following syntax:
ABNF
language en-US;
GrXML
<grammar language="en-US" ... >
Digits-only Grammars
A non-standard language type with a suffix of "-di" was previously used to indicate that a language included digits only. This functionality is now obsolete — the Speech Engine automatically determines when it should use a digits-only acoustic model. Any language identifier with a "-di" suffix will be ignored by the Engine.
