Language Identifier

A language identifier is part of the grammar header that specifies the language to be used when performing decodes.

The format of the language identifier follows the convention set out by RFC 3066. The identifier is a language and country pair, such as "en-US" for United States English.

The following languages are supported. As new languages and dialects are added over time, refer to the Languages Available Overview article for the most current list — specifically the Grammar ASR column for languages supported when working with grammars.

The platform supports the following languages:

Language Code	Description
da-DK	Danish acoustic model and dictionary
de-CH	Swiss German acoustic model and dictionary
de-DE	German acoustic model and dictionary
en-AU	Australian English acoustic model and dictionary
en-GB	U.K. English acoustic model and dictionary
en-IN	Indian English acoustic model and dictionary
en-US	American English acoustic model and dictionary
es-CO	South American Spanish acoustic model and dictionary
es-MX	Mexican Spanish acoustic model and dictionary
es-US	North American Spanish acoustic model and dictionary
fr-CA	French Canadian acoustic model and dictionary
fr-FR	European French acoustic model and dictionary
it-IT	Italian acoustic model and dictionary
ja-JP	Japanese acoustic model and dictionary
ko-KO	Korean acoustic model and dictionary
nl-NL	Dutch acoustic model and dictionary
no-NO	Norwegian acoustic model and dictionary
pt-BR	Brazilian Portuguese acoustic model and dictionary
pt-PT	European Portuguese acoustic model and dictionary
ru-RU	Russian acoustic model and dictionary
sv-SE	Swedish acoustic model and dictionary
zh-CMN	Mandarin acoustic model and dictionary
zh-CN	Cantonese acoustic model and dictionary

To specify the language in a grammar, use the following syntax:

ABNF

language en-US;

GrXML

<grammar language="en-US" ... >

Digits-only Grammars

A non-standard language type with a suffix of "-di" was previously used to indicate that a language included digits only. This functionality is now obsolete — the Speech Engine automatically determines when it should use a digits-only acoustic model. Any language identifier with a "-di" suffix will be ignored by the Engine.

Was this article helpful?