Language Identifier

A language identifier is part of the grammar header that specifies the language to be used when performing decodes.

The format of the language identifier follows the convention set out by RFC 3066. The identifier is a language and country pair, such as "en-US" for United States English.

The following languages are supported. As new languages and dialects are added over time, refer to the Languages Available Overview article for the most current list — specifically the Grammar ASR column for languages supported when working with grammars.

The platform supports the following languages:

Language Code

Description

da-DK

Danish acoustic model and dictionary

de-CH

Swiss German acoustic model and dictionary

de-DE

German acoustic model and dictionary

en-AU

Australian English acoustic model and dictionary

en-GB

U.K. English acoustic model and dictionary

en-IN

Indian English acoustic model and dictionary

en-US

American English acoustic model and dictionary

es-CO

South American Spanish acoustic model and dictionary

es-MX

Mexican Spanish acoustic model and dictionary

es-US

North American Spanish acoustic model and dictionary

fr-CA

French Canadian acoustic model and dictionary

fr-FR

European French acoustic model and dictionary

it-IT

Italian acoustic model and dictionary

ja-JP

Japanese acoustic model and dictionary

ko-KO

Korean acoustic model and dictionary

nl-NL

Dutch acoustic model and dictionary

no-NO

Norwegian acoustic model and dictionary

pt-BR

Brazilian Portuguese acoustic model and dictionary

pt-PT

European Portuguese acoustic model and dictionary

ru-RU

Russian acoustic model and dictionary

sv-SE

Swedish acoustic model and dictionary

zh-CMN

Mandarin acoustic model and dictionary

zh-CN

Cantonese acoustic model and dictionary

To specify the language in a grammar, use the following syntax:

ABNF

language en-US;

GrXML

<grammar language="en-US" ... >

Digits-only Grammars

A non-standard language type with a suffix of "-di" was previously used to indicate that a language included digits only. This functionality is now obsolete — the Speech Engine automatically determines when it should use a digits-only acoustic model. Any language identifier with a "-di" suffix will be ignored by the Engine.


Was this article helpful?