lid

Values: false , true , language, language_model

Description:

The lid parameter enables you to use the ASR engine's Language Identification (LID) module to identify the language spoken in the input audio and automatically use an appropriate language model. To force the use of an an alternate model with a different "domain" name, specify it using lid=language_model.

Note: LID is only supported for English-Spanish and English-French language pairs.
  • lid=true - automatically selects the language identification model based on the LID and language models that are available.

  • lid=language - the alternative language to detect. The primary language is determined from the primary model. In this case an alternative language model of the specified language is automatically selected.

  • lid=language_model - the alternate language model.

  • lid=language:info - use this to decode all audio using the primary model, but provide language identification information in the transcript.

  • lid=false - lid is not used.

Note: LID will only be performed if the audio submitted has a sample rate of 8000.

The following parameters provide additional options when using the lid tag:

Table 1. Additional LID Options

Name

Values

Description

lidthreshold

float between 0 and 1 (default is 0)

Adjusts the confidence level required for the system to select the alternative language. Setting this option to values greater than zero will increase preference for the default model.

lidprior

float between 0 and 1 (default is 0.5)

Defines the prior probability distribution of the alternative lid language being spoken. The default value of lidprior is 0.5, which indicates to the ASR engine that there is an equal probability of the alternative lid language and primary language being spoken.

lidoffset= N

integer

Delay start of LID until specified (N) seconds into audio. If there is not enough audio left after offset, this will process preceding utterances in reverse.

When LID scoring is below the decision threshold, the ASR engine will transcribe the audio with the language model specified by the model tag (or the default model for the ASR configuration if model is not explicitly provided). The results are indicated by a lidinfo.langfinal element in the JSON output.

Language identification is a licensed optional feature.

For additional information about using the lid tag, see:

Receiving language identification information