V‑Blaze version 7.5 (October 2023)

V‑Blaze version 7.5.0-3, voci-webapi-3.1.1-1

New features and improvements

  • Added new Out-of-Vocabulary (OOV) ASR flow.

    OOV (out-of-vocabulary) is an ASR tuning feature designed to improve transcription accuracy for audio that contains brand- and industry-specific terminology. OOV enhances existing language models with new words and preferential treatment for those words.

    • Only language models version 7.5 and higher support OOV.

    • Added new stream tag oov for user-provided OOV lists.

    • OOV lists are cached to optimize performance.

    • OOV has fully deprecated Hinting as a tuning feature.

  • Improved date detection and formatting, and added logic to protect dates from numerical redaction.

  • subst_rules now supports splitting rules on a comma in addition to newline characters.

  • senti_rules now supports splitting rules on a comma in addition to newline characters.

  • Two-digit number sequences beginning with zero are now concatenated in transcript output. For example, two-digit sequences previously transcribed as 0 2, 0 3 are now output as 02, 03.

  • Manual sentiment is now enabled for all language models. Previously, manual sentiment was restricted to English.

  • Improved memory footprint for decodeworker processes by resetting them after the maximum number of streams has been processed.

Bug fixes

  • Resolved an issue that caused number translation to fail on uppercase internal substitutions.

  • Resolved an issue that caused the senti_list tag to fail when processing stereo audio.

  • Resolved an issue with sentiment calculation when sentiment-bearing phrases cross utterance boundaries. Sentiment for those words and phrases is now properly set.

  • Resolved issues with Dutch models. Added the connecting word ën and resolved errors with hour and month conditional phrases.