V‑Blaze version 7.5 (October 2023)
V‑Blaze version 7.5.0-3, voci-webapi-3.1.1-1
New features and improvements
-
Added new Out-of-Vocabulary (OOV) ASR flow.
OOV (out-of-vocabulary) is an ASR tuning feature designed to improve transcription accuracy for audio that contains brand- and industry-specific terminology. OOV enhances existing language models with new words and preferential treatment for those words.
-
Only language models version 7.5 and higher support OOV.
-
Added new stream tag
oov
for user-provided OOV lists. -
OOV lists are cached to optimize performance.
-
OOV has fully deprecated Hinting as a tuning feature.
-
-
Improved date detection and formatting, and added logic to protect dates from numerical redaction.
-
subst_rules
now supports splitting rules on a comma in addition to newline characters. -
senti_rules
now supports splitting rules on a comma in addition to newline characters. -
Two-digit number sequences beginning with zero are now concatenated in transcript output. For example, two-digit sequences previously transcribed as
0 2
,0 3
are now output as02
,03
. -
Manual sentiment is now enabled for all language models. Previously, manual sentiment was restricted to English.
-
Improved memory footprint for decodeworker processes by resetting them after the maximum number of streams has been processed.
Bug fixes
-
Resolved an issue that caused number translation to fail on uppercase internal substitutions.
-
Resolved an issue that caused the
senti_list
tag to fail when processing stereo audio. -
Resolved an issue with sentiment calculation when sentiment-bearing phrases cross utterance boundaries. Sentiment for those words and phrases is now properly set.
-
Resolved issues with Dutch models. Added the connecting word ën and resolved errors with hour and month conditional phrases.