Platform features

Table 1. Voci ASR platform features
Feature	Real-time/Post-call	Local/V‑Cloud	Languages	Description
Out-of-vocabulary (OOV)	Post-call	Both	Support rolling out to all languages; models version 7.5+ support OOV	OOV (out-of-vocabulary) is an ASR tuning feature designed to improve transcription accuracy for audio that contains brand- and industry-specific terminology. OOV enhances existing language models with new words and preferential treatment for those words.
Voice activity detection	Both	Both	All	Algorithm used to detect the differences between human voice, noise, and silence. Utilized to identify only the parts of audio that are recognized as speech and sent to the ASR; configurable for a given use case.
Auto punctuation	Both	Both	All	Adds punctuation and capitalization. Fully punctuated transcripts significantly improve speech analysis by increasing the understanding of the caller's intended meaning.
Number translations	Both	Both	All	Controls whether certain words in transcribed text are converted into numeric digits and related conventional formats, including dollar amounts, wall-clock times, percentages, ordinals, web addresses, and telephone numbers. For example, with `numtrans` set to `true` (the default), the words “forty two percent” would be transformed into the text “42%”.
Transcoding	Post-call	V‑Cloud only	All	Determines whether V‑Cloud should use its built-in decoders to try to convert incoming audio into a supported format, if necessary.
Output formatting	Both	Both	All	Allows a customer to specify the transcript delivery format. The following outputs are supported: `json` (default), `jsontop`, `text`, `noutts`
Callbacks	Both	Both	All	Callbacks are used to enable another application to receive and directly interact with the produced transcripts. Allows for automated production workflows for speech transcription.
Text redaction	Both	Both	All	Redacts numbers from a transcript. Automated numeric redaction reduces PCI/PII risk by automatically finding and eliminating credit card and other sensitive numbers from audio and text.
Audio redaction	Both	Both	All	Replaces sensitive segments of an audio file with silence. Automated redaction reduces PCI/PII risk by automatically finding and eliminating credit card and other sensitive numbers from audio and text.
Speaker separation (diarization)	Post-call	Both	All	Automatic speaker separation of customer and agent voices when both are recorded on one channel, enabling their utterances to be analyzed independently. This is referred to as diarization.
Global language coverage	Both	Both	All	Voci supports 30+ languages, accents and domains.
REST API	Both	Both	All	Voci provides several different APIs for our products: vociwebapi - V-Blaze self-hosted REST API vcloud - V-Cloud API for Voci-hosted post-call ASR processing
Protocol support	Both	Both	All	http/https, Websockets, MRCP/uniMRCP, AudioCodes, SIP
Platform integrations and connectors	Both	Both	All	Five9, 8x8, Calabrio, Genesys, Verint, AWS Connect, and others