Platform features
Feature | Real-time/Post-call | Local/V‑Cloud | Languages | Description |
---|---|---|---|---|
Post-call | Both | Support rolling out to all languages; models version 7.5+ support OOV | OOV (out-of-vocabulary) is an ASR tuning feature designed to improve transcription accuracy for audio that contains brand- and industry-specific terminology. OOV enhances existing language models with new words and preferential treatment for those words. | |
Voice activity detection | Both | Both | All | Algorithm used to detect the differences between human voice, noise, and silence. Utilized to identify only the parts of audio that are recognized as speech and sent to the ASR; configurable for a given use case. |
Auto punctuation | Both | Both | All | Adds punctuation and capitalization. Fully punctuated transcripts significantly improve speech analysis by increasing the understanding of the caller's intended meaning. |
Number translations | Both | Both | All | Controls whether certain words in transcribed text are converted into numeric digits and related conventional formats, including dollar amounts, wall-clock times, percentages, ordinals, web addresses, and telephone numbers. For example, with |
Transcoding | Post-call | V‑Cloud only | All | Determines whether V‑Cloud should use its built-in decoders to try to convert incoming audio into a supported format, if necessary. |
Output formatting | Both | Both | All | Allows a customer to specify the transcript delivery format. The following outputs are supported: json (default), jsontop , text , noutts |
Callbacks | Both | Both | All | Callbacks are used to enable another application to receive and directly interact with the produced transcripts. Allows for automated production workflows for speech transcription. |
Text redaction | Both | Both | All | Redacts numbers from a transcript. Automated numeric redaction reduces PCI/PII risk by automatically finding and eliminating credit card and other sensitive numbers from audio and text. |
Audio redaction | Both | Both | All | Replaces sensitive segments of an audio file with silence. Automated redaction reduces PCI/PII risk by automatically finding and eliminating credit card and other sensitive numbers from audio and text. |
Speaker separation (diarization) | Post-call | Both | All | Automatic speaker separation of customer and agent voices when both are recorded on one channel, enabling their utterances to be analyzed independently. This is referred to as diarization. |
Global language coverage | Both | Both | All | Voci supports 30+ languages, accents and domains. |
REST API | Both | Both | All | Voci provides several different APIs for our products:
|
Protocol support | Both | Both | All | http/https, Websockets, MRCP/uniMRCP, AudioCodes, SIP |
Platform integrations and connectors | Both | Both | All | Five9, 8x8, Calabrio, Genesys, Verint, AWS Connect, and others |