The textinfo object
The
textinfo
object is included in a JSON transcript by default when any text is decoded from an audio file. To exclude the
textinfo
object, specify the stream tag
textinfo = false
when submitting audio for transcription. This element was first introduced with V‑Blaze version 7.3.
The
textinfo
object includes the following elements:
Element |
Type |
Description | |
---|---|---|---|
turns |
number |
The number of distinct speaker turns detected in the audio. Calculated for stereo or diarized mono audio only. | |
wordtime |
array |
An array with the following number values:
| |
overtalk |
object |
Metrics for overtalk throughout the audio file. Calculated for multi-channel audio only. | |
count |
array |
Contains the following number elements:
| |
avgtime |
number |
The average duration of all overtalk occurrences. | |
time |
array |
Contains the following number elements:
| |
words |
number |
The total of number of words spoken in the transcribed audio file. | |
silence |
array |
Contains the following number elements:
| |
tags |
object |
Generated when the request is submitted with the stream tag
Contains one key-value pair with the field name
|
The following JSON example shows a
textinfo
object generated from stereo audio:
"textinfo": {
"turns": 229,
"wordtime": [
945.62,
0.702
],
"overtalk": {
"count": [
92,
0.402
],
"avgtime": 1.19,
"time": [
109.44,
0.116
]
},
"words": 3652,
"silence": [
401.64,
0.298
]
}