Optionalbeam_Optionalcompression_Threshold for filtering out segments with high compression ratio, which often indicate repetitive or hallucinated text.
Optionalcondition_Whether to condition on previous text during transcription. Setting to false may help prevent hallucination loops.
Optionalhallucination_Optional threshold (in seconds) to skip silent periods that may cause hallucinations.
Optionalinitial_A text prompt to help provide context to the model on the contents of the audio.
OptionallanguageThe language of the audio being transcribed or translated.
Optionallog_Threshold for filtering out segments with low average log probability, indicating low confidence.
Optionalno_Threshold for detecting no-speech segments. Segments with no-speech probability above this value are skipped.
OptionalprefixThe prefix appended to the beginning of the output of the transcription and can guide the transcription result.
OptionaltaskSupported tasks are 'translate' or 'transcribe'.
Optionalvad_Preprocess the audio with a voice activity detection model.
The number of beams to use in beam search decoding. Higher values may improve accuracy at the cost of speed.