Transcribe audio files to text.
ai-video channel on
Discord.Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Uploaded audio file to be transcribed.
Hugging Face model ID used for transcription.
Return timestamps for the transcribed text. Supported values: 'sentence', 'word', or a string boolean ('true' or 'false'). Default is 'true' ('sentence'). 'false' means no timestamps. 'word' means word-based timestamps.