Generate a text-to-speech audio file based on the provided text input and speaker description.
ai-video channel on
Discord.Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Hugging Face model ID used for text to speech generation.
Text input for speech generation.
Description of speaker to steer text to speech generation.
Successful Response
Response model for audio generation.
The generated audio.