Text To Speech
Generate a text-to-speech audio file based on the provided text input and speaker description.
The default Gateway used in this guide is the public
Livepeer.cloud Gateway. It is free to use but
not intended for production-ready applications. For production-ready
applications, consider using the Livepeer Studio
Gateway, which requires an API token. Alternatively, you can set up your own
Gateway node or partner with one via the ai-video
channel on
Discord.
Please note that the exact parameters, default values, and responses may vary between models. For more information on model-specific parameters, please refer to the respective model documentation available in the text-to-speech pipeline. Not all parameters might be available for a given model.
Authorizations
Bearer authentication header of the form Bearer <token>
, where <token>
is your auth token.
Body
Hugging Face model ID used for text to speech generation.
Text input for speech generation.
Description of speaker to steer text to speech generation.
Response
Response model for audio generation.
The generated audio.
Was this page helpful?