parler-tts/parler-tts-large-v1
. This model can generate speech with
customizable characteristics such as voice type, speaking style, and audio
quality.
text-to-speech
endpoint and to
experiment with the API, see the Livepeer AI API
Reference./text-to-speech
endpoint. Here’s an example of how to structure your request:
model_id
: The ID of the text-to-speech model to use. Currently, this should
be set to "parler-tts/parler-tts-large-v1"
.text
: The text you want to convert to speech.description
: A description of the desired voice characteristics. This can
include details about the speaker’s voice, speaking style, and audio quality.description
parameter.
Some aspects you can control include:
text-to-speech
pipeline, refer to
the Orchestrator Configuration guide.
text-to-speech
pipeline is based on competitor pricing.
However, we strongly encourage orchestrators to set their own pricing based on
their costs and requirements. Setting a competitive price will help attract more
jobs, as Gateways can set their maximum price for a job. The current recommended
pricing for this pipeline is 1.5e-6 USD
per character.
text-to-speech
pipeline, you must use a pipeline specific AI
Runner container. Pull the required container from
Docker Hub
using the following command:
text-to-speech
endpoint and experiment with the API in the
Livepeer AI API Reference.