Skip to main content
POST
/
audio-to-text
Audio To Text
curl --request POST \
  --url https://dream-gateway.livepeer.cloud/audio-to-text \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: multipart/form-data' \
  --form audio='@example-file' \
  --form model_id= \
  --form return_timestamps=true \
  --form 'metadata={}'
{
  "text": "<string>",
  "chunks": [
    {
      "timestamp": [
        "<unknown>"
      ],
      "text": "<string>"
    }
  ]
}
This page is still cooking... Expect big things soon!
Check the github issues for ways to contribute! Or provide your feedback in this quick form

Authorizations

Authorization
string
header
required

Bearer authentication header of the form Bearer <token>, where <token> is your auth token.

Body

multipart/form-data
audio
file
required

Uploaded audio file to be transcribed.

model_id
string
default:""

Hugging Face model ID used for transcription.

return_timestamps
string
default:true

Return timestamps for the transcribed text. Supported values: 'sentence', 'word', or a string boolean ('true' or 'false'). Default is 'true' ('sentence'). 'false' means no timestamps. 'word' means word-based timestamps.

metadata
string
default:{}

Additional job information to be passed to the pipeline.

Response

Successful Response

Response model for text generation.

text
string
required

The generated text.

chunks
Chunk ยท object[]
required

The generated text chunks.

Last modified on February 18, 2026