Audio To Text

curl --request POST \
  --url https://dream-gateway.livepeer.cloud/audio-to-text \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: multipart/form-data' \
  --form audio='@example-file' \
  --form model_id= \
  --form return_timestamps=true \
  --form 'metadata={}'

{
  "text": "<string>",
  "chunks": [
    {
      "timestamp": [
        "<unknown>"
      ],
      "text": "<string>"
    }
  ]
}

POST

audio-to-text

Audio To Text

curl --request POST \
  --url https://dream-gateway.livepeer.cloud/audio-to-text \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: multipart/form-data' \
  --form audio='@example-file' \
  --form model_id= \
  --form return_timestamps=true \
  --form 'metadata={}'

{
  "text": "<string>",
  "chunks": [
    {
      "timestamp": [
        "<unknown>"
      ],
      "text": "<string>"
    }
  ]
}

This page is still cooking... Expect big things soon!
Check the github issues for ways to contribute! Or provide your feedback in this quick form

Authorizations

Authorization

string

header

required

Bearer authentication header of the form Bearer <token>, where <token> is your auth token.

Body

multipart/form-data

audio

file

required

Uploaded audio file to be transcribed.

model_id

string

default:""

Hugging Face model ID used for transcription.

return_timestamps

string

default:true

Return timestamps for the transcribed text. Supported values: 'sentence', 'word', or a string boolean ('true' or 'false'). Default is 'true' ('sentence'). 'false' means no timestamps. 'word' means word-based timestamps.

metadata

string

default:{}

Additional job information to be passed to the pipeline.

Response

Successful Response

Response model for text generation.

text

string

required

The generated text.

chunks

Chunk · object[]

required

The generated text chunks.

Show child attributes

Last modified on February 18, 2026

Upscale

Segment Anything 2

⌘I

Gateway Knowledge Hub

Quickstart ⚡

Gateway Services & Providers

Run A Gateway

Gateway Tools & Resources

Technical References

Audio To Text

Authorizations

Body

Response