Skip to main content
POST
/
image-to-video
Image To Video
curl --request POST \
  --url https://dream-gateway.livepeer.cloud/image-to-video \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: multipart/form-data' \
  --form image='@example-file' \
  --form model_id= \
  --form height=576 \
  --form width=1024 \
  --form fps=6 \
  --form motion_bucket_id=127 \
  --form noise_aug_strength=0.02 \
  --form safety_check=true \
  --form seed=123 \
  --form num_inference_steps=25
{
  "frames": [
    [
      {
        "url": "<string>",
        "seed": 123,
        "nsfw": true
      }
    ]
  ]
}
This page is still cooking... Expect big things soon!
Check the github issues for ways to contribute! Or provide your feedback in this quick form

Authorizations

Authorization
string
header
required

Bearer authentication header of the form Bearer <token>, where <token> is your auth token.

Body

multipart/form-data
image
file
required

Uploaded image to generate a video from.

model_id
string
default:""

Hugging Face model ID used for video generation.

height
integer
default:576

The height in pixels of the generated video.

width
integer
default:1024

The width in pixels of the generated video.

fps
integer
default:6

The frames per second of the generated video.

motion_bucket_id
integer
default:127

Used for conditioning the amount of motion for the generation. The higher the number the more motion will be in the video.

noise_aug_strength
number
default:0.02

Amount of noise added to the conditioning image. Higher values reduce resemblance to the conditioning image and increase motion.

safety_check
boolean
default:true

Perform a safety check to estimate if generated images could be offensive or harmful.

seed
integer

Seed for random number generation.

num_inference_steps
integer
default:25

Number of denoising steps. More steps usually lead to higher quality images but slower inference. Modulated by strength.

Response

Successful Response

Response model for video generation.

frames
Media · object[][]
required

The generated video frames.

Last modified on February 18, 2026