image-to-text
pipeline converts images into text captions. This pipeline
is powered by the latest models in the HuggingFace
text-to-image
pipeline.
image-to-text
pipeline is:
ai-video
channel in Discord Server.image-to-text
pipeline:
Tested and Verified Diffusion Models
image-to-text
endpoint and to experiment
with the API, see the Livepeer AI API
Reference.image-to-text
pipeline, submit a POST
request to the Gateway’s image-to-text
API endpoint:
<GATEWAY_IP>
should be replaced with your AI Gateway’s IP address.model_id
is the diffusion model to use.image
is the path to the image file to be captioned.image-to-text
pipeline, refer to
the Orchestrator Configuration guide.
image-to-text
pipeline is based on competitor pricing.
However, we strongly encourage orchestrators to set their own pricing based on
their costs and requirements. Setting a competitive price will help attract more
jobs, as Gateways can set their maximum price for a job. The current recommended
pricing for this pipeline is 2.5e-10 USD
per input pixel
(height * width
).
image-to-text
endpoint and experiment with the API in the
Livepeer AI API Reference.