Skip to main content

Documentation Index

Fetch the complete documentation index at: https://pixa.com/docs/llms.txt

Use this file to discover all available pages before exploring further.

Create short video clips from text prompts, animate still images, or produce talking-head videos with lipsync. All results are saved to your Pixa workspace.

Text-to-video

Use the generate_media tool with media_type: "video" and a video model:
Generate a 5-second product reveal video of sneakers rotating on a white background.

Image-to-video

Pass a start frame as an attachment and describe the motion you want:
Animate this product photo with a slow zoom in and subtle background motion.
The model uses the attached image as the opening frame and animates from there.

Talking video

Attach a face image and provide a script — the model animates the face to speak the script with natural lip movement and expression:
Make this face say: "Hi everyone, welcome to our product launch. Today we are introducing something special."

Lipsync

Attach both a face image and an audio file to sync lip movement to existing audio:
Lipsync this face image to the attached voiceover audio.
For lipsync, attach the face image and audio file together as separate attachments.

Video parameters

The generate_media tool accepts:
ParameterDescriptionExample
promptDescription of the video (required)"slow pan across a sunlit kitchen"
modelVideo model ID (required)kling-v2.6-pro
media_typeSet to "video" for video generationvideo
aspect_ratioOutput aspect ratio16:9, 9:16, 1:1
output_formatFile formatmp4, webm
num_variationsNumber of variations1-4
attachmentsStart frame, face image, or audio

Batch video generation

Generate multiple variations:
Generate 3 variations of a product reveal animation for this image.
To produce the same video in different aspect ratios for different platforms, ask for separate runs:
Generate this product demo in 16:9 for YouTube and 9:16 for TikTok.

Tips

For talking videos, write the script as natural speech. Avoid stage directions or formatting — just write what the person should say.
  • Start frame matters. For image-to-video, the quality and composition of your start frame directly affects the output.
  • Keep prompts focused. Describe one clear motion or scene rather than a complex sequence.
  • Async jobs. Video generation can take several minutes. Use get_job_status with sync: true to wait up to 25 seconds, or poll periodically for longer jobs.