Create short video clips from text prompts, animate still images, or produce talking-head videos with lipsync. All results are saved to your Pixa workspace.Documentation Index
Fetch the complete documentation index at: https://pixa.com/docs/llms.txt
Use this file to discover all available pages before exploring further.
Text-to-video
Use thegenerate_media tool with media_type: "video" and a video model:
Image-to-video
Pass a start frame as an attachment and describe the motion you want:Talking video
Attach a face image and provide a script — the model animates the face to speak the script with natural lip movement and expression:Lipsync
Attach both a face image and an audio file to sync lip movement to existing audio:For lipsync, attach the face image and audio file together as separate
attachments.Video parameters
Thegenerate_media tool accepts:
| Parameter | Description | Example |
|---|---|---|
prompt | Description of the video (required) | "slow pan across a sunlit kitchen" |
model | Video model ID (required) | kling-v2.6-pro |
media_type | Set to "video" for video generation | video |
aspect_ratio | Output aspect ratio | 16:9, 9:16, 1:1 |
output_format | File format | mp4, webm |
num_variations | Number of variations | 1-4 |
attachments | Start frame, face image, or audio | — |
Batch video generation
Generate multiple variations:Tips
- Start frame matters. For image-to-video, the quality and composition of your start frame directly affects the output.
- Keep prompts focused. Describe one clear motion or scene rather than a complex sequence.
- Async jobs. Video generation can take several minutes. Use
get_job_statuswithsync: trueto wait up to 25 seconds, or poll periodically for longer jobs.
Related
- Video Generation (User Guide) — in-app video workflow
- MCP Server — full MCP tool reference