wan2.7 is a unified entry for text-to-video and image-to-video. The backend automatically determines the mode based on the incoming parameters. Both modes are billed identically:
| Condition | Routes To | Mode Description |
|---|---|---|
Any of image_urls / image_with_roles / video_urls is provided | Image-to-Video | First-frame / First-last frame / Video continuation |
| None of the above parameters provided | Text-to-Video | Generate video purely from text description |
wan2.7"A cat chasing butterflies on the grass, bright sunshine, slow motion"image_with_rolesimage_urls, used to precisely specify the role of each imageFields for each object:url (string): image URL (supports http/https)role (string): image role, first_frame / last_frame, default first_frame"blurry, distorted, low quality"720P - Standard1080P - High definition (default)2 ~ 15 secondsDefault: 516:9 - Landscape widescreen (default)9:16 - Portrait1:1 - Square4:3 - Landscape3:4 - Portraittruetrue: add watermarkfalse: no watermark (default)≥0
With 2 images: the 1st is the first frame, the 2nd is the last frame. You can also use image_with_roles for precise specification.
| Requirement | Recommended Approach |
|---|---|
| Generate video from text only | Pass only prompt (no image/video) |
| Make an image “come alive” | Pass 1 image to image_urls |
| Control start and end frames | Pass 2 images to image_urls (first + last) |
| Extend an existing video | Pass video to video_urls |
| Make image move to music | Pass image + audio_url |
task_id upon submission. Use the Get Task Status endpoint to query generation progress and results.