HappyHorse
HappyHorse 1.1 Video Generation
- Alibaba Cloud Bailian HappyHorse 1.1 video generation model (unified entry, single-model auto-routing)
- Auto-routes by parameters: T2V (prompt only) / I2V (first_frame_image) / R2V (image_urls)
- Supports 720P/1080P resolutions and any integer duration from 3 to 15 seconds
- Billed by resolution × duration (seconds) only, regardless of capability
POST
Authorization
All API endpoints require Bearer Token authenticationGet your API Key:Visit the API Key Management Page to get your API KeyAdd it to the request header:
Mode Routing
happyhorse-1.1 is the unified entry for Text-to-Video / Image-to-Video / Reference-Image-to-Video. The backend automatically determines the mode based on incoming parameters. All modes are billed by the same rule (resolution × seconds only):
| Fields you pass | Routes To | Mode Description |
|---|---|---|
prompt only | Text-to-Video (T2V) | Generate video purely from text |
prompt + first_frame_image | Image-to-Video (I2V) | Animate from a first-frame image |
prompt + image_urls (1–9 images) | Reference-Image-to-Video (R2V) | Generate a new scene from reference images |
first_frame_image > image_urls > prompt only.
Mutual exclusion rules: the two media fields (first_frame_image / image_urls) are mutually exclusive. Passing both mutually exclusive fields returns 400 mixed_media_not_allowed.
Request Parameters
Video generation model name, fixed as
happyhorse-1.1Video content description, up to 2500 characters; cannot contain special tokensExample:
"A little girl walking down the road, cinematic feel"First-frame image, triggers I2V (Image-to-Video). Supports URL or base64 (
data:image/<mime>;base64,<payload>, the gateway uploads it to OSS automatically)Mutually exclusive with image_urlsFirst-frame image requirements:
- Format: JPEG / JPG / PNG / BMP / WEBP
- Short side: ≥ 300px
- Aspect ratio:
1:2.5to2.5:1 - File size: ≤ 10MB
Image array (R2V mode): 1–9 images, used as subject/style references to generate a new sceneSupports URL or base64Mutually exclusive with
first_frame_imageReference image requirements:
- Format: JPEG / JPG / PNG / BMP / WEBP
- Short side: ≥ 720p recommended
- Aspect ratio: short / long ≥ 0.4
- File size: ≤ 10MB
- Count: 1–9 images
Video resolution (affects billing)Options:
720P- Standard1080P- High definition (default)
Video duration in seconds (affects billing)Supported range: any integer from
3 to 15Default: 5Aspect ratioSupported formats:
16:9- Landscape widescreen (default)9:16- Portrait1:1- Square4:3- Landscape3:4- Portrait
Whether to add a watermark to the generated video
true: Add watermarkfalse: Do not add watermark (default)
Random seed used to control the randomness of generated contentValue range:
[0, 2147483647]. If omitted, a random seed is used.- For identical requests, the model generates different results when receiving different seed values (e.g., omitting seed)
- For identical requests, the model generates similar results when receiving the same seed value, but exact consistency is not guaranteed
Response
Response status code, 200 on success
Response data array
Use Cases
Case 1: Text-to-Video T2V (Simplest Request)
Case 2: Text-to-Video T2V (Full Parameters)
Case 3: Image-to-Video I2V (first_frame_image)
Case 4: Reference-Image-to-Video R2V (multiple references)
Case 5: 720P to Save Cost
Mode Selection Guide
| Requirement | Recommended Approach |
|---|---|
| Generate video from text only | Pass only prompt (T2V) |
| Make an image “come alive” (use it as the first frame) | Pass first_frame_image (I2V) |
| Generate a new scene from a set of reference images | Pass image_urls (1–9, R2V) |
| Save cost | Use resolution: "720P" |
Usage Tips
- Unified entry logic: input fields decide the mode. Note that the two media fields (
first_frame_image/image_urls) are mutually exclusive sizeonly effective in T2V/R2V: in I2V modesizeis ignored — the output aspect ratio is determined by the input media- Duration: 5–10 seconds is the sweet spot. Too short causes choppy motion; too long significantly increases upstream processing time
- First-frame image quality: clear, well-composed, subject centered — significantly improves I2V output
- Prompt writing: describe motion / camera / atmosphere (e.g., “slow push-in, cinematic, warm tones”) for better results than purely static scene descriptions
Query Task ResultsVideo generation is an async task that returns a
task_id upon submission. Use the Get Task Status endpoint to query generation progress and results.