HappyHorse 1.0 AI video generator features
Joint audio-video architecture
HappyHorse 1.0 runs a unified 40-layer self-attention Transformer that processes text, image, video, and audio tokens simultaneously in a single forward pass. There are no cross-attention modules and no separate Foley post-processing stage. Audio is planned alongside motion from the start — lip-sync, ambient sound, and visual action are coherent by design, not stitched together after generation completes.
Video-edit mode with reference images
Upload an existing video clip and write a text instruction to modify it. HappyHorse 1.0 supports local edits — changing clothing, color, or specific attributes — and global edits such as style or background transformation, while preserving the motion and temporal structure of the original clip. Add up to 5 reference images to specify the exact target appearance for the edited output.
Multilingual lip-sync in 7 languages
Native lip-sync is generated alongside video for English, Mandarin, Cantonese, Japanese, Korean, German, and French — all in the same single-pass architecture. Characters speak with synchronized mouth movements without a separate voice overlay or post-production alignment step. HappyHorse 1.0 also generates Foley sounds and ambient audio natively in the same generation pass.
Reference-to-video subject consistency
Upload reference images or reference videos to establish consistent character appearance, product identity, or visual style across generated clips. HappyHorse 1.0 reads reference assets and applies their visual qualities — face structure, clothing, material texture — to the generated video while applying natural motion and camera behavior from the text prompt.
Multi-format output for all platforms
HappyHorse 1.0 outputs video at 720p or 1080p in five aspect ratios — 16:9, 9:16, 1:1, 4:3, and 3:4 — covering the full range of social, streaming, and traditional media platforms. All outputs carry full commercial rights. The model is accessible via the fal.ai official API partnership with Python and JavaScript SDK support.