Aerial view of a picturesque European village at sunset. The village should feature traditional houses with red-tiled roofs clustered around a central church with a tall white spire.
A dynamic street scene with two young Asian women riding a bicycle together on a sunny day in an urban Japanese neighborhood. The woman in front wears a white t-shirt with red and white sailor-style collar trim, while her friend sits behind her.
A close-up shot of cola being poured into a clear glass filled with ice cubes. Dark, rich brown liquid should flow smoothly in slow motion with realistic liquid physics, creating small bubbles and splashes.
A close-up portrait shot of a young Asian woman with natural makeup, filmed against a soft grey background. Subject should be positioned in three-quarter view, looking slightly upward.
An animated scene of a cute cartoon cat playing an acoustic guitar, styled like a vintage concert poster. The cat should be round and plump, colored in light grey and white, with a happy closed-eye smile and simple whiskers.
Video of a happy Golden Retriever running playfully in a sunny park. The dog should be captured in slow motion, showing flowing golden fur and a joyful open-mouthed smile.
The largest open-source video generation model available today, delivering superior quality and performance.
Advanced 3D VAE architecture ensures smooth, natural motion and exceptional visual consistency throughout videos.
Superior text understanding capabilities for better text-to-video alignment and more accurate results.
Complete access to code and model weights on GitHub, enabling community contribution and improvements.
Outperforms previous state-of-the-art models with 68.5% text alignment and 96.4% visual quality scores.
Supports various video resolutions including 720p×1280p vertical resolution to suit different needs. Only 1280px720p is currently supported in generation. Stay tuned for more!