Try these ideas:
WAN 2.7 AI
Alibaba's Most Versatile Video Model
WAN 2.7 by Alibaba brings 7 generation modes in one model โ text-to-video, image-to-video, start-end animation, video continuation, AI video editing, audio-driven video, and multi-reference consistency. Generate 720p or 1080p videos up to 15 seconds with native audio on VO3 AI.
Generate Video with WAN 2.7
Try WAN 2.7 text-to-video and image-to-video directly. Select WAN 2.7 from the model dropdown to experience Alibaba's latest AI video technology.
Generate AI Model videos with AI
What is WAN 2.7?
WAN 2.7 is Alibaba's most advanced AI video generation model, representing a major leap from WAN 2.5. While WAN 2.5 offered basic text-to-video and image-to-video with fixed 5-second output, WAN 2.7 introduces 7 distinct generation modes with configurable duration (10-15 seconds), dual resolution options (720p/1080p), and native audio generation. Built on Alibaba's latest research, WAN 2.7 excels at maintaining visual consistency across complex multi-reference scenarios.
WAN 2.7 is available exclusively on VO3 AI through the KIE API infrastructure. Unlike single-purpose models, WAN 2.7 serves as a complete video creation toolkit โ from generating new content to editing existing videos, continuing scenes, and even driving visuals with audio input. The Reference-to-Video (R2V) mode uniquely supports up to 5 combined image and video references for unprecedented character and style consistency.
7 Generation Modes of WAN 2.7
Text to Video
Generate 720p/1080p videos up to 15s
from text prompts with native audio
Image to Video
Animate single images with first-frame
control and aspect ratio options
Start โ End Animation
Upload start and end frames โ
WAN 2.7 generates smooth motion between them
Video Continue
Extend existing video clips
seamlessly while maintaining style consistency
AI Video Edit
Edit videos with natural language instructions
โ change style, background, or mood
Audio to Video
Drive video generation with audio files
โ sync visuals to beats and rhythm
Reference to Video
Up to 5 image/video references +
voice for character and style consistency
Multi-Language Prompts
Supports both English and Chinese
prompts with intelligent prompt extension
TOOLS & DEMOS
WAN 2.7 in Action
Each mode shown with its input, prompt, and output. Click any card to try it yourself.
WAN 2.7 Technical Specifications
720p & 1080p Output
Dual resolution options for
both speed-optimized and quality-focused workflows
10s & 15s Duration
Configurable video length with
duration-based pricing for cost control
Native Audio Generation
Built-in audio synthesis that
matches generated visuals automatically
Prompt Extension
Intelligent prompt rewriting that expands
brief descriptions into detailed scenes
Fast Generation
Optimized pipeline delivers results in
2-5 minutes depending on settings
Character Consistency
R2V mode maintains character
identity across multiple reference materials
WAN 2.7 Pricing
Credits start from $2.99 ยท View all plans
Frequently Asked Questions About WAN 2.7
Start Creating with WAN 2.7
7 generation modes, 720p/1080p output, native audio. The most versatile AI video model available on VO3 AI.
WAN 2.7 AI video generator | Alibaba WAN 2.7 model | WAN 2.7 text to video | WAN 2.7 image to video | WAN 2.7 video continue | WAN 2.7 video edit | WAN 2.7 audio to video | WAN 2.7 reference to video | AI video generator 2026 | VO3 AI WAN 2.7






