Wan 2.6 is a multimodal model that generates videos and images from text descriptions. The new version allows using characters from reference videos and creating multi-shot narratives with audiovisual synchronization.
The Wan 2.6 model introduces multimodal content generation capabilities that combine video, image and text. Among the highlighted functionalities is Starring, which allows incorporating characters from reference videos into new scenes while maintaining visual and voice consistency. The system analyzes up to 150 reference frames to preserve the appearance and voice timbre of characters, and supports up to three simultaneous references to create interactions between multiple entities.
The multi-shot narrative function converts simple prompts into structured video sequences, maintaining consistency of characters, scenarios and atmosphere throughout different shots. This capability enables developing more complex stories than single-shot generations.
Regarding video generation, Wan 2.6 produces 15-second clips in 1080p resolution with native audio-video synchronization. The system generates multi-speaker dialogues, natural lip-sync and audio quality comparable to professional studios. The current version improves instruction following, motion physics and aesthetic control compared to previous versions.
For image synthesis, the model offers control over lens and lighting parameters, with the ability to reference multiple images to maintain aesthetic consistency. The text-image generation function allows creating structured visual narratives that interleave both formats, using real-world knowledge and reasoning capabilities.
The model is designed for applications requiring visual and narrative coherence in multimedia content generation, from creating scenes with specific characters to producing sequences with complete narrative structure.
AI platform for visual content creation using generative models. Offers video and image generation from text, audio, and visual references. Includes editing tools and open-source ...
25/05/2026
Pope Leo XIV publishes the first encyclical dedicated to artificial intelligence, setting human dignity as the criterion for all technological ...
11/05/2026
Thinking Machines Lab has published a research preview of TML-Interaction-Small, an interaction model designed to collaborate with the user in real ...
24/04/2026
DeepSeek releases a preview of its V4 family, two open-source models capable of processing up to one million tokens of context and competing with the ...
23/04/2026
OpenAI launches GPT-5.5, a model designed to handle complex tasks autonomously — coding, researching, analyzing data and operating a computer ...