ElevenLabs integrates image and video generation into its audio platform

17/11/2025

ElevenLabs has introduced Image & Video, a beta feature that integrates visual generation models into its platform. Users can create images and videos, add voices and music, and export final content in a single workflow.

ElevenLabs integrates image and video generation into its audio platform

ElevenLabs has announced the launch of Image & Video in beta version, a new feature that incorporates visual generation models into its audio-specialized platform. The tool allows users to create visual and audio content from the same work environment.

The platform integrates several models for image creation, including Nanobanana, Flux Kontext, GPT Image, and Seedream. These images can be used as storyboards, thumbnails, or source material for video projects.

For video generation, ElevenLabs has incorporated models such as Veo, Sora, Kling, Wan, and Seedance. Users can refine results, compose multiple clips to create narratives, and apply quality improvements through upscaling functions. The platform also allows adding lip sync to generated videos using ElevenLabs voices.

Once visual elements are created, users can export them to Studio, the ElevenLabs editing environment. In Studio, it is possible to add narrations with voices from the library or custom clones, compose specific background music, and add sound effects. The system includes a unified timeline to adjust synchronization and refine narration before exporting final content.

The company has oriented this feature toward content creators, marketing teams, and educational producers. According to ElevenLabs, the tool allows creating product videos, social media content, and educational materials from ideation to final export.

This launch represents ElevenLabs' expansion beyond its audio specialization, integrating visual generation, editing, and sound production capabilities into a single platform. The Image & Video feature is available on the ElevenLabs Creative platform in its beta phase.

Key points

  • ElevenLabs launches Image & Video in beta version to create visual and audio content
  • Integrates image models such as Nanobanana, Flux Kontext, GPT Image, and Seedream
  • Includes video models such as Veo, Sora, Kling, Wan, and Seedance
  • Allows adding lip sync with ElevenLabs voices
  • Studio offers editing with voices, custom music, and sound effects
  • Unified timeline allows adjusting synchronization and narration
  • Aimed at creators, marketing teams, and educational producers
  • Available on the ElevenLabs Creative platform

Videos

Related AI

ElevenLabs

Generative Voice AI

Explore the most advanced text to speech and voice cloning software ever. Create lifelike voiceovers for your content or use our AI voice generator as an easy-to-use text ...

Lastest news

Trustpilot
This website uses technical, personalization and analysis cookies, both our own and from third parties, to facilitate anonymous browsing and analyze website usage statistics. We consider that if you continue browsing, you accept their use.