Google has made a qualitative leap in audiovisual content generation with artificial intelligence by introducing Veo 2 and Imagen 3, two models that achieve unprecedented results in creating realistic videos and images.
The new versions of Google's artificial intelligence models mark a turning point in digital content generation. Veo 2 stands out for its ability to create 4K videos with an improved understanding of real-world physics and human movement, while also allowing precise control of cinematographic aspects such as lens type and visual effects.
The Imagen 3 model has also undergone significant improvements, now offering brighter and better-composed images, with the ability to reproduce various artistic styles with greater accuracy, from photorealism to anime. Google has also implemented an invisible SynthID watermark on all creations to identify them as AI-generated.
As a complement to these updates, the company has introduced Whisk, a new experimental tool that combines Imagen 3 with Gemini's visual understanding capabilities. This integration allows users to mix and modify existing images to create new personalized designs.
The new models are available through VideoFX and ImageFX in Google Labs, with plans to expand their access to YouTube Shorts and other Google products next year. The company maintains its commitment to responsible development, implementing a gradual deployment to ensure the quality and safety of these technologies.
Gemini is Google's artificial intelligence assistant developed by DeepMind. Works with text, images, audio, video, and code. Generates content, answers questions, and connects with Gmail, Calendar, ...
Google AI develops advanced platforms that improve people's lives. Its Gemini ecosystem integrates models, products, and APIs, driving responsible innovation and enabling developers and businesses to ...
09/06/2026
Anthropic introduces Claude Fable 5 and Claude Mythos 5, two versions of its most capable model to date. They share the same foundation, but one is ...
25/05/2026
Pope Leo XIV publishes the first encyclical dedicated to artificial intelligence, setting human dignity as the criterion for all technological ...
19/05/2026
Rime introduces Coda, a text-to-speech model for real-time conversational agents that reproduces the rhythm, pauses and intonation of natural ...
11/05/2026
Thinking Machines Lab has published a research preview of TML-Interaction-Small, an interaction model designed to collaborate with the user in real ...