Fueling the Future of Art: Unleashing Creativity with New Generative Media Tools
- Google introduces groundbreaking generative media models—Veo 3, Imagen 4, and Lyria 2—alongside Flow, an innovative AI filmmaking tool, to empower artists with unprecedented creative control.
- These tools offer advanced capabilities like video with audio, stunning image precision, powerful music composition, and cinematic storytelling, making artistic expression more accessible.
- With a focus on responsible AI development, Google collaborates with creative industries and implements tools like SynthID to ensure ethical use and prevent misinformation.
In an era where technology and art increasingly intertwine, Google is thrilled to unveil a suite of revolutionary generative media models and tools designed to ignite creativity across various artistic domains. Today, we’re announcing Veo 3 and Imagen 4, our latest video and image generation models, alongside expanded access to Lyria 2 for music creation, and the introduction of Flow, a cutting-edge AI filmmaking tool. These advancements, developed in close partnership with filmmakers, musicians, artists, and YouTube creators, represent significant leaps forward in media generation. They not only push the boundaries of what’s possible but also democratize access to powerful tools that help everyone—from seasoned professionals to aspiring creators—bring their visions to life.

Let’s start with Veo 3, our state-of-the-art video generation model that marks a monumental step forward from its predecessor, Veo 2. For the first time, Veo 3 can generate videos with integrated audio, capturing ambient sounds like traffic noises in a bustling city scene or birds chirping in a serene park, and even producing dialogue between characters. Its capabilities extend beyond audio, excelling in text and image prompting, real-world physics, and precise lip-syncing. Imagine crafting a short story in a prompt and watching Veo 3 transform it into a vivid, lifelike clip. Available today for Ultra subscribers in the United States through the Gemini app and Flow, as well as for enterprise users on Vertex AI, Veo 3 is set to redefine video creation.
Meanwhile, we’ve enhanced Veo 2 with new features tailored for filmmakers and creators. Informed by direct feedback from the creative community, these updates include reference-powered video capabilities for consistent character and scene design, precise camera controls for movements like rotations and zooms, outpainting to adapt video frames for different screen sizes, and object add-and-remove functions that maintain realistic scale and shadows. These tools, already accessible in Flow and soon rolling out to the Vertex AI API and other products, offer filmmakers unparalleled control over their projects, ensuring every shot aligns perfectly with their vision.
Complementing our video advancements is Flow, an AI filmmaking tool built with Google DeepMind’s most advanced models, including Veo, Imagen, and Gemini. Designed specifically for creatives, Flow enables users to craft cinematic clips and stories using natural language prompts. It provides a centralized platform to manage elements like cast, locations, objects, and styles, weaving them into cohesive, beautiful narratives. Currently available for Google AI Pro and Ultra plan subscribers in the U.S., with plans for international expansion, Flow is poised to become an indispensable asset for visual storytellers eager to explore the possibilities of AI in cinema.
Turning to image generation, Imagen 4 sets a new standard for quality and versatility. This model delivers stunning clarity in intricate details—think delicate fabrics, glistening water droplets, or lifelike animal fur—while excelling in both photorealistic and abstract styles. With support for various aspect ratios and resolutions up to 2K, Imagen 4 is ideal for high-quality prints and presentations. It also boasts improved typography and spelling, making it a fantastic tool for designing greeting cards, posters, and comics. Available now in the Gemini app, Whisk, Vertex AI, and across Google Workspace tools like Slides, Vids, and Docs, Imagen 4 will soon see a fast variant that’s up to ten times quicker than Imagen 3, accelerating the creative process even further.
For musicians and producers, Lyria 2 continues to break new ground in music composition and exploration. Since expanding access to the Music AI Sandbox in April, powered by Lyria 2, we’ve provided experimental tools that spark unique musical ideas for artists. Now accessible through YouTube Shorts and for enterprises via Vertex AI, Lyria 2 offers powerful composition features. Additionally, Lyria RealTime, which drives MusicFX DJ, is available through an API and in AI Studio, enabling interactive, real-time music generation and performance. This opens up endless possibilities for creators to experiment and perform, guided by valuable feedback from the music industry to ensure these tools truly empower artists.

At the heart of our mission lies a commitment to responsible AI development. Since its launch in 2023, SynthID has watermarked over 10 billion pieces of content—images, videos, audio, and text—helping to identify AI-generated material and combat misinformation. Outputs from Veo 3, Imagen 4, and Lyria 2 will continue to carry these watermarks. Furthermore, we’re excited to introduce SynthID Detector, a verification portal that allows users to upload content and determine if it, or any part of it, contains SynthID markers. This initiative underscores our dedication to ethical creation, ensuring that as we unleash human creativity, we do so with integrity and transparency.
Collaboration with the creative community has been instrumental in shaping these tools. By working closely with industry professionals, we’ve tailored our models to meet real-world needs, ensuring they serve as genuine enablers of artistic expression. Whether you’re a filmmaker crafting a cinematic masterpiece with Flow, a designer creating breathtaking visuals with Imagen 4, a musician exploring new sounds with Lyria 2, or a storyteller bringing narratives to life with Veo 3, these tools are designed to amplify your creativity. They break down barriers, making it faster and easier than ever to transform ideas into reality.
As we look to the future, Google remains committed to pushing the frontier of generative media while fostering a responsible and inclusive creative ecosystem. These models and tools are just the beginning of what’s possible when technology and human imagination converge. We invite artists and creators everywhere to explore these innovations, to experiment, and to redefine the boundaries of art. Together, let’s fuel the future of creativity and build a world where every vision can find its voice.