Google has a brand new picture technology mannequin to indicate you.
Credit score: Google
The tempo of AI progress is displaying no indicators of slacking. Following ChatGPT’s large picture improve just a few weeks in the past, it is now Google’s flip to indicate off new fashions for producing movies and photos from textual content prompts: We have Veo 3 (for video) and Imagen 4 (for photos), introduced throughout Google I/O 2025, they usually include some vital enhancements.
Beginning with Veo 3, it is the following step up from the Veo 2 mannequin that was lately pushed out to paying Gemini subscribers final month. Google says Veo 3 brings with it notable enhancements in real-world physics (one thing AI video typically struggles with) and particulars equivalent to lip-syncing. In brief: Your clips ought to look extra real looking than ever.
There’s one other essential improve right here, and that is sound. Beforehand, Veo-made clips got here with none audio hooked up, however the AI is now sensible sufficient so as to add in appropriate ambient sounds, together with visitors noise, wildlife sounds, and even dialog between characters.
Google has supplied just a few instance movies to indicate off the brand new capabilities, as you’ll count on, together with Outdated Sailor. In fact, it is spectacular {that a} clip like this may be produced from a textual content immediate, and it’s as much as a excessive normal when it comes to realism—we’re not getting the six-fingered arms that we used to with AI.
Nonetheless, the same old hallmarks of synthetic intelligence are evident: It is a generic sailor, on a generic sea, talking generic dialogue in regards to the ocean. It is a mashing collectively and averaging out of each video of the ocean and outdated sailors that Veo 3 has been skilled on, and will or might not match the unique immediate (which Google hasn’t given).
Veo 3 is simply out there to these courageous sufficient to pay $250 a month for Google’s AI Extremely plan, however Veo 2 can be getting some upgrades for these of us paying a tenth of that for AI Professional. It is now higher at management and consistency, in response to Google, with improved digicam actions and outpainting (increasing the view of a body). It will probably even have a go at including and eradicating objects from clips now.
Shifting on to pictures: We have Imagen 4, the successor to Imagen 3. Right here, we’re promised “exceptional readability in fantastic particulars like intricate materials, water droplets, and animal fur,” plus assist for greater resolutions (as much as 2K) and extra side ratios. You get top-tier leads to each photorealistic and summary kinds, as per Google.
What do you assume thus far?

There are sheep as large as tractors in Google’s AI world.
Credit score: Google
Google has additionally tackled one of many main issues with AI picture technology, which is typography. Imagen 4 is seemingly significantly better than the fashions that got here earlier than it when it comes to making characters and phrases look cohesive and correct, with none bizarre spellings or letters than dissolve into unintelligible hieroglyphics.
Imagen 4 is offered now to all customers, contained in the Gemini app. Google hasn’t talked about any utilization limits, although presumably if you do not have a subscription you may hit these limits extra shortly, as is the case with Imagen 3 (there is no mounted quota for these limits, and it appears they rely on basic demand on Google’s AI infrastructure).
The fastidiously curated samples Google has supplied look good, with none apparent errors or inaccuracies—simply the same old AI sheen. Imagen 4 is quicker than Imagen 3 too, Google says, with extra enhancements on the best way: A variant on the mannequin that is 10x sooner than Imagen 3 goes to be launching quickly.
There’s another picture and video instrument to speak about: Stream. It is an AI filmmaking instrument from Google that pulls collectively its textual content, video, and picture fashions that can assist you sew collectively successive scenes which can be constant, that includes the identical characters and places. You need to use Stream should you’re an AI Professional or AI Extremely subscriber, with greater utilization limits and higher fashions for these on the dearer plan.