Google takes on Sora with new AI video generator Veo

Be part of us in returning to NYC on June fifth to collaborate with govt leaders in exploring complete strategies for auditing AI fashions relating to bias, efficiency, and moral compliance throughout various organizations. Discover out how one can attend right here.

Since OpenAI unveiled its Sora generative AI video creation mannequin earlier this yr, nothing has come shut by way of sheer realism and high quality of AI generated movement visuals — till now.

Amid the flurry of bulletins at its annual I/O developer convention, Google at present unveiled a brand new generative AI video mannequin known as Veo made by its researchers at its famed DeepMind AI division.

Google Veo is a generative AI video mannequin able to creating “high-quality, 1080p clips that can go beyond 60 seconds,” Google posted from its DeepMind account on the social community X. “From photorealism to surrealism and animation, it can tackle a range of cinematic styles.”

On its product web page, Google says its purpose with Veo is to “help create tools that make video production accessible to everyone. Whether you’re a seasoned filmmaker, aspiring creator, or educator looking to share knowledge, Veo unlocks new possibilities for storytelling, education and more.” The mannequin helps text-to-video, video-to-video, and image-to-video transformations.

VB Occasion

The AI Impression Tour: The AI Audit

Be part of us as we return to NYC on June fifth to have interaction with high govt leaders, delving into methods for auditing AI fashions to make sure equity, optimum efficiency, and moral compliance throughout various organizations. Safe your attendance for this unique invite-only occasion.

Request an invitation

Google partnered with polymath artist Donald Glover a.ok.a Infantile Gambino, creator of the hit FX collection Atlanta and a movie and TV star as well, to check some new capabilities by means of his inventive studio, Gilga, utilizing Google’s new Veo AI video generator.

As an extra testomony to the notion that Google Veo is able to producing gorgeous movies from its underlying AI mannequin, DeepMind posted numerous them and the prompts on its YouTube web page and X account, together with a neon metropolis, sensible jellyfish swimming within the ocean…

✍️ Immediate: “Many spotted jellyfish pulsating under water. Their bodies are transparent and glowing in deep ocean.” pic.twitter.com/y9SmNd8NK0

— Google DeepMind (@GoogleDeepMind) Might 14, 2024

Cowboys using horses, spaceships traversing the void, and lifelike human scenes…

✍️ Immediate: “A lone cowboy rides his horse across an open plain at beautiful sunset, soft light, warm colors.” pic.twitter.com/D8uKDZVWto

— Google DeepMind (@GoogleDeepMind) Might 14, 2024

✍️ Immediate: “A woman sitting alone in a dimly lit cafe, a half-finished novel open in front of her. Film noir aesthetic, mysterious atmosphere. Black and white.” pic.twitter.com/vFVXr4Cvxi

— Google DeepMind (@GoogleDeepMind) Might 14, 2024

The outcomes are almost indistinguishable from reside motion or expert pc generated animations, all made with textual content prompts.

In keeping with a weblog put up by Google VP, Product Administration Eli Collins and Senior Analysis Director Douglas Eck, Veo “provides an unprecedented level of creative control, and understands cinematic terms like ‘timelapse’ or ‘aerial shots of a landscape.’”

As well as, Veo can simply, rapidly make high-quality edits to AI generated movies or a person’s uploaded clips — even pre-recorded reside motion footage — from textual content prompts, in response to Google’s Veo product web page.

“When given both an input video and editing command, like adding kayaks to an aerial shot of a coastline, Veo can apply this command to the initial video and create a new, edited video,” the corporate writes.

Additional, Google says that Veo can obtain consistency between video frames, avoiding a few of the weird and unsettling transformations and artifacts seen even in Sora, and that Veo does this by counting on “cutting-edge latent diffusion transformers” which “reduce the appearance of these inconsistencies, keeping characters, objects and styles in place, as they would in real life.”

Google “added more details to the captions of each video in its training data,” to enhance the outcomes. “And to further improve performance, the model uses high-quality, compressed representations of video (also known as latents) so it’s more efficient too. These steps improve overall quality and reduce the time it takes to generate videos.”

Google says all Veo movies are embedded with SynthID, its content material credentials monitoring watermarking, guaranteeing they are often detected by discerning events as AI generated.

The mannequin is alleged to be the fruits of years of analysis at DeepMind constructing upon earlier advances together with Generative Question Community (GQN), DVD-GAN, Imagen-Video, Phenaki, WALT, VideoPoet and Lumiere.

Sadly, Google shouldn’t be making it public simply but. As an alternative, following within the mould set by OpenAI with Sora (which nonetheless stays unreleased to the general public), Google wrote that it’s “available to select creators in private preview in VideoFX by joining our waitlist. In the future, we’ll also bring some of Veo’s capabilities to YouTube Shorts and other products.”

VB Day by day

Keep within the know! Get the newest information in your inbox each day

By subscribing, you conform to VentureBeat’s Phrases of Service.

Thanks for subscribing. Take a look at extra VB newsletters right here.

An error occured.

Google takes on Sora with new AI video generator Veo

VB Occasion

LEAVE A REPLY Cancel reply

The Psychology of ‘Shared Silence’ in {Couples}

David Moyes revels within the Merseyside derby “mayhem” as draw retains “title race alive” says Tim Sherwood | Soccer Information

Valentine’s Traditions

Virgin Voyages Proclaims Winter 2026-27 Caribbean Schedule, Restaurant Menu Refreshes

Fed Chair Powell’s Semiannual Financial Coverage Report back to Congress

Related articles

Apple’s ELEGNT framework might make dwelling robots really feel much less like machines and extra like companions

Apple’s new analysis robotic takes a web page from Pixar’s playbook

Samsung’s Galaxy S25 telephones, OnePlus 13 and Oura Ring 4

Hugging Face brings ‘Pi-Zero’ to LeRobot, making AI-powered robots simpler to construct and deploy

Follow us

Company

Latest news

Who Gave this Man an Economics Ph.D. (cont’d)?

The Psychology of ‘Shared Silence’ in {Couples}

David Moyes revels within the Merseyside derby “mayhem” as draw retains “title race alive” says Tim Sherwood | Soccer Information

Popular news

Anyword Evaluation: Is It the Proper AI Writing Device For You?

World Cyber Resilience Report 2024: Overconfidence and Gaps in Cybersecurity Revealed

The magical great thing about the Higher Lakes of the Plitvice Lakes Nationwide Park