Currently, Google’s VideoFX can be accessed just by a limited number of users, but the company is planning to expand the users’s number later this week.
So, the new video model Veo 2 is able to generate videos based on a text prompt or based on the given image. According to Google DeepMind, this new video AI model features different advanced capabilities compared with the first model and is able to generate a clearer video image.
Also, DeepMind stated that the Veo 2 video model can generate some more realistic motions and more fluid actions along with more realistic shadows and reflections.
It’s important to know that the Veo 2 video model has been trained on a vast number of videos because this is a common method in the AI journey of many artificial intelligence-based models. By analyzing various examples of data, they are able to learn patterns from the provided data in order to generate new content.
“Veo has been trained on high-quality video-description pairings. Video-description pairs are a video and associated description of what happens in that video.”, Eli Collins the VP of product at Google DeepMind stated.
In addition to Veo 2, DeepMind has also introduced a new upgrade for its commercial image generation AI model called Imagen 3. So, now all ImageFX users are able to create images with better quality and also in different styles such as anime, photorealism, or impressionism.
But the quality upgrade is not the only release, as Google made some changes to the UI interface as well, and now ImageFX users will see that their type prompts will be transformed into “chiplets” together with drop-down menus suggesting related words. These new “chiplets” allow users to improve their prompts or even choose from different auto-generated descriptions.
“This upgrade [to Imagen 3] also follows prompts more faithfully, and renders richer details and textures,”, DeepMind stated.