{"id":4860,"date":"2024-05-15T10:16:34","date_gmt":"2024-05-15T10:16:34","guid":{"rendered":"https:\/\/favtutor.com\/articles\/?p=4860"},"modified":"2024-05-15T10:16:36","modified_gmt":"2024-05-15T10:16:36","slug":"google-veo-access","status":"publish","type":"post","link":"https:\/\/favtutor.com\/articles\/google-veo-access\/","title":{"rendered":"Meet Veo: Google&#8217;s Hot New Text-to-Video AI (How to Access?)"},"content":{"rendered":"\n<p class=\"wp-block-paragraph\">At the Google I\/O 2024 developer conference, Google introduced Veo, its generative text-to-video competitor to <a href=\"https:\/\/favtutor.com\/articles\/sora-ai-video-generator-openai\/\">OpenAI\u2019s Sora<\/a>. Let&#8217;s find out more about it!<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Highlights<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Veo is Google DeepMind&#8217;s text-to-video generative model that can create high-quality, 1080p videos over 60 seconds long.<\/li>\n\n\n\n<li>It can seamlessly blend text prompts with reference images to generate videos that follow both inputs.<\/li>\n\n\n\n<li>It supports video editing by incorporating text instructions, including masked editing for specific areas.<\/li>\n<\/ul>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>Introducing Veo by Google<\/strong><\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">Veo is Google DeepMind\u2019s text-to-video generative model, setting a new benchmark in the field of video generation.<strong> Veo boasts the capability to generate high-quality, 1080p resolution videos lasting over a minute, spanning a diverse range of cinematic and visual styles.<\/strong><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Along with video creation, it is also able to edit existing videos by incorporating text-based instructions thus modifying the videos as per the needs of the user.<\/p>\n\n\n\n<figure class=\"wp-block-embed is-type-video is-provider-youtube wp-block-embed-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio\"><div class=\"wp-block-embed__wrapper\">\n<div class=\"jeg_video_container jeg_video_content\"><iframe title=\"Google DeepMind&#039;s text-to-video model Veo creates 60 second video\" width=\"500\" height=\"281\" src=\"https:\/\/www.youtube.com\/embed\/diqmZs1aD1g?feature=oembed\" frameborder=\"0\" allow=\"accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share\" referrerpolicy=\"strict-origin-when-cross-origin\" allowfullscreen><\/iframe><\/div>\n<\/div><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">Veo&#8217;s versatility extends to generating videos using both images and text prompts. By inputting a reference image alongside a text prompt, It seamlessly blends the visual style of the image with the instructions from the prompt, producing a breathtaking video that is based on both inputs.<\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img decoding=\"async\" width=\"1000\" height=\"562\" src=\"https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/image4-5.png\" alt=\"\" class=\"wp-image-4865\" srcset=\"https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/image4-5.png 1000w, https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/image4-5-768x432.png 768w, https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/image4-5-750x422.png 750w\" sizes=\"(max-width: 1000px) 100vw, 1000px\" \/><figcaption class=\"wp-element-caption\">(<a href=\"https:\/\/deepmind.google\/technologies\/veo\/\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">Source<\/a>)<\/figcaption><\/figure>\n<\/div>\n\n\n<p class=\"wp-block-paragraph\">To enhance Veo&#8217;s ability to comprehend and adhere to prompts precisely, Google DeepMind enriched the training data with more detailed video captions. Additionally, the model utilizes high-quality, compressed video representations known as latent which will help to boost efficiency. These measures collectively improve overall video quality and reduce generation time.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>Versatile Features of Veo<\/strong><\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">Veo uses advanced natural language processing and visual semantics to accurately capture the details and tones specified in text prompts, rendering intricate details within complex scenes. It offers creative control, comprehending prompts for various cinematic effects, such as time-lapses, close-ups, or aerial shots of landscapes.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Time-lapse:<\/strong><\/p>\n\n\n\n<div align=\"center\"><<blockquote class=\"twitter-tweet\" data-conversation=\"none\" data-media-max-width=\"560\"><p lang=\"en\" dir=\"ltr\">\u270d\ufe0f Prompt: \u201cTimelapse of a water lily opening, dark background.\u201d <a href=\"https:\/\/t.co\/t5uLQ89E1Y\" target=\"_blank\">pic.twitter.com\/t5uLQ89E1Y<\/a><\/p>&mdash; Google DeepMind (@GoogleDeepMind) <a href=\"https:\/\/twitter.com\/GoogleDeepMind\/status\/1790435828709126264?ref_src=twsrc%5Etfw\" target=\"_blank\" rel=\"noopener\">May 14, 2024<\/a><\/blockquote> <script async src=\"https:\/\/platform.twitter.com\/widgets.js\" charset=\"utf-8\"><\/script><\/div>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Close-up:<\/strong><\/p>\n\n\n\n<div align=\"center\"><blockquote class=\"twitter-tweet\" data-media-max-width=\"560\"><p lang=\"en\" dir=\"ltr\">\u270d\ufe0f Prompt: \u201cExtreme close-up of chicken and green pepper kebabs grilling on a barbeque with flames. Shallow focus and light smoke. vivid colours.\u201d <a href=\"https:\/\/t.co\/LDHC8XGyJA\" target=\"_blank\">pic.twitter.com\/LDHC8XGyJA<\/a><\/p>&mdash; Google DeepMind (@GoogleDeepMind) <a href=\"https:\/\/twitter.com\/GoogleDeepMind\/status\/1790435838779642086?ref_src=twsrc%5Etfw\" target=\"_blank\" rel=\"noopener\">May 14, 2024<\/a><\/blockquote> <script async src=\"https:\/\/platform.twitter.com\/widgets.js\" charset=\"utf-8\"><\/script><\/div>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Veo&#8217;s cutting-edge technology extends beyond generating videos from scratch. It can seamlessly edit existing videos by incorporating text-based instructions, including adding or modifying specific elements within a scene. <\/strong><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Additionally, It supports masked editing, enabling targeted changes within designated areas of the video. The example below shows how the videos can be edited as per requirements.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>Initial: Prompt:<\/strong> Drone shot along the Hawaii jungle coastline, sunny day<\/p>\n\n\n\n<div align=\"center\"><blockquote class=\"twitter-tweet\" data-media-max-width=\"560\"><p lang=\"en\" dir=\"ltr\">2. Prompt: Drone shot along the Hawaii jungle coastline, sunny day <a href=\"https:\/\/t.co\/2yU7h8BSSL\" target=\"_blank\">pic.twitter.com\/2yU7h8BSSL<\/a><\/p>&mdash; Jv Shah (@JvShah124) <a href=\"https:\/\/twitter.com\/JvShah124\/status\/1790632940319195283?ref_src=twsrc%5Etfw\" target=\"_blank\" rel=\"noopener\">May 15, 2024<\/a><\/blockquote> <script async src=\"https:\/\/platform.twitter.com\/widgets.js\" charset=\"utf-8\"><\/script><\/div>\n\n\n\n<p class=\"wp-block-paragraph\"><strong>New:<\/strong> Drone shot along the Hawaii jungle coastline, sunny day. Kayaks in the water<\/p>\n\n\n\n<div align=\"center\"><blockquote class=\"twitter-tweet\" data-media-max-width=\"560\"><p lang=\"en\" dir=\"ltr\">11) Drone shot along the Hawaii jungle coastline, sunny day. Kayaks in the water <a href=\"https:\/\/t.co\/xyymAC0aXI\" target=\"_blank\">pic.twitter.com\/xyymAC0aXI<\/a><\/p>&mdash; Allen T (@Mr_AllenT) <a href=\"https:\/\/twitter.com\/Mr_AllenT\/status\/1790453678228361720?ref_src=twsrc%5Etfw\" target=\"_blank\" rel=\"noopener\">May 14, 2024<\/a><\/blockquote> <script async src=\"https:\/\/platform.twitter.com\/widgets.js\" charset=\"utf-8\"><\/script><\/div>\n\n\n\n<p class=\"wp-block-paragraph\">Veo&#8217;s advanced latent diffusion transformers address the problem of visual consistency and fluidity throughout the generated videos, preventing flickering, jumping, or morphing of characters, objects, and styles between frames, thereby enhancing the overall viewing experience.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">It can generate video clips exceeding 60 seconds, either from a single prompt or by stitching together a sequence of prompts that collectively narrate a story.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Using its remarkable capabilities, It aims to democratize video production, empowering seasoned filmmakers, aspiring creators, and educators alike to unleash their storytelling potential and share knowledge through captivating visuals.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The tweet below shows how filmmakers can use Veo to bring ideas to life that would otherwise not be possible to implement.<\/p>\n\n\n\n<div align=\"center\"><blockquote class=\"twitter-tweet\"><p lang=\"en\" dir=\"ltr\">We put our cutting-edge video generation model Veo in the hands of filmmaker <a href=\"https:\/\/twitter.com\/donaldglover?ref_src=twsrc%5Etfw\" target=\"_blank\" rel=\"noopener\">@DonaldGlover<\/a> and his creative studio, Gilga.<br><br>Let\u2019s take a look. \u2193 <a href=\"https:\/\/twitter.com\/hashtag\/GoogleIO?src=hash&amp;ref_src=twsrc%5Etfw\" target=\"_blank\" rel=\"noopener\">#GoogleIO<\/a> <a href=\"https:\/\/t.co\/oNLDq1YlHC\" target=\"_blank\">pic.twitter.com\/oNLDq1YlHC<\/a><\/p>&mdash; Google DeepMind (@GoogleDeepMind) <a href=\"https:\/\/twitter.com\/GoogleDeepMind\/status\/1790436633478893574?ref_src=twsrc%5Etfw\" target=\"_blank\" rel=\"noopener\">May 14, 2024<\/a><\/blockquote> <script async src=\"https:\/\/platform.twitter.com\/widgets.js\" charset=\"utf-8\"><\/script><\/div>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>How to access Veo?<\/strong><\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">Like OpenAI&#8217;s Sora, Google&#8217;s Veo is not available to the public just yet. Currently, it is being shared with a select number of creators in a private preview inside VideoFX, their new experimental tool. Users can join a waitlist if they are interested in trying out Veo\u2019s capabilities. <a href=\"https:\/\/deepmind.google\/technologies\/veo\/\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">Click here<\/a> to apply for access to Veo.<\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img decoding=\"async\" width=\"1000\" height=\"476\" src=\"https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/image1-5.png\" alt=\"\" class=\"wp-image-4867\" srcset=\"https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/image1-5.png 1000w, https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/image1-5-768x366.png 768w, https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/image1-5-750x357.png 750w\" sizes=\"(max-width: 1000px) 100vw, 1000px\" \/><\/figure>\n<\/div>\n\n\n<p class=\"wp-block-paragraph\">Once you click on the signup button, you will be <a href=\"https:\/\/aitestkitchen.withgoogle.com\/tools\/video-fx\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">redirected here<\/a>. Users can now join the waitlist on Google Labs to try some of its features in VideoFX.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Once you click on Sign in with Google, it will redirect you to login to Google Labs.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">After this, you can join the waitlist by filling out the <a href=\"https:\/\/docs.google.com\/forms\/d\/e\/1FAIpQLSeC6n1KQlaqRNUGNuNRt5Q7YeoyXsq828niw2ZvIoAtW1FtYQ\/viewform?resourcekey=0-qDKZCeB4G9nS9dttXGdnHQ\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">Google Form<\/a>. For now, Veo is only available in a few countries. You can search for your country in the dropdown provided to know whether it is available or not.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>Conclusion<\/strong><\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">With its impressive capabilities, Veo is shaping up to be a strong contender to OpenAI&#8217;s groundbreaking text-to-video model Sora. It aims to empower creators and educators and its potential to democratize video production is highly anticipated.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Google revealed its text-to-video AI tool Veo, that can create 1080p videos over 60 seconds long.. Also, how to access Veo?<\/p>\n","protected":false},"author":18,"featured_media":4869,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"jnews-multi-image_gallery":[],"jnews_single_post":null,"jnews_primary_category":{"id":"","hide":""},"footnotes":""},"categories":[57],"tags":[56,59,58,62,259],"class_list":["post-4860","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-ai","tag-ai","tag-generative-ai","tag-google","tag-sora","tag-veo"],"_links":{"self":[{"href":"https:\/\/favtutor.com\/articles\/wp-json\/wp\/v2\/posts\/4860","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/favtutor.com\/articles\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/favtutor.com\/articles\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/favtutor.com\/articles\/wp-json\/wp\/v2\/users\/18"}],"replies":[{"embeddable":true,"href":"https:\/\/favtutor.com\/articles\/wp-json\/wp\/v2\/comments?post=4860"}],"version-history":[{"count":2,"href":"https:\/\/favtutor.com\/articles\/wp-json\/wp\/v2\/posts\/4860\/revisions"}],"predecessor-version":[{"id":4870,"href":"https:\/\/favtutor.com\/articles\/wp-json\/wp\/v2\/posts\/4860\/revisions\/4870"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/favtutor.com\/articles\/wp-json\/wp\/v2\/media\/4869"}],"wp:attachment":[{"href":"https:\/\/favtutor.com\/articles\/wp-json\/wp\/v2\/media?parent=4860"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/favtutor.com\/articles\/wp-json\/wp\/v2\/categories?post=4860"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/favtutor.com\/articles\/wp-json\/wp\/v2\/tags?post=4860"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}