{"id":5481,"date":"2024-06-08T06:40:22","date_gmt":"2024-06-08T06:40:22","guid":{"rendered":"https:\/\/favtutor.com\/articles\/?p=5481"},"modified":"2024-06-08T06:40:46","modified_gmt":"2024-06-08T06:40:46","slug":"kling-ai-video-outputs","status":"publish","type":"post","link":"https:\/\/favtutor.com\/articles\/kling-ai-video-outputs\/","title":{"rendered":"Kling AI challenges SORA with Jaw-Dropping Video Outputs"},"content":{"rendered":"\n<p>While the whole world is waiting for the launch of <a href=\"https:\/\/favtutor.com\/articles\/sora-ai-video-generator-openai\/\" target=\"_blank\" data-type=\"link\" data-id=\"https:\/\/favtutor.com\/articles\/sora-ai-video-generator-openai\/\" rel=\"noreferrer noopener nofollow\">SORA AI<\/a>, this AI model has taken the world by storm lately with its impressive capabilities. This model is even up for open access and many developers are stating it produces far better videos than SORA.<\/p>\n\n\n\n<p><strong>Highlights:<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Chinese video platform company Kuaishou announces Kling, a powerful text-to-video generative AI model.<\/li>\n\n\n\n<li>Built upon the diffusion architecture with powerful 3D VAE technology.<\/li>\n\n\n\n<li>Can generate videos for up to 2 minutes while capturing bodily movements, video outputs, aspect ratios and much more.<\/li>\n<\/ul>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>Kling AI: SORA\u2019s Chinese Competitor<\/strong><\/h2>\n\n\n\n<p>On 7<sup>th<\/sup> June 2024, the Chinese AI company Kuaishou <a href=\"https:\/\/kling.kuaishou.com\/\" data-type=\"link\" data-id=\"https:\/\/kling.kuaishou.com\/\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">announced<\/a> their latest text-to-video generating model called Kling AI.<\/p>\n\n\n\n<p><strong>The Kuaishou Big Model Team created Kling, a model for creating videos. Its strong video creation features enable users to quickly and simply produce artistic videos. This AI model is very impressive and can generate videos of up to 2 minutes!<\/strong><\/p>\n\n\n\n<p>It distinguishes itself by accurately reproducing real-world physics while producing two-minute films in pristine 1080p resolution at 30 frames per second. We know many text-to-video-generating AI models are already present out there, but it is the physical simulations that catch our eyes.<\/p>\n\n\n\n<p>Catching up to dynamic real-time simulations in today\u2019s generative world is not an easy task. SORA AI showed us how perfectly it was trained to be efficient in replicating these mechanisms, and now Kling AI is also doing the same.<\/p>\n\n\n\n<p>Here\u2019s a video generated by Kling where you can see a Chinese man sitting at a table and eating noodles with chopsticks:<\/p>\n\n\n\n<div align=center><blockquote class=\"twitter-tweet\" data-media-max-width=\"560\"><p lang=\"en\" dir=\"ltr\">Sora by OpenAI is insane.<br><br>But KWAI just dropped a Sora-like model called KLING, and people are going crazy over it. <br><br>Here are 10 wild examples you don&#39;t want to miss: <br><br>1. A Chinese man sits at a table and eats noodles with chopsticks<a href=\"https:\/\/t.co\/MIV5IP3fyQ\" target=\"_blank\">pic.twitter.com\/MIV5IP3fyQ<\/a><\/p>&mdash; Angry Tom (@AngryTomtweets) <a href=\"https:\/\/twitter.com\/AngryTomtweets\/status\/1798777783952527818?ref_src=twsrc%5Etfw\" target=\"_blank\" rel=\"noopener\">June 6, 2024<\/a><\/blockquote> <script async src=\"https:\/\/platform.twitter.com\/widgets.js\" charset=\"utf-8\"><\/script><\/div>\n\n\n\n<p>This doesn\u2019t at all feel generated to me! Look at the movements and expressions, it\u2019s as if the man was recorded! Kling AI is doing wonders.<\/p>\n\n\n\n<p>Notably, it is not the first video generation model attempt by China. <a href=\"https:\/\/favtutor.com\/articles\/vidu-ai-text-to-video-generator\/\" data-type=\"link\" data-id=\"https:\/\/favtutor.com\/articles\/vidu-ai-text-to-video-generator\/\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">Vidu AI<\/a>, the nation&#8217;s first Sora version, created a stir earlier this year when it was able to produce 16-second films in crystal-clear 1080p.<\/p>\n\n\n\n<p>China&#8217;s AI revolution is gaining momentum, with Kling in the forefront, and rivals are finding it difficult to keep up with this quickly changing environment. It will be interesting to see how the competition between Kling and SORA AI unfolds.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>How To Access Kling AI?<\/strong><\/h3>\n\n\n\n<p>Despite rumours of the model being up for open access, there is no public information as to how you can access Kling\u2019s Video Generating Model. It&#8217;s reportedly available for invited beta testers via the Kwaiying (KwaiCut) app as a demo, with possible free access to the model coming in the near future.<\/p>\n\n\n\n<p><strong>Here\u2019s what you can do instead:<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Download the Kwaiying (KwaiCut) mobile app on the <a href=\"https:\/\/play.google.com\/store\/apps\/details?id=com.kwai.editor\" data-type=\"link\" data-id=\"https:\/\/play.google.com\/store\/apps\/details?id=com.kwai.editor\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">Play Store<\/a> or <a href=\"https:\/\/apps.apple.com\/pl\/app\/%E5%BF%AB%E5%BD%B1\/id1195860596\" data-type=\"link\" data-id=\"https:\/\/apps.apple.com\/pl\/app\/%E5%BF%AB%E5%BD%B1\/id1195860596\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">App Store<\/a>.<\/li>\n\n\n\n<li>The app&#8217;s interface is in Chinese language, so be ready to use some translators.<\/li>\n\n\n\n<li>Check out the Kling AI video creation tool on the app. It&#8217;s good if you can use this feature. If not, select &#8220;Beta Testing Access&#8221; from your profile options.<\/li>\n<\/ul>\n\n\n\n<p>By doing this you can request access to Kling using the Mobile App, but you can also request access using your email by sending an email to this ID: kling@kuaishou.com. You must include your profile information and a brief explanation of your interest in testing this model as a beta tester.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>Kling\u2019s Model Architecture<\/strong><\/h3>\n\n\n\n<p>Kling\u2019s Model Architecture is quite intricate and yet simple. Kling creates vibrant scenarios by utilizing the Diffusion Transformer architecture to transform rich textual prompts. It produces immersive visual experiences.<\/p>\n\n\n\n<p>Thus, here we go again. This is another generative AI model built upon the diffusion architecture. SORA AI was also built upon a diffusion model and Stable Video 3D, was also built upon Stable Video Diffusion, which is also a diffusion architecture model.<\/p>\n\n\n\n<p><strong>Using a single full-body shot, KLING&#8217;s superior 3D face and body reconstruction technology can achieve full expression and limb movement drive thanks to its patented 3D VAE and variable resolution training support for different aspect ratios.<\/strong><\/p>\n\n\n\n<p>Thus, this is also a model that can adjust to different aspect ratios and movie\/image qualities. This change in regulation allows for the generation of videos in different styles and environments, with grand scenes and images.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>Kling&#8217;s Mind-Blowing Video Outputs<\/strong><\/h2>\n\n\n\n<p>Here are some of the impressive features of Kling\u2019s Video Generating AI Model. Let\u2019s look into them:<\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>1) Lifelike Expressions<\/strong><\/h3>\n\n\n\n<p><strong>Kling\u2019s technology allows for accurate mimicking of lifelike expressions and body movements. This makes the objects look more realistic as if they were imported from real time.<\/strong><\/p>\n\n\n\n<p>This is all thanks to the 3D VAE and variable resolution methodology, which makes this model add life to almost any type of object in any environment.<\/p>\n\n\n\n<p>Look at this video obtained from Kling, where you can see a boy enjoying his hamburger and closing his eyes to enjoy the taste. This moment is surreal and what\u2019s even more impressive is that Kling perfectly captures the facial movements and perfectly illustrates the emotions.<\/p>\n\n\n\n<div align=center><blockquote class=\"twitter-tweet\" data-media-max-width=\"560\"><p lang=\"en\" dir=\"ltr\">7. <br><br>A Chinese boy wearing glasses enjoys a delicious cheeseburger with his eyes closed in a fast food restaurant <a href=\"https:\/\/t.co\/2x8SirLpFY\" target=\"_blank\">pic.twitter.com\/2x8SirLpFY<\/a><\/p>&mdash; Rowan Cheung (@rowancheung) <a href=\"https:\/\/twitter.com\/rowancheung\/status\/1798740692568826346?ref_src=twsrc%5Etfw\" target=\"_blank\" rel=\"noopener\">June 6, 2024<\/a><\/blockquote> <script async src=\"https:\/\/platform.twitter.com\/widgets.js\" charset=\"utf-8\"><\/script><\/div>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>2)<\/strong> <strong>Bodily Movements<\/strong><\/h3>\n\n\n\n<p>Full-drive technology for facial expressions and limbs is realized using self-developed 3D face and body reconstruction technology along with backdrop stability and redirection modules.<\/p>\n\n\n\n<p>All it takes for Kling AI to enjoy the lively &#8220;singing and dancing&#8221; gameplay is a full-body shot. Kling attaches template actions to your input images and gives life to the image object for a particular scene.<\/p>\n\n\n\n<p>Take a look at this video created by Kling where you can see a Panda playing the guitar in a highly peaceful composure. It almost looks human-like when it plays the guitar. Are you able to distinguish? I can\u2019t.<\/p>\n\n\n\n<div align=center><blockquote class=\"twitter-tweet\" data-media-max-width=\"560\"><p lang=\"en\" dir=\"ltr\">7. Panda playing the guitar<a href=\"https:\/\/t.co\/6KwWrUdpwI\" target=\"_blank\">pic.twitter.com\/6KwWrUdpwI<\/a><\/p>&mdash; Angry Tom (@AngryTomtweets) <a href=\"https:\/\/twitter.com\/AngryTomtweets\/status\/1798777806480167324?ref_src=twsrc%5Etfw\" target=\"_blank\" rel=\"noopener\">June 6, 2024<\/a><\/blockquote> <script async src=\"https:\/\/platform.twitter.com\/widgets.js\" charset=\"utf-8\"><\/script><\/div>\n\n\n\n<p>Thus, you can upload the full body image of your favourite object, back it up with a prompt, and then you can see the object singing and dancing in a highly fashionable manner!<\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>3) Strong concept combination ability<\/strong><\/h3>\n\n\n\n<p>Based on a deep understanding of text-video semantics and the powerful capabilities of the Diffusion Transformer architecture, KeLing is able to transform users&#8217; rich imaginations into attractive videos.<\/p>\n\n\n\n<p>You can think of unrealistic situations and have Kling carve them out for you, out of nowhere!<\/p>\n\n\n\n<p>See this video below generated by Kling\u2019s AI model, a very unrealistic situation of a cat driving down the streets of a busy city. This video looks so real!<\/p>\n\n\n\n<div align=center><blockquote class=\"twitter-tweet\" data-media-max-width=\"560\"><p lang=\"en\" dir=\"ltr\">5. <br><br>A white cat driving in a car through a busy downtown street with tall buildings and pedestrians in the background <a href=\"https:\/\/t.co\/HvRgJ2PYWK\" target=\"_blank\">pic.twitter.com\/HvRgJ2PYWK<\/a><\/p>&mdash; Rowan Cheung (@rowancheung) <a href=\"https:\/\/twitter.com\/rowancheung\/status\/1798740465266872790?ref_src=twsrc%5Etfw\" target=\"_blank\" rel=\"noopener\">June 6, 2024<\/a><\/blockquote> <script async src=\"https:\/\/platform.twitter.com\/widgets.js\" charset=\"utf-8\"><\/script><\/div>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>4)<\/strong> <strong>Large-scale reasonable exercise<\/strong><\/h3>\n\n\n\n<p>KeLing adopts a 3D spatiotemporal joint attention mechanism, which can better model complex spatiotemporal motion and generate video content with larger movements while conforming to the natural mechanisms of the real physical world.<\/p>\n\n\n\n<p>You can ask for whatever mechanisms you want, Kling will provide that with ease, while making it look real and natural. The body movements can be of any range and size, depending on your needs.<\/p>\n\n\n\n<p>Here\u2019s a video of a man dusking into the sunset while riding on his horse. This video was generated by Kling, and the mechanics look too good. It\u2019s almost as if it\u2019s from a movie, which was shot live.<\/p>\n\n\n\n<div align=center><blockquote class=\"twitter-tweet\" data-media-max-width=\"560\"><p lang=\"en\" dir=\"ltr\">5. A man riding a horse through the Gobi Desert with a beautiful sunset behind him, movie quality.<a href=\"https:\/\/t.co\/PAerK5ShCT\" target=\"_blank\">pic.twitter.com\/PAerK5ShCT<\/a><\/p>&mdash; Angry Tom (@AngryTomtweets) <a href=\"https:\/\/twitter.com\/AngryTomtweets\/status\/1798777798666133651?ref_src=twsrc%5Etfw\" target=\"_blank\" rel=\"noopener\">June 6, 2024<\/a><\/blockquote> <script async src=\"https:\/\/platform.twitter.com\/widgets.js\" charset=\"utf-8\"><\/script><\/div>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>5) 2 Minute Videos<\/strong><\/h3>\n\n\n\n<p>Lastly here comes perhaps the most impressive feature. You can generate videos of up to 2 minutes. 2 minutes is too long for a generative AI model. Even SORA-created videos of only a minute long.<\/p>\n\n\n\n<p>Take a look at this video of a boy cycling generated by Kling. We have to give Kling credit for producing different scenes and environments for the whole duration of the boy cycling. It almost feels like the AI model won\u2019t run out of ideas as to how it can extend the video scenes.<\/p>\n\n\n\n<div align=center><blockquote class=\"twitter-tweet\" data-media-max-width=\"560\"><p lang=\"en\" dir=\"ltr\">6. Little boy riding his bike in the garden through the changing seasons of fall, winter, spring and summer.<a href=\"https:\/\/t.co\/LY8Wfvs3Po\" target=\"_blank\">pic.twitter.com\/LY8Wfvs3Po<\/a><\/p>&mdash; Angry Tom (@AngryTomtweets) <a href=\"https:\/\/twitter.com\/AngryTomtweets\/status\/1798777802684330218?ref_src=twsrc%5Etfw\" target=\"_blank\" rel=\"noopener\">June 6, 2024<\/a><\/blockquote> <script async src=\"https:\/\/platform.twitter.com\/widgets.js\" charset=\"utf-8\"><\/script><\/div>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>Conclusion<\/strong><\/h2>\n\n\n\n<p>Kling\u2019s 3D VAE and aspect ratio capabilities make it highly demanding. Although SORA may be launched by the end of this year, developers are starting to feel OpenAI is falling behind following the release of Vidu and now Kling. Kling will take the video generation industry to the next level!<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Let&#8217;s discuss how Kling AI&#8217;s generative videos are mind-blowing and compete with OpenAI&#8217;s SORA on many levels.<\/p>\n","protected":false},"author":15,"featured_media":5485,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"jnews-multi-image_gallery":[],"jnews_single_post":null,"jnews_primary_category":{"id":"","hide":""},"footnotes":""},"categories":[57],"tags":[56,291,60,62,215,198],"class_list":["post-5481","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-ai","tag-ai","tag-kling","tag-openai","tag-sora","tag-text-to-video","tag-video-ai"],"_links":{"self":[{"href":"https:\/\/favtutor.com\/articles\/wp-json\/wp\/v2\/posts\/5481","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/favtutor.com\/articles\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/favtutor.com\/articles\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/favtutor.com\/articles\/wp-json\/wp\/v2\/users\/15"}],"replies":[{"embeddable":true,"href":"https:\/\/favtutor.com\/articles\/wp-json\/wp\/v2\/comments?post=5481"}],"version-history":[{"count":4,"href":"https:\/\/favtutor.com\/articles\/wp-json\/wp\/v2\/posts\/5481\/revisions"}],"predecessor-version":[{"id":5487,"href":"https:\/\/favtutor.com\/articles\/wp-json\/wp\/v2\/posts\/5481\/revisions\/5487"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/favtutor.com\/articles\/wp-json\/wp\/v2\/media\/5485"}],"wp:attachment":[{"href":"https:\/\/favtutor.com\/articles\/wp-json\/wp\/v2\/media?parent=5481"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/favtutor.com\/articles\/wp-json\/wp\/v2\/categories?post=5481"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/favtutor.com\/articles\/wp-json\/wp\/v2\/tags?post=5481"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}