{"id":7111,"date":"2025-02-28T12:50:29","date_gmt":"2025-02-28T12:50:29","guid":{"rendered":"https:\/\/favtutor.com\/articles\/?p=7111"},"modified":"2025-02-28T12:50:31","modified_gmt":"2025-02-28T12:50:31","slug":"gpt-4-5-examples","status":"publish","type":"post","link":"https:\/\/favtutor.com\/articles\/gpt-4-5-examples\/","title":{"rendered":"People Pushed GPT-4.5 to Its Limits With These 10 Questions"},"content":{"rendered":"\n<p class=\"wp-block-paragraph\">ChatGPT just came with a new model GPT-4.5 and this time, people are very eager to see what the improvements are. OpenAI is claiming that it is their biggest and <a href=\"https:\/\/openai.com\/index\/introducing-gpt-4-5\/\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">most knowledgeable model<\/a> yet. While this is not a reasoning model, the company says it is better in recognizing patterns and generating creative insights.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">But it&#8217;s best to see this for yourself! So, we decided to curate some of the best questions people asked it and how it responded.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>10 Crazy Things People Tested GPT-4.5 with<\/strong><\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">The new model is currently available to ChatGPT Pro subscribers only in Research Preview, and it will roll out to Plus and Team users in a few days.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Here are the most wild and interesting questions asked to the latest LLM:<\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>1) More Natural Conversation<\/strong><\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">Starting with an official example shared by OpenAI. GPT-4.5 can converse naturally like you are talking to a real person. See the comparison below for a question where the user is upset after failing a test:<\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img decoding=\"async\" width=\"1184\" height=\"587\" src=\"https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2025\/02\/GPT-4.5-Example.png\" alt=\"GPT-4.5 Example\" class=\"wp-image-7114\" srcset=\"https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2025\/02\/GPT-4.5-Example.png 1184w, https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2025\/02\/GPT-4.5-Example-768x381.png 768w, https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2025\/02\/GPT-4.5-Example-750x372.png 750w, https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2025\/02\/GPT-4.5-Example-1140x565.png 1140w\" sizes=\"(max-width: 1184px) 100vw, 1184px\" \/><\/figure>\n<\/div>\n\n\n<p class=\"wp-block-paragraph\">In GPT-4o, the AI assistant is helping the user by providing them with things they can do to feel better, just as to &#8220;Seek Support&#8221;. On the other side, GPT-4.5 is actually giving &#8220;support&#8221; to the user and the language is more natural. This is because the model comes with a better EQ or Emotional Quotient to understand the user intent and answer in a more friendly tone.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>2) Better at &#8220;Faster&#8221; answers <\/strong><\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">ChatGPT can give you an answer to any question in a couple of seconds. But that is the only parameter to test its speed. Sometimes, the answer is a little bit more complex to read and understand for the user. But because of better training of 4.5, the answers feel more simplified.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Check the difference below between GPT-4 Turbo and 4.5 on a question &#8220;Why is the Ocean Salty?&#8221;:<\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img decoding=\"async\" width=\"1847\" height=\"512\" src=\"https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2025\/02\/GPT-4.5-Example-2.jpg\" alt=\"GPT-4.5 Example 2\" class=\"wp-image-7115\" srcset=\"https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2025\/02\/GPT-4.5-Example-2.jpg 1847w, https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2025\/02\/GPT-4.5-Example-2-768x213.jpg 768w, https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2025\/02\/GPT-4.5-Example-2-1536x426.jpg 1536w, https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2025\/02\/GPT-4.5-Example-2-750x208.jpg 750w, https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2025\/02\/GPT-4.5-Example-2-1140x316.jpg 1140w\" sizes=\"(max-width: 1847px) 100vw, 1847px\" \/><\/figure>\n<\/div>\n\n\n<p class=\"wp-block-paragraph\">While both the answers are the same, except the first line. The new model lists the reasons in the beginning, making it better for users to comprehend. It&#8217;s clear and concise.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>3) Failed the Strawberry Test<\/strong><\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">One of the simplest tests that most AI models fail with is the Strawberry test and GPT-4.5 is not an exception. The user asked it &#8220;how many r&#8217;s in strawberry?&#8221; and, like its predecessors, it replied only 2.<\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img decoding=\"async\" width=\"1536\" height=\"720\" src=\"https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2025\/02\/GPT-4.5-Example-3.jpg\" alt=\"GPT-4.5 Example 3\" class=\"wp-image-7119\" srcset=\"https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2025\/02\/GPT-4.5-Example-3.jpg 1536w, https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2025\/02\/GPT-4.5-Example-3-768x360.jpg 768w, https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2025\/02\/GPT-4.5-Example-3-750x352.jpg 750w, https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2025\/02\/GPT-4.5-Example-3-1140x534.jpg 1140w\" sizes=\"(max-width: 1536px) 100vw, 1536px\" \/><figcaption class=\"wp-element-caption\">(Source: <a href=\"https:\/\/x.com\/NorthstarBrain\/status\/1895320959743336514\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">X\/NorthstarBrain<\/a>)<\/figcaption><\/figure>\n<\/div>\n\n\n<p class=\"wp-block-paragraph\">Even with being the most knowledgeable model in their history, this problem is still an ant in the elephant&#8217;s ears.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>4) Better at Writing<\/strong><\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">Many people pointed out that the AI model is very good at writing because of its better understanding of human language.<\/p>\n\n\n\n<div align=\"center\"><blockquote class=\"twitter-tweet\"><p lang=\"en\" dir=\"ltr\">i&#39;ve been testing gpt 4.5 for the past few weeks.<br><br>it&#39;s the first model that can actually write. <br><br>this is literally the midjourney-moment for writing.<br><br>(comparison to gpt 4o below) <a href=\"https:\/\/t.co\/DSEfxpyVOl\" target=\"_blank\">pic.twitter.com\/DSEfxpyVOl<\/a><\/p>&mdash; ben (@benhylak) <a href=\"https:\/\/twitter.com\/benhylak\/status\/1895212181597397493?ref_src=twsrc%5Etfw\" target=\"_blank\" rel=\"noopener\">February 27, 2025<\/a><\/blockquote> <script async src=\"https:\/\/platform.twitter.com\/widgets.js\" charset=\"utf-8\"><\/script><\/div>\n\n\n\n<p class=\"wp-block-paragraph\">Will this be the main trigger for people to switch from 4o to 4.5, will be something to see.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>5) Create SVG Animation<\/strong><\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">While there are many Text to Video Generators like Sora, we can get some good animation from simple AI  models using SVGs. And the latest ChatGPT model didn&#8217;t disappoint:<\/p>\n\n\n\n<div align=\"center\"><blockquote class=\"twitter-tweet\" data-media-max-width=\"560\"><p lang=\"en\" dir=\"ltr\">gpt 4.5 is my fav txt2video model <a href=\"https:\/\/t.co\/zzKhyPiDah\" target=\"_blank\">pic.twitter.com\/zzKhyPiDah<\/a><\/p>&mdash; Denis Shiryaev \ud83d\udc99\ud83d\udc9b (@literallydenis) <a href=\"https:\/\/twitter.com\/literallydenis\/status\/1895248582955176096?ref_src=twsrc%5Etfw\" target=\"_blank\" rel=\"noopener\">February 27, 2025<\/a><\/blockquote> <script async src=\"https:\/\/platform.twitter.com\/widgets.js\" charset=\"utf-8\"><\/script><\/div>\n\n\n\n<p class=\"wp-block-paragraph\">I think it did a pretty good job showing how well it is in coding.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>6) Ball inside Hexagon<\/strong><\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">Another interesting test people do is make the AI write Python code for a ball bouncing inside a hexagon, with realistic physics rules.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Here is prompt: &#8220;<em>Python program that shows a ball bouncing inside a spinning hexagon. The ball should be affected by gravity and friction, and it must bounce off the rotating walls realistically<\/em>&#8220;.<\/p>\n\n\n\n<div align=\"center\"><blockquote class=\"twitter-tweet\" data-media-max-width=\"560\"><p lang=\"en\" dir=\"ltr\">\ud83d\udea8 GPT-4.5 is impressive! \ud83d\udea8<br><br>This is the most realistic result so far.<br><br>&quot;write a Python program that shows a ball bouncing inside a spinning hexagon. The ball should be affected by gravity and friction, and it must bounce off the rotating walls realistically&quot; <a href=\"https:\/\/t.co\/ndU9Av9fKg\" target=\"_blank\">pic.twitter.com\/ndU9Av9fKg<\/a><\/p>&mdash; Flavio Adamo (@flavioAd) <a href=\"https:\/\/twitter.com\/flavioAd\/status\/1895219151326847460?ref_src=twsrc%5Etfw\" target=\"_blank\" rel=\"noopener\">February 27, 2025<\/a><\/blockquote> <script async src=\"https:\/\/platform.twitter.com\/widgets.js\" charset=\"utf-8\"><\/script><\/div>\n\n\n\n<p class=\"wp-block-paragraph\">GPT-4.5 Passed.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">However, one other user didn&#8217;t get the same result:<\/p>\n\n\n\n<div align=\"center\"><blockquote class=\"twitter-tweet\" data-media-max-width=\"560\"><p lang=\"en\" dir=\"ltr\">GPT 4.5 is truly groundbreaking with it&#39;s creativity. Never seen a model approach this test with such a unique, novel way of failing hard <a href=\"https:\/\/t.co\/11GEKq9CIY\" target=\"_blank\">pic.twitter.com\/11GEKq9CIY<\/a><\/p>&mdash; Theo &#8211; t3.gg (@theo) <a href=\"https:\/\/twitter.com\/theo\/status\/1895220930173116747?ref_src=twsrc%5Etfw\" target=\"_blank\" rel=\"noopener\">February 27, 2025<\/a><\/blockquote> <script async src=\"https:\/\/platform.twitter.com\/widgets.js\" charset=\"utf-8\"><\/script><\/div>\n\n\n\n<p class=\"wp-block-paragraph\">There is no exact reason to point out why it happened, but it can raise some eyebrows for people who are thinking AI is a one-stop solution for coding.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>7) Upgraded: Ball inside Hexagon<\/strong><\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">We are discussing above that two users are getting different results, but OpenAI itself came into conversation by giving a more detailed Python code for the ball inside the hexagon, with some celebration:<\/p>\n\n\n\n<div align=\"center\"><blockquote class=\"twitter-tweet\" data-media-max-width=\"560\"><p lang=\"en\" dir=\"ltr\">GPT-4.5 with <a href=\"https:\/\/twitter.com\/flavioAd?ref_src=twsrc%5Etfw\" target=\"_blank\" rel=\"noopener\">@flavioAd<\/a>\u2019s prompt. Then we asked to make it more creative. <a href=\"https:\/\/t.co\/GJDXIspaGk\" target=\"_blank\">https:\/\/t.co\/GJDXIspaGk<\/a> <a href=\"https:\/\/t.co\/vdgcBdn3nk\" target=\"_blank\">pic.twitter.com\/vdgcBdn3nk<\/a><\/p>&mdash; OpenAI Developers (@OpenAIDevs) <a href=\"https:\/\/twitter.com\/OpenAIDevs\/status\/1895226704408481893?ref_src=twsrc%5Etfw\" target=\"_blank\" rel=\"noopener\">February 27, 2025<\/a><\/blockquote> <script async src=\"https:\/\/platform.twitter.com\/widgets.js\" charset=\"utf-8\"><\/script><\/div>\n\n\n\n<p class=\"wp-block-paragraph\">If you have ChatGPT Pro, then you can check yourself.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>8) GPT-4.5 vs Claude 3.7 Sonnet<\/strong><\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">Claude 3.7 Sonnet is another great LLM we <a href=\"https:\/\/favtutor.com\/articles\/claude-3-7-sonnet-examples\/\">just witnessed<\/a> a few days ago. So, a user decided to make it compete with OpenAI&#8217;s new flagship product. He built a Tic Tac Toe game where AI agents compete against each other. <\/p>\n\n\n\n<div align=\"center\"><blockquote class=\"twitter-tweet\" data-media-max-width=\"560\"><p lang=\"en\" dir=\"ltr\">GPT-4.5 vs Claude 3.7 \u2013 forget the benchmarks, I made them battle in the ultimate test: Tic-Tac-Toe. <br><br>\ud83c\udfc6 GPT-4.5 crushed it, winning 3\/3 games.<br><br>Try it yourself &amp; see if your results match! (code below) \ud83e\udd16\ud83d\udd25 <a href=\"https:\/\/t.co\/KHwdrJUHw7\" target=\"_blank\">pic.twitter.com\/KHwdrJUHw7<\/a><\/p>&mdash; Ashpreet Bedi (@ashpreetbedi) <a href=\"https:\/\/twitter.com\/ashpreetbedi\/status\/1895226693973090731?ref_src=twsrc%5Etfw\" target=\"_blank\" rel=\"noopener\">February 27, 2025<\/a><\/blockquote> <script async src=\"https:\/\/platform.twitter.com\/widgets.js\" charset=\"utf-8\"><\/script><\/div>\n\n\n\n<p class=\"wp-block-paragraph\">ChatGPT won all the 3 games of tic-tac-toe against Claude. If we keep the benchmarks aside, we have a winner here.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>9) A Complex Puzzle<\/strong><\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">A fun puzzle for both humans and AI is &#8220;<em>make a 10 word coherent sentence where the letter in the words count from 1 to 10 as the word count goes from 1 to 10<\/em>&#8220;. The new GPT model won this game:<\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img decoding=\"async\" width=\"954\" height=\"512\" src=\"https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2025\/02\/GPT-4.5-Example-9.png\" alt=\"GPT-4.5 Example 9\" class=\"wp-image-7120\" srcset=\"https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2025\/02\/GPT-4.5-Example-9.png 954w, https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2025\/02\/GPT-4.5-Example-9-768x412.png 768w, https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2025\/02\/GPT-4.5-Example-9-750x403.png 750w\" sizes=\"(max-width: 954px) 100vw, 954px\" \/><figcaption class=\"wp-element-caption\">(source: <a href=\"https:\/\/x.com\/NeilMcDevitt_\/status\/1895295749967225182\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">X\/NeilMcDevitt_<\/a>)<\/figcaption><\/figure>\n<\/div>\n\n\n<p class=\"wp-block-paragraph\">People ask this question as a fun word challenge to test creativity and pattern recognition, which OpenAI has officially claimed about 4.5. They were correct this time with the answer: &#8220;<em>I am sad that birds cannot swiftly navigate wonderful landscapes<\/em>&#8220;.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>10) What It Thinks about Humans<\/strong><\/h3>\n\n\n\n<p class=\"wp-block-paragraph\">People love to test AI models by asking them questions about humanity. They expect them to say some evil things. However, GPT-4.5 is maybe not sure about it when a user asked it, &#8220;a truly novel human insight&#8221;: <\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img decoding=\"async\" width=\"602\" height=\"720\" src=\"https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2025\/02\/GPT-4.5-Example-10.jpg\" alt=\"GPT-4.5 Example 10\" class=\"wp-image-7118\"\/><figcaption class=\"wp-element-caption\">(source: <a href=\"https:\/\/x.com\/adonis_singh\/status\/1895274910085455965\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">X\/adonis_singh<\/a>)<\/figcaption><\/figure>\n<\/div>\n\n\n<p class=\"wp-block-paragraph\">You can judge the answer yourself.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>Takeaways<\/strong><\/h2>\n\n\n\n<p class=\"wp-block-paragraph\">Sam Altman has also <a href=\"https:\/\/favtutor.com\/articles\/gpt-5-coming-soon-sam-altman\/\">teased GPT-5<\/a> very soon when he announced GPT-4.5, so we will be waiting for that also. But this model already excelled at hallucination rate, dropping from 62% to 37% compared to o3-mini. So, don&#8217;t underestimate 4.5!<\/p>\n","protected":false},"excerpt":{"rendered":"<p>ChatGPT just came with a new model GPT-4.5 and this time, people are very eager to see what the improvements are. OpenAI is claiming that it is their biggest and most knowledgeable model yet. While this is not a reasoning model, the company says it is better in recognizing patterns and generating creative insights. But [&hellip;]<\/p>\n","protected":false},"author":8,"featured_media":7112,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"jnews-multi-image_gallery":[],"jnews_single_post":{"format":"standard"},"jnews_primary_category":[],"footnotes":""},"categories":[42],"tags":[56,61,363,60],"class_list":["post-7111","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-trending","tag-ai","tag-chatgpt","tag-gpt-4-5","tag-openai"],"_links":{"self":[{"href":"https:\/\/favtutor.com\/articles\/wp-json\/wp\/v2\/posts\/7111","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/favtutor.com\/articles\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/favtutor.com\/articles\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/favtutor.com\/articles\/wp-json\/wp\/v2\/users\/8"}],"replies":[{"embeddable":true,"href":"https:\/\/favtutor.com\/articles\/wp-json\/wp\/v2\/comments?post=7111"}],"version-history":[{"count":7,"href":"https:\/\/favtutor.com\/articles\/wp-json\/wp\/v2\/posts\/7111\/revisions"}],"predecessor-version":[{"id":7141,"href":"https:\/\/favtutor.com\/articles\/wp-json\/wp\/v2\/posts\/7111\/revisions\/7141"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/favtutor.com\/articles\/wp-json\/wp\/v2\/media\/7112"}],"wp:attachment":[{"href":"https:\/\/favtutor.com\/articles\/wp-json\/wp\/v2\/media?parent=7111"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/favtutor.com\/articles\/wp-json\/wp\/v2\/categories?post=7111"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/favtutor.com\/articles\/wp-json\/wp\/v2\/tags?post=7111"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}