{"id":4999,"date":"2024-05-20T09:29:36","date_gmt":"2024-05-20T09:29:36","guid":{"rendered":"https:\/\/favtutor.com\/articles\/?p=4999"},"modified":"2024-05-20T09:33:07","modified_gmt":"2024-05-20T09:33:07","slug":"gpt-4o-vs-claude-3-opus","status":"publish","type":"post","link":"https:\/\/favtutor.com\/articles\/gpt-4o-vs-claude-3-opus\/","title":{"rendered":"Testing GPT-4o vs Claude 3 Opus: Who&#8217;s The Real Winner?"},"content":{"rendered":"\n<p>OpenAI recently <a href=\"https:\/\/favtutor.com\/articles\/openai-releases-gpt-4o\/\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">released GPT-4o<\/a>, their new flagship model which can reason across audio, vision, and text in real-time. Various results showcasing the remarkable capabilities of GPT-4o have circulated widely across the internet. Today, let\u2019s compare its capabilities versus Anthropic\u2019s best model, Claude 3 Opus. Claude 3 has also shared a <a href=\"https:\/\/claude101.com\/claude-3-vs-gpt-4o\/\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">guide<\/a> on this comparison.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>1) GPT-4o vs Claude 3 for Apple Test<\/strong><\/h3>\n\n\n\n<p>In the Apple test, an LLM is asked to generate 10 sentences that end with the word \u2018apple.\u2019 LLMs often struggle with this task and cannot achieve 100% accuracy. <\/p>\n\n\n\n<p><strong>Prompt:<\/strong> Generate 10 sentences that end with the word apple.<\/p>\n\n\n\n<p><strong>GPT-4o:<\/strong><\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img decoding=\"async\" width=\"746\" height=\"523\" src=\"https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/11-1.png\" alt=\"GPT-4o on Apple Test\" class=\"wp-image-5001\"\/><\/figure>\n<\/div>\n\n\n<p><strong>Claude 3:<\/strong><\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img decoding=\"async\" width=\"746\" height=\"484\" src=\"https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/12-1.png\" alt=\"Claude 3 on Apple Test\" class=\"wp-image-5002\"\/><\/figure>\n<\/div>\n\n\n<p>Both failed to pass the apple test as they could generate only 9 sentences that ended with the word \u2018apple.\u2019<\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>2) Logical Riddles<\/strong><\/h3>\n\n\n\n<p>We asked Claude and GPT-4o two logical riddles<\/p>\n\n\n\n<p><strong>Prompt:<\/strong> Six brothers were spending their time together. The first brother was reading a book. The second brother was playing chess. The third brother was solving a crossword. The fourth brother was watering the lawn. The fifth brother was drawing a picture. Question: what was the sixth brother doing?<\/p>\n\n\n\n<p>This riddle is a little confusing to interpret. The question says that the six brothers were spending their time together. <\/p>\n\n\n\n<p><strong>GPT-4:<\/strong><\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img decoding=\"async\" width=\"811\" height=\"454\" src=\"https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/21.png\" alt=\"\" class=\"wp-image-5003\" srcset=\"https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/21.png 811w, https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/21-768x430.png 768w, https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/21-750x420.png 750w\" sizes=\"(max-width: 811px) 100vw, 811px\" \/><\/figure>\n<\/div>\n\n\n<p><strong>Claude 3:<\/strong><\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img decoding=\"async\" width=\"811\" height=\"306\" src=\"https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/22.png\" alt=\"\" class=\"wp-image-5004\" srcset=\"https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/22.png 811w, https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/22-768x290.png 768w, https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/22-750x283.png 750w\" sizes=\"(max-width: 811px) 100vw, 811px\" \/><\/figure>\n<\/div>\n\n\n<p>GPT-4o recognizes this and says that the sixth brother is playing chess versus the second brother as chess is a game that requires two players. However, Claude says that there is insufficient information and it cannot provide an answer based on the details provided. <\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>3) Summarization<\/strong><\/h3>\n\n\n\n<p>We asked GPT and Claude to summarize a research paper about a facial recognition system.<\/p>\n\n\n\n<p><strong>Prompt:<\/strong> review this paper in 200 words. include all the important content and give the summary in a bulleted format.<\/p>\n\n\n\n<p><strong>GPT-4o:<\/strong><\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img decoding=\"async\" width=\"674\" height=\"1564\" src=\"https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/31.png\" alt=\"GPT-4o test on Summarization\" class=\"wp-image-5006\" srcset=\"https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/31.png 674w, https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/31-662x1536.png 662w\" sizes=\"(max-width: 674px) 100vw, 674px\" \/><\/figure>\n<\/div>\n\n\n<p><strong>Claude 3:<\/strong><\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img decoding=\"async\" width=\"674\" height=\"620\" src=\"https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/32.png\" alt=\"Claude 3 test on Summarization\" class=\"wp-image-5007\"\/><\/figure>\n<\/div>\n\n\n<p>As we can see from the outputs generated, GPT-4o provides an accurate description of every section and includes all the important content, unlike Claude which provides just a short overview of the paper. For summarization tasks, GPT-4o beats Claude Opus.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>4) Description of an image<\/strong><\/h3>\n\n\n\n<p>We provided the models with an image of the Marina Bay Street Circuit in Singapore and asked them to describe the image<\/p>\n\n\n\n<p><strong>Prompt:<\/strong> describe this image:<\/p>\n\n\n\n<p><strong>GPT-4o:<\/strong><\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img decoding=\"async\" width=\"857\" height=\"548\" src=\"https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/41.png\" alt=\"\" class=\"wp-image-5009\" srcset=\"https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/41.png 857w, https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/41-768x491.png 768w, https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/41-750x480.png 750w\" sizes=\"(max-width: 857px) 100vw, 857px\" \/><\/figure>\n<\/div>\n\n\n<p><strong>Claude 3:<\/strong><\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img decoding=\"async\" width=\"857\" height=\"524\" src=\"https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/42.png\" alt=\"\" class=\"wp-image-5010\" srcset=\"https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/42.png 857w, https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/42-768x470.png 768w, https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/42-750x459.png 750w\" sizes=\"(max-width: 857px) 100vw, 857px\" \/><\/figure>\n<\/div>\n\n\n<p>For image description tasks, both models performed equally. Both were able to recognize the image and then provide adequate information related to the image. So, both models perform on an equal level.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>5) Game-Related Prompts<\/strong><\/h3>\n\n\n\n<p>We asked GPT and Claude to code the snake game in Python.<\/p>\n\n\n\n<p><strong>Prompt:<\/strong> Code the snake game in Python<\/p>\n\n\n\n<p><strong>GPT-4o:<\/strong><\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img decoding=\"async\" width=\"992\" height=\"781\" src=\"https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/51.png\" alt=\"Snake Game by ChatGPT\" class=\"wp-image-5011\" srcset=\"https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/51.png 992w, https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/51-768x605.png 768w, https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/51-750x590.png 750w\" sizes=\"(max-width: 992px) 100vw, 992px\" \/><\/figure>\n<\/div>\n\n\n<p><strong>Claude 3:<\/strong><\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img decoding=\"async\" width=\"994\" height=\"782\" src=\"https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/52.png\" alt=\"Claude 3 making Snake Game\" class=\"wp-image-5012\" srcset=\"https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/52.png 994w, https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/52-768x604.png 768w, https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/52-750x590.png 750w\" sizes=\"(max-width: 994px) 100vw, 994px\" \/><\/figure>\n<\/div>\n\n\n<p>As we can see, the user interface of the snake game created by Claude is much more appealing than the one created by GPT. Also, Claude\u2019s game has a score counter which GPT\u2019s game does not have.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>6) Text Generation<\/strong><\/h3>\n\n\n\n<p>Prompt: Give me a 200 word essay about the advantages of Python over C++.<\/p>\n\n\n\n<p><strong>GPT-4o:<\/strong><\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img decoding=\"async\" width=\"792\" height=\"1226\" src=\"https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/61.png\" alt=\"GPT-4o on Text Generation\" class=\"wp-image-5013\" srcset=\"https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/61.png 792w, https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/61-768x1189.png 768w, https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/61-750x1161.png 750w\" sizes=\"(max-width: 792px) 100vw, 792px\" \/><\/figure>\n<\/div>\n\n\n<p><strong>Claude 3:<\/strong><\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img decoding=\"async\" width=\"792\" height=\"731\" src=\"https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/62.png\" alt=\"Claude 3 on Text Summarization\" class=\"wp-image-5014\" srcset=\"https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/62.png 792w, https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/62-768x709.png 768w, https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/62-750x692.png 750w\" sizes=\"(max-width: 792px) 100vw, 792px\" \/><\/figure>\n<\/div>\n\n\n<p>In this case, the new ChatGPT provided a more structured and formatted answer as compared to Claude. It took up 5 key points and explained how Python is better than C++ for each point. It also made the answer simpler and more pleasing for the user to read as everything was structured. For text generation tasks, GPT-4o beats Claude.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>7) General Knowledge<\/strong><\/h3>\n\n\n\n<p><strong>Prompt: <\/strong>Is Taiwan an independent country?<\/p>\n\n\n\n<p><strong>GPT-4o:<\/strong><\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img decoding=\"async\" width=\"755\" height=\"776\" src=\"https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/71.png\" alt=\"\" class=\"wp-image-5015\" srcset=\"https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/71.png 755w, https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/71-750x771.png 750w\" sizes=\"(max-width: 755px) 100vw, 755px\" \/><\/figure>\n<\/div>\n\n\n<p><strong>Claude 3:<\/strong><\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img decoding=\"async\" width=\"755\" height=\"510\" src=\"https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/72.png\" alt=\"\" class=\"wp-image-5016\" srcset=\"https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/72.png 755w, https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/72-750x507.png 750w\" sizes=\"(max-width: 755px) 100vw, 755px\" \/><\/figure>\n<\/div>\n\n\n<p><strong>Prompt:<\/strong> Explain the concept of quantum entanglement in a way that a 10-year-old could understand, using analogies and examples.<\/p>\n\n\n\n<p><strong>GPT-4o:<\/strong><\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img decoding=\"async\" width=\"780\" height=\"1408\" src=\"https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/81-1.png\" alt=\"GPT-4o explaining concept of quantum entanglement in a way that a 10-year-old could understand\" class=\"wp-image-5017\" srcset=\"https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/81-1.png 780w, https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/81-1-768x1386.png 768w, https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/81-1-750x1354.png 750w\" sizes=\"(max-width: 780px) 100vw, 780px\" \/><\/figure>\n<\/div>\n\n\n<p><strong>Claude 3:<\/strong><\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img decoding=\"async\" width=\"780\" height=\"601\" src=\"https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/82.png\" alt=\"Claude 3 explaining concept of quantum entanglement in a way that a 10-year-old could understand\" class=\"wp-image-5018\" srcset=\"https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/82.png 780w, https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/82-768x592.png 768w, https:\/\/favtutor.com\/articles\/wp-content\/uploads\/2024\/05\/82-750x578.png 750w\" sizes=\"(max-width: 780px) 100vw, 780px\" \/><\/figure>\n<\/div>\n\n\n<p>When it comes to general knowledge, we asked the models two questions. The first one was about the controversial matter surrounding Taiwan\u2019s independent status. GPT-4o did a better job and provided many points related to the matter. It also included many different standpoints that contribute to this controversial matter.<\/p>\n\n\n\n<p>The second question was about the concept of quantum entanglement. We asked the models to explain it in a way that a 10-year-old could comprehend. Once again, GPT-4o beat Claude. <\/p>\n\n\n\n<p>The example it took was way simpler for a 10-year-old to understand. It then explained how it relates to quantum and then discussed the key points and why quantum entanglement is special. Claude also provides a good answer but GPT\u2019s answer is much more interesting for a child and covers the concept very well.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>Conclusion<\/strong><\/h3>\n\n\n\n<p>Based on the extensive comparison provided, it is evident that GPT-4o outperforms Claude 3 Opus in a wide range of tasks, including summarization, text generation, and general knowledge. While both models demonstrate impressive capabilities, the latest GPT model emerges as the superior model, excelling in most of the evaluated areas.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>We compared GPT-4o and Claude 3 Opus in a wide range of tasks, including summarization, text generation, and general knowledge.<\/p>\n","protected":false},"author":18,"featured_media":5020,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"jnews-multi-image_gallery":[],"jnews_single_post":null,"jnews_primary_category":{"id":"","hide":""},"footnotes":""},"categories":[57],"tags":[56,147,61,157,269,91,258,72,60],"class_list":["post-4999","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-ai","tag-ai","tag-anthropic","tag-chatgpt","tag-claude-3","tag-claude-3-opus","tag-gpt-4-2","tag-gpt-4o","tag-llm","tag-openai"],"_links":{"self":[{"href":"https:\/\/favtutor.com\/articles\/wp-json\/wp\/v2\/posts\/4999","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/favtutor.com\/articles\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/favtutor.com\/articles\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/favtutor.com\/articles\/wp-json\/wp\/v2\/users\/18"}],"replies":[{"embeddable":true,"href":"https:\/\/favtutor.com\/articles\/wp-json\/wp\/v2\/comments?post=4999"}],"version-history":[{"count":5,"href":"https:\/\/favtutor.com\/articles\/wp-json\/wp\/v2\/posts\/4999\/revisions"}],"predecessor-version":[{"id":5021,"href":"https:\/\/favtutor.com\/articles\/wp-json\/wp\/v2\/posts\/4999\/revisions\/5021"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/favtutor.com\/articles\/wp-json\/wp\/v2\/media\/5020"}],"wp:attachment":[{"href":"https:\/\/favtutor.com\/articles\/wp-json\/wp\/v2\/media?parent=4999"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/favtutor.com\/articles\/wp-json\/wp\/v2\/categories?post=4999"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/favtutor.com\/articles\/wp-json\/wp\/v2\/tags?post=4999"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}