TechDogs-"Google’s Gemini 1.5 Pro Calls Out A Video Generated By OpenAI’s Sora As Fake!"

Emerging Technology

Google’s Gemini 1.5 Pro Calls Out A Video Generated By OpenAI’s Sora As Fake!

By TD NewsDesk

TD NewsDesk

Updated on Mon, Feb 19, 2024

Overall Rating
The generative artificial intelligence (GenAI) race is heating up as companies battle it out to come up with the latest products and services, while also looking at the future of the industry.

It wasn’t long ago that Google rebranded its Bard chatbot as Gemini. In fact, it was just over 10 days ago, on February 8, 2024, that Google announced the company’s most capable model, Gemini 1.0 Ultra.

A week later, Google announced its next-generation model, Gemini 1.5, with a blog post co-authored by Sundar Pichai, the CEO of Google and Alphabet. Within the blog, Google even depicted results of how its model performed against competitors, which included the likes of GPT-4 Turbo, Claude 2.1 and even its own Gemini 1.0 Pro.

On the same day, OpenAI, one of Google’s competitors in the GenAI space, unveiled its latest product!

Sora, OpenAI’s text-to-video AI model was launched with great aplomb, announced through a post on X (formerly Twitter) that included a link to the product’s website, which featured almost 40 videos interspersed with the model’s information.

The immediate release of such a powerful platform made some question the timing of the announcement, leading to speculation that the move was done to take attention away from Google’s Gemini 1.5.

However, Google took the challenge in stride and came up with a reply to OpenAI’s move!

So, what was Google’s countermove? Let’s explore!
 

What Did Google Say?

 
  • After comparing Gemini 1.5 to competing brands in the AI space, Google’s Gemini 1.5 has now critiqued a video generated by OpenAI’s Sora. 
  • The results were posted via former Director at Google and Product Manager at Twitter, Gabor Cselle’s X account.
  • Essentially, the results raised questions about the authenticity of the output, highlighted several inconsistencies and tagged it as a “fake” video.
  • The prompt used by OpenAI to generate the video read, “Beautiful, snowy Tokyo city is bustling. The camera moves through the bustling city street, following several people enjoying the beautiful snowy weather and shopping at nearby stalls. Gorgeous sakura petals are flying through the wind along with snowflakes.”
  • However, Google’s Gemini 1.5 found inconsistencies with the timing of the snowfall and cherry blossoms, the unnatural form of snowfall and the attire worn by people in the video.
  • Ahead of this Gabor Cselle also posted a list of “The remaining problems in Gen AI video post-SORA”, which included 5 points, along with a string of comments that highlighted the industry’s current situation.

TechDogs-"A Screenshot Of OpenAI's Video Posted On X"  

What Did Gemini 1.5 Pro Say About The Video?

 
  • Google used the video posted on X by OpenAI announcing Sora in its prompt to analyze the video. The prompt put in read “Critique this AI generated video. Are there any inconsistencies or things that don't make sense?”
  • To which, the Gemini 1.5 responded: The video is of a snowy street in Japan, with cherry blossoms in bloom. There are a few inconsistencies that suggest the video may be AI-generated.
  • First, the snow is falling very heavily, but the cherry blossoms are still in bloom. Cherry blossoms typically bloom in the spring, when the weather is warm and there is no snow.
  • Second, the snow is falling in a very uniform way, which is not how snow typically falls in real life.
  • Third, the people in the video are not wearing any winter clothing, even though it is snowing heavily.
  • Overall, the video is visually appealing, but the inconsistencies suggest that it is not a real-life scene.

TechDogs-"A Screenshot Of The Video Showing Google's Gemini 1.5 Critiquing Sora's Video"
Upon release, OpenAI did mention the model’s weaknesses on its website and conveyed it could have problems simulating complex scenes and may not understand specific instances of cause and effect, for example, “a person might take a bite out of a cookie, but afterward, the cookie may not have a bite mark.”

This was also accompanied by OpenAI mentioning the model was in its testing phase as the company is looking for feedback.

Do you think OpenAI prematurely released its text-to-video model to compete with others? Was Google justified in highlighting the drawbacks in OpenAI’s model?

Let us know in the comments below!

First published on Mon, Feb 19, 2024

Liked what you read? That’s only the tip of the tech iceberg!

Explore our vast collection of tech articles including introductory guides, product reviews, trends and more, stay up to date with the latest news, relish thought-provoking interviews and the hottest AI blogs, and tickle your funny bone with hilarious tech memes!

Plus, get access to branded insights from industry-leading global brands through informative white papers, engaging case studies, in-depth reports, enlightening videos and exciting events and webinars.

Dive into TechDogs' treasure trove today and Know Your World of technology like never before!

Disclaimer - Reference to any specific product, software or entity does not constitute an endorsement or recommendation by TechDogs nor should any data or content published be relied upon. The views expressed by TechDogs’ members and guests are their own and their appearance on our site does not imply an endorsement of them or any entity they represent. Views and opinions expressed by TechDogs’ Authors are those of the Authors and do not necessarily reflect the view of TechDogs or any of its officials. All information / content found on TechDogs’ site may not necessarily be reviewed by individuals with the expertise to validate its completeness, accuracy and reliability.

Tags:

Artificial Intelligence (AI)Emerging Technology Generative Artificial Intelligence GenAI Google Gemini OpenAI Sora

Join The Discussion

  • Dark
  • Light