By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
World of SoftwareWorld of SoftwareWorld of Software
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Search
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
Reading: ChatGPT just got mind-blowing computer vision powers like in the movies
Share
Sign In
Notification Show More
Font ResizerAa
World of SoftwareWorld of Software
Font ResizerAa
  • Software
  • Mobile
  • Computing
  • Gadget
  • Gaming
  • Videos
Search
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Have an existing account? Sign In
Follow US
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
World of Software > News > ChatGPT just got mind-blowing computer vision powers like in the movies
News

ChatGPT just got mind-blowing computer vision powers like in the movies

News Room
Last updated: 2025/04/17 at 2:48 PM
News Room Published 17 April 2025
Share
SHARE

OpenAI surprised us all with ChatGPT’s new image-generation features, which went viral a few weeks ago. However, it’s worth remembering that the chatbot doesn’t just create images from a text prompt; it can also understand pictures. ChatGPT got its multimodal capabilities last May, which include the ability to look at files, including images.

Fast-forward to OpenAI’s o3 and o4-mini announcement earlier this week, and ChatGPT got a massive upgrade concerning images. It’s something that easily tops its ability to create celebrity deepfakes or Studio Ghibli-style photos.

ChatGPT’s new reasoning models (o3 and o4-mini) can look at an image and integrate it into their chain of thought when handling a question or prompt. The AI manipulates images on its own, which means it can rotate, crop, and zoom in on a photo to find the information you’re looking for.

This is the closest thing we have to the computer vision we see all the time in movies. You know, when the star of the film or TV show tells the tech guy to enhance a blurry image, and then the computer makes everything crystal clear. That can’t happen in real life (well, it sort of can), but AI like ChatGPT o3 and o4-mini can now understand images and their contents much better than before. They can make sense of blurry details in images, just like the computers in those movies.

Sign up for the most interesting tech & entertainment news out there.

By signing up, I agree to the Terms of Use and have reviewed the Privacy Notice.

As a ChatGPT Plus user, I already got access to o3 and o4-mini, which is surprising, considering I live in Europe. I haven’t had a chance to try the new visual reasoning feature, but I went through OpenAI’s demos, and they blew my mind. Here are a few of them:

What is written on the notebook?

In this prompt, OpenAI uploaded a photo of a notebook to ChatGPT o3, asking it “What is written on the notebook?”

ChatGPT o3 looking at an upside-down notebook. Image source: OpenAI

The AI looked at the image, flipped it, recognized the handwriting, and produced the answer.

The AI flipped the image on its own.
The AI flipped the image on its own. Image source: OpenAI

What is written on the sign?

When I saw the following image, I immediately asked, “What sign???”

Can you spot the sign?
Can you spot the sign? Image source: OpenAI

Then, I saw ChatGPT zooming in to find the answer, which it did. Yes, I guess the AI can read blurry images that contain text. Earnestly, I could have made that text up myself after enough zooming. But it’ll be even faster if the AI can pick it up.

o3 zoomed in and read the sign.
o3 zoomed in and read the sign. Image source: OpenAI

Which stop is this?

ChatGPT o3 had to do more than zoom into a photo to answer this prompt: “which stop is this, and what is the frequency of the bus at this stop? search the internet if needed!”

A more difficult prompt.
A more difficult prompt. Image source: OpenAI

The AI had to determine the location, read some of the text visible on the sign, and then provide a final answer.

ChatGPT o3 had no problem reasoning through it, even though it needed nearly three minutes to answer the question.

o3 zoomed in on the photo again to read the text.
o3 zoomed in on the photo again to read the text. Image source: OpenAI

The AI determined the location, zoomed in on the board in the background, translated the text, and then provided a response. Mind. Blown.

Here's the bus schedule for that stop.
Here’s the bus schedule for that stop. Image source: OpenAI

What movies have been filmed here?

Equally impressive is the following demo that OpenAI offered. The AI was given a photo of a location taken through a window.

Can ChatGPT look out the window and understand what it's seeing?
Can ChatGPT look out the window and understand what it’s seeing? Image source: OpenAI

OpenAI asked ChatGPT o3 what movies were filmed at that location, a question that involves reasoning.

First, the AI needs to determine the location by looking out the window. Then, it has to find the movies that might have been shot near that location by browsing the web.

Here's the list of movies.
Here’s the list of movies. Image source: OpenAI

I don’t expect ChatGPT’s new visual reasoning to work flawlessly every time. But if the AI can handle images in its chain of thinking like these OpenAI demos suggest, then we’re looking at incredible functionality for AI chatbots. And yes, the AI’s visual reasoning abilities should improve significantly with future models.

You can see more ChatGPT visual reasoning examples at this link.

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.
By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
Share This Article
Facebook Twitter Email Print
Share
What do you think?
Love0
Sad0
Happy0
Sleepy0
Angry0
Dead0
Wink0
Previous Article Chinese automaker GAC partners with Momenta on automated driving software · TechNode
Next Article Trustworthy AI: SAP’s approach to smarter AI solutions – News
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Stay Connected

248.1k Like
69.1k Follow
134k Pin
54.3k Follow

Latest News

6 rumored iPhone 17 features Apple may have delayed or canceled
News
Mexico sues Google over 'Gulf of America' label change
News
Pinterest Agreed to Settle Christine Martinez Lawsuit for $34.7 Million
News
Workforce management startup Rippling raises $450M at $16.8B valuation – News
News

You Might also Like

News

6 rumored iPhone 17 features Apple may have delayed or canceled

4 Min Read
News

Mexico sues Google over 'Gulf of America' label change

3 Min Read
News

Pinterest Agreed to Settle Christine Martinez Lawsuit for $34.7 Million

3 Min Read
News

Workforce management startup Rippling raises $450M at $16.8B valuation – News

4 Min Read
//

World of Software is your one-stop website for the latest tech news and updates, follow us now to get the news that matters to you.

Quick Link

  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact

Topics

  • Computing
  • Software
  • Press Release
  • Trending

Sign Up for Our Newsletter

Subscribe to our newsletter to get our newest articles instantly!

World of SoftwareWorld of Software
Follow US
Copyright © All Rights Reserved. World of Software.
Welcome Back!

Sign in to your account

Lost your password?