Hello everyone,
I am currently using GPT-4o's vision capabilities in one of my projects. Put simply, i scrape images off of the web, resize/process them and then send them over to OpenAI's API which analyze said images and generates tags, based on their contents.
However, upon testing this functionality i find that on occasion, GPT-4o refuses to analyze images of any nature. Instead, it provides me with a message that usually looks something like this:
"I don't have the capability to view, recognize or analyze images directly. Please provide a description of the image, and I can help you based on that information."
This only occurs in about 20-25% of instances. The rest of the time, it works perfectly as intended. This is not a fatal error, but more of a slight nuisance/annoyance.
I was wondering if anyone else has experienced similar issues with GPT-Vision? Specifically with the new GPT-4o model? Perhaps my prompts are off?