🌍 Public

Multimodal Miscues

As my recent interactions with Gemini proved, newfangled AI chatbots continue to find fascinating ways to drop the ball.
3-min read
SHARE
Multimodal Miscues
Image Source: Google Gemini via Claude Code

I'm preparing for my first speaking gig of 2026. The topic shouldn't surprise anyone these days: AI with a dash of citizen development.

As I do with all my talks, I bring my receipts in the form of data. Lots of data. To that end, why not enlist AI to help me generate compelling visualizations?

It's not a difficult ask. Standalone AI image generators have existed since 1973. These days, you need neither a dedicated tool to produce computer-generated JPG and PNG files nor a design program like Canva. Put differently, mainstream AI chatbots are multimodal. As you'll see in this post, they continue to make plenty of baffling non-text mistakes.

Prompting Gemini

I used Claude's deep research feature to generate some recent surveys and studies on AI in the workplace. I threw the results into Notion, took a screenshot of one promising survey, and then crossed over to Gemini. Here's my initial prompt:

Turn this picture into a bar chart.
Notion Image Uploaded to Gemini, January 28, 2026 | Click to enlarge it.

Here's its initial response:

Gemini Result, January 28, 2026 | Click on the image to enlarge it.

That's the data in a structured format. It's a good start, but Gemini failed to produce the requested chart. I responded with the following prompt.

Where's the bar chart? I just see a table.

And Gemini then gave me the abomination below:

Gemini Result, January 28, 2026 | Click on the image to enlarge it.

See a problem?

Sometimes, it's just quicker to pretend that Gemini, Perplexity, and ChatGPT don't exist.

I had to go old school.

AI as a Designer

For a recent post's featured image, I gave Google's much-hyped Nano Banana the following prompt:

Draw a picture in this style of Arnold Schwarzenegger watching a movie.

Nano Banana required four attempts to return something passable. Pictures of decapitated celebrities aren't my jam.

The Enshittification of Tennis Viewing: An AI Infographic
Explaining the madness that today’s fans suffer when they try to watch a match.

The AI Paradox Is Alive and Well

So smart one minute and so dumb the next. Expect plenty of hallucinationsβ€”and not just in text-based responses.

Such is life in the age of AI. Sometimes, it's just quicker to pretend that Gemini, Perplexity, and ChatGPT don't exist.

Feedback

What say you?

Before You Go…
If you'd like to support my writing efforts, I'd appreciate it.

TIP THE AUTHOR

Member discussion