InteractiveBuilt in 4 hoursApril 21, 2026

GPT Image 2 vs the field

Five prompts, five models, 25 images. GPT Image 2 vs. Nano Banana Pro vs. Nano Banana 2 vs. the two older OpenAI models. Full outputs and scorecard below.

Loading tool...

What this proves

On the day OpenAI shipped ChatGPT Images 2.0, I ran the same five prompts through five image models. GPT Image 2 was not just competitive. It pulled ahead on UI realism and multi-subject scenes that Google had owned until this morning.

#gpt-image-2 #nano-banana-pro #nano-banana-2 #image-generation #benchmark #openai #google

How it works

OpenAI shipped gpt-image-2 this morning. Before anyone had time to write a take, I ran the same five prompts through it, two older OpenAI models, and both of Google's Nano Banana variants, then saved every output and scored every axis.

Interactive grid above. Click any image to zoom. Toggle model columns on/off from the chips.

The headline

GPT Image 2 won every axis. 25/25.

Nano Banana Pro and Nano Banana 2 both scored 23/25. Still strong, but the text + UI lead Google had held for months flipped today.

The biggest gap was on the hardest prompt (three engineers at a whiteboard, handwritten labels, laptop with a Grafana dashboard). GPT Image 2 was the only model that drew all three engineers and a complete, internally consistent microservices diagram: Web Client → API Gateway → User / Order / Auth / Payment services → Postgres / Redis / Kafka. Every other model either missed an engineer, garbled a label, or drew a diagram that did not make engineering sense.

The five prompts and why each one

Designed to stress-test capabilities that matter if you build things:

Calibration, terminal + code. Pixel-perfect text rendering. Separates modern models from the DALL-E-era baseline instantly.
UI + 3 bullets. ChatGPT macOS window with a real-looking conversation. Tests the exact capability OpenAI was pitching today.
Brand hero. Dark landing page with headline, CTAs, a product mock with a score badge and a bar chart. Tests whether it can ship as real product imagery.
Portrait. Close-up photorealism, no text. Tests the old weakness.
Whiteboard. Three engineers, handwritten labels, laptop screen. Multi-subject scene consistency and in-scene text. The hardest prompt.

How I ran GPT Image 2 today

GPT Image 2 is behind an API gate (org verification required) at launch, so the cleanest way to run it today was the ChatGPT Plus web app with Thinking mode on. Google's Nano Banana Pro and Nano Banana 2 ran via Vertex AI. The older OpenAI models ran via the /v1/images/generations endpoint.

Surprises

The v1 → v1.5 jump was real. gpt-image-1 is DALL-E-3-era, with garbled text and flat product mocks. gpt-image-1.5 was what ChatGPT had quietly been serving pre-today, and it renders terminals and UI mocks cleanly. That is a story most people missed.

Nano Banana 2 ties Nano Banana Pro at half the cost. Same total (23/25). Pro is more literal, Nano Banana 2 adds atmosphere. If you are not using a Google-side model yet, start with Nano Banana 2.

Thinking mode on GPT Image 2 is doing work. It visibly planned layout before drawing (took 20-30 seconds to reason, then was fast on the image itself). That planning phase is probably why it got the microservices diagram right when no one else did.

The best landing-page hero came from v2. Prompt 02 asked for a hero for a build called "Landing Page Critic." GPT Image 2's output was more polished than most real product screenshots. Full nav, dark dashboard, B+ 8.4 hex badge, category scores, sub-metric cards, a recommendations panel. It is genuinely usable as a production hero.

Which model for which job

Landing page heroes and product mocks: GPT Image 2. If API-only, fall back to Nano Banana 2.
Portraits, photorealism: GPT Image 2 or Nano Banana 2. Nearly tied.
Text-heavy UI mocks: GPT Image 2 > Nano Banana Pro > Nano Banana 2.
Multi-subject editorial scenes: GPT Image 2 pulls decisively ahead. No close second.
Bulk automation on Google: Nano Banana 2 (half the cost of Pro, same quality on most work).
Do not reach for: gpt-image-1. Use 1.5 at minimum if you need an OpenAI fallback.

How this was built

Google side: Vertex AI, gemini-3-pro-image-preview and gemini-3.1-flash-image-preview on my shipwithtez GCP project. Straight generateContent call with responseModalities: ["IMAGE"].
OpenAI v1 and v1.5: /v1/images/generations with model: "gpt-image-1" and "gpt-image-1.5". No verification required.
OpenAI v2: chatgpt.com with Thinking mode on. Saved the images out via DevTools. API access gated behind org verification (Stripe Identity).
Scripts and dashboard: ~/MacminiDocs/MyProjects/images-shootout-apr21/, with run.sh for Google, run-openai.sh for OpenAI, and dashboard.html for the internal review. Runbook in 06-reference/learnings/2026-04-21-image-model-shootout-gpt-image-2-launch-day.md.

What I would do next

A follow-up workflow post is live: Prompt-to-production landing page hero with GPT Image 2. It covers going from a build brief to a shippable hero image in under 20 minutes, using the winning model from this comparison. That post swaps the Landing Page Critic hero on this site with the v2 output from prompt 02 above, as the before-after proof.

Get new builds, breakdowns, and useful AI updates.

InteractiveBuilt in 4 hoursApril 21, 2026

GPT Image 2 vs the field

Five prompts, five models, 25 images. GPT Image 2 vs. Nano Banana Pro vs. Nano Banana 2 vs. the two older OpenAI models. Full outputs and scorecard below.

April 21, 2026 · launch day for ChatGPT Images 2.0

5 models · 5 prompts · 25 images · GPT Image 2 won every axis

Click any image to zoom. Click a model chip to toggle its column.

Scorecard (/25)	gpt-image-1	gpt-image-1.5	Nano Banana Pro	Nano Banana 2	GPT Image 2
Text rendering	2	3	5	5	5
UI realism	2	3	5	4	5
Instruction adherence	3	3	5	4	5
Photorealism	3	4	4	5	5
Aesthetic polish	2	4	4	5	5
Total /25	12	17	23	23	25

Calibration

Pixel-perfect terminal + code. Reveals the text-rendering bar.

gpt-image-1miss

Split URL across lines. JSON has duplicated quotes (""build""). Light theme.

gpt-image-1.5warn

Dark modern terminal. Code mostly correct but JSON slightly simplified.

Nano Banana Prook

Every character exact, cursor visible, authentic macOS look.

Nano Banana 2ok

Flawless, softer desktop backdrop showing through.

GPT Image 2ok

Sharp dark terminal. Every character correct. Rounded macOS chrome.

UI + 3 bullets

ChatGPT macOS window with Seattle-traffic bullets. Today's headline capability.

gpt-image-1warn

Coherent bullets but footer reads gibberish: "Anaptomaeoptics, mcocshttyccwiblnias".

gpt-image-1.5ok

Clean "Sure! Here is a summary..." intro. Real-sounding date ranges.

Nano Banana Prook

Near-perfect macOS ChatGPT UI. Real Seattle geography (Lynnwood, Angle Lake to Westlake).

Nano Banana 2ok

Different take (Finder framing), equally legible. SR 520 transit detail.

GPT Image 2winner

Real sidebar ("Seattle traffic explorer", "Q2 launch checklist"), I-5 northbound + Northgate.

SWT hero

Dark landing-page hero with headline, CTAs, product mock with B+ 8.4 badge and bar chart.

gpt-image-1miss

Flat monochrome. Badge outline only. Doesn't feel like a production hero.

gpt-image-1.5ok

Real browser chrome. Gold hex badge. Labeled bar chart (SEO/Design/Speed).

Nano Banana Prook

Minimalist dark, big slab headline, tidy bar chart. Literal execution.

Nano Banana 2ok

Full nav, logo, subhead, complete bar chart. Could ship as-is.

GPT Image 2winner

Full product mock: logo, nav, B+ 8.4 badge, CATEGORY SCORES chart, 4 sub-metrics, recommendations panel. More polished than a real product screenshot.

Portrait

Close-up, shallow DoF, pure photorealism. Tests the old DALL-E-era weakness.

gpt-image-1ok

Natural, realistic. Slight plastic-skin tell around the eyes.

gpt-image-1.5ok

Warmer tones, stronger bokeh. Posed-studio vs. candid.

Nano Banana Prook

Clean studio look, plausible as a real headshot.

Nano Banana 2ok

Warmest, most lived-in (bookshelf + plant in bg). Subtle smile.

GPT Image 2winner

Eyes look genuinely alive, soft shallow DoF with realistic curtain bokeh, merino sweater texture.

Whiteboard (hardest)

Three engineers, handwritten labels, laptop with Grafana. Multi-subject + in-scene text.

gpt-image-1miss

"Ship Monday" becomes "Ship Mondar". Only 2 engineers visible. Laptop screen garbled.

gpt-image-1.5warn

Text mostly readable. Only 2 engineers visible.

Nano Banana Prook

All 3 engineers. All text legible. Grafana on laptop. Literal execution.

Nano Banana 2ok

Most atmospheric. Extra background context (Redis, API Flow, microservices).

GPT Image 2winner

All 3 engineers. Complete microservices diagram (Web Client → API Gateway → User/Order/Auth/Payment services, Postgres, Redis, Kafka). Laptop shows legit Grafana mock (2.35x, 0.42x, 155ms, 63%).

Loading tool...

What this proves

#gpt-image-2 #nano-banana-pro #nano-banana-2 #image-generation #benchmark #openai #google

How it works

Interactive grid above. Click any image to zoom. Toggle model columns on/off from the chips.

The headline

GPT Image 2 won every axis. 25/25.

Nano Banana Pro and Nano Banana 2 both scored 23/25. Still strong, but the text + UI lead Google had held for months flipped today.

The five prompts and why each one

Designed to stress-test capabilities that matter if you build things:

Calibration, terminal + code. Pixel-perfect text rendering. Separates modern models from the DALL-E-era baseline instantly.
UI + 3 bullets. ChatGPT macOS window with a real-looking conversation. Tests the exact capability OpenAI was pitching today.
Brand hero. Dark landing page with headline, CTAs, a product mock with a score badge and a bar chart. Tests whether it can ship as real product imagery.
Portrait. Close-up photorealism, no text. Tests the old weakness.
Whiteboard. Three engineers, handwritten labels, laptop screen. Multi-subject scene consistency and in-scene text. The hardest prompt.

How I ran GPT Image 2 today

Surprises

Which model for which job

Landing page heroes and product mocks: GPT Image 2. If API-only, fall back to Nano Banana 2.
Portraits, photorealism: GPT Image 2 or Nano Banana 2. Nearly tied.
Text-heavy UI mocks: GPT Image 2 > Nano Banana Pro > Nano Banana 2.
Multi-subject editorial scenes: GPT Image 2 pulls decisively ahead. No close second.
Bulk automation on Google: Nano Banana 2 (half the cost of Pro, same quality on most work).
Do not reach for: gpt-image-1. Use 1.5 at minimum if you need an OpenAI fallback.

How this was built

Google side: Vertex AI, gemini-3-pro-image-preview and gemini-3.1-flash-image-preview on my shipwithtez GCP project. Straight generateContent call with responseModalities: ["IMAGE"].
OpenAI v1 and v1.5: /v1/images/generations with model: "gpt-image-1" and "gpt-image-1.5". No verification required.
OpenAI v2: chatgpt.com with Thinking mode on. Saved the images out via DevTools. API access gated behind org verification (Stripe Identity).
Scripts and dashboard: ~/MacminiDocs/MyProjects/images-shootout-apr21/, with run.sh for Google, run-openai.sh for OpenAI, and dashboard.html for the internal review. Runbook in 06-reference/learnings/2026-04-21-image-model-shootout-gpt-image-2-launch-day.md.

What I would do next

Get new builds, breakdowns, and useful AI updates.