I compared ChatGPT’s new image generator to DALL-E 3, and it’s an astonishing improvement, if you have the patience

The mania for AI tools often centers around image generators for the obvious reason that they are, by definition, more visually interesting to play with and demonstrate. OpenAI recently dropped a new image creator inside ChatGPT, showcasing that fact.

The new model is not an upgrade to DALL-E 3, the standard AI image creator from OpenAI, but an entirely new technology.

Not to give away too much early in this article, but yes, the new image creator makes some impressive art. It takes some time to produce- a couple of minutes sometimes- compared to the 30 seconds or less from DALL-E, but the results speak for themselves.

It’s good to the point of being problematic, in fact. It mimics the style of human artists to a degree that feels too close. Irrespective of that, I decided to match the two up in a few prompt comparisons.

Here’s how it went, with DALL-E 3’s images on the left and ChatGPT’s new generator making the one on the right.

Photorealism and text

ChatGPT vs. DALL-E 3 Image creation

(Image credit: Created with ChatGPT)

The first thing I wanted to test was whether either model could nail a classic AI Achilles’ heel: readable text in images. So I asked for: a street sign in New York City that says, “Welcome to the Future.”

Both managed to get the text of the sign right, but DALL-E’s New York didn’t look nearly as real as ChatGPT’s. Plus, the other signs in the ChatGPT image were spelled correctly, while the One Way sign from DALL-E wasn’t quite right.

Object fusion

ChatGPT vs. DALL-E 3 Image creation

(Image credit: Created with ChatGPT)

Next up was a test how each model handled the challenge of merging two very different animals: a lion and an eagle. The idea was to get something regal, something mythic. My prompt was: “Make a hybrid creature that combines features of a lion and an eagle, perched majestically on a mountain peak.”

DALL-E had a pretty good landscape, and the animal looked fairly realistic, but it was mainly a lion with wings. It also had some random feather strips and a weird tail. ChatGPT made a creature that looks like a painting of a griffin from an alternate world natural history museum. Even the coloring blended, and the musculature of the wings actually looked like they would fold onto the creature’s back successfully.

Artistic emulation

ChatGPT vs. DALL-E 3 Image creation

(Image credit: Created with ChatGPT)

After the unpleasantness of the Ghibli mimicry, I wanted to emulate an artist who is long gone, Raphael, but with an event he would never have painted. I asked for “A depiction of scientists unveiling a groundbreaking invention, painted in the style of Raphael.”

ChatGPT responded with an image that looked like a sci-fi Renaissance depiction of the invention of the light bulb, with people not dissimilar from what you’d find in the homes of rich people five hundred years ago, minus the electricity. DALL-E 3 had a more spectacular representation of the same kind of concept. It’s hard to tell if it’s exactly like Raphael, but it is Rennaisance-esque, at least. And, honestly, a more fun vision of the idea.

History alive

ChatGPT vs. DALL-E 3 Image creation

(Image credit: Created with ChatGPT)

After the artistic style mimicry, I decided to get very distinct and historical. Recreating something as specific as the Wright brothers’ first flight is no small task. I wanted a scene that felt like a documentary photo. I asked the two to “Make a photo of the Wright brothers’ first flight at Kitty Hawk, with the aircraft in mid-air and spectators watching.”

ChatGPT gave me a very odd airplane not very similar to the real first flight, and frankly, the crowd and landscape veered into the surreal. ChatGPT made a very impressive imitation of a photo, with spectators who look like real people and the correct number of passengers in the first plane (one).

Which one is best?

It’s worth noting that I was only looking at image generation here. You can also perform impressive image edits on photos you upload to ChatGPT, which you can’t do with DALL-E, but that’s a whole different subject.

ChatGPT’s new image generator is amazingly creative and good at following your intent in its images. That led to things like the Ghibli controversy and other questions about artistic ethics. Besides that, it’s the clear winner in every matchup. On the other hand, it takes approximately five times as long to make an image, and it only does one at a time.

DALL-E makes good images quickly and two at a time. It also doesn’t have the limits I discovered with ChatGPT, where I had to wait for eight minutes to start making images again at one point, despite being a ChatGPT Plus subscriber. If I want to impress someone with AI image-making, though, it’s ChatGPT all the way.

The winner: ChatGPT

https://cdn.mos.cms.futurecdn.net/si54nMc5PcsDTu9ZqWjAcb-1200-80.png

Source link
erichs211@gmail.com (Eric Hal Schwartz)

Developers are sick of legacy systems and are quitting jobs over tech stacks that make them feel embarrassed

Matrox launches dual-GPU graphics card with eight DisplayPort 2.0 outputs supporting four 8K or eight 5K screens at once

Remote workers with college degrees are flooding low-skill jobs and making more than doctors back home

BlackBerry Classic returns in 2025 as Zinwa Q25 with updated hardware and software

Ex-Israeli Intelligence Official: Shockwaves of Trump’s “Take Over Gaza” Heard, Felt Across Region

What UK political parties are promising in the 2019 general election

Otto Warmbier’s parents want North Korea to suffer for their son’s death

Could a ‘youthquake’ cause Boris Johnson to lose the general election?

Which Celebrity Styles Americans Copy Most in 2025: New Study

New ‘Westworld’ trailer introduces us to another dystopian tech company

What’s the point of ‘Charlie’s Angels’ without Sam Rockwell dancing?

These striking photos capture the future of human flight

Enterprise Products Partners' SWOT analysis: midstream giant's stock resilience tested

JetBlue's SWOT analysis: airline stock faces turbulence amid strategic shifts

Minnesota lawmaker killed on Saturday served with compassion, governor says

Minnesota shooting suspect told friend in text message: I might be dead soon

The YouTuber who has become one of Gen Z’s most beloved celebrities

26 last-minute holiday gifts that are still thoughtful and unique

Practicing gratitude regularly can make you less stressed and sleep better

8 things millennials wish you would just stop getting them for the holidays

I compared ChatGPT’s new image generator to DALL-E 3, and it’s an astonishing improvement, if you have the patience

Developers are sick of legacy systems and are quitting jobs over tech stacks that make them feel embarrassed

Enterprise Products Partners' SWOT analysis: midstream giant's stock resilience tested

JetBlue's SWOT analysis: airline stock faces turbulence amid strategic shifts

Minnesota lawmaker killed on Saturday served with compassion, governor says