ChatGPT Can Now Generate Images, Too

September 24, 2023

ChatGPT can now generate images — and they are shockingly detailed.

On Wednesday, OpenAI, the San Francisco artificial intelligence start-up, released a new version of its DALL-E image generator to a small group of testers and folded the technology into ChatGPT, its popular online chatbot.

Called DALL-E 3, it can produce more convincing images than previous versions of the technology, showing a particular knack for images containing letters, numbers and human hands, the company said.

“It is far better at understanding and representing what the user is asking for,” said Aditya Ramesh, an OpenAI researcher, adding that the technology was built to have a more precise grasp of the English language.

By adding the latest version of DALL-E to ChatGPT, OpenAI is solidifying its chatbot as a hub for generative A.I., which can produce text, images, sounds, software and other digital media on its own. Since ChatGPT went viral last year, it has kicked off a race among Silicon Valley tech giants to be at the forefront of A.I. with advancements.

OpenAI has long offered ways of connecting its chatbot with other online services, including Expedia, OpenTable and Wikipedia. But this is the first time the start-up has combined a chatbot with an image generator.

DALL-E and ChatGPT were previously separate applications. But with the latest release, people can now use ChatGPT’s service to produce digital images simply by describing what they want to see. Or they can create images using descriptions generated by the chatbot, further automating the generation of graphics, art and other media.

In a demonstration this week, Gabriel Goh, an OpenAI researcher, showed how ChatGPT can now generate detailed textual descriptions that are then used to produce images. After creating descriptions of a logo for a restaurant called Mountain Ramen, for instance, the bot generated several images from those descriptions in a matter of seconds.

The new version of DALL-E can produce images from multi-paragraph descriptions and closely follow instructions laid out in minute detail, Mr. Goh said. Like all image generators — and other A.I. systems — it is also prone to mistakes, he said.