Skip to main content

The context you need, when you need it

When news breaks, you need to understand what actually matters — and what to do about it. At Vox, our mission to help you make sense of the world has never been more vital. But we can’t do it on our own.

We rely on readers like you to fund our journalism. Will you support our work and become a Vox Member today?

Join now

The text-to-image revolution, explained

How programmers turned the internet into a paintbrush.

Joss Fong
Joss Fong is a founding member of the Vox video team and a producer focused on science and tech. She holds a master’s degree in science, health, and environmental reporting from NYU.

Beginning in January 2021, advances in AI research have produced a plethora of deep-learning models capable of generating original images from simple text prompts, effectively extending the human imagination. Researchers at OpenAI, Google, Facebook, and others have developed text-to-image tools that they have not yet released to the public, and similar models have proliferated online in the open source arena and at smaller companies like Midjourney.

These tools represent a massive cultural shift because they remove the requirement for technical labor from the process of image-making. Instead, they select for creative ideation, skillful use of language, and curatorial taste. The ultimate consequences are difficult to predict but — like the invention of the camera, and the digital camera after it — these algorithms herald a new, democratized form of expression that will commence another explosion in the volume of imagery produced by humans. But, like other automated systems trained on historical data and internet images, they also come with risks that have not been resolved.

The video above is a primer on how we got here, how this technology works, and some of the implications. And for an extended discussion about what this means for human artists, designers, and illustrators, check out this bonus video:

You can find this video and all of Vox’s videos on our YouTube channel.

More in Video

Video
What would J.R.R. Tolkien think of Palantir?What would J.R.R. Tolkien think of Palantir?
Play
Video

How The Lord of the Rings lore helps explain the mysterious tech company.

By Benjamin Stephen
America, Actually
The progressive plan to reclaim the working classThe progressive plan to reclaim the working class
Podcast
America, Actually

Progressive caucus chair Rep. Greg Casar on his movement’s new playbook.

By Astead Herndon
Video
The Department of Holy WarThe Department of Holy War
Play
Video

What Pete Hegseth’s fascination with the Crusades can tell us about the war in Iran.

By Nate Krieger
Video
Live Nation lost. Will anything change for ticket prices?Live Nation lost. Will anything change for ticket prices?
Play
Video

A jury ruled Live Nation and Ticketmaster a monopoly, but what that means for ticket prices is not so simple.

By Frank Posillico
Eating the Ocean
Why are states unleashing millions of these fish?Why are states unleashing millions of these fish?
Play
Eating the Ocean

America’s fishing paradox.

By Nate Krieger
Video
Why Americans can’t escape credit card debtWhy Americans can’t escape credit card debt
Play
Video

Credit card APRs are now as high as 20 percent.

By Frank Posillico