CLIP interrogator reveals what your robo-artist assistant sees

Ever since DALL•E hit the scene, I’ve been wanting to know what words its model for language-image pairing would use to describe images:

Mobile DALL•E app idea: capture an image, then use GPT-3 to generate descriptive text, then use that as prompts for #dalle.

I want to know how to produce lettering like this: pic.twitter.com/dprziqKWzE
— John Nack (@jnack) June 5, 2022

Now the somewhat scarily named CLIP Interrogator promises exactly that kind of insight:

What do the different OpenAI CLIP models see in an image? What might be a good text prompt to create similar images using CLIP guided diffusion or another text to image model? The CLIP Interrogator is here to get you answers!

Here’s hoping it helps us get some interesting image -> text -> image flywheels spinning.

Nackblog

Musings on photography, illustration, mobile apps, and more

CLIP interrogator reveals what your robo-artist assistant sees

Leave a Reply Cancel reply