Timezone: »

 
Discovering the Hidden Vocabulary of DALLE-2
Giannis Daras · Alex Dimakis
Event URL: https://openreview.net/forum?id=jxeSZaVzpmg »

We discover that DALLE-2 seems to have a hidden vocabulary that can be used to generate images with absurd prompts. For example, it seems that Apoploe vesrreaitais'' means birds andContarra ccetnxniams luryca tanniounons'' (sometimes) means bugs or pests. We find that these prompts are often consistent in isolation but also sometimes in combinations. We present our black-box method to discover words that seem random but have some correspondence to visual concepts. This creates important security and interpretability challenges.

Author Information

Giannis Daras (University of Texas, Austin)
Alex Dimakis (University of Texas, Austin)

More from the Same Authors