Could OpenAI’s text-to-image generator have invented its own language?

OpenAI’s incredible text-to-image generator DALL-E appears to have created its own written language, according to Giannis Daras, a computer science Ph.D. student at the University of Texas at Austin.

Daras asked the algorithm to create an image of “farmers talking about vegetables,” which in turn generated an unknown word, “vicootes.” When he put the word “vicootes” back into the algorithm, DALL-E created pictures of vegetables.
The image of the farmers also contained the nonsense text “poploe vesrreaitars.” When Daras put the phrase into the system, the algorithm generated images of birds.
“It seems that the farmers are talking about birds, messing with their vegetables!” Daras wrote on Twitter.

A known limitation of DALLE-2 is that it struggles with text. For example, the prompt: "Two farmers talking about vegetables, with subtitles" gives an image that appears to have gibberish text on it.

However, the text is not as random as it initially appears... (2/n) pic.twitter.com/B3e5qVsTKu
— Giannis Daras (@giannis_daras) May 31, 2022

The Theory: In a paper that’s yet to be peer-reviewed, Daras states that DALL-E has generated a hidden vocabulary underlying or working in parallel with its primary function. He adds that the discovery of DALL-E’s secret language creates many interesting security and interpretability challenges.

The Counter-Theory: Daras also underlines that the results aren’t always consistent. For example, the prompt “Contarra ccetnxniams luryca tanniounons” generates pictures of bugs, while other times, it generates images of “mostly animals.”

And some others believe the theory doesn’t hold up. Research analyst Benjamin Hilton argued, “My best guess? It’s random chance. Or just maybe (if you really press me) ‘Apoploe vesrreaitais’ looks like a binomial name for some birds or bugs.” He added, “To me this is all starting to look a lot more like stochastic, random noise, than a secret DALL-E language.”
Others believe it could be DALL-E’s ability to create shortcuts by turning images into code.

If I try a cartoon, or a 3D render, DALL-E generates lots of bugs (some of which can fly) and no birds.

My best guess? It’s random chance. Or just maybe (if you really press me) “Apoploe vesrreaitais” looks like a binomial name for some birds or bugs.

(6/15) pic.twitter.com/hC3g2B9HRS
— Benjamin Hilton (@benjamin_hilton) May 31, 2022

Could OpenAI’s text-to-image generator have invented its own language?

Redditor breaks ChatGPT's brain with alphabet overload

Brain-reading AI device returns walking power to paralyzed man

Viral AI-generated Pentagon explosion image rattles Twitter

Blizzard banks on AI for next-level game design

Apple restricts employees from using ChatGPT

Influencer creates an AI version of herself that will date you for $1 a minute