Managed to get the new Deep Floyd image generation model to work in a Colab notebook https://colab.research.google.com/github/huggingface/notebooks/blob/main/diffusers/deepfloyd_if_free_tier_google_colab.ipynb
It's supposed to be able to handle words... it's not quite there yet though. Here's what I got for: a photograph of raccoon in the woods holding a sign that says "I will eat your trash"
@simon wow that's pretty close! I was so frustrated trying to get Dall-E to put words in an image. Is this a known-nit-so-great corner of AI image generation?