Chapter 8. Creative Applications of Text-To-Image Models

This chapter will present creative applications that leverage text-to-image models and increase their capabilities beyond just using text as control. We will start with the most basic ones and then move on to more advanced ones.

Image-to-Image

Even though generative text-to-image diffusion models like Stable Diffusion can produce images from text from a fully noised image, as we learned in Chapters 4 and 5, it is possible to start from an already existing image instead of a fully noised image. That is, add some noise to an initial image and have the model modify it partially by denoising it. This process is called image-to-image, as an image is transformed into another image based on how much it is noised and based on the text prompt.

With the diffusers library, we can load a image-to-image pipeline to load the class. For example, let’s explore how to use SDXL for this ...

Get Hands-On Generative AI with Transformers and Diffusion Models now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.