Multimodal Prompting for Image Generation

Note: Supporting code and data are available here. Figure 1. An illustration of Multimodal Prompting: an arbitrary composition of images and text can be used as a prompt to the Stable Diffusion model using the method presented in this article. In the past few months, we have experienced groundbreaking progress on the front of image generation models. While some of these models have been available behind black-box user interfaces (or only for fixed prompts) ( Citation: Yu, Xu et al....

October 1, 2022 · 11 min · Nihal Jain