The Hidden Language of Diffusion Models [ICLR'24]

Text-to-image diffusion models have demonstrated an unparalleled ability to generate high-quality, diverse images from a textual prompt. However, the internal representations learned by these models remain an enigma. In this work, we present Conceptor, a novel method to interpret the internal representation of a textual concept by a diffusion model. This interpretation is obtained by decomposing the concept into a small set of human-interpretable textual elements. Applied over the state-of-the-art Stable Diffusion model, Conceptor reveals non-trivial structures in the representations of concepts. For example, we find surprising visual connections between concepts, that transcend their textual semantics. We additionally discover concepts that rely on mixtures of exemplars, biases, renowned artistic styles, or a simultaneous fusion of multiple meanings of the concept. Through a large battery of experiments, we demonstrate Conceptor's ability to provide meaningful, robust, and faithful decompositions for a wide variety of abstract, concrete, and complex textual concepts, while allowing to naturally connect each decomposition element to its corresponding visual impact on the generated images.

NEW! code is published under the google research repo

Description

Official implementation of the paper The Hidden Language of Diffusion Models. The paper proposes a method dubbed Conceptor to produce explanations for text-to-image diffusion models.

Concept Explanation with Conceptor

Given a concept of interest (e.g., a president) and a text-to-image model, we generate a set of images to visually represent the concept. Conceptor then learns to decompose the concept into a small set of interpretable tokens, with the objective of reconstructing the generated images. The decomposition reveals interesting behaviors such as reliance on exemplars (e.g., "Obama", "Biden").

Single-image Decomposition with Conceptor

Given a single image from the concept, our method extracts the tokens in the decomposition that caused the generation of the image. For example, a snail is decomposed into a combination of ladybug and winding due to the structure of its body, and the texture of its shell.

Concept Manipulation with Conceptor

Our method enables fine-grained concept manipulation by modifying the coefficient corresponding to a token of interest. For example, by manipulating the coefficient corresponding to the token abstract in the decomposition of the concept sculpture, we can make an input sculpture more or less abstract.

Citing our paper

If you make use of our work, please cite our paper:

@article{chefer2023hidden,
        title={The Hidden Language of Diffusion Models},
        author={Chefer, Hila and Lang, Oran and Geva, Mor and Polosukhin, Volodymyr and Shocher, Assaf and Irani, Michal and Mosseri, Inbar and Wolf, Lior},
        journal={arXiv preprint arXiv:2306.00966},
        year={2023}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

The Hidden Language of Diffusion Models [ICLR'24]

NEW! code is published under the google research repo

Description

Concept Explanation with Conceptor

Single-image Decomposition with Conceptor

Concept Manipulation with Conceptor

Citing our paper

Files

README.md

Latest commit

History

README.md

File metadata and controls

The Hidden Language of Diffusion Models [ICLR'24]

NEW! code is published under the google research repo

Description

Concept Explanation with Conceptor

Single-image Decomposition with Conceptor

Concept Manipulation with Conceptor

Citing our paper