Imagen (text-to-image model)

Imagen
Imagen
	File:Illuminated Valley in the Afternoon (Imagen 4.0).webp An image generated with Imagen 4. Partial prompt: Softly illuminated afternoon valley with meandering river
Developer	Google DeepMind
Initial release	May 2022; 3 years ago
Stable release	Imagen 4 / 20 May 2025; 11 months ago
Repository	{{URL\|example.com\|optional display text}}Lua error in Module:EditAtWikidata at line 29: attempt to index field 'wikibase' (a nil value).;
Engine	Lua error in Module:EditAtWikidata at line 29: attempt to index field 'wikibase' (a nil value).
Type	Text-to-image model
Website	Imagen website

Imagen is a series of text-to-image models developed by Google DeepMind. They were developed by Google Brain until the company's merger with DeepMind in April 2023.^[1] Imagen is primarily used to generate images from text prompts, similar to Stability AI's Stable Diffusion, OpenAI's DALL-E, or Midjourney.

The original version of the model was first discussed in a paper from May 2022.^[2] The tool produces high-quality images and is available to all users with a Google account through services including Gemini, ImageFX, and Vertex AI.^[3]

History

Imagen's original version was first presented in a paper published in May 2022. It featured the ability to generate high-fidelity images from natural language.^[2] The second version, Imagen 2 was released in December 2023.^[4] The standout feature was text and logo generation.^[5] Imagen 3 was released in August 2024.^[6] Google claims that the newest version provides better detail and lighting on generated images.^[7] On 20 May 2025 at Google I/O 2025 the company released an improved model, Imagen 4.^[8]

Technology

Imagen uses two key technologies. The first is the use of transformer-based large language models, notably T5, to understand text and subsequently encode text for image synthesis. The second is the use of cascaded diffusion models providing high-fidelity image generation. Imagen generates image in three stages, starting from a base of 64x64, then upsampled to 256x256 and 1024x1024.^[2] Imagen 4 generates image up to 2k. ^[9]

Capabilities

Imagen can generate photorealistic images from text prompts.^[3] It can also create various styles, such as cinematic, 35mm film, illustration, and surreal. Like most text-to-image generative AI models, Imagen has difficulty rendering human fingers, text, ambigrams and other forms of typography.

The model can generate images in five aspect ratios, namely 9:16, 3:4, 1:1, 4:3, and 16:9. Imagen can also refine already generated images by editing existing text prompts.^[7]

References

^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
^ ^a ^b ^c Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
^ ^a ^b Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
^ ^a ^b Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).

External links

Imagen website

[1] Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).

[:0-2] Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).

[:2-3] Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).

[4] Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).

[5] Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).

[6] Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).

[:1-7] Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).

[:3-8] Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).

[9] Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

Imagen (text-to-image model)

Contents

History

Technology

Capabilities

See also

References

External links

Navigation menu

Imagen (text-to-image model)

History

Technology

Capabilities

See also

References

External links

Navigation menu

Search