Google’s Imagen 3: The Latest in Text-to-Image AI Technology

slacerna August 17, 2024No Comments

Google’s Imagen 3: The Latest in Text-to-Image AI Technology

The latest version of Google’s text-to-image model, Imagen 3, is now available to the masses. Teased back in May at Google I/O, Imagen 3 is the tech giant’s most advanced AI generator, rivaling other models like Midjourney, DALL-E 3, and X’s very uncensored Grok-2.

Advanced Safety Measures

Unlike Elon Musk’s Grok-2, which has gained notoriety for generating copyrighted images and deepfakes of public figures, Google has taken a more cautious approach. The company states that it “used extensive filtering and data labeling to minimize harmful content in datasets and reduce the likelihood of harmful outputs.” Additionally, images created by Imagen 3 feature Google’s SynthID digital watermark to help identify the image’s provenance.

Superior Performance

Apart from safety protocols, Google claims that Imagen 3 offers greater versatility in understanding prompts, higher quality images, and improved text rendering — an ongoing challenge for all AI image models. Users who have tested Imagen 3 report impressive results, although some Reddit users have criticized the model for being too restrictive in the types of images it can generate.

Availability

If you’d like to explore Imagen 3 and test its boundaries, the model is currently available via ImageFX and VertexAI. Soon, it will be integrated into Google AI features in Workspace and Gemini on both web and mobile platforms.

About the Author

Cecily is a tech reporter at Mashable who covers AI, Apple, and emerging tech trends. Before earning her master’s degree at Columbia Journalism School, she worked with startups and social impact businesses for Unreasonable Group and B Lab. She also co-founded a startup consulting business for emerging entrepreneurial hubs in South America, Europe, and Asia. Find her on Twitter at @cecily_mauran.

General