Google Whisk is a new way to create AI visuals using image prompts – here’s how to try it

Google Whisk uses images as inputs instead of text-based prompts
It’s built on Google’s Imagen 3 generative AI model
The experimental tool is free to try for users in the US

Google’s new AI tool makes it easier to create and remix your visual concepts. Instead of asking you to describe what’s in your mind’s eye, Whisk lets you input three image prompts: one for subject, one for scene and one for style. Whisk takes care of the rest, making it a more intuitive way to experiment with different ideas.

While most of the best AI image generators require you to write a detailed prompt, Whisk handles that behind the scenes. When you drop pictures into the web-based Whisk interface as inspiration, Google’s Gemini model automatically analyzes them and writes a detailed caption for each. These are then fed into the Imagen 3 model, to create a matching image.

For example, you could drop in an image of a car as the subject and a photo of a rural landscape for the scene. You could them add a watercolor as the style to see what Whisk creates. Hit the button and you’ll get a pair of images based on your inputs.

From here, it’s easy to remix the images. The interface allows you to specify additional text-based details to tweak the outcomes. You can also easily drop in different source images or roll the dice if you’re in need of inspiration. New results appear in pairs in the feed, making it an intuitive way to ideate. You can also choose to refine images by revealing the text prompt and adding more details.

Whisk it up

Introducing Whisk: Prompt Less, Play More | Google Labs – YouTube

Watch On

While Whisk is designed to eliminate the need for text-based prompts, Google includes the option to refine the written prompts because results won’t always match up to the source material.

In a blog post about the experimental tool, Google explains that Whisk, “captures your subject’s essence, not an exact replica.” It’s only as effective as Gemini’s analysis of the images you submit. While this is generally very impressive, it also isn’t able to get inside your mind: you might expect Whisk to pull out one detail from an image, where it focuses on another.

The post explains further: “Since Whisk extracts only a few key characteristics from your image, it might generate images that differ from your expectations. For example, the generated subject might have a different height, weight, hairstyle or skin tone. We understand these features may be crucial for your project and Whisk may miss the mark, so we let you view and edit the underlying prompts at any time.”

Even with these shortcomings, Whisk an interesting application of Google’s existing AI tools. The underlying generative models are the same as if you were chatting with Gemini via its text interface. By relying on image inputs, though, Whisk is a more accessible and intuitive way for visual creators to play with their ideas.

Based on early feedback from digital creatives, Google refers to Whisk as “a new type of creative tool” which is intended for “rapid visual exploration, not pixel-perfect edits.”

How to try Google Whisk

Google Whisk is currently only available to users in the US. If you’re based there, you can try it out via your web browser at labs.google/whisk.

The experimental tool is completely free to play with. Data from your experience with Whisk will be fed back to Google to help refine and develop future AI products.

You might also like…

https://cdn.mos.cms.futurecdn.net/9F5U9scWSnp4ymRyVT79QW-1200-80.png

Source link

Your next iPhone case could be made from lab-grown T-Rex leather thanks to ‘world first’ technology — but experts say otherwise

How to watch The AI Doc: Or How I Became an Apocaloptimist

‘This rootkit is highly persistent; a standard factory reset will not remove it’: “NoVoice” Android malware on Google Play infects 50 apps across 2.3...

I’ve been testing film cameras for years, and the Lomography Lomo MC-A easily just became my favorite

Ex-Israeli Intelligence Official: Shockwaves of Trump’s “Take Over Gaza” Heard, Felt Across Region

What UK political parties are promising in the 2019 general election

Otto Warmbier’s parents want North Korea to suffer for their son’s death

Could a ‘youthquake’ cause Boris Johnson to lose the general election?

Shop the Brand That Started It

USO Announces Celebrity Chef Tyler Florence as Newest Global Ambassador

Netflix’s Masterpiece Thriller Series With Near-Perfect Rotten Tomatoes Score Returns In Two Weeks & The Time to Binge It Is Now

‘The Boys’ Returns in Less Than a Week With Explosive New Episodes

Thai Hospitals Face Pressure from Ongoing War, Says Tisco

Meet China’s AI-powered recycling robot that sorts 220 pounds of clothes in 2 to 3 minutes

Form 144 TransUnion For: 2 April

AI coding tools are accelerating software development—but trust is becoming the bottleneck

The YouTuber who has become one of Gen Z’s most beloved celebrities

26 last-minute holiday gifts that are still thoughtful and unique

Practicing gratitude regularly can make you less stressed and sleep better

8 things millennials wish you would just stop getting them for the holidays

Google Whisk is a new way to create AI visuals using image prompts – here’s how to try it

Thai Hospitals Face Pressure from Ongoing War, Says Tisco

Your next iPhone case could be made from lab-grown T-Rex leather thanks to ‘world first’ technology — but experts say otherwise

Shop the Brand That Started It

Meet China’s AI-powered recycling robot that sorts 220 pounds of clothes in 2 to 3 minutes