Today on AI For Humans The Newsletter!
Google’s new image generator is incredibly easy to use
OpenAI just unleashed hordes of AI agents
And Pika’s got another generative video party trick to show you
Plus, our can’t-miss AI feature of the week!

Welcome back to the AI For Humans Newsletter!

AI Image generation is something we’ve been tracking here for a VERY long time — some of our first episodes of the pod were about how exciting early Midjourney models were. But as of 2025, we’ve gotten pretty close to parity on photorealism and control in initial AI image generation - it’s mostly been solved.

However, a nagging problem in the space has been photo manipulation. What if you want to remove something easily from the photo? Or swap out something different? Or change the color of the background but keep the main subject the same?

I recently forgot to take my picture in Miami. Now I have it!

Enter the new multi-modal <takes deep breath> Gemini 2.0 Flash (Image Generation) Experimental.

Google’s new model is a massive step-up (good deep dive here) in being able to ask for changes to existing photos, better comprehend what you’re asking for, and unlock all sorts of photo manipulation that wasn’t possible before.

— # (#)

In context, "multimodal" refers to the capability of a model to process and integrate multiple types of data inputs, such as text, images, audio, and video. This allows the model to understand and generate content that combines these different data forms, leading to more comprehensive and versatile outputs.

The use cases we’ve seen are broad and fascinating: Change clothes on your subject with ease, change of subject but keep the same style, and maybe our favorite one… create start and end frames of a kids movie keeping a consistent character throughout.

Is it perfect? Far from it. Kevin had a pretty terrifying experience attempting to make a new headshot and many users have pointed out that it’s VERY good at removing watermarks from images.

But this is a new world of democratizing the tools around image manipulation and giving users a LOT more control over how AI image gen works. Will it kill photoshop? Eh, prob not but it does give AI imaging tools a lot more power.

3 Things To Know

More Agents Are On The Way
OpenAI released a toolkit for developers to build their own agents. Similar to ChatGPT Operator, these agents can navigate the web autonomously to do your bidding. It means we’ll likely see more contextual agents, those that perform specific jobs better than a generalist agent like Operator.

Try Google’s New Deep Research for Free!
Deep Research may be 2025’s most impactful AI development for everyday work. Google first launched one last year, but it felt like a link roundup newsletter. OpenAI made big headlines with their excellent implementation, followed closely by Perplexity giving you 5 free queries per day. But Google’s back with a new version, built atop the Gemini 2 family of models, and early feedback points to it being a big improvement.

The Latest Pikaffect - Morph
The folks at Pika Labs keep shipping specific generative video transformations and this is a fun one. Provide an image of the subject, and have it animate you morphing into a new visual. Does it feel a little 2023 face swap party-tricky? It sure does. But we’ve never been above cheap tricks for a laugh here.

We 💛 This - Midjourney Image Retexture

Last week we talked through Runway’s new method to restyle any video clip. In short you simply provide it with the first frame of the video, remade in a different style, with the same structure.

But how do you restyle a single frame from a video? A few ways to do this:

Magnific - Powerful, but heckin spendy
Adobe Firefly structure reference - Kinda works but the output images aren’t that impressive
Gemini Flash image generator - The new kid on the block, works pretty well but if you’re going for specific aesthetics it may struggle
👑 Midjourney image retexture - Probably the best tool for this on the market at the moment, if you want very high quality output images

It’s part of their “Full Editor”, available to anyone either (1) on an annual plan, (2) who has been sub’d for a year or more, or (3) has generated more than 10k Midjourney jobs.

In Midjourney simply upload your original image, and prompt to your heart’s delight to “retexture” the image. Then plug it into Runway’s first frame feature to restyle the video clip:

The original
Two intricately detailed steampunk robots
Two babies, shot on a smartphone camera
Two bright green cartoon anthropomorphic slime people
Two victorian characters from an old fashioned painting

Are you a creative or brand looking to go deeper with AI?
Join our community of collaborative creators on the AI4H Discord
Get exclusive access to all things AI4H on our Patreon
If you’re an org, consider booking Kevin & Gavin for your next event!

Google Launches A Sneaky-Good Image Generator

3 Things To Know

We 💛 This - Midjourney Image Retexture

Keep Reading

AI For Humans - The Newsletter