Category Archives: AI/ML

Ideogram enables personalized generation using just a single image

July 30, 2025AI/MLjnack

The AI generator—of which I’ve been a longtime fan—has introduced the ability to upload a single image of a person (or cat!), then use it in creating images. It’s hard to overstate just how long people have wanted this kind of control & simplicity.

For a deeper look, here’s a quick demo from the team:

Photoshop’s new Harmonize looks rad

July 29, 2025AI/MLjnack

Having worked on such features going all the way back to (OMG) 2002, I can’t wait to try this out!

#Photoshop beta new feature! Seamlessly composite images with Harmonize! pic.twitter.com/J4biVk702N

— Paul Trani (@paultrani) July 29, 2025

And yet, I must wait: as to why the beta isn’t available for me to download via the CC desktop app…

😉

Google Photos adds GenAI features

July 28, 2025AI/ML, Google Photos, Photographyjnack

The app promises to let you turn static images into short videos and transform them into fun art styles, plus explore a new creation hub.

I’m excited to try it out, but despite the iOS app having been just updated, it’s not yet available—at least for me. Meanwhile, although I just bit the bullet & signed up for the $20/mo. plan, the three video attempts that Gemini allowed me today all failed. ¯\_(ツ)_/¯

Childhood drawings brought to life through Midjourney video

July 25, 2025AI/MLjnack

Even though I got absolutely wrecked for having the temerity to use one of my son’s cute old drawings in an AI project last year (no point in now digging up the hundreds of flames it drew), I still enjoy seeing this kind of creative interpretation:

My mom sent me a 30-year-old drawing…

That I made for her when I was a kid.

Naturally, I animated with Midjourney..

+ It’s not perfect
+ But it’s awesome
+ And it captures the innocent chaos
+ Of my childhood imagination.

Yes, my mother is fighting a dragon…

With a… pic.twitter.com/hMd1dOdoXR

— Rory Flynn (@Ror_Fly) July 6, 2025

Gemini enables image-to-video

July 24, 2025AI/MLjnack

Man, am I now gonna splash out for another monthly subscription? I haven’t done so yet, but these results are pretty darn impressive:

To turn your photos into videos, select ‘Videos’ from the tool menu in the prompt box and upload a photo. … The photo-to-video capability is starting to roll out today to Google AI Pro and Ultra subscribers in select countries around the world. Try it out at gemini.google.com. These same capabilities are also available in Flow, Google’s AI filmmaking tool.

Higgsfield Soul: Generate -> Inpaint -> Animate

July 3, 2025AI/MLjnack

Okay, so this isn’t precisely what I thought it was at first (video inpainting), but rather an creation->inpainting->animation flow. Still, the results look impressive:

How it works:

→ Generate an image in Higgsfield Soul
→ Inpaint directly with a mask and a prompt
→ Combine with Camera moves, VFX, and Avatars to turn static edits into living, speaking visuals pic.twitter.com/ENHqdA3WHm

— Higgsfield AI (@higgsfield_ai) July 3, 2025

John Oliver vs. AI slop

June 29, 2025AI/MLjnack

“What a fun way to celebrate the destruction of our shared objective reality!” :->

But honestly this is a really insightful, hilarious, and eye-opening tour through the charms & many, many discontents of our new world:

Google steps up virtual try-on with Doppl

June 27, 2025AI/ML, Try-onjnack

As I’ve noted previously, Google has been trying to crack the try-on game for a long time. Back in the day (c. 2017), we really want to create AR-enabled mirrors that could do this kind of thing. The tech wasn’t quite ready, and for the realtime mirror use case it likely still isn’t, but check out the new free iOS & Android app Doppl:

In May, Google Shopping announced the ability to virtually try billions of clothing items on yourself, just by uploading a photo. Doppl builds on these capabilities, bringing additional experimental features, including the ability to use photos or screenshots to “try on” outfits whenever inspiration strikes.

Doppl also brings your looks to life with AI-generated videos — converting static images into dynamic visuals that give you an even better sense for how an outfit might feel. Just upload a picture of an outfit, and Doppl does the rest.

AI brings people to tears—of joy

June 23, 2025AI/ML, Photographyjnack

Several years ago, MyHeritage saw a huge (albeit short-lived) spike in interest from their Deep Nostalgia feature that animated one’s old photos. Everything old is new again, in many senses. Check out Reddit founder Alexis Ohanian talk about how touching he found the tech—as well as tons of blowback from people who find it dystopian.

Damn, I wasn’t ready for how this would feel. We didn’t have a camcorder, so there’s no video of me with my mom. I dropped one of my favorite photos of us in midjourney as ‘starting frame for an AI video’ and wow… This is how she hugged me. I’ve rewatched it 50 times. pic.twitter.com/n2jNwdCkxF

— Alexis Ohanian (@alexisohanian) June 22, 2025

Little Happy AI Trees

June 22, 2025AI/ML, Illustrationjnack

“Today we won’t need our paints or our brushes, or our joy.” Oh boy…

View this post on Instagram

A post shared by Joe Nunnink (@nunnigram)

Greg the Stormtrooper

June 21, 2025AI/MLjnack

I’ve heard people referring to the recent release of Google’s Veo 3 as the ChatGPT moment for video generation—that is, a true inflection point at which a mere curosity becomes something of real value. The spatial & character coherence of its output, and especially its ability to generate speech & other audio, turn it into a genuine storytelling tool.

You’ve probably seen some of the myriad vlogger-genre creations making the rounds. Here’s one of my faves:

Better than any recent Disney Star Wars show or movie. pic.twitter.com/iikJIVP8pq

— MERICA MEMED (@Mericamemed) June 8, 2025

Jawas gone wild

PartCrafter 3D promises to make multiple useful 3D meshes from a single image

June 16, 20253D, AI/MLjnack

You had me at “editable 3D Lego Vader minifig from image of Lego Vader.” 🙂 Check it out:

On classic cars & the feeling of craft

June 8, 2025AI/ML, Designjnack

John Gruber recently linked back to this clip in which designer Neven Mrgan highlights what feels like an important consideration in the age of mass-generated AI “designs”:

I think that was what mattered is that they looked rich, they looked like a lot of work had been put into them. That’s what people latch onto. It seems it’s something that, yes, they should have spent money on, and they should be spending time on right now.

Regardless of what tools were used in the making of a piece, does it feel rich, crafted, thoughtfully made? Does it have a point, and a point of view? As production gets faster, those qualities will become all the more critical for anything—and anyone—wishing to stand out.

“A surrealist design engine no one asked for”

June 4, 2025AI/ML, Illustrationjnack

A while back, Sam Harris & Ricky Gervais discussed the impossibility of translating a joke discovered during a dream (“What noise does a monster make?”) back into our consensus waking reality. Like… what?

I get the same vibes watching ChatGPT try to dredge up some model of me and of… humor?… in creating a comic strip based on our interactions. I find it uncanny, inscrutable, and yet consequently charming all at once.

“Hey ChatGPT, based on what you know about me, please create a four-panel comic you think I’d like…” https://t.co/U7WRfShGRh

— John Nack (@jnack) June 4, 2025

The new Flux rocks for image restoration

June 2, 2025AI/ML, Photographyjnack

Please tell me Adobe is hiding off screen, secretly cooking up magic. Please…

Meanwhile, you can try it yourself here.

Flux Kontext via @replicate

Kontext just put every photo restoration company out of business. This took 6 seconds. pic.twitter.com/a3vT9BhLZY

— Adam Hails (@Clearstory3D) May 30, 2025

It’s remarkable how good kontext is at this, as a general purpose editing model rather than a specialised restoration one. https://t.co/pesZOzmir3 pic.twitter.com/29TnU4U1Zl

— fofr (@fofrAI) May 31, 2025

New Google virtual try-on tech

May 28, 2025AI/ML, Try-onjnack

Take it away, Marques:

Google’s new AI “Try On” feature pic.twitter.com/KYZfue3Poh

— Marques Brownlee (@MKBHD) May 20, 2025

To try it yourself:

Opt in to get started: Head over to Search Labs and opt into the “try on” experiment.

Browse your style: When you’re shopping for shirts, pants or dresses on Google, simply tap the “try it on” icon on product listings.

Strike a pose: Upload a full-length photo of yourself. For best results, ensure it’s a full-body shot with good lighting and fitted clothing. Within moments, you can see how the garment will look on you.

“Kafkaesque Workplace Theater”

May 27, 2025AI/MLjnack

Sounds like kind of an awful band, doesn’t it? How about “Prompt Washing & the Insight Decay Spiral?” (Take that, Billy Corgan.)

This list from Brad Koch puts a finger directly on some of the maladaptive behaviors we’re seeing in our new cognitive golden age…

Microsoft Photos adds interactive relighting

May 19, 2025AI/ML, Photographyjnack

Man, for 18 years (yes, I keep the receipts) I’ve been wanting to ship an interactive relighting experience—and now my team has done it! Check out the quick demo below plus details on DP Review.

Higgsfield debuts Ads

May 17, 2025AI/MLjnack

Sigh… having quickly exhausted my paid credits, Imma have to up my subscription level, aren’t I? But these are good problems to have. 🙂

Meet Higgsfield Ads: turn one product pic into a studio ad in seconds.

Upload a product photo, pick one of 40+ templates, and walk away with a cinematic spot.

Retweet, like, comment “Ads” and the link with a detailed guide and 150+ use-cases hits your DMs. pic.twitter.com/LjBg9Ostxg

— Higgsfield AI (@higgsfield_ai) May 15, 2025

Every prompt is awesome! LegoGPT

May 14, 2025AI/MLjnack

Text-to-brick FTW!

We’ve released the code for LegoGPT. This autoregressive model generates physically stable and buildable designs from text prompts, by integrating physics laws and assembly constraints into LLM training and inference.

This work is led by PhD students @AvaLovelace0, @kangle_deng,… pic.twitter.com/wdhkF5NBgC

— Jun-Yan Zhu (@junyanz89) May 9, 2025

Quick tutorial: Runway References

May 7, 2025AI/MLjnack

Identity preservation FTW—though I’ve yet to test this feature with my own face & will reserve judgement a bit until I’ve done so:

[Via Jan Kabili]

Krea introduces “GPT Paint”

May 6, 2025AI/ML, GPT-4ojnack

Continuing their excellent work to offer more artistic control over image creation, the fast-moving crew at Krea has introduced GPT Paint—essentially a simple canvas for composing image references to guide the generative process. You can directly sketch, and/or position reference images, then combine the input with prompts & style references to fine-tune compositions:

introducing GPT Paint.

now you can prompt ChatGPT visually through edit marks, basic shapes, notes, and reference images.

available now on Krea Image. pic.twitter.com/oHiPIedUNz

— KREA AI (@krea_ai) May 1, 2025

Historically, approaches like this have sounded great but—at least in my experience—have fallen short.

Think about what you’d get from just saying “draw a photorealistic beautiful red Ferrari” vs. feeing in a crude sketch + the same prompt.

In my quick tests here, however, providing a simple reference sketch seems helpful—maybe because GPT-4o is smart enough to say, “Okay, make a duck with this rough pose/position—but don’t worry about exactly matching the finger-painted brushstrokes.” The increased sense of intentionality & creative ownership feels very cool. Here’s a quick test:

I’m not quite sure where the spooky skull and, um, lightning-infused martini came from. 🙂

Runway adds References

May 1, 2025AI/MLjnack

This looks amazing for character consistency! See thread for more examples.

GPT-4o infographics: Faraway, so close!

April 30, 2025AI/ML, GPT-4o, Infographicsjnack

Things are night-and-day better than they were just a month ago (in the dark DALL•E days), but would you like your owl with FEAFERS?

Oh, ChatGPT, you are *almost* good at infographics… But what’s with the EATON and FEAFERS? pic.twitter.com/K7vDRjRsdP

— John Nack (@jnack) April 16, 2025

“When Identity Preservation Goes Wrong”

April 29, 2025AI/ML, GPT-4ojnack

Hah hah oh nooooo… Enjoy some creepy & unintended fun from GPT-4o:

ChatGPT prompted 74 times

“Create the exact replica of this image, do not change a thing”

This is why I say you need to start a new chat after each edit pic.twitter.com/LTFjQebA5e

— A.I.Warper (@AIWarper) April 28, 2025

GPT-4o image creation is coming to Designer!

April 23, 2025AI/ML, GPT-4o, Illustrationjnack

Having created 200+ images in just the last month via this still-new image model (see new blog category that gathers some of them), I’m delighted to say that my team is working to bring it to Microsoft Designer, Copilot, and beyond. From the boss himself:

5/ Create: This one is fun. Turn a PowerPoint into an explainer video, or generate an image from a prompt in Copilot with just a few clicks.

We’ve also added new features to make Copilot even more personalized to you, plus a redesigned app built for human-agent collaboration. pic.twitter.com/m1oTf53aai

— Satya Nadella (@satyanadella) April 23, 2025

Fun recent GPT-4o explorations

April 23, 2025AI/ML, GPT-4ojnack

Just sharing a few things I’ve been trying.
For Easter, my cousin’s sweet pup as sweet treats:

Closing out Easter by turning my cousin’s dog into peeps, Paas, Jelly Belly, and more. pic.twitter.com/OnJhjiM6Z7

— John Nack (@jnack) April 21, 2025

Bespoke felt ornaments FTW:

Getting an early start on Christmas, visualizing friends’ cars as felt ornaments (GPT-4o + @higgsfield_ai): pic.twitter.com/FkAjp8FWdz

— John Nack (@jnack) April 21, 2025

Creating cozy slippers from an A-10 Warthog:

“You’re now the proud owner of the most dangerously cozy footwear in the sky. Plush, cartoon A-10 Warthogs with big doe eyes and turbine engines ready to warm your toes and deliver cuddly close air support. Let me know if you want tiny GAU-8 Gatling gun detailing on the front.”… pic.twitter.com/lKLRJGALaw

— John Nack (@jnack) April 18, 2025

StarVector: Text/Image->SVG Code

April 22, 2025AI/ML, Illustrationjnack

Back at Adobe we introduced Firefly text-to-vector creation, but behind the scenes it was really text-to-image-to-tracing. That could be fine, actually, provided that the conversion process did some smart things around segmenting the image, moving objects onto their own layers, filling holes, and then harmoniously vectorizing the results. I’m not sure whether Adobe actually got around to shipping that support.

In any event, StarVector promises actual, direct creation of SVG. The results look simple enough that it hasn’t yet piqued my interest enough to spend my time with it, but I’m glad that folks are trying.

StarVector official app is out on Hugging Face

Generating Scalable Vector Graphics Code from Images and Text pic.twitter.com/4nIr0eHJzG

— AK (@_akhaliq) March 24, 2025

“You eat what you are”

April 21, 2025AI/MLjnack

There’s a way-higher-than-zero chance that you won’t want to check out this AI rendering; just sayin’. 🙂

AI is getting out of hand pic.twitter.com/4nLya89kyY

— Charly Wargnier (@DataChaz) April 7, 2025

That Happy Meal feel

April 17, 2025AI/ML, GPT-4ojnack

Sure, the environmental impact of this silliness isn’t great, but it’s probably still healthier than actually eating McDonald’s. :-p

Having a ball turning my family into Happy Meal figures. (See prompt in quoted post from @firatbilal) https://t.co/9Az1Om6BBe pic.twitter.com/bB55ebBQ3r

— John Nack (@jnack) April 17, 2025

Tangentially, I continue to have way too much fun applying different genres to amigos:

My love language is turning friends’ family photos into GPT-4o-powered illustrations. pic.twitter.com/5qYzmhtxFq

— John Nack (@jnack) April 8, 2025

AI Logofluff

April 15, 2025AI/ML, Design, GPT-4ojnack

ChatGPT has famous marks marching to fuzz:

How about using the same prompt to create fluffy logos? https://t.co/SIVCbhmJ1x pic.twitter.com/m4wyF7zADM

— Gizem Akdag (@gizakdag) April 13, 2025

And now Microsoft Designer has me feeling truly warm & fuzzy:

Google, Dolphins, and Ai-i-i-i-i!

April 14, 2025AI/MLjnack

Three years ago (seems like an eternity), I remarked regarding generative imaging.,

The disruption always makes me think of The Onion’s classic “Dolphins Evolve Opposable Thumbs“: “Holy f*ck, that’s it for us monkeys.” My new friend August replied with the armed dolphin below.

I’m reminded of this seeing Google’s latest AI-powered translation (?!) work. Just don’t tell them about abacuses!

Meet DolphinGemma, an AI helping us dive deeper into the world of dolphin communication. pic.twitter.com/2wYiSSXMnn

— Google DeepMind (@GoogleDeepMind) April 14, 2025

[Via Rick McCawley]

How MCP is like electricity & the REA

April 10, 2025AI/MLjnack

Wait, first, WTF is MCP? Check out my old friend (and former Illustrator PM) Mordy’s quick & approachable breakdown of Model Context Protocol and why it promises to be interesting to us (e.g. connecting Claude to the images on one’s hard drive).

Don’t mind me, just turning the dog into a guy, because why not?

April 9, 2025AI/ML, GPT-4ojnack

The better question may be, what are you waiting for? 😉 Let’s roll, ChatGPT:

Fun! Here’s my guy Seamus as a sleepy dude. pic.twitter.com/JZ7XOT7Kws

— John Nack (@jnack) April 9, 2025

Draw 3D-styled characters with Google Gemini

April 7, 2025AI/ML, Illustrationjnack

Check out this fun toy:

[1/8] Drawing → 3D render with Gemini 2.0 image generation… by @dev_valladares + me

Make your own at the link belowhttps://t.co/sy8poJZYuQ pic.twitter.com/DkGT6DRUsb

— Trudy Painter (@trudypainter) April 4, 2025

Apparently I’m over my quota, so sadly the world will never get to see a Ghiblified rendering of my crudely drawn goldendoodle!

Friday’s Microsoft Copilot event in 9 minutes

April 6, 2025AI/MLjnack

The team showed of good new stuff, including—OMG—showing how to use Photoshop! (On an extremely personal level, “This is what it’s like when worlds colliiiide!!”)

As it marks its 50th anniversary, Microsoft is updating Copilot with a host of new features that bring it in line with other AI systems like ChatGPT or Claude. We got a look at them during the tech giant’s 50th anniversary event today, including new search capabilities, Copilot Vision which will be able to analyze real-time video from a mobile camera. Copilot will also now be able to use the web on your behalf. Here’s everything you missed.

00:00 Intro and Copilot Agents
2:07 Copilot for planning
2:30 Copilot AI podcast generating
3:18 Copilot Shopping
3:39 Copilot Vision
4:07 Copilot feature use cases demo
6:16 Researcher, Copilot Studio, custom agents
6:53 Copilot Memory
7:23 Custom Copilot appearances
8:48 Outro

Rustlin’ up some Russells

April 4, 2025AI/ML, GPT-4ojnack

2025 marks an unheard-of 40th year in Adobe creative director Russell Brown’s remarkable tenure at the company. I remember first encountering him via the Out Of Office message marking his 15-year (!) sabbatical (off to Burning Man with Rick Smolan, if I recall correctly). If it weren’t for Russell’s last-minute intervention back in 2002, when I was living out my last hours before being laid off from Adobe (interviewing at Microsoft, lol), I’d never have had the career I did, and you wouldn’t be reading this now.

In any event, early in the pandemic Russell kept himself busy & entertained by taking a wild series of self portraits. Having done some 3D printing with him (the output of which still forms my Twitter avatar!), I thought, “Hmm, what would those personas look like as plastic action figures? Let’s see what ChatGPT thinks.” And voila, here they are.

Click through the tweet below if you’re curious about the making-of process (e.g. the app starting to render him very faithfully, then freaking out midway through & insisting on delivering a more stylized, less specific rendition). But forget that—how insane is it that any of this is possible??

Can you show ChatGPT 5 portraits of legendary Adobe creative director Russell Brown and get a whole set of action figures? Yep! pic.twitter.com/gLTIcGqLJ0

— John Nack (@jnack) April 4, 2025

“The Worlds of Riley Harper”

April 3, 2025AI/MLjnack

It’s pretty stunning what a single creator can now create in a matter of days! Check out this sequence & accompanying explanation (click on the post) from Martin Gent:

I tried to make this title sequence six months ago, but the AI tools just weren’t up to it. Today it’s a different story. Sound on!

Since the launch of ChatGPT’s 4o image generator last week, I’ve been testing a new workflow to bring my characters – Riley Harper and her dog,… pic.twitter.com/SMgjDnJWH1

— Martin Gent (@martgent) April 3, 2025

Tools used:

@OpenAI‘s ChatGPT 4o (Images)
@hedra_labs (Lipsync)
@elevenlabsio (Voice & Sound Effects)
@SunoMusic (Music & Lyrics – made with v3 six months ago)
@Kling_ai (Animation)
@higgsfield_ai (Animation)
@ideogram_ai (Title Lockup)
@topazlabs (Upscaling)

Severance, through the animated lens of ChatGPT

April 2, 2025AI/ML, GPT-4ojnack

People can talk all the smack they want about “AI slop”—and to be sure, there’s tons of soulless slop going around—but good luck convincing me that there’s no creativity in remixing visual idioms, and in reskinning the world in never-before-possible ways. We’re just now dipping a toe into this new ocean.

ChatGPT 4o’s new image gen is insane. Here’s what Severance would look like in 8 famous animation styles

1/8:
Rankin/Bass – That nostalgic stop-motion look like Rudolph the Red-Nosed Reindeer. Cozy and janky. pic.twitter.com/5rFL8SGttS

— Bennett Waisbren (@BennettWaisbren) March 27, 2025

See the whole thread for a range of fun examples:

4/8:
Pixar – Clean, subtle facial animation, warm lighting, and impeccable shot composition. pic.twitter.com/FNWgPccHcI

— Bennett Waisbren (@BennettWaisbren) March 27, 2025

OMG AI KFC

April 1, 2025AI/MLjnack

It’s insane what a single creator—in this case David Blagojević—can do with AI tools; insane.

I’m blown away!

This KFC concept ad is 100% AI generated!

My friend David Blagojevic (he’s not on X) created this ad concept for KFC and it’s incredible!

Tools used: Runway, Pika, Kling AI, Google DeepMind Veo2, Luma AI, OpenAI Sora, upscaled with Topaz Labs and music… pic.twitter.com/u9ics8M51x

— Salma (@Salmaaboukarr) March 31, 2025

It’s worth noting that creative synthesis like this doesn’t “just happen,” much less in some way that replaces or devalues the human perspective & taste at the heart of the process: everything still hinges on having an artistic eye, a wealth of long-cultivated taste, and the willpower to make one’s vision real. It’s just that the distance between that vision & reality is now radically shorter than it’s ever been.

New generative video hotness: Runway + Higgsfield

March 31, 2025AI/MLjnack

It’s funny to think of anyone & anything as being an “O.G.” in the generative space—but having been around for the last several years, Runway has as solid a claim as anyone. They’ve just dropped their Gen-4 model. Check out some amazing examples of character consistency & camera control:

Today we’re introducing Gen-4, our new series of state-of-the-art AI models for media generation and world consistency. Gen-4 is a significant step forward for fidelity, dynamic motion and controllability in generative media.

Gen-4 Image-to-Video is rolling out today to all paid… pic.twitter.com/VKnY5pWC8X

— Runway (@runwayml) March 31, 2025

Here’s just one of what I imagine will be a million impressive uses of the tech:

First test with @runwayml‘s Gen-4 early access!

First impressions: I am very impressed! 10 second generations, and this is the only model that could do falling backwards off a cliff. Love it! pic.twitter.com/GZS1B7Wpq0

— Christopher Fryant (@cfryant) March 31, 2025

Meanwhile Higgsfield (of which I hadn’t heard before now) promises “AI video with swagger.” (Note: reel contains occasionally gory edgelord imagery.)

Now, AI video doesn’t have to feel lifeless.

This is Higgsfield AI: cinematic shots with bullet time, super dollies and robo arms — all from a single image.

It’s AI video with swagger.

Built for creators who move culture, not just pixels. pic.twitter.com/dJdQ978Jqd

— Higgsfield AI (@higgsfield_ai) March 31, 2025

Fun with empowering existential dread

March 30, 2025AI/MLjnack

It’s so good, it’s bad! 😀

Currently asking ChatGPT for faux-German words like
Überintelligenzchatbotrichtigkeitsahnungsscham:

“The bizarre cocktail of joy, panic, and existential dread a product manager experiences when an AI answers a tough product question better than they could.” pic.twitter.com/yVZVdoKZi9

— John Nack (@jnack) March 30, 2025

Virtual product photography in ChatGPT

March 29, 2025AI/ML, GPT-4ojnack

Seeing this, I truly hope that Adobe isn’t as missing in action as they seem to be; fingers crossed.

In the meantime, simply uploading a pair of images & a simple prompt is more than enough to get some compelling results. See subsequent posts in the thread for details, including notes on some shortcomings I observed.

A quick test of ChatGPT virtual product photography, combining real shoes with a quick render from @krea_ai/@bfl_ml Flux:
“Please put these shoes into the image of the basketball court, held aloft in the foreground by a man’s hand.” pic.twitter.com/k1AhTdHFcs

— John Nack (@jnack) March 28, 2025

See also (one of a million tests being done in parallel, I’m sure):

Still experimenting with chatgpt4o

prompt: “model wearing cap provided”

not bad pic.twitter.com/FObSXeyxOS

— Salma (@Salmaaboukarr) March 26, 2025

Depression-era Ghibli

March 27, 2025AI/MLjnack

We’re speed-running our way through the novelty->saturation->nausea cycle of Studio Ghibli-style meme creation, but I find this idea fresher: turn Ghibli characters into Dorothea Lange-style photos:

https://t.co/gl8U6x9Q8m

— Sterling Crispin (@sterlingcrispin) March 26, 2025

Ideogram 3.0 is here

March 27, 2025AI/MLjnack

In the first three workdays of this week, we saw three new text-to-image models arrive! And now that it’s Thursday, I’m like, “WTF, no new Flux/Runway/etc.?” 🙂

For the last half-year or so, Ideogram has been my go-to model (see some of my more interesting creations), so I’m naturally delighted to see them moving things forward with the new 3.0 model:

I don’t yet quite understand the details of how their style-reference feature will work, but I’m excited to dig in.

Meanwhile, here’s a thread of some really impressive initial creations from the community:

We launched Ideogram 3.0 just three hours ago, and we’ve already seen an incredible wave of striking images. Here are 16 of our favorites so far:

1/ @krampus76 pic.twitter.com/tbwfMfkvg5

— Ideogram (@ideogram_ai) March 26, 2025

LegoGPT

March 26, 2025AI/MLjnack

The family that bricks together, sticks together? 🙂

Worked great on this group shot as well—with the exception of disappearing one cousin! pic.twitter.com/pZzXfPurv3

— John Nack (@jnack) March 26, 2025

ChatGPT reimagines family photos

March 26, 2025AI/ML, GPT-4o, Illustrationjnack

“Dress Your Family in Corduroy and Denim” — David Sedaris
“Turn your fam into Minecraft & GTA” — Bilawal Sidhu

Entire ComfyUI workflows just became a text prompt.

Open an image in GPT-4o and type “turn us into Roblox / GTA-3 /Minecraft / Studio Ghibli characters” pic.twitter.com/rCXclZklq5

— Bilawal Sidhu (@bilawalsidhu) March 26, 2025

And meanwhile, on the server side:

ChatGPT when another Studio Ghibli request comes in pic.twitter.com/NF5sy24GlU

— Justine Moore (@venturetwins) March 26, 2025

Google’s “Photoshop Killer”?

March 23, 2025AI/MLjnack

Nearly twenty years ago (!), I wrote here about how The Killing’s Gotta Stop—ironically, perhaps, about then-new Microsoft apps competing with Adobe. I rejected false, zero-sum framing then, and I reject it now.

Having said that, my buddy Bilawal’s provocative framing in this video gets at something important: if Adobe doesn’t get on its game, actually delivering the conversational editing capabilities we publicly previewed 2+ years ago, things are gonna get bad. I’m reminded of the axiom that “AI will not replace you, but someone using AI just might.” The same goes for venerable old Photoshop competing against AI-infused & AI-first tools.

In any case, if you’re interested in the current state of the art around conversational editing (due to be different within weeks, of course!), I think you’ll enjoy this deep dive into what is—and isn’t—possible via Gemini:

Specific topic sections, if you want to jump right to ’em:

00:00 Conversational Editing with Google’s Multimodal AI
00:53 Image Generation w/ LLM World Knowledge
02:12 Easy Image Editing & Colorization
02:46 Advanced Conversational Edits (Chaining Prompts Together)
03:37 Long Text Generation (Google Beats OpenAI To The Punch)
04:25 Making Spicy Memes (Google AI Studio Safety Settings)
05:48 Advanced Prompting (One Shot ComfyUI Workflows)
07:19 Re-posing Characters (While Keeping Likeness Intact)
08:27 Spatial 3D Understanding (NO ControlNet)
10:42 Semantic Editing & In/Out Painting
13:46 Sprite Sheets & Animation Keyframes
14:40 Using Gemini To Build Image Editing Apps
16:37 Making Videos w/ Conversational Editing

Happy birthday, Adobe Firefly

March 21, 2025Adobe Firefly, AI/MLjnack

The old (hah! but it seems that way) gal turns two today.

The ride has been… interesting, hasn’t it? I remain eager to see what all the smart folks at Adobe have been cooking up. As a user of Photoshop et al. for the last 30+ years, I selfishly hope it’s great!

Welcome to the world, #AdobeFirefly! https://t.co/R92lBktZIQ

We have great stuff you can out try right now, plus so much brewing in the lab. Here’s a quick preview: pic.twitter.com/hIaW9EpMor

— John Nack (@jnack) March 21, 2023

In the meantime, I’ll admit that watching the video above—which I wrote & then made with the help of Davis Brown (son of Russell)—makes me kinda blue. Everything it depicts was based on real code we had working at the time. (I insisted that we not show anything that we didn’t think we could have shipping within three months’ time.) How much of that has ever gotten into users’ hands?

Yeah.

But as I say, I’m hoping and rooting for the best. My loyalty has never been to Adobe or to any other made-up entity, but rather to the spirit & practice of human creativity. Always will be, until they drag me off this rock. Rock the F on.

Adobe to offer access to non-Firefly models

March 20, 2025Adobe Firefly, AI/MLjnack

Man, I’m old enough to remember writing a doc called “Yes, And…” immediately upon the launch of DALL•E in 2022, arguing that of course Adobe should develop its own generative models and of course it should also offer customers a choice of great third-party models—because of course no single model would be the best for every user in every situation.

And I’m old enough to remember being derided for just not Getting It™ about how selling per-use access to Firefly was going to be a goldmine, so of course we wouldn’t offer users a choice. ¯\_(ツ)_/¯

Oh well. Here we are, exactly two years after the launch of Firefly, and Adobe is going to offer access to third-party models. So… yay!

Even more news today! We are expanding our footprint in the @Adobe ecosystem to offer more choice to their creators pic.twitter.com/A4tHRkb25h

— Black Forest Labs (@bfl_ml) March 19, 2025