- Realtime:
- Oh look, I’m George Clooney! Kinda. You can be, too. FAL AI promises “AI inference faster than you can type.”
- “100ms image generation at 1024×1024. Announcing Segmind-Vega and Segmind-VegaRT, the fastest and smallest, open source models for image generation at the highest resolution.”
- Krea has announced their open beta, “free for everyone.”
- How incredible would it be to have realtime generative brushes like this?
- Drawing to Video, made using Vizcom -> Leonardo -> Pika.
- 3D generation:
- ByteDance has released ImageDream (image to 3D)
- SceneWiz3D offers “A new approach to create high-fidelity 3D scenes from text and 3D object control”
- Image -> depth -> geometry using Marigold + Blender
- 3D for fashion, sculpting, and more:
- This is what Adobe Substance & a notional 3D mode of Firefly Text-to-Image should feel like.
- Outfit Anyone + Animate Anyone = virtual try on + movement.
- Sculpting/rendering via Adobe Substance 3D Modeler + Dreams + Unbound + Krea.
- AnimateDiff v3 was just released.
- Instagram has enabled image generation inside chat (pretty “meh,” in my experience so far), and in stories creation, “It allows you to replace a background of an image into whatever AI generated image you’d like.”
- “Did you know that you can train an AI Art model and get paid every time someone uses it? That’s Generaitiv’s Model Royalties System for you.”
How-to: Combining Photoshop + ComfyUI
It’s a little nerdy even for my blood, but some of my teammates swear by these techniques that enable connecting Photoshop to a hosted instance of Stable Diffusion, enabling one to guide the process via a Photoshop doc and/or custom-trained styles:
“I Draw Better Than AI!”
Hah—I can dig this finger-rich pin from Pictoplasma.

To the moon! Insta360 makes a satellite
What if your tiny planet—a visual genre I’ve enjoyed beating halfway into the ground—were our actual planet? Insta360, on whom I’ve spent crazy amounts of money buying brilliant-if-maddening gear, has now sent their devices to the edge of space:

Catch up on great new Illustrator features in 60 seconds
Let’s talk vector generation, 3D mockup support, image-to-type, and more. Take ‘er away, Deke!
Promising 3D research from Adobe
AI image generation is getting *crazy* fast
Gemini is bonkers
I mean, seriously, what even is all this?? I can’t explain; just please watch.
- 0:00 Intro
- 0:19 Multimodal Dialogue
- 1:32 Multilinguality 2:04
- Game Creation 2:31
- Visual Puzzles 3:17
- Making Connections
- 3:39 Image & Text Generation
- 4:06 Logic & Spatial Reasoning
- 4:55 Translating Visuals
- 5:27 Cultural Understanding
Baby, You Can Drive My Bricks
I’ve had way too much fun creating custom Lego sets based on friends’ & family’s rides, so to help others do it, I’ve made my first custom GPT, “Baby You Can Drive My Bricks.” Take it for a spin & let me know what you create!

Pika Labs “Idea-to-Video” looks stunning
It’s ludicrous to think that these folks formed the company just six months ago, and even more ludicrous to see what the model can already do—from video synthesis, to image animation, to inpainting/outpainting:
Our vision for Pika is to enable everyone to be the director of their own stories and to bring out the creator in each of us. Today, we reached a milestone that brings us closer to our vision. We are thrilled to unveil Pika 1.0, a major product upgrade that includes a new AI model capable of generating and editing videos in diverse styles such as 3D animation, anime, cartoon and cinematic, and a new web experience that makes it easier to use. You can join the waitlist for Pika 1.0 at https://pika.art.
“Emu Edit” enables instructional image editing
This tech—or something much like it—is going to be a very BFD. Imagine simply describing the change you’d like to see in your image—and then seeing it.
[Generative models] still face limitations when it comes to offering precise control. That’s why we’re introducing Emu Edit, a novel approach that aims to streamline various image manipulation tasks and bring enhanced capabilities and precision to image editing.
Emu Edit is capable of free-form editing through instructions, encompassing tasks such as local and global editing, removing and adding a background, color and geometry transformations, detection and segmentation, and more. […]
Emu Edit precisely follows instructions, ensuring that pixels in the input image unrelated to the instructions remain untouched. For instance, when adding the text “Aloha!” to a baseball cap, the cap itself should remain unchanged.
And for some conceptually related (but technically distinct) ideas, see previous: Iterative creation with ChatGPT.
“We’re on a mission from God…”
On the off chance you missed me over the last week or so, it’s due to my being off in Illinois with the fam, having fun making silliness like this:
NBA goes NeRF
Here’s a great look at how the scrappy team behind Luma.ai has helped enable beautiful volumetric captures of Phoenix Suns players soaring through the air:
Go behind the scenes of the innovative collaboration between Profectum Media and the Phoenix Suns to discover how we overcame technological and creative challenges to produce the first 3D bullet time neural radiance field NeRF effect in a major sports NBA arena video. This involved not just custom-building a 48 GoPro multi-cam volumetric rig but also integrating advanced AI tools from Luma AI to capture athletes in stunning, frozen-in-time 3D visual sequences. This venture is more than just a glimpse behind the scenes – it’s a peek into the evolving world of sports entertainment and the future of spatial capture.
Phat Splats
If you keep hearing about “Gaussian Splatting” & wondering “WTAF,” check out this nice primer from my buddy Bilawal:
There’s also Two-Minute Papers, offering a characteristically charming & accessible overview:
GenAI demos from Russell Brown
It’s always great to learn from the master—especially when he’s making “spaghetti western” literal!
- The power of selections with Generative Fill
- Create watercolors and other art styles with Generative Fill
- Manage the stacking order of Generative layers

Iterative creation with ChatGPT
I’m really digging the experience of (optionally) taking a photo, feeding it into ChatGPT, and then riffing my way towards an interesting visual outcome. Here’s a gallery in which you can see some of the journeys I’ve undertaken recently.
- Image->description->image quality is often pretty hit-or-miss. Even so, it’s such a compelling possibility that I keep wanting to try it (e.g. seeing a leaf on the ground, wanting to try turning it into a stingray).
- The system attempts to maintain various image properties (e.g. pose, color, style) while varying others (e.g. turning the attached vehicle from a box truck to a tanker while maintaining its general orientation plus specifics like featuring three Holstein cows).
- Overall text creation is vastly improved vs. previous models, though it can still derail. It’s striking that one can iteratively improve a particular line of text (e.g. “Make sure that the second line says ‘TRAIN’“).


GenFill vs. eternal dog-pant mysteries
Hah! This is my kind of ridiculous Adobe social content. 🙂 Happy Friday.
The Young & The Spiderverse
Man, I’m inspired—and TBH a little jealous—seeing 14yo creator Preston Mutanga creating amazing 3D animations, as he’s apparently been doing for fully half his life. I think you’ll enjoy the short talk he gave covering his passions:
The presentation will take the audience on a journey, a journey across the Spider-Verse where a self-taught, young, talented 14-year-old kid used Blender, to create high-quality LEGO animations of movie trailers. Through the use of social media, this young artist’s passion and skill caught the attention of Hollywood producers, leading to a life-changing invitation to animate in a new Hollywood movie.
Hands up for Res Up ⬆️
Speaking of increasing resolution, check out this sneak peek from Adobe MAX:
It’s a video upscaling tool that uses diffusion-based technology and artificial intelligence to convert low-resolution videos to high-resolution videos for applications. Users can directly upscale low-resolution videos to high resolution. They can also zoom-in and crop videos and upscale them to full resolution with high-fidelity visual details and temporal consistency. This is great for those looking to bring new life into older videos or to prevent blurry videos when playing scaled versions on HD screens.
Adventures in Upsampling
Interesting recent finds:
- Google Zoom Enhance. “Using generative AI, Zoom Enhance intelligently fills in the gaps between pixels and predicts fine details, opening up more possibilities when it comes to framing and flexibility to focus on the most important part of your photo.”
- Nick St. Pierre writes, “I just upscaled an image in MJ by 4x, then used Topaz Photo AI to upscale that by another 6x. The final image is 682MP and 32000×21333 pixels large.”
- Here’s a thread of 10 Midjourney upsampling examples, including a direct comparison against Topaz.
Demos: Photoshop Generative AI tips
Demos: Using Generative AI in Illustrator
If you’ve been sleeping on Text to Vector, check out this handful of quick how-to vids that’ll get you up to speed:
- Welcome to Generative AI in Illustrator
- Generate artwork from text with Text to Vector Graphic (Beta)
- Explore creating stunning patterns with Text to Vector Graphics
- Tips for making your best artwork with Text to Vector Graphic (Beta)
- Tips: Take Your Text to Vector Graphic (Beta) patterns to “Wow!”
- Tip: Control your pattern color with Text to Vector Graphic (Beta)
Ai + AI FTW
Check out this quick demo of Illustrator’s new text-to-vector & mockup tools working together:
AI generated Logos onto any surface. pic.twitter.com/qY4tEkVK0Q
— Riley Brown (@rileybrown_ai) October 29, 2023
360º AI: Skybox adds new sketching & style features
Directly sketch inside a 360º canvas, then generate results:
And see also the styles these folks are working to bring online:
Sneak peek: Project Glyph Ease
Easy as ABC, 123?
Project Glyph Ease uses generative AI to create stylized and customized letters in vector format, which can later be used and edited. All a designer needs to do is create three reference letters in a chosen style from existing vector shapes or ones they hand draw on paper, and this technology automatically create the remaining letters in a consistent style. Once created, designers have flexibility to edit the new font since the letters will appear as live text that can be scaled, rotated or moved in the project.
DreamCraft 2D->3D tech looks wild
Can you imagine something like this running in Photoshop, making it possible to re-pose objects and then merge them back into one’s scene?
Project Primrose: Animated fabric (!) from Adobe
The week before MAX, my teammate Christine had a bit of a cough, and folks were suddenly concerned about the Project Primrose sneak: it’d be awfully hard to swap out presenters when the demo surface is a bespoke dress made by & for exactly one person. Thankfully good health prevailed, and she was able to showcase Project Primrose:
Here’s a bit more info about the tech:
We propose reflective light-diffuser modules for non-emissive flexible display systems. Our system leverages reflective-backed polymer-dispersed liquid crystal (PDLC), an electroactive material commonly used in smart window applications. This low-power non-emissive material can be cut to any shape, and dynamically diffuses light. We present the design & fabrication of two exemplar artifacts, a canvas and a handbag, that use the reflective light-diffuser modules.
Come work on Firefly!
We’re looking to meet great PMs, engineers, data scientists, and more; come check out open roles!

Reflect on this: Project See Through burns through glare
Marc Levoy (professor emeritus at Stanford) was instrumental in delivering the revolutionary Night Sight mode on Pixel 3 phones—and by extension on all the phones that quickly copied their published techniques. After leaving Google for Adobe, he’s been leading a research team that’s just shown off the reflection-zapping Project See Through:
Today, it’s difficult or impossible to manually remove reflections. Project See Through simplifies the process of cleaning up reflections by using artificial intelligence. Reflections are automatically removed, and optionally saved as separate images for editing purposes. This gives users more control over when and how reflections appear in their photos.
A couple of fun Photoshop-After Effects collabs
Matthew Vandeputte used a mix of Generative Fill and Content-Aware Fill (or both) to make these rad little animations in After Effects:
[Via Tom Hightower]
New features come to Lightroom Classic
Lens Blur, HDR, Point Color, and more: Katrin Eismann breaks down the update in this overview, and Matt Kloskowski shows the features in action here:
Remembering Microsoft’s “Big-Ass Table”
TBH I still kinda want one. 🙂
“Boing!” New Google animation tech looks fun.
My old teammates Richard Tucker, Noah Snavely, and co. have been busy. Check out how their Generative Image Dynamics work makes it possible to interactively add small, periodic motion to photos:
My recent Firefly demos & previews
I got to spend time Friday live streaming with the Firefly community, showing off some of the new MAX announcements & talking about some of what might be coming down the line. I hope you enjoy it, and I’d welcome any feedback on this session or on what you’d like to see in the future.
Check out Illustrator’s new font-identifying Retype feature
Short & sweet:
Adobe Illustrator has this feature called Retype (beta). With it you can select an image in Illustrator and enter Retype (beta) to determine the fonts that were used (at least close matches) in the JPG! It will also do the same for text that has been outlined. It’s amazing!
One sketchy vector erector: Project Draw & Delight
This tech lets you use augment text-based instructions with visual hints, such as rough sketches and paint strokes. Draw & Delight then uses Firefly to generate high-quality vector illustrations or animations in various color palettes, style variations, poses and backgrounds.
Check out my MAX talk on the potential of Generative AI in education
I got to spend 30 minutes chatting with educator & author Matt Miller last week, riffing on some tough but important questions around weighty, fascinating stuff like what makes us human, what we value around creativity, and how we can all navigate the creative disruptions that surround us.

Hear how Adobe generative AI solutions are designed to continually evolve, develop, and empower educators and students from kindergarten to university level. Generative AI is expected to have a significant impact on the creativity of students. It has the potential to act as a powerful tool that can inspire and enhance the creative process by generating new and unique ideas. Join Matt Miller, author and educator, and John Nack, principal product manager at Adobe, for this exciting discussion.
In this session, you’ll:
- Learn how Adobe approaches generative AI
- Hear experts discuss how AI affects teaching and learning
- Discover how AI can make learning more personalized and accessible
What if 3D were actually approachable?
That’s the promise of Adobe’s Project Neo—which you can sign up to test & use now! Check out the awesome sneak peek they presented at MAX:
Incorporating 3D elements into 2D designs (infographics, posters, logos or even websites) can be difficult to master, and often requires designers to learn new workflows or technical skills.
Project Neo enables designers to create 2D content by using 3D shapes without having to learn traditional 3D creation tools and methods. This technology leverages the best of 3D principles so designers can create 2D shapes with one, two or three-point perspectives easily and quickly. Designers using this technology are also able to collaborate with their stakeholders and make edits to mockups at the vector level so they can quickly make changes to projects.
The wonderful miniatures of Henry Sugar
Wes Anderson & crew are back to making delightful miniature worlds, this time for “The Wonderful Story of Henry Sugar.” Enjoy three charming minutes, won’t you?
What’s even better than Generative Fill? GenFill that moves.
Back in the day, I dreaded demoing Photoshop ahead of the After Effects team: we’d do something cool, and they’d make that cool thing move. I hear echoes of that in Project Fast Fill—generative fill for video.
Project Fast Fill harnesses Generative Fill, powered by Adobe Firefly, to bring generative AI technology into video editing applications. This makes it easy for users to use simple text prompts to perform texture replacement in videos, even for complex surfaces and varying light conditions. Users can use this tool to edit an object on a single frame and that edit will automatically propagate into the rest of the video’s frames, saving video editors a significant amount of texture editing time.
Check it out:
Adobe Project Posable: 3D humans guiding image generation
Roughly 1,000 years ago (i.e. this past April!), I gave an early sneak peek at the 3D-to-image work we’ve been doing around Firefly. Now at MAX, my teammate Yi Zhou has demonstrated some additional ways we could put the core tech to work—by adding posable humans to the scene.
Project Poseable makes it easy for anyone to quickly design 3D prototypes and storyboards in minutes with generative AI.
Instead of having to spend time editing the details of a scene — the background, different angles and poses of individual characters, or the way the character interacts with surrounding objects in the scene — users can tap into AI-based character posing models and use image generation models to easily render 3D character scenes.
Check it out:
Generative Match: It’s Pablos all the way down…
Here’s a fun little tutorial from my teammate Kris on using reference images to style your prompt (in this case, her pet turtle Pablo). And meanwhile, here’s a little gallery of good style reference images (courtesy of my fellow PM Lee) that you’re welcome to download and use in your creations.
Tutorial: How to generate images in different styles using reference image library in Adobe Firefly
It’s a new feature! Write a prompt and experiment with styles from the library or upload your own as a reference.
#AdobeMAX #CommunityxAdobeTry here:https://t.co/c9g7CGiuBU pic.twitter.com/cSt3cmMNR3
— Kris Kashtanova (@icreatelife) October 12, 2023
Important protections for creators in Generative Match
I’m really happy & proud that Firefly now enables uploading your own images & mixing them into your creations. For months & months, this has been users’ number 1 feature request.
But with power comes responsibility, of course, and we’ve spent a lot of time thinking about ways to discourage misuse of the tech (i.e. how do we keep this from becoming a rip-off engine?). I’m glad to say that we’ve invested in some good guidelines & guardrails:
First, we require users to confirm they have the right to use any work that they upload to Generative Match as a reference image.
Second, if an image’s Content Credentials include tags indicating that the image shouldn’t be used as a style reference, users won’t be able to use it with Generative Match. We will be rolling out the ability to add these tags to assets as part of the Content Credentials framework within our flagship products.
Third, when a reference image is used to generate an asset, we save a thumbnail of the image to help ensure that the use of Generative Match meets our terms of service. We also note that a reference image was used in the asset’s Content Credentials. Storing the reference image provides an important dose of accountability.
To be clear, these protections are just first steps, and we plan to do more to strengthen protections. In the meantime, your feedback is most welcome!

Introducing Firefly Text to Vector, plus many Illustrator enhancements
I’m delighted to say that the first Firefly Vector model is now available (as a beta—feedback welcome!) in Illustrator. Just download your copy to get started. Here’s a quick tour:
And more generally, it’s just one of numerous enhancements now landing in Illustrator. Check ’em out:
Introducing Generative Match in Firefly
Hey everyone—I’m just back from Adobe MAX, and hopefully my blog is back from some WordPress database shenanigans that have kept me from posting.
I don’t know what the site will enable right now, so I’ll start by simply pointing to a great 30-second tour of my favorite new feature in Firefly, Generative Match. It enables you to upload your own image as a style reference, or to pick one that Adobe provides, and mix it together with your prompt and other parameters.
You can then optionally share the resulting recipe (via “Copy link” in the Share menu that appears over results), complete with the image ingredient; try this example. This goes well beyond what one can do with just copying/pasting a prompt, and as we introduce more multimodal inputs (3D object, sketching, etc.), it’ll become all the more powerful.
All images below were generated with the following prompt: a studio portrait of a fluffy llama, hyperrealistic, shot on a white cyclorama + various style images:

Sneak peek: Object-Aware Editing Engine
Powered by Firefly, in development now:
Windows Copilot + Adobe Express
Google promises interactive creation of dynamic, looping videos
My old teammates Richard Tucker, Noah Snavely, and co. have been busy. Check out this quick video & interactive demo:
Excited to share our work on Generative Image Dynamics!
We learn a generative image-space prior for scene dynamics, which can turn a still photo into a seamless looping video or let you interact with objects in the picture. Check out the interactive demo:https://t.co/GLPBVpouJY pic.twitter.com/6h1Qq0kL2G
— Zhengqi Li (@zhengqi_li) September 15, 2023
80lv notes,
According to the team, they trained the prior using a dataset of motion trajectories extracted from real-life video sequences that featured natural, oscillating motions like those seen in trees, flowers, candles, and wind-blown clothing. These trajectories can then be applied to convert static images into smooth-looping dynamic videos, slow-motion clips, or interactive experiences that allow users to interact with the elements within the image.
Photoshop introduces Parametric Filters
Man, if you feel like you can’t keep up with technology while you do your day job, just please know that the same is true even at the company where one works, regarding one’s old app. At least we have smart folks like Deke McClelland to show us what’s been happening:
“Sky Dachshunds!” The future of creativity?
Here are four minutes that I promise you won’t regret spending as Nathan Shipley demonstrates DALL•E 3 working inside ChatGPT to build up an entire visual world:
I mean, seriously, the demo runs through creating:
- Ideas
- Initial visuals
- Logos
- Apparel featuring the logos
- Game art
- Box copy
- Games visualized in multiple styles
- 3D action figures
- and more.
Insane. Also charming: its extremely human inability to reliably spell “Dachshund!”

