Category Archives: AI/ML

Fun with Omni: Changing cams, plus Wolverine

July 6, 2026AI/ML, Google Omnijnack

Karen X. Cheng shows a subtle but powerful model capability:

View this post on Instagram

A post shared by Karen X (@karenxcheng)

Meanwhile Christian Cantrell is looking sharp, is not downright superheroic:

What I thought would be a quick test of Gemini Omni video editing in https://t.co/hmRbEam8vP turned out to be a pretty wild study in agentic creative processes. (1 of 3) pic.twitter.com/44HptL00HC

— Christian Cantrell (@cantrell) July 6, 2026

Come build with Gemini Omni Flash!

July 5, 2026AI/ML, Google Omnijnack

I’m thrilled to say that the first big launch of my Chapter 2 at Google is here! You can now build on Gemini Omni Flash in Google Enterprise Agent Platform (aka Vertex); see docs.

TBH I was so busy helping get the release out the door, and then taking some much needed rest over the Fourth of July break, that I’ve hardly had a chance to post useful info. I’ll fix that soon! In the meantime, here’s our little intro sizzle reel:

Gemini Omni: Our new model is a leap forward in world understanding, multimodality, and editing—letting you generate any output from any input, starting with video.

Coming soon to developers and enterprise customers via the Gemini API and the Gemini Enterprise Agent Platform… pic.twitter.com/af9oElAODp

— Google Cloud (@googlecloud) June 2, 2026

Of beat labs & photo shoots

June 25, 2026AI/ML, Photographyjnack

It’s always cool to see how creators are embracing new tools:

Flow Music and Believe bring next-gen tools to artists: “We’re teaming up with the global artist development company Believe to bring Google Flow Music and Lyria 3 Pro directly to artists.”
Accelerating fashion photo production with Gemini 3 Pro Image: “AddGlow helps fashion brands and retailers bypass the logistical bottlenecks of traditional photoshoots, providing a retail-specific dashboard specifically designed for enterprise-scale creative production.”

Quick tour: Creating Flow tools with natural language

June 23, 2026AI/ML, Google Flowjnack

My teammate Anika & I got to meet the other day with a really big creative brand the other day (more details to share soon, I hope), and they got excited about delivering super focused, relevant experiences for their customers by building on the Flow agent & apps. Here’s Anika offering a concise tour of how to create, share, and remix the latter:

With Google Flow Tools, you can now use natural language to create bespoke tools and workflows.

In this video, you’ll see how you can:

Explore and try the various Tools within Google Flow
Remix a tool to make it even more relevant for your unique workflow
Create your… pic.twitter.com/uW7hY3Sdfd

— Google Flow (@FlowbyGoogle) June 18, 2026

“Google Just Turned Street View Into a Video Game”

June 21, 20263D, AI/MLjnack

As Bilawal puts it,

At Google I/O 2026, DeepMind shipped Maps Imagery Grounding for Genie 3 — their real-time world model can now generate interactive 3D worlds conditioned to any of the 280 billion Street View images Google has captured over 20 years. Pick a location on Google Maps, choose a style, drop in a character, and walk around.

Check out his accessible & illuminating tour of the new tech:

Aleph 2.0 uses Nano Banana for precise video transformation

June 17, 2026AI/MLjnack

Wonder Twin powers, activate:

Introducing Runway Aleph 2.0.

Edit videos using AI, keeping camera and motion all consistent.

Select specific frames in your video to reprompt with GPT Images 2.0 or Nano Banana 2, then apply it to the entire video.

Here’s my full tutorial: pic.twitter.com/csCZ8KLal1

— Jerrod Lew (@jerrod_lew) May 27, 2026

Omni Teapot

June 15, 20263D, AI/ML, Google Omnijnack

My 16yo is lowkey impressed that at Adobe I got to work with Utah Teapot creator Martin Newell. At this point, anything that impresses a teen is very welcome. 🙂

I wonder what he’d think of Gemini Omni turning real teapots into geometry just by saying the word:

Relighting video with Omni in Flow

June 7, 2026AI/ML, Google Flow, Google Omni, Relightingjnack

Sometimes it’s the seemingly simplest applications of tech that can be the most repeatably powerful. Here’s a quick demo of using a simple sketch of a lighting layout to direct Google Omni Flow in relighting an in-studio video:

Looks like you can relight a scene according to a top-down lighting diagram. @FlowbyGoogle pic.twitter.com/bVV280pM7F

— László Gaál (@laszlogaal_) May 22, 2026

Puppetry + AI FTW: Behind the Scenes with Timmy TPU

June 6, 2026AI/ML, Design, Google Omnijnack

I love the blend old-school puppetry, 3D animation, Gemini Omni, and the latest experimental video tools that went into creating TPU Training Day, the short film that debuted during Google I/O 2026.

I know you’ve heard it a million times, but it bears repeating: AI isn’t a substitute for human creativity, or in many cases even for traditional techniques. It’s just a whole new toolbox that can multiply our expressive powers.

And here’s the film itself:

Check out Google Flow Agent

June 3, 2026AI/ML, Google Flow, Google Omnijnack

Did I have “Google makes cool, extensible, AI-powered creative tools” on my 2026 Bingo card? I did not—and I’m happy to be wrong! Check this out:

Introducing Google Flow Agent

Google Flow Agent can help you plan and reason through complex creative tasks with your inputs, all while under your control.

It’s built with Gemini models and brings a deep understanding of your project to help with everything from early… pic.twitter.com/e1qnNNuFYh

— Google (@Google) May 29, 2026

According to the docs, you can use the Agent to:

Brainstorm and plan: Chat with the Agent to outline storyboards, develop visual mood boards, and turn high-level concepts into actionable prompts.
Generate new media: Ask the Agent to generate videos or images and select the best model to generate with.
Edit assets directly: Ask the Agent to edit selected media from your project.
Batch generate: Ask the Agent to create multiple variations of an asset at once.
Organize your assets: Ask the Agent to rename specific files, group selected media into a new Collection, or archive unused assets.
Add context & references: Drag media into the Agent prompt box from your device or project. You can also select multiple assets and let the agent know which media you are referring to.

3D typography using Omni + Flow

May 31, 20263D, AI/ML, Google Flow, Google Omni, Typographyjnack

Check out this cool little technique:

View this post on Instagram

A post shared by Ariane Gebhardt (@ariane.gif)

This is especially wild when you consider where typography stood just a couple of years ago—for which I’ll forever be kinda nostalgic. 🙂

Nano Banana & Flux are great & all, but I legit miss when DALL•E was gakked up on mescaline. pic.twitter.com/JBxPanywoF

— John Nack (@jnack) January 14, 2026

A beautiful moment of expressivity unlocked

May 30, 2026AI/ML, Shit That Actually Mattersjnack

Despite—or maybe because—of my line of work, I have some genuinely mixed emotions about AI. Is it about empowerment, devaluation, theft, magic? Yes. It is all, as my wife would say of me, A Lot™.

Alongside whatever else it may be, however, the tech can be a genuine enabler of human expressivity. If you don’t believe me, just take 90 seconds to read & watch this heartfelt moment:

Last year, I went to Mexico City and taught AI tools to a small group of filmmakers and artists.

My favorite moment in was when I walked away from the group and came back to find this artist crying because she had finally been able to visualize a story she wanted to produce… pic.twitter.com/WCGnQPyk0B

— Minh Do (@minhsmind) May 30, 2026

Vibe-code your own VFX apps & more Google Flow Tools

May 29, 2026AI/ML, Google Flow, Google Omnijnack

Democratize all the apps!!

I think this new platform will be a major sleeper hit:

With Google Flow Tools, you can build creative workflows customized to fit your creative process. Explore a gallery of premade Tools built by creatives, remix existing ones to fit your needs, or create your own from scratch by just typing a description of what you want to create. You can shape and iterate on these Tools fluidly, adjusting them for individual projects, or singular clips and images. All users can explore Tools in Google Flow, and Google AI subscribers can create custom Tools from scratch or remix existing ones.

Check out the ways some artists have been spinning up tools & putting them to work:

“Mastering Gemini Omni: The Ultimate Video Prompting Guide”

May 27, 2026AI/ML, Google Omnijnack

From nailing text rendering to camera control, these tips should help you get the best possible results.

Google Earth + Omni = Drone magic

May 27, 20263D, AI/ML, Google Omnijnack

My friend Bilawal, who used to work in Google’s Geo group (Earth, Maps, and more), has created an eye-popping faux-drone video using Omni Flash:

Gave google omni a sketched camera path and asked it to generate drone POV footage. pic.twitter.com/cQZFMtOkEi

— Bilawal Sidhu (@bilawalsidhu) May 26, 2026

Here’s another exploration, inspired by Bilawal’s:

けっこうヤバかった https://t.co/Doi85TMvzf pic.twitter.com/jLX9T83ryS

— KEETY｜AIクリエイター (@KEETY2591756) May 27, 2026

Google Omni preserves marital harmony

May 21, 2026AI/ML, Google Omnijnack

The Flavawagon rides again—emotionally safely! 😀

Context: 25 years ago she helped me flame the original Flav. https://t.co/dxukPMwd3q pic.twitter.com/oLBd3t4E7T

— John Nack (@jnack) May 19, 2026

Gemini Omni in a nutshell

May 20, 2026AI/ML, Google Omnijnack

Here’s a great little one-minute explainer, featuring a couple of fun examples I hadn’t yet seen:

Awesome examples of Omni video transformation

May 20, 2026AI/ML, Google Omnijnack

This is such a wild, game-changing feature:

Gemini Omni is a major leap in world understanding & multimodal editing! It can take photos, video & audio and build entirely new scenes. Over time it’ll be able to handle any input & any output – starting w/ video

You can even give it your own videos & iterate on your ideas: pic.twitter.com/VrHPJKRJXH

— Demis Hassabis (@demishassabis) May 19, 2026

I think Carlos gets it exactly right: “I think many are focusing on the wrong aspect of the Gemini Omni model when comparing it to Seedance 2.0, since conceptually they are entirely different things. This is a model for editing videos (like Nano Banana) like we’ve never had before!“

Creo que muchos están enfocando mal el modelo de Gemini Omni al compararlo con Seedance 2.0 cuando conceptualmente son cosas distintas.

Este es un modelo para editar vídeos (a la Nano Banana) como nunca antes habíamos tenido! pic.twitter.com/eaqngSnbCD

— Carlos Santana (@DotCSV) May 19, 2026

i don’t think you understand how insane omni is pic.twitter.com/wvj4B4a59B

— Sam Sheffer (@samsheffer) May 19, 2026

Editing videos is where Gemini Omni Flash really shines. It is so incredibly capable.

> Make it New Year’s Eve with fireworks. Update the clock

London launched the fireworks early. https://t.co/cTGMPbT3tZ pic.twitter.com/c3Kh1y2KO5

— fofr (@fofrAI) May 19, 2026

I need Omni fireflies in @AdobeFirefly (Yo dawg…) https://t.co/BqvcTPo905

— John Nack (@jnack) May 19, 2026

“Nano Banana for video” is here!

May 19, 2026AI/ML, Google Omni, Nano Bananajnack

I’m so pleased to be playing a very small role in bringing breakthrough video transformation to the world. Check out the new Gemini Omni:

The team writes,

We’re introducing Gemini Omni, where Gemini’s ability to reason meets the ability to create. Omni is our new model that can create anything from any input — starting with video. With Omni, you can combine images, audio, video and text as input and generate high-quality videos grounded in Gemini’s real-world knowledge. You can also easily edit your videos through conversation.

Today, we’re rolling out the first model in the Omni family: Gemini Omni Flash, to the Gemini app, Google Flow and YouTube Shorts. In time we will support output modalities like image and audio.

Conversational video editing is the real breakthrough:

Check it out & let us know what you think!

Putting in the mental reps

May 18, 2026AI/ML, Idle Philosophizingjnack

I keep finding myself thinking of this observation from Paul Graham:

“In preindustrial times most people’s jobs made them strong. Now if you want to be strong, you work out. So there are still strong people, but only those who choose to be. It will be the same with writing. There will still be smart people, but only those who choose to be.“

To reiterate from a previous post, quoting Keep the Robots Out of the Gym:

Think very carefully about where you get help from AI.

I think of it as Job vs. Gym.

If we’re working a manual labor job, it’s fine to have AI lift heavy things for us because the actual goal is to move the thing, not to lift it.

This is the exact opposite of going to the gym, where the goal is to lift the weight, not to move it.

He argues for identifying gym tasks (e.g. critical thinking, problem solving), and for those use just your brain (with minimal AI assistance, if any).

My primary metric for this is whether or not I am getting sharper at the skills that are closest to my identity.

Try personalized image creation via Gemini

May 17, 2026AI/ML, Google Photos, Nano Bananajnack

As I often said back in the day, Google’s longstanding mission is to “organize the world’s information and make it useful.” A lot of that information is photographic, and a lot of that information is private; hence the value and power of Google Photos. It knows (with your blessings) who’s who, what places are important, and so on.

Now Nano Banana can leverage that info to make fun and beautiful things on your behalf.

Since you can already organize and label groups of people and pets in your library, those labels provide the context that Gemini needs to make your images feel truly yours…

With those labels in place, you can simply ask Gemini to “create a claymation image of me and my family enjoying our favorite activity” and Gemini can generate that specific image for you automatically. You can also experiment with different styles like watercolors, charcoal sketches or oil paintings. You can turn a quick idea into a custom creation, saving you the trouble of searching for, downloading and re-uploading files just to see a concept come to life.

Google Earth + Nano Banana? Go Go Godzilla!

May 13, 20263D, AI/ML, Nano Bananajnack

I love this kind of simple, scrappy creativity,:

Google Earth now allows importing ANY 3D model, so I used @tripoai on @fal to get Godzilla into Tokyo, and my Chrome extension to transform it into a movie scene pic.twitter.com/150iiy5UjX

— Blendi (@BlendiByl) May 13, 2026

Here’s the Chrome extension:

Capture any Google Earth 3D view
Transform with AI (Nano Banana Pro) into cinematic shots
Generate videos (Veo 3.1) with customizable duration and audio

GenFill + Vividon = Magic

May 12, 2026AI/ML, Photography, Relightingjnack

It’s insane what we can do now—from object removal to lighting changes—that was simply out of the question even a year ago.

Check out this little progression of edits, starting with the newly enhanced Generative Fill in the Photoshop beta, followed by a couple of steps of Remove, followed by a pass with Vividon & a few tweaks in Camera Raw (running inside PS):

Photoshop GenFill + https://t.co/FoIY1oy8vJ relighting = Magic #F35 #MoffettField pic.twitter.com/4JM7lQJnt9

— John Nack (@jnack) May 12, 2026

Nutty & I’m here for it. Per PetaPixel,

Co-founder and Chief Innovation Officer Marcus Kurn adds that the ability to deliver two or three lighting variations alongside every final image is a real differentiator: “once you start delivering two or three lighting variations with every final image, your clients will never want to go back.”

Vividon relighting comes to Photoshop

May 11, 2026AI/ML, Relightingjnack

“No prompting, no friction. Just incredible results.”

As I mentioned back in January, Vividon offers new generative relighting tech that promises amazing realism & identity preservation:

Vividon places every relight on its own Photoshop layer. Adjust opacity, change blend modes, paint in or out exactly what you want, or remove it entirely. Your original always stays untouched.

Check out a 10s demo below, and visit their site for a more interactive preview:

And here’s a full 2-minute tour:

“A vehicle that cares back”

May 11, 2026AI/ML, Idle Philosophizingjnack

“People will forget you said, people will forget what you did, but people will never forget how you made them feel.” — Maya Angelou

I’ve reflected on this maxim countless times over the last couple of years, as I’ve considered the relationships I want with AI—particularly with notional creative partners. I want a partner who cares—who (which?) actually takes the time to get to know me, asking thoughtful questions, noodling on answers, and genuinely taking my feedback to heart.

I thought of this while listening to Stewart Brand talking to Ezra Klein the other day. Check out this poetic & provocative passage:

Well, it wound up that, basically, most of the book is Chapter 2, “Vehicles.” And the land vehicle that humans have used for 6,000 years is a horse, and the horse takes a lot of maintenance.

I’ll read something here from the book, if I may. There’s this philosopher named Albert Borgmann who wrote:

You cannot remain unmoved by the gentleness and conformation of a well-bred and well-trained horse — more than a thousand pounds of big-boned, well-muscled animal, slick of coat and sweet of smell, obedient and mannerly, and yet forever a menace with its innocent power and ineradicable inclination to seek refuge in flight, and always a burden with its need to be fed, wormed and shod, with its liability to cuts and infections, to laming and heaves. But when it greets you with a nicker, nuzzles your chest and regards you with a large and liquid eye, the question of where you want to be and what you want to do has been answered.

And I end with: “I wonder if that might come again someday — a vehicle that cares back.”

The scarily beautiful animation of Sincitium

May 5, 2026AI/ML, Illustrationjnack

Side note: “Macrófago” is 100% the best word I’ve learned all week.

Sincitium is finally here.

We are pleased to present our latest piece: a concept trailer created specifically for the @runwayml Big Pitch Contest. For this project, we wanted to explore a completely different aesthetic from our usual studio style, and this film is the result of… pic.twitter.com/FHKkZWjjJg

— Contanimation (@Delachica_) May 4, 2026

AI filmmaking turns a (creepy, fun) corner

May 4, 2026AI/MLjnack

This is the first time I can recall watching a genuine narrative (not a handful of gee-whiz demo shots) made with AI & not really caring about the production details. We’re turning the inevitable corner where it’s just the quality of ideas & narrative that’ll matter—not so much how the proverbial sausage was made.

WE FOUND SOMETHING IN… [THE DEAD MALL]
Seedance 2.0 Omni-Reference

A girl gang and their scooter storms into an abandoned mall at 2 a.m. Inside, they stumble on something that has no business being there. Instead of running, they stay, and what follows spirals into total,… pic.twitter.com/fC25Q24w8H

— DAN · MXVDXN (@mxvdxn) May 1, 2026

FuruFuru crashes the set

May 4, 2026AI/MLjnack

Is it still brainrot if it’s really skillfully done, like several of these clip-bombing bits from FuruFuru? Check it out & be the judge:

I am obsessed with this Japanese man using AI video to put himself into movies

(he’s on IG at @ai_am_furufuru) pic.twitter.com/ePeVPkpEG4

— Justine Moore (@venturetwins) May 3, 2026

Deeper in the Flow state: Object insertion, and Doodle to edit

May 3, 2026AI/ML, Google Veojnack

Tap the pencil icon on any clip to insert objects directly into videos or remove elements, without changing anything else:

You can also draw or annotate on an image. Flow understands your doodles and incorporates them into your final frame. You can doodle directly in Flow instead of turning to a separate editing app.

Change camera angle in Google Flow

May 2, 2026AI/ML, Google Flow, Photographyjnack

Speaking of changing angles in photos & video, Google Flow now enables changing camera angle and motion in existing clips:

A couple more examples:

See yourself from a new angle in Google Photos

May 1, 20263D, AI/ML, Google Photos, Photographyjnack

Get some fresh perspective from our amazing teammates in research:

Today we are announcing a new approach to fix scene alignment after a photo was taken. Our method, now available as part of the Auto frame feature in Google Photos, uses machine learning (ML) models to understand the scene and its spatial layout and uses generative AI to imagine the photo from that new perspective. In contrast to classical photo editing, our method interprets a photo as a 3D scene — think of a real moment frozen in time — and change the camera position automatically within that space.

How to get the most from Nano Banana

April 30, 2026AI/ML, Nano Bananajnack

My new teammates have posted a series of detailed tips & tech specs (e.g. you can upload as many as 14 images together with a prompt). Check it out!

1. Introduction to Nano Banana

The Models: Overview of Nano Banana 2 (powered by real-time web search) and Nano Banana Pro (built for high-end reasoning).
Core Strengths: Deep reasoning capabilities, accurate visual rendering, and premium features like text rendering and upscaling (2K/4K).

2. Technical Specs at a Glance

Context Windows: Up to 131,072 input tokens for Nano Banana 2.
Versatility: Supports multiple aspect ratios (from 1:1 to 21:9) and up to 14 reference images in a single prompt.
Safety: Built-in SynthID watermarking and C2PA credentials for responsible AI use.

3. Best Practices for Prompting

Be Specific: Focus on concrete details regarding subject, lighting, and composition.
Positive Framing: Describe what should be there (e.g., “empty street”) rather than what shouldn’t.
Director’s Perspective: Use cinematic terms like “low angle,” “bokeh,” or “aerial view.”

4. Five Powerful Prompting Frameworks

Image Generation: Using the [Subject] + [Action] + [Context] + [Style] formula.
Image Editing: Utilizing “Semantic Masking” to change specific parts of an image via text.
Real-Time Data: Leveraging web search to create visuals based on current events or weather.
Text Rendering: How to get legible, localized text in over 10 languages within your images.
Creative Direction: Advanced tips for controlling lighting (e.g., Chiaroscuro), camera hardware (e.g., GoPro vs. Fujifilm), and film stock.

5. The Creative Ecosystem

How to combine Nano Banana with other models like Gemini (for prompt engineering), Veo (for video keyframes), and Lyria (for AI soundtracks).

Photoshop, 3D, and redemption

April 29, 20263D, AI/MLjnack

“Being early is the same as being wrong.” — Marc Andreessen, Vol. ~900

We put 3D into Photoshop nearly 20 years ago, and it got used by nearly 20 people total, lol. For many of the past several years, it was on the team’s “gotta throw overboard, as soon as we can find time” list—but happily that time was never found.

I am so glad to see this foundation now finding a meaningful niche, and I have high hopes for its generative future. Posing a person or thing directly is so much more intuitive than trying to precisely describe an outcome via prompt, and simple 3D manipulation + generative rendering could well deliver game-changing best of both worlds.

ついに新機能「オブジェクトを回転」が正式版に追加されました#PR #AdobePhotoshop #AdobePartner @creativecloudjp pic.twitter.com/f93OYx2hWo

— タマケン | デザイン (@DesignSpot_Jap) April 28, 2026

Sketching to control Nano Banana in Photoshop

April 29, 2026AI/ML, Illustration, Nano Bananajnack

Just like it says on the tin. Check it out:

Photoshop is the most powerful way to use Nano Banana 2

In photoshop you can sketch and control exactly where everything goes in your nano banana 2 generation

Here’s how I’ve been using it: #AdobeFireflyAmbassadors #Ad #AdobePartnerModels pic.twitter.com/c7YzV55JNS

— Allen T. (@Mr_AllenT) April 6, 2026

Canva’s new Magic Layers converter is really impressive

April 28, 2026AI/ML, Designjnack

As generative imaging models like Nano Banana get increasingly adept at rendering text-heavy layouts, the ability to convert those layouts into native text/image compositions is of course hugely valuable for editing. Check out Canva’s new Magic Layers feature:

Tus Posters con GPT Images 2.0 por fin son 100% editables

Con Canva puedes separar las capas y personalizar cada texto o imagen. Se acabó el conformarse con lo que te dé la IA: ahora el diseño es 100% tuyo

Te explico cómo hacerlo pic.twitter.com/UudG6Kk4zP

— ImPaul (@impaulxyz) April 26, 2026

I couldn’t resist trying it out with a silly infographic I made using the new ChatGPT image model, and dang if it didn’t do a pretty a great job:

“LooseRoPE” promises super intuitive illustration & compositing

April 27, 2026AI/ML, Illustrationjnack

Man, it must be nearly 20 years ago that we started envisioning drag-and-drop-simple composition and compositing in Photoshop—back when gradient-domain painting & blending was the emerging hotness. After plenty of false starts, could these simple interaction patterns finally become mainstream? Maybe! I must know more of this witchcraft:

Do you like image editing? Don’t like prompt engineering? Want to see what a giraffe-duck hybrid looks like?
If you answered yes at least once, you may like our new #SIGGRAPH2026 paper: LooseRoPE, which presents a new, prompt-free way to edit images using simple visual cues pic.twitter.com/JMzMDHJ9wE

— Etai Sella (@etai_sella) April 23, 2026

You can now verify Google AI-generated videos in the Gemini app

April 24, 2026AI/MLjnack

You can now check if a video was edited or created with Google AI directly in the Gemini app.

Just upload a video and ask something like, “Was this generated using Google AI?” Gemini will scan for the imperceptible SynthID watermark across both the audio and visual tracks and use its own reasoning to return a response that gives you context. For example, it might say: “SynthID detected within the audio between 10-20 secs. No SynthID detected in the visuals.”

Uploaded files can be up to 100 MB and 90 seconds long.

Scout with Maps, animate with Veo

April 22, 2026AI/MLjnack

Check out this super cool mashup between Google Maps & my new product, Veo (video generation):

280 billion Street View images + generative AI = Maps Imagery Grounding.

At #GoogleCloudNext, we announced that brands can now generate beautiful AI visuals, all anchored in Street View.

For example, when storyboarding, filmmakers can visualize a scene – like a spaceship taking… pic.twitter.com/epwc0GvAm2

— Miriam Daniel (@miriamkdaniel) April 22, 2026

The team writes,

With Maps Imagery Grounding, a film studio can use a laptop to quickly visualize a scene at a specific place, like Washington Square Park in New York City—before scouts ever set foot on set. It’s easy to use: just type a prompt like “generate an image of a futuristic spaceship hovering in front of the Washington Square Arch” into the Gemini Enterprise Agent Platform and enable grounding with Google Maps Imagery in settings. In seconds, you can storyboard your creative vision with an accurate image—and you can even use Veo to animate the scene.

Slick 360º camera control for Nano Banana

April 21, 2026AI/ML, Nano Banana, User Interfacejnack

Check out this cool little UI from Flora:

Get every angle from one product shot. Camera control rotates around any image in a full 360. Use it on PDPs, campaign stills, lifestyle, whatever you shot last week.

Now live in FLORA. pic.twitter.com/AqmxlAnGCZ

— FLORA © (@floraai) April 21, 2026

Did you know that Google Slides can make you into a video avatar?

April 16, 2026AI/MLjnack

I had no idea! And yet here “I” am, thanks to this new-to-me feature. In at least this first test, the visual likeness is very good, the gestures are a little off, and the voice is that of someone else (not shocking, as the creation flow asked me to read aloud only a couple of numbers):

“AI will never suffer from bipolar disorder and autism like me”

April 14, 2026AI/MLjnack

Spending four minutes listening to Diplo’s thoughts on how art will be made going forward, and specifically on the value of quirky, messy, world-experiencing humans will be a good use of your time, I promise. The machine needs us ghosts.

if you are a creative you need to adapt or just like give up and become an uber driver until everyone has a waymo. I know it’s not cool or classy to speak like this but i’m not gonna candy coat the future – it is what it is . sorry for bad new’s my purist . there will always need… https://t.co/SXswII51wv

— diplo (@diplo) April 14, 2026

“A rare look at how Hollywood is already using AI”

April 13, 2026AI/MLjnack

I’ve been sending this video to friends & family to explain what the heck it is I actually, y’know, do for a living. (It’s somehow related to enabling all this!)

Here’s a good summary from Gemini:

Digital Clones for A-listers (0:33–1:56): The Creative Artist Agency (CAA) is helping actors create and store secure digital doubles of their likeness and vocal inflections. This serves as a “vault” to protect their intellectual property and assert rights against unauthorized use.
Deep Voodoo’s AI Innovations (2:15–3:54): Founded by Trey Parker and Matt Stone of South Park, this studio uses proprietary facial scanning and AI to perform tasks like real-time de-aging for projects like the TV series Before and Billy Joel‘s recent music video.
Production Efficiency and Ethics (6:03–7:40): Director Darren Aronofsky and filmmaker Eliza McNitt utilized Google’s Veo 3 model for the short film Ancestra. AI allowed them to create complex cosmic visuals and even recreate a newborn baby digitally to avoid the ethical concerns of filming with a real infant.
Commercially Safe AI Tools (8:00–9:10): Asteria Film Company, co-founded by Natasha Lyonne and Bin Moser, focuses on building “commercially safe” AI models trained strictly on licensed materials to avoid copyright infringement, emphasizing that learning to use AI is an essential skill for modern filmmakers.
The Human Element (4:48–5:13): Despite the rapid evolution of AI, industry unions like SAG-AFTRA emphasize that human performers bring a unique, special quality to projects that algorithms cannot replicate, advocating for guardrails to ensure AI serves as a tool for creators rather than a replacement.

“Sketch to Vector” comes to Illustrator

April 8, 2026AI/ML, Illustration, Nano Bananajnack

Nano Banana + Adobe tech FTW! Here’s a quick look:

And here’s a deeper dive:

Reflection removal comes to Photoshop

April 7, 2026AI/ML, Photographyjnack

Here’s a practical, down-to-earth application of AI from my old teammates Dana & co.:

Phota launches, promising maximum identity preservation

April 5, 2026AI/ML, Nano Banana, Photographyjnack

Phota—about which I expressed some initial misgivings, given its ability to rewrite memories—has launched Phota Studio & their API. From what I can tell, it builds upon a Nano Banana foundation and adds personalization that relies on uploading dozens of images of each individual in order to maximize identity preservation:

With Phota, for the first time, you can generate, edit, and enhance photos while keeping your identity intact, every time.

We’re not building a generic foundation model. We build personal models about you, and about the people and pets around you. At the center are profiles, built from your personal album that learn the details of your appearance that make you recognizable as yourself: how you smile, your eye color, and how your face looks from different angles. Your personal model is private and only used by you.

Today, we introduce Phota Studio and Phota API, powered by our photography model that brings flagship image model capabilities, personalized to you.

With personalization, an image model stops being just playful and starts becoming useful for photography.

With Phota Studio, you… pic.twitter.com/UFOW32Vpvh

— Phota Labs (@PhotaLabs) March 26, 2026

Here’s a quick thread in which I tried inserting myself into a couple of images, using both Phota’s model (which depended on my uploading 30+ images of myself) and just Nano Banana straight out of the Gemini app:

Currently having fun Phota-bombing historical events in @PhotaLabs, which mixes their custom, identity-optimized model with @NanoBanana: pic.twitter.com/f3atvLkbxM

— John Nack (@jnack) April 6, 2026

FreePik enables 3D photo shoots

April 1, 20263D, AI/MLjnack

I love seeing progress like this: upload a product pic, convert it to 3D, and photograph it on a virtual set:

Your next 3D photo shoot will be done with AI

3D Scenes generates full environments from any image

→ Place your objects in the scene
→ Move the camera like a real shoot
→ Consistent lightning and detail across every angle

Available now on Freepik pic.twitter.com/blLN6fN1YW

— Freepik (@freepik) March 26, 2026

Go from 2D to a 3D

Upload your product photo → AI builds the scene around it → navigate freely in a 3D space

Rotate, zoom and explore every angle pic.twitter.com/waJf70Bdmn

— Freepik (@freepik) March 26, 2026

Photoshop’s Remove Tool gets smarter

March 31, 2026AI/ML, Photographyjnack

“Now with more distractions” isn’t usually the kind of thing one would tout—but as you’ll see, it’s just the kind of smarts people want for clean-up work:

Photoshop’s Remove Tool is getting a HUGE upgrade with more distractions.

A LOT more! pic.twitter.com/EYHp3Dilmm

— Howard Pinsky (@Pinsky) March 27, 2026

Runway debuts Multi-Shot

March 26, 2026AI/MLjnack

Here’s a fun, ultra-simple way to turn an image (or just a prompt) into a short, multi-shot narrative:

Introducing the Multi-Shot App. An easy way to go from a simple prompt to a thoughtfully crafted scene. All with dialogue, sound effects, intentional cuts, pacing and cinematic framing. Start from an image or go purely Text to Video for total creative exploration. Available now… pic.twitter.com/ek5uuuVf06

— Runway (@runwayml) March 26, 2026

Just for fun I fed it this image…

…and this prompt (based on an all-too-true story):

A family of Lego people and their dog gaze around Yosemite’s most iconic vista, then reminisce about that time they got stuck there in the snow in their VW van, expressing hope that they don’t get stuck again!

Check out the results:

I Love(art) to move it, move it…

March 25, 2026AI/ML, User Interfacejnack

I’ve long quoted James Ratliff, the super sharp designer behind Adobe’s Project Graph (who’s recently decamped to Figma), in nicely phrasing how the process of generating & refining ideas generally starts broad/declarative (searching, prompting) and moves towards fine-grained methods (selecting, moving, etc.):

I see an increasing number of tool & model creators mixing modalities—even in the Gemini Super Bowl ad featuring a mom & daughter drawing a simple circle to show where they’d like to add a dog bed.

I’m eager to check out Lovart’s take on the possibilities, especially for animation:

⚡️ New on Lovart: Move Object

→ Select any object with rectangular or lasso tool
→ Move it wherever you want
→ Prompt optional modifications
→ One clean, consistent image

No masks. No layers. No re-roll. pic.twitter.com/Sw800icnsu

— LovartAI (@lovart_ai) March 25, 2026

Update: Here’s a look at the UI, in which you can move & scale the selection rectangle, as well as the before & after images:

Spline enables agentic 3D creation

March 25, 20263D, AI/MLjnack

“3D scenes, websites, games, apps,” promises Spline. “Describe anything and Omma builds it for you in seconds.”

Omma combines code generation (LLMs), 3D AI mesh generation, and Image generation all in one place for you to build and ship. Deploy to production, assign custom domains, and more.