All posts by jnack

OMG AI KFC

April 1, 2025AI/MLjnack

It’s insane what a single creator—in this case David Blagojević—can do with AI tools; insane.

I’m blown away!

This KFC concept ad is 100% AI generated!

My friend David Blagojevic (he’s not on X) created this ad concept for KFC and it’s incredible!

Tools used: Runway, Pika, Kling AI, Google DeepMind Veo2, Luma AI, OpenAI Sora, upscaled with Topaz Labs and music… pic.twitter.com/u9ics8M51x

— Salma (@Salmaaboukarr) March 31, 2025

It’s worth noting that creative synthesis like this doesn’t “just happen,” much less in some way that replaces or devalues the human perspective & taste at the heart of the process: everything still hinges on having an artistic eye, a wealth of long-cultivated taste, and the willpower to make one’s vision real. It’s just that the distance between that vision & reality is now radically shorter than it’s ever been.

New generative video hotness: Runway + Higgsfield

March 31, 2025AI/MLjnack

It’s funny to think of anyone & anything as being an “O.G.” in the generative space—but having been around for the last several years, Runway has as solid a claim as anyone. They’ve just dropped their Gen-4 model. Check out some amazing examples of character consistency & camera control:

Today we’re introducing Gen-4, our new series of state-of-the-art AI models for media generation and world consistency. Gen-4 is a significant step forward for fidelity, dynamic motion and controllability in generative media.

Gen-4 Image-to-Video is rolling out today to all paid… pic.twitter.com/VKnY5pWC8X

— Runway (@runwayml) March 31, 2025

Here’s just one of what I imagine will be a million impressive uses of the tech:

First test with @runwayml‘s Gen-4 early access!

First impressions: I am very impressed! 10 second generations, and this is the only model that could do falling backwards off a cliff. Love it! pic.twitter.com/GZS1B7Wpq0

— Christopher Fryant (@cfryant) March 31, 2025

Meanwhile Higgsfield (of which I hadn’t heard before now) promises “AI video with swagger.” (Note: reel contains occasionally gory edgelord imagery.)

Now, AI video doesn’t have to feel lifeless.

This is Higgsfield AI: cinematic shots with bullet time, super dollies and robo arms — all from a single image.

It’s AI video with swagger.

Built for creators who move culture, not just pixels. pic.twitter.com/dJdQ978Jqd

— Higgsfield AI (@higgsfield_ai) March 31, 2025

Fun with empowering existential dread

March 30, 2025AI/MLjnack

It’s so good, it’s bad! 😀

Currently asking ChatGPT for faux-German words like
Überintelligenzchatbotrichtigkeitsahnungsscham:

“The bizarre cocktail of joy, panic, and existential dread a product manager experiences when an AI answers a tough product question better than they could.” pic.twitter.com/yVZVdoKZi9

— John Nack (@jnack) March 30, 2025

Virtual product photography in ChatGPT

March 29, 2025AI/ML, GPT-4ojnack

Seeing this, I truly hope that Adobe isn’t as missing in action as they seem to be; fingers crossed.

In the meantime, simply uploading a pair of images & a simple prompt is more than enough to get some compelling results. See subsequent posts in the thread for details, including notes on some shortcomings I observed.

A quick test of ChatGPT virtual product photography, combining real shoes with a quick render from @krea_ai/@bfl_ml Flux:
“Please put these shoes into the image of the basketball court, held aloft in the foreground by a man’s hand.” pic.twitter.com/k1AhTdHFcs

— John Nack (@jnack) March 28, 2025

See also (one of a million tests being done in parallel, I’m sure):

Still experimenting with chatgpt4o

prompt: “model wearing cap provided”

not bad pic.twitter.com/FObSXeyxOS

— Salma (@Salmaaboukarr) March 26, 2025

Depression-era Ghibli

March 27, 2025AI/MLjnack

We’re speed-running our way through the novelty->saturation->nausea cycle of Studio Ghibli-style meme creation, but I find this idea fresher: turn Ghibli characters into Dorothea Lange-style photos:

https://t.co/gl8U6x9Q8m

— Sterling Crispin (@sterlingcrispin) March 26, 2025

Ideogram 3.0 is here

March 27, 2025AI/MLjnack

In the first three workdays of this week, we saw three new text-to-image models arrive! And now that it’s Thursday, I’m like, “WTF, no new Flux/Runway/etc.?” 🙂

For the last half-year or so, Ideogram has been my go-to model (see some of my more interesting creations), so I’m naturally delighted to see them moving things forward with the new 3.0 model:

I don’t yet quite understand the details of how their style-reference feature will work, but I’m excited to dig in.

Meanwhile, here’s a thread of some really impressive initial creations from the community:

We launched Ideogram 3.0 just three hours ago, and we’ve already seen an incredible wave of striking images. Here are 16 of our favorites so far:

1/ @krampus76 pic.twitter.com/tbwfMfkvg5

— Ideogram (@ideogram_ai) March 26, 2025

LegoGPT

March 26, 2025AI/MLjnack

The family that bricks together, sticks together? 🙂

Worked great on this group shot as well—with the exception of disappearing one cousin! pic.twitter.com/pZzXfPurv3

— John Nack (@jnack) March 26, 2025

ChatGPT reimagines family photos

March 26, 2025AI/ML, GPT-4o, Illustrationjnack

“Dress Your Family in Corduroy and Denim” — David Sedaris
“Turn your fam into Minecraft & GTA” — Bilawal Sidhu

Entire ComfyUI workflows just became a text prompt.

Open an image in GPT-4o and type “turn us into Roblox / GTA-3 /Minecraft / Studio Ghibli characters” pic.twitter.com/rCXclZklq5

— Bilawal Sidhu (@bilawalsidhu) March 26, 2025

And meanwhile, on the server side:

ChatGPT when another Studio Ghibli request comes in pic.twitter.com/NF5sy24GlU

— Justine Moore (@venturetwins) March 26, 2025

RIP to ZIP

March 24, 2025Designjnack

Oh man, this vid from Aaron Draplin—stalwart hoarder of obsolete removable media—gave me all the feels, and if you’re a creative of a certain age, it might give you them, too:

View this post on Instagram

A post shared by Aaron James Draplin (@draplin)

Google’s “Photoshop Killer”?

March 23, 2025AI/MLjnack

Nearly twenty years ago (!), I wrote here about how The Killing’s Gotta Stop—ironically, perhaps, about then-new Microsoft apps competing with Adobe. I rejected false, zero-sum framing then, and I reject it now.

Having said that, my buddy Bilawal’s provocative framing in this video gets at something important: if Adobe doesn’t get on its game, actually delivering the conversational editing capabilities we publicly previewed 2+ years ago, things are gonna get bad. I’m reminded of the axiom that “AI will not replace you, but someone using AI just might.” The same goes for venerable old Photoshop competing against AI-infused & AI-first tools.

In any case, if you’re interested in the current state of the art around conversational editing (due to be different within weeks, of course!), I think you’ll enjoy this deep dive into what is—and isn’t—possible via Gemini:

Specific topic sections, if you want to jump right to ’em:

00:00 Conversational Editing with Google’s Multimodal AI
00:53 Image Generation w/ LLM World Knowledge
02:12 Easy Image Editing & Colorization
02:46 Advanced Conversational Edits (Chaining Prompts Together)
03:37 Long Text Generation (Google Beats OpenAI To The Punch)
04:25 Making Spicy Memes (Google AI Studio Safety Settings)
05:48 Advanced Prompting (One Shot ComfyUI Workflows)
07:19 Re-posing Characters (While Keeping Likeness Intact)
08:27 Spatial 3D Understanding (NO ControlNet)
10:42 Semantic Editing & In/Out Painting
13:46 Sprite Sheets & Animation Keyframes
14:40 Using Gemini To Build Image Editing Apps
16:37 Making Videos w/ Conversational Editing

Happy birthday, Adobe Firefly

March 21, 2025Adobe Firefly, AI/MLjnack

The old (hah! but it seems that way) gal turns two today.

The ride has been… interesting, hasn’t it? I remain eager to see what all the smart folks at Adobe have been cooking up. As a user of Photoshop et al. for the last 30+ years, I selfishly hope it’s great!

Welcome to the world, #AdobeFirefly! https://t.co/R92lBktZIQ

We have great stuff you can out try right now, plus so much brewing in the lab. Here’s a quick preview: pic.twitter.com/hIaW9EpMor

— John Nack (@jnack) March 21, 2023

In the meantime, I’ll admit that watching the video above—which I wrote & then made with the help of Davis Brown (son of Russell)—makes me kinda blue. Everything it depicts was based on real code we had working at the time. (I insisted that we not show anything that we didn’t think we could have shipping within three months’ time.) How much of that has ever gotten into users’ hands?

Yeah.

But as I say, I’m hoping and rooting for the best. My loyalty has never been to Adobe or to any other made-up entity, but rather to the spirit & practice of human creativity. Always will be, until they drag me off this rock. Rock the F on.

Adobe to offer access to non-Firefly models

March 20, 2025Adobe Firefly, AI/MLjnack

Man, I’m old enough to remember writing a doc called “Yes, And…” immediately upon the launch of DALL•E in 2022, arguing that of course Adobe should develop its own generative models and of course it should also offer customers a choice of great third-party models—because of course no single model would be the best for every user in every situation.

And I’m old enough to remember being derided for just not Getting It™ about how selling per-use access to Firefly was going to be a goldmine, so of course we wouldn’t offer users a choice. ¯\_(ツ)_/¯

Oh well. Here we are, exactly two years after the launch of Firefly, and Adobe is going to offer access to third-party models. So… yay!

Even more news today! We are expanding our footprint in the @Adobe ecosystem to offer more choice to their creators pic.twitter.com/A4tHRkb25h

— Black Forest Labs (@bfl_ml) March 19, 2025

Inconvenient versions of everyday objects

March 19, 2025Designjnack

Heh—let’s get funcomfortable with Katerina Kamprani:

View this post on Instagram

A post shared by Art & Calligraphy (@artsdawn)

Roblox lets players create 3D objects simply by describing them

March 18, 20253D, AI/MLjnack

I guarantee you, the TTP for the feature is less than the length of this 45-second promo. :-p

Putting Gemini editing to the test

March 17, 2025AI/MLjnack

Here’s a little holiday-appropriate experiment featuring a shot of my dad & me (in Lego form, naturally) at my grandmother’s family farm in County Mayo. Sláinte!

A little St. Paddy’s fun testing Google @GeminiApp‘s conversational editing abilities on Lego pics from Ireland: pic.twitter.com/LPCD0D3igi

— John Nack (@jnack) March 17, 2025

Happy glitter-free St. Pat’s

March 17, 2025Miscellaneousjnack

“When are we gonna start jazzing things down?? St. Patrick’s Day should be shit!” :-p

View this post on Instagram

A post shared by Garron Noone (@garron_music)

Wardrobe upgrades courtesy of Gemini

March 16, 2025AI/ML, Try-onjnack

Speaking of reskinning imagery (see last several posts), check out what’s now possible via Google’s Gemini model, below. I’ve been putting it to the test & will share results shortly.

Alright, Google really killed it here.

You can easily swap your garment just by uploading the pieces to Gemini Flash 2.0 and telling it what to do. pic.twitter.com/pNPBkIdRqy

— Halim Alrasihi (@HalimAlrasihi) March 14, 2025

Photoshop gets new background-removal skills

March 14, 2025AI/MLjnack

This enhanced capability, which apparently now uses a cloud-hosted model, looks really promising. See before & after:

The Photoshop Beta also has some pretty wild improvements to Remove Background pic.twitter.com/yu7u8ISbMW

— Howard Pinsky (@Pinsky) March 13, 2025

Another example:

https://t.co/VuXQVHMkN1 pic.twitter.com/mcy0nQ3b6m

— Howard Pinsky (@Pinsky) March 14, 2025

Runway reskins rock

March 13, 2025AI/ML, Illustrationjnack

Another day, another set of amazing reinterpretations of reality. Take it away Nathan…

3 tests of Runway’s first frame feature. It’s very impressive and temporally coherent. Input is a video and stylized first frame. ✨

First example here is a city aerial to: circuit board, frost, fire, Swiss cheese, Tokyo. #aivideo #VFX pic.twitter.com/Y7HST74uBy

— Nathan Shipley (@CitizenPlain) March 6, 2025

…and Bilawal:

Playing guitar, reskinned with Runway’s restyle feature — pretty epic for digital character replacement.

I’m genuinely impressed by how well the fretting & strumming hands hold up.

Not perfect yet, but pulling this off would basically be impossible with Viggle or even Wonder… pic.twitter.com/UJBS9c8U1a

— Bilawal Sidhu (@bilawalsidhu) March 7, 2025

PikaSwaps nails virtual try-on

March 12, 2025AI/ML, AR/VR, Try-onjnack

This temporally coherent inpainting is utterly bonkers. It’s just the latest—and perhaps the most promising—in myriad virtual try-on techniques I’ve seen & written about over the years.

This is effortless fashion

Made with @pika_labs Pikaswaps feature pic.twitter.com/BE9LDP8eAR

— Jessie_Ma (@ytjessie_) March 12, 2025

Mystic structure reference: Dracarys!

March 11, 20253D, AI/ML, Illustrationjnack

I love seeing the Magnific team’s continued rapid march in delivering identity-preserving reskinning

IT’S FINALLY HERE!

Mystic Structure Reference!

Generate any image controlling structural integrity Infinite use cases! Films, 3D, video games, art, interiors, architecture… From cartoon to real, the opposite, or ANYTHING in between!

Details & 12 tutorials pic.twitter.com/brw4Dx39gz

— Javi Lopez (@javilopen) February 27, 2025

This example makes me wish my boys were, just for a moment, 10 years younger and still up for this kind of father/son play. 🙂

Storyboarding? No clue! But with some toy blocks, my daughter’s wild imagination, and a little help from Magnific Structure Reference, we built a castle attacked by dragons. Her idea coming to life powered up with AI magic.
Just a normal Saturday Morning.
Behold, my daughter’s… pic.twitter.com/52tDZokmIT

— Jesus Plaza (@JesusPlazaX) March 8, 2025

Behind the scenes: AI-augmented animation

March 10, 2025AI/ML, Illustrationjnack

“Rather than removing them from the process, it actually allowed [the artists] to do a lot more—so a small team can dream a lot bigger.”

Paul Trillo’s been killing it for years (see innumerable previous posts), and now he’s given a peek into how his team has been pushing 2D & 3D forward with the help of custom-trained generative AI:”

Traditional 2d animation meets the bleeding edge of experimental techniques. This is a behind the scenes look at how we at Asteria brought the old and the new together in this throwback animation “A Love Letter to Los Angeles” and collaboration with music artist Cuco and visual… pic.twitter.com/3eWSdgckXn

— Paul Trillo (@paultrillo) March 7, 2025

“Fast food, but make it Lego”

March 7, 2025AI/MLjnack

Here’s a fun use of Flux->Minimax (see workflow details):

Fast food, but make it Lego.
byu/Sad-Ambassador-9040 incomfyui

Charmingly terrible AI-made infographics

March 6, 2025AI/ML, Illustration, Infographicsjnack

A passing YouTube vid made me wonder about the relative strengths of World War II-era bombers, and ChatGPT quickly obliged by making me a great little summary, including a useful table. I figured, however, that it would totally fail at making me a useful infographic from the data—and that it did!

Just for the lulz, I then ran the prompt (“An infographic comparing the Avro Lancaster, Boeing B-17, and Consolidated B-24 Liberator bombers”) through a variety of apps (Ideogram, Flux, Midjourney, and even ol’ Firefly), creating a rogue’s gallery of gibberish & Franken-planes. Check ’em out.

Currently amusing myself with how charmingly bad every AI image generator is at making infographics—each uniquely bizarre! pic.twitter.com/U3cs8ySoVa

— John Nack (@jnack) March 6, 2025

Surrealism blooms through Pika

March 5, 2025AI/MLjnack

Check out this delightful demo:

By combining @pika_labs Pikaframes and @freepik, I now have the magical ability to jump through space and time and in this example, music becomes a transformative element teleporting this woman to a new location. This is how it’s done. 1/6

The videos below are fully narrated… pic.twitter.com/06WtgI50ZV

— Travis Davids (@MrDavids1) March 3, 2025

Individual steps, as I understand them:

Generate image (in this example, using Google Imagen).
Apply background segmentation.
Synthesize a new background, and run what I think is a fine-tuned version of IC-Light (using Stable Diffusion) to relight the entire image, harmonizing foreground/background. Note that identity preservation (face shape, hair color, dress pattern, etc.) is very good but not perfect; see changes in the woman’s hair color, expression, and dress pattern.
Put the original & modified images into Pika, then describe the desired transformation (smooth transition, flowers growing, clouds moving, etc.).

VSCO introduces Canvas

March 4, 2025AI/ML, Photographyjnack

Another day, another ~infinite canvas for ideation & synthesis. This time, somewhat to my surprise, the surface comes from VSCO—a company whose users I’d have expected to be precious & doctrinaire in their opposition to any kind of AI-powered image generation. But who knows, “you can just do things.” ¯\_(ツ)_/¯

View this post on Instagram

A post shared by VSCO | Photo & Video Editor (@vsco)

NeRFtastic BAFTAs

February 28, 20253D, AI/MLjnack

The British Academy Film Awards have jumped into a whole new dimension to commemorate the winners of this year’s awards:

The capturing work was led by Harry Nelder and Amity Studio. Nelder used his 16-camera rig to capture the recent winners. The reconstruction software was a combination of a cloud-based platform created by Nelder, which is expected to be released later this year, along with Postshot. Nelder further utilized the Radiance Field method known as Gaussian Splatting for the reconstruction. A compilation video of all the captures, recently posted by BAFTA, was edited by Amity Studio

[Via Dan Goldman]

Lego together creative AI blocks in Flora

February 26, 2025AI/MLjnack

Looks promising:

Introducing FLORA, Your Intelligent Canvas.

Every creative AI tool, thoughtfully connected. pic.twitter.com/SUHrHtrQmn

— weber (@weberwongwong) February 26, 2025

Their pitch:

Create workflows, not just outputs. Connect Blocks to shape, refine, and scale your creative process.
Collaborate in real time. Work like you would in Figma, but for AI-powered media creation.
Discover & clone workflows. Learn from top creatives, build on proven systems and share generative workflows inside FLORA’s Community.

Sigma BF: Clean AF

February 25, 2025Design, Photographyjnack

Refreshingly simple design!

Is it for me? Dunno: lately the only thing that justifies shooting with something other than my phone is a big, fast zoom lens, and I don’t know whether pairing such a thing with this slim beauty would kinda defeat the purpose. Still, I must know more…

Here’s a nice early look at the cam plus a couple of newly announced lenses:

Perhaps image-to-3D was a mistake…

February 22, 20253D, AI/MLjnack

Behold the majesty (? :-)) of CapCut’s new “Microwave” filter (whose name makes more sense if you listen with sound on):

https://youtube.com/shorts/bshQXczbZdw?si=aFwvtgs-fKf2wl8x

As I asked Bilawal, who posted the compilation, “What is this, and how can I know less about it?”

Impressive product insertion in Ideogram

February 21, 2025AI/ML, Ideogramjnack

Slightly funky UI (I’d never have figured this out on my own), but amazing identity preservation! (Why can’t I do anything like this in Photoshop…?)

EditIQ edits single long shots into multiples virtual shots

February 19, 2025AI/MLjnack

Check it out (probably easier to grok by watching vs. reading a description):

From the static camera feed, EditIQ initially generates multiple virtual feeds, emulating a team of cameramen. These virtual camera shots termed rushes are subsequently assembled using an automated editing algorithm, whose objective is to present the viewer with the most vivid scene content.

Controlling video generation with simple props

February 18, 2025AI/MLjnack

Tired: Random “slot machine”-style video generation
Inspired: Placing & moving simple guidance objects to control results:
Check out VideoNoiseWarp:

Every now and then something comes along that feels like it could change everything… NoiseWarp + CogVideoX lets you animate live action scenes with rough mockups!

ComfyUI nodes by @Kijaidesign https://t.co/AziU049jbg pic.twitter.com/eZsXJ38lxv

— Ingi Erlingsson (@ingi_erlingsson) January 21, 2025

Analog meets AI in the papercraft world of Karen X Cheng

February 14, 2025Adobe Firefly, AI/ML, Generative Filljnack

Check out this fun mixed-media romp, commissioned by Adobe:

This video combines AI-generated elements (balloon, kite, surfboard, and backgrounds) with my own real-world practical effects and stop motion.

I made this for @Adobe Firefly and I’ll share tutorial tomorrow!

Thanks @Adobe for sponsoring my art #AdobePartner #AdobeFirefly pic.twitter.com/dPLrzCchH9

— Karen X. Cheng (@karenxcheng) February 12, 2025

And here’s a look behind the scenes:

Here’s the tutorial! This video combines AI-generated elements (balloon, kite, surfboard, and backgrounds) with my own real-world practical effects and stop motion.

I made this for #AdobeFirefly
Thanks @Adobe for sponsoring my art #AdobePartner pic.twitter.com/yUZtMlwk2r

— Karen X. Cheng (@karenxcheng) February 13, 2025

YouTube + Veo

February 13, 2025AI/MLjnack

The YouTube mobile app can now tap into Google’s Veo model to generate video, as shown below. Hmm—this feels pretty niche at the moment, but it may suggest the shape of things to come (ubiquitous media synthesis, anywhere & anytime it’s wanted).

View this post on Instagram

A post shared by Bilawal Sidhu (@bilawal.ai)

A cool Firefly image->video flow

February 12, 2025Adobe Firefly, AI/MLjnack

For the longest time, Firefly users’ #1 request was to use images to guide composition of new images. Now that Firefly Video has arrived, you can use a reference image to guide the creation of video. Here’s a slick little demo from Paul Trani:

Firefly Video (beta) is now available to everyone! Give it a whirl and share your results!https://t.co/sOeN1pwXcV #adobefirefly #communityxadobe pic.twitter.com/ZOvkqKSq9T

— Paul Trani (@paultrani) February 12, 2025

Titles: Severance Season 2

February 11, 2025Designjnack

Building on the strong work from the previous season,

Berlin’s Extraweg have created… a full-blown motion design masterpiece that takes you on a wild ride through Mark’s fractured psyche. Think trippy CGI, hypnotic 3D animations, and a surreal vibe that’ll leave you questioning reality. It’s like Inception met a kaleidoscope, and they decided to throw a rave in your brain. [more]

Google Photos will flag AI-manipulated images

February 10, 2025AI/ML, Google Photosjnack

These changes, reported by Forbes, sound like reasonable steps in the right direction:

Starting now, Google will be adding invisible watermarks to images that have been edited on a Pixel using Magic Editor’s Reimagine feature that lets users change any element in an image by issuing text prompts.

The new information will show up in the AI Info section that appears when swiping up on an image in Google Photos.

The feature should make it easier for users to distinguish real photos from AI-powered manipulations, which will be especially useful as Reimagined photos continue to become more realistic.

Minimalist mograph in ChatGPT’s Super Bowl spot

February 10, 2025Design, Illustrationjnack

I really love the way the visual medium (simply black & white dots) enriches & evolves right alongside its subject matter in this ad for ChatGPT, and I hope we get to hear more soon from the creative team behind it.

DeepSeek meets Flux in Krea Chat

February 9, 2025AI/MLjnack

Conversational creation & iteration is such a promising pattern, as shown through people making ChatGPT take images to greater & greater extremes:

pic.twitter.com/7SQwBMyrlv

— No Context Shitposting (@NoContextCrap) February 8, 2025

But how do we go from ironic laughs to actual usefulness? Krea is taking a swing by integrating (I think) the Flux imaging model with the DeepSeek LLM:

Krea Chat is here.

a brand new way of creating images and videos with AI.

open beta out now. pic.twitter.com/dbHX31l92A

— KREA AI (@krea_ai) February 7, 2025

It doesn’t yet offer the kind of localized refinements people want (e.g. “show me a dog on the beach,” then “put a hat on the dog” and don’t change anything outside the hat area). Even so, it’s great to be able to create an image, add a photo reference to refine it, and then create a video. Here’s my cute, if not exactly accurate, first attempt. 🙂

A mind-blowing Gemini + Illustrator demo

February 6, 2025AI/MLjnack

Wow—check out this genuinely amazing demo from my old friend (and former Illustrator PM) Mordy:

In this video, I show how you can use Gemini in the free Google AI Studio as your own personal tutor to help you get your work done. After you watch me using it to learn how to take a sketch I made on paper to recreating a logo in Illustrator, I promise you’ll be running to do the same.

Cadbury celebrates “Unsung heroes of daily life”

February 5, 2025Designjnack

What an amazingly simple, charming idea for a campaign. Swipe through the post to see the clever applications:

View this post on Instagram

A post shared by Marketing Mentor (@marketingmentor.in)

MatAnyone promises incredible video segmentation

February 3, 2025AI/MLjnack

What the what?

this looks insane, MatAnyone

Stable Video Matting with Consistent Memory Propagation pic.twitter.com/tt1k23raYv

— AK (@_akhaliq) February 3, 2025

Per the paper,

We propose MatAnyone, a robust framework tailored for target-assigned video matting. Specifically, building on a memory-based paradigm, we introduce a consistent memory propagation module via region-adaptive memory fusion, which adaptively integrates memory from the previous frame. This ensures semantic stability in core regions while preserving fine-grained details along object boundaries.

Premiere Pro now lets you find video clips by describing them

February 2, 2025AI/MLjnack

I love it: nothing too fancy, nothing controversial, just a solid productivity boost:

Users can enter search terms like “a person skating with a lens flare” to find corresponding clips within their media library. Adobe says the media intelligence AI can automatically recognize “objects, locations, camera angles, and more,” alongside spoken words — providing there’s a transcript attached to the video. The feature doesn’t detect audio or identify specific people, but it can scrub through any metadata attached to video files, which allows it to fetch clips based on shoot dates, locations, and camera types. The media analysis runs on-device, so doesn’t require an internet connection, and Adobe reiterates that users’ video content isn’t used to train any AI models.

Goodbye, endless scrolling. Hello, AI-powered search panel. With the all-new Media Intelligence in #PremierePro (beta), the content of your clips is automatically recognized, including objects, locations, camera angles & more. Just input your search to find exactly what you need. pic.twitter.com/cOYXDKKaFI

— Adobe Video & Motion (@AdobeVideo) January 22, 2025

Celebrating the skate art of Jim Phillips

January 31, 2025Illustrationjnack

If you’re like me, you may well have spent hours of your youth lovingly recreating the iconic designs of pioneering Santa Cruz artist Jim Phillips. My first deck was a Roskopp 6, and I covered countless notebook covers, a leg cast, my bedroom door, and other surfaces with my humble recreations of his work.

That work is showcased in the documentary “Art And Life,” screening on Thursday in Santa Cruz. I hope to be there, and maybe to see you there as well. (To this day I can’t quite get over the fact that “Santa Cruz” is a real place, and that I can actually visit it. Growing up it was like “Timbuktu” or “Shangri-La.” Funny ol’ world.)

8-bit Lego stop-motion mayhem

January 30, 2025Designjnack

Up my alley, or way up my alley?? 🙂

View this post on Instagram

A post shared by Rogier Wieland Studio (@rogier_wieland_studio)

Breakthrough interview with the DeepSeek co-founder

January 29, 2025Miscellaneousjnack

Huge if true. 😉

wow, found a rare interview of a DeepSeek co-founder talking about his first AI startup exit a few years ago pic.twitter.com/AWDUhbeRfh

— Trung Phan (@TrungTPhan) January 29, 2025

SNL goes God mode

January 29, 2025Illustrationjnack

“The whole solar system honestly slaps…” -God

This is 100% how all of my younger colleagues’ conversations sound to me. 🙂

Gemini turns photos into interactive simulations (!)

January 28, 2025AI/MLjnack

Check out this wild proof of concept from Trudy Painter at Google, and click into the thread for details.

Photos → Creative Code using Gemini

I built an experiment that turns photos into interactive @p5xjs sketches using Gemini 2.0 Flash.

Unlike UI generators, this creates code that mimics the *behavior* of what’s in the image – like smoke swirling or ripples spreading.

Check… pic.twitter.com/BbhYqUmZxA

— Trudy Painter (@trudypainter) January 23, 2025

Quick fun with Krea, Flux, custom training, and 3D

January 27, 20253D, AI/MLjnack

Putting the proverbial chocolate in the peanut butter, those fast-moving kids at Krea have combined custom model training with 3D-guided image generation. Generation is amazingly fast, and the results are some combo of delightful & grotesque (aka “…The JNack Story”). Check it out:

God help you, though, if you import your photo & convert it to 3D for use with the realtime mode. (Who knew I was Cletus the Slack-Jawed Yokel?) pic.twitter.com/nuesUOZ1Db

— John Nack (@jnack) January 27, 2025