Monthly Archives: September 2022

Meta introduces text to video 👀

September 29, 2022AI/MLjnack

Every. Single. Week. There’s. A. Breakthrough.

Text to video by Meta: https://t.co/S8AZXdNeov pic.twitter.com/vzGXrR7WEp

— Suhail (@Suhail) September 29, 2022

Per the site,

The system uses images with descriptions to learn what the world looks like and how it is often described. It also uses unlabeled videos to learn how the world moves. With this data, Make-A-Video lets you bring your imagination to life by generating whimsical, one-of-a-kind videos with just a few words or lines of text.

Completely insane. DesireToKnowMoreIntensifies.gif!

DALL•E is now available to everyone

September 28, 2022AI/ML, DALL•Ejnack

Whew—no more wheedling my “grand-mentee” Joanne on behalf of colleagues wanting access. 😅

Starting today, we are removing the waitlist for the DALL·E beta so users can sign up and start using it immediately. More than 1.5M users are now actively creating over 2M images a day with DALL·E—from artists and creative directors to authors and architects—with over 100K users sharing their creations and feedback in our Discord community.

You can sign up here. Also exciting:

We are currently testing a DALL·E API with several customers and are excited to soon offer it more broadly to developers and businesses so they can build apps on this powerful system.

It’s hard to overstate just how much this groundbreaking technology has rocked our whole industry—all since publicly debuting less than 6 months ago! Congrats to the whole team. I can’t wait to see what they’re cooking up next.

NVIDIA’s GET3D promises text-to-model generation

September 27, 20223D, AI/MLjnack

Depending on how well it works, tech like this could be the greatest unlock in 3D creation the world has ever known.

The company blog post features interesting, promising details:

Though quicker than manual methods, prior 3D generative AI models were limited in the level of detail they could produce. Even recent inverse rendering methods can only generate 3D objects based on 2D images taken from various angles, requiring developers to build one 3D shape at a time.
GET3D can instead churn out some 20 shapes a second when running inference on a single NVIDIA GPU — working like a generative adversarial network for 2D images, while generating 3D objects. […]
GET3D gets its name from its ability to Generate Explicit Textured 3D meshes — meaning that the shapes it creates are in the form of a triangle mesh, like a papier-mâché model, covered with a textured material. This lets users easily import the objects into game engines, 3D modelers and film renderers — and edit them.

See also Dream Fields (mentioned previously) from Google:

Photoshop-Stable Diffusion plugin adds inpainting with masks, layer-based img2img

September 26, 2022AI/MLjnack

Christian Cantrell + the Stability devs remain a house on fire:

The new version of the @StableDiffusion plugin for @Photoshop brings in-painting (masking) and layer-based img2img. Download it for free here:https://t.co/gqFWpABQLY pic.twitter.com/zMvq93WI8o

— Christian Cantrell (@cantrell) September 26, 2022

Here’s a more detailed (3-minute) walk-through of this free plugin:

Demo: Generating an illustrated narrative with DreamBooth

September 25, 2022AI/ML, Illustrationjnack

The Corridor Crew has been banging on Stable Diffusion & Google’s new DreamBooth tech (see previous) that enables training the model to understand a specific concept—e.g. one person’s face. Here they’ve trained it using a few photos of team member Sam Gorski, then inserted him into various genres:

From there they trained up models for various guys at the shop, then created an illustrated fantasy narrative. Just totally incredible, and their sheer exuberance makes the making-of pretty entertaining:

New Photoshop-Stable Diffusion plugin integrates with local or hosted engine

September 22, 2022AI/MLjnack

$89 from Flying Dog Software, available via the Adobe Exchange. Promised features (which I haven’t tried) include:

Support of your own Server and Stability Cloud
Text2Image, Inpainting (2 variations), and Image2Image
Preview Screen
Modifiers Library
Working on Selection
Tiling, Face Reconstruction, Multi Server Management and more

Generative dancing about architecture

September 20, 2022After Effectsjnack

Paul Trillo is back at it, extending a Chinese restaurant via Stable Diffusion, After Effects, and Runway:

Using @OpenAI dall-e 2 AI to dream up what a Chinese restaurant skyscraper might look like. The fluid frame interpolation was made with @runwayapp new super slow motion setting that also uses the power of AI to morph between frames. #aiart #ai #architecture #vfx #dalle pic.twitter.com/HMUvUyqsxA
— Paul Trillo (@paultrillo) September 15, 2022

Elsewhere, check out this mutating structure. (Next up: Falling Water made of actual falling water?)

I've been testing interpolated animations with Stable. Unlike FiLM, Deforum's notebook maintains consistency across all frames.#StableDiffusion #Deforum #AIart pic.twitter.com/zsrRRWRw1x
— Iban (@1ban3gaNa) September 17, 2022

Lexica adds reverse-image search

September 19, 2022AI/MLjnack

The Stable Diffusion-centered search engine (see a few posts back) now makes it easy to turn a real-world concept into a Stable Diffusion prompt:

Just added reverse image search to Lexica. You can upload a photo and it’ll return the most similar Stable Diffusion images and their prompts.

This makes it very to turn a real world concept into a Stable Diffusion prompt.https://t.co/0YdmzHqYNy pic.twitter.com/8t0UlfqG7W
— Sharif Shameem (@sharifshameem) September 19, 2022

This seems like precisely what I pined for publicly, albeit then about DALL•E:

Mobile DALL•E app idea: capture an image, then use GPT-3 to generate descriptive text, then use that as prompts for #dalle.

I want to know how to produce lettering like this: pic.twitter.com/dprziqKWzE
— John Nack (@jnack) June 5, 2022

Honoring creators’ wishes: Source+ & “Have I Been Trained”

September 18, 2022AI/MLjnack

I’m really excited to see this work from artists Holly Dryhurst & Mat Herndon. From Input Mag:

Dryhurst and Herndon are developing a standard they’re calling Source+, which is designed as a way of allowing artists to and opt into — or out of — allowing their work being used as training data for AI. (The standard will cover not just visual artists, but musicians and writers, too.) They hope that AI generator developers will recognize and respect the wishes of artists whose work could be used to train such generative tools.
Source+ (now in beta) is a product of the organization Spawning… [It] also developed Have I Been Trained, a site that lets artists see if their work is among the 5.8 billion images in the Laion-5b dataset, which is used to train the Stable Diffusion and MidJourney AI generators. The team plans to add more training datasets to pore through in the future.

The creators also draw a distinction between the rights of living vs. dead creators:

The project isn’t aimed at stopping people putting, say, “A McDonalds restaurant in the style of Rembrandt” into DALL-E and gazing on the wonder produced. “Rembrandt is dead,” Dryhurst says, “and Rembrandt, you could argue, is so canonized that his work has surpassed the threshold of extreme consequence in generating in their image.” He’s more concerned about AI image generators impinging on the rights of living, mid-career artists who have developed a distinctive style of their own.

And lastly,

“We’re not looking to build tools for DMCA takedowns and copyright hell,” he says. “That’s not what we’re going for, and I don’t even think that would work.”

On a personal note, I’m amused to see what the system thinks constitutes “John Nack”—apparently chubby German-ish old chaps…? 🙃

Google & NASA bring 3D to search

September 18, 20223D, AR/VRjnack

Great to see my old teammates (with whom I was working to enable cloud-rendered as well as locally rendered 3D experiences) continuing their work.

NASA and Google Arts & Culture have partnered to bring more than 60 3D models of planets, moons and NASA spacecraft to Google Search. When you use Google Search to learn about these topics, just click on the View in 3D button to understand the different elements of what you’re looking at even better. These 3D annotations will also be available for cells, biological concepts (like skeletal systems), and other educational models on Search.

Lexica: Search for AI-made art, with prompts

September 15, 2022AI/ML, Illustrationjnack

The makers of this new search engine say they’re already serving more than 200,000 images/day & growing rapidly. Per this article, “It’s a massive collection of over 5 million Stable Diffusion images including its text prompts.” Just get ready to see some… interesting art (?). 🙃

Stable Diffusion img2img support comes to Photoshop

September 14, 2022AI/MLjnack

More awesome work from Christian Cantrell in his free plugin. Check it out:

Runway previews text to video

September 13, 2022AI/MLjnack

“Days of Miracles and Wonder” Vol. ∞…

I think that Runway founder Cristóbal Valenzuela is right about the primacy of this design affordance. 😉

🐵 https://t.co/AczheFyyxv pic.twitter.com/zhtzHqwkFS

— John Nack (@jnack) September 8, 2022

AI + James Joyce = Poetry in motion

September 12, 2022AI/ML, Illustrationjnack

Lovely work from Glenn Marshall & friends:

'Consonance' is my project that explores how AI interprets the spoken word. This is an excerpt from a James Joyce novel. I use his words exactly for the prompt, in the style of artist John Lavery.
Funded by @futurescreensni
A collaboration with @HeaneyCentre + Armchair & Rocket pic.twitter.com/8sGgipjZeb
— Glenn Marshall (@GlennIsZen) September 8, 2022

AR: Stepping inside famous paintings with a boost from DALL•E

September 11, 2022AI/ML, AR/VR, DALL•Ejnack

Karen X. Cheng & pals (including my friend August Kamp) went to work extending famous works by Vermeer, Da Vinci, and Magritte, then placing them into AR filter (which you can launch from the post) that lets you walk right into the scenes. Wild!

View this post on Instagram

A post shared by Karen X (@karenxcheng)

Insta360 announces the X3

September 10, 2022Photographyjnack

Who’s got two thumbs & just pulled the trigger? This guuuuuy. 😌

Now, will it be worth it? I sure hope so.

Fortunately I got to try out the much larger & more expensive One R 1″ Edition back in July & concluded that it’s not for me (heavier, lacking Bullet Time, and not producing appreciably better quality results—at least for the kind of things I shoot).

I’m of course hoping the X3 (success to my much-beloved One X2) will be more up my alley. Here’s some third-party perspective:

Relight faces via a slick little web app

September 9, 2022AI/ML, Photography, Relightingjnack

Check out ClipDrop’s relighting app, demoed here:

🔥🔥 Today we, @clipdropapp are launching our 💡image relighting #AI application 💡

The app allows you to apply professional lights to your portrait images 📸 in real time ⚡

Try it now! it is free 🙂https://t.co/5mTsmLrp9R #photography #MachineLearning pic.twitter.com/SrQ09SsfDm

— Onur Tasar (@onurxtasar) September 7, 2022

Fellow nerds might enjoy reading about the implementation details.

Stable Diffusion arrives in Photoshop 🤖🎨🔥

September 8, 2022AI/MLjnack

Great work from developer Christian Cantrell! I’d love to know what you think of this.

AI art -> “Bullet Hell” & Sirenhead

September 7, 2022AI/ML, Illustrationjnack

“Shoon is a recently released side scrolling shmup,” says Vice, “that is fairly unremarkable, except for one quirk: it’s made entirely with art created by Midjourney, an AI system that generates images from text prompts written by users.’ Check out the results:

Midjourneyで生成した絵を使って横スクロールシューティングゲームを作ってみた pic.twitter.com/M6HUMhzKkW
— Nao_u (@Nao_u_) August 13, 2022

Meanwhile my friend Bilawal is putting generative imaging to work in creating viral VFX:

View this post on Instagram

A post shared by Billy3d (@billyfx.ig)

“Dreamcatching”: Generative AI for music vids

September 6, 2022AI/ML, Illustrationjnack

Trippy!

A bit behind the scenes:

Magdalena Bay has shared a new Felix Geen directed video for “Dreamcatching.” The clip, multi-dimensional explored through cutting-edge AI technology and GAN artwork, combined with VQGAN+CLIP, is a technique that utilizes a collection of neural networks that work in unison to generate images based on input text and/or images.

“Little Simple Creatures”: Family & game art-making with DALL•E

September 3, 2022AI/ML, DALL•E, Illustrationjnack

Creative director Wes Phelan shared this charming little summary of how he creates kids’ books & games using DALL•E, including their newly launched outpainting support:

John Oliver gets DALL•E-pilled

September 2, 2022AI/ML, DALL•E, Illustrationjnack

Judi Dench fighting a centaur on the moon!
Goose Pilates!

Happy Friday. 😅

DALL•E outpainting arrives

September 1, 2022AI/ML, DALL•Ejnack

Let the canvases extend in every direction! The thoughtfully designed new tiling UI makes it easy to synthesize adjacent chunks in sequence, partly overcoming current resolution limits in generative imaging:

We just released a new edit interface for DALL·E that lets you use Outpainting to expand beyond the original borders of an image!

You can use this to make images with different aspect ratios, or arbitrarily large images like murals or magazine covers. pic.twitter.com/OW4lC6HQFl
— David Schnurr (@_dschnurr) August 31, 2022

Here’s a nice little demo from our designer Davis Brown, who takes his dad Russell’s surreal desert explorations to totally new levels:

Outcropping! Now you can spend your credits faster. So much fun though. #dalle2 pic.twitter.com/amq41nNe5t
— Davis Taylor Brown (@Davistaylorbro) September 1, 2022

Nackblog

Musings on photography, illustration, mobile apps, and more