Interesting, interactive mash-ups powered by AI

May 7, 2021AI/ML, Photographyjnack

Check out how StyleMapGAN (paper, PDF, code) enables combinations of human & animal faces, vehicles, buildings, and more. Unlike simple copy-paste-blend, this technique permits interactive morphing between source & target pixels:

From the authors, a bit about what’s going on here:

Generative adversarial networks (GANs) synthesize realistic images from random latent vectors. Although manipulating the latent vectors controls the synthesized outputs, editing real images with GANs suffers from i) time-consuming optimization for projecting real images to the latent vectors, ii) or inaccurate embedding through an encoder. We propose StyleMapGAN: the intermediate latent space has spatial dimensions, and a spatially variant modulation replaces AdaIN. It makes the embedding through an encoder more accurate than existing optimization-based methods while maintaining the properties of GANs. Experimental results demonstrate that our method significantly outperforms state-of-the-art models in various image manipulation tasks such as local editing and image interpolation. Last but not least, conventional editing methods on GANs are still valid on our StyleMapGAN. Source code is available at https://github.com/naver-ai/StyleMapGAN.

2 thoughts on “Interesting, interactive mash-ups powered by AI”

earth says:

May 8, 2021 at 2:08 pm

content aware fill + segmentation brushes + sensei = this?

Reply
1. jnack says:
  
  May 9, 2021 at 2:32 pm
  
  It’s definitely fun to think about how techniques like these could manifest in Adobe apps. Do you have specific ideas?
  
  Reply

Leave a Reply Cancel reply