{"id":23636,"date":"2026-03-05T10:12:52","date_gmt":"2026-03-05T18:12:52","guid":{"rendered":"https:\/\/jnack.com\/blog\/?p=23636"},"modified":"2026-03-05T10:12:52","modified_gmt":"2026-03-05T18:12:52","slug":"speak-it-see-it-with-kreas-new-voice-mode","status":"publish","type":"post","link":"https:\/\/jnack.com\/blog\/2026\/03\/05\/speak-it-see-it-with-kreas-new-voice-mode\/","title":{"rendered":"Speak it -> See it, with Krea&#8217;s new voice mode"},"content":{"rendered":"\n<p>I try not to curse on this blog, doing so maybe a dozen times in 20+ (!!) years of posting. But circa 2013-2017, when I saw what felt like uncritical praise for Adobe&#8217;s <a href=\"https:\/\/jnack.com\/blog\/2017\/01\/13\/voice-driven-photo-editing-here-we-go-again\/\">voice-driven editing prototypes<\/a>, I called <em>bullshit<\/em>.<\/p>\n\n\n\n<p>The high-level concept was fine, but the tech at the time struck me as the worst of both worlds: the imprecision of language (e.g. how does a normal person know the term &#8220;saturation,&#8221; and how does an expert describe exactly how much they want?) combined with the fragility of traditional selection &amp; adjustment algorithms.<\/p>\n\n\n\n<p>Now, however, generative tech can indeed interpret our language &amp; effect changes\u2014and in the case of Krea&#8217;s new realtime mode, in a highly responsive way:<\/p>\n\n\n<blockquote class=\"twitter-tweet\">\n<p lang=\"en\" dir=\"ltr\">introducing Voice Mode. <\/p>\n<p>speak as you draw and get changes in real-time. <\/p>\n<p>available now in Krea iPad. <a href=\"https:\/\/t.co\/c6mHHjupmW\">pic.twitter.com\/c6mHHjupmW<\/a><\/p>\n<p>\u2014 KREA AI (@krea_ai) <a href=\"https:\/\/twitter.com\/krea_ai\/status\/2028496804124496057?ref_src=twsrc%5Etfw\">March 2, 2026<\/a><\/p>\n<\/blockquote>\n<p> <script async=\"\" src=\"https:\/\/platform.twitter.com\/widgets.js\" charset=\"utf-8\"><\/script><\/p>\n\n\n<p>Whether or not voice <em>per se<\/em> becomes a popular modality here, closing the gap between idea &amp; visual is just so seductive. To emphasize a <a href=\"https:\/\/jnack.com\/blog\/2026\/02\/18\/ui-realtime-generation-the-undiscovered-country\/\">previously made<\/a> point:<\/p>\n\n\n<blockquote class=\"twitter-tweet\">\n<p lang=\"en\" dir=\"ltr\">We simply have not started rethinking interactions from the grounds up. <\/p>\n<p>So many possibilities wide open when you think of human &#8211; AI in micro feedback loops vs automation alone or classic back and forth. <a href=\"https:\/\/t.co\/iVKb02SbdU\">https:\/\/t.co\/iVKb02SbdU<\/a><\/p>\n<p>\u2014 tuhin (@tuhin) <a href=\"https:\/\/twitter.com\/tuhin\/status\/2023920352586588647?ref_src=twsrc%5Etfw\">February 18, 2026<\/a><\/p><\/blockquote>\n<p> <script async=\"\" src=\"https:\/\/platform.twitter.com\/widgets.js\" charset=\"utf-8\"><\/script><\/p>","protected":false},"excerpt":{"rendered":"<p>I try not to curse on this blog, doing so maybe a dozen times in 20+ (!!) years of posting. But circa 2013-2017, when I saw what felt like uncritical praise for Adobe&#8217;s voice-driven editing prototypes, I called bullshit. The high-level concept was fine, but the tech at the time struck me as the worst [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"","sticky":false,"template":"","format":"standard","meta":[],"categories":[66,2,7],"tags":[],"_links":{"self":[{"href":"https:\/\/jnack.com\/blog\/wp-json\/wp\/v2\/posts\/23636"}],"collection":[{"href":"https:\/\/jnack.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/jnack.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/jnack.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/jnack.com\/blog\/wp-json\/wp\/v2\/comments?post=23636"}],"version-history":[{"count":1,"href":"https:\/\/jnack.com\/blog\/wp-json\/wp\/v2\/posts\/23636\/revisions"}],"predecessor-version":[{"id":23637,"href":"https:\/\/jnack.com\/blog\/wp-json\/wp\/v2\/posts\/23636\/revisions\/23637"}],"wp:attachment":[{"href":"https:\/\/jnack.com\/blog\/wp-json\/wp\/v2\/media?parent=23636"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/jnack.com\/blog\/wp-json\/wp\/v2\/categories?post=23636"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/jnack.com\/blog\/wp-json\/wp\/v2\/tags?post=23636"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}