“Stunning”—Midjourney update wows AI artists with camera-like feature

Midjourney 5.2 allows
Enlarge / Midjourney 5.2 allows “zooming out” on synthesized images. The original synthetic image is shown in the red dotted box here.

On Thursday, Midjourney unveiled version 5.2 of its AI-powered image synthesis model, which includes a new “zoom out” feature that allows maintaining a central synthesized image while automatically building out a larger scene around it, simulating zooming out with a camera lens.

Similar to outpainting—an AI imagery technique introduced by OpenAI’s DALL-E 2 in August 2022—Midjourney’s zoom-out feature can take an existing AI-generated image and expand its borders while keeping its original subject centered in the new image. But unlike DALL-E and Photoshop’s Generative Fill feature, you can’t select a custom image to expand. At the moment, v5.2’s zoom-out only works on images generated within Midjourney, a subscription AI image-generator service.

On the Midjourney Discord server (still the official interface for Midjourney, although plans are underway to change that), users can experiment with zooming out by generating any v5.2 image (now the default) and upscaling a result. After that, special “Zoom” buttons appear below the output. You can zoom out by a factor of 1.5x, 2x, or a custom value between 1 and 2. Another button, called “Make Square,” will generate material around the existing image in a way that creates a 1:1 square aspect ratio.

David Holz, the creator of Midjourney, announced the new v5.2 features and improvements on the Discord server Thursday night. Aside from “zoom out,” the most significant additions include an overhauled aesthetic system, promising better image quality and a stronger “–stylize” command that effectively influences how non-realistic an image looks. There’s also a new “high variation mode,” activated by default, that increases compositional variety among image generations. Additionally, a new “/shorten” command enables users to assess prompts in an attempt to trim out non-essential words.

Despite the immediate rollout of v5.2, Holz emphasized in his announcement that changes might occur without notice. Older versions of the Midjourney model are still available by using the “/settings” command or the “–v 5.1” in-line command argument.

A comparison between output from Midjourney v3 (left), v4 (center-left), v5 (center-right), and v5.2 (right) with the prompt "a muscular barbarian with weapons beside a CRT television set, cinematic, 8K, studio lighting."
Enlarge / A comparison between output from Midjourney v3 (left), v4 (center-left), v5 (center-right), and v5.2 (right) with the prompt “a muscular barbarian with weapons beside a CRT television set, cinematic, 8K, studio lighting.”

For fans of this new image synthesis art form that is sometimes called “synthography” by proponents such as Julie Wieland, the changes in v5.2 are welcome ones, with some Midjourney users calling them “stunning” and “mindblowing,” which are not unusual superlatives in the hype-friendly world of AI at the moment. But fans would likely argue that Midjourney’s visual improvements do justify the astonished reactions among themselves.

The latest update is part of a series of quality improvements since March 2022, when the model generated relatively ill-defined imagery that lacked detail. Most recently, Midjourney introduced v5.0 in March and v5.1 in May of this year, both of which improved realism and image detail. The v5 model series introduction allowed the creation of realistic images of Pope Francis and Donald Trump that sparked concerns about deepfakes on social media.

Despite the excitement over the new features among Midjourney enthusiasts, image synthesis remains highly controversial among some artists due to how these AI systems are trained, utilizing millions of scraped images from the web without artist consultation, credit, or permission. Midjourney has never officially revealed the exact contents of its training data. Adobe is attempting a more ethical path forward with Firefly, but Venture Beat recently reported that active artist consent is still marginal.

For now, it’s hard not to appreciate Midjourney’s eye-opening technical advancements while still wondering if there is a more ethical path forward for this technology—one that pleases artists, both traditional and synthographer alike.

Source: Ars Technica

Posted in Uncategorised and tagged .