How Midjourney improves the generation of graphics

BY Anna Mitranka

Midjourney, along with DALL-E and Stable Diffusion, is one of the most popular image generators using artificial intelligence and machine learning.

Based on a textual description (prompt), it is able to generate hyper-realistic graphics, photos, 3D graphics or illustrations in a variety of styles. How can it help the agency? This is briefly discussed in the article.

How does Midjourney work?

Midjourney’s algorithms generate images from hundreds of millions of graphics published on the Internet. Every few months, the company releases a new model with slightly different features. Currently, the default is version 5.2, which generates the most detailed results with better colors, contrast and composition. This version also handles prompt interpretation better.

The latest version is currently 5.1, and features even greater consistency, better sharpness, greater accuracy in responding to text commands, and fewer unwanted edges or text artefacts.

However, if the results of our searches are not satisfactory, it is worth comparing them with the results obtained with earlier versions of the program. Simply add the — version parameter or its abbreviation — v to the command prompt, or configure the settings using the /settings command.

midjourney

How do I use Midjourney?

There is currently no free version available. Midjourney offers three subscription levels, Basic, Standard and Pro, starting at $8 per month.

The service is accessible via a bot on the official Midjourney server on the Discord platform. We can use it by sending a direct request to the bot, or by inviting the bot to our own server on Discord. To generate an image, use the /imagine command at the command prompt. The algorithms will generate four images. We can use the U (upscale) button to enlarge the selected one. We can also generate new iterations of the selected variant by pressing the V (Variations) button. If you want the algorithm to generate images from scratch for a given prompt, click on the arrow button (refresh icon).

Speed is a major advantage of Midjourney. Pictures are literally created before your eyes. In fast mode, you can create an image in about 1 minute. In slow (relaxed) mode it takes much longer. However, a good prompt is essential to ensure that this time is not wasted.

Interested in how the Midjourney can be used in your business?

How do you create a prompt?

On midjourney.com we find this definition of a prompt:

“A prompt is a short phrase of text that Midjourney Bot interprets to create an image. Midjourney Bot breaks down the words and phrases in the prompt into smaller chunks, called tokens, which can be compared to training data and then used to generate an image…”. (Source: https://docs.midjourney.com/docs/prompts).

To get results as close to our expectations as possible, we need to learn how to create advanced prompts. Such a prompt should specify elements such as:

Subject: character, animal, place, object, etc.

Technique: photography, painting, illustration, sculpture, drawing, tapestry, etc.

Environment: at home, in the garden, under water, in the city, etc.

Lighting: soft, neon, ambient, studio, etc.

Color: light, muted, bright, black and white, pastel, etc.

Mood: happy, calm, melancholic, etc.

Composition: portrait, close-up, overhead, panorama, etc.

Also important is the use of parameters that instruct the algorithms to change the way an image is searched. These are described in detail and illustrated in the ‘User Guide’ section of the Midjourne website. The most important ones are:

  • Aspect Ratio (–aspect or –ar ) – changes the aspect ratio of the resulting image. The aspect ratio is the ratio of the width to the height of the image. Example parameter: –ar 16:9.

  • Chaos (–chaos – the higher the parameter, the more unusual and unexpected the generation.

  • No (–no) – negative hints, –no plants will try to remove plants from the image.

  • Quality (– quality or — q) Specifies the amount of time to generate an image (for the current model, only accepts the following values: .25, .5 and 1). Higher value = more time to generate graphics.

  • Repeat (or repeat or — r) Runs the task repeatedly.

  • Seed (– seed) The Midjourney bot uses a seed number to create a visual noise field, such as TV interference, as a starting point to generate the initial image grids. Seed numbers are generated randomly for each image, but can be specified using the –seed or –sameseed parameter. Using the same seed number and cue will generate similar final images.

  • Stop (–stop)Allows you to stop the job mid-process.

  • Style (– style or — s)
    Allows you to modify the default aesthetics of a given version of Midjourney.
    Version 5.2 and 5.1 accept –style raw. Images made with –style raw have a lower level of automatic beautification. Niji 5 version accepts –style cute –style scenic –style original or –style expressive.

  • Stylize (– stylize or — s) The lower the stylize value, the lower the artistic value of the image. High values are artistically more interesting images, but less related to the hint.

  • Tile (– tile) Generates images that can be used as repeating tiles to create seamless patterns.

  • Version (– version or –v) The parameter already described above, which allows you to select the version of the algorithm.

  • Video (– video) Allows you to create short videos from generated images.

  • Weird (-weird or — w) Experimental function that allows you to get unusual results, modified with bizarre and unexpected elements.

Commands

Another important category is commands. The latest version 5.2 introduced the /shorten command.

/shorten

Analyzes promet and highlights important words and suggests which words can be removed.

Other commands include:

/blend

Allows you to blend selected images together.

/describe

Creates four sample prompts based on the uploaded image.

/remix

Allows you to experiment with successive variations of images.

/info

Displays user account information, as well as current tasks.

/settings

Displays the settings of the Midjourney bot.

midjourney

Wersja 5.2 – ulepszenia

In addition to the aforementioned /shorten command, the new version of Midjourney has been enhanced with interesting features worth exploring like Zoom Out, Make Square and customizable variations.

Zoom Out

Now you can use the Zoom Out function, which mimics the zoom out in a photografic camera, generating a scene around the generated subject. Just generate the image, then use the Zoom buttons underneath. The zoom factor is x1.5, x2 or a custom value from 1 to 2.

Finally – we also have Custom Zoom, a function that allows you to manipulate the aspect ratio of the image – for example, after setting Custom Zoom, you can edit the aspect ratio (–ar) and set the desired Zoom value.

Make Square

There is also a Make Square function, which allows you to get a square format from a post-obtained graphic. For images with aspect ratio other than 1:1, this function will fill in the missing space so that a square format graphic is produced.

Configurable variations

Users now have Vary Strong and Vary Subtle buttons to modify the results without changing the prompt. Vary Strong allows a stronger modification of the result, Vary Subtle a softer one. The changes achieved by this feature are not predictable, and thus allow experimentation with the results.

Midjourney – pros and cons

Midjourney’s advantage is certainly its ease of use. The tool makes it easy to create graphics, even for beginners in the world of AI. The use of parameters allows you to adapt the operation of the algorithm to the expected effects and gives you the possibility to control the effects to a large extent.

The drawback of the tool is still the limited realism. And not just in terms of common errors such as the number of fingers. It is simply that the algorithm can interpret textual queries in unpredictable ways. This means that changing one word in the prompt can result in a completely different image being generated.

Do you want to use Midjourney effectively for your purposes?

Midjourney and agency work

Advertising agencies have been quick to embrace the possibilities of Midjourney.

It is particularly useful for visualising ideas – whether for internal brainstorming or for presenting ideas to clients. In just a few moments we can create an image that allows the client to feel the concept rather than just imagine it. Midjourney allows us to go beyond the limitations of image banks and create the most unlikely graphics or images on demand. It’s also a great tool for creating mood boards. In our experience, it works quite well as an aid to storyboarding – although as consistency can still be a challenge, this tool should be seen more as an aid to the work and a way of getting it done faster.

Generated graphics:

Hyper-realistic image of a simple monochromatic cup of coffee with a latte art pattern at the top. The latte art pattern includes musical notes and a heart. A green cup on a monochromatic pink background. –ar 4:5 –s 750 –v 4 -.

Illustration showing the entire globe in space. A little boy and a little white bear are standing on top of the globe. The boy is holding a candle in his right hand and stroking the bear with his left. 

midjourney

Transparent spray plastic bottle for detergent filled with many layers of soil, sand, grass and a single small daisy flower. White bacground. Hyperrealistic graphics. –ar 4:5

A hiperrealistic lollipop made of fried chicken, red background –ar 4:5 –s 750 –v 5 –c 50

A bottle of olive oil at yellow background, surrounded by yellow fruit and vegetables, flat com position, monochromatic, essential, hyperrealistic –ar 4:5 –s 750 –v 5 –c 50

A storyboard full color sketch depicting people packing crates of olives onto the ca r’s trunk in an Olive Garden, wide frame, full-body image –ar 7:4 –s 750 –v 5

Summary:

In conclusion, Midjourney represents a significant advancement in the field of AI-generated graphics. Its ability to interpret natural language prompts and produce detailed, realistic images opens up new possibilities for creativity and efficiency in graphic design. While it has its limitations, the ongoing improvements and updates to Midjourney suggest a promising future for AI in the creative industries.

FAQ: Exploring Midjourney in Graphic Generation

  1. What is Midjourney?
    • Answer: Midjourney is an AI-based image generator that creates hyper-realistic graphics, photos, 3D graphics, or illustrations from textual descriptions.
  2. How does Midjourney work?
    • Answer: Midjourney uses algorithms to generate images from a vast database of internet graphics, interpreting natural language prompts to create visuals.
  3. What are the subscription levels for Midjourney?
    • Answer: Midjourney offers three subscription levels: Basic, Standard, and Pro, starting at $8 per month.
  4. What are the pros and cons of Midjourney?
    • Answer: Midjourney is user-friendly and versatile but sometimes limited in realism and can interpret prompts unpredictably.
  5. How is Midjourney beneficial for advertising agencies?
    • Answer: Midjourney aids in visualizing ideas quickly, creating unique graphics, and assisting in storyboarding, enhancing creativity and efficiency.

Anna Mitranka
copywriter

Hi! Are you interested in this topic and would like to discuss similar activities in your company?

Fill out the form and schedule a free 30-minute strategy call!

    Dziękujemy za wypełnienie formularza!

    Wkrótce skontaktujemy się z Tobą, by umówić zakres i termin spotkania. Na tej podstawie dobierzemy eksperta, który poprowadzi konsultacje.

    Życzymy udanego dnia :) zespół Neon Shake