How to fine-tune your AI images with these simple prompting techniques

Andrew
5 min readOct 10, 2022

Generated an AI image, it’s close but not quite what you want? In this article, I will teach you a few simple prompting techniques to let you dial-in the details of your images.

(If you are new to AI art, read this article first which covers the basic of creating a prompt.)

Software

We will use this Stable Diffusion GUI for this tutorial. See my quick start guide for setting up in Google’s cloud server. Note that many of the techniques outlined in this article only works on this software.

1. Adjust keyword strength with () and []

Use () to increase the weight of a keyword. Use [] to decrease. Below is a famous example taken from Web UI’s feature showcase. This is the original image:

Egg and Bacon

You can increase the emphasis of egg in the image by adding () to the keyword egg:

Egg and bacon, emphasize on Egg

Likewise, you can do the same for bacon:

Egg and bacon, emphasize on bacon

As you can see, the more weight you put on a keyword, the more bias the image towards the keyword. The effect of multiple parenthesis is multiplicative. Each parenthesis represents 1.1 times increase in weight. In other words:

(keyword): 1.1
((keyword)): 1.21
(((keyword))): 1.33

Similarly, the effect of multiple brackets is

[keyword]: 0.9
[[keyword]]: 0.81
[[[keyword]]]: 0.73

Use [] to suppress unwanted objects

Celebrity name is a strong effect. Some names are strongly associated with certain objects. For example, Steve Jobs is highly associated with the Apple logo (see image example below) because they often appear together in the training data. What if you just want to generate a portrait of Steve Jobs? One method is to generate a lot of them. By shear chance, you will get an image you want AND without the Apple logo. But this could unnecessarily take up resources. Alternatively, you can suppress the keyword Steve Jobs to reduce the chance of generating the Apple logo, an association, in the hope that the main effect, Steve Jobs, still remains. You would usually need to play with how many [] to achieve the desired effect.

Steve Jobs is strongly associated with Apple logo.

2. Suppress unwanted objects with negative prompts

Another way to suppress objects or styles you do not want is to use negative prompts. In fact, many practitioners use boilerplate negative prompts. Below is an example for portraits.

((((ugly)))), (((duplicate))), ((morbid)), ((mutilated)), [out of frame], extra fingers, mutated hands, ((poorly drawn hands)), ((poorly drawn face)), (((mutation))), (((deformed))), ((ugly)), blurry, ((bad anatomy)), (((bad proportions))), ((extra limbs)), cloned face, (((disfigured))), out of frame, ugly, extra limbs, (bad anatomy), gross proportions, (malformed limbs), ((missing arms)), ((missing legs)), (((extra arms))), (((extra legs))), mutated hands, (fused fingers), (too many fingers), (((long neck)))

All you need to do is to list everything you do not want to see.

3. Blend two keywords

What if you want to blend two faces? Keyword swapping would allow you to do that. In fact, this is an important technique to create new looks. Celebrity name is a strong effect. By using names of actors and actresses, it is easy to generate good-looking faces. But the downside is the faces are too recognizable. Everyone can tell who he or she was, and you lost novelty. Keyword swapping allows you to blend two readily recognizable faces together to form a new one.

Use syntax

[person1 : person2: amount]

to blend two faces at different degree. Below is an example

Keyword swapping from Joe Biden to Donald Trump

The last number ranges from 0 to 1. As an example, if the number of sampling steps is 40, specifying amount = 0.75 swaps the keyword joe biden to donald trump at step 40*0.75=30 steps.

This technique can be used to generate novel faces. Below is an example.

[Emma Watson: Amber heard: 0.85], 40 steps:

Blending two faces.

Full prompt is

[Emma Watson: Amber heard: 0.85],((Victorian)) , Feminine,((Perfect Face)), ((arms outstretched above head)), ((Aype Beven)), ((scott williams)) ((jim lee)),((Leinil Francis Yu)), ((Salva Espin)), ((oil painting)), ((Matteo Lolli)), ((Sophie Anderson)), ((Kris Anka)), (Intricate),(High Detail), (bokeh)

You can go one step further and blend 4 names together:

[Evan Rachel Wood: Jennifer Lawrence: 0.75], [Jennifer Aniston: Jennifer Connelly: 0.85], 40 steps:

Blending 4 faces.

Full prompt:

[Evan Rachel Wood: Jennifer Lawrence: 0.75], [Jennifer Aniston: Jennifer Connelly: 0.85], Feminine,((Perfect Face)), ((arms outstretched above head)), ((Aype Beven)), ((scott williams)) ((jim lee)),((Leinil Francis Yu)), ((Salva Espin)), ((oil painting)), ((Matteo Lolli)), ((Sophie Anderson)), ((Kris Anka)), (Intricate),(High Detail), (bokeh)

You got the idea. This technique allows you to generate novel faces with precise facial features you want. Just pick the celebrities with certain looks and blend them together with keyword swapping!

Summary

In this article, we have gone through techniques to control image generation by adjusting strength of keywords, adding negative keywords and blending keywords. These are invaluable tools to dial-in your image. Hope you find these techniques useful. In the next post, I will cover how to redraw an area of the image with inpainting. Stay tuned!

(This article originally appears in my blog)

--

--

Andrew

I write about AI and internet business. Check out my new stable diffusion site: https://stable-diffusion-art.com/