Experiment on Reference Parameter in Midjourney — cref, sref

Florent
5 min readMay 22, 2024

--

Summary

  • The character reference parameter makes the referenced image the main theme or subject.
  • If the referenced image has a distinct figure, the model uses it as the main subject and mixes other elements like style and color with the prompt and style references.
  • Without a distinct object, the model closely mimics the original referenced image’s composition, borrowing some concepts from the prompt or style reference.
  • The model struggles with phrases that objectively describe the image, such as ‘barrel distortion’.
  • The character reference parameter strongly preserves the referenced image’s subject, so the style parameter is better for borrowing concepts without the subject.

Character Reference ( — cref)

  • Charater reference parameter enables the model to create images of the same character in the prameter.
  • (My interpretation) The parameter brings the subject (or alike) into an image generated, and it has a little impact on style and medium compared to style reference parameter.

Style Reference ( — sref)

  • Style reference parameter influences the style or aesthetic of images you want Midjourney to make
  • (My interpretation) Midjourney’s model borrows the style in the parameter — such as color, medium, composition — not subject in it. It has an impact overall but the subject.

References for the experiment & Characteristics

[A] Mondrian’s work from Shutterstock, [B] A fashion photo from Capture Magazine — https://www.capturemag.com.au/advice/the-future-of-fashion-photography

Each reference has a kind of converse color spectrum and dimensional expressions.

  • [A] Mondrian’s work: De Stijl, Cubism, Straight lines, Strict angles, Solid Colors
  • [B] A Fashion Photo: Black & White, Woman in wild eyelines, dramatic facial expression, Bold circles

Prompt

a perspective in highly barrel distortion, yellow and green, a woman gazing at the front without emotion — cref [REFERENCE 1]— sref [REFERENCE 2]— ar 16:9

Additionally, I wanted to know if a phrase in terms of perspective, arrangement and color when using reference parameters, so added ‘a perspective in highly barrel distortion’ which is also known as ‘fish-eye lens’ and ‘yellow and green’ phrase.

Result

Comparision

Case 1: cref = Mondrian’s work, sref = A Fashion Photo

Case 1

In regard to references

  • Mondrian’s work dominated almost all of the image from style to subject.
  • A Fashion photo’s existence of ‘woman’ and characteristic ‘circle pattern’ has mingled into the image, and they go hand in hand such in a way of the original composition.

In regard to prompt phrases

  • Both ‘barrel distortion’ and ‘green and yellow aren’t taken into account.
  • The expression of ‘gazing at the front’ is depected in only one image.

Case 2: cref = Mondrian’s work, sref = A Fashion Photo

Case 2

In regard to references

  • The woman almost identical to A Fashion Photo became a main subject
  • Straight lines and strict lines from Mondrian’s work were adapted to its background, mixed with the circle pattern above the woman in the original Fashion Photo.

In regard to prompt phrases

  • ‘Yellow and green’ is well applied.
  • ‘Barrel distortion’ isn’t in place.

Takeaway & Interpretation

  • Character reference parameter has Midjourney model make an image whose the referenced image is the main theme or subject.
  • I speculate that if the character-referenced image has a distinctive figure, the model use it as main subject and mix the other substances in cref image — such as style, color, pattern — with prompt and style references.
  • If it has no distinguishable object in cref image, Midjourney model might strongly compose the image in the almost identical way of the original referenced image, partly borrowing some concepts in prompt or style reference
  • Again, like previous experiment, the model can’t properly finish the job related to phrase describing the image as an object such as ‘barrel distortion’ (objective description).

I really wanted to apply fish-eye lens style to the image, so I figured out how to realize it. According to the official document of Midjourney, it’s posslbe to use multiple refernce parameters, so I’ve got new famous reference.

Harry Styles!

Prompt — sref 1 = A Fashion Photo, sref 2 = Harry Styles Album Cover

fish-eye lens photography, yellow and green background, a woman gazing at the front without emotion at the center — sref https://s.mj.run/YqHVLOMjQg8 https://s.mj.run/H0zVvl8yPBE

Result

Prompt — cref = A Fashion Photo, sref = Harry Styles Album Cover

fish-eye lens photography, yellow and green background, a woman gazing at the front without emotion at the center — cref https://s.mj.run/YqHVLOMjQg8 — sref https://s.mj.run/H0zVvl8yPBE

Result

Takeaway & Interpretation

  • A prompt which Midjourney’s model cannot interpret in an intelligible way should be referenced with style parameter to adapt the style such as arrangement and composition
  • Character reference parameter strongly maintains the subject in the referenced image, so if you want to just take some concepts of photo, it would be more effective to use style parameters.

--

--