Hello everyone! My friend and I are working on a project together and we want to keep track of our progress and which prompt produced which image. We tried saving everything into a shared folder, but that didn't work out well because we wanted a better way to view the information. Is there a tool that can help us keep track of the prompts, trials, and other information, something similar to Weights and Biases?
#📝|prompting-help
1 messages · Page 12 of 1
hi, newbie question, what is this? is it a different UI tool other than Auto1111?
Yeah. Apparently we now have Comfy UI and Invoke (which I've not used yet)
Auto needs a good categorization system for add-ons now 🙂
We should also have some security system, or at least user reports for safety ratings
@drowsy chasm I'm not sure if this helps you, but the Houdini interface for Stable Diffusion, called MLOPs, now contains a field for writing information into the metatags of the JPGs it generates. I typically write my positive/negative prompts, the model, CFG, steps, and image guide strength. It's nice to be able to go back to an image a few weeks later and discover what prompts you used to generate it.
this sounds amazing! thank you so much will test it out
I see a lot of text to image for qr codes but how do i do image to image ones
you mean an image to qr code ?
kinda. Im looking for something like controlnet shuffle but it shuffles it into a qr code
What's the prompt to hide a hand? It keeps appearing if front of the character's face but I can't get it to just.. not be there
How important/relevant/effective to use characters like ( ) | to separate semantically distinct things in a prompt? Or, I saw people using BREAK between smaller phrases. Is there a guide or a common knowledge regarding those? Thanks!
The prarenthesis are used to add weight to a concept contained within. Not sure about the vertical pipe symbol. I've never used that. I believe the BREAK is used on long prompts because there is an initial 75 token limit. The break signals the processor to add another 75 so tokens aren't lost.
In my experience, good sentence structure is a better approach to distinguish separate concepts rather than symbolic tricks. However, Stable Diffusion is not very good at managing more than two concepts. Often, portions of the second concept you introduce will bleed into the first. That's why its hard to render 'some dude in a black suit dancing with some chick in a purple sequin dress'.
thanks a lot, that's very helpful 🙏
@thorn dock | is i believe from dynamic prompt extension
https://github.com/adieyal/sd-dynamic-prompts#basic-usage
and break could be in prompt as well when you are using Regional prompter, it change all keyword in BREAK
hello there
i have a problem with regional prompting
sometimes the ai switches around the sides of the screen
and sometimes it only looks at one prompt
with:
a portrait of a woman with black hair and red eyes
ADDCOL a portrait of a woman with blonde hair and blue eyes
i sometimes get
here it got the sides correct
but the eyes are purple
and the last one is literally perfect but switched sides again
uh
what do i do?
its pretty confusing
How to avoid this?
Click on hires fix
how do i give negative prompts
Anyone know how to use Temporal Kit? I followed this tutorial along to 15:30 when he says to click Prepare EBsynth and nothing happens, I don't get the key or frames folder created or filled. https://www.youtube.com/watch?v=rlfhv0gRAF4&ab_channel=enigmatic_e
edit: I manually created those 2 folders and then it worked
This is a tutorial on how to install and use TemporalKit for Stable Diffusion Automatic 1111. This extension uses Stable Diffusion and Ebsynth.
HOW TO SUPPORT MY CHANNEL
-Support me by joining my Patreon: https://www.patreon.com/enigmatic_e
SOCIAL MEDIA
-Join my discord: ...
@thin creek i have solved your hair issues. But red eye i cant will try again. I just checked common prompt and in prompt i have
.
.
Woman portrait ADDCOMM
blonde hair with Green eye ADDCOL
black hair with blue eye
I have not single wrong blonde black side
But trying on photorealism i have not anime.
i found some similar model
ooooh. i will try putting them at the back of the line
thx a lot
do you have regional prompter extension? @thin creek
yes
ok
is there a way to fade the left image into the right?
or merge 2 different styles in the middle
i seem to have troubles using 2 different loras for each side. it just applies both to the whole image
try apply them in each collumn not before ADDCOMM. What is before ADDCOMM is for whole image. If you put it there then this is the reason.
i have it like:
stuff i want ADDCOMM
<style1> style1 ADDCOL
<style2> style2
the styles in <> are loras
o.k. tried and result is very bad. try move them in ADDCOMM line. I am going to try something as well
kk
Ok. Thanks
@thin creek doesnt work. But i have only loras for general things, not for persons.
well....it works with inpaint
np, pitty it doesnt work, as well probably very depends on model
only weird part is that here i only coloured the bottom left part
yet it changed the eye completely
i think it is because it is on half of the image.
How about generate some image without loras, send it into img2img and here use controlnet iP2P with lora?
is this where I would gofor help downloading or no?
#🤝|tech-support @hybrid harness
iP2P?
Thank you
never heard of it
it is part of controlnet
you have statue of Eva, and you can type turn her into statue of Adam
wow your help with regional promting was amazing!
i am not sure....
look
works perfectly
just how i wanted
idk where the horse comes from
but looks cool too i guess
have you use addcomm? probably horse is part of your figure, i mean some word that triggering horse.
hmm
lora:castlevania_style_offset:1 ADDCOMM
(beautiful moon: 1), dark sky ADDCOL
ADDROW
ADDCOL
a portrait of a beautiful female knight with blonde hair in armor
this is what i got
nice npc for a dnd campaign
knight can trigger it i believe, not sure about castlevania
nvm
i got sth cool instead
@orchid ore i cant fix the eyes but i am already happy
that is controlnet, for sure good to install,
yes 😄
but the horse is the only one whoi enjoys it
ill try it out
this is weird ... why is it related to age..
girl 6-12
female 17-25
whenever i use these tags i get age difrences
girl shouldn't mean kid female...
woman weird as well would suppose >18
@teal valve That's exactly what the word 'girl' means {a female child or adolescent}. A woman is an adult female. I find it incredibly shocking to see so many prompters using the term "1girl."
adolescent is older than 12, isnt?
Yeah but then allot of tags like 1girl, catgirl, doggirl and so on end up backfiring at you
And i gotta go around for example i go with
Mature female, fox ears and big tail.
Kinda anoying that i can't simply use a single prompt for that
try use magical "-" in mature-female. Or adult-woman or whatever. It is mighty symbol
Hello everyone, I would like to know if does exist any guide, to learn about the "base" for an optimal syntax/structure for a prompt...?
Which model has best or most variants of "horns" and what is the best way to prompt them using inpaint sketch?
1girl is a booru tag for anime models if you want a single girl/woman in the picture. it won't work right if you aren't using an anime model
Same with 1boy
Can I take one of the generated images and get the AI to modify only parts of it? I need the character to have longer hair and I want to remove the random blue jewel on a necklace it gave him.
@drowsy gazelle you know what? it is working with photorealistic non anime models as well. With some probably better and with some probably worse, model i am trying work well 1girl
you can send it in img2img and there choose inpainting @strange galleon
anyone know what model this image used? it wasnt listed on thee civit ai gallery ppost
i want to know since it looks like QQ/tencet AI generation style
let me pull up some example images
have you tried if it has some metadata?
let me see
well, the image didnt have any, but the other images in he same post had used anything v33 and the taiwanese food lora
There are a lot of realistic mixes that also have some anime in them. Example: I've made anime stuff with deliberate before.
so at least something. It is difficult get exact model. With prompt and seed probably posible, but difficult.
i think that 1 girl Blip or Clip understand as well. Mmnt
i did it with 2.1 but cant find it somewhere, so i dont know 🙂
is there a way to fix a great image that has clipping problems? ie a backpack strap going though the shoulder
So sometimes when i click on the lora i want to use it does something and othertimes it doesnt. What is the difference between calling model keywords and calling a lora model in the form of lora::1(name)
hiiiii, my name is Florencia, I'm wanting to launch a beta version of a cooking app and I want to know what kind of prompts can work to generate recipe images. If you can give me a hand I would be very grateful
I don't really think SD can make recipes like that? I think ChatGPT would be better IMO.
SD can hardly handle a few words without garbling everything
Hi guys, I am trying to re create a character from a book series. I was wondering if anyone can help me with it. The character's name is "Skippy the magnificent"
do you guys have any recommendations for sprite making websites
and sprite as in the pixel sheet
Guys, how can I make better full body images without the face lose quality? I'm suffering a bit on that
Any prompts for showing less skin? She's a warrior not a pole dancers.
Prompt - Masterpiece, best quality, a beautiful dragon-woman, fantasy, Dungeons and Dragons, detailed red hair, detailed dragon yellow eyes, detailed black dragon wings, Serrated Horns, wearing yellow medieval armour
Negatives - Sexy, Big Breasts, NSFW, lewd, risque, nude, hat, laser, lightsaber, lowres, bad anatomy, bad hands, bad faces, text, error, missing fingers, extra digit, fewer digits, cropped, worst quality, low quality, normal quality, jpeg artifacts, signature, watermark, username, blind, bad eyes, ugly eyes, dead eyes, blur, helmet
Hi! Im using txt2img.py to generate images, is does not has negative prompt as parameters but I did some research and it looks like if you give the model for example a prompt a man riding a horse - tree, dog, cat:-5.0 the prompt will be a man riding a horse and everything after - will be negative prompt, and the -5.0 will be the negative weight, can someone confirm it?
I'm assuming it mainly from some issues on github and this blog post by stability ai https://stability.ai/blog/stablediffusion2-1-release7-dec-2022
Hey guys, can anyone help me with a prompt? How can I force angles like these? I got that by pure luck. I had this in the prompt (front view:1.5), (straight on:1.5) (upper body:1.5) plus a lot of other stuff.
Most of the times, the subject of the picture is just sideways while facing the camera
like this, but facing the camera.
how do i make a prompt for spiderman so they colors of the suit is different then blue and red
I wonder if looking at viewer would work
That only makes the head turns to the viewer, the body still stays at an angle most of the times
anyone have any tips on showing 2 characters with extreme size differences? for example, peter pan and tinkerbell
@calm spindle front view,
from front,
They are fine and should work, however the anime models have numerous biases, one of them having long hair forcing a side view to show it off.
Therefore, forcing the pose with controlnet is a good option.
ah, that may be it
Ill search a bit for controlnet
Thank you
Welp
(from front:1.5),
blue sky,
(white clothing:1.5),```
really dont get how these things works tbh
You cam do that with Tiled Diffusion Extension with Prompt Control
Or Regional Prompter
use : portrait, medium shot, frontal shot
1 girl, (white clothing:1.5), looking at front
blue sky in background
.... add to negative : (side view:1.5) .... this will slightly increase your chances of getting that pose without controlnet
try it with:
(wearing green gold spidernet suit)
@silver valley @buoyant charm badass, these both work great, thanks
RP uses a bit unfortunate terms (base prompt and common prompt) that I always forget how to use properly. It's explained in the readme but I wish they chose something more intuitive...
ok some people dont know this but you can fuse two things with | like this cat|fox
what i dont know how weight here works is it like this ? (((cat))) or do we use numbers ?
Each pair of parentheses is the same as multiplying it by 1.1. so 3 parentheses= 1.331.
You can write (cat:1.3) to get a similar effect.
I think the numbers are better because it's cleaner and more obvious what number you're going for. With the parentheses, you might screw something up by not closing them right
if you mean the weight in relation to the general prompt, I use this form ((cat|fox)), I have never tried the other one
How do I remove NSFW filter from automatic 1111
so i would write (cat:1.3|fox:1.5) ????
you should question this in the automac 1111 discord there are plenty of people who make NSFW content who are experts for your needs 🙂
Discord
Link
Ya, it took me a while to figure it all out, but it's very powerful once I got the hang of it
If I use upscaling, the images take way too long to load, even when using the medvram command
is there any other way to make things faster?
Are you generating and upscaling at the same time?
Normally I generate an image and then upscale it afterwards if I like it
Here's a tutorial on upscaling - you can do it multiple times on one picture
Does anyone know what style is this?
I can't replicate it in sd
do you happen to have the original image file with the metadata?
metadata2go.com
This online metadata viewer will show you all hidden metadata info of audio, video, document, ebook & image files. Online exif data viewer without installation!
If you had the original file you could use https://github.com/SupaGruen/StableDiffusion-CheatSheet to check for all the relevant data
Tried it: No Stable Diffusion EXIF data detected
then i would suggest image searching for similar styles or trying to contact the artist
Not sure if this is the right channel but why do I get this half shade on the eyeglobe when inpainting and how do I get clearer eyes that is not so hazy? Is it higher res, settings or prompt?
i have a dreambooth model on myself that i trained using realisticvision 2.0, 30ish pictures at 1500 steps. i also tried with 2000 steps and got the same results. i'm finding the model only gets my face right with some serious prompting. i've found that "detailed ___ face" and "natural skin" get me close sometimes, but other than that it's a guessing game. does anyone have any tips?
Yeah, about my previous question
I have slow results because my GPU is a 1660
apparently it operates prompts at 1/5 or 1/6 speed of a 2060 so I'm thinking about replacing my GPU with that
is it worth the investment?
invest in cloud gpus, you can get good gpu's (rtx 30 series) for less than 0.2 cents per hour
Basically, you rent powerful gpus on the cloud. e.g vast.ai
And you pay only for the duration you use it
do some research, it's worth checking it out
Hi, can someone explain what the 'AND' function does in a prompt? hard to search for it since and is a commonly used word lol, TIA
Here is the wiki Information for AND:
https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Features#composable-diffusion
ah thanks, i was on there but didn't spot that, cheers. i still don't quite understand what it is doing, it does something that i like though
hey anyone here got some experience with the extension grounded sam's prompts?
talking about the grounded dino section
Does anyone have any tips or good resources I can read on how to generate specific scenes? Im finding it hard to wrangle SD into doing it through prompting, and can't find images close enough to it to use controlnet to do it
What promt should i use to get images in this Art Style
Hey y'all I'm new to Stable, how do I make good prompts? I heard there's some logic behind it but idk
Sure, thanks man!
Hey - How can I add specific elements to my image like aloe vera leaves using prompts
Is there any way to prevent hires.fix from changing the image like this? Left is Highres fix.
I am using DPM++ SDE Karras as Sampling Method
Sure you need to select an ESRGAN based upscaler like esrgan4x anime6b
Then set the denois to 0.25 for example.
The lower the denois the less the image get changed
Thank you!
Thank you agian @silver valley, its working flawlessly. Have been able to quickly generate them without hires to get a preview and then use hires when I am ok with it 🙂
I changed to UniPC as sampler, 4x UltraSharp with a denoise of 0.4
Np, looks awesome!
Does SDXL on Clipdrop support negative prompts?
There is no separate field for it, is there perhaps some special syntax for it, or is that simply not a thing on Clipdrop?
I don't think they have negative prompts
Alright. Thanks for the info!
it's giving me some very bad results lately, what happened?
"doctor holding patient's hand in bed and hiding medical bill behind his own back"
all generated images are like this, no sight of the medical bill (or the doctor's back)
How can I "isolate" parts of a prompt? For example if I want a picture with two cars, one red with large wheels and one blue with small wheels, how do I prevent the prompt from bleeding over?
Hi, can somebody help me with this?:
Type: IMG2IMG
Image: potrait photo of me
Prompt: full body shot, bald
**Expected output: a full body shot of me bald
Actual output: the same potrait photo of me, but bald**
If you are using ComfyUI this person demonstrated their workflow to achieve something to this effect:
Found it while googling about your question because I was wondering something similar
reddit
1,596 votes and 173 comments so far on Reddit
Anyone know of a good model for styles like this? This is from reddit, don't even think it's ai but still the style I'm looking for
It's an original illustration
search in Civitai for Ghibli? study it is very likely that SD recognizes this study/style without a model but in Civitai there are them
I'm tryng to generate an image of a Throne, but every single throne I get got a king on it
I want just the throne
so do we all ^^
other than the glossy hair any suggestions for making this more realistic?
How would I go about reverse engineering an image to get prompts that generate similar images?
I know about clip decoders and midjourney's "Describe" feature, but what is the best one to use with stable diffusion done locally or free online?
Automatic1111 webui has a clip interrogate feature in img2img that can be used for that.
I’ve tried using it but it loads forever, and nothing happens- though everything else works perfectly
Longest I’ve let it sit for is 30 minutes- don’t know if it’s supposed to take longer
You should then check the console when its running
If it runs into an error you can post a Screenshot in #🤝|tech-support
Np 🙂
add "No Humans" add "object concept" to positive .... add humans and people to negative ... try removing Portrait reemplace for full view or full shot .... you can add Throne Focus .... use object concept art or "Gold Throne concept art" ..... check effect of : removing the part that mentions Carl Karssson.....
What's "object concept"?
Gold throne? but is a old tag in boru models maybe work ...
@worn dome Now I'm seeing the effect of some words from your original prompt,
conflicts between details and full view ... or excesive attention to "object" word and no to "white background" part .... (Txt2Img).. but you are in Img2img... (image guiance) ..@worn dome
@worn dome ok image whit metadata..
anyone know of a good Lora or just general prompting to get something simliar to this? tried many variations of hockey mask covered in goo, SD doesn't really know what a hockey mask is, or just thinks its a surgical mask
Recommended positive/negative for prompting out hands? Found some amazing LORAs but they have some serious problems with fingers.
Without knowing which are those Lora that you mention, I don't understand you well, has tried with embedding / loras--- that improve the generation of hands? .... https://civitai.com/tag/negative
If you really don't want hands on composition you can try
"hands behind back", "hands in pockets", "hands behind head", maybe "hands out of frame" will work... or "No hands/Handless"
Browse negative Stable Diffusion models, checkpoints, hypernetworks, textual inversions, embeddings, Aesthetic Gradients, and LORAs
Thank you kindly, I thought I had found all the hand negs but I had not. badhandv4 is helping tremendously. I try to keep details of the art to a minimum as it's adult-oriented and I know this is a SFW server.
hello, in negative prompts, does putting brackets make a difference in the image?
like for example: (worst quality, low quality:1.4), or worst quality, low quality:1.4,
is there a difference?
It's just my opinion, I haven't read the theory behind it, but I think that if the model has a certain influence from models based on tag systems, then if it does have an influence, for the quality descriptors you mention it's hard to see why. crossed attention has a lot of influence, but for other tags such as excluding a specific color or a certain clothing, those changes are more clearly appreciated ...
May be a dumb question but Google hasn't helped. Tried Anything and a few others, but I can't get the effect I want. I want anatomy such as the nose size, eye shape, lips etc to be carried over to the anime style image so it really looks like the person and not an anime approximation. It seems the way I'm doing it, it's just the vague hair style, skin color and eye color of the original image. Any prompts to help this?
'' , '' when writing prompts is necessary ?
@shy sun....really anime no is real 😄 .. and real no is anime ... You may have a more favorable result if you only apply a style that adds typical anime strokes and colors to a photograph (Img2Img)... but that has been available for a long time in photography programs and smartphones
@sullen orbit ?
@obtuse torrent ''1woman,(dark skin:1.2),earrings,hairband,ponytail,hair ribbon,hair clip,long hair '' for example, these '' , '' are needed to separate the words, or not?
@sullen orbit normally if is it have less of 75 words (tokens) in** amime models** (tag sytem models ?) "," It is not "necessary" but it has consequences .... for exmaple ... compound words can be tokenized wrong... or when you go past 75 tokens A1111 will not split the prompt the same way it would if there was a "," in it.... resulting in a different relative weight assignment and that can cause some changes in the generation ---
(It's my opinion, I may be wrong, let's see if someone else contributes something)
can someone help me to create a character like this in automatic1111
why is it impossible to simply prompt a servant pouring a drink into a Kings cup
it always bugs out and never works
could always do depth map, blender, or openpose
hello, im not sure if this is the right place to ask. I don't need an answer but more of the concept so i know what the theoeretical path looks like.
I've noticed that models really influence the look of characters, specifically im using it for anime, the faces, the eyes, you can really tell what model someone is using from how the style is(at least, thats how it feels being 1 week new to this stuff). I see the model explained as the 'artist' (in really simple terms) so it makes, but what if i wanted to generate something with quite a specific style?
IF we use persona 5 as an example (this image), something super unique, really simple, if i wanted to theoeretically make something similiar, am i correct in thinking that the only possible way to do this would be to train a lora/model/something? which obv would need a lot of references and realistically might not be doable hence the thereoetical, or am i understanding this all wrong? I'm pretty sure from everything im seeing the answer is training but i just wanted to check here first before i dive into it
Well it would be hard to do it without a model but not impossible
do you have an example of the style or character you want to use
hmm, if we stick with the example above, i wanna convert an already existing character into that style
my very noob and limited guess is its not possible but maybe im missing something obvious or smth :O
Is there a current list of known artists in SDXL ?
What Promt For Greek Statue Like this ?
Question for you prompting wizards...
When blending a prompt, such as [ cat | dog ] , does it work the same to add more words to that blend, such as [ cat | dog | chicken ] ?
yes
and if you need 75% cat and 25% dog ---> [cat|cat|cat|dog] ... you got the idea?
https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Features#alternating-words @umbral veldt
Greek Statue ?
Greek Statue ? , "[god name] god/goddess of [god/goddes ambit description]"
So you couldn't weight words like this to get the same effect? [(cat:0.75)|(dog:0.25)]
IDK..
test it,
I have never done it ,
@umbral veldt
The way I see it, that way of writing it maintains the same relative weight between cat and dog... but it would make both lose relative weight against the rest of the prompt... test is necessary ..
Hello everyone, i have a stupid question, every time i put i color inside my prompt, i always see that a lot of other elements inside the producted image will be of that color, there's some way to prevent this?
You can add extensions like controlnet or regional prompter, which help to subdivide the composition or guide the generation in a better way, but you must familiarize yourself with them to take advantage of them...@north canyon
https://github.com/Mikubill/sd-webui-controlnet.git
https://github.com/hako-mikan/sd-webui-regional-prompter#2d-region-assignment-experimental-function
How do you combine two words together if you don't have access to the [dog|cat] format?
Is there another way to do it, like dog AND cat?
@drowsy gazelle .... the [dog|cat] format? .... is a feature of A1111... webUI... it.. alternate 2 (or more) words in each step of generation ..... IDK about others UIs ...
Yeah I'm using another version that doesn't have it
sorry chaps, how do you actually ad negative prompts?
like in a1111?
Whenever I try to generate a character in 1960x1080 (landscape) it generates multiple of the same object twice or thrice, sometimes merging the hands together. How do I make it so it only generates one of the character and fills the rest with background?
This is something common when it comes to directly generating this type of resolution.
I recommend generating an initial image at a low resolution and then achieving that with tools like Hires.fix and/or upscalers
For example:
653x360 initial
Hires.fix (x2)
- another upscaler (x1.5)
or any other similar combination
maybe you can generate a:
980 x 540 + Hires.fix (x2)
... Add "clone" "cloned" "Duplicate" "jpeg artifacts" .. to negative
Thanks. I messed around with some hires.fix settings on automatic1111. What should my hires steps and denoising strengths be set to ideally?
if you set steps at 0 .....it is normally is fine (repeat the same value that you have for the normal generation) ...D.S. depends a bit on the scaling method, normally 0-51 is possible...
05-30 if your base builds are fine and don't need much change
30-51 if you need to give the AI more freedom to alter what is generated at the beginning....
Alright. Thanks
for anime models? it is tag "upside-down" ... "lying", "on_back", "legs_up" and you need a lucky seed ||to no get a NSFW pose||
Denoising strength can only be 0 to 1 in webui. Only setting it to 1 gives me a clear image, 0-0.5 generated a blurry image. I set scaling to the same as normal
My mistake, when I told you numbers, I was talking percentages... I assumed you knew that the D.S. goes from 0 to 1 (0% to 100%)
@keen epoch
Can someone help me with getting a samurai cowboy witb a oni mask holding a lever action, I'm only able to use this servers bot
FAQ: Why are my images blurry?
In order to ensure a safe experience, the DreamStudio website has a NSFW classifier that will detect and blur any potential NSFW images. While in most cases the classifier will appropriately identify NSFW images, there may be occasional false positives due to the nature of how these systems work. We will continue to work on and improve the classifier to make false positives less and less likely! You are not charged for any images that are blurred
Why is that option not available in img2img?
is there a way to img2img this image to both remove the white background and add a landscape as well as make the actual character look different/more epic? im using automatic1111 if that helps
As far as I'm concerned Stable Difussion doesn't work with Alfa channel, but you can get it into Photoshop and use this tool, then work the selection and get the character over a transparent bg, then generate your bg separately in Stable Diffusion and blend the two images together in Photoshop
I'm trying to generate some images of a royal palace background, I don't need something specifically, but I could do with some prompting help cos I'm not sure what should I write down
I've got here some reference images, but I don't need the bg to be exactly like them
Hello! I'm not really sure if this is a right place to ask, so sorry..
I'm looking for a way to restyle an image. I made a 3D sketch with some character sitting in a car, and I want to restyle it using one of the models I have. I tried to use img2img, but it does not save the structure of the shot. Character sometimes in wrong pose or car is wrong e.t.c. Is there a good tutorial on restyling images. All I saw on YT are for photos and people, but some shots I need are without characters at all..
I'm using local web-UI (Automation111 or something like this, do not remember)..
if i understand properly, but i doubt, i would use img2img with about 0,3 denoise strenght. Default is too strong, if remember properly about 0,7 probably. @wicked fossil
Oh, yes, it does help, thank you! Weird parameter name for a beginner user 🙂
glad it helps
Hi guys, I'm new to SD. The faces on my characters look off and I don't know how to fix it, anybody that could help?
What am I doing wrong?
This is kind of the result I was going for:
(face-wise)
can anyone suggest an extension for a1111 to automatically run a prompt through different checkpoints and/or different loras?
Thats already in build in auto1111 called X/Y/Z prompt. At the bottom in txt2img
What type of prompt should be used depends on the model, when the basses have a presentation page where they recommend the settings, and other things... depending on whether the model understands the concepts or not, phrases like "look wise" "wise gaze" can help you or tags like "tsurime" or emotions/personality traits like "cool", "confident", "self-reliant"..you can also try other models or options hires.fix or scripts for better details
Almost there, just need to get rid of the finger feet.
this was after 30 steps
I'm gonna give it some more steps to work on. Like 40.
my stable diffusion is behaving really weird and I need help with my prompts.
My prompt was: A magnet shaped like a U, 2d game asset
Negative prmopt: Detailed background
and this is what I got
Why the heck does this keep happening!
It gets even worse when I upscale, these random green blurs on certain parts
Thats mostly if your model is missing a vae
But I'm not D:
oof is my question getting away now 😦
Which one did you use?
clear VAE
What model do you use ?
It happens on any model I use, but this specific one is azovya RPG artist tools v3
I'm mainly using stable diffusion for game assets and it has worked better before
Try an other vae like 84000-mse
Clear vae is for anime models
It does it even with anime models though
Can you show me an example?
here is a better answer:
A magnet shaped like a U, 2d game asset
Negative prompt: detailed background
Steps: 20, Sampler: Euler a, CFG scale: 7, Seed: 2350699442, Size: 512x512, Model hash: cc6cb27103, Model: model, Version: v1.4.0
You should try an other model
which model?
Maybe this one:
https://civitai.com/models/47800/game-icon-institutemode
<游戏图标研究所- luxiaoyu >"AI-Game Icon Institute" Here is the link to the method of correctly using the model: https://www.bilibili.com/video/BV1L...
You download the model. And put it into the models/stable-diffusion folder
Thats it 🙂
oh xD
Then you can select it in the dropdown
just in here?
I can't find one of the examples! But I have seen it! Is the VAE thing the only possibility? If so I can troubleshoot from there on my own
@vapid crater In the stable-diffusion folder
Yes
mine is called stable-diffusion-webui 🤔
Alright, ty, I can likely find a fix on my own with that.
For photorealistic models you need a vae like 84000 mse and for anime for example the kl-f8-anime2 vae
@vapid crater fourth from bottom folder
OH
Stable-diffusion-webui/models/stable-diffusion
ah the one I already have is there
ty!
what a weird file extension name gameIconInstitute_v30.safetensors
Yea .Safetensor are the new file extensions for models(checkpoints)
ah
Make sure they are over 1.99gb in size if you Download more. Everything below dont go in this folder
what's different between models? Is it the data they have or the algorithm?
yeah that one I just downloaded is 3.97GB
The models contain different training data. The model you had used before is the official SD 1.5 version.
The Community models are better trained or specificly trained on subjects and are based on the 1.5 model
aha
so if I want a model for game assets I should look for a model with lots of images related to game assets in its data
cuz they are trained on a bunch of images right?
Yes the one i linked is such a game asset model
aha
do I need to type game asset in my prompt tho?
sometimes it helps but if the model is already trained for game assets it should be useless right?
Yea i would try game icon, game asset, 2d, icon, U shaped magnet:1.2,
what does :1.2 do?
It gives the word more attention
wow that's very cool
You can use (word) or word:1.1
I also tried to use this as an image prompt but it was too similar even though Denoising strength was very low
wait I meant CFG scale
what if I want to have a slight 3D shape on it?
but still from a platformer angle
and smoother curves
It should generate it more 3d then 2d
I cant try it out. Not at the pc rn, but later
wtf I got this
game icon, game asset, 2d, icon, U shaped magnet:1.2,
Negative prompt: detailed background
Steps: 20, Sampler: Euler a, CFG scale: 7, Seed: 1140004488, Size: 512x512, Model hash: cc6cb27103, Model: model, Denoising strength: 1, Version: v1.4.0
with the image prompt
my stable diffusion isn't making any sense
uh I thought i did but apparently not
game icon, game asset, 2d, icon, U shaped magnet:1.2,
Negative prompt: detailed background
Steps: 20, Sampler: Euler a, CFG scale: 7, Seed: 1388204124, Size: 512x512, Model hash: c112297163, Model: gameIconInstitute_v30, Denoising strength: 0.62, Version: v1.4.0
BRUH
wait I copied the wrong info it used this: Negative prompt: detailed background
Steps: 20, Sampler: Euler a, CFG scale: 7, Seed: 4265811788, Size: 512x512, Model hash: c112297163, Model: gameIconInstitute_v30, Denoising strength: 0.9, Version: v1.4.0
my image prompt seems to have done 0 effect
Okay what if you try the prompt in txt2img ?
game icon, game asset, 2d, icon, U shaped magnet:1.2,
Negative prompt: detailed background
Steps: 20, Sampler: Euler a, CFG scale: 7, Seed: 1206477096, Size: 512x512, Model hash: c112297163, Model: gameIconInstitute_v30, Version: v1.4.0
this is text2img
not even a u shape
not even 2d
i made this but with CN. Which means download tons of models.
it looks a lot like my image prompt tho
it is your image as source
Seems like the model is more for 3d assets
oh oof
Can you try.
Two pole magnet, red and white. U shaped, 2d, comic style
I'm looking to get something like this but I took this straight from google
I thought the AI could achieve that with denoising strength adjustments
why else would you use your image prompt if it gives the same results
okay
maybe horse shoe would be better than U shape
it seems like it only understands "red white"
Two pole magnet, red and white. U shaped, 2d, comic style
Negative prompt: detailed background
Steps: 20, Sampler: Euler a, CFG scale: 7, Seed: 1858827973, Size: 512x512, Model hash: c112297163, Model: gameIconInstitute_v30, Version: v1.4.0
that was text2img
hmmm okay
sometimes I feel like drawing my own stuff is easier than using AI 😂
If you want some context, this is all the art I've managed to make with AI so far but now I can't even make a fucking magnet! 😡
Is this a joke? I got this by using the prompt "Magnet"
Draw a magnet and put it in img2img maybe
I did
It's thinking fridge magnet then
I used this as my prompt
and this is the best result I could get
maybe this model has zero images of U-shaped magnets
what's lora?
It's like a model but instead of being an entire thing of art it's taught a specific concept, for example you can teach it how to recreate specific characters, specific clothing, hairstyles, the list goes on
you mean in here?
No, tbh I've only dipped my toe into loras, it's not made in the UI, you make it elsewhere, I am not the most knowledgeable
oh
Well into making them. I'm very experienced using them!
Yea its something that a model may didnt know
Maybe not even the base 1.5
Or maybe with an other word
but it seems like I need a different model though, any tips for similar stuff like this? It's 2D but still slightly 3D
Maybe some kind of tech lora knows how to make a magnet, but that would be a shot in the dark
oof
Hmm maybe an anime model or toonyou
I can at least be proud of being the first to encounter a problem in 2023
cuz you are almost never the first 😂
I encountered a similar problem. I can never make one of my characters because she has a veil covering her eyes, and no models have been trained on eye veils
oh lol
hmm ok
I guess I will just make my own in blender with an almost 2D camera angle and then just ask the AI for textures
cuz I want it to look old and rusty
Guys I'm trying to generate a set of photo realistic character images in stable diffusion using Random names in my prompt. I'm planning to make a data set out of this results and do a dreambooth training.
Right now I'm struggling to generate the character in all angles like front,side,3/4 views, and back. I'm not getting similar faces in each angle.
I'm using epic realism for this. Any lead or help would be appreciated.
@vapid crater try shape of letter C
the fact it can't even understand shape U is surpricing
mine are purple on one of my checkpoint models
You might need to be more specific on that one
You then need a vae file for the model
In img2img you need to use the sd upscale script
Its nearly the same
anyone can help me to make better this photo
here
1girl,armlet ,bangs ,bikini ,purple bikini ,blunt bangs,bracelet ,braid ,braided ponytail ,breasts ,earrings ,eyebrows visible through hair ,flower ,flower bracelet ,flower on liquid ,hair flower ,hair ornament ,hairclip ,highres ,jewelry,large breasts ,long hair ,navel ,necklace ,parted lips,partially submerged ,purple flower ,purple hair ,purple nails,side-tie bikini ,sitting ,solo,stomach ,swimsuit ,tassel,tassel earrings,thigh strap ,violet eyes ,water ,wet ,white flower,raiden_shogun,aesthetic,more flowers,(exceptional, best aesthetic, new, newest, best quality, masterpiece, extremely detailed, anime, waifu:1.2)
<lora:raiden shogun_LoRA:1>
1girl,armlet ,bangs ,bikini ,purple bikini ,blunt bangs,bracelet ,braid ,braided ponytail ,breasts ,earrings ,eyebrows visible through hair ,flower ,flower bracelet ,flower on liquid ,hair flower ,hair ornament ,hairclip ,highres ,jewelry,large breasts ,long hair ,navel ,necklace ,parted lips,partially submerged ,purple flower ,purple hair ,purple nails,side-tie bikini ,sitting ,solo,stomach ,swimsuit ,tassel,tassel earrings,thigh strap ,violet eyes ,water ,wet ,white flower,raiden_shogun,aesthetic,more flowers,(exceptional, best aesthetic, new, newest, best quality, masterpiece, extremely detailed, anime, waifu:1.2)
<lora:raiden shogun_LoRA:1>
i am new to this thing and peoples are doing much better things than me
just asked for can i make it better
ohh i copied wrong thing
lowres, ((bad anatomy)), ((bad hands)), missing finger, extra digits, fewer digits, blurry, ((mutated hands and fingers)), (poorly drawn face), ((mutation)), ((deformed face)), (ugly), ((bad proportions)), ((extra limbs)), extra face, (double head), (extra head), ((extra feet)), monster, logo, cropped, worst quality, jpeg, humpbacked, long body, long neck, ((jpeg artifacts)), deleted, old, oldest, ((censored)), ((bad aesthetic)), (mosaic censoring, bar censor, blur censor)
here
yeah
https://huggingface.co/YoungMasterFromSect/Trauter_LoRAs collection of Anime LoRAs that include characters from various series. Every day I try atl...
np np
oh i saw them
and any guide for imprinting
i will
yep
inpainting
its works weird
doesnt fix limbs
ohh
last question
why my pc crashes
i have rx 6600xt
and randomly crashes when using sd
for temps
yep
so i need to wait some time
low res?
which gpu you have
ohh
you can do sd with laptop
i want to buy a laptop but i am scared for sd to run poorly
it is hot?
laptop model?
i am planing to buy more budgets
like asus tuf
or hp victus
people dont reccomend that one usually
ohh i found problem probably
i use 512x512
than upscaling it to 2k
i dunno
from highres fix
selecting animeb6
something like that
it just works
in same res
why am i getting out of memory error
i hate this
8
changed sampling steps and res
experimanting
but nothing changed
oh my
this looks good
in image
difference is so much
do you make it with control net right?
still out of memory error
with 8 gb vram...
i need absoulty an nvidia gpu
same res
1280 720
changed nothing
--xformers doesnt work on amd
yeah
i entered some commands
export COMMANDLINE_ARGS="--lowvram --no-half-vae --opt-sub-quad-attention --opt-split-attention-v1 --autolaunch"
low
okey
thanks anyway
i am going to buy an 3060 laptop because of you
i am going uni next year
so i need a laptop
i am in other country saddly...
some people said 6 gb vram isnt enough
but you use it pretty smooth i guess
i learned my lesson
amd is not for sd
absoultly
it is pain
i have a lot of problam installing on linux
and then
this memory issue
I gave up on SD and made my own magnet lol
I thought it could give me more details and more interesting results
but I do need better textures
maybe I should try to use this as an img2img prompt
never heard of that
🤔
2070 super or 3060
which one is better for sd
2070 is waaaay cheaper in my country
and i can trade my rx 6600 xt with that
i guess 2070 is way way way better than rx 6600xt
i have some graph about it
if we say
3050 is on par with 2070 on ai
stil better than my gpu
but 3050 is 50 class gpu
2070 is 70
probably 2070 is more powerful at ai
It would be awesome to see Apple silcon on a list like this
you are right...
intel is that bad
so i need mux
asus tuf has mux switch
in budget class
its pretty neat
yep
i will do that
but i certainly buy an nvidia gpu
stable diffusion is so much fun
what are those Stock texts tho?
interesting
oh you used that google image as a prompt xD
I was also hoping I would get an image faster by using SD instead of making my own but it was faster and easier by making my own in blender xD
it looks like a helmet xD
magnet helmet 2023
lol
Hi, how do I stop these weird blurry artifacts appearing when I do small-scale inpainting?
(I have whole picture checked, for the record)
the picture is blurry on a lot of spots... do you mean blurry spots where your mask ends? i think this can happen if you have the "Mask blur" option set too high
If the "Mask blur" is not the thing you are looking for and instead your problem is that the newly drawn skin or background is too blurry, maybe try "highly detailed skin texture", "sharp", "natural skin". Common negative prompts to prevent blurry skin: "airbrushed" "cgi" and sometimes "doll"
Does someone have an idea how i might get this pose (not exactly, lookalike is fine)? I´m already struggling to control her fingers a little bit...
Do generated images have any kind of metadata so I can use it to know which prompts were used to make it?
This is not always the case...often there are options that allow you to determine what generation data is added to the metadata...it is not uncommon that some authors choose not to add them or only a few of them, also and cases that use an external editing program alter or delete them.
So you're saying that, by default, they're not added?
I don't remember which is the default setting.. also A1111 has had many updates.. and not all people generate with A1111 ........ if you use A1111 you can see that in settings
hii!!! i need help, please. I want to make a Spanish-style card but changing card number 12 (which is a king) for that of a gaucho
I'm using the stability api service and struggling to get results that reflect my prompt. I've perhaps handicapped myself as I'm looking to generate some 300 images with a common style. In this case I've selected 'water colours'. Here is an example prompt: a watercolour depicting a user zooming a picture on a mobile phone
I should note that I'm also forcing the aspect ratio to create a 'swatch'.
So I'm trying to generate an image of an orphaned child (to represent software that is no longer supported) 'a watercolour depicting a lost child' and I get the reponse - Invalid prompts detected - so I'm guessing more overly zealous censhorship
hey
What prompt should i use to create a image from a reference image
Please help
Hello, I'm new to Stable Diffusion. I used a lot Midjourney and loved it. I saw Stable Diffusion and wanted to try because it's "unlimited to use". Do you know why is it soo bad with things like characters or stuff from pop cultur thing. I tryed to do some Mass Effect and lol characters and it never worked not even close. Even with the image to image. Maybe I'm just using it wrong so I came here to find if you can give me any advise for this
A prompt that describes the input image
Hey do you use a webui for SD ?
The base model of SD cant create every character from games or movies. So people started to train their own files to get specific characters. These trained files are called Loras. And they can be loaded together with a model to get a good image of the Character.
Yes the Automatic11111 thing like this
Do you know where I can find this ?
Sure you can find different models and loras on Civitai.com
Make sure to place models(checkpoints) in the models/stable-diffusion folder and Loras in models/loras
now I'm wondering how MJ can get so many pop-culture characters (relatively) right. Are they all trained in the model itself or does their process pull up LORAs / TIs in the background, in response to prompt keywords?
Thank you so much !
I think they mostly mix stuff in the background to get the stuff you want.
Np, i would suggest to first try different models. Maybe some of them know more characters. If not then search for loras of the chars.
Here is a guide on how to use Loras:
#🤝|tech-support message
how can you prevent more than one character appeared
Using a smaller resolution like 768x512 and then upscale it instead og using higher resolution at the start
More like 512x768 or 768x1024 as its a Portrait format
ahh 2:3
You can fuse two thing together with the | symbol between them like this cat|fox you make this symbol when you press "alt gr" + the button left from the "Y"
hey guys, as anyone found a way to replicate those "mini-x" kind of setup and looks almost as you are transposing what you want into a diorama sort of thing? I've seen it often done with videogames like "mini-thelastofus", "mini-cyberpunk" and they look as if they all were isometric in nature
would be really cool to build upon it
is there a way to generate a prompt x amount of time before it randomize? i use wildcards and dynamic prompt
Is it true that my SD is not using the internet when running?
A challenging prompt no ai has yet come close to mastering: A bear with six eyes
Since apparently no person has ever drawn this it is impossible for any system to imagine it. If anyone can get SD to comply I'll be very impressed.
I need some help, I'm not that experienced with stable (or other Ai generators) but I'm getting a problem that i not faced before
in preview my arts look kinda fine but when they finish they look kinda weird like that
should I up the sampling steps? I'm using 60 rn
are you using any loras? alsoi whats your cfg at?
what are the lora weights at? ive seen this happen with multiple loras in the past, they usually dont work well together at higher weights
id try lowering the lora weights and see if that helps if theyre both at :1 or something
8 cfg is fine btw
I'll try lowering a bit, maybe its a lora conflict
I removed all loras and tryied with 100 steps, got a bit better result, I'll try to insert the loras slowly and get the results that I want, any suggestion in improving the quality of the details in the image? still looks kinda unfocused to me
Lora style 0.2 - 0.5 (you can increase it more if the style does not show well)
.. character Lora 0.5 - 0.9 (in this range the character should be shown relatively well and leave room for the other Lora)
the Sampling steps depend of the Sampling method you use, but in general there is little benefit to going beyond 50.
to test the image of: 10-20.
to generate with a good seed of: 24-50
thanks for the advice, I lowered the style lora and got some pettry good results, I'll be more mindful of the sampling steps but I'm getting some consistent results with 60-70
@cosmic linden ok......pastelmix recomendations (uploader)...
Guide
For the settings or parameters, I recommend using these settings.
Sampler: DPM++ 2M Karras
Steps: 20
CFG Scale: 7
Hires. Fix: On
Upscaler: Latent (MUST!)
Hires Steps: 20
Denoising Strength: 0.6 ()
() I prefer using 0.6 since it's the sweet spot of this model. If you can find a better setting for this model, then good for you lol.
Latent upscaler is the best setting for me since it retains or enhances the pastel style. Other upscalers like Lanczos or Anime6B tends to smoothen them out, removing the pastel-like brushwork.
------- HF bye

@cosmic linden normally on their presentation page the models bring a brief description and recommendations for use, and sometimes known problems are mentioned... it is always good to review them well... and if you have more time read questions asked by other users and the answers of the authors..
I generally read it and look into other generations to have ideias of what people are using, I just messed with the steps but I overlooked the Latent upscaler, I'll look into it
I'm not very versed on all of this so thanks for the patience
NP, sooner rather than later there will be topics in which you have more experience and you will be advising me... besides, the fun is in experimenting and more than once trying something not recommended will give you better results
大广赛要用,所以我训练了这个lora 它可能不是那么听话,但是对我来说已经足够了 如果对你也有帮助 请给个好评 这是我坚持下去的动力! I had to use it for the Creative Advertising Art Competition for University Stu...
there is something very refreshing about uncharted territory
where everyone can figure something out for the first time
i've learned a lot from noobs experimenting (and really am still myself a noob experimenting) and thats super super cool
How can I force stable diffusion to use one type of color ?
Like I say red and that came blue 💀
And also, I'm interested to use controle net. If you know how can I do it with 1.5 and the webui
hey guys, i want to try and paint rest of the face with inpaint tool, is it possible? if its not, what extension can do it?
i just get stripes and thats it
for example blonde hair:1.3, green shoes:1.3
I've got a ship inside the black area and I don't want it to be there. I'm trying to erase with inpaint and I dont get it
Prompt: sea
Negative prompt: ship, blurred, bad quality
Someone could help me?
what are your img2img settings
Changing denoising strength I got a better result, but still a bit rought difference in the colour of the sea
hmm thats pretty hard. SD always tries to add something
you woule be faster by editing it with a editing tool like Pain.NET
what's that?
free image editing tool, not as good as photoshop but its okay
I've got Photoshop
It´s sometimes helpful with some tricky edits SD is probably not prepared to do yet
the best i got with 1.5 inpainting model and denois 0.8
for inpaint you should use specific inpainting models
nyway to use stable diffusion more as a filter than generation? As in, generating art in a certain style and wanting to convert another image to that style as well?
use the content aware fill or generative fill in photoshop, those are great for removing small unwanted objects like that ship
content aware fill is in beta?
Yes
cos I don't have Photoshop Beta
Content aware fill is in the regular photoshop, Generative fill is beta
I had, but the free trial expires
Perfect!
im trying to make a character lay down, but all im getting is a pose with her rm crossed, i have a prompt lay on back but it aint working
H
(Img2Img) Any tips for prompting a 2D cartoon image towards being something more like a 3D CGI/Photorealistic image? I’m close but not quite there yet.
Ok thx
Hello guys.. playing ewith the deform and somehow, the images get very fast abstract.. and it's not part of my prompt.. smth in the setting.. please any help ❤️
what is the best sampling method for anime?
I want to create a small cute shibi, with blonde hair and blue eyes, for the backround juste a beautiful color.
But i dont know exactly how to do it with speacial prompt any experts here ?
lower the denoise strength to like 0.3 or 0.4
this one?
there are models (checkpoint) that facilitate this, and models specialized in them (LoRA, Lyco), look for them in civitai, by the way check the prompts they use... there is always a possibility that other models somehow understand the concept and generate it ... https://civitai.com/tag/chibi
Browse chibi Stable Diffusion models, checkpoints, hypernetworks, textual inversions, embeddings, Aesthetic Gradients, and LORAs
Anyway to use stable diffusion more as a filter than generation? As in, generating art in a certain style and wanting to convert another image to that style as well?
And what do you call the area beyond full shot?
🤷
@wanton girder
The following terms may not work on all models or be understood in the same way.... if there is some realism and photography in them: Wide open view, Extreme wide open view, panoramic, scenery, etc..
How can I tell my AI ton focus on one thing ? Like I want a guy with a helmet that look like a camera, but never something near a camera came out
to fuse two things use the | symbol you can make it with alt gr+ the button left from Y than use it like this helmet|camera should it be more an camera not an helmet use weights () they tell the ai to generate from this word more like this helmet|(camera) , the ai dont understand context but words. enjoy @sly fable
text is hard, has anyone good tips ?
Oooooh ok thank you !
@sly fable ah i forgot , to make weight heavier just add more () to an word like cat,(cat),((cat)) you only need one of them and dont use the same word over and over
no problem
Ok tysm !
and use nagative promts too, so you can avoid mixing with other data that has same vibe but totally different content,
I will !
as an example when you want an real photo, just tipe in negative promt waht you dont want like ,3d,render,oil painting, and so on
@sly fable
Ok ok ty ! You help me a lot
its a little sad that here is not an guide with all the promt tricks and good promt words to get what we want.
It is !
thats for you and me to make, stabilityAI just trained the model and let us play with it. I'm sure theres guides out there in the internet if you checked, or you could write your own 😉
Is this the right place to ask for prompt tips?
If so ... how do I keep all characters in my renders from having the same expression?
For example, (one woman smiling) and (three men frowning) can have them all smiling or frowning...
#1100170312106127410 can do that for you
How? Oh you mean the damn unicorn thing lol
If anyone has any prompting-help advice on how to have multiple characters with different expressions, I'd be way grateful
what would be a good prompt to turn my photo into an 80s anime
I struggle to put multiple characters with different outfits in an image.
For example, I want the AI to draw a man and a woman sitting and chatting with each other in a cafeteria, both wearing different outfits. How the hell am I supposed to describe that to SD without it mixing things up?
Hi! I have a question. I would like to know if it's possible to use stable diffusion in this way: I upload my own sketch (png) and then make it inot a rendered image. For now I found a lot of tutorials regarding using the sketching option within the stable diffusion program but I don't know if my png sketch that I uploaded is really used by the model or if it's just not visible for it. Can anyone help me?
What is the best input format to make sure my prompt is understood? For example, if i were to use piercing blue eyes as a prompt how do i avoid it being understood as "piercing blue" eyes?
you could try using your sketch as an image with controlnet. check out the gallery for examples https://github.com/Mikubill/sd-webui-controlnet
Can some1 explain me what exacily "BREAK" doing in prompt?
In Regional prompting are keywords like ADDCOL, ADDROW, ADDCOM turned to word break in picture name.
But in A1111 here is the answer
https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Features#break-keyword
thanks, I'll try it!
there is a MJ style I like but fail to reproduce. any idea to get those brush strokes in sky?
Van goghs a starry night over a huge cyberpunk city, recursive network graphs within neural networks made of stars and galaxies in space, moebius + pascal campion, incredibly detailed + sharpen + professional lighting, cinematic, awe inspiring --ar 16:9
any tips for sdxl prompting, i dont get good results
not completely bad but compared to others not really good
i struggle with extremly blurry results
Anyway to use stable diffusion more as a filter than generation? As in, generating art in a certain style and wanting to convert another image to that style as well?
Hi, is there a way to try different models in the bot rooms? Not sure how to point to different models. Any tips?
Hey there SD community. I had a hopefully somewhat quick question for you I'm guessing one of you can answer without an issue!
I was wondering if you guys know a way to put a generated image from SD, infront of a pure white background. An example is below. I am able to do this right now by first uploading a pure white background into inpaint, mask an area, and then generate an image. Which works well enough, but if I want to use the shape of a specific image and get control net involved, the quality takes a big hit, and then the pure white background disappears. Do any of you know a way I can do this in a much more straightforward fashion? Any help would be great. An example is below. If it helps, I already have rembg installed and can remove backgrounds from generated images. Thanks!
how do i do weighted promts and other things
is there a wiki for it
?
im using sdxl bot
Hi there, im curious how you make animals fur change colors in a prompt? I am having alot of trouble.
how do i make my results less creepy
my prompts are 1girl, portrait, full body, brown eyes, extremely detailed eyes, standing:1.2, oversized (t-shirt), (black shorts), platform boots, monotone background, long (black hair), cute face, style of Masashi Kishimoto and for the negatives ((blurry)), nsfw, out of focus, large breasts:1.1, 1boy, (deformed:1.3), poorly drawn face, poorly drawn eyes, lowres, ugly, mutation, mutated, worst quality, bad quality, twins, 2girls, 3girls, 4girls, multiple people, 2boys, 3boys, 4boys,
i don't like how it looks
i want it more anime like
i'm also using sd webui if that's important
Anyone knows how I can mix multiple faces into a single individual? I dont want to get multiple characters to appear in the image.
how can i share a reference image for SD to dream it?
anyone there?
Please DM me the answer in case i dont reply its very much useful for me
add blurry as an negative promt and high detail,sharp, to the positive ones ?
use the fuse promt, i has explained it 2 days ago #📝|prompting-help message
tried allready 😅 , i kinda gave up with sdxl for now
@kindred garnet can you give me your promt and tell me what you wanted to be generate ,eventually can i find out the problem
Hello, what's a smart way to find if the style of an artis (that I might use in my prompt) is actually recognised by Stable Diffusion ? Is there a test you can think of that I could use ? Maybe trying to generate the artist's work (e.g. a vase with sunflowers by Vincent Van Gogh) ?
Your model needs a vae file for color correction. And you should try a resolution of 512x768
Looks pretty anime-like to me.
And speaking of VAE
@silver valley Are those good enough?
I don't understand much about VAE, my friends told me to get these.
You should get the kl-f8-anime2 vae too. Its very good for anime
Can this discord server support img2img input?
Please see the pinned comments in the bot sections for all information on SDXL. You can also see #📣|announcements for the latest release information
Hopefully this question is pretty basic, I'm a newer to the AI space so I am still trying to experiment with tools to understand more about SD. That being said does anyone know what I would need to do to make two more variations of this image with the head in a different position? I'm going for King Ghidorah
From what I have been reading that seems like the way to go, I'm not familiar with Controlnet either. Does anyone have any experience doing something similar to this? Controlnet settings that you might use to adjust this image to a different pose?
I'm sure someone might have a better method but the best I can think of is to controlnet tile that and text2img until you get a different pose. Usually you can use openpose to change poses but that's reserved for humanoid figures I think. Mabey openpose face could help but unsure, could give that a few generations to see if it works.
An alternative I would try if I were to do what you're trying to do is edit the image around and flip/tilt it to whatever pose you want it to be in then img2img that or controlnet tile that.
That's a great idea! that would definitely help with adding some variation to the heads, It's always the simplest solutions haha
you need canvas inpainting and control net so you need an amatuer freundli tool that can use free models is bug free and has its own discord to help.
you need InvokeAI first search for their discord.
Its an software like paint but very simple and easy to use in there discord are many people who can explain to you how to do it.
anyone got any suggested prompts for making sure head, torse, legs, feet are all facing the same direction?
i keep having issues, especially with back shots, where the head faces away but the rst of the body still faces the viewer
like this
there will always be failures, unless it's a build guided in some way... regional prompter or Controlnet multiposes ...
yeah im using controlnet
middle row, second from the right. This happens a lot
always in that spot
and it's always in back shots with ot without CN
so i assume its something to do with back shots
@tight light that is a product of the way the image is generated without a specific guide you will always have these problems appearing, the same for upside down or unusual poses
try to get a good template for controlnet to guide you, it's hard work with the editor... good luck
any particular reason on why abyssorangemix2hard is generating black images? i thought that was the whole point
i generated this character, is there anyway, that i can consistently generate this same character ( face, hair, ears, look)
copy the seed and up the batch count
Please some can help here. I am trying to generate images with size of width:1216 height:832 but i always get images with 1024x1024
using which UI?
using bot on this discord
try to resize so pixel count adds up to 1048576
what is the input of negative prompts on the bot? --negative prompt?
negative_prompt
danke
after you write dream, if you click outside of the box you can see
the additional parameters you can add
i am putting this prompt but still getting the 1024x1024 image resolution .. Red leaves tree with white trunk , best quality, outdoors, close-up, high detail, 2k style:Photographic width:1216 height:832
that doesn't add up to a mega pixel
1216*832 = 1011712 (nvm don't think this matters as long as it's close to a megapixel)
you could try genning & using the resize option on the prompt, might get close to the resolution you want
wait
i checked the bot mentions and those are 832x1216
i think the height is bigger though rather than width
for some reason
i saw some other prompts using same size and their images match the same resolution but when i try with same prompt it's coming out at 1024x1024
your last one is 832x1216
but the wrong dimension is altered
also i think other people probably hit the resize button
and got a random dimension
one more thing i am confused some one entered along prompt they did not got token limit error when i tried i get LONG PROMPT message with this strawberry milkshake, bar menu, canon eos r 3, f / 1. 4, iso 2 0 0, 1 / 1 6 0 s, 8 k, raw, unedited, symmetrical balance, beautiful volumetric cinematic lighting, masterpiece, best quality, award-winning, coherent, exquisite cinematography, film still, RAW, shot on 35mm. 108MP, HDR, intricate extremely-realistic textures
Negative: unrealistic, semi-realistic, smooth plastic details, uninteresting, ordinary, ugly, plain, dull, sparse, painting, airbrushed, illustration, drawing, toy, doll, digital art, vector art, clipart, comic, 3d rendered. low-resolution, low-quality, smeary/smudged, overexposed, oversaturated, image grid, watermark
your prompt has a lot of commas, /, isolated letters/numbers/etc.
those are all individual tokens
you can have a near paragraph long prompt as long as all the words are tokens by themselves
each comma and space is a token
you could avoid using commas entirely a lot of the time
i thought comma was not counted
what was the prompt that went through? i think there is a 77 token limit still
i just copied some one else prompt and tried the image resolution for me is same 1024x1024 as the other member's image was proper size mentioned in Prompt of this one .Mother's Day poster, , lovely child sending flowers to beautiful mother with big wavy hair, shining big eyes, background in a warm home, perspective, light and dark contrast, warm colors, bright background, style:Pixel Art width:1152 height:896
i think commas count as tokens, but there's special attention applied to tokens within a certain range from a comma (at least in webui)
i just checked now the original message had LONG PROMPT warning . its correct 77 token limit is there for everyone
this image has same size as mentioned in prompt width:1152 height:896
and when i try with same prompt i get width:1024 height:1024
there might be a special interaction if you get the size via pressing the resize button vs manually inputting size
whenever i get a size through resize button it just works
might not be open to manual size testing right now
Hi, why can't I use commands : style and aspect It shows me an option. Not a valid choice.
actually i noticed width/height aren't valid options? you have to use aspect i think
click style/aspect and select from the list. "landscape" isn't a valid aspect for instance
Well, I didn't notice that, thank you
Thanks for the help i was just checking the words in prompt and manually using size now it works perfectly as you mentioned.
I'm trying to make objects and the issue is that it produces them at an angle, how is it I could make it only generate images that face the camera head on?
I don't know what keywords to use
front view
I'll try
it's producing better results but I still can't get it to be precise, any tips?
updated my model to v4.5 and it's a lot better
anyway to find out what seed i used? since i generated an image after this one
parameters
white background
anime girl with light brown hair, her hair covering right eye, brown red eye, fox ears, brown hair, smile, right eye covered up by hair, by hair, bust shot, close up, blue bow tier dress, bright brown hair, hair covering right eye, two red marks on face, hair coveres right eye, red mark left side of cheek, red mark right side of cheek, one eye,
Negative prompt: low quality, worse quality, brown hair, bow tie, white hair, bow tie on hair,
Steps: 40, Sampler: DPM2 a Karras, CFG scale: 8.5, Seed: 845172862, Size: 768x768, Model hash: 876b4c7ba5, Model: cetusMix_Whalefall2, Clip skip: 2, ControlNet 0: "preprocessor: reference_only, model: None, weight: 1, starting/ending: (0, 1), resize mode: Crop and Resize, pixel perfect: False, control mode: Balanced, preprocessor params: (-1, 0.5, -1)", Version: v1.4.1
umm
go there
drop the image u generated
theres the seed
might be obvious but here goes the seed
anyone got any questions feel free to dm me lol
any idea how i can get rid of framed pictures?
always struggling with that
ok this is pretty dark, but it has black bars on left and right
Maybe I just have no idea what I'm doing, but I always get really disheartened trying to use stable diffusion to rough out a character for a story. Like, I feel helpless trying to fight the AI from just doing whatever it wants. I want a character with blue hair, yellow eyes, with black steer horns sprouted from her head, and it basically ignores half of what I say. The horns? If they ever happen (like 1% of the images) it's the wrong kind. It REFUSES to do blue hair. Even when putting it at the top of negative prompts, it DEFAULTS to blonde hair, and blue eyes.
I just don't understand how I've seen so many AI recreations of well known characters, accurate down to the intricacies of the outfits or the extremely unique hair styles. I try to steer it with the prompts and instead of refining as I tell it what not to do, it shrugs its digital shoulders and goes, "you're wrong, blonde hair is awesome."
Two times in a row trying different variations it gives me this instead of images,any idea what i might be doing wrong please?
- It depends vastly of what prompt (and negative prompt) you're using. I'm guessing you're using stuff like
blue hair woman red eyes green shirt with horns. The AI does not understand really well how those words work together, how they fit, what's the context for them, etc. So with a prompt like that you can easily end up witha woman with red hair green eyes and red shirt and hornsevery word you used will be there.... Just not how you expected it. So you've got to learn how to write prompts correctly, try to use attention/emphasis syntax https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Features#attentionemphasis, use theBREAKkeyword in your prompt, etc... - to avoid any kind of color contamination, cf : https://civitai.com/models/18840/no-more-color-contamination-read-description
- the cfg value you're using can also impact how strongly the AI should follow your prompt. Don't overdo it tho.... anything past 10 will usually yield deep fried results if you're not using some special techniques. (not gonna talk about those for now at least)
- same for the clip skip value. tldr leave it to 1 unless specifically told so by the model's author/if it's a derivative from "that one" model
- people usually recreate well known characters using loras
"Loras?"
Thanks for the response, I'll definitely be checking that out. Nothing more frustrating than having an extremely capable tool and feeling like you're just playing with rocks.
Lora, yup https://stable-diffusion-art.com/lora/
also cf #🤝|tech-support message
Hello, I'm new in this community! I was wondering, why does it come out with the kid? I only want the woman. My prompt is this: mature, motherly, blonde, 80's mom
because you have motherly as a prompt, and the model probably knows that mothers have children
search google for mother, youll see that almost every picture has a baby in it
ty!!
Ty!!, Sorry im new
Hello, I want to share refer image to SD to make a new one, what prompt should i use?
Someone help
My image always get messed up once I include Gal Gadot lora. Any idea why?
a 32 yo (gldot:0.3 | IU1:0.6),long hair, dark theme, soothing tones, muted colors, high contrast, (natural skin texture, hyperrealism, soft light, sharp),red background,simple background. lora:FilmVelvia3:0.4 lora:Gal_Gadotv3:1 lora:LORA_iu_v35:1
I am combining her face with another famous celeb. But still, even if I just use Gal Gadot, things are messed up. Either the bg goes awry or the body bones look really bad.
weird question, but what would be the best model to make photorealistic catboys for a meme?
So uughh, probably a dumb question:
For some reason, I’ve been struggling with img2img. I have the program installed locally, and I am trying to have the AI add one matching thing to the image without changing anything else, or the base style of the image itself. I’ve messed with the settings and the image seems to always come out re-painted, messed up, or the thing I specified just doesn’t get added.
I’m a bit new to this, so I’m pretty sure I just don’t know the correct settings. 😛
For that you need to Inpaint into the image.
Load the image into Inpaint tab in img2img.
Mask the part where you want something to add. Then prompt for it.
Select Mask only and denois 0.5-0.9 for example
You need to lower the lora strenght by turning the 1 to 0.5 for example
That got the area selected, but nothing is happening when I try to add something. I have moved the denoising slider all over the place, but for whatever reason, it refuses to edit the selection.
Not sure if Im missing an addon or something
is there a format for a prompt to merge/mix 2 loras or text inversions of differnt faces?
i have tried [xxx:yyyy:1.5] where x y is trigger word
^^^
basically trying to make a world leader into a catboy as a meme, any model suggestions?
Thank you! That seems to have done it.
also you don't want to use "restore faces" option for anime style output.
hey guys, Im doing a female photo and I would like her to have a lil catholic cross necklace is there a way to emphasis I want a small cross and not like a "50 cent" one ? lol and also how to avoid getting cross on other piece of clothing ? any tips ? (im getting a lot of double cross)
I guess "cross necklace" is like a double use cuz necklace alone works pretty fine .. maybe with inpainting ..
try Cruxific instead of cross
ears ?
yes PS... or inpaint .. or: .... black_hair, streakked_hair, red_hair + RNGod bless
yeah its w.e hmu if u have any questions
It has nothing to do with what I asked...
You can try lower the value of the lora. But i would recommend using a ghibli lora instead. Thats trained on the ghibli style
OK thank you
How did you manage to prompt 2 different people in the same image?
Let alone with different ages and clothing?
any prompt about cat? please
Does SDXL understand natural language? I was using 1.5 & 2 mostly with comma separated keywords but no setence.
does anyone know what dreambooth repo works best with AMD GPU
Is it possible to tell @delicate kernel to select one of the 2 images and then make alterations to it?
what would be the prompt to get this pov, you see the front side of the person, just the back?
if i'm generating an image and i see something in the live preview process, is there a way to stop it right there so i can try to hone in on that image result before it continues getting processed? the end results are sometimes drastically different and i see some live previews that i really love
How do i get a higher quality image from A1111? The images generated aren't bad, just when i zoom in they tend to be pretty pixalated.
what is your height/width set to ?
521 width 720 height normally, changes on what i'm making
that would contribute to low res, but you can use a 2x or 4x upscaler to get larger higher quality images, sometimes lowres negative prompt can help a little too but im far from an expert
Don't think my computer could handle much more then what i have, 12gb gpu doesn't seem to be enough. Thanks for the help regardless
Whats your gpu?
You should be able to easily upscale with 12gb vram
6700 XT
Dunno
So far i keep getting the "Not enough Vram" message when i try it
I have tested this card and can upscale 512x768 by 2
Do you use the recommended Commandline args for amd ? In your Webui-user.bat
Yes
--medvram --no-half-vae --opt-sub-quad-attention --opt-split-attention-v1 --autolaunch
git pull
Looks good. But you also can remove --no-half-vae. Shouldnt be needed for 6000 series
Alright
Then try 512x768.
Highres fix, upscale by 2.
Denois 0.5
Hires steps 10
Upscaler Esrgan4x
Just got RuntimeError: Could not allocate tensor with 655360 bytes. There is not enough GPU video memory available!
@tired vigil Okay then completely restart SD and try upscale by 1.8
Or 1.5 to test.
Then you also need the Tiled Diffusion Extension. Install that and then only enable the Tiled Vae.
That will help to get the upscale done.
Then you should be able to upscale by 2
Ive generated this image with an AMD 6700XT.
You can load it into PNG-info tab to check the Settings used.
can someone tell me how to prevent multiple bodies from being added? it looks great in the live preview until the last few seconds itll add a conjoined body or an extra body in
You need to lower the base resolution and then upscale the image
its currently set to 512x512 and upscaling to 1024x1024
The 1.5 based models got trained on 512x512 and the 2.1 models on 768x768 resolution
Ohh okay then show your upscale Settings
Probably the denois is to high
Okay first disable restore faces. That breaks faces when using together with Highres fix.
Set hires steps to 10 or 15
Then set the denois to 0.55
If you use esrgan4x as upscaler you can use a denois of 0.5 or lower
that seems to work great except i lost her full body
Okay, you can also set an other base resolution. For example 512x768 is good for portrait style or full body.
Im off now gn
thanks for your help, gn!
you can use settings tab for that...
.....the fewer steps you will get more intermediate images....
consider that this will increase the space when saving each generation
thank you i will experiment with that
is there a way to exclude specific seeds ?
umm, never generated something like that but probably something among the lines of purple_mist purple_mist_background
could also run that through a controlnet process and keep the pose/perspective etc
ty!!
also how would i go from just the right upper part of the face, where only an eye and some hair is visible
how can i minimize/reduce the amount of this "text" jibberish
Can you share the actual image? so can get the meta data for the prompt.
I would like to do some testing and let you know.
hi, i have a stupid question, whaty do you type in to get multiple people showing? i tried "2 people" or "1girl 1boy" but doesnt do anything
@reef snow You can try using controlnet with an image of multiple people.
i'm gonna try it, thanks 🙂
usually when you just want to show something specifically you just say stuff like legs_out_of_frame, or only_legs_in_frame, leg_focus
so in the current stable diffusion I always had good luck generating larger women with the "curvy" keyword, but now even with () around it, it doesn't seem to want to generate anything but a slender woman? any advice?
So umm you could try highlighting the word that you want, press alt or control i dont remember and press the up arrow key a couple of times to let it know that you really want curvy in the gen
TY I'll try this!!
quick question, is this formula correct?
[from:to:When to make the change]
I'm trying to make something like this
a photo of living room with royal furniture style, and a carpeted floor, and a cabinet, white ceiling designed with [a painting of heaven: starry night :20]
but no results
Hello!
I recently started using Stable Diffusion running in a local instance. I've learned most of what I know from the great guides here: www.stable-diffusion-art.com/beginners-guide/
Currently I'm trying to create character illustration/portraits of the party members for my first D&D campaign, using D&Diffusion checkpoint.
My main question currently is** is there a way to "restrain the reach" of adjectives in my prompt to specific part of it**.
For example, I'm trying to create a Dragonborn/Lizardmen with Gold scales, wearing an armour made of black scales, which is understandably confusing for Stable diffusion.
Is it possible to "retrain the reach" of the colour in the prompt?
Another question is the effect and usage of punctuation such as quotation mark and commas. Does they have an effect? My test haven't found much, but I see a lot of prompt using at least commas.
The only think I could find about this subject is the use of exclamation mark to increase the strength of some word.
when have 0.xx format ---> 0.20 (at 20% of steps)
curvy may have a conflict with Ethiopian (stereotype) 
if the emphasis() doesn't work, you can try changing curvy .. to chest and hips
you mean [something to replace : replace with : 0.2]?
so just writing 20 doesn't work?
@simple finch
any value.... "when" refers to a percentage of the steps.....
The change will be made after x% of the steps...
the format to write a percentage is 0. xx where xx is the % .. 0.xx is between 0.01 and 0.99
Ok got it, Thankss.
It all depends on how tokenize the model, but in general in short prompts (-75 tokens) the effect is close to 0, in longer prompts the grouping effect that A1111 performs appears... when using the punctuation to separate and group "packages" of at most 75 tokens (packages have the same weight among themselves)... then variations appear since the same token will have different relative weight depending on whether it is grouped in a package of more or less tokens
ah thank you, that makes sense! I'll definitely play around with it, totally didn't think about that
Thank you for your answer.
So if I understand properly, with the default settings, the punctuation doesn't have much effect?
only if your prompt is short < 75 ..( there may still be differences due to misunderstood compound words or descriptive phrases that lose meaning) ... I recommend that you do some tests keeping seed and other parameters constant, and thus appreciate the effect of adding or not separators and/or punctuation marks