I've encountered problems when i'm loading lots of models and switching between them often, my generations will become corrupted. A lot of neutral gray backgrounds and sort of uncooked looking results. When it occurs i just restart the webui. Hasn't occured for a long while though. Actually when i was using linux with AMD, before i swapped to windows 11 with nvidia, i had to reboot the whole machine when the issue came up
#📝|prompting-help
1 messages · Page 6 of 1
sorry to dig up old messages. i'm combing logs to see if anyone has mentioned tips for wrangling the new a1 update
Yeah, not sure what happens, but I've noticed it likes to generate the same person for while. For a long time I couldn't get it to stop generating the same woman wearing denim, very frustrating.
Still see denim pop up now and again, even though it's a negative prompt now, lol
100% not what I wanted it to generate, haha
Any tips on getting good/better eyes? I've made a realistic looking character and the eyes always look fake, no matter how many seeds I go through
Yes, tick the 'Restore Faces' box, game changer for me.
Thank you! I have used it before but for some reason I just forgot about it lol. I will use it
Had a shower thought -- "What about an extension that, all it does, is reads the positive prompt and populates the negative with antonyms?"
let's see
Any suggestions for getting like a smug/cocky looking smirk on a character's face? I've tried everything I can think of, but the character always has more of a cute or wholesome smile.
I'm new and learning, but I get the impression that trying to capture different emotions is going to be hard 
DAMNIT what is the antithesis of greg rutkowski?
can u b specific? any link? name of the suggested lora or TI? thx.
All the loras that i downloaded are giving me bad results, some weeks ago this wasnt the case. Today I downloaded a lot more loras and models, and now it seems to be very bad at understanding the loras i put in the prompt. Does somebody know why this is happening. Is it maybe because i downloaded too many loras or/and models?
Are you trying to get a specific character using loras?
Yes also, but i tried different loras, they all seem to not give great output
I learnt that using both a lora + TI made for a character together, gives good results. Just asking just in case which folder do you put lora in?
stable-diffusion\stable-diffusion-webui\models\Lora here
So I just deleted all my models and loras and only redownloaded the ones I want to use now, and it looks much better.
Sorry for late answer. That's good! I still recommend combining Lora and TIs of the same character, the results are good, for me at least it was really good
Good luck 🙂
Using a1111 webui if matters. Is there a way to write the seed into the prompt? as well as other settings?
Hello everyone, i'm running a dnd campaign with some friends and i'd like to use stable diffusion to generate some character portraits and environmental pictures to help them visualise what i'm describing. Which models would you recommend to achieve this objective? I'm not exactly a huge fan of anime style so i'd rather use models that are as far away from this bias as possible
currently i have novel ai and f22 installed
A protogen model might do you well
i'll take a look thanks for the suggestion
where do you guys get your models from? I'm using https://civitai.com/, are there any other recommended sources?
I'm kinda new to SD tbh, i've used a few times last year and never touched it again
Whats a good negative prompt for making sure the head and feet of an image doesn't get cut off.
id recommend charturner for more angles for the characters and RPG. I haven't tried them yet but VinteProtogenMix, Cheese Daddy's Landscapes mix, and A to Zovya RPG Artist's Tools might also help. PlanIt! may be good for specific items if you need images for those. It does great exploded and patent views. The RPG models are centered around fantasy so they might get somewhat close but its hard to say
Hey whats the problem?
Hey
I'm making tshirt design
Got this output
How do I remove the tshirt mockup??
@silver valley
Tired alot but can't get the tshirt thing removed
I just need that lion vector
Would remove the no of the negative prompt, just add t-shirt and shirt,
then try to add silk screen t-shirt design in positive prompt, i saw someone using this and worked good
You could cut the the lion and put it into img2img
You mean photoshopping it out? Then img to img?
Yea if you want a similar image with only the lion
Alright Got yaa
But you cant get the exact same image without the shirt
That'll be okay, just need something really similar
Yea then img2img
Thanks man, really appreciate
Happy to connect 😉
No problem 🙂 hope it works 👍
Good day, I am trying to generate representative images / illustrations for scientific papers. e.g. if a paper is about a fly, it would make sense to show a fly. So far so good. But it would also be good to represent the scientific area. e.g. neuroscience; or the concepts, such as "Feeding, Motivation, Brain circuits, Neuronal activity". To be honest I am not quite sure how it should look like, but the model didn't seem to include that or related things. And I do want to avoid anything that looks like text. If I ask it to generate a photography with a camera model it seems to generate say a nice fly. But can I go beyond that? Ideally this would be without a human in the loop. Does anyone have some advise or resources? Thank you.
Composition is really nice, what sort of prompts did you use to get the center in focus like that with the rest blurred?
without any human in the loop, that's quite some challenge for now.
First of all, you can do this, but I wouldn't go without any supervision at all.
You managed to have the picture of the fly, and a simple prompt like "wildlife photography of a fly, 70mm, taken for national geographic" for example will give you that, 90% of the time.
I'm not sure I understand what you tried and that didn't work. The thing is, some ideas won't be just possible to insert in such a prompt template : "wildlife photography of a Brain circuit, 70mm, taken for national geographic" for example will just be junk. But to fit a large possibilities of subjects, you will need to describe in quite different ways what you want inside the picture. The prompt won't be able to follow just a template to be used everytime.
So that's where I think you would need a GPT type text model, that would be the "interpreter" between the human giving the subject, and SD making the picture. The model would build a corresponding prompt for your SD to draw it.
And last piece I would add to that puzzle, a CLIP feedback. You'll get lots of results, and if you want to automate this without human supervision, you're going to need some model like CLIP that will make a "caption" of your different possible pictures, and select the one that has the largest score/change to fit the bill.
Adding some human in the middle though, you can make this quite fast once you get how prompts for different subjects work, or ask for chatGPT's help
Thank you for your feedback, that is very useful. I am using ChatGPT to extract the concepts and research subject. How would suggest using it to help with the prompt?
I used it for help in prompt making in the past.
First of all, I explain to it what SD is, then I show example of prompts to it, and then I ask to give me prompts for a given idea.
#1019361238234443776 message
you can adapt this by mostly changing the example prompts with things that relate to your field though
Great, thank you. I will try that. And also the idea with using CLIP is a good one to follow up on. That would for example help where the resulting image no longer resembles a fly.
Do you have a suggestion on how to avoid images that contain text (or something that looks like text)?
it's hard, even with a tidy prompt, to get 100% good results, so having a way to parse the bad one seems essential here
negative prompting "text" works quite well for me
But if it still goes through, it's usually because you have some tokens in your prompt that inspire text. like "journal cover" would push text in. any token used in a context that would include text usually will tend to do that
Is anyone here using Latent Couple and has seen/resolved this kind of artefact? I see this a lot on my generations where two sets of eyes are being created (seemingly photo-realistic ones inside a larger illustration/anime-esque eye).
I am trying to generate a city. What prompt can I add so it's viewed from the ground?
I've tried a bunch of stuff but it just keeps doing this eagle eye type of view from afar
I think I've seen someone else with this issue and they used terms such as "street level" or "street view".
alright i will give it a go, thank you 
Sweet. That actually seemed to work!
It does add some "street" elements such as cars, but can get rid of those easily. At least I have a starting point now, thanks a lot haha
More than welcome! It might help to describe the scene more fully, such as "empty roads" or "deserted", depending what look you are going for,
I'm new to making Ai art and started using AOM3, but I'm still not getting much how to use it and how to paste generation data, is anyone free to help me?
Welcome to the fun Lucas!
If you have some generation data, you can paste it into the prompt field and there is a icon that will read all that data and set the configuration for you.
This blue icon with a white arrow.
I tried using it but the generated image is not the same
Ah, you are trying to replicate a specific image? There are a lot of things that could be preventing you from getting the same result.
Some things are.. unfortunately designed, such as "Clip skip" being a global setting rather than a generation parameter. I presume they did this to keep things simpler but it is quite a 'gotcha'.
what does the "|" vertical bar do in automatic1111? How does it differ from colon "," when prompting?
https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Features read on prompt matrix
If you put [cow|dog], every step it will alternate between cow and dog
Scratch that, seems like Face Restoration was set to 0.5 weight for some reason.
not sure if this is the right place to ask but does anyone have any good cyberpunk-esque prompts for a futuristic city?
This bit of information might be relevant to your earlier question. https://www.reddit.com/r/StableDiffusion/comments/10c797w/what_is_eta_noise_seed_delta/
Can any of you help with getting SD to produce weapon icon artwork? For custom magic weapons in D&D. I can't seem to get it to return anything that looks like those kind of icons.
There are models and loras that can generate that kind of icon artwork: https://civitai.com/models/1239/stylized-rpg-game-icons
Depends what you are looking for specifically, but it should give you start.
anyone have any ideas on how to fix this issue?
it insists on adding reflections that are the same color as the background on the helmet edges and chest
so when i use the green screen mask tool in premiere it takes out those parts of him too
Has anyone been able to turn pictures of things like cars into illustrations? I'm having very little luck getting something close to the original image
Man, negative prompting is such a lawless wasteland. People are just all over the place. Has anyone done any testing to see if their negative prompts actually have any effect?
How to get stable diffusion to understand a particular color? Lets say i want a certain shade of pink and I want it to understand the color using hexcode... can we do it?
No you cant use hex codes,
but you can name the color and hope sd knows it. Here is a big list of colors:
https://rexwang8.github.io/resource/ai/modifiers
Is there any way to prevent the constant close ups and portrait shots? I have both of those in the negative prompt and I have medium shot and action shot weighted in the positive.
Could it be the "enhanced face" and "r/eyes" parts of my prompt are overriding the negative prompt?
how can i command
Hello, I used to come to the prompt rooms where I could see people prompts and take inspiration from their prompts. Is there aw ay of doing that ? Anyone showing prompts and results ?
hey !
multiple answers here
1/ no bot currently to prompt with
2/ you can have access to the archive channels with the previous prompts by checking the #👥|roles and attributing youself some
@rich oxide
FAQ: I'm new here! How do I get started and is there a bot?
Welcome! There's currently no bot on the server to generate your images. Start by heading over to #1072220168534642768 to get yourself situated and help find the channels you are looking for! Please make sure you review our #✍🏼|rules-and-tos and feel free to assign yourself some #👥|roles as well! Answer any questions your may have at our #1072229020520947753. There are many ways of accessing Stable Diffusion, take a look at #1080946152318443610 to start your journey!
oh ok, thanks @lone badge the archive channels maybe are what I am looking for. I will add myself to a role and try
oh and Lexica seems also cool ! thanks. Was it in the FAQ ? completely missed
lots of little things in the /faq, but all are in the #1072229020520947753 for easier access. This channel, as well as #1080946152318443610 are quite up to date, we try to keep it up that way
you should find lots of links to access the AI or different subjects
guides, tools, ...
What is a good prompt to make a random background from a sketch?
random isn't the best skill of SD...
You would mostly use extensions like Dynamic prompts, to have a list of lots of possibilities and it would take one from it
Well, I have to ask that because I have a sketch here, and I need to have some form of a background as an idea for said background.
@lone badge What I want is to color this and generate a random backgroun for said person.
let me try to just scribble it
trying for canny
scribble no good
a girl dancing in a dress, background
Steps: 20, Sampler: Euler a, CFG scale: 7, Seed: 1894469186, Size: 512x640, Model hash: 4d4f85a738, Model: Base_1.x_sd_v1-5_vae, ControlNet-0 Enabled: True, ControlNet-0 Module: canny, ControlNet-0 Model: controlnetPreTrained_cannyV10 [e3fe7712], ControlNet-0 Weight: 1.25, ControlNet-0 Guidance Start: 0, ControlNet-0 Guidance End: 1
but like I said, adding dynamic prompt, I get lots more variety
I would need to describe a background to have anything though
and this would not make it random
perhaps I could use magic prompts to make something good.
Also, what kind of settings do you run in the canny to make that work?
this was with just the base canny settings, your picture was quite OK for it
Also, thanks man for the dynamic prompts. It's going to be great to have a thing to randomize stuff, though do you know of a function in said Dynamic prompts where you just place the description randomizer in the beginning only, and not on the rest of the prompt outside of the description?
Put in a file a list of those alterations you want, one per line. Save that txt file inside the wildcards folder, in the extension. Name it for example alteration.txt
Then use __alteration__ in your prompt and it will use one in the file at random. Put it at the start of the prompt if you want
Prepare multiple files like that and call for the one you want
Really great stuff
Like random_color.txt
And just placing the keyword within the description would just be confined there?
Or lots of things
Also, why can't I use Magic prompts and just do the same thing?
Yeah it would act as an alias and be replaced by one of the values, keeping the rest of the prompt working
Maybe you can, i just don't know every extension ^^
How would I just confine the Magic prompts within the description then?
Like I said I don't know how that extension works.
But the wiki seems to say it works about the same
https://github.com/adieyal/sd-dynamic-prompts
I keep getting this weird line cut in my image
anyone know how to fix that
modelshoot style, female superhero, green and red costume with flames coming from her hands, (EVA plugsuit combat armor:1.2), flying through the air, Anne Stokes, character art, a character portrait, white background, character portrait, masterpiece, (art by Alex Ross:1.2), detailed face, painting by Ed Blinkey, Atey Ghailan, Studio Ghibli, by Jeremy Mann, Greg Manchess, Antonio Moro, alphonse mucha, trending on ArtStation, trending on CGSociety, Intricate, High Detail, Sharp focus, dramatic, photorealistic painting art by midjourney and greg rutkowski
Negative prompt: (((canvas frame))), cartoon, 3d, ((disfigured)), ((bad art)), ((deformed)),((extra limbs)),((close up)),((b&w)), wierd colors, blurry, (((duplicate))), ((morbid)), ((mutilated)), [out of frame], extra fingers, mutated hands, ((poorly drawn hands)), ((poorly drawn face)), (((mutation))), (((deformed))), ((ugly)), blurry, ((bad anatomy)), (((bad proportions))), ((extra limbs)), cloned face, (((disfigured))), out of frame, ugly, extra limbs, (bad anatomy), gross proportions, (malformed limbs), ((missing arms)), ((missing legs)), (((extra arms))), (((extra legs))), mutated hands, (fused fingers), (too many fingers), (((long neck))), Photoshop, video game, ugly, tiling, poorly drawn hands, poorly drawn feet, poorly drawn face, out of frame, mutation, mutated, extra limbs, extra legs, extra arms, disfigured, deformed, cross-eye, body out of frame, blurry, bad art, bad anatomy, 3d render, (watermark:1.2)
Steps: 60, Sampler: DPM++ 2M Karras, CFG scale: 7, Seed: 1951662759, Size: 792x1224, Model: neverendingDream_bakedVae, Denoising strength: 0.75, Mask blur: 4
In my experience, yes. Describing facial features like tend to make those concepts large enough in the result to show (by making them closer to "the camera").
Not quite true. You just have to describe the background as a loose concept like "interesting background", rather than something specific like "dance room background".
Well ,thanks, you may very well be right on that one, sorry for that
No need to apologise. We're all learning!
Hi, newbie here, how did the guy do this?: https://www.reddit.com/r/architecture/comments/11qe6v0/ai_is_a_game_changer_tool_for_architectural/?utm_source=share&utm_medium=ios_app&utm_name=iossmf
it would be amazing for my architecture studies and ive installed stable diffusion, but have no idea how to „edit“ sketches like this 1:1. He also seems to be using text2img, rather than img2img which ive tried and failed with
it's controlnet
the initial inspiration was to find out potential ideas for backgrounds. I've seen some anime models that can make nice looking backgrounds, seemingly at random. I wanted to know what an AI would consider a contextually appropriate background
These are generally what I was trying to have the model generate, so that I could take reference for what it thought contextually made sense as a background (and other details, such as leg positioning or clothing choice)
It's mostly pointless if the AI can't perceive the sketch and concept so you kind of have to let it run a little wild
for posterity, I used controlnet on scribble mode, most of the info should be on the png metadata
Used abyssorangemix(? can't really remember) with EasyNegative and described general actions, saying something like "most detailed background" without specifying a background
had to brute force generate like 40-50 images but got lucky with the first generation pictured above
might do better with 768x512 next time
what exactly is that? sorry if obvious but i just got into everything today 🙂
new tool, you can use it to generate poses or generate from images. A more strongly enforced img2img, basically it takes ur image and sort of draws in on at every step which forces the AI to conform to your initial image at every step
normal img2img basically just uses your initial image as it's seed image, which can and will just be transformed over time outside of generally basic shapes and colors
ill look into it, thanks! is there any comprehensive guide that you know if by any chance?
I didn't figure it out from a comprehensive* guide, the controlnet page should probably have everything you need. Install it like a extension. https://github.com/lllyasviel/ControlNet
another thing i didnt get are those pickle/safetensor files, are they basically just other file formats for models?
I think so. I just slap safetensors in, I think ckpt files have the potential for malware so safetensors are always preferred.
cheers
ControlNet is a pretty powerful and advanced tool, i would suggest you to learn the basics of generating images first before trying it
thanks for the tip, anything you can recommend to try out or learn first? wanna get into it asap tbh, since right now its the time of the semester where i can benefit the most from such a tool
If you started today, i would suggest first to play around with txt2img.
Try to generate something maybe an Character of a series or a specific place. Try to decribe it and learn how to adjust the prompt to get nearer the result you want.
Try different models too, download a few and test them
Try different steps, cfg scale and Resolution
oh yeah, think ive got the basics down with the prompts, been using dalle and all the other stuff in their webbrowser forms for a while now, just installing diffusion locally now was pretty overwhelming with all the new file formats and technical terms, but ive been trying out everything this whole day
that stuff is very gpu memory heavy isnt it, now im starting to feel my 3070s meager 8gb
Ah nice! For technical Questions regarding everything of local sd you can also ask in #🤝|tech-support
apprechiated!
reddit seems down right now so hard to say on your architectural question
but i did quite some architectural tests with someone yesterday
controlnet is really good for that
Yea same for my 1080
yeah at the worst time of all, its basically just a sketch that is promted into an amazing looking render
whats that?
ive got this if you wanna try
should do the trick
what's is it supposed to be ?
an office interior ?
an empty interior so far, the lighter part of the ceiling is sloped upwards
Its a Programm that adjusts the vram usage to let you get higher resolution and heavier tasks in SD.
You have to edit the webui-user.bat and add --xformers
Behind Commandline_ARGS=
then restart the bat to use it
You can also add --autolaunch behind it
having some fun with it
Then you dont need to copy the IP every time
it's working OK but would work better without the fake light on the ground
yeah if you like i can look if i still have the original drawing with just the lines
no don't worry
as a proof of concept in 3 clicks
that should be quite all right
I used a style randomiser
not all are good, and it has a hard time understanding the windows here
I'll try another mode
damn thats really fucking amazing still
what model are you using? saw that the guy on reddit used realistic vision 1.4
let's try this
amazing, ill try to get the controlnet thing installed aswell then
a suburban house, RAW photo, 8k
with styles on, but those are a little strong x)
still just wtf
do i add it in the same file or into the row below?
thats honestly crazy, i can totally see myself saving hours of work with that stuff
Same line
This was using mlsd mode, best for architecture, but worse for all other things : it extracts only the straight lines
Mlsd or canny ?
mlsd here
cheers, works perfectly
this is using depth, less crazy stuff
lovely, any tips on how to tell it specifically which parts are windows for example in that interior drawing i sent, just describing where they are if it dosnt get it?
also installed controlnet now as an extension, where do i go from here
You need the models files for it from here:
https://huggingface.co/webui/ControlNet-modules-safetensors/tree/main
Download all that start with Control_
These go into the extensions/ControlNet/models folder
which ones would you say are relevant for me there for architecture render-like stuff
sorry if im bombarding with questions rn but youre helping tremendously!!
Mlsd is for architecture but take all of them to try them
gotcha
Depth and canny and normal map can be useful too
For architecture
Openpose is for Characters
Like I said a couple of days ago, still waiting for 2.1 mlsd fingers crossed LOL
I don't feel like bothering thibaud again since he gave us scribble unannounced. I already made 1 request I don't want to push him
got them in, dont see them in the UI though
model selection rather
They are not under the normal Selection
Look at the bottom of txt2img
There should be ControlNet
Just curious...is your uni not offering ai courses now? How about school policy, do they allow a.i. renders?
bottom txt2img with a triangle icon, click to open control net.
I see some archi uni offering courses on ai
i think my uni is pretty far behind in that regard, there are no AI courses afaik but i dont see why they wouldnt be allowed since theyre just another tool in my usecase at least. Probably thered be issues if people submitted 100% ai generated stuff
@dense egret here is a simple guide for ControlNet:
https://www.reddit.com/r/StableDiffusion/comments/119o71b/a1111_controlnet_extension_explained_like_youre_5/
but like that it basically just does what one can do in photoshop just quicker
I bet that is what's gonna happen. people handing in lots of a.i. gen stuff with some photoshop fixes here and there
Are you in a uni in north america?
cheers i see it now down there, just had the wrong one enabled in the installations selection i belive. Got 2 different sources from github earlier
watch a youtube tutorial, you'll pick it up very quickly
nah switzerland, pretty backwater place here haha
if you can grasp rhino grasshopper (most archi students do), SD and control net is piece of cake
well...the initial learning is piece of cake. how to control the a.i. isn't so straight forward
a.i. still has a mind of its own. No one can claim absolute mastery on prompting/prompt engineering.
haha yeah i noticed today. Wasnt even aware you could run that stuff locally till today, everyone i know is using the basic browser stuff and that seems kinda unsuitable for professional use rn
by the way, there are plugins for blender with stable diffusion...img to 3d...or back and forth. One day it will be in rhino grasshopper (if not already since rhino is also python based)
no wait there is already
damn seems ive been barely scratching the surface of those tools
i remember seeing someone pulling some parametric a.i. image, went through chat gpt for a python script, and made a battery for grasshopper that way
I am in the field and I also feel like just scratching the surface. I mostly post the non-serious hobby stuff here.
cool, what do you do exactly?
I am in your field
switzerland isn't a backwater place for architecture study...ETH you must know it
ETH is one of the forefront of architectural education institution
ah gotcha, nice to be able exchange some AI knowledge for that field then haha
yeah i applied there aswell but wanted to move asap so i just took the next best thing. Might consider it for my masters if they take me, which is questionable
oh sorry this is prompting-help if you wanna chat, chat private. sorry all for spamming here.
alrighty
so im trying it rn with controlnet but it dosnt really seem to take my draft rn, ill check if theres anything i overlooked
watch a youtube vid on control net. several ones from a month ago are good beginner ones. watch like olivio's or aitrepreneur ones for example.
thanks will do
Yea Aitrepreneur has some good ones
what is precisely meant actually with "Invert colors if your image has white background", do i have to draw white on black...?
Mostly used for scribble i think
right i need a little help with my prompts, the txt is what i've got and i keep getting deformed faces, any suggestions?
Sampling Method is Euler a Steps 20
Seed -1
advice would be don't worry about the face and just do inpainting on the ones you like using this method:
https://www.reddit.com/r/StableDiffusion/comments/10xdzt9/tiptrick_this_inpainting_trick_is_so_badass_im/
It depends on the resolution of the image and how far the characters face is from the viewer. The nearer the face, the more Pixels it gets for a good face
So close up and portrait will give you the best faces. Also Upscaling can help with images like yours
how can i create a prompt to generate an idea for a mascot in my highschool?
does the resolution slider actually influence the results? i noticed when im setting it to 1000x1000 rather than 800x800 it always offsets the floor down in controlnet interior sketch renders
Anyone using Deforum? I'm having a hard time getting my character to change clothes. For example: frames 0 - 15 the character should be wearing a suit, and from 15 - 25 a jacket with normal clothes. But the character keeps the suit on the the whole time instead and the clothes doesn't change
What's the exact differences between (prompt), ((prompt)), and (((prompt)))? I think I've seen [prompt] too. Pretty basic question but I don't think I've seen it explained anywhere formally.
The number of "()" means how much weight you are giving to the enclosed word. The more, the more weight on the prompt. "<>" is used for embeddings like Hypernet and LoRa's. I am not sure what the "[]" are used for, though. Hope this helps. 🙂
Yeah that's sort of what I figured, the only reason I was wondering was because there's the prompt of (prompt:1.2) ,which is what I usually use instead of double parentheses. I guess I can narrow my question down further to what the differences of (prompt:1.2) or whatever number vs (((prompt))), having never used triple parentheses before.
I am not 100% sure, but I think the ((())) came first, and were later replaced by (prompt:1.2).
They both do the same thing.
Ahhh I see
But don't take my word on it - I am guessing here.
The important thing to know is that they both do the same thing.
Yeah I'm watching some videos, made in the recent weeks, and the prompts that I'm seeing mainly have triple parentheses in some of them so I'm guessing the artists use them out of habit. Was unsure if they do the same thing as (prompt:weight)
I don't use ((())) anymore because it's easy to mismatch them.
I have built an app based on Stable Diffusion that lets users write prompts and generate AI images. I hold this model on AWS S3, and run the inference on EC2 instances. I want to let users use the prompt writing syntax made available in AUTOMATIC1111/stable-diffusion-webui so they can have fine control over the weighting of each token, but i want to build my own prompt-writing UI. Does anyone know where in the WebUI repo i can grab just the syntax, and not the UI?
WebUI is a Python module, so it should be on the Git repositories.
I am going CRAZY with the many of embeddings used on CivitAI prompts that I can't find anywhere.
I end up with washed out results that don't match the sample images, and I think it might be due to the missing embeddings.
I have searched CivitAI for them, but no luck.
When the AI images end up washed out comparing to the prompt sample images, does it mean I am missing an embedding? I need some help here.
Anyone?
Good morning everyone!
Hello 😄
How do I go about making prompts and making something that looks attractive? Is there a guide and general list of negative words to get started with?
i would like to generate a cheerful man who is running at a summer event throwing paint bombs with other colleagues (friends) and has already been hit by a paint bomb somehow. I don't know what to write to make the paint run down his body / to be covered with it any body ?
or possibly water bombs
have you tried throwing phrases that describe it and seeing the results?
Hello,
Does anyone know what type of prompt I can use to tell Stable Diffusion to create a new image resembling the style of another image?
- I have this black lady in a graphic novel / comics style (not being a native speaker, I'm not sure if this best describe the style I'm looking for),
- And I would like to tell Stable Diffusion to create* a male yoga, a thin and lean female ballet dancer, and other characters* who are all different in sex, size, etc.
- but the y should all share the same drawing style. Is there a way to do that and for all the generated images to look as if they had been drawn by the same artist?
is there any way to create clean line art of a pic? tried canny but doesn't work well
hey all, is it generally better to use underscore in prompts instead of a regular blank space?
someone and idea on how I can make this hair go less up but more like rounded
hey guys, how come my ControlNet doesn't remotely resemble the annotator results?
Edit: just realised I didn't set up the models properly, should be good now.
anyone that knows how I can make the body? I can't just do it like this cuz I don't have enough vram
I am trying to find a way to isolate concepts. a dress made of vines or red car as part of my prompt without vines taking over my entire image or everything colored or tinted red. or stop being unable to get white hair because an earlier part of the prompt mentioned black in a completely different context.
use weight on those terms and use underscore to connect two terms together, which isn't perfect but it can yield better result
like (red_car:1.2) (white_hair:0.9)
photo of a Woman in (dress made of green vines)1.3 walking down a street, woman long red hair, and there are vines everywhere and the hair has a green tint.
woman long red hair at least got the hair somewhat red, long red hair got taken over completely by the green
But there are vines everywhere and the street is ignored entirely in favor of vines for a background.
try photo of a Woman in (dress made of green_vines)1.3 walking down a street, woman long_red_hair
If we were to go back in time it really seems like () should have been used for concept grouping, ie (red car) to ensure the red car is a single concept not red and car separately.
Underscores helped a bit with getting the green out of the hair but green still everywhere else and the background is still just vines and no street
photo of a Woman walking down a street, wearing a (dress made of green_vines)1.3, woman long_red_hair still has not street, just green in the background, but photo of a Woman walking down a street, woman, long_red_hair has a street, so it's no that my model can't do streets.
I should note this whole time the dress has been more green cloth then 'made out of vines' to begin with despite vines in the background, but I didn't feel I could push the made out of vines part until I get the other things under control.
Ok seems like the vines were overweighed. lowering to 1.1 got just some vines on a street but still just a green cloth dress. I can't get the dress to be made of vines or the street to not have vines on it.
FAQ: How do I use the Stable Diffusion website & what do all the settings do?
Check out these videos for a great overview of the website and how to use it! https://youtu.be/014J2Yo1aGI | https://youtu.be/M3jCa6qTHpQ
why not use inpaint?
I can't get inpaint to look as good as if it were generated originally. Always some kind of visible seaming or pattern disruption (lines bending oddly, slightly mismatched color tones, etc.) Or does weird things due to the mask shape not being pixel perfect (shapes that were fine become slightly odd as it tries to conform prompt to the contours of the mask). That may be me doing things wrong, of course.
How to generate two subject with different attributes ? eg: Left woman with blue hair and right woman with green hair … can it really be done in prompts ? Or is there any other tool to help with that ? The purpose is not to generate two subjects , the purpose is to generate two subject with different attribute , hair color , dress color , etc.
You could generate the two persons next to eachother and somehow color the hair manually.
Latent couple is an extension that lets you use multiple prompts for different part of the picture. This would be my go-to solution for this
/ g
I tried this out and it seems hit or miss. Maybe I am doing it wrong but I can't get super reliable stuff out of it. Also would prefer a way to do this without yet another plugin in the workflow, though that may not be possible. Still think this is what parenthesis should have been used for.
It's mainly a limitation of the models still. Like sometimes, saying something is blue makes everything blue. This will progress with time, but right now, without more conditioning like that, I'm not sure how to do it
I find it really surprising this was not something anyone put in at the very base level when SD syntax was first being developed.
Some other techs have helped since then, like clip guidance, but I have more hopes on next models to solve this.
how I can make black non-glare material material for armor in prompt?
any recomendations for image to text pls?? thanks in advace)
https://github.com/Extraltodeus/multi-subject-render Try this to see if it'd help somehow?
No way to go around the problem of SD tend to mixed objects when they have different attributes (or even different animals in the same prompt often is very challenging and luck-based). At least you can't go around that by prompt engineering or build-in settings.
or try latent couple and paint a blue blob left and green blob right and prompt accordingly.
i'm working on a project and i need help creating a prompt, if anyone is very good at prompting feel free to send me a dm
is it possible to control the "UNet Weight" and "TEnc Weight" in the prompts when using Lora?
how would i make a prompt that just adds NFS unbound effects to the images for a batch image convert in automatic 1111
▷ Stable Diffusion
▷ Prompt: a professional photo of a lauging american god warrior, muscle, young, (hdr:1.3), very beautiful hair, hyperdetailed, cinematic, warm lights, intricate details, hyperrealistic, dramatic, complex background, (muted colors:1.2), old Hollywood movie,modelef3
▷ Negative prompt: obese, (deformed, distorted, disfigured:1.3), poorly drawn, bad anatomy, wrong anatomy, extra limb, missing limb, floating limbs, (mutated hands and fingers:1.4), disconnected limbs, mutation, mutated, ugly, disgusting, blurry, amputation
▷ Steps: 100, Sampler: DPM++ 2M Karras, CFG scale: 9.5, Seed: 1615613750, Face restoration: CodeFormer, Size: 512x768, Model hash: 74da23045a, Model: modelef3
▷ Model Download: https://drive.google.com/file/d/1rRW1VX_psfLwryn6IZJirSnPNU0tJl4c/view?usp=sharing
Google Docs
this seems like a nice model :
#1047197565365538826 is the place to share those around here, you'll have lots more visibility
Most people host their models on civitAI or on Hugging face for free, it may be more fitting than a google drive link, long term
is there a list with negative prompts for more specific things? ik there is a pinned list but thats focused around people and i feel if i use it with animals it might get messed up
I am trying to create a reflection of a old bar in a mirror I haven't seen my first result yet but wouldn't mind having some advice if you already did something like that in the past
Any ideas on how to make my character look from a distance? I want to make a scene where the character is seen from afar first and gradually gets closer, I'm sorry if my English isn't 100% correct, thanks
Aeriel view perhaps
Anyone can tell me how to create a negative prompt with stable diffusion when you only have a "command line" interface? What's the syntax? Thx
Is there a log anywhere that outputs your prompts/seed for the images? I know I can click on them afterwards and see but I accidentally reset it.
There is no such thing not in any webui from what I can tell.
If you lose the prompt no way to get it back other then your memory
Ah damn. I appreciate the reply though.
I think that's a nice thing node based UIs are gonna offer that the whole UI setup will be saved in your img in a different file format.
ola
hello !
Where can I find denoise strength in automatic1111? Is it the noise multiplier option in the settings?
its in txt2img if you select highres fix, or in img2img
Thanks!
I was trying to add a monkey hanging off a building with inpainting and I absolutely couldn't get anything but random shapes or slight variations on the background (depending on strength). Nothing remotely monkey.
Anyone have any ideas what the difference is between the interrogation models for generating prompts from an image, or where there exists documentation for any of it? I've got a giant list of options and downloading them one by one and testing them is not fun, lol. Some of them are quite big too.
well, it's a little like for upscaling models, all are a little more specialized on detecting specific pictures.
This was a nice HF space to compare some that existed, but you can't just load your list in it to test them :/
https://huggingface.co/spaces/nielsr/comparing-captioning-models
for the two tone hair tag, how do i specify which two colors i'd like? do i just add two other colors to the tag list and it'll work like that?
for example lets just grab two random colors doesnt matter would it go like this
(two-tone hair, red, blue) or do i need to do (two-tone hair), red, blue ?
Oh nice, well it's something at least. I've been captioning things manually because the interrogators have just been so awful, but it's becoming such a chore, especially with large datasets. I wish there was more detailed documentation about these things.
same, maybe one day, but it takes time, and things move a lot so guides get deprecated fast too
ik there are a lot of ways to fix hands but what's the best most efficient method?
inpaint full res with nothing more than luck on the seed
or, longer I think, using depth map controlnet on top
oof
are there any models trained on hands?
any tips on how to get 2 different persons in one prompt? whenever I try to have 2 different faces, they kinda get merged into two persons with the same face 🤔
the models aren't really good with that part, yeah. tokens get mixed together. ask for something blue and maybe everything will be.
Using targeted conditionning, you can do such things. The first extension that comes to mind is "latent couple", that lets you use multiple prompts in the same picture, and target specific parts of the picture with each prompt
Ah yes I heard of latent couple, will try that out
is there a syntax list for prompting in SD anywhere? I'm using automatic1111. Just found out about [ | ] last night and would like to see the full list if there is more I don't know about. Also would like to know if commas are important when describing the same item, i.e. wild long blonde hair in tight messy bun vs blonde hair, long hair, wild hair, messy bun, etc.
There is also the Cutoff extension for getting better control of colours applying: https://github.com/hnmr293/sd-webui-cutoff
Be warned with Latent Couple, it does not seem to work with Loras very well. Any Lora will be applied to the entire image. The extension "Composable Lora" should help with this but it does not, in my experience, work at all (there is still "leaking" of Lora from other subprompts and it tend to burn the image). You can avoid it by setting the weight very low (e.g. 0.2) but then the Lora does not apply much at all.
thanks. will check it out
How to install and eb=nable it?
If it isn't in the extension list in the UI, you can do Install from URL tab in the Extensions tab.
It will then show up as an expandable area in the UI.
my eyes and face are almost always messed up, past a certain point of complexity in the prompt
model: 2DN
prompt: "woman on beach, hyper detailed, perfect face, detailed eyes, (full body)"
negative prompts: "easynegative, bad-artist, bad-image-v2-39000, bad_prompt_version2, bad_quality, ng_deepnegative_v1_75t, verybadimagenegative_v1.1-6400, vile_prompt3"
40 sampling steps
if i remove full body from the prompt and let it just draw the face the faces are really good (4th image)
i seem to have this issue with almost every model i use though so it might be something with my settings? i'm not sure
restore faces is off
sometimes it's this bad
Any tips for making a photo look "amateurish"? Like it was taken with a phone camera, or a still from a home video.
The first image is closer to what I'm looking for, like it was taken by a phone camera by someone standing with the group. But 95% of the results I'm getting are like the second, where the composition looks extremely fake and phony(in a non-AI way), like something from a fashion catalog or professional photoshoot.
I'm not getting any changes from adding 'amateur', 'amateur photo', 'cameraphone photo', 'home video', etc. Or from negative weighting 'professional', 'photoshoot', 'model', etc.
Use inpaint to do the faces
With an inpaint model
How do I prompt for certain characters from certain series? I've tried San from Mononoke and the prompt provided was san (Mononoke hime) . Is this the syntax: [character name] ([series name])
i'll try that out ty
Also I highly recommend InvokeAI for inpaint/outpaint jobs
Try searching for models that are trained on those TV characters? Other then that I am no prompt master Xd
i'll try that one out ty
i'm not sure if the syntax matters that much as long as you prompt it early on enough, add () if you need to add importance to it
Ok thank you
i keep getting backgrounds even if i specify no background as the first prompt, how do i make sure it doesn't do a background?
Hello! Anybody know of an architecture prompt that can get close to this? https://i.pinimg.com/564x/70/fd/a7/70fda734f757ff55c7445762e39f4a9e.jpg
Thats normal for not upscaled images
The nearer the face to the viewer the better the face
Portrait and Close up gives the best result
i see
i was told to do inpaint, could i get closeup level quality from inpainting?
Yea it will get better but not on lowres. You cant give it more pixel than it has available for the face.
But then upscaling will fix it
Hi, how to create an Image from a face on artbreeder? I have the face, now I want to create a photo of guy in a shirt, having that head, standing in a garden. Every prompt I do creates weird 'art'
can anyone tell me what a value does after a word e.g. ponytails: (1.2) < this value, or what it is used for. thanks!
https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Features (attention/emphasis section)
Does more sampling steps mean better image. Ik they typically do but where do I get to a point of diminishing returns. Recently I’ve been doing 300-1000 but I feel like it’s too much
M1 Pro suddenly cannot start stable-diffusion. It used to work fine before, but now it is giving the following error in the attachement
images typically get to acceptable levels at 20 steps. I typically go between 20-50. You can see your images' details, composition change throughout each step. I find few advantages going beyond 50-100. Higher steps don't give me more details, just different kinds of details. If you want a high res, high detail image, go through a proper high-res fix and then img2img, and extras' upscaling mix of upscaling methods. Read yesterday's discussion on general-with-image around 6pm on ultimate SD upscaling extension to start
often at higher steps, the image look less like the prompt i wrote, or sometimes it sways to certain words on the prompt that are not what i'd consider important, even if I use :numbers to emphasize some words. so diminishing return to me often start when I go beyond 50 steps
Wouldn’t having more sampling steps decrease deviation from the prompt and home the image more towards the prompt and negative?
not always the case in my uses. Have you noticed that it got more accurate? I often find what I want between 20-50 steps, then beyond it justs wanders off to lalaland
No I just assumed that if it was checking it’s work more then it would be more accurate but I have noticed lots of the examples for the models were generated with less than 100
has anyone figured out what i have to tell gpt for it to give me a prompt that is formatted correctly?
my current attempts have been hit or miss despite providing 8 example prompts
that was having correct results for me at some point, but not that great either tbh : #1019361238234443776 message
never though about including that bit about dall e
the only problem is that it doesnt know about TI, loras, and models
although for gpt4 someone could make a plugin that sent it the image and told it what the model was called and the activation phrase since it can understand images
This is the best one ive gotten yet
Prompt: A peaceful forest scene with a waterfall in the background, photorealistic, realistic, lush foliage, (8k:1.5), modelshoot style, serene, tranquil, highly detailed, (waterfall:1.25), crisp focus, (sun rays:1.5), dappled light, misty, atmospheric, autumn colors, (animals:1.2) grazing in the meadow, wildlife, art by Thomas Kinkade
Negative prompt: Disfigured, kitsch, ugly, oversaturated, grain, low-res, deformed, blurry, bad anatomy, poorly drawn, mutation, mutated, extra limb, missing limb, floating limbs, disconnected limbs, malformed hands, out of focus, long neck, long body, surreal, childish, mutilated, mangled, old, text, b&w, monochrome, conjoined twins, multiple heads, extra legs, extra arms, fashion photos, meme, distorted, abstract, psychedelic, grotesque.
ok i think ive found the sweetspot
I didn't try to use it with LoRA yet.
The one you gotten is quite nice there !
im hoping to get the gpt4 api and experiment but it just depends on how long itll take for them to approve my access
its still struggling to include prompt matrixes but i think im almost there
Couldn't find a "no-dumb-questions" channel so I'll go ahead and ask here. I see that SD 2.1 is generally argued to be quite powerful once you learn to use negative prompts. I'm using the huggingface diffusers library and note that you can use negative prompts for SD 1.5 as well. Is this indeed the case? Can you be succesful with negative prompts in 1.5? If not, why not?
You can use negative prompts with any model from my understanding
i see - so it just works better with SD 2.1? Any reason why? Did they change anything when training to enable that?
idk about that Xd, I just know it works
Thanks! Any clue which channel is best to ask about technical details like that?
You're in the best place for that already, just donno the fancy technical stuff perhaps others do.
Ok, i'll hang out a bit then, thanks again
np
either try "leaning towards" or use the Controlpose and controlnet extension from auto1111 if you use auto1111 that is
.....
I pulled something off
But idk how I did it
And my words are not working
I tried those words but no results

Even up the weighs
Hello I get so frustrated with bad hands, is there any prompt to consistently get the hands of screen or behind the character?
Try something like "Hands behind back/head".
"Hands on hips" does also work well in many scenarios.
And Idk but maybe "Hands out of Frame" in positive or "hands in frame" negative
thanks I am going to try
hello to all ! rookie question !!! there is any tutorial to explain the logic and the structure of a good "prompt" 🤓
I don't know, for me it was more learning by doing.
Looking at the picture and thinking of prompts that could help.
Or you could look at others stuff from https://civitai.com/ and so on.
And ofc some informations from https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Features
Especially interesting the explanation of the sequences and ( ) etc
Civitai is a platform for Stable Diffusion AI Art models. We have a collection of over 1,700 models from 250+ creators. We also have a collection of 1200 reviews from the community along with 12,000+ images with prompts to get you started.
thank you for answer
Also, look at this good guide in the pinned section : #📝|prompting-help message
I would say though, yeah, at first, experimentation is key
this is very good !!!
How to make it work/look better.... i typed "Minecraft Steve" and this is nothing like it
Minecraft Steve finding diamonds in a cave
How do I stop the AI from rendering only my character's legs?
(((full body))) doesn't help
I need some understanding about loras
if I dont use the lora code but only the trigger word
Will it still get me the character from the lora or not?
by only using the trigger word?
better put some weight on it
like that, I guess
lora:stLouisLuxuriousWheels_v1 is trigger word and 0.8 is weight
no. it wont work with Lora. tried.
thanks guys
Could someone help me for this effect?
https://www.tiktok.com/@proteinique/video/7210483375159741702
I've found a way with "Prompt Traveling" but even if I generated good prompts, they're are totally random and not like the images I've generated.
With Seed Travel it is also absolutely random.
Does anyone has a good way to create those videos with more control?
you can "re-use seed"
How do you do that?
Is there someone I can dm I just have a few questions
ask here so everyone can help xD
Alright
So
When I generate a character
The rendering looks good
Like all the way but when it finished it looks like crap
Why is that
Like the end result could have beeb perfect but somehow got worse at the end
What do I do please help
hey guys, any ideas abouy how to make a "mask possesion scene" like in the movie "The Mask"
I'm trying to try to create something like a scene where a mask is possessing someone and this person desperately tries to remove it from their face
FAQ: How do I use the Stable Diffusion website & what do all the settings do?
Check out these videos for a great overview of the website and how to use it! https://youtu.be/014J2Yo1aGI | https://youtu.be/M3jCa6qTHpQ
Try less steps? The preview is from one of the first few steps
like how many?
There’s an extension that allows you to save all the step images. Or, just guess and check. Or use x/y/z plot to compare a range of Steps/CFG settings, etc
Xyz is at the bottom in Scripts dropdown
ty
do you have a guide or something for the extension?
I haven’t actually used it but it’s in Extensions tab. Probably straight forward. There should be a link to the github there
One of the many things I keep forgetting to get around to
This is assuming you are using Automatic1111
yup
#1072220168534642768 a boy
how do i stop getting creepy faces like this?
That is pretty tpyical of lowres gens. SDUpscale will fix this no sweat. Check out the scripts under the img-img tab, choose SDupscale, and set your denoising no higher than 0.3 or 0.4. Less denoise = more of the original image, more denoise = less of the original image.
This is the fix for lowres faces and eyes, and pretty much everything else that might be 'close but not quite'.
Is there any different between how it handles (nature, stars, moon, tent) and (nature), (stars), (moon), (tent)?
That's an excellent question. I'm not sure, but it wouldn't be hard to test. Using the same prompt, settings and seed, just replace the first version with the second and see if the output is affected.
I'll give it a shot and see if I notice anything. Thanks.
Z will make a separate grid for each z value, consisting of the x/y combos
I only get different poses with controlnet when I put my denosing strenght on 1 but then it completly deforms my init img...
i tried it and it dosen't help, i still get those really creepy faces
you trying for anime specifically?
yeah
Do you have fix faces enabled?
fix faces is for real human faces, bad to use on anime
what model are you using?
yeah turn it off
I'm curious as to your prompt and whether you're using negatives
here's my positives
brown hair, brown eyes, 1 boy, standing, small breasts, fully clothed, gothic, bangs, black oversized hoodie, black pants, brown shoes, single color background, Masashi Kishimoto, long hair, cute
and here's my negatives
blurry, big breasts, girl, (deformed:1.3), (blurry:0.9), poorly drawn face, mutation, mutated, worst quality, bad quality, twins, multiple people
ooops i left breasts in there
You have a lot of color descriptors tied to items in there - the model will usually blend those together and yield unspecific results. There's an extension called Cutoff that you can use to fine-tune the effect of a token's permeation through the rest of the prompt. Works really well from my testing.
Let's rewrite that prompt a bit -
((blurry)), nsfw, out of focus, large breasts:1.1, 1girl, (deformed:1.3), poorly drawn face, poorly drawn eyes, lowres, ugly, mutation, mutated, worst quality, bad quality, twins, 2girls, 3girls, 4girls, multiple people
This is closer to how I'd format a prompt. It may or may not help.
ohhhh i see thank you
so try to be more descriptive?
much better
not perfect obviously
but still
Kind of? It's more a matter of how you format the prompt. Also using a lot of colors in quick succession can throw things off - I recommend looking into the Cutoff extension. It has shown to be excellent at isolating specific colors / tokens and keeping them from being pervasive elsewhere in the prompt.
^ as you can see the hoodie and boots did not come out right. Tinkering with their position in the prompt could help. Cutoff could also help a lot with this.
(hello) emphasizes the word, [hello] de-emphasizes a word. Hello:1.3 emphasizes the word by 30%, Hello:0.5 de-emphasizes the word by 50%.
o ok
You should check out the #1072013871730131004 channel
Lotta awesome folks in there that love to help out
And a ton of cool prompting ideas are shared all the time!
thank you very much
You could inpaint the faces
If I mask the area in from the face and I uselike "perfect face" then I get usually good results.
hello any negative prompt to fix or hide mutated fingers? my negative prompts seems doesn't work recently
((bad hands)), missing finger, fewer digits, ((mutated hands and fingers)), ((mutation)), (bad_prompt:0.8)
thats what i used currently
Hello, can someone help me how to get a result like this using model and prompts?
I like how it's looks like aquerelle
Has anyone encountered this error before?
[DEADLINE_EXCEEDED] Invalid prompts detected
my prompt seems to be just normal words?:
A fierce pirate with a heart of gold. He has an eyepatch, a hook for a hand, and a peg for a leg.
for prompt optimisation I can recommend to try out PromptPerfect (https://promptperfect.jina.ai/)
Check out they have reverse from image to prompt thingy now https://twitter.com/hxiao/status/1638265632742416386?s=20
Optimize prompts for GPT-4, ChatGPT, MidJourney, DALL-E, StableDiffusion and Lexica. Automatic prompt engineering done right!
In their latest demo, @OpenAI unveiled the impressive multimodal capabilities of #GPT4, generating text descriptions from images with ease. Give PromptPerfect 0.6 a spin to experience this feature firsthand! Spoiler: so much better than #BLIP2! Let's see some examples, 🚀🧵
Any pointers using 3D openpose+ controlnet? When I supply 4 controlnet models (canny/openpose/depth/normal) scaled to the same res as my image this is my result.
I'm trying to create more consistent hands/feet on my models
Seems like very few have discovered this, but if you want to get your prompt, seed number and other settings used to generate an image in text2image in Stable Diffusion (Automatic1111), all you have to do is open Notepad or another text editor and drag your picture into it. All of that information is coded into the header at the beginning of the image file.
to add to this - you can use the PNG info tab in SD to do this, or use the "image browser" extension as well. Just learned that today.
You can also activate "save json with generation data for each image" in the webui settings
any way someone can help me on how you get this specific style? 
Where did you find the pictures at?
@sullen saffron https://civitai.com/models/11457/jinx-league-of-legends I believe this is correct. That's the info for the right image.
Yeah, reverse image search on both images and just browsed a bit.
ooh thank you!
lovely reflexes
Epic manga mermaid
have a question, is there a possibility to change only the white background, but leave the picture objekt (a Fire Extinguisher) untouched? I will test the possibilities for Product related AI. But i dont know where to start, cause im a fresh beginner in AI Picture generation.
for the start a Background would be nice, but i fail^^
im glad if anyone has some tipps for me👍
this isn't the best tool for that, it will tend to change the object too
you would need to have a mask, and the AI can ignore what is masked
but if you go that way, you need a precise mask, and even then, the AI isn't good at making colors out of white background.
You'd better generate backgrounds and paste it on it maybe
ok thx 4 answer
the is an option im missing^^ an ai based background changer
maybe one day...^^
hum
well I see options
the thing is, the fire extinguisher will change a little bit still
like using controlnet would help a ton keep the details one, but some would move, like the inscriptions
i am a developer and contributor to models. if you guys want to participate in a challenge: show me a person partially obstructed by something like a wall. for example, a person peeking around a corner, or a person hanging over a wall, so that you would only see their head, torso and arms
it doesn't look like this is possible, even with controlnet.
it is easy enough to get someone hanging off a wall from behind, sitting on top of a wall, etc.
but in none of these situations is the person obstructed
this is quite the nut to crack
indeed, i can't think of any synthesized image yet where the person was partially obstructed by a large thing
i have seen characters hallucinated in the background, but not this sort of midground object
rough mockup
that generates useful controlnet preprocessed images
I get some correct ones using segmentation mode
upper body, a videogame character climbing a ruined wall, pixel art
Steps: 10, Sampler: UniPC, CFG scale: 5, Seed: 1371007450, Size: 512x512, Model hash: c4947fa2c4, Model: DB_mix_realbiter_v10, ControlNet-0 Enabled: True, ControlNet-0 Module: segmentation, ControlNet-0 Model: controlnetPreTrained_segV10 [b9c1cc12], ControlNet-0 Weight: 0.9, ControlNet-0 Guidance Start: 0, ControlNet-0 Guidance End: 0.8
not 100%
but some
and to be honest, if you look at the segmentation picture :
you should edit it in paint and make it from that
with no preprocessor
just using segmentation model
also using "upper body" in the prompt
yeah i just made some progerss with depth too
segment is a good idea i haven't tried that yet
i do have upper body
I want to make this look much more realistic, I'm using the latest version of realisticVision model
Especially with the water
the clouds in the back needs more variation, the beach near water needs more details like muskles or a crab walking around, some lightweighted weeds maybe. Imho its a bit to clear. But a great Picture at all 🙂
i think some postprocessing can do it
this is great yeah
Wdym?
Like img2img?
I've never done postprocessing before
realistic ultra detailed portrait photograph of fit woman, real skin, (Highly detailed beautiful face), hdetailed eyes, luscious lips, (moody lighting:1.2), depth of field, bokeh, 8k uhd, dslr, HDR, by (James C. Christensen:1.2|Jeremy Lipking:1.1),(sharp focus:1.2),
Why did the system give her hideous purple skin?
Looks like she comes right out a fight
Please how can I create this.
The prompt is above. It was an accident. I wasn't trying to do it.
someone has an idea to get better fur? or can share his experience with that part? i started with ai pictures so im a beginner and i try to make an fox , its very cute and nice but to get a fluffy fur its hard. https://prompthero.com/BalrogDx sometimes its like lil spikes, sometimes looks like to animated and the fur repeats in its look. Someone has idea to make it better?
Hey guys,
I've a question about Seeds.
I want to have a fixed Seed and 2 Images.
But both Images should be different.
Is it possible to do something in the prompt like "Background Desert|Snow"?
That only one of them will be used for an Image?
Than I could generate some Images until I'm fine 🤔
this photo taken by Leica M11 and Lens 35MM F/2, an indian 60years old man wearing white shirt and white dhoti and wear a white towel in his head and standinga alone by side of the road, in the light rain.
Hello, dows anyone know how I would go about removing facial hair from a face ? should I use inpainting ?
You can use the x/y/z script and try Prompt SR
2023 one modern woman, clear skin, Brown Eyes, Brown Eyebrows, Blonde Wave Long Hair, Apricot Lips, Model, Very Realistic, High Definition, 85.0mm lens, ƒ/1.6, ISO 100, Canon EOS 5DMark IV, dramatic Focus Out, Gray Lighting Background --ar 1920:1080
A stunningly beautiful girl. Cinematic in nature, this hyper-detailed scene is filled with insane details and beautifully color graded using Unreal Engine. The use of DOF, Super-Resolution, Megapixel, Cinematic Lightning, Anti-Aliasing, FKAA, TXAA, RTX, SSAO, Post Processing, Post Production, Tone Mapping, CGI, VFX, and SFX has created an insanely detailed and intricate world. The hyper maximalist approach and hyper realistic Volumetric and Photorealistic rendering bring out the ultra photoreal and ultra-detailed aspects of the scene. With 8K and super detailed visuals, this scene bursts with full color and Volumetric lightning, using HDR to create a realistic and breathtaking environment. Powered by Unreal Engine and rendered in 16K with sharp focus, the intricate details of this scene are truly mesmerizing.
so what's the trick to ignoring the color composition in img2img?
like "I just want this pose, completely change everything else" and I'm also using controlnet
in the same vein, how do you get controlnet to not make everyone bald and naked if you're using somethign like Magic poser as the base pic
Any model/lora suggestions for trying to make visual novel style backgrounds?
are there some guidelines around the best way to describe character direction
ie
- from behind
- from front
- from left side
- from right side
or should it be something like
- facing left
- facing forward
etc?
What are your prompting exactly?
Chinese martial arts, movie backlight
@tiny thistle
FAQ: How do I generate images? Is there a bot on the server?
Currently, there is no bot on the server that generates images. However, there are plenty of other ways such as the official https://beta.dreamstudio.ai/ website or by running Stable Diffusion locally using your own hardware! Check out #1080946152318443610 for more details! You can also stop by #1025467151206854736 for any issues you experience while using DreamStudio or #🤝|tech-support for any problems you encounter while installing it locally!
Question about prompt syntax,
What do brackets do, if used around a single word, without weight numbers or anything?
If its a way to group commands, like (red, cape), (blue, suit) then what advantage does it have over comma's?
without numbers, it acts as if you had 1.1 as a weight
(cape) = (cape:1.1)
(a whole lot more in the wiki on that)
Wasn't my question but I appreciate that link haha. It was very helpful. 👍
Sorry what's the question then hehe ? If you use parenthesis, they will be interpreted as weights, even without numbers.
You can add \ before a parenthesis in order to escape it, and be used in your prompt for what it is, just a parenthesis, but I'm not sure there is a big impact from it, at least the grouping you want won't apply, the models would need more training on such prompt tactics first.
You can currently use Couple Latent, an extension that lets you use multiple prompts at on e, targetting part of the picture only, to help group some tokens on a given subject. I haven't seen such possibilities natively through just prompting yet
Oh no no, I was just saying I wasn't the one that asked anything. Pretty sure you still answer their question just fine.
ho my bad ^^ well happy to have helped then x) I was between two things and didn't notice it was someone different
Awesome. Thank you 🙂
What prompts help me to change the distance to the object?
those tend to work to some extend
Do the acronyms work as prompts? or just the full phrase?
nope, the acronyms don't work, use the longer version
this is an infographics for something else completly than SD
but it shows well the different terms
usually, photography terms work quite well
using different objective size for example in prompt works wonder
thanks, I'm trying to use it to generate Heroes of Might and Magic type character art, but it keeps lopping off their heads
70mm won't give the same perspective as something else for example, I don't remember all rn
have you tried :
1/ using img2img on very high denoising ? example with a sword that kept being out of frame : https://www.reddit.com/gallery/yjnak3
2/ use controlnet with openpose model ? that help force picture composition, and even choose the pose of the character
but to stay prompt focussed, the before tips are useful to me, as well as "square picture, centered subject" sometimes
Thank you :))
Thanks! I'll take a look at those!
some others exist. I use a lot "half body shot", and "upper body" for the upper part of someone
I have been focusing the past month on fantasy character portraits, but now that I have switched to landscapes of high-fantasy worlds, I am having difficulty with finding a good model/checkpoint or Lora Style. Not sure if anyone has any suggestions, here.
Ideally something where I wouldn't have to make very bloated prompts in order to achieve a somewhat consistent result.
Does anyone know how to create art like this in Stable Diffusion? Which model to use and what prompt?
Hi guys. I'm totally new to AI Stable Diffusion today. Had a long day of fun learning how to install models and things and had some attempts at some illustrations and some came out nice.
I'm actually a digital painter and have currently painted this image. https://prnt.sc/NjjCqaNqxhOl
I'm was hoping to figure out a way to populate the background and foreground with trash and broken old overgrown Carnival rides. Does anyone know what might be the best prompts/models to use for this? Id like it to stay in my painting style of course.
Just a very quick example of what I mean. I've found backgrounds to be pretty challenging so what I do is just edit in something close to what I want like this and then use img2img to slowly iron out the rest.
Thanks a lot. 🙂 Thats actually what I was trying to do today. (Img2Img with ControNet) With the results I was getting it looked like I just added some random image into the background. No style copying, no color matching etc etc. Im sure there is a way to get it like I painted it. Just trying to figure it out.
Yeah, I'm not sure how well it'd pick up a detailed background like that but img2img is pretty good at picking up stuff like architecture styles, common room themes, nature, and what not. Just takes a lot of messing with all the settings.
Any ideas how I can have SD draw out like cartoon burrito mascots? Nothing ive put is working so far.
from a few days ago - yes i sorted out a workflow that works quite well
this was the journey to trying to position the character
it was actually given to me as a challenge. it's very hard to get the hands not to look like feet lol
at this stage i actually have a pipeline where everything in unity is runtime rendered to pixel art
naturally, not realtime
not yet
has anyone come across a library that translates a json style schema to prompts?
ie a rough example (which would require something like LatentCouple):
{
"canvas": {
"height": 512,
"width": 512,
"depth": 100
},
"global": {
"description": [
"an analog photo of a couple walking on a city street", "golden hour", "morning"
]
},
"objects": [
{
"position": {
"x": 140,
"y": 105,
"z": -50,
"width": 100,
"height": 160
},
"description": [
"a man wearing a black suit and top hat ", "1800s renaissance fashion", "smiling", "waving"
],
"globalWeight": 0.5
},
{
"position": {
"x": 220,
"y": 105,
"z": 20,
"width": 120,
"height": 180
},
"description": [
"a woman wearing a grey frock turning at waist", "1800s renaissance fashion", "laughing", "dancing"
],
"globalWeight": 0.5
},
{
"position": {
"x": 0,
"y": 256,
"z": 0,
"width": 50,
"height": 160
},
"description": [
"a cropped 1800s era building", "dust and grime"
],
"globalWeight": 0.7
}
]
}
Hello everyone, hope you're all well. I've been messing about with Stable Diffusion for a few days now and i was wondering if i could get directed to some guides for prompts or if a kind soul here had a few tips. When i have a look at CivitAi and the generation data thats used for the images on their, the way its laid out is like a foreign language to me. Just want to know how i can turn my simple, fantasy man prompt into something a bit more fantastical! thanks for any help!
and if theres something blatant on this server that i've missed that explains this then i do apologise
I saw these from a MJ AI user and thought they were really neat. Is this type of imagery easier on MJ than stable diffusion? What type of prompts would get this effect?
anyone know how can i improve their faces? I've tried upscalign and inpainting, but this technique put errors to SD.
Do you use prompts like "perfect face, detailed eyes, both eyes the same, detailed face" or negative prompts like "disfigured face, ugly eyes, imperfect eyes, skewed eyes, unnatural face"?
only on negative prompt, i’ll try input on prompt
i see people using ( )s or (( )) and :1.1 :1.4, what does those exactly do?
() put more attention to that prompt, same idea with 1.2 etc
Using the [Alternating | Prompts] thing in Automatic1111's webui, is there a way to make it lean more towards one of the two alterations?
I've tried adding like [((Alternating)) | Prompts], which does change the image.. But I don't know if it's actually working in the way I want, or if it's just.. Coincidence?
are they any guides (website) i could learn from?
Are you using Automatic1111's webui?
Well, if it is, a good place to start is https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Features.
It goes over a lot of the stuff, like the ( )'s and :1.1's and all that.
Outside of that, I've just been looking around on Youtube for a lot of stuff!
FAQ: How do I use the Stable Diffusion website & what do all the settings do?
Check out these videos for a great overview of the website and how to use it! https://youtu.be/014J2Yo1aGI | https://youtu.be/M3jCa6qTHpQ
Does anyone have some different prompts for exposure?
The background is better exposed for me and the subject is very faint, you can't see any details, what prompts would you try?
how do i tell the AI not to generate these texty thingy?
using "text" in negative prompt mostly.
Those do happen from time to time, more or less depending on how your prompt calls for them (passively for sure, but like a "magazine" would have text for example)
I use "text, name, watermark" as negative prompts for those issues
i have
Negative prompt: text, signature, ui, user interface, symbol, sigil, watermark, name
And its still on the picture?
Maybe Inpaint could help? But Idk
A captivating and elegant photograph of a 1930s female influencer, radiating glamour and sophistication in a classic Art Deco setting. The subject, adorned with luxurious attire and accessories, is posed gracefully against a backdrop featuring geometric patterns and gilded accents. The photograph is taken using a vintage Graflex Speed Graphic large format camera, paired with a Carl Zeiss 135mm f/3.5 Planar lens, known for its remarkable sharpness and signature rendering of tones. The image is shot on Kodak Tri-X 400 black and white film, imparting an authentic and timeless atmosphere. The camera settings are meticulously chosen to create a striking portrait: an aperture of f/5.6 for a balanced depth of field, an ISO of 400 for optimal tonal range, and a shutter speed of 1/60 sec to capture the subtle nuances of the subject's expression. The scene is lit with a classic Hollywood-style lighting setup, featuring a key light positioned at a high angle, casting dramatic shadows that accentuate the contours of the influencer's face, and a fill light to gently soften the shadows and reveal intricate details. This evocative and nostalgic portrait perfectly embodies the essence of the 1930s era, transporting viewers to a time of elegance, opulence, and unmistakable style.
Hoping someone has some guidance for this.
Hi! I currently have this Automatic 1:1:1:1, SD v1.5 installed locally and I'm trying to reach locally some results I got on Adobe Firefly, but I'm not getting really there. What should I do? Let me show examples:
pencil drawing of a human ear, close-up
That's my prompt on Adobe Firefly and the overall look I'm wanting to achieve in SD
if I input this same prompt locally, the results are the these
playing a little more with the prompt I'm able to reach some other things:
human ear, handdrawn, pencil drawing, black strokes on white background, detailed, patent-style drawing, academic drawing, realistic
my question is: how do you guys think I should be able to get closer to firefly's results? in terms of overall art style (firefly's seems more "pencil-y") and framing (SD tends to crop the subject out of the frame
Thanks a lot!
Best sampling method for people?
guys, check out this test I made.
I used the same prompt in Dall-E 2, Adobe Firefly and SD1.5 (auto 1:1:1:1), "photo of a human eye".
what can I do in SD to start to approach dall-e's and firefly's results? I ried adding lots of deatils to my promp in SD, tried messing with CFG scale and steps, sampling methods, but none worked in that direction.
what would you guys try in SD to achieve that more realist, less zoomed and saturated look?
thanks a lot!
you should try an different model, for example realistic vision, deliberate, dreamshaper, protogen
I guess I don't know exactly how to switch models. I installed the 1111 following a youtube tutorial, not so much of an expert. can you please help me here? just linking to some article or help page would be nice. thanks a lot!
You can use different models for different artstyles, genres etc,
They are listed here for example:
https://civitai.com/
If you download a model (.ckpt or .safetensor) (and its over 2gb) then it goes into the models/stable-diffusion folder
After that you can reload the webui and select the model of the dropdown
oh, I see. will try something like that
just so I can understand it better: what model I'm using right now?
your using the official Stable-Diffusion 1.5 model
ok, and it should be better with dreamy images in opposition to realistic, is that it?
the official model has no specification on what its trained on, its basicly a mixture of half the internet. It can generate everything but everything not that well xD
custom models like realistic vision are trained on highres and realistic images so youll get that as output too
oh, I see. will do some research. thanks a lot!
no problem 🙂 you also need negative tags to get better results
yup, trying to make them work aswell (but SD generally just ignores them in my experience :/ )
here is a human eye, made with realistic vision
I bawl
might be a stupid question but is (keyword) equals to (keyword:1.0) ?
I'm trying to get a character that's sort of Elliot Ness meats punisher, but sci fi. Geting some interesting characters, but it's not really hitting the sci fi elements very well.
#1047610792226340935 tesla
hi, I'm wondering if there's a way to prompt changing the color of the hair without changing the color of the eyebrows or pupils altogether
Is there a) a basic tutorial/resources for prompting with stable diffusion. I'm constantly getting bad results. 😂 And is there b) a more specific tutorial/resources for prompting to create logos?
I'm really struggling to get pictures of non ultra muscular men, using all the promps I can think of and yet every image of a male is gigachad lol
Any tips?
Is it possible to generate a low quality version of the image while I'm prompt engineering?
My specs are bad and it takes a lot to generate an image
Why is my euler a model generating bad faces
Same. It seems to be generating lower quality images than yesterday.
How can I go about generating something like The Thing from Fantastic Four? Like a humanoid with Rocky skin?
My girlfriend is taller than me, how to force AI to make her taller than me on the generated pictures? Maybe only 5% of my pictures can handle that if I use words like taller/ shorter/ smaller
Hi guys, i trained a lora with my picture, during generation i can see the 50% and 75%, seem perfect, but the final image no. ther's a step i can do to stop the image on 50%or 75% and work from this step?
Hi guys, im trying to create a shape that would have surfacing/style/esthetic of these image... could someone please help me with the prompt?
how would you build the prompt?
I am trying to find a way to say "single wing" in a negative prompt that will not result in preventing ALL wings. Anyone have a suggestion?
I want it to feel free to add wings where necessary, but not to only add one because that's stupid.
I would include that image as part of an img2img prompt since there's no clear way to describe that that wouldn't confuse an AI (and an NI, for that matter!)
you can screenshot it while its on that step
help I tried to change the expression for this image through inpaint sketch and circles formed around the mouth that is not blended properly, could anyone help me with the prompt?
There's a setting for outputting in-process images, but I don't remember what it is so I didn't answer. There are a tonne of reasons why someone might not want to work with a screenshot, not the least of which is "then you have to manage to crop it exactly".
(I am not sure the setting is exposed in Automatic1111 -- I was using colab when I used it, and that was several months ago)
The prompt is not going to help with that. That is a limitation of the AI itself. You'll need to postwork if it's not getting it right.
Fortunately, Gimp and Krita are free, so it won't cost you a penny.
(Your other option is to modify the code to try and find out why it's doing that and then fix it, but if you asked this, I don't think you're on that level yet.)
How do I tell if Im invoking the wildcards properly?
is there any prompt that gives the general feel of a model. like something basic that can be compared with others to see the differences?
I see thanks for the reply
Anyone know pompts for this image?
Hello, could you help me figure this out? I wanted to put the Awaiting Tongue Module, but my Stable doesn't see it, what's wrong?
Any idea how to force AI to make my girlfriend taller than me? None of them can get it 🤷
Not even controlnet?
Can someone help me with prompting arm holding food and giving it to character in front of the viewer? It's meant to be POV
is there a prompt to fcous only on the body like full body instead of the background?
actually I meant generating it only with the prompt, without editing. I want to make that picture.
Controlnet is only extension and with openpose you can even change pose of your generated characters and also change their height
FAQ: Why are my images blurry?
In order to ensure a safe experience, the DreamStudio website has a NSFW classifier that will detect and blur any potential NSFW images. While in most cases the classifier will appropriately identify NSFW images, there may be occasional false positives due to the nature of how these systems work. We will continue to work on and improve the classifier to make false positives less and less likely! You are not charged for any images that are blurred
can try with dblx embedding
i'm looking to place 2 characters next to each other, wondering i can use 2 loras for them
I just saw this, but one thing you can try is putting the image into img2img with a low denoising strength. That can sometimes paint over seams from inpainting. It doesn't always work and may alter other details, but it is worth a try.
Upscaling can do the same thing.
online help website, health clinic, premium modern design, colors - white, blue, green, blue.
Minimalism
anyone?
you can try 'detailed eyes' but the best way to do it is with upscaling
thanks for replying
np. SD has trouble drawing things small enough to fit, so if you can generate a larger base image or upscale it, it will often look a lot better. Most upscaling methods sorta 'redraw' as it goes to smooth things out, too
does anyone know how to fix this issue where the camera is like lower than the character? i use openpose as well but idk how to change it to be the same height
#1047610792226340935 tesla
I encountered the same issue when moving or stretching it to far.
You couls try use the Posex extension. Its the same but in 3D
Any tips for generating just one single realistic looking eyeball?
how do i use highres fix. i havent been able to find and extension or anything similar
nevermind im blind
whats the best overall upscaler to choose for highres?
Would the blender plug-in solve this or does it just send a 2d translation? I’m guessing the latter and therefore prob same issue.
Would be cool to have a some camera info as a controlnet model though. You could supplement the depth and normal maps to essentially map it in 3D space
Like depth2points in nuke
Yea exactly, in Blender you just create an image too
But i saw a new extension that looked like 3d Blender in the webui and its not the Posex extension
Sounds cool. Would love to see it.
Does anyone know if there is already a model trained on this style snd tips to recreate? This is from my absolute favorite artist Chiara Bautista and really want to make some art in her style. Before I start training myself would like to know if theres already a method.
can someone explain how to use wildcard extension? I'm kind of confuse
does anyone have knowledge on the difference of prompting in img2img and text2img
they seem like they need different styles of prompting
Should I have very detailed captioning? I used blip captioning but it detailed all my images as very vague, things like "a woman taking a picture of herself in a mirror" and not describing like what she is wearing, etc and is there such thing as over detailing the captions like one of them saying taking a picture in a mirror when shes actually leaned up against a glass wall, and do I mention the glass wall or is that overdetailing it
hello, i need help in model Counterfeit-V2.5 , "the magic words" for get a long pony tail hair ? (?? 🙏 )
hey for me this worked with Counterfeit:
thanks i hope it work for me..
I've been trying like mad to find a way to prompt something like these, the illustration style, and I just can't get there. These are all generated by SD 1.5, but the artist hasn't shared their prompts (https://tengyart.ru/gorod-nakhodka-v-nejroseti-stable-diffusion-1-5/). Can anyone point me in the right direction? I don't think the artist is using a model, but I honestly don't know.
How to make something as beautiful as this one?
like this? sry cant get her running xD
it depends on the model used and the tags
also highres fix does the rest
Can you recommend me a tutorial for that?
Does anyone have tips on making better use of a textual embedding? I'm using the RealisticVision2.0 model with my own embedding and the results, while beautiful, only bear a passing resemblance to the intended subject.
How to make it so that there are two characters who interact with each other and are totally different? Writing red hair woman, and yellow hair man it totally mixes, once two people have red, once both yellow, etc. I used "Latent Couple" also, but when the characters interact with each other there are still problems.
heya, i need some help to get a detail right in my images. im trying to create an image where the girl is holding a monster energy can, but even after figuring out how to make the hands look decent, the can keeps being misformed. cant seem to find a good prompt for that not to happenhttps://i.imgur.com/PQt5XmZ.png
you can look at the Promptbook for basics:
https://openart.ai/promptbook
Upscaling is a different thing
Anyone knows how to fix the leg on the right? tried drawing it in photoshop and inpainting and this is the best result
Is there a better way to zoom in on inpainting instead of just ctrl+scroll to zoom the whole window?
I'm using the api to generate images but sporadically I get these type of images back (like 2 halves of an image in 1 image). Is there any way to avoid this?
@brazen heart
FAQ: How do I generate images? Is there a bot on the server?
Currently, there is no bot on the server that generates images. However, there are plenty of other ways such as the official https://beta.dreamstudio.ai/ website or by running Stable Diffusion locally using your own hardware! Check out #1080946152318443610 for more details! You can also stop by #1025467151206854736 for any issues you experience while using DreamStudio or #🤝|tech-support for any problems you encounter while installing it locally!
Anyone know what to write in the prompt to get this style of art? I’ve been trying on and off for months to get smth similar to use for my characters, but i just cant get a similar look
Why are some details enclosed in brackets in the prompt?
Hi all. If I have an image of a person, is it possible to change their clothes, hair, background etc while keeping them the same person?
bump
Hi guys, need little help with prompt generated by chatgpt:
The image shows an ice-filled cooler with an array of different beer bottles and cans inside. The cooler is surrounded by a large group of friends smiling and laughing, with some holding beers in their hands. The sun is setting behind them and there is a beautiful, clear sky in the background. The scene is filled with joy and anticipation as they look forward to the weekend ahead
I'm trying to repeat the same thing but with more correct keys, but I can't draw people around a beer box
you can try use inpainting
you can try add this TI to your prompt https://civitai.com/models/4629/deep-negative-v1x hopefully can fix it
it's emphasis the higher the stronger https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Features#attentionemphasis
in Easy Diffusion I can use {} to generate multiple prompts, e.g. portrait of {old, young} woman. That would result in two images, one old and on young woman. Is there a way to do this in Automatic 1111?
Hi all, i search help for my prompt
I want to make cover book for children, how to limit the api to children before 12 for example, to don't get anything weird
Yes with the x/y/z script in txt2img and then select Prompt S/R, Hover over it for more info
thanks, I'll take a look
You want to make an cover with an child on it ?
No, i have a description of the cover, and i don't want weird thing like NSFW or nothing else in the result. Because the image is for children.
Hm then add a lot of negative tags in the prompt, like nsfw, nude, gore, blood, etc
It's the opposite that I want lol
In the negative prompt of course
That you dont get that
What webui are you using ?
I'm using php curl
Okay but whats your output ?
Like the API needs an interface i think
I dont have much plan of apis sry
I've already a pro plan 😛
I've result differents
But for example :
I think image like this isn't for children
Yes for sure xD
I need image for this example, a kraken like this :
It's more for children haha
I want a prompt to have more image like the second
wow, that's really powerful, thanks for showing me
Isn't what i want again
Yes you can also compare everything with it 😄 np
This is the prompt : Kiri, the majestic kraken and Lira, the beautiful mermaid in the depths of the ocean. 4k, 8k, smooth brush book cover, disney style, cartoon
The kraken has disappear aahah
Have you an idea @silver valley ?
What model are you using?
I've the version if u want : 436b051ebd8f68d23e83d22de5e198e0995357afef113768c20f0b6fcef23c8b
Or what is the best model for my project ?
Thats just a hash?
Yes
Maybe try Dreamshaper or modi-style
Do you have a link about ?
Dreamshaper is pretty versatile:
https://civitai.com/models/4384/dreamshaper
Mo-di is just trying a modern Disney like style
https://huggingface.co/nitrosocke/mo-di-diffusion
hmm what i can write in the version input ? @silver valley
$data = [
"version" => "436b051ebd8f68d23e83d22de5e198e0995357afef113768c20f0b6fcef23c8b",
"input" => [
"prompt" => $prompt, // Prompt for the model
"guidance_scale" => 8, // Scale for classifier-free guidance, range 1 to 20
"prompt_strength" => 0.8, // Prompt strength when using init image. 1.0 corresponds to full
'num_inference_steps'=> 50, // Number of denoising steps, range: 1 to 500
"num_outputs" => 4, // Number of images to output, range: 1 to 4
"seed" => 3940025417, // Used to limit randomless of output
"height" => 640, // Height of output image
"width" => 448, // Width of output image
]
];```
why are you using the stable diffusion cli version ?
That's what I found when I was doing my research haha
ohh okay, you may want to use an easier programm with more features like the Automatic1111 webui for SD
the cli version you use supports only the bare minimum fore creating images
if you have a nvidia gpu with minimum of 4gb vram you can just install it
Here is a tutorial:
https://www.youtube.com/watch?v=VXEyhM3Djqg
Stable Diffusion AUTOMATIC1111 got updated multiple times these last few months since my first installation video, and it has received a lot of new features. So in this tutorial I will show you how you can install the most complete and updated version of the stable diffusion text-to-image Ai + GUI on your PC for free. You need to have at least 4...
I call my Servor operator ahah
you can install that on a server too with a docker container
also works on linux
here ist the Source for the sd webui:
https://github.com/AUTOMATIC1111/stable-diffusion-webui
Okok, but it's for a personnal project with a friends, and actually i'm at work.
yea if your pc at home can handle it, you can install it there
Lot of thanks for your help, and for now, i can test with the cli version ?
sure but like i said its just the bare basics
if you need help for the installation you can ask me again in #🤝|tech-support
Okok, we have a call with my friend tonight, we install it i think 😛
Okok no problem, so much thanks !
no problem 🙂
and if you have it installed then we can talk about how to get good images for your project 😄
Nice 🤣 😆
im trying to update automatic 1111 ui, i read online that i should edit my .bat file by adding a "git pull" between set and call lines. I did it, but when i try to launch it, it gives this errror, fatal: not a git repository (or any of the parent directories): .git
BNJ
Thank you
Any ways use prompt tags for random face expression ?
Hi, new to Stable diffusion. Im trying to make an old portrait with 3 people in it look like a photo. Almost got there then suddenly 1 of the people has been removed. How do i keep the work ive done but add the extra person back ? Thankyou 🙂
help, my generations keep coming out like this for some reason and i have no idea why
i dont think i did anything wrong with the steps
that could be the weight of the lora
or the cfg scale fix
I really need some help. I'm trying to make some characters in stable diffusion but it keeps adding this orange tint to all my characters. Does anyone know what I could do to counter that?
Thats the result I got
See what I meean so much god damn orange lmaoo
I require some help. I am attempting to generate an image of a stained glass image of a girl, like the first attatched image. However, it consistently comes up with the girl next to the stained glass window rather than having her be the stained glass window. Can anyone help?
Current Prompt:"Human girl, Brown hair, Teenager, Brown eyes, Pink hoodie, looking to the right, stained glass "
I mean I know of some good negative prompts that might help with making the image less... Cluttered IG?
here's what I use
"lowres, text, error, cropped, worst quality, low quality, jpeg artifacts, ugly, duplicate, morbid, mutilated, out of frame, extra fingers, mutated hands, poorly drawn hands, poorly drawn face, mutation, deformed, blurry, dehydrated, bad anatomy, bad proportions, extra limbs, cloned face, disfigured, gross proportions, malformed limbs, missing arms, missing legs, extra arms, extra legs, fused fingers, too many fingers, long neck, username, watermark, signature, bad eyes"
I mean you could put something like "Tan" or "Orange Tint" into the negative prompts
you may have already done this but maybe try "stained glass of human girl"
I have not tried that! Thank you for the suggestion!
That seems to have fixed it! Thank you!
Not sure how to go from here. I like the image, but I would like to make the sheep:
Larger (like a boss from a video game)
Larger horns (basically the size of their torso)
More spirally
Malnourished/skinny
But, whenever I add keywords like "large frame, malnourished, skinny," etc. it doesn't really do anything. How could I fix this?
Don't know how effective it would be, but you could try doing keywords such as 'Small', 'Small horns', 'Straight horns', 'Fat', etc.
Or you could take it to Inpaint and mess with it there
Yet again, not sure how well either would work
Still kinda new to SD
but eh, might as well try and help where I can
i have no clue how inpaint works but thats probably the answer
Okay so
send to inpaint
Draw over what you want gone
and put in keywords to tell SD what you want in its place
So like
Here
Give me a sec
Here is our base image
This is what I've drawn
I put in the prompt of "Sun"
And now we wait for the image to generate
These are my settings
Wait
Messed it up
1 sec
Nope
seems I'm just wrong
Trudeau as a Chinese spy
immaculate
i just can not for the life of me get this right. How do I make these goats very skinny?
i want to make it that they are so skinny you can see the ribs
hey folks, just launched UnPrompt to help AI artists! 🎨
UnPrompt lets you search 30m AI generated images + prompts instantly - all free to use and remix! New images are added every day. Would appreciate a quick comment and upvote on producthunt if you find this useful! ⬆️
how should i be adjusting the denoising strength on hires fix? Is lowering it going to help with smaller detailed while upping it helped with things like multiple heads?
is there a way i can read through all prompts listed within a model
i want to see if theres obscure prompts i would use in cgi that its not picking up on so im wondering if the prompt even exists for the ai to draw from in the first place so i just want to look at the prompt listing
it seems like everything i prompt turns to shit. New at this and ive downloaded some models but all the images are weird looking
an example
Prompts are not in the model...
im feeling the same im just about to finish a training with 18 images 3000 steps learning rate 0.0002 so wel see if it actually learned anything i made sure for like 6 hours going through every text file correcting the bs prompts it came up with like i said in general chat it thinks every girl is carrying a knife LMFAO last time i tried just something simple like a cat it had a knifes for claws =_= i cant seem to win lol
this is why i want to find a detailed break down of all prompts that the localized SD can pull its information from so i can better direct it with the text documents for the images
it seems what im asking isent a widely asked question other wise id imagine people would have put videos and guides on what prompts are available and how to further learn why some prompts are used when to us they are pure false is it because the details cant be seen properly by the processing of images? if so id like to be able to correct it but with so little info on this subject its very very difficult just scouring the internet looking and hoping for some information
i need to know what prompts are known vs what are not so i can better chose prompts to fit the images
Hi I'm having some trouble with having a character doing something with his first hand and holding something on his other one, any advice
Great, now make it so we can filter on stable diffusion
How can I add detail to an image, specifically the clothing? working with an anime character
Also how the heck do I inpaint fingers properly
Where ai
Does anyone know the preferred syntax for 1.5
So it's my first time trying to use Stable Diffusion. If I'm trying to colour an existing painting that only has grey values, how would I go on about that?
I'm guessing I first put it to img2img, but then what?
idk if this would be the right channel for this btw, sorry in advance if it isn't
To modify in img2img, you just set the prompt to what you want and then muss around with the variables for a while. I would set original image strength high, and then slowly iterate through new generations, sending the best ones back to img2img.
Could also look into ControlNet, might be better for re-colors, not sure.
There is no such thing as prompts listed within the model. Do you mean what tags the training data was given? If so, you're talking about 5 billion images, not easy to look through.
i'll try thanks!
im more confused now so if the ai is trained on so many images what im curious about is what prompts it has come up with
I think you're misunderstanding the tech in general. Think of SD like a human artist that has seen a lot of pictures. Just because they can draw a "red cat in a cowboy hat" doesn't mean they've thought of the prompt before.
There's no list of prompts that work, you can prompt anything.
so if the ai dosent use the correct prompts or anything remotely close how would one make the ai use the correct prompt vs what it thinks is correct becaus training so far has yielded little results for me but then again im still new to this and i dont understand the tech at the even basics let alone advanced
i apologize if i seem like a broken record i know teaching and explaining to some people who flat out right do not understand can be aggravating especially when explained in easy ways and information dosent sink in so if im slow on the uptake to the explanations i apologize
Oh, okay so I'm not very familiar with training. You're saying that the image descriptions you are giving don't seem to be working?
For making a Lora?
yeah its a mix of the image getting wrong prompts and prompts i insert are almost completely ignored regardless of me putting in prompts like such
a woman running on a beach with a blue top and black shoes on her feet and the sea in the background, 1girl, beach, blurry_foreground, day, denim, depth_of_field, desert, figure, focused, lips, long_hair, looking_at_viewer, ((running)), motion_blur, nose, ocean, outdoors, photo_(medium), photo_background, photorealistic, realistic, red_hair, red_lips, reference_inset, sand, shore, shorts, sky, smile, solo, swimsuit, water
thats what the image prompts should be but instead i get
a woman lying on a beach with a blue top and black shoes on her feet and the sea in the background, 1girl, beach, beach_umbrella, lying, blurry, blurry_background, blurry_foreground, day, denim, depth_of_field, desert, figure, focused, lips, long_hair, looking_at_viewer, running, motion_blur, nose, ocean, outdoors, photo_(medium), photo_background, photo_inset, photorealistic, realistic, red_hair, red_lips, reference_inset, sand, shore, shorts, sky, smile, solo, swimsuit, water
this is where im stuck and at a loss so i thought that maybe the localized SD dident have access to as many prompts as say dreamstudio
also i believe it is a Lora im not sure i was just trying to do a image2image but the prompts are being very selective about what it will use see and intergrade into the image this is leaving me with a deformed girl like shes straight out of DeadSpace games and or like the mutated human girl in son of the forest with multiple limbs
when i tried training the SD i took a bunch of anime pictures to use 19 to be exact but i used 18 because i was almost fully filling my 11gb vram so i know i cant do more than that but i also have the training tab settings as follows embedding learning rate 0.0002 gradient clipping disabled hypernetwork learning rate not touched batch size 2 gradient accumulation steps 9 log directory textual_inversion prompt template custom_subject_filewords.txt having - a photo of - as its only line from what videos are showing and standard sizes max steps 3000 save an image to log directory every N steps, 0 to dissable and 50 for the other section as well save images with embedding in png chunks drop out tags when creating prompts 0.1 choose latent sampling method deterministic ive already taking the images ran them all through preprocessing and manually changed every single ones prompts to correct them trained it but still seeing odd random suggested prompts when adding images to img2img with the imbedding name from the training along side the correct prompts.
anyways 3:15am gonna sleep ill check back in the morning guys
any suggestions for the promting?
my friend has a dnd character that is fleshless and has ball joints in the arms and yellow eyes. The yellow eyes somehow makes the whole face yellow.
I would try txt2img and describe the character precisly.
Then an important step is to set the resolution to 512x768 for portrait view. That will get better results
! ok
Also add some negative tags like blurry, low resolution, multiple limbs, lowres, to the negative tags
And some Quality tags like high quality, masterpiece, 4k, in the positive tags
I enter the description like this "{ "prompt" : "attractive girl", "Model" : "deliberate_v2", "Negative prompt" : "", "width" : 512, "height": 350, "sampler_index": " DDIM", "script_args": ["abg remover"], "steps" : 25 }" into stable diffusion but the output image does not remove the background please help me to leave the output image when entering description with
i want the result like this but i don't know how to enter the description by code can someone help me
why you need it as a command ?
do you use the api of auto1111 ?
yes i am programming an app using stable diffusion
ahh okay
#resources
#1080946152318443610 here you go 🙂
I went to the original link of the remove bg extension but there is no guide for who used the code please help
Anyone know of a model I could use to generate images in that sort of style? Most models I find online are for portraits, I can't find one for drawings of animals for example (though that might be too specific), since I'm trying to something similar to this picture. the model that comes by default with auto1111 doesn't really give me satisfying results, nor do any other models I've tried. though that might be an issue on my end, I'm still very inexperienced
hey
FAQ: How do I generate images? Is there a bot on the server?
Currently, there is no bot on the server that generates images. However, there are plenty of other ways such as the official https://beta.dreamstudio.ai/ website or by running Stable Diffusion locally using your own hardware! Check out #1080946152318443610 for more details! You can also stop by #1025467151206854736 for any issues you experience while using DreamStudio or #🤝|tech-support for any problems you encounter while installing it locally!
we don't have a bot for prompting on the server
but you can make some on the website in that FAQ I just linked
I can use this one for free?