#💬|general-chat
1 messages · Page 41 of 1
I wanted to do an image of a guy with his tongue stuck on a cold pole...no can do
so then I tried a person licking a sucker..nope. How about just sucker in mouth? nope. Holding a sucker? kind of
I'm having slightly better success with Controlnet but still, the details are all wonky
throwing or kicking a ball - often get a giant soccer ball, or a giant baseball, or multiple balls
or a hat that looks like a baseball
or a soccer ball as a hand
I don't know if sd-webui has the same features like InvokeAI, where you can mask out a specific part and then recreate it. That is still something i am missing or am i wrong?
It is the unified canvas.
a11111111 does that
Hey I'm in the middle of setting up Stable Diffusion for the first time using Automatic1111 as the webinterface while following along a youtube guide: my question is how do custom models work exactly? Do I still first have to install e.g. Stable Diffusion 1.4 before then putting custom models into the folder or are they effectively the same?
Like can i skip downloading 1.4 and just replace that part of the video with for example Realistic Vision 1.3 or does Realistic Vision 1.3 merely tell the base model of SD 1.4 how to behave?
I don't know if you can skip installing the original models but you can try.
yeah i had a hunch itd be like that because otherwise surely the realistic vision page on civitai would specifically mention for what version its made right?
usually they do mention
like to make absolutely sure im at this point right now https://youtu.be/VXEyhM3Djqg?t=486
Realistic Vision is a 1.5 model
so am i good to ONLY drop the realistic vision.safetensor in that folder and then follow the rest of it normally?
but I dont think models use other models. It works on its own
Here is something strange: i get the best results for my needs using a NSFW model. Even i don't create NSFW images. It depends so much...
I think so Good
im a bit spooked about the installation to go off a suspicion
does anyone know for sure before i potentially waste 40 minutes of installing the wrong thing
also @potent fox i think its mainly because it's a lot easier for an AI to figure out human anatomy by training on naked images
even if it clothes them after
Anyone tried both Dreambooth and LORA methods? Wondering if I might have better success generating objects if I try DB method instead
what time would you waste?
main reason why id rather skip 1.5 is because theres seemingly no safetensor version of that one available
it has to download a bunch of stuff first no?
in the video i linked he said it took him 20 minutes on a really good pc
a model based on 1.5 has 1.5 in it
im not sure what its downloading at that point exactly
all of this is rather confusing
yes, but the models are self contained files
I even used 3 different Lora's at the same time.
the models are individual files that go in the models dir
unless you're mixing models, you select one at a time when you generate
if you generate with the vision you mentioned, you aren't using the 1.5 base
hmm alright ill give it a go then
what it downloads are all the dependencies for SD to run and to enable easy update afterwards. You chave the web-ui which is updated nearly daily which loads checkpoint models, which can be based on 1.4, 1.5 or even 2.1. the models are technically the version of SD you are using. There are hundreds of models you can use
You can use all your Lora's in your prompt even without the additional networks settings, which do not show up for me lately.
as far as security goes since you said the web ui updates nearly daily
is that an avenue for someone to inject bad code or is it safe so to speak
it is possible but very unlikely. SD changes fast so the webui needs to be updated
is there a way to force updates off regardless?
don't add git pull in your bat
unless you manually enable updates it wont update
Just use <lora:name of your lora file:weight>
then you update only when you want
even when you update, you can roll back to a previous version
i can set it up first for the install then remove the gitpull right
Your safest route is always use known checkpoint hosts, dont go to shady sites and get safetensor files if possible
I don`t know why SD is connected to the Metaverse. Really no clue at all.
if you want to git pull each time you run, add it to webui-user.bat
https://i.imgur.com/vACZqw9.png this is what you mean right?
No , literally "Git Pull"
git clone is used for the first install. git pull is to update
by hand, open a CMD window in your stable-diffusion-webui dir, in that CMD type git pull
if you dont git pull, the web-ui doesnt update
you can also add it to the BAT so it updates each time you run it
nope, I'll just sit here and watch some of the preeminent pioneers of this technology empty a glock-19 full auto into their feet for no reason at all while the actual technology experiences a meteoric rise in popularity and development around them while they. . uh, do something with the metaverse apparently

lol
the meeeeeeeeeeeeeeeeeeeeeeeeeeta veeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeeerse!
Be sure to have full write access to the folder recursively. I had problems using git pull because of that.
when you enter the metaverse, the part of your brain they remove is given to you in a glass jar
the only time the webui would have to be updated is if a model im using pushes an update right?
no. models and the web-ui are independent
if the person you go out with is taller, you will also have to up date
alright ure getting banned
I think that frank has frankly some kind of issue here...
btw all this stuff installing right now is contained within the AI folder i created right
its not installing somewhere else as well?
the web-ui gets regular updates because improvement, new features etc are happening all the time. SD is moving like lightning and the A1111 webui needs to be updated or people complain or move to another UI
The git pull works fine. You just add it every now and then to the webui-user.bat
ok well naturally it says unable to connect after it gives me the local url
dummy me expecting it to work first try
I also add --no-half to the commandline. For some reasons.
ok but how on mothers green earth do i troubleshoot this now
i have no idea whats going on at any point how am i expected to figure this one out now
You don't need the xformers module. It should work anyways.
ah wait i got an error message below the url
Interrupted with signal 2 in <frame at 0x000001629373B720, file 'C:\AI\stable-diffusion-webui\webui.py', line 173, code wait_on_server>
in the webui-user.bat edit the line with "set PYTHON=" add the path to the python exe
Terminate batch job (Y/N)?
ye i did that already
That's strange. Please verify you installed Python 3.X and the comment of pure_fire.
open a cmd and type "python --version"
a cmd anywhere?
C:\AI\stable-diffusion-webui>python --version
Python 3.10.6
C:\AI\stable-diffusion-webui>
Can you update to python 3.10.9?
Type git --version
git version 2.39.2.windows.1
it does work. you shouldnt have issues
set PYTHON="C:\Users\Steve\AppData\Local\Programs\Python\Python310\python.exe"
try remove the ""
what do i do after i remove those?
relaunch the bat
okk it works
nice
the guy in the video had the " but oh well
Where might i ask about dreambooth?
Strange. It seams the "" are needed when you have spaces in the path. Or maybe they also need to be escaped then.
"" are normally needed for spaces
But i didn't think they make problems if there are no spaces.
Soo... Anyone know where i can ask for some help with dreambooth?
have fun. its incredibly addictive
hmm alright another question as a test i just copied peoples prompts/settings to see if my setup generates the same image
but it does not
is there another thing im missing perhaps
ahh id imagine resolution maybe?
model
Probably the model u are using
to get the same image you need exaclty all the settings right, but even then the GPU you have can change the output depending on how old your GPU is
right, the GPU might not generate the same details
well yeah im just going down the realistic vision civitai page
where people post images they generated with it
Why would anyone create the same image as anyone else anyways? This is not art. Sorry.
what was the original image resolution?
yeah thats waht imeant
i can only imagine resolution as the difference maker right
cuz that one isnt being listed
yeah i was just testing if it works properly
that will make a difference
You can get the same result with a fixed seed, the same model and exactly the same prompt and negative prompt including all the other settings...maybe.
it also doesn't mention if it had restore faces on, hires fix, what for restore faces
what does restore faces do?
it fixes imperfections on the face...sometimes
Only on low resolution generations.
You could give examples in the images channel.
not sure this image is within the rules
What image? You can post it on my server if you dare.
i am generating now
euler is good
DPM2++SDE Karras if i remember correctly.
at 512x512 I got face only
Euler-A is fast and good. But not so high details IMHO
The models are trained on 512x512 images usually, but you can go to higher resolutions using some tricks.
again
right. the image shown with this is seated, down to around the knees
I keep getting face only
there's nothing in the prompt about her pose, the clothes, nothing
Come on. Get it done 😉
is there a difference in generation speed in nvidia vs amd btw?
the prompt is "face of a thai woman with a happy face..."
"a face of a thai woman with a happy face, big beautiful lips, long lashes, extremely beautiful, 4k, 8k, trending on artstation, greg rutkowski"
that's going to make a face
I can't test the prompt since i shut down my rig. But i expected something to show up somewhere 🍭
I dont think that image was generated with that prompt
could be the face, then more added with outpainting
btw Loras are basically just adding certain generation training to the existing model right?
I get just the face every time
Oh my guy, you are getting to it surely. But then, please show me your result!
gonna take a while until i figure out a good balance of the negative prompts
is there like a staple that you should put in all of them
that dont impact generation negatively much at all
I usually always do "low quality, messy"
Okay, it looks like a portrait. Maybe you could outpaint it to include something else, but i guess you would get away better if you used a body shot.
Woohooo, have fun!
you can also try downloading bad prompt
right, but Im using the prompt the creator supposedly used
And what did you expect?
an image similar to that supposedly generated
but looking at the prompt, I don't think that image came from that prompt
Can you show me the image which came from the prompt?
What resolution did you use for generating the prompt?
512 x 768 and 512x512
hello guys !
@vestal dew Amazing. The given prompt is basically worthless and can't produce that output. Except they were using a specifically trained model.
Especially the amount of negative prompts is ridiculous.
used 3090 came in with 2 fans that are nicked, and scratches here and there 
but its a 3090 for sure
A 3090 is for sure a good thing for this.
buy in person.
how much u got it for
Free 3090's for everyone.
NVIDIA should put them out cheaply and evolution gets faster 😉
btw someone mentioned outpainting
where do i find that in automatic1111?
or is outpainting basically just changing aspect ratio and setting it to resize and fill
Good question. i also wonder about that. It is easier in InvokeAI.
But i want to stay within sd-webui.
Extensions tab > Available > press the load button, search for "openOutpaint extension"
you will need to add --api to launch arguments
Our vibrant communities consist of experts, leaders and partners across the globe. They are developing cutting-edge open AI models for Image, Language, Audio, Video, 3D and Biology.. AI by the people, for the people. Learn more here 
where do i find the launch arguments?
ok i checked 3090 gpu z, all numbers are good, now i just gotta run it and see if any issues arise
this speed is enjoyable
still amazes me how the person who used directml to make it run for AMD was able to do it, with the issues that it had
btw how do you increase or reduce the weight of a prompt phrase?
some versions of a1111 also open a launch window
there's a box in there for launch arguments, but you have to press SAVE after adding them
so many people have 3090's
anyone know?
in novelai it used be {{}} to make the prompt phrase have more weight and (()) to reduce it
how does it work in automatic1111
highlight the text, hold Ctrl and press up/down arrows
for some reason outpaint made the canvas and image tiny
I closed, loaded a new UI window, and now it's fine
check the wiki
so if i understand this correctly only a few models are capable of doing outpainting?
Hi there! Where can I find models with safe tensors checked or maintained by Stable Foundation? Also, are there the same for upscalers?
anyone knows how to get controlnet?
What in the world is going on?
since when does this exist?
amazing
I run A11111111, try to generate something. Before generating, all the settings change to some hires fix, and then cuda error
I am new to this. Any resources available that I can use to get started with SD?
Yep! #1072220168534642768 has some installation tutorials and such.
Awesome. Thanks a ton:)
how can there be an ETA for Generate Forever?
bleeding edge code. thats how
It's odd. When I ran it earlier today, it didn't show any ETA. Now it's treating each generate as separate, even though it keeps generating
can someone one help me with this?
I tried changed the width & height to 512x592 but it gave me this error
Image
then, i tried not to change the size of the image i am trying to generate but still gave me this kind of error
can anyone help me out?
RuntimeError: Sizes of tensors must match except in dimension 1. Expected size 148 but got size 74 for tensor number 1 in the list./
Looks to me like a tensor is too small
ControlNet is absolutely crazy. especially with OpenPose. if you haven't messed with it yet, do yourself a favor and give it a try. just mentioning this because someone did the same a few days ago and that's when i learned about it
you using the a1111 webui?
changed, BTW, from bigger or smaller?
I just added the "open pose editor" other extension... my day is gone into this already I think
you'll be delighted to know that microsoft paint 3d provides a lot of easy shapes for building cool compositions off. some fun to be had there
great tip, I hadn't though about it yet
I'm no 3D designer, blender and others are too far from my understanding for now, so a simple tool like that is great
I do know blender and such but they're such a bulldozer when sometimes just a shovel is needed
yep, like starting visual studio to edit an XML file
Michael, that's a no go for here
you've been blocked once, warned once
next I kick
I didn't see, he was already spamming
banned
Instead of shapes, at least for people, what about images of people made in DAZ or Poser or Blender?
paint 3d has lots of other 3d models in a huge community gallery. but yeah others are good too. theyr'e less "accessible" but work just as well
i only throw paint 3d out there because its free for windows users
UM
As I was generating images, I made an edit to one in photoshop, added a tattoo. The next generated image had a similar tattoo in the same place.
Any news on SD 2.2 or 3.0?
3.0 should be 1.5-2 years
lol
Think they’ll be popular? Or do you think people will stick with 1.5
didnt they just announce 3.0 is coming soon/next
Depends on the tools
If they did then they are sucking the pipe so hard. They can see how much of a failure 2.x has been they may have just up and shat the bed to ignore 2.x and jump straight to 3.0. WHICH, by the way, is when 1024x1024 was originally supposed to come.
2.x was definitely a bomb
I have no idea as jumping to 3.- this fast would show they utterly failed with 2.0
they were like almost anti-hype levels of release
Basically admitting their failure. Don't get me wrong because they had to do what they did.
this tweet is what i was referring to
guessing from that 3.0 could be out by end of march/early april
I seen some also say maybe Deep Floyd = 3.0 and thats what hes talking about, but either way something hot and new coming in haha
Yep, looks like this is why he has been so quiet because of the utter disgrace of 2.0 amongst the community. Too bad
Deep Floyd they said originally would be early to mid feb but it kept getting pushed back
as long as the phrase is like 3 words haha
it struggles with like an actual long sentence from what ive seen
Oh, damn. 😦
Anyway if DF is just for text then why would there be an opt out from 3.0?
they had a test bot certain people could use to test it, but its been down for about a week now (DF) so people been wondering if that doesnt mean its release is imminent
DF isnt just for text, its a whole new bigger model
text is just something it can do
but i think controlnet yeets some of its features
Well, I can barely run SD now
yeah people are saying youll need about 16 gig to run DF, but thats just some users guesses
and seen lots say would need a new ui too wont run in auto
If DF is 1024x1024 I can see that, yes
This is why I am due for a new card (every 3 gens) but Nvidia is too greedy for me so I sit here with a 1060 6gb. Next card has to have a min of 16gb or I will not even look at it.
yeah
24 would be ideal to future proof
and even that is probably only a few years future proof haha
3 gens 24gb will be like my 6 or 8gb is now.
If things keep going as they are.
Sad thing is that no matter how great the 4090 is it is not worth the street price for it. 1200. 4080 800. 4070ti 600. 4060 399-449.
Sad thing is AMD is just as bad and Intel can't get its stuff together. I have hopes they can since they are Intel but never good with gpus. Not bad cards for a real first go though and priced right.
Now if they gained marketshare you bet they would be gouging us too.
4090 is only 1200$?
Those prices I mentioned is what the cards should be at. 4090 out the door should be no more 1399.
I am accounting for inflation not their greed.
wow
Does anyone know what the last tweet of Emad can mean? He says "Some fun AI-generated media dropping Sunday... Future is coming fast, time to nap..." Text to video maybe?
well I didn't see it. Now I'm teased
Would gladly share a screenshot of his tweet but it isn't possible to share here
How does selling art generated with dreamlike diffusion model works? I’m a bit confused by the license and copyright of it even after reading them. I thought if you generated it from SD, you would own them, but generating with a specific model card would prevent you from selling / gaining profits?
yep I saw where you shared it
(Im not good at that, I prefer not to answer, I don't want to say stupid things about picture rights...)
it was once explained to me like this: some models used training images that weren't supposed to be used...so if you use a model that did that you can't sell it
i know that's a -really- dumbed down version of the whole explanation, but it explained enough to me
Can someone give me a quick guid on how i can generate images using this server?
I’m totally new here
Is there any curated list by Stable Foundation with safe models?
we have a forum on this discord with lots of models, #1047197565365538826
but what do you mean by safe ? no malware ? try to use models shared as .safetensors, those are made to prevent the risk ckpt can have.
If you use automatic for example, there is an automatic test on unpickle to protect you, and it should, unless you manualy disable it.
Outside of that, there is Hugging face and CivitAI that have implemented a scan of ckpt that are uploaded, and they flag dangerous things too.
https://huggingface.co/
https://civitai.com/
But no, we don't have a curated list like you are asking, those options seemed fiting for now
you can't directly here, there is no bot. lots of infos in #1072220168534642768 and #1072229020520947753 , but I'll also link the current FAQ on this
Welcome ! There is no bot currently to generate your images on discord. You may want to start by taking a look at the #1072220168534642768 channel. You can access Stable diffusion in different ways : 1️⃣ the official website, https://beta.dreamstudio.ai/. The easiest and fastest way to access Stable diffusion with 200 free credits. For any question on it, you can find help in the #1025467151206854736 channel. 2️⃣ Installing Stable diffusion on your computer. There are numerous projects that let you do that, and you will find help in the #🤝|tech-support channel. 3️⃣ Running Stable diffusion in the cloud, through rented GPU services, using notebooks. You can find lots of them shared and discussed over in the #1011228442399883294 channel.
I see, thank you! The automatic you were talking about is the AUTOMATIC1111 SD GUI? If so, are you saying the software already performs a test automatically, without me configuring anything?
if you don't go and add "--disable-safe-unpickle", it does test that the ckpt is safe, yes
I see, thanks a lot!
no problem, happy to see people care for their safety !
yo, so does Auto1111 have a guidance scale option for Img2Img generations?
i switched from cdmr2's easy diffusion to auto1111 recently so i can use LoRAs but it seems its missing the guidance scale feature
the CFG scale is an option but that doesnt really have the same effect on the image as the guidance scale in cdmr2's
it should be the same, the CFG Scale that you found, yes. it's strange that it doesn't act the same way, this is a base torch paramater that isn't really processed more by the tools, it's sent to torch as is through the Stable Diffusion pipeline...
What difference do you see between the two features ? Maybe there is another feature in Auto that covers it too
the guidance scale in cdmr2's easy diffusion changes how close the img2img result is to the image you provide, while the Auto1111 CFG scale adjusts the effect the prompt has on the original image, which sounds similar on paper but if the guidance scale is set to max it'll literally just give you the same image with minor changes, and the CFG scale on 1 just gives you a noisy mess that vaguely resembles the image you provided
easy diffusion has two options for this same thing, one thats prompt strength and the other thats guidance scale
Goof morning, everyone! How are we all today?
Ok, so yes, this is named "denoising" in automatic. the lower it is, the more the prompt is important. The higher it is, the more the input image is important.
CFG Scale is how strong the prompt will influence the output, just like it does in text2img
it's quite sensitive, I mostly keep it between 0.4-0.6
no problem !
this is so useful
hum
wait
I inverted my thoughts
it's the other way around
the way to remember is this : the noise is the input picture
the more you denoise, the further from it you can get
Good
right i see
Awesome! How's the project going?
I gave up on the whole "SD making every chunk of a giant game world" thing and went for a more pragmatic approach
Sometimes to best way to solve problems is to not have them in the first place
It would be difficult ngl
Plus, you game will be more structured, and a more playable experience with more cohesion
So, what is your new approach?
yeah but... SD + GPT ?
It can create game dev documents lmao
🤣
Something worse lol
Hello i'm wondering, is there anyway to merge the content of multiple images in the SD WebUI? I have one image that I really like the artstyle of, and another that I like the content of, and i'm wondering if i can mix the two, because i saw an article of a similar ai doing it but they never had any links or references
What would y'all say is the best general model right now? like say instead of just using basic 1.5 and you want something with that and 'generally' more, is there one? i mainly use unstablephotoreal but it's obviously tilted towards photo or realism, is there any general 'upgrade' all rounder model ? There's a ton of specialisation models but its getting a bit much
Cyvugiigxuxjckck
laughs
What does that m e a n?
🤣
It just means basic character portraits and scrolling backgrounds
For now
There are multiple ways to combine styles. You can do it through training the model yourself (Dreambooth, etc), using LORA, using textual inversion embedding from places like the Library of Concepts, by inpainting and outpainting, etc.
ControlNet looking pretty spicy ngl
I haven't used it much myself tbh LOL
I have so many things I am working on that I just c a n ' t
I might want to train my own ControlNet but it sounds like I'd need 50-100K images and 1-2 weeks of compute
I don't know the exact numbers myself, but like anything, it's a lot of work
hi guys , trying to fine tune the protogen model , where and how do i find the path from huggingface?
if I change the filename of SD 1.5ckpt to model.ckpt webui comes up with an error and says it cant locate model.ckpt. What am I doing wtong?
why are you renaming it
some older tools required to rename the model, but I don't think any of the current UI demands that
you should be good with the first name. Though, renaming it shouldn't pose that problem either
this is strange, you reloaded the list of models in the UI, or it doesn't even start anymore ?
hmm, my Textual inversion embeddings aren't loading anymore
after a recent 1111 update
anyone else ever have this issue?
there is this issue talking about that, opened yesterday :
https://github.com/AUTOMATIC1111/stable-diffusion-webui/issues/8088
I have a 3080
but I guess it's the same issue
mine straight up says on launch that they were skipped
yep, but I'm not sure the other guy diagnosed correctly the origin of the problem, I doubt it's linked to the card type itself
ok so not the same
it just says it skips them
and no why at all
I can't post a screenshot
let's move to #🤝|tech-support
Generally, what's the best sampling method?
Hi,
I am conducting a short survey for the purpose of a college project. I would be grateful if you could fill this short form 🙂
Im renaming it becuase webui seems to look for model.cpky by defaault
My img2img using controlnet with a blurry image used to sharpen the features, but recently It's been outputting images of the same bluriness as my input. I forgot what helped me sharpen the features. Anyone has ideas?
hard to say. i used to jump straight to DPM++ SDE but now i'm checking out 2M and DDIM instead a bunch. i can get great results from low step counts. Though to be honest, i dont really know what the best is.
Karras are mostly identical to their counter parts but do it in less steps. i've never seen a good explanation of what the karras samplers do
hmmm ok
DDIM seems to get the fastest results i've seen around 20 - 25 steps
maybe I need better prompts 💀 my results are kinda messy
Why do AI still suck when it come to make hands, it can make most body parts and even the sexy bits rather okay, but whats up with 12 ginger hands?
ai learns how patterns relate to concepts. arms and faces are more definable as a pattern. fingers usually have a finger next to them, so you end up with 10 fingers on one hand. or fingers are often bent in millions of different shapes, so you get mangled hands
also these models are barely 6 months old. controlnet helps
so many shapes those can take... even the privy bits don't change that much compared to fingers. fingers can cross multiple times, each of the 5 has so many possible possition, and they are always one masking the other, it gets really hard for it to get good.
I tried training multiple hand models, never managed anything good
guys i have the perfect solution for ur hand needs, its called mittens.safetensors
lol yeah, I resorted to never making hands again too
to be fair
almost every artists have struggled with fingers
its part of the journey
could we say we are .... mid .... journey ?
looks for the door
hey guys, quick question - would I be better off getting a 16gb graphics card or would a 12gb suffice for most SD related stuff?
hi is there SD artist here who can do a quick commission, i need to see portfolio to start. realism portrait
if you don't intend to do big finetuning with lots of concepts at once, 12 is currently enough for just about everything. hard to say what the future has in stock for us though
Guys would you say this is an AI generated pic slightly edited in photoshop ? https://twitter.com/ArkhamVideos/status/1629308707572989954
Cause to me it looks hella AI but people say im wrong
I don't think so, maybe a little model training on a basic level - just need something a little beefier than my current 6gb
the artifacts on the left make it hard to tell, but I would say no, it's the official tweeter using the main character models, it would make little sense
training on 12 is possible, even dreambooth
Nah the tweeter is a fanaccount
it's just not the faastest and you can't go high in batch size so the more data you add, the longer it could take to learn.
And look at the weird details, this doesnt look at all like an 2023 preorder bonus picture, harley has a weird hole in the stomache
then yeah, possibly. and the artifacts could be here to help hide it
or it's just a screen capture in a moving video
Ah maybe yes, somethings clearly wrong but the overall quality reminds me very much of cheaply generated AI pics
guys how do you prompt ugly ppl
why would they use ai art?
its obviously them using their low quality assets
hey, how do you install and then use checkpoints?
if its only speed im sacrificing i can live with that, i won't be doing much, thanks for the advice
hey @mild ridge
@fervent thunderhey there Guizmus
it's quite easy usually
you download the model (.ckpt or .safetensors), you put it in your models folder, you reload the dropdown menu at the top left of the UI and BAM, the new model apears there. you can select it and then use it to generate pictures
some precisions
prefer safetensors, they are.... safer ^^
thank you
some models do require doing something more, downloading .yaml file, those are the 2.X models
it's usualy indicated on the model
there are some other things but it's already enough to start 😉 I'll loose you if I continue
thats what microsoft thought for bing but then people are just using it to confirm their extreme political beliefs instead. chat bots might have some unintended social consequences. i think "her" from director spike jonze (sabotage music video), is just the tip of the iceberg of unintended uses
sure i'll ask "are chatbots needed?" and i have concerns about them, but i'm pretty sure we're on rails here and theres no getting off this ride. no sense fighting it. would one be useful here to help onboard people? probably not. it'd get immediately abused and is no more useful than #1072229020520947753
Looks like Karla Ortiz now has a twin in Europe trying to do the same thing: https://nltimes.nl/2023/02/22/mauritshuis-hangs-artwork-created-ai-place-loaned-vermeer
"Artist Eva Toorenent advocates for legislation and rules for what she calls 'unethical technology' with her organization, European Guilt for Artificial Intelligence Regulation (EGAIR). 'While Midjourney makes a lot of money with this software, the artists and creators whose work is involuntarily included in this dataset see nothing in return,' Toorenent said to the Volkskrant. 'Without the work of human artists, this program could not generate any works at all. The higher the quality of art in the dataset, the higher the quality of the AI art."'
"Toorenent called it bizarre that the Mauritshuis singled out an AI creation. 'That is quite something. With this, the museum is actually saying: we think this is okay.'"
Apparently this individual also has her own equivalent of the Concept Art Association that's trying to lobby the EU, called the European Guilt for Artificial Intelligence Regulation (EGAIR)
"Whilst artificial intelligence may be useful for some professions, Toorenent believes that within the art profession, AI should be a no-go."
She might be even more extreme than the Concept Art Association, as she wants a total ban rather than the CAA's goal of a 2% max limit.
European Guilt lul
Hey guys, I have a friend of mine who's a tech guy from Silicon Valley, he would like to chat and interview a few people that are interested in ai generated art, like people who have used stable diffusion.. Anyone interested?
The hands don't look like AI hands. The imperfections in the hands, that is.
She wants kickbacks.
Yaml sounds like a Godzilla monster: Godzilla vs Space Monster Yaml
what
What is the ideal size for a Lora preview image?
heads up not to click on shady links btw
looks like ass
Hey guys, for the same price is getting a used 3090 or a new 4070ti for the same price better?
regarding model training, how should I prepare the source images and what quantity is ideal?
If you can get a 24GB 3090, yes, but otherwise no. The higher VRAM will allow you to process higher max resolution images on txt2img. 4070ti 12GB is fast, and ideally the best price to performance for non-4K gaming right now (best goal for gaming is 1440P on a 144hz monitor ultra settings). I'm saving up for a 4090 for Stable Diffusion so I'll have max speed + the 24GB VRAM.
The things holding me back are the used (no idea what the previous user did) and double sided vram are prone to high temps and higher failure rate. I searched up online and per tomeshardware the 4070ti is faster in processing using xformer than cards besides 4080/90. But I have no experience in this and would be a grief purchase if I need more vram than 12gb down the road.
For gaming, under 4K they are equal. Although 3090 consumes quite a bit more power. (400w vs 250w I think?)
In general, used cards are fine, but if it's the same price then it's not the same price because the used one will be worth more with the warranty.
the new one*
Plus 3000 series cards are old now so there's no real reason to keep supporting that ecosystem. 4000 will be the last of this generation (PCIE 4.0) so you'll be able to ride out the driver updates for many years.
I also read that there were transient spikes on the 3090 which can kill your psu. I don’t know if that’s fixed now or not
I just got a Response from the seller to get the deal on the 3090 for $150 less than what I pay for the 4070ti. Should I take the bite? If so I will return the 4070ti I bought
nah
Just re-sell your 4070ti just before the warranty is up for close to full value and upgrade to a 4090 then.
I see, thanks for the input, hopefully 12gb vram will be sufficient for the next few years
hi
hey ya'll -- i'm about to post a video2video workflow i've been experimenting with on twitter
and i wanted to know which of these two videos i should put in the first tweet to make for a more impactful hook to read the rest of the thread
ah, can't post videos here 🙂
hi
maybe in #🎥|animation you should get feedbacks, and you can post those
guys ive made a good picture but ive used random seed -1
how do i now use that seed to try and fix the image
Thanks! I got some feedback in other communites. Here's the thread if anyone is curious: https://twitter.com/bilawalsidhu/status/1629632110456602631
What video card do you have for video in SD? What is recommended?
Use png info, it'll give you all image info, including seed it generated image with
i got it
I know there is a lot of instruction on building poses, was wondering if there is model that does "more" simple poses rather then the full on OpenPose
Jeez i havent slept
Ffff
Guys if I want to teach somethint like woman looking left, I need to do dreambooth right?
is there an easy way for auto1111 to do the same img2img settings run for 100+ images in my foldeR?
i see gen chat is just questions now huh
Like we need more tutors
Jesus hire more tutors for us
Raaahhh!!!
Hahahah
Anyway I am so tired now
you are willing to pay?
Is any way i can train preexisting model and retrain how i am totally messed
batch
check batch tab
ty
people make video using those batch + control net
Does anyone know any AI to generate music?
Hello
i didn't see those toms benchmarks
https://www.tomshardware.com/news/stable-diffusion-gpu-benchmarks the one i saw. maybe theres more?
i got a 4080 so if you are legit with that information, i'll go hit up my copium dealer rn
E
i think too if a psu died because of a spiked load, that psu was on it's last legs anyways. They really shouldn't be that fragile
4090, 4080, 4070ti faster, 3090 can do more training and larger images because of memory
I was referring to the 24gb vram on the 3090. I already tested on my 4070ti using the exact same settings they used with x formers and got similar results (19-21it/s). But due to the limited vram I’m not sure if it’s worth to change to a 3090 because when training it’s either yes or no
yes but vivy was saying 4070ti beats 4090 and 4080 somehow. makes me curious
I searched up online and per tomeshardware the 4070ti is faster in processing using xformer than cards besides 4080/90 maybe i misunderstood this?
oh yeah okay, you meant its faster than all but those two
mystery solved gang
also, the 4070ti has dlss3 for those sweet free frames
Yea there’s that for gaming. Without a doubt it’s the better buy for that purpose. But since I often hear vram is king in air training this making me curious
If you weren’t to count the subscription part, is Novel ai still worth it? I’ve used control net with the anything v3 model on stable diffusion but the constant unfixable memory issues are making me consider novel ai. Although I’m not sure if it’s nearly as good as stable diffusion with anything v3 and control net or if it will even save storage and work faster.
Has anyone here ever used Novel. Ai?
but more vram for training...so 3090 better than 4070ti there
got my ram in and lost my hot timings as I had finally won the silicon lottery so no more 14,14,14,14 vs 16,18,18,18 but... @vestal dew #🏞|general-with-images message
Can someone point me in the right direction with a post or online resource about a problem? I trained models using photos of myself through a couple different google colabs and when I test them right there in google colab the images generated look like me. When I copy the model into my locally run (on my pc) stable diffusion and select that trained model, the images generated don’t look like me anymore. I’ve looked here and haven’t seen this question asked yet but I’m sure it has. Thank you!
that's a lot of vram
tips on getting a different character into the pose that a character in the 'initial image' is in?
I'm trying to use 'midjourney diffusion' to get a midjourney style
i was thinking of buying specific 14s but i've never really built towards fast timing. just usually go for mhz. what kind of diff do timings make?
good 14 14 14 14 ddr4 dimms that max my alderlake are 2.5x the price i've found
still got me thinking though
Guys how do i even generate a specific anime character
People who use stable diffusion in tiktok are generating detailed anime characters
I use the same model
But whats the prompt
danbooru tags, loras, etc.
danbooru is um, its an anime porn site it turns out. i asked the same question, and it's a huge rabbit hole
butttt, it's well organized and all of the anime models are trained on sets like it
Yk how to like generate a specific anime Character
I legit comment on whats the model and prompt and settings but they won't reply
so you use the booru tag for that character, cause all the images of it are trained with that tag
also loras
https://civitai.com/ get them there
U mean prompt?
you put the tag in the prompt yeh
Which tag
the booru tag
no
I cant comprehend alot about computer stuff only a bit
anyways, i've given you enough breadcrumbs to explore and discover! glhf
So yeah you can get annoyed by me
Aight

Well time spam there comment section again
got a lot of no help, huh?
Hello. Im not sure what it's called. Are there freely available models that are able to preserve face, clothes, body of actors in generated images between different prompts? Preferably for anime. Maybe some prompt helper that adds very verbose descriptions of generated 1boy/1girl
prompt helper is linked above for anime. the booru tag autocomplete extension for web ui
That's just tag aitocomplete. Already using it, very useful but i need something different: personality preservation of my actors
custom actors?
Imagine your actor has blue hair, certain face features, some estimated height etc. I want to show this actor in different situations
Controlnet may do it
same seed, same description, especially if you use some mix of existing characters from used tags like the booru tags
I am no expert on controlnet because I cannot run it
What do ya'll do with your generated images collections?
that link is specific to your topic, not a double dog link to unrelated info
Keep them to help generate future images
I curate out the ones I like, but have amassed so many at this point it's sort of silly...
I do too, and then I'm left with hundreds I like, but none are perfect
I have a Titan RTX
It’s a few years old but has 24gb of vram and should hold me over till a Ti or Titan drops
Prolly not worth upgrading to the 4090 for me. But that’s the card I’d get if I was buying today
curate them eventually
Days when you'd require 24GB VRAM to run Dreambooth 🥲
Hey SD gang ! How is it going this fine time of the day ?
I went for the 4070ti because I read db can be used with 12gb now, but weiui uses 4gb just having it open

At little but on this I spent all night getting it to work. I am blessed as this is a first gen ryzen b450 so is 50/50 all four slots would work. Just the two new chips 16-20-20-38 but adding two more 18-22-22-48.
That first one going from 14 to 18 is a bit big but in the real world not noticeable I have been told, and if it is not by much.
doesn't them both down to a common compatible speed?
this stuff is the limits of my hardware know
dual rank is slower (twice as many chips) than single rank so the single rank has to bow down to the dual rank.
due to more chips timings has to be slower. We are speaking of nanoseconds
adds up though. Funny thing is a 3600mhz CL18 is exactly the same speed as a 3200 CL16
I was just lucky. TBH, 48 gigs I will trade any imperceptible speed loss for.
When I go to bed I get to ram test all night.
i've got 32 and my usage has never had issues. i'll probably just keep that bank size and worry about high performance ssds and timings next
i figure i'm throttling my system performance by 15% with the old dimms i've got in the board
Does anybody know how to generate a specific ANIME CHARACTER
The people who post ai generated anime characters from tiktok wont tell me how they do it
I have the same model yet they idk how they generate a specific anime character
Anime is one of the biggest community around here, you should ask there, you'll get answers easily. Sorry not sure, outside of using a dedicated model or some embeddings for that character.
#1072013871730131004
Personally I prefer ram over pagefiles. I just did a test and bumped up my mhz to 3333 from 3200 using the same timings of the new ones. I will know how it goes when I memtest them tonight.
gl
Yeah, I think the fastest this board can do is 34xx but if I had a second gen ryzen I could get the next bios and do 3600.
It does see it as 3600 and has the xmp for it but this board can't handle that without the newer bios. All good.
Now when I gen and it goes into ram it shouldn't be eating my nvme m.2 anylonger
Please check Project AI....
you can use Waifu Diffusion 1.5 and generate Textual Inversion embeddings
Can i use controlnet with my face module ?
Depends what you mean with your face module, but from what I saw for now, there isn't anywhere where you can't use controlnet in auto, except in upscaling or training
Btw .. i have been using colab sd 1.5
On 1200 steps i get a great model but on 2000 i get bad
13 picture of my face with iphone 13 pro max
And suggest ?
Any *
ho ok. you meant a model you trained
well yes, you can use controlnet on those, no problem for that
as for your question on training itself, there are multiple things to know there
the quantity of steps you need will mostly depend on how many pictures you used, and on the "Learning Rate", those are 2 of the main training parameters here.
after a point, yes, training strats to get worse
1a-6 learning rate
it's called "overtraining", and there are some things you can do to mitigate that, but not a lot
mostly you can try and use more varied pictures of your face. think different lights and backgrounds
second you can add regularisation data, or class data, it's called different things depending where you train
Wheres the best place to train
it's another "concept" than your face, it's complementary data to train on so that the model doesn't go full "tunel vision" on you
Sadly i got only 6 vram on my pc so i use colab
I said "where", I meant "how", what tool you use. even on collab there are multiple options
and it's hard to say what is the best in a generic sense, but given what you are saying, the easiest to get how all this works is the best right now, this can be quite coplicated sometimes. I use TheLastBen's version of dreambooth for that personally
nope another one
but it's about the same
Shivam is really good too
I made most of my models in it
U train on sd 1.5 to ?
mostly yes, having a hard time to get it right on 2.X
depending on the subject I'm training, I can also train on other things
like waifu diffusion is a nice base if you want to train anime
Btw .. more then 10 pictures is needed ?
How many picture for best result ?
Of my face
as many pictures as you can while keeping quality and diversity high.
The way I make a dataset is, I start from an empty folder and I add 1 by one each pics I like from my selection, starting from the best.
Each new pic I add, I must answer myself "what does it add? what is in this one and not in any others ? ". If nothing is added, I don't pick that new one
Same for duplications. Everything that repeats too much gets trained. so for your face it's cool but if it's always the same wall behind ? it gets trained too
10 is enough
for face, I go from 6 to 12
and I train for 100 steps per picture as a base, and add up to 50% more steps
so if you select 10 pics, that would mean training for 1k steps first, start doing checkpoints from there and train 500 more steps
compare the different models it gives and keep the best one
so I’ve been looking into SD a bit, and Automatic 1111 keeps showing up everywhere. What exactly is it?
Its a webui to use Stable-diffusion localy on your PC for creating images
oh
so it uses my pc power to generate images essentially?
hm ok
do I need to train it to make images?
only stable diffusion I’ve used is the one through novel ai
Nope, there are so called model files, these are pretrained
You can download and use them
but if I wanted to I could make my own?
Yes depending on your GPU vram
hm
Well my laptop isn’t too powerful, but I’ve been really wanting to make an ai model I can keep consistent
With 6-8gb vram you can train a lora that can help for consistency of a character
Its a file usable in the webui (its a light model or an Addition to it)
There are a lot of loras for well known Anime Chars for example
is it possible to have a Lora that can make multiple characters?
Yes this is possible
No problem, if you want to try the local webui (automatic1111) i can link you a tutorial and for technical Problems there is #🤝|tech-support
hm
would u say it’s the best one I can use?
or is it more of like a preference thing
There are multiple Webuis, Automatic1111, InvokeAI, SD webui V2, Sygil webui, and Shark.
The most used and the most feature rich of those is Automatic1111,
so i would go with this it also can handle gpus with less vram pretty good.
ok sick, I’d totally love to try it
@signal trail Nice make sure your GPU has more than 2gb vram for it to work and has to be from nvidia, if your on Linux it will work with amd too or Shark.
Here is the tutorial for Automatic1111 for nvidia gpus:
https://m.youtube.com/watch?v=VXEyhM3Djqg
HI guys(these days it means Gils as well) 😜 , I saw some info on the internet, but could not find anything concrete on a API for Cinema4D, using SD for rendering. Anybody know where to look?
anyone tried sd generation and training on a 4090 laptop gpu?
first, you make your text using any other image tool. black text on white background
here is a tutoriel https://www.reddit.com/r/StableDiffusion/comments/119j8qr/clear_text_using_controlnet/
here is another one https://www.reddit.com/r/StableDiffusion/comments/113yg8f/yet_another_controlnet_post_fancy_titles_and_text/
they didn't use the same mode, you can get your text with different effects depending on the mode
this one is a great example too
how do i run run the programm lol?
with the webui-user.bat?
i have already installed it and used it once
can someone controlnet a photo of a rainbow? i wanna see what happens
yooo wus new
Long time no see, how you been? Can you spare a real one £50k?
https://www.youtube.com/watch?v=cVxQmbf3q7Q
@wise stratus have you seen this? apparently the math in Stable Diffusion's noise function is bad
hello, Sunny! I am great
tired, though xD
a lot of exciting research
how about you?
(had a meeting with advisor to finallly release my paper.... he said he didn't understand anything and now I have to rewrite it after a long-ass meeting)
why is it so much harder to explain things than to make them xD
Hey Sunny !
I'm awake. And h u n g r y for breakfast! (Suddenly, all I feel like acting like it's a 90's breakfast commercial. "I AM A KID!" She yells as the nabs the box of Trix.)
HAHAHAH
Nooooo
You know. I have been writing research. And I personally decided that professionalism was never going to be a part of it from the beginning. That idea was straight tossed out the window like the hot garbage it was...cuz ADHD 👏
what HAHAHAHAHA
But if you need help reading, I'd be happy to read
How can I generate things here?
not sure if I understand, you mean professionalism is writing research is hot garbage? That's an unusual take 😛
No, I mean my ability to write professionally in one go is hot garbage from ADHD and I'll die from boredom
probable search the channels or the search bar for instructions on how to use the bot. Not sure if it's active right now, though
unless you mean generating in your machine, then search for install instructions
Hiiiiii
I see, so how do you do it then?
I write pieces, usually. Typically I write bulletins to organize the information and gradually string it together. It looks like a hot mess during the process, but once you thread it together, it works
And if I stop mid sentence because my brain just dies because it decided it's going to work on another sentence, I just do that, and resume when my brain is ready for that sentence
It looks like an apocalyptic landscape while writing. I'll color code or highlight things to keep organized.
You know, I think AI like this will soon lead to an explosion of personalized media and we won't have to wait for companies to produce it anymore
But knowing how capitalism works they'll probably try to lock it down somehow
But what if I really want the Mario Infinity War crossover
I don't know the future. All I can say is that, imho, we can continue to build and do cool things, and show those cool things to other people and innovate and help people
Which, frankly, ya'll be cool
So keep rockin' on ❤️
yeah, funny how every writer has their own quirky little process
I just sit down for 6 hours and write the whole thing continuously
xD
I used to do that, and for my one novel, I am actually re-writing it like this
So, what is your paper about?
oh you have a novel? that's cool.Me too, actually. I'm learning to write novels, so I have a couple exercises.
as in novels I wrote to learn
what's your novel about?
Wdym lock it down?
eh... in general terms.... zero-shot textual inversion.
so like suppose you want to do that yellow bird among white flowers example from last time
you could generate an embedding for a yellow bird and then put it with white flowers and the model wouldn't mix up the colors
Restrict the use of copyrighted material to zero or charge a lot of money to allow it
but you don't need previous images in my method
As in, characters, locations, etc
Like Mickey Mouse
@lofty parcel oh, well im fully on that side then
is there any guide for mixing models?
will send*
ehrm once you accept the friend request if you want 😛
Yup! It's something I have been working on for a long time. I decided to re-write it, somewhat in a non-traditional way, but I think the format works. What's your novel about?
Of allowing people to mess around with copyrighted characters, right?
ty very much
No, protecting the rights of owners of those copyrighted characters
I do have a dream novel to write, but I decided to first write smaller ones to hone my skills. The premise of the first one is kinda cold war meets medieval fantasy? As in, magic has been just discovered and it's super destructive like atomic bombs so there's a tension building and the story revolves a little around espionage
Oh
It's a fantasy, somewhat dark fantasy, I suppose. But not traditional, per se. Like me, very different. Each character has their own issue to overcome, but you get to see their thought patterns, reasons for being, as if you were that person. It's about learning to reconcile with the difficult things in life, especially the unfair things, and choosing to do good anyway. Also, there's many dragons in it, a lot of magic, somewhat of a steampunk setting, and characters of all ages.
Personally I like the part of the Internet where people can make fanart and memes because it would be too much trouble for companies to shut them all down
though it doesn't really read like a suspense thing, but rather like idk a heroic fantasy novel xD and, on the core, it's philosophically about apathy and a lost capacity to dream being restored. (e.g. "why a future?" and that question being resolved as the story progresses)
Right, so what you were working on before?
Also, there's #🏞|general-with-images
I like that. Send it to me please, I'll enjoy it if you write well 😛
Fanart and memes are a different thing
Sounds cool!
yeah but I don't want anyone at the conferences saying I "promoted my paper on social media" and cutting me off xD
Lol, I do
I guess you could argue that once we can make things indistinguishable from official sources we could have some sussy Mickey Mouse propaganda psyop
Ah, yeah, understandable
that' some unusual confidence for a writer haha! usually ppl self-doubt too much
Thats why "we" pursue ways to let some of those nightmare become a reality
Glaze is a start, but not the end
nice out-of-context phrase for an AI dicussion xD
Oh, i meant STOP them from become a reality
suuuuuure.... 🤣
I mean I just want the freedom to make my Among Us Infinity War Super Mario spinoff personally
You can, but you cant showcase it as your own creation and cant profit from it
If you'd walked my path in life, you'd know why. XD I've had to claw my way to get here LMAO. I'm certainly not perfect, but when it comes to certain types of writing (not all), my writing packs a punch because I draw from real life experience
Media will become so saturated that there probably won't be any more profit to be made lest companies lock the tech behind a paywall and copyright holders gain a share of profits
I have plenty of self doubt otherwise, hahaha!
now I got curious about your life story! haha any chance I can get that friend request in?
also so curious to read it now, especially to check if brag matches fact! And also because it sounds good
Thats if this tech gets out of hand from regulators, companies etc
I hope for Adobe to do their job on this
And others too
I don't mind sharing most things with ppl lol
so, anyway, yeah. I've been working on this for quite some time now. It is amazing no one else published the same idea so far, because I'm really late to the party
The thing is, content never has to be published for people to consume it for their own enjoyment
And once that's possible, those companies won't make any money from their own creations if the generator is good enough
That's why they'll want to stop people from getting technology that powerful
Artificial scarcity would be required to maintain the current system
Things do move pretty quickly in this space. But that means there's a lot of room for growth in a lot of directions
AI at some point could become a godlike entity
If this happens then a different system would be needed
A radical change of order
These incentives, combined with safety concerns and the vast compute that companies have, puts the field at high risk of being locked down and permanently closed source, be it by law or sheer inaccessibility of hardware
If AI becomes this powerful it should be used for areas some people wont like it to be involved to
I don't think it requires AGI, I think we're already coming up on it this decade
A longform video synthesizer would be nothing short of incredible, and I don't think we're too far away
I want AI to be used in several areas before AGI anyway
I dont talk about some "fluffy" stuff only tho
I like fluffy stuff
SD can make some very fluffy cats
I can't wait for UniPC sampler to be in AUTO1111
Fluffy cat approval
yes
where i can generate pictures from text
a
Hey Ima
You can check in the #1072220168534642768 and #1072229020520947753 but in 2 words
1/ not on this server directly, no bots
2/ there are multiple websites to use it for free quite a lot
3/ you can install it at home if you have a good enough computer
4/ you can also run it in the cloud if you don't have the computer for it
Welcome ! There is no bot currently to generate your images on discord. You may want to start by taking a look at the #1072220168534642768 channel. You can access Stable diffusion in different ways : 1️⃣ the official website, https://beta.dreamstudio.ai/. The easiest and fastest way to access Stable diffusion with 200 free credits. For any question on it, you can find help in the #1025467151206854736 channel. 2️⃣ Installing Stable diffusion on your computer. There are numerous projects that let you do that, and you will find help in the #🤝|tech-support channel. 3️⃣ Running Stable diffusion in the cloud, through rented GPU services, using notebooks. You can find lots of them shared and discussed over in the #1011228442399883294 channel.
custom models with midjourney quality https://www.youtube.com/watch?v=cVxQmbf3q7Q
Question how do I uninstall SD ? I plan to install it again later under Invoke Ai.
I'm using A1111 currently.
it's as simple as deleting the A1111 folder. If you didn't add options specificaly for it, everything is inside
it's a program that "runs", but doesn't install in the classical sense, not A1111
Okay. Just am not sure unfortunately the store of my life..
I'll start with deleting the folder.
Is it installed into Python ?
should I uninstall that also.
I can confirm to you that it is enough for it to disapear, no need to delete python
plus you need python for invokeAI too I think
Yea.
sorry to bother everyone but where do i put my lora file in stable diffusion
in the LORA folder, I believe it's inside the Models folder by default
Vb-n
Lora folder.
Hello all! I'm starting to use ControlNet and it's cool! But i dont really understand thé difference between canny pose, normal, hed etc. I understand thé difference in the process of creation but dont really understand when WE have to use one of them rather than another. I dont really Saw différence between each results so i dont see how i Can decide one or the other? Have you Seen cases where one IS more useful than the other? If yes, un which situation?
french ? 🙂 damn autocorect :p
It's a hard question, people are still discovering it. We see lots of uses in for example :
- depth for black text on white background, lets you do some real nice texts https://i.redd.it/nud53zng1uja1.jpg
- depth for hands + openpose for body to control entirely the composition https://www.reddit.com/gallery/11c4m4q
- canny for lots of general use
- scribble/fake_scrible to make a drawing into something real https://www.reddit.com/gallery/11bpvvj
But the thing is, there is also that new possibility of using more than 1 controlnet at once, that brings in even more crazy stuff.
It's mostly uncharted territory being explored right now
Hey I had a question regarding prompt creation how exactly do stuff like this work farm, ranch, ((night)), cloudy, what exactly does (( )) do?
(()) means the weight is 2x
it makes it more likely to express that, but it also means less likely to express other prompts
() is an operator that increases attention, weight on that word
https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Features#attentionemphasis
prompt order also matters
Got It thanks I appreciate it I would always see it and never know what it did
if you're using A1111 you can do it like this
(prompt:xx) where xx is the weight
(night:2) makes the weight of the prompt 2
Alright
you can also do [night:day:xx] where xx is the number of steps done with the first one, and then the rest of the steps are done with the second one
so you can mix things
Alright once again thanks I appreciate it
also there are tricks like if you do "blue pants, long hair" vs "long hair, blue pants" - the result will be different
You guys are tons of help. Loving reading all this
you can also mix these
you're on the server with groupies for this tech, it helps
Is the seed value what’s important when you want the same image but want to slightly tweak it?
I count myself in at least
exactly. The seed describes the noise, the base image that will be worked on. Same seed = same pic for all other params the same
if you vary on one, you can check what it does
([fred:bob:12] wearing a blue shirt:1.5) weights the whole thing 1.5x, and then Fred and Bob would be maybe Lora tags you're using to get a certain character, Fred expressed for 12 steps, Bob the rest
resolution also changes the image, as does seed, sampler, and of course model
you may particulary like "Variating seed", that lets you use a secondary seed, and test variation of the first seed
https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Features#variations
you can also mix models in the "checkpoint merger" tab
I just started last week learning all this. It’s been a little overwhelming but I keep telling myself to just go one step at a time, build my skills and knowledge one brick at a time and be patient. It’s been exciting learning
that is the method, yes. This is a very very rich tech, and using SD is the most custum it gets. There are numerous features to understand, and lots of those, you need to take a step back, and think about what it brings to the table. Lots of those are game changing
I hope you keep enjoying it and we keep seeing you curious and sharing your pics around 🙂
I will share for sure. Right now I’m dividing my time learning more about prompts using models I’ve found and then learning to create my own models. I really want to create my own models
it's a really fun part of the thing for sure
but it's best to try and understand the most of image making first, since it will be hard to understand what you do right and wrong when you start training without that
Thanks for the advise
chatgpt works the same way - many don't realize.
You have to consider order, weight of prompts, styles
I dont understand why Tortoise TTS never progressed like this...with different models etc.
do you think it is good practice to change the LORA's file names to their trigger words?
if it works for you, then do it
but how do LORA names make you remember the trigger word when you're using the AI?
UI, I meant
why not put it in a spreadsheet?
Hey, so I've been having trouble rendering stuff with stable diffusion. It loads all the way, but then I just get a blank screen instead of the image. I don't think it's because of too many prompts since I tried it with just 4-5 prompts on both negative and positive. Do you think it's because of my computer not being strong enough or a memory issue?
what are the specs?
2070 super 16gb ram intel i7
I do it with an i7 and a GTX 970
so it isn't your system
Is the 2070 AMD or NVIDIA?
If it's the same as their trigger words, then I remember them!
Nvidia
then it's fine, whatever helps you
your card and system are better than mine, and I can generate
maybe
try it 512x512 without hires fix checked
alright honestly I think that might be the issue cause I've rendered almost like 9 thousand images all high res 😅
but you said you can't render, it just stops
Yeah every now and then It happens at random
like Ill generate lets say 6 perfectly fine and then maybe the next 2 fail or something like that
Usually I just ignore it but since I'm here now I thought I might as well ask
then it renders fine again?
sometimes mine stops working, but then I have to start it again
yup it's weird 6 will render fine then maybe the next 2 fail and then it renders fine
and then once again every now and then the render breaks but then goes back to being fine
I've only had one instance where it just completely doesn't render
Unstable Diffusion discord has a technical support channel
Thanks for the answer and yes my laptop's auto-correction plays tricks on me 😉
So it's interesting and the logo is really top. On the other hand, I personally don't run the AI on my own computer but I use Replicate. So I can't mix several models at the same time. If I use Replicate it's because I have a website and I propose to my users to test the AIs via Replicate's API.
So I tried on Replicate to make a logo, I went on this page : https://replicate.com/jagilley/controlnet-depth2img (Ithink it's the good model?)
And I tried with a text written in black on a white background.
When I launched the modèle I got this error message;
Not enough memory available to process your request. Try reducing the size or number of any file inputs or outputs.
More details
CUDA out of memory. Tried to allocate 3.06 GiB (GPU 0; 14.76 GiB total capacity; 9.11 GiB already allocated; 462.88 MiB free; 12.26 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF
Do you know where this can come from?
With other ControlNet templates I get either the same errors or less sharp images... or I can't see the letters.
controlnet uses a secondary model during generation and as such, costs more VRAM. This could be the reason why you run into CUDA errors, this is a classic OOM error. Try reducing the width and/or height to reduce thise happening.
you'll need to increase the virtual machine's vram if you don't want to reduce the dimensions
Hi
How do I diffuse?
hey new people 🙂
It's weird because the image is only 20.7ko and I already made the template work on Replicate with much bigger images ???
Welcome ! There is no bot currently to generate your images on discord. You may want to start by taking a look at the #1072220168534642768 channel. You can access Stable diffusion in different ways : 1️⃣ the official website, https://beta.dreamstudio.ai/. The easiest and fastest way to access Stable diffusion with 200 free credits. For any question on it, you can find help in the #1025467151206854736 channel. 2️⃣ Installing Stable diffusion on your computer. There are numerous projects that let you do that, and you will find help in the #🤝|tech-support channel. 3️⃣ Running Stable diffusion in the cloud, through rented GPU services, using notebooks. You can find lots of them shared and discussed over in the #1011228442399883294 channel.
the image is small, but to make an image in SD, it goes through latent space and loads the model, and this costs VRAM,lots, GBs. Adding Controlnet loads a second model. there is a different one depending on the mode you are using, but all those up the VRAM you need to run this. So it could explicate the difference.
Still though, you have quite enough vram here, if you are in 512x512 it should work smoothly. Do you have xformers running to help with the vram ?
/start
#1072220168534642768 maybe ?
I don't know if you know Replicate?
The main thing is that users put models online (here ControlNetDepth) and then you can use them via an api, because it is exactly Replicate that runs models in the cloud at scale. So it doesn't run on my computer but on the contrary it runs on a Nvidia T4 GPU which should do the trick. On the other hand, maybe the model that has been deposited on Replicate has different settings than the one used by AVEROK ...
well I know of replicate, but I didn't think it would be compatible with controlnet models, since they aren't models in the classic sense, no ckpt, and work in complement of the classic models. Like, you can't just run a controlnet, you need a base model too, right ?
Tbh, this is still quite new, I could have some misconceptions
does anyone have problems opening sd via сolab notebook?
hey guys, how to install on ubuntu OS?
the webUI? Git clone the repo and run the webui.sh script
how do i get to Dreambot-archive role
new emoji!
noice
what sorts of problems? What notebook are you using? Do you have a GPU assigned to you?
it is giving an error at the end 😦
Just another (hopefully helpful) hint from me: if you changed something and then generate the image, generate it once more. For me it oftens changed from worse to perfect then.
can anyone help get me to the dreambot archive roles? I would like to retrieve my images there
Does anyone have gradio live forum or Discord? its offline on google colab
hey guys quick security question: how safe is installing controlnet as it isn't a safetensors file?
Em que sala eu posso criar imagens?
ControlNet is just an extension.
hello everyone, i would like to start with SD but i am so overhelmed with all the information so apologize upfront if this is a silly question. please can someone advice how can i know what is the official most actual version of SD?
lets go to #🤝|tech-support so you can be more specific about the error
Is anyone using an Nvidia tesla card? Are they worth it? Are they more difficult to use in any way?
Considering something like the P40.
no problem, feel free to ask. SD is just a method to generate images. If youre refering to the SD models, then the last version is 2.1 and you need something like the WebUI to work with it. If youre planning then, to gen some imagens, Id install the WebUI, since its the most complete interface to use SD. You can use any model you want with it. Theres also InvokeAI, it used to lack lots of features but its really good these days. Most ppl just use the WebUI tho. You can get it here: https://github.com/AUTOMATIC1111/stable-diffusion-webui
Then follow the instructions for you build. If you get an error or something just ask for help in #🤝|tech-support (but remember to search the wiki and the issues on GitHub first, most of the time the errors are already solved there)... So yeah, its pretty much installing the WebUI, downloading some models from hugginface/civitasai/models-embeddings channel and having fun...
thank you very much for answer and being helpful 🫶
🙏
oh, nah dude that guy who makes the video is a fraud, he just lies constantly
lul
I dont remember the exact name of the dataset used. It was something like 5B something. So yeah, around 5 billion images. Im gonna try to find the article about it
laion 5b
idym
I did! 🧋 
Gggggaaaaaayyyeeeeeee
😎 slurp
Question for everyone. Other than 512X512 sized renders what other sizes have you all had good success with? 512 seems to be the most successful for me
Depends on what resolution is model trained on , most of them mention which resolution works best.
For 1.5 it's mostly 512x512, yea..
Making it wider or higher is inconsistent, but sometimes works...
There might be a trick to try to pull higher resolutions out of model, but eh. who knows.
I did some runs with 1280x700
No idea why this numbers, but it it did okay-ish somewhere around 33% of the times
(maybe I got lucky and it's actually less)
Anyway...depends from model.
Look what author is saying, what people prompting, if there's nothing , you can look at models it made with to see it's resolutions...
That makes sense that it would be mostly based on how the model was created
I finally figured out what I had been doing wrong with the model I trained - novice error / omission in my prompts. Im very happy with what it’s doing now. How do people generally share their work here in this group?
posting links to https://civitai.com/ model?
Yes, it stopped working on me this morning
Thank you
Yeh its luck
You can increase ur luck by training a model on 1024 if u have the vram for it
768 models suck ass for the most part anyways, so until we get a decent model, 512 is the norm which can be stretched consistently to about 678 before it starts doing weird shit
Keep in mind that all outputs are high fidelity, any output will look way better than any jpeg out there when resized 2-4x its original size.
Hey guys, anybody know how to run a publicly accessible SD without using Gradio and their crappy services?

