#💬|general-chat
1 messages · Page 47 of 1
What do you mean by connect? And what is your goal?
a service that can tell if a face looks similair to another face
want to create folders for each similair face
so i can categorize them
Does anyone know if there's a nice open source text2video generator out there? similar to runway gen-1?
Hey guys, I've only took a few days away and now I see all kind of posts with lora tag, what is lora? is it like dreambooth?
dreambooth meets textual inversions. Almost as high quality as a dreambooth training, but can be used on any related model like an embed, small file sizes too
fast and easy to train
ah, I see, thanks a lot for the info.
theres also LoCon, LoHa too
any idea if its possible to do lora using 8gb vram or colab?
i think so yes. it's designed for low vram training
If we make comparisons with dreambooth, do we get almost the same quality or dreambooth still wins?
dreambooth wins but the margin is a lot tighter. also, there's now tools to take two models, and take the difference between those and turn it into a lora
and you can merge loras too
i'm still researching this all too so , take what i say with a grain of salt. i might be misunderstanding some things but its hard to tell. so much to take in
exactly, you miss one week, and you get back and its like a new world here.
2.1 has controlnet models now too
is it any good? I still use 1.5
I tried 2.0 and it gave me really bad result.
and I heard the 2.1 is not that much of a big difference.
if only we had 1.7 : (
2.1 can produce good results as long as you know how to use it
yeah, the workflow is changed.
training is a lot more important, but not enough people are training for it.
So peopple just assume it's bad.
people like to get quick results so it makes sense.
I did make an embedding that can improve results, but it's for digital art specifically
not to mention 2.0 trained with wrong values.
Yeah, I have never used 2.0, I only use 2.1 occasionally.
are there any rumors about a new model?
openclip instead of the old closed off clip models that can't be further refined due to licensing restrictions
so it's a brand new pretrained model on 2 and forward
not rumors. Actual work being done. 3 is getting the ground work laid. datasets are being built
Doubt they will release it to the public.
Im just hoping that 3 has the best of both worlds, the speed and trainability of 2 and the quality of 1
Why not? Stability Foundation is focused on open source.
they didn't release 1.5 for months till it got leaked.
That doesn't mean they will never release it to the public.
Hopefully I'm wrong and you are right.
I see the laoin 5b images are of different sizes. How did SD train with different size images?
I hope so too.
where i can make images in this discord channel ?
you cant anymore
pretty sure
@peak apex
Currently there is no bot on the server that generates images. However, there are plenty of other ways such as the official https://beta.dreamstudio.ai/ website or running Stable Diffusion locally using your own system resources! Check out #1080946152318443610 for more details! You can also stop by #1025467151206854736 for any issues you experience while using the website or #🤝|tech-support for any problems you encounter while installing it locally!
Has anyone tried to sell images they created with stable diffusion?
Many of the commercial services don't sell the ability for NSFW generations because then it opens them up to all the regulations and laws around hosting and producing pornographic images. Look into stable hoard. i don't use it so i dont know my way around it but i've heard they've made it simple to use.
community cloud of stable diffusion hosts that donate their compute power to the hoard for free
So - is there a plan to solve the "hands" problem?
Guys, could anyone help with automatic1111 on collab with controlnet?
does it have to be in the img2img tab? and do you need an input img in the img2img tag, besides the controlnet input img that is
you can download on your pc automatic 1111 stable diffusion and draw nsfw
With controlnet you can generate pretty good hands. There are addons for openpose editor with hands already prepared and you just need to select one and the AI will generate nearly perfect hands
That's the only thing you can do about that. We'll see what the future brings
can somebody give tip how to make good prompt?
Welp... You need to add lots of details in the prompt as well as tags with which the AI will modify the images (lighting, style, etc.). Here is a list I used before I remembered most of them: https://www.reddit.com/r/StableDiffusion/comments/yn2b6j/stable_diffusion_tags_explained_what_tags_exactly/
I used this method but one time I start use detailed negative promt and low detail classic prompt but it makes detailed haotic art
when I use very detailed classic prompt it draw what I want but lovres
Then use high res. fix
does it work with local automatic 1111?
It will make your image twice as large and will add some details
Of course
this is option or extension?
Can't send images here...
ok, I`ll google it
Does anyone here use Deforum to create animations? I'm curious if you have any advice about keeping the prompt subjects in frame. Sometimes the prompt objects kind of disappear and all you see is a solid color or a few areas of color.
I tried it, but can't say I have experience with it
how do you upload a depth image to controlnet in automatic1111 colab? i mean, not uploading a picture and getting the depth map, but uploading a depth map directly
same as uploading the picture to make a depth map, but you just put preprocessor on "none", since you are giving it an "already processed" picture
ffs how it didn't occur to me! Thanks! another quicky, how do i add more models or loras to it? do i need to download the models and upload to the drive or can i just link them somewhere?
you mean more controlnet models ? or more LoRAs/Dreambooth models ?
but in both case it should be that, yeah
you download them, put them in your drive in the right folder
and it will see them on next start
Again, Thanks!
Say, do i have to also upload an image to the img2img input? it doesn't work without it, but with it, it "ruins" the desired output
go into txt2img if you don't want img2img input lol ^^
or maybe I'm missing something
I mean in controlnet... will it still wofk in text2img?
yeah. I mainly use controlnet on txt2img personally
it's different
it doesn't "seed the colors" in
meaning you can have a lot different content in the picture
but it still follows
(can't link pics here)
i see, in img2img, which input "comes first"? if i upload a specific picture of a tree, upload a picture of a man to controlnet and prompt to generate a robot, will it retain the man's shape still?
Nice! i think i remember seeing it on reddit
the init image in img2img will be the initial noise. it will be the initial image that SD will modify, keeping it very close (or less depending on the controlnet settings you use)
same in txt2img, it will let the controlnet image just be a control, a way to push outputs in a given direction, but it won't force the first picture entirely as initial noise
good example there
in that case, the picture of the tree would be the initial noise, it would be the picture you would get if you set it to 0 steps, or denoising to 0
the picture of the man, in controlnet, would depend on the controlnet model you use. but it would basicaly force the output to be in the shape of a man
has anyone figured out why hands are such a problem?
is it really that complex a problem?
and the prompt robot would force the image to create a robot as much as it can.
so in the end, it would morph the pixels of the tree, and try to make a robot in some kind of green color inside the shape of a man
it's quite complex yeah. lots of intersections, positions, angles to see them from, ... and detail level changes a lot in the sources.
controlnet makes a great tool for that though, it makes great hands using depth map
Thanks Guizmus.
so img2img input comes first, and then the prompt and controlnet comes together and push the model into your wanted output
So I tried using controlnet on hugface and ... it was bad. it was like SD1.5 quality. How finicky is controlnet?
there are quite a lot of details. you can go very far in it. the main tips I could give after some hours on it :
1/ go into the settings tab, and push "controlnet quantity" higher. you can have depth + canny + normal map at once for example, and it brings more fidelity
2/ don't hesitate to use use an inpaint mask with it too, so that some part of the picture don't get modified at all.
3/ try to play with the "threshold values" on canny mode, and the "preview annotator result". This will give you an idea of the details that were picked up by controlnet, and that will be forced on your output. you can change those values for more or less fidelity to get better quality
let me show you a hands example in #🏞|general-with-images
ok
do you ever make 1.5 embeds?
not that much, no. I did some text inversion long time ago, but mostly felt in love with fine tuning full models. I work on a bigger dataset on my side, but LoRA don't seem to lend the quality I want, compared to classic dreambooth
i was curious if you knew if training 1.5 embeds you need to use the standard model
i was trying to make an embed using a realisticvision model since thats what im using and it comes out all funky
Im currently working on a larger whole model myself, but ive always liked embeddings the most. How do you get through all the hard dataset making and captioning? Im finding it hard to get motivated to do it. I only have 100 images so far (already captioned) and probably need around 2-400 more.
where do you even make loras in auto?
dreambooth extension I am pretty sure
oh right
I always have used a base model since I noticed that training on other models always came out bad.
ok so I guess you cant train on anything but the base model then
I thought maybe I was doing something wrong with a vae or god knows what else haha
It's best to train on the base model and it will still work well on other models.
My embeddings are all mostly pretty good.
fair enough. i was just getting into this embeds a bit. inspired by those reddit posts of that guy putting himself in historical scenes. doing that with myself. but i thought since i used a realistic vision model an embed of myself would be best from that model too but i guess not lol
Train it on the base model and use it on the realisticvision model
what do you do for your text file? do you label them to match the photos or just give a file name like 'chaz is standing by the ocean' or just 'chaz(1), chaz(2), chaz(3), etc' ?
Hi all - I have a specific application and not sure where to start. In Architecture, we frequently create construction documents in packages that are not suited to rendering. However clients frequently ask for renderings, and much time is wasted converting black and white axio drawings into colorful renderings to please the client. Are there any good SD applications that would keep the general structure of a drawing, but be able to change the style to a rendered look? Or even something to feed a 3d model into and then have it supply some textures that are generally popular in the architectural renderings category of internet images.
I have never trained a subject, only styles, which is a little different. So in case I were to tell you something wrong, I suggest looking up a tutorial that covers training a subject.
for sure, let me find you some examples
hey, thanks for "showing love" but with messages such as "1" and "2", you are mostly pinging users you never interacted with. Welcome around but please participate and not just play with that bot that way.
controlnet not working...after giving command...showing this in command prompts..
controlnet not working...after giving command...showing this in command prompts.. Loading model: control_sd15_canny [fef5e48e]███████████████████████████████████████████| 16/16 [00:04<00:00, 5.74it/s]
Loaded state_dict from [D:\stable-diffusion-webui\extensions\sd-webui-controlnet\models\control_sd15_canny.pth]
ControlNet model control_sd15_canny [fef5e48e] loaded.
Loading preprocessor: canny
0%| | 0/16 [00:01<?, ?it/s]
Error completing request
Arguments: ('task(4eqh3zlvc0ora2o)', 0, 'robot doll in a futuristic city', '', [], <PIL.Image.Image image mode=RGBA size=512x512 at 0x28199DDF880>, None, None, None, None, None, None, 20, 0, 4, 0, 1, False, False, 1, 1, 7, 1.5, 0.75, -1.0, -1.0, 0, 0, 0, False, 512, 512, 0, 0, 32, 0, '', '', '', [], 0, True, True, <scripts.external_code.ControlNetUnit object at 0x0000028199E1FBE0>, False, False, False, False, '1:1,1:2,1:2', '0:0,0:0,0:1', '0.2,0.8,0.8', 150, 0.2, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, '<ul>\n<li><code>CFG Scale</code>
couldn,t add the whole response
An update on the copyright situation, involving suspected systematic removal of images. Many of the reposted images with altered logos bearing resemblance to a particular TV show franchise survived after I removed the logos. However, there are still occasional, random, but systematic removal of some of my posts after that.
At first I thought it was because the name of the show, or a name that resembles the name of that show appeared in the write up/title of post. So I added dots(.) and hyphens (-) in between the letters to avoid my post being spotted and removed/reported. It worked for a day or two. Then in a day or two I post new parodies, and they get reported/removed despite the altered title/text. So today I try to remove any text or title related to the show. The only way for anyone to know the reference to the show is if they are familiar with the main character. Yet again, today, 1 of my post gets removed within about 5 minutes of posting on instagram. There was never an official complaint sent to me, and I have no idea on why some of my post get targeted.
The posts I made a few days ago bare an altered title in the write up, contained character images that look very close to the main character of the show, and they survived. But today, I made several attempts to downplay the resemblance, yet somehow the new post get struck in 5 minutes or so, its like they have a bot that remembers the images in the post, and strikes at them whenever I repost it, despite me trying to change all kinds of words to avoid detection. Yet the old post that bare greater resemblance survive all along.
I tested more today by posting a second post with a set of images that also bare noticeable resemblance to that show, but started off posting it with a write up that bare no mention to the show. Despite having a recognizable character, that post stayed on all the hashtag spaces and did not get reported/struck down.
I am worried that there is a bot looking for posts that textually mention anything close to the show, and reports them. And that instagram remembers the associated images, so that even if I re-word the post, they will just simply strike it down again.
But I am unsure as to how they find and gauge what to report/strike down. My older post bare more resemblance, with both text/images close to the show. Yet they survived for days now. I am just worried that they will keep tightening the noose, such that any new material I post, so long as there is even a slight hint of resemblance, will be reported/struck down.
If they keep doing that on instagram. I'd consider using twitter for this set of images/stories involving this franchise. I am suspicious of a systematic crack down of a.i. art related to this franchise since I've seen all kinds of super-hero a.i. art surviving, but I've seen no one post this franchise (maybe there were, but were similarly quickly struck down by the intellectual property owner). This is a shame that the franchise cannot even allow fanart made in a.i. art form. There are drawings of those heroes on devianart and they were never censored like this.
i keep getting "AttributeError: 'Options' object has no attribute 'sd_lora'" when using Lora
how can i fix it?
getting controlnet error like this Loading preprocessor: canny
0%| | 0/20 [00:01<?, ?it/s]
Error completing request
just found out about invoke
seems so convenient
woulda saved me a lot of time and effort
hmm whats invoke?
no, it's another tool, that isn't as popular as automatic
it has some really great things going too though
less. but when it is, it brings features that rocks. but yeah, that's the big problem of it
cool. ill have to check it out
it's very solid, and continues to work, it has a lag behind though
more "designer" oriented
let me show a "speedrun" using it lol
its a webui but also just like the whole entire process
really not the same approach
Hello
it does all the suffering of python being unwilling to cooperate in a one click installer
quite awesome imo
Does anyone know how a 1070 would perform in SD?
Got loads of GPU choices, want to spend as little as possible lol
yeah i see that auto installer. very nice
soooo pleasant
gonna check it out when i move over to my rig later
see, invoke has a thing for that too, it can check if you meet the requirements (which id imagine you do)
tho idk how many vram you have
i think minimum recommended is like 4 GB
it runs on my 1050ti so uhh
1070 is 8GB, the best choice i have around this price range is a 1080TI (11GB) or a Titan X Pascal which would be 12GB
idk
The question is, how large of a render would this cap out at?
i would love to render in 1080p lol
i dont think theres a size limit on any gpu, its more of a question of how long itd take
correct me if im wrong tho
doesnt going larger than 512 just make the model funky? like it starts stretrching more the bigger you go with image size
yeah
but yeah basically these models are designed to work in 512x generally and dont really know how to deal with larger resolutions, so your best bet is generating one at 512 or at absolute most 1024 and then upscaling
Would that fuck my Vram?
which i think some webuis do automatically with a checkbox
or use a 2.x model you have more grace upscaling since its 768 base
Ah, so instead of natively rendering at 1080p you upscale?
yeah
Interesting because ive seen some really high end renders that just cannot be possibly upscaled
i can never find a good 2.x model i really like. maybe i just am liking the faster speed still with 1.5 models lol
generally makes for better results that way
So are we thinking 8GB of VRAM is enough?
Got a guy with a 1080 and pretty ready to purchase
id think so
i have never tried anything over 4 tho so uhh
im not the best guy to ask
sd got super lobotomized when it updated to 2.0 or whatever right?
because they uhh
removed all nsfw from the dataset i think? which means way less images of humans which means way less knowledge of anatomy which means way worse humans on generated images
i heard smth like that
like for legal reasons either all nsfw or all images of children had to go, you cant have both in the same AI
you can uhh
guess why
so they removed all nsfw right before 2.0 is what ive heard
which is fair on their part tbh
yeah I think thats what happened
2.x stuff just seems a struggle to me to get good results
hopefully the next official model brings the wow factor back
unlikely but hope dies last
both stable and dalle 2 seemed to be the big news makers last year, then they both just faded to the midjourney updates i think
i was looking at dalle2 the other day, first time in months, i assumed theyd have updated it somehow, but it seemed the same
midjourney more like uhhh mid journey
like its mid
like its pretty uhh average like its alright but like yknow like
yeah im not paying for it myself, but they at least keep pushing out models that wow the general public
and supposedly they have their next update about to release
main reason im using sd is because you can put all sorts of things in it
yeah for sure
like theres models and hypernetworks and embeddings and i still have no clue what a lora is but theres that as well i guess
thats what makes it so cool for me
lora seems like just another type of embed to me
only you can control the weighst easier with a slider
yeah i mean it probably does what a hypernet does just in a different way
anyways its late gn guys tomorrow ill check out invoke and like idk make a big chungus hypernet or smth
What are decent system requirements to run SD?
to run minimum, 4GB
to run comfortably, 8GB
to have the full experience with some training in it, 12GB
to have the mega full experience with the motherload training or very fast/big pictures, anything above
depends a lot on your price range
if you have access to 16GB and upwards, it gets quite fast too
like I make 8 pictures of 640x512 in 10 seconds, on 24GB
Ok, and you are talking about VRAM or RAM?
VRAM
you don't do anything on RAM, almost, in SD
there are some small benefits of more RAM, but not on the picture generation themselves
And If I buy a server, can I see how much of a VRAM it has? Because normally there are only RAM shown
there are other requirements that I could go on about, like:
processor speed, disk speed, for faster loading of SD. not that useful in picture generation
processor speed and ram for swapping models faster/keeping more things in cache
you will need to check for that yes, but people tend to refer to it more as "renting GPU" around here
I do it at home so I'm not sure
would it be possible to use RAM to generate images thatd be funny i think
you should be able to run it on vram though
first by removing from VRAM both process that take the most usually : windows and discord
then colab
tbh colab is really good right now
fair enough thats what i been doing so far
it's enough to train or make pictures for 2 hours per day for free
I get it's not unlimited
but it's quite the offer still
VRAM is part of RAM, correct?
VRAM is part of your GPU
nope, it's another type of RAM that is directly inside the GPU
most people that I see that do big training/have big requirements and do it online (and I mean 100+GB VRAM), do it through notebooks, either on google colab or other online services that rent GPU that way
if you intend to buy a server for your home for that though, go more on the A100 road
look at the #1011228477954998273 thread though
there is a very good comparison pinned
I will check it out.. Thank you for detailed answers, very much appreciated
not only do you need a gpu, but go team green (nvidia) who has a monopoly on cuda and rtx. I have not looked at amd for over a decade because of those 2 things being important in games/simulation. I don't like the monopoly but thats how it is now
If you just blindly buy a GPU and wanna save a buck and go team red amd, you will shoot yourself in the foot in the world of ai art (so far)
yeah i just go with nvidia for years now
no complaints either. all the cards ive had have been great
Hey, im new to stable diffusion and dont understand some things. When downloading Citivai models and using a LORA model, there are training images given. Do I need to download another software to use these training images, or is it possible in stable diffusion itself?
its all in the files you downloaded
I meant do I have to train it? I have the files, Im just not sure how to train the AI
training is already done. you just use the files
select the model and prompt away
add the lora to the prompt and it uses that as well
@delicate oxide
i just watched ntts vid
got it, thanks
Hi, can someone tell me where are the bots to generate images?
How to see how much of VRAM does google colab provide? using !nvidia-smi I can't see VRAM
When I go to resources it says
GPU RAM
5.8 / 15.0 GB
Does it mean that I'm using 5.8 and available is 15 GB?
And does this mean that I can run SD on a server with 8 GB of VRAM easily?
we don't have bots anymore
yea but you are limited on time.
It can give you 6 hours daily ... or less, depending on how busy it is and how often you use it.
More often you use it - less time it'll give you...my min was 2.5h or so I think
You never know when it'll kick you off ¯_(ツ)_/¯
that's why I bought all the stuff for a new system, so I could do more of this on my computer. Net system won't work, of course, so $2K wasted. But I tried.
Hey, so I was watching a video about what Textual inversion is and how it works, and I have an issue : https://i.imgur.com/t6aR44n.png
I don't understand what the 'Text embedding' in this schema represents(I have a little idea, which it is the float numbers representing something about some images, likely rgb/posX/posY)
But more so I don't understand where it comes from. To me I thought this thing was inside a model, and not something from outside plugged onto it
(I'm a total newbie at this)
How do I make a model that produces 512x768 generations well?
Do I use instance images with those dimensions?
My guess is yes
I don't see anyone talking about the latest samplers, such as uni_pc_bh2
Hello guys please when trying to render the video output by uncheking "skip video for run all" on Google collab, I got this error "ValueError : max() arg is an empty sequence"
Someone has a solution to this please? Thanks
Hello, help please, trying to create a image , and it's works only with CPU but i have a GPU 3060 and this Is not recognized, somebody have the same problem ( l'm working in Ubuntu).
Thanks, i try with this.👍
I pray it works
hi i'm new to SD and wonder if i could get some advice on where i should be placing the models off Civitai example i.e Type Checkpoint merge vs LORA
do i put all Lora files in Models > Lora & Checkpoints in Model>Stable-diffusion
Checkpoints (.ckpt), .Safetensor(bigger than 2gb) going into models/stable-diffusion
Lora files go into models/lora
Hi! I am the developer of Sci-Fi Diffusion. I have been away from the AI and Stable Diffusion scene for a few months now because all of the hate I got made me a bit sad (not from this community obv).
However I want to return back to it. What happened while I was gone? What major tools/software are new? I heard of stuff like ControlNET. Also any major update on finetuning/training models?
lmao. the haters have multiplied. But advances in other fields, like voice and text generation, but with SD, multi-controlnet, and better video consistency
Oh hi I think I remember you. To be honest I didn't get sad, Just that haters' logic made my brain malfunction and I got tired of trying to explain to them in a calm manner lol.
Anyway gotta get back to this stuff
and also Sci-Fi Diffusion 2.0
the latest video butteryness has a tweet calling it theft with 100k likes, my brain is having a hard time with it too lol
Twitter? Nah that place is good. good for sorting out people whom should not be interacted with 
but a 13b gpt model that can fit into 8GB that is looking to be able to perform like chatgpt is being worked on, and the open assistant has some preliminary releases, high level gpt bots won't be closed source for very long
Lets go FR
Also I love how SD is opening the gap with MJ now LOL
I told people that this being Open Source is a force no corporate funding can cope with
Sure MJ is more user friendly
but the vastness of SD
I love it
OpenSource is the best!
yea, I just used the MJ bot recently and it was ok, but just compared to the stable horde bot there are so few options beyond prompting to manipulate your generation
yea lots of good work being put in, it used to be like 5 options and no models
Im thinking of bringing AI to engineering when I get enough funding or knowledge (or both)
So I can make my OpenRC project
OpenRC ?
like openTX?
Ahhh
Like Voron Design (very cool stuff look it up if interested with 3D Printers)
Basically what they do is make 3D printer designs that are open source, and suprisingly its the best printer out there. Since its open source and not made by a company or anything, any shortcoming or problem is solved really fast and there are billions of mods/customizations
I want to apply the same concept, but for making RC vehicles, from toy vehicles for kids to learn to professional rc vehicles
Oh right my bad i meant RC everything
Anyway, thanks for the chat guys, Good day to all of you! Its nice to be back
right on 🙂
To answer your Question, yes ControlNet is the big thing. Together with openpose editor and posex, and depth lib you have the power to get nearly any Pose you like
Loras are trainable by the Kohya tool.
How to get stable diffusion to understand a particular color? Lets say i want a certain shade of pink and I want it to understand the color using hexcode... can we do it?
I'm guessing it is possible, but not sure it's been done before. If I wanted a specific shade of a color, I might try and train a LoRA or embedding on JUST that color. I have no idea how this would turn out. Definitely not feasible to do this for every hex x)
definitively in the category of "possible but damn near impossible" right now. The few people that needed that for now did use another tool instead. With the good prompt, you sometime manage to get a flat color, on the most used pantomes, but no consistency
hello
Hey Ryan !
does anyone know why sd does not recognize the Annotator i give to it
it understands normal picture poses
but doesnt work with open pose editor
you'll better ask in #🤝|tech-support so you can post pictures of the problem, it must be just a wrong setting or something like that ?
ok thanks
You need to select None as Preprocessor
does anyone has a good video or documentation about the install and prepare of stable diffusion web ui
Sure here is a good tutorial:
https://m.youtube.com/watch?v=VXEyhM3Djqg
Here is the sd webui repo:
https://github.com/AUTOMATIC1111/stable-diffusion-webui
thx mate
For help feel free to ask in #🤝|tech-support
al right
What do you mean limited on time? If I buy a server shouldn't I have VRAM available all the time?
whats the command prompt to rename an entire folder of files to name_numbersequence, Where name is whatever file title I want and numbersequence is generated to go sequentially from 1 to as many files as there are
nevermind found an easy solution
^that's the perfect code question for chatgpt. Super accurate with basic requests like that. Saves me a lot of time.
Not sure about paid options , refer to docs in this case
You are referring to google colab?
yea
Oh I was thinking about some server provider other than google colab..
And one question
I'm making a bot that generates images based on SD.. And I have a problem, I was testing it on google colab, but seems to be blocking, like I can't generate 3-4 images at the time with different prompts.. Do I have to use something to parallel those things so it can generate more prompts at the same time?
oh, I have something for you
It can generate 8 different prompts at the same time
on TPU, not GPU tho
I never tested it and idk if I will be able ever, since I'm banned on kaggle now lol
Erasing concepts from ready diffusion models - https://github.com/rohitgandikota/erasing
Paper - https://arxiv.org/abs/2303.07345v1
And how many different prompts could/ve bot on this server generate?
It generates images , you make prompts , but 8 still max , since it's 8 tpu units or whatever they are called
idk if you can put more into one thing
Oh ok thanks for the help
Does anybody knows if stable diffusion bot had a limit of number of prompts at the same time?
oi
hi, what is the min gpu to run this software, i am using gtx 960
the addon for blender says no gpu required, can it utilize an amd gpu or is it cpu only?
You need 4GB VRAM minimum for the WebUI spins.
2GB definitely won't render 512x512
Does anyone know if automatic1111 supports multi GPU render? (Not sli)
i am planning to upgrade the gpu, is it ok for 3060ti, or i can wait for the 4060ti because i need new gpu to play games soon as well
3060ti would probably be the ideal budget option, as it has such a high VRAM pool
Ideally you would look for something like a 2080ti, or 3070ti. Not the newer stuff.
Makes more sense to buy used
Well yeah you need more ram to actually make the images but the better models will do it faste
Toms hardware did a benchmark tests for stable diffusion please don’t mind my voice input
so no need to have a powerful cpu?
This is my build here, I only use an i5 cpu https://pcpartpicker.com/list/DV8LRv
I'm happy with the 4070ti, although there is a lot of hate for it out there. But it cranks out images very swiftly and I haven't felt like I'm missing out on anything yet due to only 12gb vram
My opinion may change on that if I get more involved with model training, or if some vastly superior open source software hits that is more vram hungry
Very high end i5 though lol, beats every i9 before it
While generating images, task manager shows that CPU usage is < 10%
the CPU has no bearing on Stable Diffusion
how much ram when used?
https://twitter.com/dannypostmaa/status/1635630263148355585
Did anyone see this?
Its a damn interesting output.... any idea as to how this was done?
Where do I type my prompts, can someone help me please?
You could probably get similar results inpainting with t2I style transfer plus depth/canny
Looks like inpainting
with controlnet perhaps
Thank you so much for your information
I have a gtx 1660 and I was thinking of upgrding to a rtx 3050 is it worth it?
ummhh....its not.....cuz the flow of the fabric is there! How does one achieve that!!
canny tolerance could be set to ignore the wrinkles while maintaining the shape. depth could have a lower weight allowing some variation in that regard
...and t2i style transfer
I believe would have similar results, but might not be what he did he could have some method thats less convoluted
Got it... do you suggest any paper to articles to see the implementation in a bit detail?
Saw this today, don't know if its been discussed, but seems like this is another misguided attempt to 'reverse engineer' source images from an AI tool generated image: https://www.stableattribution.com/
There's tutorial videos on YT how to use controlnet, and t2i style transfer
cool ui, stupid tho
@regal snow .... yes i have tried controlnet.... but I dont think it will solve the issue of putting on the fabric....
do you think it should? If yes, then please suggest what should i try in that....
Controlnet will trap the generation, so with that active and you inpaint her chest and arms, the resulting image should remain constrained to that shape
Then just include another controlnet with the t2i style transfer.
It should force that style to be transfered into the inpainted area
the arms would have to be a second round of inpainting
but how to you crack a)the image to have the same fabric b)to have the flow of the fabric(cuz the tshirt flatlay has different flow and the person wearing it has different)
to answer that I'll just be repeating what I already said
- canny - 2) depth (0.5 strength) - 3) t2i style transfer --- should have similar results
okay @regal snow .... i will try that...
Howdy! Working on a Photoshop plugin. Made our first little demo video: https://www.youtube.com/watch?v=o-cY66Ent6s (not sure where I should post this...)
isnt there already a photoshop plugin
Yes
There are a few... there's an official one with limited functionality... there's one you can use with a local server that is a bit hard to use... trying to make something that Photoshop users will like
the demonstration is nice
Thanks
how to install xformers on windows?
Just edit the webui-user.bat and add the cmd argument --xformers
well,
https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Xformers
So long as you have a Pascal, Turing, Ampere, Lovelace or Hopper card. Need to check your GPU specs to see
if not then I believe you need to build xformers from source.
will it work with nvidia 3090?
yes
I was reading that xformers will make generations no longer "deterministic" (perfectly reproducable) - but I haven't experienced that yet. I've only had deterministic results as far as I can tell.
it is deterministic
it's not hardware agnostic
meaning it will give different results on different computers
but the same results on the same computer with the same settings
Ahhhhhhhhhhhhhh very interesting
and even the small differences are usually very minor
the language isn't very clear in the wiki on that
like, when I reproduce an example of a sample on civitai, I don't see the differences
but yep, I have experienced it with trying to help other users on very specific details sometimes
it's also why it's harder to install than other things : it needs to be compiled locally, or a precompiled wheel corresponding to your environment needs to be downloaded
I tried both ways, and if you have the wheel (like automatic does) it's a lot easier and faster ^^
all I needed for my 750TI was to add --xformers. I was pleasantly surprised.
yep, it's quite automated in automatic (lol)
I discovered the pain when I was trying to install dreambooth by hand, in the early days
apparently, I got into this at just the right time. automatic1111s web ui has great low-mem support, basically everything just works. And thetn unipc is added the other day, cutting my generation time in half.
I'm trying the new toy in town right now, comfyUI
great potential too
node based UI
Command line option: --lowvram to make it work on GPUs with less than 3GB vram (enabled automatically on GPUs with low vram)
wooo. yeah, looking at it on github now
can't really say more, I used it for 10 minutes for now ^^
but the video sold me on it
just seem to need some adaptation time maybe
are there models available that are better at creating textures and patterns?
I m not able to use the dreambots, how to get access in those channel ??
@solid kite
Currently there is no bot on the server that generates images. However, there are plenty of other ways such as the official https://beta.dreamstudio.ai/ website or running Stable Diffusion locally using your own system resources! Check out #1080946152318443610 for more details! You can also stop by #1025467151206854736 for any issues you experience while using the website or #🤝|tech-support for any problems you encounter while installing it locally!
Jae can I ask you something about dreambot? Implementation wise
Sure, although I'm not the right person to ask since I didn't work on it myself
So I'm trying to make a bot similar to dreambot.. And I have a question about it.. When I use prompt it usually takes 35-45 seconds to generate an art.. But when I use 2-3 different prompts at the same time then I can see that GPU is working for each prompt a bit of a time, so that ends up around 120 seconds to generate each art.. Can that be managed to finish it sooner (back to originally 35-45 seconds) or not? (I'm using google colab with 6GB of VRAM)
I think "comfy" is the wrong name for something that's making me giggle like a horde of over-caffeinated ferrets
Now I'm waiting for a webui for an txt2ai that will take a text prompt to set up the nodes in comfyui to generate an image.
#1011228667659178055 or #🤝|tech-support would give you a better answer here but you probably need more vram or to not use batch processing
It does feel they are a little lying on the "comfy" prefix
I'll dig deeper when i have time. But I'm sure we can share node configuration templates or something
I'm sure gonna make myself a library of useful modules
Ok thank you for you answer.. And one more thing, if I don't use batch processing that will end up something like this:
- user gets image in 40 seconds
- user gets image in 80 seconds
- user gets image in 120 seconds
Something like this, correct? Since it will be in queue
yeah since you're limited by compute
Understood, thank you for you time an help, very much appreciated
As soon as i saw that comfy ui was nodebased, i got excited by that but, my immediate thought was " A lot of people will not be comfortable with this"
Another UI paradigm i thought could benefit these workflows a lot is the editor for Winamp's Advanced Visualization Studio
What's the maximum number of words/tokens accepted by SD?
Typically 75 tokens. If you're using an alternate webui like Auto1111, you can do multiple batches of 75 tokens at once and it will evenly distribute weights between the batches, effectively allowing infinite tokens.
in which channel to use the AI?
@fervent thunder
Currently, there is no bot on the server that generates images. However, there are plenty of other ways such as the official https://beta.dreamstudio.ai/ website or by running Stable Diffusion locally using your own hardware! Check out #1080946152318443610 for more details! You can also stop by #1025467151206854736 for any issues you experience while using DreamStudio or #🤝|tech-support for any problems you encounter while installing it locally!

Thanks. So a picture paints 75 words huh?
give or take, yep
With A1's ui there's also the BREAK keyword that isn't commonly known about. it deals with that batching you're talking about. Instead of automatically cutting the batch at the 75 token marker, you can tell it where to end one batch and start the next, giving you better control over the attention of each batch before it's averaged together.
Ok
Yeah, downloaded Automatic1111
yes
it goes in the models/stablediffusion dir
Automatic1111\stable-diffusion-webui\models\Stable-diffusion On mine, automatic1111 is the main dir
Ok sir
Ok
Thank you
I don't know what emaonly is for that model
Oh ok
ckpt and safetensors are the same saved in a different way
not sure if that explains adding models
that's about updating a1111
Ok
Who wants separate channels for contest stuff and actual product announcements?
so you are from the stable diffusion server!
guys where i need to put orangemix.vae.pt ?
In automatic1111?
yes
Models > VAE
have you seen the comfy tutorial? It's a visual novel. 
ok thanks
didn't get the time to really yet
but soon
I'm getting to the end of my day
it doesnt change tho output images i didnt understand in the settings is at automatic
wuick question, using controlnet, are there models (depth, canny, etc) which are smaller in size? all i see is several gb size for them
I am looking for someone who can show me how to take real life photos and make them into visual novel style backgrounds with controlnet or without. This is a paid gig.
I;... I just started it
and even if you said it
it had me surprised
lol is it like this the whole tutoriel ?
are there hidden feature to be unlocked by seducing the teacher or something ?
this is even just very instructive to explain how Stable Diffusion works
Thank you! so, dumb question, these are safetensors, like Loras, but do i just put them in the regular controlnet models folder?
yes, CN models folder
safetensor is safer to use than ckpt, but essentially the same thing in use
it was cool seeing ren'py used educationally--at the same it made me laugh
the author definitely has some opinions. 👀
Pls post in #🤝|tech-support
ok sorry'
Np but there you get more help
Hi
so now i can't generate images from discord?
1.5 if you want to delve deeper into UI / extension customization. 2.1 if you're playing for fun.
I watched a tutorial that said the weighted value of LoRA additional networks shouldn't go over 1 even if it's the sum of multiple LoRAs. Why? I tried it and I didn't see much of an issue, I know that if that weight value goes too high pictures start to look weird
Now? That's been for like 2 months, Chico.
whats comfy ui?
anyone know if its possible to get SD to work with tiff files?
or dpx
any professional image format
it's least comfy ui you can use 😄
But with it you can generate different things on different areas of image
can't you just convert it?
Ok am I going crazy or was the "member" role that makes everyone's name green darkened?
when writing captions for a subject in dreambooth, do you describe it using the name it will be referred to (ex. ukj) or an actual description of the character? (ex. a young man)
Hi I'm new
Me too
typing...
A node based UI for stable diffusion
Deliberate is an EPIC model. any more like this?
I am by far an expert, and have a lot of unanswered questions myself, but from what I understand, the captions (as in the side-car text files) should include very detailed descriptions of everything in the image that you want the ability to change through prompting. In other words, what doesn't get described, becomes un-editable through prompting. So for a single subject, you might consider describing everything in an image (background, lighting, camera angle, surrounding details, etc) except key features of the subjects standard look, since you want those to come through in every generation.
Fam, please correct me off I'm off base here.
Hey guys, I'm not sure what terms to include in a google search for this... how do I add a picture of someone to use as a reference for a character, and have the engine reference that picture everytime I use the name, say, Goliath, in my prompt? Sorry if this is a very newbie question.
You'll need to train a model or an embedding. Do you have SD installed locally?
I do
I highly recommend this guys channel. He covers almost everything you need to know to get started. For training, I'd recommend starting with a Textual Inversion first. They are fast to train and might help build your foundation of knowledge for how training works. After that, you might consider training a LoRa or DREAMBOOTH, which are a little more advanced and time consuming.
ok thank you I will check that video!
@knotty turtle Do you know if it would be accurate or not if I gave 4 headshots in one image and then prompted "Group of men standing together"
accurate in terms of representing those people accurately
Can someone point me to where it explains prompt value I.E masterpiece:1.x Still learning and not sure what this is actually called
Thank you soo much I will read now

that's a timeout
😬
he's welcome to come back but he didn't answer to my warning yesterday, and just posts the same thing today with more data to contact him
at least interact with the community and go on about things related to AI
this had no link
like, linkedin exists...
Oh, for sure. I block people who do that. It's like, c'mon
@vast ingot it's a throwaway / spam account. I can p much guarantee that pfp is fake and the guy is just looking to scam some kids.
Whatsapp is extremely sus, just another red flag
most likely yes. problem is, it's not really possible to be sure in all cases. I try to keep everyone to the same rules, and it's rare we jump the gun on a ban, but since it's 2 times the same in 2 days, I'll check if I upgrade to ban with the team
You... you mispelled the word sentence, in a sentence about upgrading a sentence, and then upgraded your sentence by correcting the spelling of sentence.

that's a sentenception
You may have already tried this while I was away, but I don't recommend trying to train multiple individuals in one model/embedding (if that's what you were meaning). It is possible to generate an image with multiple people, but is much more advanced and involves training each person as a LoRa on each of the people and then using a multi layered approach with a plugin called ControlNet and the open pose editor
You can train multiple ppl in one lora
Oh, you can?! I had no idea
h3ll0 :3
lora can function like dreambooth. you can finetune as many concepts as you'd like at once. you'll just need to adjust setting for the quality not to go down too much, test a lot your datasets to find the right mix for every concept to be train and learned at the same time.
to be fair though, if you go to those extends in efforts, you should do a dreambooth instead imo
Dreambooth is good if u aren't planning to share ur shit right
But if u are then Lora should be better I feel
depends on the bandwidth and for what
a good dreambooth is always nice for a future mix
mix mixed with more mix tends to lower the quality accross the board, starting from trained checkpoints a new mix can be best
but yeah, LoRA have their merits for sure
like, in Auto, you cannot use multiple checkpoints at once
but you can compose a prompt with multiple LoRA
comfyUI solves that though, you can make pictures from multiple models at once now
I always felt like training Lora = training dreambooth
Since u can just merge loras into ur model anyway
it is very close but no cookie for me. I find some difference in quality, and I can't stand that, I prefer to go dreambooth, but yeah, it's because I can with lots of disk space and bandwidth
What I like about loras is u can mix on the fly
I usually do like 3-4 loras at 0.25 str each
they do train quite fast too
they are the current best compromise in quality/speed/size for sure, and quite versatile
but still, I would advice mostly 1 thing to every trainer out there (no pokemon)
keep your datasets, keep them in the highest possible quality, and tag there.
Do crop and resize, but keep the originals.
You never know what training tomorrow will ask
what do you mean ? full finetune in lora ?
full finetune is just a nice word for dreambooth with captions on a large dataset
so if you put enough layers on your lora to retain enough data
it should work
Like can I resume training someone else's lora
I'm not sure I see any function for that in the official repository
but
like you said, you can merge a lora into a checkpoint
and train on that checkpoint as if you were continuing the training the person did
as long as you merge the LoRA on the same model it was trained on
but I may have just missed the option
plus there are 3 different LoRAs implementations currently at least
When y'all are captaining for an individual subject, do you describe the key features of that individual or leave them out?
I've seen YT'ers describe doing it both ways, and am not sure which is best.
It's my understanding that leaving those key features out will make them more malleable in your outputs, but maybe that's not ideal if you're trying to create a recognizable character.
For example, I'm trying to make a model for myself. I'm bald and have a beard. Should I mention bald and beard in every caption, or leave that out
Anyone played with the MidJourney 4 model from Hugginface into SD 1.5
Leave it out
Is dreamstudio Stable Diffusion v1.5 the same as runwayml/stable-diffusion-v1-5 from hugging face?
https://stable-diffusion-ui.github.io/. is this a legit website to download stable diffusion for mac
Hello world
Yes but its not for Mac.
Try either:
https://diffusionbee.com/
Or Automatic1111:
https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Installation-on-Apple-Silicon
Does this work with automatic 1111?
@vast ingot my main impression of comfy so far: it is more efficient in both RAM and VRAM usage (than auto1111's). I can generate images and use other 3D apps at the same time with comfy on my wee 750TI
I think it's really the future, but it needs lots of setup for a given project, and currently, the UI can take some time for you to wire correctly and put the good values everywhere.
but the versatility will be worth the effort
What's your guys' workflow for captioning large datasets?
Do you use a specific app or interface?
I'm brute forcing this like a pleb opening each file individually in a text editor and a Windows image previewer and ... maaan, it's a slog
My workflow is to download someone elses and click the ❤️ button (:
First pass using an auto caption tool, but not always.
Then i use bulk rename utility, a software that lets you alter multiple file's name at once in lots of ways
Thank you. Are you using this utility to somehow create/edit captions in your txt files? Or just for renaming files? Or am I just misunderstanding how captioning works? Haha.
I've been using BLIP and deepbooru to get things started and then opening each text file to finetune descriptions.
Was hoping to find a utility for streamlining the process of opening each text and image file for comparison and editing
It depends on your training tool and options. Mine uses the files names, but there is also an option to caption inside a txt file, yes. I prefer to just have the caption as the name of the picture
Not sure how to mass edit text files easily
I know in notepad++ you can search and replace text in all files in a folder though
Using regular expression and what not if needed
I was thinking notepad++ or something like atom
Interesting. Okay. I wasn't trying to mass edit the text files, just hoping for a utility that would look at both the training image and it's associated text file at the same time to save me a few seconds from opening and closing hundreds of files and risking me accidentally opening the wrong two files and miss-captioning the wrong image (as I've already accidentally done - heh). But maybe captioning just the filename would work better/faster than the text file way. I wasn't aware I could do that.
Do you use A1111 to do that? I don't see the option to caption inside the file name
What extension do you use specifically so i can check ? I don't train on automatic, i use EveryDream. A dedicated training tool for dreambooth based finetuning
I'm actually not sure. Heh. I thought it was just built into Auto, but maybe it's part of the DreamBooth ext...
For me, the main tab is "Train", then "Preprocess images"
there is some training built in automatic yes
text inversion and hypernetworks
the preprocess is about the same, let me start it, I'll check
I was actually just looking at the every dream tool. Someone on a different server told me about it, but I have absolutely no clue how to install or use it. I'm doing some research on that now. I'm not a programmer so this stuff is a bit out of my depth
windows/nvidia ?
Yes. Both
1/ you need to have python 3.10.X in your path
check that by typing "python --version" in any console
2/ git clone https://github.com/victorchall/EveryDream2trainer cd EveryDream2trainer windows_setup.cmd
that's about it
the doc is quite complete
I have python at this location but it still says
you should check in #🤝|tech-support , this is strange if you are sure the path is correct. but also I see a " just before the C: that may not fit here
Thank you for the suggestions. I'm running Python 3.10.9
if you're interested in seing the different types of training there is, we updates the #1080946152318443610 recently
I also made a little guide on it https://trainingsd-methods.carrd.co/
Wonderful. Thank you very much
it stays very superficial, it's an introduction into looking more in the method you choose
but I also try to define most concepts in the terminology
Sweet. Understood. I've done a considerable amount of research on the different methods and am pretty set on using dreambooth for optimal quality. And I've done some trainings with basic settings and automated captioning. The results have been pretty good, but I'd like to dig in deeper to understand the best way to train through proper captioning. I'm still very new to this process. Right now I'm just trying to develop a workflow for speedily captioning large data sets
this tool suite, associated with everydream, is quite sweet for catpionning
it's the full suite, not the trainer itself
it lets you prepare the datasets
If some are interested, 2 stability.ai employees working on DeepFloyd are participating in wandb fully connected today, talking about WandB and DeepFloyd + txt2Img
Hi, i am new to stable diffusion. I have questions about text2img and controlnet image generation. Is there people can help me? I am Turkish so if i contact with a Turkish person, i express myself better. Can those who can help write me from PM? Thanks in advance.
I can try and answer your questions in simple english here if you want
but I can't talk turkish
Thanks. then I try to explain my thoughts simply.
I want to replace the characters in a illustration I found on a site like Deviantart with other characters I want. But which extensions are good for this, which models would be better? I don't know exactly what settings I need to make.
Anime characters
ok
so you want to keep your background
and change the characters
only the faces, or the whole person ?
Yes actually i keep the pose of the characters, and change the characters
Whole person i think
controlnet is your solution here
I will need to show some pictures
and we cannot in this channel
let's move to #1011634831467221033
Ok
comfy can probably do that
Is EveryDream 2 a newer version of EveryDream or a different utility all together?
https://github.com/victorchall/EveryDream2trainer
there are 3 tools
EveryDream : the suite of tools for dataset preparation
EveryDreamTrainer : the first version of everydream, modifying the official dreambooth script
EveryDream2Trainer : the second version, a complete rewrite of original code
there is not a lot of reasons to use EveryDreamTrainer anymore imo
Oh, geez. And there I was already out of my depth. Might be time for me to head back to shore. lol
Okay, thanks for that. I'll have to do some more reading
it's a complex thing ^^ I've been in it quite a lot, you can also ask if you'd like
we can move to #🔧|finetune too if needed
Thank you. Really appreciate that. I'll see what kind of trouble I can get into on my own, but might bother you later if (when) I get caught in the weeds
hello guys , i am learning more about stable diffusion and i was asking where can i find the PNGInfo script ?
inside stable diffusion files
yeah was adjusted since we have other roles with a similar color!
I see, cool!
is 59.46s/it good?
Depends on your situation but no, not so much if just making a picture
how to make it better?
you should go into #🤝|tech-support , they may have some diagnosis to try
I'm not sure without more context
mk
59.46 it/s?!
my current image is generating at like 4
granted, it's a big size 😄
if it's s/it it's bad, but it/s would be good. you said it both ways
is there a wiki where i can read what exactly is the difference between hypernetworks, embeddings and LoRA ?
If I'm training an embedding and I close stable diffusion then come back, will it continue where it left off?
Anyone using AUTO1111, I'm trying to find a away to get around the 2000px limit on img2img
I’ve got a request for anyone here, if they’re willing to help me with something for a minute-
I’m trying to get the img2img alternative script running with the 2.1 model, but it seems to fail halfway through.
Can anyone else try running a 2.1 model with the img2img alternative script? I’d love to know if the errors are just isolated to me or if the script is broken for everyone
You can't see the error in your CMD and look it up on google?
It's actually not an error. It just stops halfway through the process
first the image gets re-noised into latent noise, and that part seems to work, and then it doesn't de-noise it. It seems to fail on the first iteration so I get returned an image that's one big wash of color
and the console stops and closes at that time, or goes back like everything is all right ?
just all ok according to it
can you show us the settings in the UI ?
yeah
it may be something in there that prevents steps to happen for any reason
I didn't use that script yet tbh
are pictures allowed in this channel?
no, let's move to #🤝|tech-support
hopping over there!
Sometimes i wish i was still on top of the coding game and was sharp still. I'd do stuff to a1's to make it use controlnet for hires fix. i should start migrating to comfyui where the nodes are
hi
So my wife was scrolling thru Tiktok and saw these really cool semi-animated images that at first sight seem to partly been made using Stable Diffusion. I'm sure the creator used some kind of extensions to further enhance the images. I could be totally wrong, but if anyone here understands the creation of these kinds of semi-animated pics, I would love an overview of how to create them. The user who posted this on TT is: at vblank_digital . Thks
animations are typically polished out in another app like blender or da vinci. SD just generates all the detail needed to create them. Depthmaps are combined with the image to produce interesting results
a lot of it is still artistic hand and skill, rather than star trek "computer make me an animation!" level yet
This is good to know! I appreciate u sharing your insights.
We have a channel dedicated to #🎥|animation as well ^^
could anyone send me a tutorial on how to install a model from huggingface
search what you want
download the safetenser/chkpt file
put it in webui/models/stable-diffusion folder
ty
Do you make embeds?
Hello everyone! I asked my friend what AI he uses and he gave me a link to this server. please explain what and how is done here.
lmao this Zephyr dude is a chatgpt bot
I can't speak English well, so I'm using translator.
i have but i'm no expert. i've just wung it everytime. used automatic1111''s mostly and once stable tuner. training i've mostly been reading about lately. might abandon embeds for now while i learn loras and locons
Does anyone have experience training embeds? I'm doing it locally on stable diffusion UI and it's... A) Taking a long long time, B) not giving me the most accurate portrayals of the person I'm training it on.
Is there a better/easier way?
how many images are you using? im also trying to get better at it so i dont have a lot of advice really
Trying to decide if you should name the actual images with a prompt or just all the same name like 'chaz sitting in a cafe' and 'chaz standing in front of a cathedral' or just 'chaz(1), chaz(2), chaz(3) etc'
I am using 5
I tried to run it overnight... but it seems to have just stopped running by itself
at image #138~ instead of 3000
i found if my text file and image names didnt match up it could stop but idk
They match
instead of putting the name I'm iterating on gCro I put a desciptor of the image though
For example, instead of "gCro sitting on a chair", I put, "a man sitting on a chair"
Send me anything you want
browsing loras on civitai
wow this artists style looks cool let me check their actual real art proceeds to witness the most spine chillingly detailed and realistic genshin impact porn humanity has by far managed to concieve
i hate it here
Hello everyone. Can anyone help me extract an "image url" from the stability_api objects generated from the following call? How can I get the URL that contains the generated image?
answers = stability_api.generate(prompt="expansive landscape rolling greens with blue daisies and weeping willow trees under a blue alien sky, artstation, masterful, ghibli",steps=30,cfg_scale=8.0,width=512,height=512,samples=1,sampler=generation.SAMPLER_K_DPMPP_2M)
There's a community of lucky ⚡ 4090 ⚡ owners hidden away on discord where we talk about all things SD and AI. If you'd like to be part of it feel free to reach out 😄
hey all, Ive been messing around with controlnet for the last couple of days, using the sd21 models from last week... So i think this is awesome, but im having some issues weighting the controlnet and find im consistently having to push models I like up to way higher CFG scales to get the result I want, does this seem correct? If I have a model and config that generates images I really like, or consistently cool stuff, and then try and use a controlnet on it, it stops doing what I liked about it, and I have to fiddle with config scale values (like going from 7-8.5 usually to 11-15) and samplers to find something that works. Anyway, I find when googling this it seems to be lacking any real discussion or have yet to find others talking about it, so it makes me think maybe something is weird my side. Any thoughts/knowledge appreciated, or if you've seen this addressed/discussed anywhere id love to know, otherwise ill assume im doing something incorrect 🙂
Don't use too many images, make sure the are all quite varied in terms of background and don't use the Webui, use python and train the embedding manually
is there a way to use loras on invoke btw?
cant seem to find the button on the interface
how many is too many?
try less than 10 images, you really don't need many
set COMMANDLINE_ARGS=--vae-path "models\Stable-diffusion\vae-ft-mse-840000-ema-pruned.ckpt" i ve this in .bat file but i get this error Couldn't find VAE named vae-ft-mse-840000-ema-pruned; using None instead file is in place it is 326.851 kb , why i got this error ?
damn. I guess Midjourney 5 dropped
I'm so desperate for some new models I may open the wallet lol
what's new with mj v5? I am seeing similar things to v4
when v3 went to v4, it was a wow moment.
SD will always have the nsfw edge i guess lol
no idea. I just read a bit ago it came out today
I am seeing the rating party images here, but nothing that the current crop of SD models can't do. especially the newer 2.1 stuff
too pricey for me still, Im just anxious for a new model like I said lol
maybe its just easier prompting yet again...as the article said
it is already pretty easy
that steam punk woman on the above link look less gaunt/edgy than the usual midjourney v4 retro-fashion model look that i usually see. more in line with the stuff i get on like rmada's stuff on 2.1
one thing i hope sd can solve is multiple faces in a scene. Unless you inpaint to fix each one, there are always messed up faces in a crowd. at least midjourney v4 show recognizable faces, even though they usually look like a bunch of creeps to me for some reason lol.
here is a v5 portrait by someone. Looks like something that 2.1 classic neg and other analog model for 2.1 can already do comfortably
https://www.reddit.com/r/midjourney/comments/11s1udj/v5_almost_getting_text_almost_lmao/ another one here. the text is better though lol.
https://www.reddit.com/r/midjourney/comments/11sd9sb/sexy_sax_hands/ the fingers and hands are not bad though. tutting putin lol.
hey guys, is it possible to leverage the power of 2 separate computers for SD generations?
yea, you can generate more images with 2 separate computers...but if you are thinking of doing an SLI type thing, I don't know if it is possible.
Do you know how? 
Oh nvm yeah I was thinking SLI type of thing
https://www.instagram.com/reel/CpLK2p7qOSG/?igshid=YmMyMTA2M2Y%3D How hard would it be to do a video like this with stable diffusion and a custom trained model?
does anyone know what section the image in #🏞|general-with-images is
Should work but it’ll be slow, and you’ll be restricted to minimum resolutions
anyone know how long it’ll take to upload anything v4.5 to google colab?
trying it out now, this is definitely not ready for prime time
on a random image, 6 HOURS
and I have a pretty beefy pc
6 hours?
someone else did his in less than 60 mins with high settings
i didnt try it myself yet
high is just too much glazing, very visible for the most part
the important part is the render quality
in terms of the glazing, the best option was medium
but even then, for images with white it becomes VERY obvious up until the lowest options
true
but the artwork from Ortiz was glazed beautifully
they merged so nice together
exactly this reminds me a lot of the iphone planet photo, perfect color and palette selection to hide the downsides of using GLAZE
^^
another artwork i looked at being glazed was almost looking like someone ate too much mushrooms
the background was white tho
really hoping for StabilityAI to release a competing model to GPT3/4 sometime soon
they dont have anything with such features tho
as of now
plus i doubt they will have any strong enough benefits to convince people to switch from OpenAI to them
somehow like Krita vs Photoshop
although, Stability AI has ofc much more ressources than Krita
a big one would be having a lot more transparency about what its trained on
people would value that
having unrestricted and actually open (🙃) AI would be an instant sell to a lot of people, myself included
sure. but it is a way to differentiate themselves
and its the whole reason why stable diffusion became so popular and displaced DALL-E
you can prompt whatever you want. not what we say you can prompt
for as much as Emad talks about the problems with GPT3/4 I would be surprised if he had no plans on doing anything about it
theres huge potential there for stability AI
SD is psychic
earlier today in the midst of generating totally different images, it made an image of a crashed motorcycle
hi
a few hours later, neighbor's son was working on his motorcycle, riding it to test, crashed it
ambulance came
What fool would pay for GPT 4 ?
people are paying for it just to goof around trying to get it to write silly things
Hi
Can you find any error messages throughout?
I prefer my technology to be a black box which is in held captive in a walled garden, humanity is simply not ready to make it's own decisions
your right
only the few should make our decisions, it just feels right
Are you on Windows?
How are you running it?
extension
you shouldn't need to, you can just load them in the webui
I'm making a comic but its not marvel style
@fervent thunder your working on comic I was think of doing the same
Have you made any before?
Never
First time, I posted an image in the general with images
It will probably be bad, but its more of an experiment, the whole thing is done with A.I
you didn't do the words with AI
Yes I did
how do you get words?
Not image but text
oh
I used gimp for the speech bubble and words
also, the characters look different in each panel
Ye that’s to be expected
They will look slightly diff I will try to improve it later
I made a text embed and I tried dreambooth a while back haven’t made a Lora yet
Doing dream booth in the webui was terrible last time I checked I ended up using python
DAVID.G9
just downloaded stable diffusion, took me an hour to set it up 💀 my lord i have a lot to learn
Did you do it with youtube tutorials or with automatic1111 docs?
There's nothing really to learn, just download them (you can do it there - https://civitai.com/)
And drop into \stable-diffusion-webui\models\Stable-diffusion folder
oh yea and change model inside webui
Ic thank you
/chat
Damn, I really want glaze to be open sourced...
hello
hey there and welcome
I have some questions about stable diffusion...
well, this seems like the good server 🙂
do tell us 🙂
it's not really rush hour yet though ^^ quite calm for now
so, I became really interested about it and already downloaded it couple of times. I have an amd card so I thought it would be a good ideas to download it on linux
it was arch linux
I think it's how you got to do it on amd yes
when I tried to download it on Ubuntu, it didn't really work for some reason
and then I started searching in the internet
damn, too bad it's the simplest to set up
I know, I really tried everything
🫂
I will try again today
but
my question is
what'S the difference between stable diffusion and webui?
both have good ratings
what's better for what usecases?
Stable diffusion is the brick, the ciment that makes the pictures.
WebUI is one of its implementations
there are other UIs for it too
like invokeAI or comfyUI
you should take a look at our #1080946152318443610 maybe
we should move to #🤝|tech-support to continue
for links and images
Comfyui is awesome, I prefer the workflow
how do you call those hair styles for ladies with like a bulb from their hair in english?
not sure, ask chatgpt ? x)
hey all, is it generally better to use underscore in prompts instead of a regular blank space?
it's a bun chatgpt was at capacity
not that I know of, no, unless you start training a model on such a prompting patern
i thought that most models used the Danbooru tagging pattern, is that not the case?
depends really on what you call most
I don't use anime model personnally
so never
x)
even then, you'd have to ask #1072013871730131004 , they may know more
it could be just replaced by space in captionning to conform with the current naming patern
not even trying to do anime though, its just that i found the tagging so complex that it shocked me
there are tags for bangs over one single eye and other crazy specific things like that
the main models trained by S.AI were on some part of LAIONB
so not danbooru
yes those are quite crazy
it's one of the reasons there is so many anime models out there
the tagging system was already in place and going full force
so it's quite easy to make datasets
when merging models, do the tags remain the same?
like, can i take a realism model and merge 10% of anime into it to get the crazy tags?
Like I said, you should ask anime on this. I think they use the crazy tags in quite a lot of models but not with underscores
And when merging, it merges the same tokens
So if merging things trained completely differently, it can have adversary effects
Goodmorning everyone!
I'm surely not the first person to come here and say that I'm developing a new Stable-Diffusion based software with a team of developers, but I was wondering if any of you would be interested in sharing with me (both here or on private messages) their own workflow in order to produce AI generated pictures in order to help us to improve it and make everything easier.
Since I'm the lead designer for the project and I'm also a traditional illustrator and ex-Adobe certified trainer I might have some ideas on the most efficient workflow but I might be biased towards a specific approach to the software!
(the beta will be released in the next months)
new AI software is always excited
the unified canvas from InvokeAI is the future imo
where do I create images
InvokeAI looks great but I'm not sure that "unified canvas" is a necessary approach!
Right now we're working hard on a whole different aspect for the UI. Let's say it will be a lot more focused on the "creative process" with a zone focused on brainstorming ideas and a different area just for evolving those ideas with tools to fine-tune the "promoted" results.
@main grotto
Welcome! Start by heading over to #1072220168534642768 to get yourself situated and help find the channels you are looking for! Please make sure you review our #✍🏼|rules-and-tos and feel free to assign yourself some #👥|roles as well! Answer any questions your may have at our #1072229020520947753. There are many ways of accessing Stable Diffusion, take a look at #1080946152318443610 to start your journey!
I find that the unified canvas gives you more artistic freedom since you can outpaint easily and add more to it, but I only just started working with it. but I believe you want more a technical approach of sort?
(mainly look at #1080946152318443610 on the FAQ I just linked, there are ways to make pictures on there)
We're working for the professional sector, but I want to just give more freedom to the user (and we'll absolutely have Out-painting in our software - I believe it's necessary). You know, if you want to create an illustration or a design, it's important to respect the proportions of the final product.
InvokeAI from what I understand also works for the professional sector, and they give payed plans for companies but for the casual user it is free
That's the same thing we want to do ^_^
great
Also we want to offer custom model and split the "pro" subscription (for those who wants to pay that) with the creators of models and plug-in active on their software / we're still thinking about how to do that - but we want to give a big economic boost to those who wants to work for this community :3
And we want to give the users the possibility to save their progress and keep working the next day from every step they made along the journey towards the end result - which will give a lot more value to the artistic interventions of the user (like: The AI didn't do this art on its own, I did this whole thing because I made the whole journey to this point)
Can you guys please talk to InvokeAI so us casual users can have everything in one app/webui? of course you can still offer these subscriptions separately as two businesses.
kind of like adobe but different companies offering their products in one unified app
I'll contact them as soon as we'll be able to! I don't see anyone as a competitor!
you took the 2 channels that you arent able to lol
I don't think there are any other than those x)
Good morning, everyone! How are we today?
Great, thank you, what about you? ^_^
Trying to get my hands warm, haha! I'm freezing and after my morning pills, I only found one one of my arthritis gloves. 🤣 I think the other took a trip behind something. I'm just going to hide under these blankets and pretend I found it
Usually a warm tea helps :3
Yes! I'll take all of the warm tea!
So, what's everyone up to today?

