#💬|general-chat
1 messages · Page 79 of 1
where would the smaller ones go?
The smaller files aren't models. These are additional files that can be used together with models.
On civitai.com you will find a lot of models and small files aswell.
Check the description (info card). If its for example a Lora, it goes into models/lora
ah ok
i tried updating my auto1111 and it broke 😭
Then make a screenshot of the whole cmd log for #🤝|tech-support
We can fix it
what is the negative prompt for?
for things you dont want to be in picture basicaly
ok i assumed so just checking
Is inpaint sketch broken? Having an issue where when I send an image to it via text2img, it just... disappears. dramatically. Like it flies off the canvas.
also what is the point in using parenthesis in prompt instead of commas is that just specific to some models?
parenthesis are used to group tokens to add emphasis. So say you have a prompt that contains "A banana split with chocolate fudge" - and it's not generating the fudge? You can try "A banana split with (chocolate fudge:1.2)" to further emphasize that SD needs to add some chocolate fudge to the generations.
ty 👍
Mhm. Just keep in mind that if the model doesn't know what chocolate fudge is, has no reference for it, you'll be outta luck regardless
makes sense yea
is there a community guide to prompting for SD? i need like a noob to expert guide, if there is such.
https://docs.google.com/presentation/d/1HEcE3qOAGVujcDaNQbiLXyx7zwKHQkXEILsYBhsot7A/edit?usp=sharing
thank you!
there is ALOT of content in there, is there a specific something i should look for? i couldn't find it using search
had to learn docker to get comfyui working with an RX590 to test it out
i don't think I'll be going back to automatic1111
i'm not sure if any of you remember the RAM issue I've been having where automatic would take up even an entire 32 gigs?... well, comfy hasn't gone above 3gb
should I risk it and try an SDXL checkpoint on an RX590?
are some samplers more prone to making mutants than others?
44 seconds doing 25 steps of 512x512 with an SDXL checkpoint on an RX590
1:53 at 768x768
and successfully completed 1024x1024 in 3:25
i guess that confirms.. 1024x1024 SDXL on an 8gb RX590, with 16GB Ram, is actually doable
Hello dear friends, do you know of any open calls, contests or events to exhibit AI/Digital/NFTs art or about the metaverse? I recently found a contest for Vogue with AI, but the deadline was closed :/
with SDXL, do the quality if images, meaning accuracy, detail, etc get better at larger sizes?
What do you think?
so, i'm pretty sure that was a memory leak I was dealing with earlier with the version of automatic1111 I was using
a dozen pictures before having to restart.. with comfyui, I've done like 100 and Python isn't using more than 3gb RAM
Can someone on local SD help me with one prompt as part of a project?
I just need it as an example.
yes
some samplers are probabilistic, compared to deterministic ones
One cat, making a pizza on the beach, zebras are attacking
hey guys
i was spitballing some ideas for myself for stable diffusion art projects
and i was thinking of making a wordless picture book
would my rx 7800 xt kill itself doing picture book sized pictures
640x640 is already kind of alot for it
Hello!
------
| |
| |
------
:.|:;
What are your webui-user.bat Commandline args ?
hi everyone, I need to do an in-paiting on some frames of a video of my company (remove a logo). I dont care about temporal consistency, nothing fancy.
These frames are scattered through the video.
Instead of manually extracting frames, manually in-paint them and replace them in a copy of the original video, is there some extension of automatic webui or some comfyui workflow that allows me to do it on the fly, in-place with some playback control? Or do you know any faster way?
How to generate images plz help I am new
Is it possible to inpaint an outfit from an onlineshop onto a a generation?
Or what would generally be the easiest solution to do that?
so, both SD2.1 and SDXL are fairly new?.. what exactly is the purpose of each one?
is there a button or something i can push to make a picture like this to generate all the different results i would get with different denoise or steps https://user-images.githubusercontent.com/121317726/210807006-77a192f1-5fef-41a2-8f05-067511e94129.jpg
idk how ppl make these and figured id throw this out here in case anyone just knew
Hey, this is the X/Y/Z Script under txt2img scripts at the bottom
THANK YOU!
hmm, all my SDXL stuff comes out looking like 1950s cartoons, despite specifically asking for realistic
well that was easy to solve, I was using a resolution that wasn't immediately understood by the model
Can someone help. I installled Automatic 1111 and it works fine, but it loads not showing me all the screen UI. Its like zoomed in, i cant see the all of UI, i always need to be scrolling to see other parts of the UI. Is there some setting option to make it normal? Thanks.
sounds like your browser might be zoomed in
Good morning, everyone! How are we all this beautiful day?
Everything else is ok. Only when I enter automatic 1111 I it shows bigger.
What's your browser ?
Chrome
Updated? And if you have an adblocker you need to whitelist the webui
I'd try another browser too
did you got an answer?
Hi Dear Friends. I got some SDXL Loras, it says Lora. I packed them to models/lora but i cant see them in stable diffusion. Can anyone help me with that? is it in the right folder? or do i need some other extension? As Example: https://civitai.com/models/124016/sdxl-chalk-dust-style it says lora. is it not a lora?
#🤝|tech-support 😬😬😳
i may have to stay away from the SDXL stuff for a while, my GPU will do it.. but it doesn't like it
by that I mean.. it does take 3 1/2 minutes to do a 1024x1024
I'm curious, but how many people here support "open source" concepts?
I mean if you prefer stable diffusion over the alternatives I'd say there's a 80% chance you also support gimp, blender, etc.
I'd imagine some people just run it because it's free though, or can be ran from their PC without paying for tokens
something else I'm curious about.. is there a reason we have a starboard channel, and not a port channel?
I'm testing out more SDXL models, see if I can find some more that I like.. the image I got earlier was using the standard model
Discord down for like 10 mins for anyone else?
port ?
Yeah, the Discord status page shows an unresolved API error, but says a fix is being implemented.
yes, port and starboard
what is port ? what should be in there
port is left, starboard is right
okay so what do you vote in port? im sry but i didnt have seen a port channel anywhere else just starboard for top images
It's a boat joke.
Heey i need help
I launched ST but i try to input a image of a girl (ai) to remove her clothes. I get runtime error all the times
Hi. I have a question - is there a good model for compositing a foreground and a background image together.
so if I have a fg image of a bottle, and I want to place it on a big image that provide (not one that is generated from scratch)
it would composite them and add shadows and other pieces to make them fit better together
what happens if you just copy and paste the the foreground onto the background and use it to generate something new based off of that?
Is there any possible way to run stable diffusion without a NVIDIA gpu?
ok, so after cloning the link, when I try to run webui-user.bat, it opens command prompt, activates the git pull function, and crashes. How do I fix this?
hmm, do I have to have a model installed for it to work?
are you able to open the webui-user.bat and run commands individually from a command prompt to see where it actually fails?
command prompt crashes immediately
no time to do anything
open the command prompt alone, if you're on windows you should be able to hit the windows key and type in cmd
where can i ask for help in warpfusion
dope thanks
Maybe a dumb question that can be found elsewhere… but what does putting a number less than one do when in parenthesis? Like (small dog:0.5), does that actually lower the “weight” of the word?
My mistake Mr. The Chad. Won’t happen again
@quick pecan 1 is 100%, 0.1 is 10%, so how much you want something to have an affect is determined by that number
GUYS PALESTINIANS ARE GETTING MASSACRED, PLEASE DON'T LET WESTERN MEDIA FOOL YOU THEY'VE BEEN DYING AND SUFFERING FOR DECADES PLEASE,I STAND WITH PALESTINE
So what’s the difference between doing (the chad:0.5) and [the chad:1.5]? They’ll both lower the weight? Sorry if this sounds dumb, I’m just struggling with some basic prompts lol
one direction gives what ever you're applying it to more weight, the other gives it less
if I'm not mistaken, a low number like 0.1 will have a more subtle result
Seed variation affects the starting noise used in image generation.
Basically the diffusion process takes the noise and tries to find an image out of it.
How do both regular seed and variation seed relate to each other? What does the variation seed affect when I have already input a regular seed?
Variation seed is usually for fine tuning.
Seed: 803755656
Seed var: 3402361402
I rarely use it but that's its main purpose.
if you specify weight using number (tokens:weight), then this is what will be used. Meaning :
(dog:0.9) = [dog]
I'm not sure [tokens:weight] works.
As long as you don't add weights, you can play and combine [] and (). but if you specify weights, then the rest is discarded.
Source : https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Features#attentionemphasis
Oh okay I see
If you like an image but want small variations, freeze the main seed and shuffle the variation seed.
Then another question, if I change the initial seed but keep seed variation the same, what should be the expected behaviour?
seed variation is nice for small changes in a given image, yeah
Oh so it’s the other way around, I see
if you change the initial seed, then the variation will be done from the new seed, meaning a completly new image, close to the one from that new seed
but since you didn"t process that seed yet, it's useless
I should keep the same initial seed and change the variation seed then?
If you want the same image yes.
yeah, variation is useless when you change the seed. better put variation back to 0 if you change seed
Variation seed is useless if you change the main seed.
Thank you, that’s extremely helpful
Is there a specific process to go about variant seeds? Or is any random number gonna work?
the lower the variation, the less changes happen
so once you are ok with the quantity of changes, modifying the variant seed gives you more "small variations" around your original picture
it's very usefull in combination with features like "X/Y/Z plot"
Like, can I type a completely random number and expect it to work or should the variation seed number be within some specific subset of the initial’s seed character?
It doesn't matter.
you can type a completly random number as secondary seed yes
it's another random seed to go on top of your first seed, basically
It's a seed so the randomness is equivalent for every combination.
what matters is the amount of variation
Lower in which sense? Like for the seed: 803755656 how do the variation seeds come about?
Oh
it's a little like doing some "img2img" steps at the end of your image generation, this feature.
First, it process your base seed, as is
then it does some changes by adding some noise, using your second seed. the noise is random, as is the seed
but the changes are more or less important based on the variation strenght
So if you want the variation seed to have a stronger influence over the image you adjust its strength.
I’m trying to figure out the variation seed functionality in general first
Before proceeding to variation strength
variation seed is exactly like the seed in img2img
it's a random number
type 1 or 10000000, it will have the same strengh of impact
this impact strengh is in the variation strengh
but there is not a lot more to learn from the seed
Actually I wonder if the noise seed patterns are the same for variation and regular seeds. If you had the same seed for both variation and main would that even change anything? 🤔
it is the same, and it will change things because the base noise has already been altered quite a lot when the added noise comes into play
Makes sense I suppose.
As I understand currently, the seed variation is just a subset member of the initial seed family. Something like a A subseed of the initial
But if I make the case of having both the initial seed and variation seed be the same number then what will happen?
Guiz answered that up earlier.
Yeah I read it, thanks for the explanation!
well, when a picture is made, the initial seed decides what "noise" is used as base
then the "steps" happen. it works with your tokens to make sense of the pictures, drawing what you asked using the weights in the model
then, it adds noise, using your variation seed. More or less noise is added, using the variation strengh
then it does some more steps
So it just deepens the intensity of the noise in those regions if the seeds are the same.
Nice, I understand now
it depends how much the base noise and the resulting pictures differ. usually, quite a lot, noise is noise
I'm talking about the initial noise pattern.
yeah but I mean, there is nothing left, almost, from that base noise pattern, no ?
When? After the diffusion process?
yeah, once the steps have first processed and we start caring about that variation
Yeah I mean it won't matter at that point like you said.
there are others way to go about that feature, that other tools tried. like adding a small noise to the base noise, before doing any steps
in comfyUI, adding the noise of different prompts in the middle of the diffusion process, can help compose quite nice pics
I mean that's basically the euler a scheduler...
Any ancestral schedulers for that matter.
learning about controlnet now 😄
at the mapping part of it
Are images generated in DreamStudio saved to a cloud? Or is is download only, no internal save?
depth maps are cooooool
Hey guys read this story..and I planned to make this continue, if you people wish. Please let me know in comments your thoughts and show some support
New Here not sure if this is the right spot, I have 2x A100 and a H100 on the way, Is NVLINK Supported on Stable Diffution, and on these cards will I see a improvement?
Afaik 4090 is more efficient than a100, unless you've got free access somehow
Generally don't need all that VRAM for stable diffusion. As far as distributed inference across multiple GPU on the same machine, I've seen exactly one implementation and the author recommends against it. Dunno if there's anything newer.
I've actually done a lot of SDXL today with 8GB of VRAM on a 5 year old GPU
yea. if I generate something new it changes too much of the images. I want to constrain them more so it only focuses on better composition
Is https://github.com/CompVis/stable-diffusion.git the same as the 1111 version?
No
Its the core base without user interface or additional features
With image inpainting we can remove photobombers, can we do same for video?
Any suggested method?
Help, can I use my own civitai model in stability ai?
With sd2 they make a big deal about it not being trained in living artists. They dropped that with sdxl and started training on living artists again I'm assuming?
Im using ComfiUI and the A100 is noticeably faster then my 4090
To be clear I bought the A100 I am not renting a server
@tender cove OR did you mean in terms of power draw?
which model does @radiant meadow use to generate the images it is creating??
SD 1.5?
SDXL 1.0?
Its using SDXL
Is comfiai better for gtx 1070ti
Yes. We have bots and mods that filter nsfw stuff. The bots can do it but the users are not allowed to prompt for it #✍🏼|rules-and-tos
Price, not power draw. My understanding is that a100 is about capable as 3x 4090 when it comes to SD, but costs ~5x as much. As for power draw/tdp, a100 is ~1/2 a 4090, but some people in stable horde are underclocking/undervolting their GPUs pretty heavily for only ~10% performance reduction, though I'm not sure which GPUs or how well that transfers to the 4090.
Keep in mind that I don't actually have either GPU, so it's really just something to look into.
pretty sure it can, yeah
what determines VRAM usage?.. do higher resolution images require more?
seems simltaneous renders in batches will do it
at 512x512 SD1.5, it's like 900mb per batch size
Where can we have hidden text in images? Please ping on reply 😄
secret watermarks?
No like the latest announcement
aah
Yea
I just placed an order for a Sparkle A770
?
GPU
Oh
I should be able do things much faster after I get that, and probably focus on more SDXL
I meant SDXL
yes, but SDXL models are trained on 1024x1024, vs standard 512x512 so they take much longer and require a lot more resources
I can do SDXL on an RX590 but it's 6-7 minutes per image
Thanks for your input, I have been picking up these cards for testing as it's out of the reach for most and no one who has them shares there benchmarks in real world it seems
I'm also open to sharing access if anyone has a work load that can benefit from a 40g or 80g card
Running a workload on it directly
I can even spin up a VM if you need a closed environment
Join the Horde. Server link is in #1080946152318443610
... No it's not. Just the website is. 1 sec
Bleh. Send me a message if you want an invite
Done
is there an easy way to tell which prompts are not being understood?
pixel 8 pro can do what sd 0.1 did 10 years ago
XDD
i bought a p8p... and its so bad.
its like a pc but you remove 99% of the features and strength and make it pocketable.
I prefer tablets in general. Bigger screen is easier to use, almost as portable, and more surface area makes for less throttling so more responsive. I get the cheapest phones I can, and a decent tablet and I'm set.
less throttling?
i dont know if i should send the p8p back
i only bought it cause i wanted free pixel watch 2
More surface area, better heat dissipation, CPU can run fast for longer
I have a full Linux environment in termux on both my phone and my tablet.
😒
You can, or you flash Graphene OS on it to have the most private and secure phone xD
and get rid of all google featuers. no thx. im not dumb
Thats not dumb but I see why someone wouldn't do that of course 🙂
when i ll do illegal stuff ill do privacy otherwise irrelevant
why are toilets private if u aint doing anything illegal
curious, if I did something like (adjective noun:1.0) will this effectively group that as one prompt, but not add additional weight to it rather than processing the words separately?
that looks super affordable for 16gb graphics card
yeah, but I really had to work at convincing myself to go for more vram, because the 8gb A750 is only $190
i might get that card too im thinking been eyeing the 3060
but honestly all the games side by side run smooth enough in comparision
the 3060 usually like 10 fps higher but i just play at 60 fps anyway lol
Hey guys new here! What do you guys use SD for? I personally use it for video editing been wondering in what other fields people are using it?
I just generate images for fun. It's just a hobby for me.
I personally dont use SD (anymore) but something else but that may not matter for the question i guess.
I use visual generative AI for prompting ideas or references before i switch to main tools or when for fun i might simply photobash AI images with or without external images and assets or work personally on top of whatever i generated/photobashed
Sometimes i basically photoshop and blend me and my friends or celebrities with those images for fun as well
I stopped using SD since google collab banned web UI's on free accounts
prohibited, I should say*
my 1050ti too slow, too low vram too, not fun
end prohibition
Guys need help
I want to create a QR code with my on portrait in the background
Any ideas ????
does anyone know if comfy ui can work with 8gb ram?
Me with my MX130 : 
@jolly vergeram or VRAM?
right now I'm doing SD1.5 stuff with controlnet and my system total is using 7.85G RAM and less than 6gb VRAM
Hi all! Is anyone here using an RTX 3060 12GB? I'm thinking of getting one for myself. But I can't find any info on what is the largest resolution it can generate images at in Stable Diffusion.
from what I understand the 3060 12gb is pretty solid at AI for the price, it was on my list of possibilities until I decided on the one I bought last night
@void valley also, it's not so much what resolution it can handle, but how long it's going to take, and how long you want to wait. I can do 1024x1024 on an RX590 easily, but it's going to take my 7 minutes per image
Quite honestly I just want to hut the 1080p. I can wait. I am running cpu on one machine and a gtx 1650 on the other right now. I mostly just run them locally and control them from the phone while doing other things. So waiting is not a problem.
I also have an rx570 8gb. But as far as I checked it wasn't supported.
not officially, but there are solutions
I saw one guy setting it up undel ubuntu. But quite honestly I am not that experienced with linux.
Do you run your 590 under windows?
and i'm not sure if the solutions work on Windows, maybe
if you can run docker containers on Windows then it should work
Yeah. But at that point I might as well just boot up linux from another ssd and learn that.
Whichever has easier to follow tutorials I guess.
And how well does your 590 work?
What speeds do you get on 512x512?
without LoRAs I can do 512x512 in like 44 seconds
since I am using LoRAs right now and doing 512x768 it's taking me about twice that per image
Yeah. Any chance you remember the it/s speeds on the 512x512 or the 512x768?
but a guy that goes buys ulyssessrr has docker containers for a version of ROCm and torch that work with the rx590 that I use to run comfyui from
My gtx 1650 gets me like 6sec/it on 512x512 and around 15sec/it on 512x768. I am wondering if it would even worth the hassle to set the rx570 up.
i'm doing about 3.90s/it
it may be a pain in the ass figuring out how to use docker containers to do it, but if you attempt it, go for the torch docker and run that with comfyui, automatic1111 is unusable for me
Thank you.
and that is part of the reason why I didn't buy AMD again for a new GPU
hey guys, i am working on using stable diffusion sdxl model for inference purpose for generating images using some prompts, What is the hardware requirement for fine tuning sdxl? Thanks
Hi I need a help when trying some prompts with words kids , young the stablity ai api shows some error like banned words or invalid prompts Can I get list of banned words to try some valid prompt?
Hello everyone, I am collecting feedback for a new AI model that is capable of recognizing AI generated images, it is open source and hosted on huggingface. Here is the link: https://huggingface.co/spaces/freecs/A.I.R.S
Hi guys, now "path to ckpt" no longer exists in stable diffusion automatic 1111... it has been replaced by path_to_model or model_link.
It doesn't work to use the path "/content/gdrive/MyDrive/Fast-Dreambooth/Sessions/"blabla"/"filename".ckpt... does anyone know how to do it now?
I'm using google colab
@wise stratus its been awhile 😔, entertain us pls
Anyone gotten SDXL ControlNet inpainting to work in OpenOutpaint? It doesn’t seem to be masking properly. It’s just generating a completely new pic over the original
i probably won't touch SDXL again until next week
it takes about as long to do an SDXL image on my GPU working properly as it did taking a standard image to finish using CPU only
I'll have an A770 next week
where do I go for help with a problem im currently experiencing with comfyUI?
trying to use the sdxl refiner but the node just gets bypassed
8gb gpu
SD1.5, easy for 8gb VRAM
So is it minimum 12gb for confyui?
especially if you use comfyui which seems to use fewer system resources than automatic1111
nah, comfy can run under 12 too
no, comfyui seems to use fewer resources than automatic1111
it has less overhead than automatic, true, little lighter
Ah ok.
you can even do SDXL on 8gb with Comfy if you don't mind waiting 5+ minutes per image at 1024x1024
I love that comfy has become popular with time, it's one of the best tools around imo
yeah, it seems confusing at first, but there's so much more you can do with it without having to have someone else rework a UI for you first
I saw a video saying you can use a portrait image to make images that have poses. Can I use base Comfy? Or do I have to get Loras or other things?
are you talking about using controlnet to copy a pose?
comfy has everything you need as far as code go for those things, but you may need models too. for what you describe, you may need an openpose preprocessor (to extract the pose from the base image) and a controlnet model using that pose if I understand well what you describe (not sure what that video was showing).
I used comfy for such comvoluted things to be honnest, almost anything is possible in that
like generating an image using 4 different models, merging the latents, ...
I think we can ? not sure, this is the general channel, it may be blocked, but it's mostly possible to post link in this discord yes
yeah, I set up controlnet in comfy.. just needs to load image node into a midas depth map node and feed that into an "apply controlnet" node, and that controls the output
Oh it’s not showing up
I need to figure out how to do backgrounds and foreground subjects separately in general, comfyui might be the easiest way to do it because I could probably use a depth map to detect the character which I could use as a mask for an img2img after generating a background
#🏞|general-with-images will let the preview go through I think, this channel is restricted to text mostly
yep, this uses controlnet mostly, so you'll need to download some controlnet models, but that's about it
piecing together nodes to do custom things automatically isn't really something other UIs can do, but comfy can.. in fact, I'm about to try to set that up right now
Ok thanks
What about certain features of an image? Like if I want the exact eyes of a character in one image to be the eyes of a different character? Is that possible? Or am I gonna have to have both images and just keep regenerating it?
you're talking photoshop territory there imo.
You can use masks, to let the AI alter only the eyes, but you won't just sum the eyes of another character from another picture, you'll generate new eyes
basically, if you apply a mask of an image over where a character is at, you can apply a background to said character with a separate set of prompts.. so what I am attempting to do is use a depth map to detect where the character actually is, convert that to a mask, and use that
Ah ok, thanks.
One more thing. I have both a desktop pc and a MacBook. Has anyone run comfy on a MacBook? Or is just not enough power to run it?
you should ask in #🤝|tech-support maybe.
I've just been back around here after a big break, not sure where we are at
When I left, it was working on macbook, not a power problem, but it did require to run a linux I believe... like I said, I'm really not up to date though
ok, thanks for the help!
example of controlnet (using a 4 leaf clover as base image) and 4 different prompts for 4 different elements, using different models too. Comfy is quite the best for such things
#🏞|general-with-images message
well, 5 prompts if we include the total picture one that is used for the last few steps
right now where I am at is just converting a depth to a mask so I can do everything around the foreground subjects
hey. It seems that prompts have to be literal. I mean, I wanted spiralling clouds around a beam
https://th.bing.com/th/id/OIG.Ye2m9FfeGF2pqT8VaXGV?pid=ImgGn
Howerver, I thought that the AI would place the spiralling clouds on the sky because it's obvious. No, it isn't obvious for the AI.
I tried to generate this scene in SDXL. I got the background and the beam, but all other elements were ignored.
whelp.. I got a workflow that automatically deletes and recreates backgrounds
generate the subject character first, that output image goes into a depth map, depth map gets converted to a mask, mask gets inverted then goes to a vae encoder for inpainting, which sets the latent image for a new ksampler node which has it's own positive text prompt for the background
i just wish there was a simpler way to straight up do the foreground and background separately without having to do subject detection
well, by definition, foreground and background is that, separating the subject from the rest.... hard to imagine one without the other
yeah, but it's difficult to actually describe multiple subjects in one prompt and not having terms get mixed
that's also a problem even without foreground/background. but there is an extension for that too !
this is meant to limit the impact of tokens to specific group of words
I think that the way to avoid confusion is to have some mathematical terms or physics included in the prompts. Like vectors, coordinates, etc.
you can also target the prompt to specific parts of the picture, with area conditionning
not sure if it's in the main branch now
not the best node for it in my experience some months ago, but it does illustrate the point quite well
https://civitai.com/models/24537/comfyui-visual-area-conditioning-latent-composition
i may try some of those.. IMO, how it is right now is a bit of a weak point with AI
right now, things like comfy UI letting you code even custom nodes quite easily, are tools to build dedicated pipelines for specific tasks
You can spend some hours on a workflow that will do wonders, and call it from the command line, integrating all this in other process for different purpose
such tools like comfyUI are the main glue that will put together more advanced, specialized and user friendly tools imo
what's the state of the field currently on LLMs, open source models in particular ?
working this sort of stuff out though.. the problem solving, this is actually fun to me
yep, I get that 🙂
I spent so much more hours making workflows, labelling, adding rerouting for clarity, groupping, making it more powerful and modular, .... than using the workflow once it's done
one strong thing that I loved in the "targeted conditionning" concept (that lets you set where a specific conditionning is applied) is that it also works with controlnet, meaning you can use multiple controlnets with different models on different part of the picture if needed
it can also be quite the bother when scaling the picture, since the conditionning needs scaling too
right now my workflow looks like a spaghetti monster
and the output for "santa clause" in one prompt, and "the surface of mars" on another, comes out looking like the corpse of santa clause half buried in the sand on mars
i probably shouldn't even bother with this until I get my new GPU and can do renders a lot quicker, which lets me test results and adjust much quicker
I finally got comfy running on my macbook
how well does it run on that?
seems to run pretty well. It took like 30 seconds each image for a batch of 4
512x512 i'm assuming?
yeah, I can try with different one
i wonder... is it possible to have multiple checkpoints going at once?
that's be cool for simultaneously comparing checkpoints
Hi guys, was hoping to get clarification on hosting my own stable diffusion server..
I'm new to developing with StableDiffusion and wanted clarification after trying to do my own research, feel free to flame me if I didn't RTFM. I want to deploy my own web application publicly which uses SD in the backend with an already-trained model downloaded from CivitAI.
What is the cheapest cloud server / service I can use to host a server containing my Stable Diffusion model that generates content based on prompts. I don't want to store any generated content other than user information for logging in and authentication purposes.
Has anyone used this service? https://console.cloud.google.com/marketplace/product/techlatest-public/techlatest-stable-diffusion?q=search&referrer=search&hl=en&project=ai-sandbox-398320
I'm wondering how this works vs doing it manually. Any and all advice appreciated.
what does this mean on the stable diffusion webui page "Your space is on error, check its status on hf.co"
that makes no sense to me
curious.. when you do prompts, it it best to do more important prompts first?
anyone?
i think it is not official sd webpage.
Stable diffusion webui, is local installation, wouldnt be surprised if hf.co is some scam page.
Hello peep!
is there a /describe feature here? if not where can i use it. I have seen it somewhere
i know midjourney has one but its paid
try one of the #1100170312106127410 channels, they have their own prompt triggers that work well
What do you guys this about this https://www.youtube.com/watch?v=aAdOCKn8f40 ! It's using SD to generated the images btw
Bing AI no longer alows the word 'demon' in prompts
or nuclear bomb
I wish the model Bing AI uses was downloadable because their promp blocking is garbage
id like to buy a SD phone
wot why demons
thats for generating images. I want image to text
to describe the image
its dalle3, from OpenAI
if you happen to use A1111 that has built in option to interrogate images for text description
i'm personally never using a1111 again
hey everyone
would this be the chat that i could post a question?
oh i think i found it!
I'm actually surprised how much of a difference denoise makes, it's almost like having a completely different model
Hey all. this is a rando question - has anyone yet to see a consistent approach of a trained model of a realistic character been used to inpaint into realistic photographs? Basically to inject them into scenes. I've seen a few use cases but nothing solid. Wondering if it's too far a leap (especially if the photographic style/scene/lighting) changes too much across photographs
Thinking of using a posable model to create a depth map to match into the scene and inpaint in. but it's been tricky so far, not sure it's a rabbit hole worth exploring quite yet
why is that
it was nothing but problems, besides comfy is more versatile
oh wow, and i never had any issues with a1111
i also tried comfyui but coulnd't settle with its interface
I like visual scripting, something that's cool about it is if something fails, you can actually see where the failure is immediately because you can see the lines of code being executed in order
which was great when I was making stuff in Unreal Engine
less syntax sensitive than traditional coding
stable diffusion v3 probably incoming : https://twitter.com/GozukaraFurkan/status/1713329091074502956
so what was the point of SD2.1 then?... it's here and almost immediately surpassed?
I can't seem to generate images
Hopefully less demanding than SDXL 
O shit nvm
unless SD3 is trained on higher resolution, and replaces both SDXL and SD2.1
what resolution is DALL-E 3 trained at?
seems to be 1024x1024, same as SDXL
as far as I remember SDXL was intended to be part of something better 
perhaps.. and if they do successfully clone DALL-E 3, I see it as being an SDXL replacement, not a standard SD replacement
so, maybe SDXL2.0? Would be nice if the improvements could also be applied to standard
sdxl 1.1
by the sounds of it, SDXL2.0.. I've seen what DALL-E 3 can do, it's a game changer for many, if stable diffusion can keep up, that'd be amazing
game is the same. Game changer is like when shaq started slam dunking so hard that it broke the hoops. game changer is like when google maps came out and reloading webpages was less important.
game is even worse with dall3 since its a restrict license from a service
dalle 3 is over hyped. its not that great. people can use sdxl to greater effect if they learn
"it understands prompts better" but thats all you can do is prompt it. sdxl has countless other ways of controlling and affecting an image since it's open. also, what's good is understanding it if it can't be used for anything except looking at for personla use only
bing is censoring just too much. I want to create explosions, war, blood but not real graphic content. It's fantasy art such as vampires, destruction caused by fireballs, magic, etc
https://sites.research.google/parti/ this research even blows dall-e 3 out. how can dall3 be a game changer or next level, when parti is just better like 6 months ago
@fervent thunderfor sure... I actually got a temporary ban from Bing because I was asking it for body horror stuff
specifically, someone tearing out their own eyes, like in event horizon
I've been using DreamStudio to try to generate images of a scorpion and it's just not working, it keeps giving me some sort of shrimp/lobster looking thing, any tips?
Prompts I've tried:
A golden scorpion with eight legs and two big crab claws and a long stinger tail
A golden scorpion with a long, tall, tail with a stinger curved up with red eyes on a leaf with a dark green background
go higher weight on scorpion
golden (scorpion:1.3) ...(long stinger tail:1.1)
I'd get rid of "with" , "and" , "a" too
@pale latch the way the AI understands the prompts and how you can interact with it via ChatGPT is indeed mind blowing
But you dont have all the customization that Stable Diffusion or Firefly offer
But people might use more than just one of em like i di
di
do*
daboo dee daboo da
generating images is like gambling, just need to generate 1 more time till i can get the perfect image
the best strategy is to put all your AI points into luck. A good luck of the seed can drasticaly reduce the total count of images you need to make
that, or reducing your perception score, so you don't see the imperfections
That's a nice way to tell someone to gouge out their eyes 
What does "DM A" and "DM B" mean?
Hey all . Does anyone here interested in smaller docker image for amd rocm pytorch . The official amd pytorch image i dowloaded was 35-50GB . Now i was able to go down to a 10-15GB base image just by changing to fedora without anaconda . And i havent done anything funky yet
Tested with stable diffusion webui and comfyui . Both works nice so far. with just basic installation
Does the docker image works on windows ?
can anyone tell me what this means: "Warning Image-2-Image: disabled" ... happens whenever i check the image 2 image box
If its runs inside WSL2 theoretically it should work , but i have no means to test it
ah alright
does anyone have a really long prompt for stable dif? I need to prove a point about them existing
I must take back my statement it probably would not work as amd hasnt resolved wsl2 rocm as of yet
yo guys, i tried to use dreambooth on my mac and here what i have it doesnt work.. when creating the model
Duration: 00:00:00
Error completing request
Arguments: ('CCCarlitosStyleTTTattoo', 'anything-v3-fp16-pruned.safetensors', '', False, '', '', False, False, '') {}
Traceback (most recent call last):
File "/Users/carlos-albertorios-flores/stable-diffusion-webui/extensions/sd_dreambooth_extension/dreambooth/utils/utils.py", line 273, in f
res = func(*args, **kwargs)
File "/Users/carlos-albertorios-flores/stable-diffusion-webui/extensions/sd_dreambooth_extension/dreambooth/ui_functions.py", line 984, in create_model
result = extract_checkpoint(new_model_name=new_model_name,
File "/Users/carlos-albertorios-flores/stable-diffusion-webui/extensions/sd_dreambooth_extension/dreambooth/sd_to_diff.py", line 151, in extract_checkpoint
original_config_file = get_config_file(train_unfrozen, model_type)
File "/Users/carlos-albertorios-flores/stable-diffusion-webui/extensions/sd_dreambooth_extension/dreambooth/sd_to_diff.py", line 82, in get_config_file
model_version_name = model_versions[model_type]
KeyError: ''
Traceback (most recent call last):
File "/Users/carlos-albertorios-flores/stable-diffusion-webui/venv/lib/python3.10/site-packages/gradio/routes.py", line 422, in run_predict
output = await app.get_blocks().process_api(
File "/Users/carlos-albertorios-flores/stable-diffusion-webui/venv/lib/python3.10/site-packages/gradio/blocks.py", line 1326, in process_api
data = self.postprocess_data(fn_index, result["prediction"], state)
File "/Users/carlos-albertorios-flores/stable-diffusion-webui/venv/lib/python3.10/site-packages/gradio/blocks.py", line 1229, in postprocess_data
self.validate_outputs(fn_index, predictions) # type: ignore
File "/Users/carlos-albertorios-flores/stable-diffusion-webui/venv/lib/python3.10/site-packages/gradio/blocks.py", line 1204, in validate_outputs
raise ValueError(
ValueError: An event handler (f) didn't receive enough output values (needed: 10, received: 3).
Wanted outputs:
[dropdown, html, html, html, html, html, html, html, slider, html]
Received outputs:
[None, "", "<div class='error'>KeyError: ''</div>"]
i think after I get my new GPU I'll probably just be doing SDXL exclusively, since according to benchmarks I'll be able to do 1024x1024 stuff faster than I can do SD1.5 512x512 now
also, something I've really began to appreciate with comfyui is the ability to have a bunch of different text prompts placed off to the side somewhere that I can plug in and swap around as needed
Hi! I'd like to find an equivalent of " Stable Audio " that I could be install on my PC. The same for " Suno AI " that allows people to generate songs (music + text + voice)
Not sure if this is the right place to ask. I'm trying Fooocus for the first time, instead of Automatic1111. Fooocus seems to only let me use 5 loras at once though. Is there a way to add more lines for loras?
Also just realizing that I don't know how to use embeddings in Fooocus. Or whatever the files that go into the "embeddings" folder are called.
guys I have a questiooon, can I change de name of the models and loras? Because ufff is quite a mess and im always not remembering the one im looking for and im not sure if it can affect
Or if even I can use Subfolders?
why do all my outputs look so ugly
how do i fix this
like why is it so pixelated and stuff
also my computer doesnt seem to be using much resources at all
task manager tells me nothing is really doing any work
u have wrong vae
like if u have xl vaeo u need xl model
but if u mix them
caput
oh
I mean that can be 1 of the reasons
alright, ill try finding another VAE and model
nope. accessing amd gpu through wsl2 i thought was going to be easy too when i was using AMD. Apparantly microsoft has tried to work with them to get their drivers up to speed for WSL2 but amd's window driver engineering team is kind of, well, you know
the VM needs the host's drivers to support pass through and AMD isn't there
Ah good to know thanks! Hopefully in the future it works better
Not till 8000 series probably
if i had amd and was trying to do this machine learning stuff, i'd switch to nvidia. in fact, that's exactly what happened. Bought amd gpus exclusively for nearly 20 years.
they were cheapper and could play games still. the limitations weren't that hard
True but rocm on windows will help SoonTM xD
are blocked images accessible to whoever moderates the AI?
"an angel impalling a dragon" was blocked. But "an angel slashing a dragon" was not.
impalement is too sexual
i'm using an RX590 with ROCm on Linux right now, it works fine
but AMD did drop official ROCm support on my card and it looks like they're trying to tighten up which GPUs are allowed to make use of it
Anyone got a prompt that creates a clean visual representation of a neural network? Or else a Lora or some embeddings? I'm lost and can't create one in SD (Automatic1111 webUI),.. any help appreciated, thanks. 🙂
I tried Img2img and it just copies the existing one I got from the interwebz,.. hardly original or accurate.
How i can generated images ?
@fervent thunder
Currently, there is a public bot on the server that generates images available as a research beta for SDXL, you can find the current status of the bot in #1047610792226340935. There are plenty of ways to use Stable Diffusion such as the official https://dreamstudio.ai/ website or by running Stable Diffusion locally using your own hardware - check out #1080946152318443610 for more details! You can also stop by #1025467151206854736 for any issues you experience while using DreamStudio or #🤝|tech-support for any problems you encounter while installing it locally!
I'm asking about this server, there are 1 - 10 bots
Been trying to find a good place to buy a used 3090, tried ebay got burned on a bad card (return accepted luckily so didn't lose any money but still) are there any reliable ways to buy a good used card or is it just take your chances on ebay and keep trying till you find a good one?
buy a new one, less problems
Sadly I need both kidneys
Usually adding more than 5 loras will make results bad (like in a1111 or comfyui) so I guess fooocus devs hide this.
I think in the “run_anime.bat” it has an example of using embeddings in it. Like (embedding:unaestheticXLv31:0.8)
Related question how much better is a 3090 than a 4080, in terms of training and generating large images (speed not as important to me) is the 3090 the clear winner?
(
hi, kind guys: why I can't access midjourney's channel?
It just can't click in, but it's ok for other channels
I wonder when will people make ai less chaotic
All I wish I had was some sort of pose curation AI, or at least a model that understood any type of pose and not just the typical basic poses.
Even throwing a bunch of ControlNets together with Depth, OpenPose, and Canny, it doesn't make it possible for current SD models to make the poses the characters are in if the poses and angles are so foreign to it's training data.
Hello.. is there a mobile app for illusion diffusion ai?
What's the difference between Checkpoints and Loras?
hey do any of you know about dreambooth on mac ??
if i want to use comfyui with a v1.5 model, what node do i need to take ? "CLIP Text Encode (Prompt)" ?
if you have default Workflow. Try just load 1.5 model and it should work
i do need to organize comfy more, and make use of grouping so I can turn entire clusters of stuff on and off depending on my needs
guyss how can i train SD with my style on a mac, dreambooth doesnt seem to work..
Hi. Is there a support team here? I'm not sure if there is a network error or if my image outputs/prompts have caused me to be locked out?
Try probably ask in tech support or prompting help but be more specific.
Will do. Thank you.
hey people, i'm searching for a working discord bot to use with my stand alone auto1111 installation... there is literally zero bots that work out of the box or i'm crazy? everything either trying to download their own model or some shenanigans ...
Someone know if its possible to somehow detect faulty images, where body proportions are off or other errors in the image? Like, to generate a lot of content and the filter them sort for the best results.
@violet fernthere are some tools for fixing faces after generation, but, when it comes to faulty images, the go-to solution to them is inpainting
I mean more alike a process where the AI detect IF it has errors, not to improve them.
Hey I am brand new to all of this so I’m sure I’m missing a lot, I have everything loaded in but when trying to edit Image to image with paint in to replace I painted area I get basically nonsense instead of the AI looking at the rest of the picture and replacing the effected area with something that also makes sense, am I missing something obvious?
probably not as well as you can detect errors with a quick glance
So it is somehow possible?
the closest I know of is aesthetic scoring. it's not focused on defauts but on global aesthetics. You'd need to train another type of detector for specificaly that task for something more accurate
you can find extensions for automatic that do aesthetic scoring
Thanks thats what i was looking for!
is there an explanation for bizarre mistakes such as drawing the armor but without the body? I wanted an armored dragon flying, somehow the AI made the armor with no dragon. An empty armor flying. what?
@fervent thunder did you try generating the dragon first and then the armor?
inpainting
how do I do inpainting?
I have returned
Hi I was thinking about using midjourney or stable diffusion for comics and I was wondering which is better?
well stable diffusion gives you more control and there's no cost or limitations
Are there any promo codes available just now?
Hello! do you know what is the best way to train a art style to use it for games, comics, or other media? with Lora or checkpoint?
Hi guys, how do I generate a new image based on an already existing image? For example, how can I upload my own photo and generate it in a different art style?
I guess you use a model, or your own trained model and with controlnet with openpose, you draw the pose or use the photo reference
Is it not possible to just upload the image and then tell AI to recreate that image, but in a different art style?
Midjourney has it
Well, use controlnet and add the image reference, that´s how i learned it.
I don´t know if there´s other way
Never heard of controlnet, what it is?
Is it usable in here? Or just in the PC version of Stable Diffusion?
in the pc version
But I heard some websites have the use of contrl net for img to img
heh.. love jacking up skinny prompts, along with muscular prompts, and creating very buff skeletons
nice, my A770 will be here day after tomorrow
Someone know how to create a realistic Disney poster?
hey how do i make a negative prompt?
nice... just found out there's a node that automatically sends a finished image over to the input of my "inpainting" workspace... more automation, yay
The more I see DALL-E 3, the more I cannot wait for the day SD replaces CLIP and begins to close the gap. I mean, SD has the big advantage of being completely uncensored and open source yes, but closing the gap where prompt adherence is concerned would be the greatest thing that could possibly ever happen to SD.
Yeah, hopefully SD3 is released before year's end
reminder to not use bing create since ye it great but limited and you cant exemple use cigar or anything similar :C
any tips on the dataset of pictures i need for making an embedding of myself. i started just asking my girlfriend to take a picture of me in 1:1 ratio every time we go out to have a diverse set of different backgrounds clothes and lighting in my data set, but not sure how many picture of close up etc i should do and how varied i should make facial expressions and angles.
The question is if it will come to D3 level
And if D3 in the meantime will get some updates as well
I downloaded stable diffusion and there's an error later that shows in the code that says "stable diffusion failed to load" what can be the solution?
Cuda memory falls short
"downloaded stable diffusion" doesn't mean much as I can probably nameoff the top of my head a dozen of stable diffusion interface/client.
Also a full log would probably help understand what's happening there.
Also, #🤝|tech-support might be better suited for that kind of discussion.
In edge is ok. 😉 thanks. But i wanted to work in chrome...cant understand why it shows the screen so big.
Then you would need to clear the chrome data, cache etc
Well the problem is that i did that already, still the same happening, and i just installed Fooocus to try it, and the same is happening using chrome. Cant understand whats happening.
Chrome is just a bad browser ^^
Firefox best 😛
Maybe reinstall chrome but save your passwords and stuff
yeah, I will try that. Thanks for the help, 😉
No Problem 🙂
Good! What are you up to?
I'm great because Dion released his Stable Audio GUI this morning!
anyone knows of any lora to put the kimetsu no yaiba uniforms in a character?
can someone generate this prompt cuz bing image generator doesn't work ? "four skeletons in a gas station smoking sigar in fisheye view"
hmm... I need to do more optimizations with ComfyUI... what I'd like to do is have a series of "preview image" nodes that populate sequentially, so I can keep my last 4 generated images on screen, or separate them all when I do a batch of four
hey, just curious, when people use all these ratios in their prompt, what are they doing?
eg. (ornamental engravings:1.2)
what does the ratio do?
That number applies weight to the text inside the parenthesis. A 1.2 means they want the model to emphasize those tokens 1.2 times the amount of the other tokens.
how to I img2img in a bot channel?
nice.. just polished up by auto background workflow
well, it seems the whole Clipdrop community got an "answer" of some sort. Clipdrop updated their pricing plans, and SDXL image generating, along with Stable Doodle and Background Replacer are crossed out in the Free plan meaning no more free generating with Clipdrop. These functions are now behind the paywall
I'm thinking I might even just create a custom comfyui node which consists of a group of nodes I have linked together that can perform a single, automated task
Good morning, everyone! How are we all this lovely day?
hello Sunny, im fine besides of pain in my right hand. How about you? 😄
what is the best stable diffusion model
and if i dont have the requirements can I still do it
What's your VRAM fam?
You can use options such as --medvram and --lowvram. Tiled diffusion extension can help as well
where do I check that
ok
in your taskmanager, go into "performance" (1), click on your graphics card in the list of device (2) and look for "shared video memory" in the bottom of the screen (I put a square around it)
(go look at the screenshot I put in #🤝|tech-support
ok
I have that and it's more than sufficient (except for SDXL which is the latest version)
is sdxl the best one?
Depends. If you can't run it easily it's not going to be the best for you. But yes it is better in general.
alr which one should I take then
My suggestion: Google how to install automatic1111 GUI -> look at civitai dot com -> search for checkpoints (also called models) -> find one whose art you like -> make sure it's compatible with SD 1.5 or 2.1 -> google how to place the file and how to generate images with automatic1111
oh ok
there's a lot to cover but for now try to get a model running
alr
whats the best stable diffusion model to download rn?
I believe the latest official one is SDXL 1.0
https://huggingface.co/stabilityai/stable-diffusion-xl-base-1.0
nice
downloading it now, how much vram is needed?
or does it like not matter so much?
I'll be honest, I'm coming back after a long break and haven't ran it yet, I'm downloading it too currently, and I don't know the requirements, I have 24GB, but I think I eared it could also run on a lot less
holy monkeys 24??
vram
jesus christ
like gpu ram
I'm on 3090TI
damn
i got a 3070
SDXL is still the latest official model.
it only got 8 though
I'm running a modest GTX 1660
yeah, the range of VRAM is quite big on the 30XX series
3X AS MUCH
lmao
goodness
price too though
I've got a 3090.
I've only got 6 gigs of vram lmao
I got that one specificaly for training
I hate prebuilts, they feel icky idk why
i got mine fo free though so i aint complaining
so I'll plug what I trained, long time
https://civitai.com/user/Guizmus
most of it being trained from art on this very discord
oh i guess if you're actually training models then 24 gigs might be reasonable
yuh my laptop broke so they gave me a voucher of £1200 so i got a desktop instead
but for generation that is waayy overkill
damn that's a steal
IKR
what do u even need 24 gb for anyways?
most things u can even do dont require that many gb's do they?
completly, yes. images taking 24GB VRAM are so big that the AI isn't capable of doing good things. it's very complex to keep the result goods after a certain point
mhm
whats the difference between normal stable diffusion and sdxl?
the size, number of parameters. How it was trained... a lot internally
From the user point of vue : the quality of the results, and the requirements
will i have to download different models than the ones i was using before?
that was a very time inducing process
it's the main difference, yes. there is one model for SD 2.1, and one model for SDXL
hell, there are hundreds or thousands of models if you're into that
quick question, what is a pickletensor file?
Some models will require a second file with the same name that will determine some internal settings that need to be applied to use that model.
i think the last model i used was 1.4 so im hoping alot is different
ckpt files are also called like that sometime, to mark the difference with the relative security of safetensors
i just installed stable diffusion
my first pictures generated with model are so ugly !
i guess something is wrong
sorry I can't, but welcome around
Thank you .
What does "ugly" mean?
oh ok
does that mean this file is potentially unsafe? I mean I got it from civitai so idk
If it ends in .safetensors you are ok
civitAI runs a basic unpickle test, to prevent from virus. people can also report dangerous files. They do not guarantee anything though
This indication just means it's an old file usually, not necessary dangerous
but if there is a safetensor alternative, take that one for sure
If it ends in .ckpt it's probably fine, but you should try to find a .safetensors version.
there are also spaces online that can convert a ckpt to safetensor, but this is usually more for the people that do make the models
it would require you to upload the model for that
yeah but it's a pickletensor
Not sure if this is the place to ask but is there a list of "Negative prompts" that are used regularly to get good results?
whelp, UPS just dropped off my new GPU
has controlnet been removed from the bots?
if anyone can pm me , i need help with stable diffusion , would be much appreciated
Hello everyone, is there an option for inpainting available here?
Hi guys, is there a specific place thread to ask about comfyui?
Here is Ok, also in sdxl and in tech-support if it has to do with installing etc
Does CPU matter at all for stable diffusion
@fervent thunder not if you're using GPU to do all the work
i got a r7 5800x and was wondering if upgrading to a r9 7950x would change anything
for IT/s i mean
what's your GPU and are you using that for SD?
then I don't think a change in CPU is going to net you any noticeable difference
really only care about stable diffusion performance as games dont benefit from 16 cores vs 8 and since i'm playing on 1440p a faster cpu really doesn't do anything
looking for a reason if its even worth changing to am5 yet
1500€ for a new base is a lot of money compared to additionaly getting the cheapest 4090
yeah i figured as much
i want to train loras
something thats a pain in the ass on my 1080ti, takes over 2 hours to do a proper one
gm frens ive arrived
alo
well now, seems I stepped in some shit... installing comfyui for intel arc
that might depend on if you use automatic1111, comfyui, etc
i dont know what any of that means
i got stable diffusion? installed
ok i think i found my senpai, Olivio Sarikas
stable diffusion is a command line tool, if you want a graphical interface, you use something like automatic1111 webui, or comfyui
hey can someone help me understand how to update controlnet
The extension can be updated in the extension tab.
The models need to be manually downloaded
Has anyone used animatediff, please let me know. Why I've installed animatediff like in the picture but comfyui still doesn't recognize it??
Hello)
nice.. thing I got torch 2.0 working with intel arc optimizations now
wow... full 20-step 512x768 images in 9 seconds
4 seconds per with DDIM
vs over 40 seconds before.. so yeah, 10x speed upgrade 😄
try this on the #🎥|animation channel, there's a lot of people with animateDiff knowledge there, to me it seems like it's asking you for a node you don't have installed or it's deprecated, that's why it's in red, it doesn't find it. Are you using someone else's workflow? they are probably using another custom node, check the command window
dude... 1.30s/it for 896x1152 SDXL
Does SDXL + Controlnets (IP-Adapter, depth etc) work when using TensorRT in A1111?
Hello, I just installed SD last night on my 1080Ti
Is it okay to generate images of my favourite kpop idol?
It would just be for personal use, but the law is confusing
hello, any pointers on where I can find negative embeddings for sd2.1 ?
Best place to look is probably civitai
i checked but couldn't find for 2.x (most are for 1.x)
You can filter results for 2 only, but it's also newer so there won't be as much stuff for it over all
what are the like licensce terms of images you make?
Is there any way to enable NSFW images?
My friend wants to know.
Not on this discord.
No I mean on the platform of the photo generation
Who has an RTX3070 please respond.
i have @fleet wren
guys , is it possible to turn a picture of myself into an anime character ? i mean exactly the same picture
yes image to image
Can someone help with this?
NansException: A tensor with all NaNs was produced in Unet. This could be either because there's not enough precision to represent the picture, or because your video card does not support half type. Try setting the "Upcast cross attention layer to float32" option in Settings > Stable Diffusion or using the --no-half commandline argument to fix this. Use --disable-nan-check commandline argument to disable this check.
How do i use the --no-half commandline? Where do i enter it?
I feel stupid again, I have to reinstall Automatic 1111, but after a stroke I had a a year ago things sometime fall out of my memory, this time it was something about argument lines, I remember "--xformers --autolaunch" but then a huge blank; I have a Rather okay Intel with a 3090 how shall my "set COMMANDLINE_ARGS=" look like?
webui-user.bat if its a1111
what's wrong with what you have now?
I do not know, I have a vague thought it was longer before, my logic is sometime failing and I ask for I feel unsecure.
it's like long prompts, it's mostly useless to have tons of args if everythings working fine
Ty. Will run it as it is. I am on my first run of it and that is a thing I do remember, that that can take a long time before it start.
@stable pawnwhen i lauch it , in the console command it says xformers not found , running without it
getting Intel Arc optimizations working on Arch Linux was actually extremely simple
guys does anyone know how to change camera position. Im trying to get a shot from underneath so you have a crazy perspective
could try low angle, maybe "shot from below"
you could try adding perentheses around the prompt to give it more weight, like ((something))
how long is a pic using sdxl gonna take on average to generate?
i got 8gb vram and it taking years
--xformers --no-half-vae would be enough
then you need --xformers --medvram-sdxl --no-half-vae
ooo
right yeeee i got no half vae on but i thought xformers made it worse so i got rid of it
Thanks
Breh
xD xformers is the biggest performance boost you can get. and it doesnt change the quality
sowwy
Hi i need help I just installed stable diffusion with realistic vision but it only generates two or more people instead of one when i put a "complex prompt" it doesn't matter if I tell it that there is only one character alone.. I don't understand
Holy fuckin shit! I just tripled my generation speed!
Nvidia finally released their tensorRT extension, and i went from 7.36 IT's to 20!
is lora file supposed to end in safetensors?
yeah i downloaded something not sure what now, but ill get up to date soon, well maybe not so soon, but ill get up to speed
The constant generation of waifus on this discord is beyond unfathomable levels of cringe
yes
for some reason when i downloaded a lora file thing and i put it in the folder, it didnt appear
Hey everyone, does anyone know if I can use my own z-depth image in Control Net or other plugins? I'm looking to generate some textures on top of my 3D scene, but the z-depth produced by Stable isn't suitable for my needs
You can read through what they do and decide what you want and whatnot
https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Command-Line-Arguments-and-Settings/28f4dc4c6f4e6113cce908db6b66a2a20f164871 
In WebUI you just refresh the Lora tab. Make sure you're using an appropriate base version checkpoint.
any good content creators out there? to learn and keep up with SD?
Does anyone know a good model for making anime images more shonen-like (creating characters like ichigo n shit) and not hentai like every single anime checkpoint i find on civit?
Hi, would you know which is the best model driven by microscopic images? I would like to generate some thank you
Good day! Could anyone tell me what is the best way to start making AI generated portraits using real people images? How to make face super realistic?
man.. now I have to go through the task of rebuilding my SD workflow
who wanna start with dropshipping?
This still only works on Nvidia and CPUs right? Also I know with at least my 3060 ti it runs out of Vram so likely anything with 8GB of ram will not preform better so if I had a 3070 ti or a 4060 ti (not to mention the slower buss speed that also makes things worse) it won't do any better and be faster right? Then for the 3080s would I fill Vram using a 3080 12gb or would there be a possibility of better performance with a 3080 ti?
Does this logic and process even make sense?
@grave stagthat would be false. Earlier this week I was using an RX590 AMD, now I'm using an Intel Arc series
Did they update it or something? I thought it used to only run on Nvidia? Or at least nobody would say to buy a GPU other then Nvidia for this. If I consider an upgrade from my 3060 ti (though I might stick with Nvidia) would an equivalent AMD card preform as well as one from Nvidia? I think AMD is cheaper on the used market? That and ark might have the cheapest price for high Vram cards but there is no upgrade from my 3060 ti that can be Intel.
The RTX3060, if it has at least 8gb is actually pretty good for SD, the 4060 didn't provide that much of an upgrade over the 3060... I bought an Intel card with 16gb VRAM and I'm liking it
I have the ti card, I could buy an ark but then it be a downgrade for anything that doesn't need more then 8GB ram (I think there is no arc card better then a 3060 ti right?) So then that's AMD or Nvidia and does it run better on Nvida vs AMD?
I can't go from a 3060 ti to ark that's not really an upgrade.
Especially since I also use the PC for gaming.
Nice sale on frames at Michaels right now.
Not sure if anyone prints/frames any generations. 🙂
@grave staghow much VRAM do you have now? And where do you feel it's lacking in SD?
I make a few images (not that I know how to write image prompts well enough to get results I want and like) and I spike to 100% Vram usage. I tried just using this to learn by starting to make the exact image shown in the examples and I can't even match that or the anime character right but that said it takes like double digit min for like 4 images if I turn up the complexity and resolution any amount its annoying.
@grave stag Automatic1111 or Comfy?
What's the major difference between A1111 and Comfy?
I've been hearing COmfy is faster?
Comfy doesn't eat up a bunch of VRAM
I was doing 1024x1024 SDXL on an RX590 with 8GB VRAM
WHat does it eat then? There's gotta be a caveat in it somewhere
Automatic1111 I think? I just want to rip though images and make a million in like an hour (not a million for real lol just a lot) and then pick some for OCs for my RPs.
or it's just more efficient of a UI
Might just be easier to run then Automatic1111
Its more complicated to use though?
it just uses nodes, pretty easy to figure out, and more versatile
Huh... Might give it a shot on my laptop. My laptop has a 1660ti in it, and it's slow
I want speed
my Intel Arc GPU now does 512x768 images at 20 steps in 4 seconds 😛
WHen I move from this 3060 back to my laptop, I'm gonna be having one hell of a time
How much Vram?
Also, I know this is more of a question for tech support, but I really didn't get any definitive answers, does animatediff take a long time to render videos on a 3060 or is there something wrong with my get up?
I feel like it shouldn't be taking an hour and a half
with 100-130s/it
I should mention google isn't helping me, I keep getting answers on how to animate irl things
@grave stagA770 has 16GB VRAM
but it's not really the VRAM making it that much faster, it's a lot of the other stuff, like having dedicated AI cores
Same though I am moving from a different GPU to a different laptop but moving from my desktop to laptop on battery be a hell of a time.
Random question -- in auto1111, can the x/y/z plot be used to render across different negative prompts?
Nvidia has dedicated stuff for A.I. as well. You telling me arc will do better then a 3060 ti at rendering?
they'd both be pretty close until you start doing stuff that actually uses more vram, like larger batch sizes
Larger batches as in 4 at a time?
Idk I might get back into this with the GPU I have then again it doesn't make sense to have a 3060 ti desktop and a 4060 laptop might as well be the same.
All I want is to learn how to use this to make OCs for RP
yeah, try ComfyUI first, it might be exactly what you need
How many batches is using a lot of Vram?
Is Img2Img possible using these bots? Or just on the website
Last time I tried I spent a few hours with help and I still couldn't get it to spit out the anime character I wanted or spit out the ref image, let alone make OCs that I like and that's the whole point of using this.
when I was testing it out, I think Comfy was using an addition 800-1000mb per image added to the batch
So 1gb per image so 6 images and I'm out of ram?
How long you think it take to go from prompt to 10 good resolution images based on prompt?
i wish I had a tool that let me see how much VRAM I was using making these 1920x1080 SDXL images
Can't you just check task manager for Vram usage?
nah
there's a tool to check intel GPU usage but it doesn't seem to understand values for hte Arc GPU
Its been a while since I've used this and I don't even have this set up on my laptop, how long you think it take to learn how to make OCs so I can stop paying people for refs?
@charred thistle think you can teach me how to use this again?
comfyui should start out with a basic setup needed to generate stuff... you need to have checkpoints installed in the models/checkpoints folder to load up.. one box for positive prompts, one for negative, the tool that processes it all in the middle and an image output
Also if I am upgrading from 3060 ti for this and just it doesn't make sense to have a 3060 ti and 4060 laptop what GPUs should I be looking at? Will AMD preform as good as Nvidia?
it doesn't seem to be like there's much to be gained going from a 3060 to a 4060
AMD should get good performance if AMD supports ROCm with that card, which for new ones they will
if you have comfyui up and running, the github page for comfyanonymous/ComfyUI_examples has a bunch of example workflows for doing a bunch of tasks, like inpainting, upscaling, how to use Lora
I tried that, I said I'll spend some time use the models and recreate a ref image or two and man a few hours of help and 2 days later I still couldn't get it to match or make the anime character well, she looked cool just not quite oh that's so and so from x anime no doubt.
Again I have 3060 ti q very different GPU then a 3060 and I'm not saying do I upgrade to the 4060 I know the Vram be low and the buss be slow. I'm saying a 4060 laptop and a 3060 ti desktop in a lot of ways are the same my laptop is pretty close to my desktop I should upgrade the GPU so my desktop be more of a oh this be better when I get home and have reasons to use my desktop.
you would need to have everything exact if you want a match.. meaning the checkpoint, any loras used, both prompts, image size, cfg, number of steps, the seed, etc
2+ hours and occasional help on discord and it still didn't match not even a that's close but x detail be different just a different image that looks like a crappy off bran of the anime character.
If you want to help me out and help me learn how to recreate the images and make my own OCs that's great but unless I find a teacher and one that can really sit down with me not multi task and sometimes take hours to respond, this just be to complicated for me to learn and dive into. Even though it seems so promising to be able to make my own OCs for my rps and not just paying people a small fee to make the better or more important ones on discord.
I'm still fairly new myself.. about to try merging concepts from two different images and see what happens
You seemed knowledgeable about this with how much you know for GPU specs and what to run for this and such?
if u gonna buy a 4060 for 500$ thats a scam
Again nobody be reading I'm not saying do I buy a 4060 or I bought a 4060 card. I was saying the GPU I have in my desktop and the laptop I bought and why I'm thinking I should be upgrading my desktop GPU.
if u have a 3060 ti i think thats enough for now if u only gen images
Okay, so before I start the conversion to comfyui, is there anything I should know about it other than the fact it looks confusing
liek how much faster is it compared to A1111, is it slower than A1111, is it bad, is it good
I mean a better card can speed things up and I'd keep this card for a while like maybe years down the road and it doesn't make sense to have a laptop basically as powerful as my desktop, if I can get a 3080 12gb at a good price I might upgrade to it. Thing is 3080 12gb be basically the same price as a 3080 ti that be just under a 3090 that be I wish I could say just under a 3090 ti but no that be way more expensive.
its faster than auto1111 but its more complicated but once u learn how to use it its very good
Oh yeah, I'm watching a video on it and it definitely looks confusing, but it looks like it's going to be easier on my 1660ti.
Is it lighter or heavier than A1111?
Idk I'll look for 3080 12gb deals, they might as well be 3080 tis in price though witch is strange and that said 3080 tis at least be a little less then 3090s even if it be pretty close and if I'd buy a 3080 ti I'd buy a 3090.
that should've been my first question lol
if i had like 400$ i would probably buy an used quadro rtx 16gb but thats me idm used cards
lighter
but some extensions that are in auto1111 cant be used in comfy
That's fine, I don't have many extensions to begin with.
I just want to be able to animate.
and it not take an hour and a half.
I'm looking at used as well, how well do quadro do at gaming? They don't have all the RTX and DLSS and video upscaling stuff or streaming encoders that regular RTX cards have do they?
the quadro rtx 5000 16gb its similar in perf to a 3070
and yes they have rtx and dlss
Similar as in slightly better or slightly worse I have a 3060 ti that's not much of a jump. It also has DLSS rtx, video upscaling and encoders for streaming?
yea its probably very simmilar to a 3060 ti in gaming and yes they have all the encoders,it has less cores because its turing and 30 series is ampere
Bro are you serious on eBay quadros go for more then a 3080 12gb and similar performance to a 3070 laugh laugh no way.
I see no reason to upgrade to a quadro.
you would only buy a quadro if u want lots of vram at reasonable price,if u have 600$ or more just buy an used 3090
That's a bid with over 2 and a half days left it will of course go higher and such be more then a 3080 12gb.
If I had $600+ I'd go 3090, if I had ~$550 I'd go 3080 ti, I just want to find some sub $500 maybe even sub $400 3080 12gb.
Those sold months ago the market be getting worse sold listings are to check what you price yours at not what you can find one to buy at check sort by price lowest on eBay find me a quadro under $500 just find me one (and don't link a bid that has 1+ days left price will go way up before bid ends).
yea if i had 500 id go for a RTX A4000 16G
have u tried with an used 2080ti 11gb that one i have seen recently as low as 300$
I have $400-$450 if that I'd like to go under $400 if I can. I thought since the price a few months ago was this price now I'd be able to get a 3080 12gb for this but I can't I can't seem to get them even though other cards like the 3060 ti seem to have gone down in used price by $50-100.
That's last gen so I'm not sure it even has DLSS and if so not great one, I think worse RTX, not as much an upgrade as a newer 30series card would be I don't see why I would go that card?
Its very much a good value at that price but I doubt it be better then my card if my card goes for $250 on eBay.
If it is better it won't be that much better.
I just want to get some card with more Vram that is actually better so it doesn't seem like my laptop and desktop have the same GPU and I'd have reason to switch to my desktop for use.
take a look at this offer for example he had 6 rtx a4000 for 400$ each only because they had damaged casing but they all gone now https://www.ebay.com/itm/374997507145
You said a 5000 be as good as a 3070 and this be a 4000 so it be worse then my card other then some A.I. stuff due to the Vram.
the a4000 has the same specs as the 3070 ti only difference is that 3070 ti has gddr6x vram and the a4000 has gdrr6
and the extra 8gb vram of course
I'm happy with my 16GB A770 I got for under $300
just sitting here doing 1920x1080 sized SDXL renders
Wait is the 5000 worse then the 4000? You said the 5000 be as good as the 3070?
Also what does the x mean for vram anyways?
slighty faster in vram intensive tasks
the 5000 is similar in gaming performance to a 3060 ti but overall the 3060 ti its better because its newer generation so more cuda cores, u would only buy the 5000 if u get it cheap at around 300$ and if u want it for mostly ai
the 4000 is better because its from the same generation as 3000 series but its more expensive so u would have to camp ebay until u can get one for like 400$
Okay this may be a really dumb question, but I have to ask... can I run comfyui off a flashdrive?
a 3.0 flashdrive in a 3.0 slot
it seems to be running, like loading in the cmd window, but it seems to be slow.
I'm curious if this is because this is a first time boot.
Okay! It works great actually.
with the heavy censorship how do I create art like this? https://scryfall.com/card/cmr/134/murder
the name is "murder" and art is a dead person with a sword
do I have to use local processing then?
Hi uhh
Can i get the invite link for stable dreamer bot
So i can add in my server
Hello!
Nice, I think I might have actually aided in a new guide being updated for using Intel Arc GPUs with SD. It seems to be much simplified and the Intel optimizations run on Python 3.11 that really only entails a copy and paste into a command line to apply and should work for any Linux distro
Hi guys, is bombs or dynamites a forbidden word for the bot right now? Trying to make a Joker, but seems like there's never a bomb.
"bot" runs locally and I'm not sure if it having any censorship, but the way models are trained is they need to be fed images in order to understand certain things, and that doesn't sound like something a lot of people train for
Sorry, i meant the sdxl beta bot accessible through discord.
aah.. well then what I said is even more true, since newer models are likely going to have less polish to them
Can you trade time for higher quality images?
If I was willing to wait an hour for an image to generate, could I have an absurdly high quality one?
Nope
Quality is not defined trough steps or time
Why do my images look better with more steps?
Some Samplers need more steps than others. so with like 5 steps everything looks bad but for example between 30 and 60 there wont be a big difference anymore
100 steps wont give you better quality
Steps are = how often should something change in the image
Good morning, everyone! How are we all today?
It's so repeatable, I'm sure it makes a difference
Hey there, I am good, I just started trying out SD a couple of days ago, and I'm having some fun. I am producing photorealistic pics of asian girls now.
Good to hear! If you need any help with prompting, I suggest checking out #📝|prompting-help
I can't think of any business use for SD or anything else I can use it for other than producing pics of asian girls. It's been an interesting way to harness the power of my GPU now that it's not mining ethereum.
I'd be very interested to know if anyone has used SD for their business.
Yep I got some help there which made a massive difference to the quality of my outputs.
yall checkout my merge plis
https://civitai.com/models/170390/sakuraii
hey to all.anyone using automatic ui on apple silicon ? how is the performance ? i did some digging and i seen people recommending diffusion bee and trying it now but i also wonder if there is any software to run sdxl on apple silicon ?
I am so intrigued by this community and technology. Considering trying to run and install on a VM with NVIDIA Quadro M4000 -- good idea, bad idea? If reasonable, is it best to follow the automatic guide to set up?
I feel like you will make sussy stuff
Amongus 80085
Hi guys -- do you all have a recipe for installing stable diffusion on Ubuntu 22.04? I see a bunch online, prefer to use yours if there is one so I get it right?
depends on which UI you want
something fairly standard I guess -- didnt know there were different options
works for me
conda should be straight forward to install on ubuntu.
conda sorta like pip?
No it's more like docker.
You can make it have certain environments with specific packages.
ohhh -- ok
my ubuntu box is normally locked away from the web, but accessable on the interior network -- so I hooked it to the web and am doing the normal upgrades right now
so conda, then comfy?
Yeah
does comfy install all the background code needed and the models and such?
comfy you basically just clone the repo, activate the conda environment, then do pip3 install -r requirements in the comfyui folder
then you run main.py and connect to it through a browser.
You can make it a server with --listen and connect from another computer
was that directed at my question?
do any of you know any good ai tools for translation?
Hmm, I have my Intel card set up to use ComfyUI with the tutorial shown on the ComfyUI Github, benchmarking it to compare to an earlier benchmark I took of a simpler process to see if all the hassle of installing and using "intel for python" and other optimizations are worth it
tried to install roop for webui and now it doesn't work even without the roop extension
😦
roop?
does it start if you remove roop?
