#💬|general-chat
1 messages · Page 56 of 1
anyone tried llama-cpp btw? seems to be much smaller but i still have no idea if its even working, it just hangs there and sometimes writes some random words
i mean if you drop the framerate and step counts then it could be doable
bruh this for 2 models?
don't you have to have like, same # of VRAM for these?
2 models + the environment/UI itself, yep
💀
Is Instruct-pix-2-pix improving at all?
some samplers are better than others at certain prompts or concepts
takes trial and error. x/y plots are super helpful.
hey is there a prompt that would increase the resolution for the pics ? I have 512x512 pics but even when i use upscalers they seem to not be high resolution
ei! someone can help me plz?
i need to do this animation in a banana https://www.youtube.com/watch?v=j70IbMuiHjo
i film the video of the banana, and i want to doit in stable. How can i do?
hi , someone cna help me plz.
i want use stable diffusion at FastAPI.
How can I use stable diffusion in FastAPI?
may i ask if there is a free recharge of the credits in a month or so?
hello i can't install stable diffusion can someone help me ?
Does xformers and medvram affect result quality? It seems to me like images is less detailed now and often colors became pale and washed-out
Any idea how I could prompt in dreamstudio so that it creates the style of midjourney ? The art focussed commerically used kinda of style ?
No that doesnt Effect the quality. Your issue is that the model you use mostly needs a VAE file for color correction
You can try fantasy style, game art, drawing, oil painting
anyone got any good tutorials on how to use Lora and what they're same for textual inversions?
how to start use the tool for free
why do TIs work in negative prompt but not LoRas?
that sounds good , which model do you think is best for it ?
Is there a help channel?
chatgpt's knowledge has a cutoff date. Also, it doesn't get "facts" right often. Especially about social media type events
It's impressive but it's really just a rube goldberg machine that adlibs out text
thanks
don't think general chat is the place for such a sensitive charged discussion
please take the conversation elsewhere
You are pathetic for a ranging on mods, just stop the topic and move on
if you dont feel like it, leave
Emotionally charged/sensitive topics aren't the best place for this. Instead of feeding in to one another, it's best to calmly diffuse. I am glad to ban or timeout those who cross lines and are harmful to one another. Typing that I lose credibility as a human before I can even reply to you is very disrespectful. Please drop it.
asking to not further a dangerous/harmful conversation is not out of bounds
this is not okay to say to someone either
so long as this conversation is dropped things are fine. Please be nice to one another.
You can try Dreamshaper
woah when did the server turn into a southpark episode? the only debate about hitler's death is from hero worshippers of him
and that's all i'll say about that
yeah. seems really fucked. that was pretty egregious
right
i seent it
@uncut junco , you have been timed out for furthering this very dangerous and sensitive topic after being warned. The other involved in the conversation has been banned based off the messages sent here.
Please take this conversation elsewhere. This conversation does not belong in #💬|general-chat , and just ask you please respect this. This is for AI and stable diffusion and not heavy charged conversation that can become explosive or reactionary.
In the meantime please do some things like eating your favorite food, generate some images, go for a walk, take a breather-- Fighting in a chat is not the way nor will it ever be the way.
As some people cannot reply quickly sometimes, it is harmful to dehumanize someone on that principle and to blatantly disrespect them and call them pathetic as well. I was taking the time to read the messages from both parties before acting and halt before coming to the actions now taken.
Give others the benefit of the doubt and a little time please. I apologize for any harm or misunderstanding.
https://discord.gg/2SzA8v5H?event=1094987731425308742
you could ask yourself here perhaps
I feel like the health of the community is shifting for the worst. Toxic trolls are reaching critical mass. Mods are playing delicate hands for VERY trolled out topics, such as "hitler isn't suhc a bad guy" . I see it in the subreddit too. The toxic vocal minority is starting to dig their heel in. I don't like where it's going. I think this is a very serious issue that speaks towards toxic seeds being cultivated.
Trolls are reaching a critical mass, I can attest to this too. But I think @karmic brook was right to first try to stop the escalation. Action has been taken now, but when a situation like that happens, taking rash or hasted decision isn't a good idea either. and when there are walls of texts to go through, it takes its time to decide for sure, especially when both party stay to push further and further. Time out was necessary to analyze, and act fairly. But yes of course, for sure, this isn't language or rhetoric that can stay around.
I feel like you may be right on some parts, but I don't want to generalize either. Even if I may not be fast enough and not have the immediate judgement on simple situations all the time, I need to keep on trying. So does every one of us.
I like how AI answered my question to this riot.
When someone offends another person, it can elicit a variety of emotional responses from those who witness it, including anger, sadness, and empathy. When a third party takes it personally and defends the offended person, it may be a sign of empathy or a strong sense of justice. Psychologists may view this as a positive behavior because it shows that the person is willing to stand up for others who are being mistreated or disrespected.
However, it is important to consider the context and motivation behind the third party's actions. If their defense of the other person is excessively aggressive or stems from a personal agenda rather than a desire to help, it may not be an appropriate or effective response. Additionally, some individuals may be more sensitive to perceived offenses and may overreact in situations where others would not feel offended. Overall, psychologists would likely view this situation as complex and multifaceted and would consider the individual personalities and motivations of all parties involved.
Best VAE for anime style?
It's more harmful to let someone believe some things, get affected by propaganda or bs media.
Some people NEEDS to be told they are saying dumb things, whether they like it or not.
How else would they know that if everyone sitting there and playing along.
Idiots should have someone saying to them they are idiots or they'll never know.
I agree - that's not right place to discuss it, but also I feel like you're doing more harm than discussion itself did, but...maybe I'm missing something, since it's cleaned now.
#propaganda_needs_to_be_stopped or whatever.
I feel when someone is saying outright trolling comments about siding with Hitler it isn't my responsibility to tell them they should change their mind or thinking, especially when the premise is already trollish or negative, nor can I sway an opinion. Instead of letting the conversation go further, I ended up banning the user so that this sort of conversation/narrative couldn't go further.
It's important to keep the space here a comfortable one for users, so that's the action I took after assessing the situation.
makes sense 
Very cool approach. I can see in hindsight that reacting in the moment might've been narrow focused
Fruits got the juice
I'm on opposite side of barricades, I'm living in Ru and I've seen whole country go nuts in a finger snap...
Propaganda is really bad...nomatter how stupid it is , it just works if people keep repeating it over years...
I'm sorry for my country being responsible for all this crap...
these are only positive comments but like fruit said, mabey move this to #🌶|off-topic
It's sometimes scary how suggestible people in general can be. Not so much individuals, but seeds can be sewn so easily
i dont mean, in general chat either lol.. words. so confusing
is step/s and it/s the same thing?
I'm really digging into archives surrounding illuminati diffusion and nothing i can find indicates that it was ever under a restrictive license. Rail M is what I can see mentioned on all remaining sources. The author purged everything.
I think it's really important to call these licensing blunders out and say "Hey! That's not cool!". I mean, the guy does great work but he was in here earlier blaming community members for deleting it. passive aggressively suggesting that anyone merging illuminati is hurting the wide community rather than his personal business deals around secret licenses
I watched all that arguing happen in the 2.1 chat. Just so much drama
One model among many.
i woke up to find it and my immediate thought "who the hell does he think he is?"
well, it was certainly ONE model. It's a gooder
Yeah it was a good one, and it allowed all the merges to get better
now its all wrapped up in the same BS that fantasy AI was lighting fire to and leaving on our doorstep
These models come out so fast its tough for either one of them to act like their model is that important. In a month nobody will be using either one
Philosophically, i'd compare it to someone branching the linux kernel, releasing it under the same license, then getting pissed when people migrate the changes to another branch
"those changes were licensed!"
but i do like both of the models. I use them and get good results. Do you believe all the deepfloyd release hype? Is it reall imminent you think? Seems like it is more serious than the last few months of SOONing
yup
I really want to know how DeepFloyd will be released, as in what form. How we will use it
i'll believe it more next week when it happens
on hugging face and then ported to auto by day 2
yeah, that is what i was thinking too. Probably have a colab right away, but then someone will make it happen for auto. I really like the idea of being able to pull that model in to auto and use it like the others
same for SDXL
uhoh but now we have to worry that model authors dont want their wide public release to be refined at all 😮
it might become important for any company funding the development of public models to declare their position and philosophy around community refinement very clearly and openly. This illuminati bs where the author just suddenly declared it was never meant to be merged, after it was released with a permissive license, is a giant debacle
If they want to hold the IP of something for 100 years I don't see why others can't use it
Would it be possible to sue them?
derivative works will always be derivative works. no matter how they're made or what process is used to get there
okay I get it. yeah usually that will be in the user agreement
if they wanted to go after it, they would already have imo. Like, https://huggingface.co/nitrosocke/Nitro-Diffusion got really popular at one point, and was clearly targeted in part towards their art style.
it still seems to be
and yes, I do hear you and the regulation that is coming around it
no they won't
but I do think we are close to a paradigm change there
they will try to gather as much data as possible in the process and then try to monopolize
I think with this AI age, those guys with tons of graphic cards are the real players
I do think S.AI is quite a lot working on regulation, making sure this just doesn't get purely gutted by politicians, by making sure the necessary breaks are one, tuned for diferent location differently.
At least it's what it seems from all the news they gave us about future version, and my personal hope
i'm seeing this stuff blow up faster than the dotcom bubble did in the 90s. i like to think i was young then but had a decent perspective over the rollout of tech. By "this stuff" i mean ML. If we consider the paradigm shifts that came out of that dotcom period, we'll probably see more from this
I don't hope so to be frank, but honestly ? I'd be suprised if we don't get run over by this and need to change our systems quite a lot
oh. itll come in like a wrecking ball no doubt. it won't be a graceful period of change
if you've got an "oh shit handle" near by, hold onto it tight
you know those little handle grabs in some cars? thats what i call them
It's going to be more cyberpunk like
But that is not the matrix thing 20 years ago?
ever seen the movie "MindStorm" ?
Stranger Days another good one
and don't get me started on Total Recall
ya know I wish there was something we could do about people going into Voice chats and sitting there with mic muted.. that kind of defeat's the purpose.. ya know
theres a bunch of other channels if you want to use one for yourself
no thats no my peeve, my peeve is they just go into the vc, mute themselves and just sit there. never replying to any one that might want to actually vc, ya know what the channel is for. why do that do that. it's grrr. so... I don't know . just P*'s me off
but anyhow...
it's a public space so its equivalent to someone sitting down next to you at a really large table at the library. no big deal if they dont want to chat imo. youre free in either case to swap voice channels if their presence makes you uncomfortable
no it's more like someone sitting down next to you at a conferance so people can network and then they just sit there and put tape on their mouth while you try to actually talk to them
so yeah I disagree
VC is suppose to be about interaction and ya know Talking on voice. thats the point i'm trying to make
yea i dont disagree. what about the people who want to simply listen in and be included? some dont have mics. i actually know someone who is physically mute irl who does the same thing
then to be transparent they should I don't know include that in their name somehow so others know , would be helpful, and it's so easy to do.
and going to VC with no mic is sorta like entering a race with no car
but I thought I would just mention it, don't wanna debate or argue the point. i've made my point so i'm gonna back out of the conversation. have a great day.
what do they hurt by just listening?
i default ot mic muted. it mutes immediately when i'm done usually. otherwise i could be the dummy with a wide open mic when i'm afk. I'd think this is a common habit because nobody wants to be that dummy
unmute when i actually have something to say
there is a push to talk
Dead chat
vox is superior to use, plus it's something like 100 clicks to get to that setting vs muting the mic when unused
plus, i don't even see the harm
ever heard of something called manufactured drama?
yeah it's a term made up to try to get out of argument to discredit those that are trying to make a point and make them look like the one's who are the cause of the issue when they are just point out something. so yeah I may have heard of it.
still doesn't make sitting in a vc and not talking or responding to anyone right or even the correct course of action.
nice try tho

but anyhow, nothing I say isn't going to change anything i'm just getting this off my chest so it doesn't bug me as much. ya know .. .venting.
but again i've made my point said what I had to say so. yeah. just gonna back out.
and I know my grammer sucks
😄
hey there, was quite busy and havnt checked SD since 2.1 release, is there some models with good reputation released since then?
can i get some technical help please? im recieving the following error and cant find the referenced settings in the error.
"NansException: A tensor with all NaNs was produced in Unet. This could be either because there's not enough precision to represent the picture, or because your video card does not support half type. Try setting the "Upcast cross attention layer to float32" option in Settings > Stable Diffusion or using the --no-half commandline argument to fix this. Use --disable-nan-check commandline argument to disable this check."
Wow, so much to know with this stuff. any particular direction one would advise I start learning if I am most interested in character creation type imagery?
LoRA, auto1111, controlnet
hello all, does anyone know what prompts to use so that the result generated image is going to include a specific text, like if i am making a logo and i wanted to have the name with the logo in the result?
You're way better off downloading something like gimp or some other traditional open source image editor to add text.
Making text and then generating new image based on that with controlnet would be another option
What's the general speed like these days since the first release?
I needed a 1070 TI to generate 20 images in a minute I think
I'm looking to generate 1 image per 1-2 minutes on a shitty laptop
Is this out of the question still?
with that card, not gonna happen
With a 1070 TI this was easy for it
erm, for my desktop
I was just wondering if there were any performance increases in the past few months 🙂
I basically want to just replace "xstarfish", a random background generator, with stable diffusion lol
a bad laptop generally doesnt have a decent gpu, so it will use the cpu. thats really slow. no improvements there
Boa noite! Alguém sabe como resolver esse erro aqui?
RuntimeError: CUDA error: CUBLAS_STATUS_ALLOC_FAILED when calling cublasCreate(handle)
Time taken: 0.02sTorch active/reserved: 2505/2590 MiB, Sys VRAM: 4096/4096 MiB (100.0%)
so it will use the cpu
This kind of makes no sense. Is it because the lack of CUDA or similar support?
Which software is currently the best for video stylization with SD? (e.g. to make it look like japanese animation)
Does anyone know how to put a woman leaning on a car in stable diffusion. Having such a hard time. Especially since it keeps messing up the hands lmao
Damn I got it finally. Took 100 years 😛
over a thousand images made. Finally got what I wanted 😛
cool, thank you very much!
🙏 🙏
Hi all,I have a question about Image Upscaling.
I wrote a program in python, and use pyinstaller in order to package my program as .exe file
After I run the .exe program, I got the output error:
Traceback (most recent call last):
File "zoom_Img.py", line 7, in <module>
File "PyInstaller\loader\pyimod02_importers.py", line 352, in exec_module
File "stability_sdk\client.py", line 28, in <module>
File "PyInstaller\loader\pyimod02_importers.py", line 352, in exec_module
File "stability_sdk\interfaces\gooseai\generation\generation_pb2.py", line 16, in <module>
ModuleNotFoundError: No module named 'tensors_pb2'
[14548] Failed to execute script 'zoom_Img' due to unhandled exception!
How can I solve it?
Remark:
python version 3.10
pyinstaller version 5.9
run the following pyinstaller command:
pyinstaller --onefile --hidden-import=tensors_pb2 --hidden-import=stability_sdk zoom_Img.py
hi , can anyone show me how to use multiple control-nets as hand most like pattern . I tried with size 512x512 using open pose and depth method but the hand result is always different from sample.
1>original image https://upanh.tv/image/iXgTEj
2> image depth + openpose and results https://upanh.tv/image/iXRnbZ
https://upanh.tv/image/iXRBTA
https://upanh.tv/image/iXRzOD
3>setting control-net of mehttps://upanh.tv/image/iXRkga
Can someone explain how does the base model matters when training a lora ?
why should I pick one or another ?
I don't find anything about this
Hey
Hey there 🙂
well that base model question is a good one 🙂
to answer you I need to specify what LoRA is, how it acts when you use it
LoRA is close to what a model is : it's parts of a models. it's the weights that would have been modified in that model you trained it on if you had trained it using Dreambooth, almost
LoRA on their own aren't usable : to use them, you need to apply them on to a model
they then make a "new model", adding their differences in weight inside your choosen model
so your choise in base model imports a lot : all that is not trained in your LoRA file will come directly from the base model you choose
best thing to do usually is
- either to use the same base model the LoRA was trained on
- or use a model that is close to the types of render you want (anime, photoreal, ...)
a LoRA is never permanent on a checkpoint : it loads in memory while you use it, but the model file doesn't change at all
so no, a LoRA can't break your checkpoint for good
but a badly trained lora can make worse pictures than not using it, sure
Does anyone know why my program is not updated automatically? It seems that everything is written everywhere.
@echo off
set PYTHON=
set GIT=
set VENV_DIR=
set COMMANDLINE_ARGS=--theme=dark --xformers
git-pull
callwebui.bat
but on startup it says:
fatal: not a git repository (or any of the parent directories): .git
you'll need to go into #🤝|tech-support for better answers on this but I see 4 problems in your case already :
1/ you didn't install using "git clone". instead you installed by downloading a zip folder. This will prevent you to update using "git pull", you'll need to either reinstall correctly, or continue downloading the zip folder to replace your old code.
2/ it's "git pull" and not "git-pull"
3/ it's "call webui.bat" and not "callwebui.bat"
4/ it's "set COMMANDLINE_ARGS=--theme dark --xformers" and not "set COMMANDLINE_ARGS=--theme=dark --xformers"
Thank you.
Point 4 was corrected, points 2 and 3 were the same for me. For some reason, this is a copy so wrote. I'll look into it first, thanks.
Can you suggest how to do this? "This will prevent you to update using "git pull", you'll need to either reinstall correctly, or continue downloading the zip folder to replace your old code." I don't understand at all.
launch.py: error: unrecognized arguments: --xformers and not set COMMANDLINE_ARGS=--theme=dark
Для продолжения нажмите любую клавишу . . .
#🤝|tech-support really for those in the future please.
To do this, you need to reinstall. To go into a new directory, open a console and do git clone https://github.com/AUTOMATIC1111/stable-diffusion-webui
This will download the code again, in a way that will be possible to update easily in the future for you.
Then just move your models from your old install to the new install, and run the new webui-user.bat. wait for it to install, and you are good
did you really copy the full line, including when I wrote "and not" ??? x)
Your command line should be : set COMMANDLINE_ARGS=--theme dark --xformers
🙂
My mother told me in childhood that I should learn English, not listen to her. Now everything is so difficult to understand, but the main thing I understood is that I need to reinstall the program and reinstall all my 113 models.
Hello, can we use sd 2.1 base to generate icons? I tried to fine tune it with AWS services icons, but quality is poor. I used modified DreamBooth script to allow multiple subjects in one training
Where is technical support located, where should I write?
you're doing great. keep going, you'll do it !
🙂
I have a harder time training on 2.1 than on 1.5 personally but the project you describe seems possible yes.
What kind of poor quality ? You may want to "debug" your training now, understand what messed it up, and try again after updating your dataset and/or your captions. #🔧|finetune can be a good place to find tips on this too.
Thanks for the advice. Let me take a look at that channel. It is poor that, with few epochs it doesn't generate the details of the icons properly, and with 150+ epochs, it is darken and it forgets any other subject the base model has.
For context, I have about 30 icons in the dataset, each with 15 of 770x770 px images with the icon being translated around (to avoid duplication). No prior preservation. Using modified DreamBooth script
so, 30x15 = 450 total pictures ?
each of the 30 concepts has the same problem ?
you are describing 2 different epochs, low and very high. Did you look at your Loss curve on the 150+ training, to see what was the sweet stop you should stop on ? did you do more checkpoints in between to test ?
I wrote a guide on training, this part may be useful right now https://github.com/Guizmus/sd-training-intro#monitoring-the-loss-value
Yes around that. Yes all having similar problems. The loss curve are going down, but at epochs 110+ there were spikes going up. Average loss at end of epochs was about 0.0098. I guess it overfits. I didn't do checkpointing though. Good advice to start doing one. Let me take a look at your link
take your curve, and try to find the lowest point in loss before the spikes on 110
this is where it has the most change to be good
in your next test, do multiple checkpoints, every 10 epochs or so, starting on epoch 80, and compare those
Thank you! Your github is very useful. I was looking for definition of GAS and batch size, and didn't find good explanation elsewhere.
What would be a good prompt for an icon of AWS Lambda when fine tuning it?
"AWS Lambda" I don't understand what that means
This icon overhere is called AWS Lambda https://upload.wikimedia.org/wikipedia/commons/thumb/5/5c/Amazon_Lambda_architecture_logo.svg/1200px-Amazon_Lambda_architecture_logo.svg.png
Should the caption be "a lambda icon" or "a lambda function" or something else? During fine tuning. Oh and what would be a good class prompt in case we can use prior?
well, the full captioning strategy of such a dataset isn't very easy...
1/ I think 15 lambdas is a LOT TOO MANY. I would use 3, max.
2/ I would do the same for each type of icon if they are this simple
3/ for each icon, I would caption it "icon $type $variation"
where $type would be "lambda" here
and $variation would be another token describing how this icon varies. here for example, "hollow" or something. no need to specify the orange for example, since this is not a variating part, and will be trained inside the "lambda" token because of it
class prompt on this... "image" ? I wouldn't use prior preservation on this at all tbh
I see, but if there is only 1 type of Lambda icon, all the 3 training samples will be all the same, just translated around? So far in my 770px, the icon is about 50px in size, so I just move it around for variation. Should my captions be "icon lambda right", "icon lambda left" ?
Below is my result with 12 epochs of ~450 images with 30 concepts, lr 2e-06, GAS 2, Batch size 5
oops, can't attach image?
#🔧|finetune => we'll have images
we are in the only channel without images currently
SD private testing
hello
Hey there !
@vast ingot Thanks, they helped in those support. Only I lost the shortcut that launches the program and forgot what it's called, can you tell me where it is? The one with white letters on a black background 🙂
I remembered, this is a file in a folder.
it was me there too x)
The base program that starts the UI is called "webui-user.bat"
Thank you.
sorry i didn't answer I was on something else and didn't see it before you just pinged me
Thanks for your work. without you it would be very difficult for me.
Looks like deepfloyd IF is bigger than SDXL, which makes me question what the point of SDXL is?
i guess SDXL is being trained for aesthetics?
Hi guys. I have 2 questions:
- can I used SDXL with API?
- I can't see the negative prompts example for REST API usage, just like you send text_prompts, how would I send a negative_text_prompt?
Hi guys, is GTX 1650 sufficient for stable diffusion usage?
self promo like that, without even a presentation and in the wrong channel, isn't welcome :/ try to explain what you do before promoting it like this, and head over #🌶|off-topic since this has nothing to do with SD
is there is a reliable way to let sd not show any hands because v1.5,v2.1 and the new one beta just dont manage to make them look good, i tried negative prompts "hand,hands" as well as positive brand "no hands , hands hidden" but they dont work ?
you need to trick the ai with:
hands behind back, or hands behind head, 😉
you tried amputee ?
i cannot get normal finger count myself
hi i want to use stability ai to create a text to image app i want to how much credit the restApi cost per request
I have installed it on a computer with a 1650. Its not worth it. Painfully slow. You are much better off using a google colab.
the thing is I need to load every single thing again and again
in Google collab
How long does it take to generate a 512*512 image with 20 steps?
I mean for him
Guys can you upload your venerated images to online stock websites?
Do you actually owe the images?
I cant recall because i i use SD on a different computer with a good graphics card, but I would estimate about 40 seconds for a single picture on the 1650. Lowering the resolution to something like 512x384 helps a lot, especially if you are running 4 pics. Must stick to 1.5 models, 2.1 768 models are too much for it
Copyright is a complex set of law and is often dealt with on a case by case basis. I'm in the camp that AI generated works may sometimes be a new creative work, and other times be derivative of copyrighted material. It really depends on the intentions and process of the author to whether it would qualify or not
An iconic photo we all know and love, maybe, is the XP background. Those rolling green hills. If you generated a ton of stock images to replicate the aesthetic of that original background and named them "XP Rolling Hills" then that i don't think would qualify for copyright protection and would be derivative work.
Another case, which i think is one with a lot more nuances and courts will probably have to chew on through a couple different circuits until somtehing is figured out. What if you make a prompt that has Stock photograph in the prompt and fired off 1000 of those, curated the best 30% and published. I'm not sure enough intention of work is there. What sort of creative act was done to make those images and how are they transformative and not derivative? Questions!
Courts will figure ones like that out.
i think though that documenting our creative processes will be something photographers and artists using AI tools will have to learn to do. It may be our key defense to why we own the work we make. It's all about that creative process and proving that you were indeed the one who had the intention and will to create the work.
I just wonder if I can upload something I generated and sell for money...
what are other good sites to see pictures with prompts for inspiration what works good like civit.ai?
I wonder why everytime i want to do 1 character it always draw 3 or more even using negative promts
That's the thing. This area of IP law is complex. You need to understand a little bit about what it takes to create a new copyright
If you're cloning another photographers style and selling that image as "style of photographer, by me" it could be infringing because of that intended expression, literally admitting that it's derivative of that photographer's work. While a lot of people say style can't be copyrighted, and sure it can't, it's not that simple. Derivative works exist too.
https://en.wikipedia.org/wiki/Campbell_v._Acuff-Rose_Music,_Inc. << here's a sick case law study about derivative vs fair use argument. precedent is really important in this part of law. the case found that a parody is derivative but can also qualify for fair use if the artist provides enough transformative intent from the original song. Also 2 live crew are sick.
discord screws up the wiki link and you need that . on the end
or not wikipedia fixed the bug
I'm teaching a certain style to sd, but for reason, it learned somehow but can't draw bodies or poses
what i did wrong?
no idea how to train styles here sorry.
no prpob
I do have a guide on training, not only styles
but I don't have time to answer questions right now
so I can link the guide and you can ask complementary questions here, i'll answer later if you want
perhaps you need a better regularization data set? i'm noticing when i'm training people, the regularization images should have many other similar people. similar only to a degree though.
I ask only because i'm trying to figure out styles myself. will come back to this later as i have to go to the world famous city of nanaimo, where they keep the nuclear wessles.
Guys, I need gpu guidance but is this the right section for that? I mean, as it pertains to stable diffusion.
Thanks, Its funny that i have getting lots of heads drawed but not bodies
Hi, I'm want to start using SD locally but my internet is kinda bad so I wanted to know the download size, in some places says 20 gb and in others 12 gb required.
More like 12 but if you dont have a model downloaded already it will download +5gb
Guys, will the 4070 12GB be good enough for model training and such?
OK, thx🙏
The problems start when i change the size of the canvas
Is there a good AI tool to download images of a subject? Like if I said download all sprites of RYU from street fighter that have at least 128x128 resolution, Is there something that could search the web and automatically download it for me?
Hey, it's my first time and I couldn't find the info in the Start Here or FAQ about what channel I can just play around in and start creating images.
I dont think you can do that anymore.
They removed the channel?
That I know of
yes
where do you create images then?
On your computer, or in the cloud.
I want to make a style LoRA based on my partner's comic. I've put a fair bit of effort into extracting high res crops from each panel. I think I can give it hundreds of examples of backgrounds, and I envision a future in which I could generate backgrounds in the same look.
But before I invest all the time in cleaning up training data, I'd like to know, what is the plausible way in which the style LoRA could be used to create new backgrounds for future comics in the correct perspective and angle? Could I give it a 3D image from sketchup? Could I give it a fairly sloppy sketch? Do you also type a verbal description of the background you want? (e.g. draw a loose sketch of a hallway, and then specify "Hospital hallway, green walls" as well?)
Is this a feasible thing to try to do, basically?
Hey everyone, I'm trying to figure something out for a new pc build and it's hurting my brain
is anyone able to provide some advice on using dual 3090s. I understand stable diffusion can only use one but I want the other for LLM's
or anything else
is anyone able to advise if it's a bad idea or not or things I should take into consideration? I'm trying to get my brain over the motherboard PCIE lanes I need right now
Thanks!
what's the use case?
img2img maybe controlnet https://www.youtube.com/watch?v=qju59fAbvPY
goofing around is the only definition I can give RN
I suppose it's making sure i have the platform set up correctly (hardware wise) to enable me to just freely explore
im not an expert, but 1 3090 is definitely enough for exploring with automatic1111
and LLMs running alongside it?
or do the LLM's only boost into the GPU performance when spitting out responses
and not while idling
any issues with using the E-GPU route?
sorry to bombard you 😄 I appreciate you entertaining my idiotic questions
nah dud your questions are beyond my abilities 🤦♂️
no problem, thanks for replying anyway!
Guy i'm trying to install disable fusion but i'm getting an error, can someone help me out?
Interesting - what he's doing seems like what I'd want, feeding it a 3D image with img2img. To be clear, is he using a style LoRA there, to get that anime look to the image he generates? Am I using the right terminology?
He also warns at the end that he couldn't generate the exact same scene from another perspective, which is a bit of an issue for comic making. Is that just inherent to this, or is there another method (maybe even just feeding it the previous render, so as to say "make sure the details of the scene match this")
i am also struggling with the consistency issue, what i know currently is that editing the must-have details + prompts then img2img may work
also, it may depend on the setting/theme of your comic
the YT vid did not have a lora in the prompt, its using a animated model
no, i would do photo, 50mm, cat (the fov thing idk)
the software cuts up natural sentences to find the embeddings, sentences do help if you specify 2 cats with different attributes tho
the best i can do is to give you this 😂 https://stable-diffusion-art.com/how-stable-diffusion-work/#Latent_diffusion_model go to the part about prompts
Pessoal, boa noite
Galera alguém sabe onde que acho essa opção para verificar qual é o modelo a usar?
1c0ed90d69
Hi i need some help
what's wrong?
whatever i prompt i cant get any good results in terms of quality
what's the quality issue? style? hands? face? background? camera effect?
the texture sucks
can you send a sample?
are you using automatic1111? and its base model?
youll wanna do it in #🏞|general-with-images
How important are "negative prompts"? I seldom use them and only do so if I see that my prompt generate unwanted result often or all the time, but I ran in to more and more folk that now claim the negative prompt are almost essential for make good images. So what do those negative prompts actually do, what would be a good way to test the strength and usefulness of those prompts?
they are pretty important
try monochrome, greyscale, bad quality, lowres, blurry, text, or removing stuff from existing gens
I tested "bad quality" and know what AI do not know what thay is for "quality" is unspecific, and if I specify in my prompt "color photo" I seldom need to use "drawning, grayscale, 3d" and so on in a negative prompt.
the prompts are "trained in" using tags, so the AI doesnt have to know at the start, the user who labelled the tags to train the model trained it in
if you are happy with your gens thats fine, but in my experience (realistic anime AI images), i cant live without them
I know AI is trained on tags, but what do the tag "quality" and "bad quality" stand for? Question is for a method to test negative prompts to see how they work.
from my limited experience, it may be based on the # of upvotes on image boards
Can you give me an example of a prompt that always give "bad quality" or "worst quality" in a image so I can confirm that the negative tag "worst quality" give me better quality?
Something similar happens to me, it gives what i want in 512 but when I add 1024 is just turn crazy
does anyone recommend me a model to train with human poses?
not a model but controlnet
it depends on the models and the tags that the creators used in tagging them. i just read someone's experience in creating their own LORAs for example and they did add some pictures in there that they considered "low quality" and tagged them as such. not sure how effective it is but that's one such example.
Anyone who knows if you can use diffusers in auto1111 and where I can find the information to do it?
h anyone there
will try "brutally mutulate my creation pls"
you might be favoring roko's basilisk if you start training ai to mutilate creations. careful here. 🤡
for me embedding worked better
omg maybe i made a mistake hahaha, I trained my style in embedding, it draws it well but for some reason can't draw bodies
Hi guys, I joined because I had a problem with some files on my pc

Umm is it okay if I say it here?

No go to a tech support discord server
Bruh, is part of stablle
I test negative prompt but my English skill is lacking so; is "unsharp" a real word or just a photoshop routine?
every time I try to run it I get an error for a "dll file"
Try using dull instead
@tawdry turtle Great idea, I look for words to test.
Which one? Just download that specific dll
I download it and another appears and another and another

I just need someone who has the stable and can give me the dll files
copy paste and fixed
window
Bruh window sucks, get Linux already

Nu
Okay makes sense. Yeah go to their GitHub and get the dlls from there
all? I want it to work, no explode 
I doubt it does anything tbh, I don't use it, trying to avoid words that doesn't mean something exact
Maybe it does for anime, idk
There seems to be very little discussion here
do you think there is going to be a way to have models with predictable output to certain prompts ,besides finding out thorugh trial and error ?
How to generate images
hi anyone there
hello, can you guys help me with it: Couldn't find Lora with name cuteGirlMix4_v10
gn
how the heck does anyone understand all this jargon XD
how do i increase the allocated memory
for stable diffusion
i downloaded it on my computer
and i put the url to run it online
Good answer. I... don't know if I'm up for that level of math knowledge. Documentation can be confusing in this subject. But I shall persevere. Everything moves so fast.
and i downloaded it by following royal skies download tutorial
I'm with you up to 'grid of pixels,' but nothing beyond that, lol. But i will look into these concepts.
hey, I'm not sure to understand how to properly caption images for loRA training
I've seen a yt tutorial where the guy say that describing the picture help ai not to focus its training on what's described
it's confusing, i tought that i had to actually caption what i'd like to be trained 🤔
not what i want to avoid beeing trained
when u guys switch models using the x/y/z plot script, does it keep displaying when switching models? Mine shows the progress images for the first model then stops until it's done, not sure if its just me or not
Oi guiz, not sure if this counts as spam or not. The account only has 4 messages in the server, and they're all this link.
once wouldn't, but they posted it 5 times and never spoke
I cleaned it
👍
Good morning, everyone!
How are we all this beautiful day?
my images look very uneven if I use inpainting. is there a way to process a image to combine all the elements in an 'even' way?
I feel good but tired, I looked into negative prompts last night and found much that was very interesting and a bit odd.
GM all.
I am not entirely sure what you mean by uneven, but, you can always use art programs like Gimp, Krita, etc, to put things together if you feel you need more control. There are some UI's that offer a lot of options, too, but it depends on your preference.
It is a fascinating subject, really. What did you find odd?
Good morning, Kenso!
More adventures in AI art!!
Dun dun DUNNNN!!!
lol
Has someone in the community deployed their own instance of SD please?
Looking for a bit of architectural help if thats possible
Haha (we all need some dramatic flare in the morning lol)
There's actually #1071944112917463200 and #📝|prompting-help and #1047197565365538826
I will not talk to mush about the religion of AI prompt, but I see that many use like "best quality" as a prompt at same time as "worst quality" as a negative prompt, and I just have thoughts what folk want to archive with (cute:3.5).
anyone know of a tool i can just drop in 50 images and have it create a video transitioning between the images?
Yeah, you can certainly use worst and best, just like you can use the words well and good. It's feel like it's a game of word association tag. There's a lot of fun things you can do, and some do work really well across the board.
I mean...many art editing programs have this capability. If you are looking for something free, program wise, you can prolly use something like Davinci. I'd check #🎥|animation
Though
hi how to fix a "in queue" i put the prompt and click start
nervermind, its work now
but i have a another question, how to generate from text to image high quality images, my generated image looks like from dalle mini
If you need help with prompting, I suggest checking out #📝|prompting-help and #1045349359044280360 as well as #1072220168534642768 and #1080946152318443610
There are plenty of examples on the server to look at, as well as community guides, etc
Hi Sunny, I mean sometimes the resolutions of the different parts are quite different... even if I do an overall upscale it's quite apparent.
So, you're preserving the outside of the mask, but what you're prompting isn't really coming out right?
(You can always post images and get help in #1034602544263090268 for reference)
it comes out right, but at a visibly higher resolution than the rest of the image. I'd like to generate the total image again but with everything 'interpreted together'
Well, there's a lot of ways to do things. Personally, I would say to lower the denoising for how much your output image is going to change in img2img, and generate a few very close images to your base image. Use the seed you want, too, ofc. There are many different ways to upscale and combine upscalers to see what happens/get what you. #1003034183716835418 is a channel dedicated to working with them, so you might find some useful information to help with your project
I'd also post your settings/images
thanks @bleak matrix
Is it possible to train your own images with stable diffusion?
lets say you have 20k nice dslr photos of family and friends
is that a huge undertaking?
Np!
If you want to learn more about that, I suggest you start with #1072220168534642768 followed by #1080946152318443610
We have channels dedicated to these subjects as well, such as #🔧|finetune
Thanks will look around, have you ever done it?
kind of wondering about what the benefits would be as well
Yes
I mostly have family and friend photos - thought maybe if I trained it I could generate new art of them in a way.
but I have no clue :p
There's a lot of awesome things you can do, so it's def worth learning
I recommend you start with learning how to use SD, as this will help you fundamentally understand how to make things with your friends/fam in them
can someone please help me with stable diffusion? for some reason it wont let me launch it
You can post screenshots and get help in #🤝|tech-support
hey
its morning here
It’s evening here
Ever since I started messing around with SD it's all I can think about.
Preoccupied thoughts throughout the work day...while I'm eating...browsing the SD subreddit... any suggestions where to go from here? I currently use Automatic1111 GUI and am learning to navigate that. I guess the next step would be to find a "model" and try other renders from that?
Emad's AMA is starting shortly, see you in #1029055412764422214! We will pin questions you have for Emad in chat.
hi
Man, 2.1 kinda sucks
will there be an everyone ping when SDXL gets released? Or is there any other way I can get notified because I only really care about working locally
is stable diffusion xl open source
I also only care about working locally. SD is fun because it's a local model. I don't want to pay per use or subscribe to some online bot.
SDXL is live, but still not available for offline use yet 🥲
not that I could probably run it lol
Please ask your questions over in #1029055412764422214 where it will be pinned! ❤️
The websites are so goddamn slow now because of the SDXL ping lol
The SDXL model just vanished for me lol
its completely down now i think
I assume that when sdxl gets out of beta Stability will open source the model
yes, once it's out of beta there will be an open-source release, according to the announcement
so sad, I was just starting to have fun with it
it still can't do cube stacking tho
great that we can test out what they are working for so we don't get a 1 year long wait time
wow that is a significant improvement
model disappeared
hello everyone, cant see new model on dreamstudio
my gradio for 1111 is quite slow and take a lot of time to load, is there anyway to fix?
what did you do to thispersondosnotexist
SDXL beta right? So we still have a chance of seeing better 'hands'? 😭
where do i download the new model? 😮
Hands have been easy since CNET version 1. You just need to work for it a little. You can do it!
I poped up too late to the AMA. 😂
I wanted to ask what were to major roadblocks to having a model that can generate HD sprites, but keep the style coherent, similar but not the same as the video models.
SDXL should be available on DS again 👍
We'll be posting a recording soon of the AMA soon! ^^
He said there will be pixel perfect precision soon, for what it's worth.
Nice ama
a pixel art model is not the issue, but coherence between frames.
nice.
Soon.
Anybody happen to know of someone having applied AI for organic synthesis or other chemistry?
yo, where can i download the new model SDXL ?
came late to the party want to know more about "IMMERSO"
only available on dreamstudio website?
beta testing release is only available on the API & DreamStudio: #📣|announcements message
oooh so it's not an open beta
i don't have or work with dreamstudio
i'll wait for the release then 🙂
its web based sign in with discord to play
I was hoping we would here some news on deepfloyd
how can i leave feedback if i can't use it xD
i'm on Auto1111
you get some credits on sign up if you want to try it out!
Just sign in with your discord account.
Wait so just to be clear, SDXL is going from a paid credit system on DreamStudio, to an open-source model to be used on local web UIs?
(Sorry I know very little about Stability AI)
hey so im trying to get everything set up and it's giving me this message when i run webui-user.bat
venv "C:\Users\1anda\stable-diffusion-webui\venv\Scripts\Python.exe"
No Python at '"C:\Users\1anda\AppData\Local\Microsoft\WindowsApps\PythonSoftwareFoundation.Python.3.11_qbz5n2kfra8p0\python.exe'
Press any key to continue . . .
but my bat file looks like this
@echo off
set PYTHON="C:\Users\1anda\AppData\Local\Microsoft\WindowsApps\python.exe"
set GIT=
set VENV_DIR=
set COMMANDLINE_ARGS=
call webui.bat
i mean, doesn't matter much does it, i have all the tools i know how to work with on auto1111, like upscalers, img2img and inpaint... the works. plus i can generate as many images i want. what's the benefit of using dreamstudio? even "edit" says "soon" so i can't even edit the images, right?
hi happy
dunno how dreamstudio is better when it doesn't even have an editor
from testing it seems like sdxl seems to be very similar to previous models in terms of fidelity and prompt adherence. Deepfloyd looks better though
its grayed out and says "soon"
Man AI is progressing so quickly. It's almost scary dude.
both. SDXL is available in dreamstudio + stability API currently, but will be released after more training.
SDXL open source model release when? Or are we clamming up like closed AI?
xD
If it happens we riot 😂
The plan is to open source SDXL
Good.
Crazy
From what i've seen, midjourney v5 does it just a bit better than SDXL
Do you know how much Vram we would need to run SDXL locally, if i may ask?
Mid journey forces you to use discord to generate images. I want to run locally with the ability to fine tune
I wanna make a dedicated AI rig. What are some of the most ideal cards to utilize, price to performance? And at that end, can you even multi-gpu with SD?
4090 is solid, but the a6000 is pretty great
been running with a 3090 for a couple months now. I like it a lot :)
from the ama earlier: ~2.5x of 2.1
I have a 3090 myself 😄 super quick with gens
I've been meaning to try it with alpaca, but my PC is goofy and hard-locks when I start the model for some reason lol
Ah! So it requires double the Vram? If so, thanks.
Same with any 2.1 SD model lol
RTX 3060 with 12GB is a deal
#1072220168534642768 (this server was previously called Stable Diffusion)
Does anyone have a couple of key bullet points from Emad's talk just now?
not yet its still in beta
in dreamstudio (paid) or free in pickapic.io (free but random, you dont know which model youre using)
its a russian roulette between dreamlike-photoreal-2.0, SDXL 2.2 beta, and SDXL 2.2.3 beta
what's the best upscaler ?
how long is a rope?
... depends on where you cut it.
just like upscalers, the best is depending on your needs.
i havent ever used anything other than the 4x ultrasharp model after discovering it
and that's probably great for the art you make, but nobody knows what ne3zy makes xD
Hey everyone, as sometime somewhat new to SD I've been using automatic1111's implementation and getting cool results. What's the next step I can take to start exploring making more realistic and cohesive images? I see the images in POW and they look so much more cleaned up and dynamic. Any tips on achieving this level of control on my prompts?
well you can download ControlNet, that'll be a new universe for you to explore
Does someone have the recording of the AMA?
AMA Recording will be up on our YT channel 
#StableDiffusion
— "AI by the people, for the people."
Join our Discord — https://discord.gg/stablediffusion
raw art made from stable diffusion is still pretty limiting despite it being able to create some crazy good looking stuff so after you get a good image, you can always apply the usual postprocessing stuff that you'd do in photoshop to make an image look professionally done
does youtube have transcripts of videos available? would love to simply read the whole thing
in a channel we don't have access to 
channel is hidden now since ama is complete
when will SDXL be open source?
did u guys check out the new model?
maybe I'm bad at prompting but it seems just cartoony with alot of things
the model currently on dreamstudio is native 512x512, but that could change with further revisions of the model.
is there any recording of today's AMA going around?
Has anyone used easy negative embeding? I wanted to ask, I placed the files in a folder and then in the sb embedings folder is that going to be a problem?
Is there any video showing off SDXL? Some kind of "before/after" promo material or blog post?
hey, how am i going to create ai pictures on my own pc? can someone help me with this or we just need to use de website?
google "voldy rentry" and follow a bunch of steps
paid is easier
local install is more versatile
it mostly depends on if you have more patience or more money
i have patience but i don't have money and website cencors almost all of it
thanks for the info
actually if I google I get a page whose URL is "voldyold" but just "voldy" is the right one, hosted on Rentry
noted: you should be good as long as you have a GPU with 4+GB of VRAM (ideally more)
XL version still in training is 1024 native multi-aspect and will likely feel much more varied
i use 1650 refresh it should be enough for minimum, thanks for help i'll ask questions to you if i see trouble
I thought Emad said some models would release this week
or there'd be some big news hasnt been any has there?
@hushed quarry hey, am i supposed to use all models on the step 3 or one of it
what's the hold up on getting the AMA recording uploaded to the youtubes? We on dial up?
any longer and it will no longer be relevant, at the speed of AI progress
Thank you this is what I was hoping for!
Its on dreamstudio
I know sd is mostly gpu intensive
but does a better CPU help load times
well not the actual image generation
just launching the program
Would depend on your system...you could monitor your CPU usage when you launch SD and see if it spikes to near 100 percent usage.
Hi guys im new and i want to make pics first of all will a 1660 ti make the job?
second where can i dowload the program and plugins or smth
it is, but won't be fast
https://github.com/AUTOMATIC1111/stable-diffusion-webui
one of most popular webui's
Do you have details about the language model that SDXL is using? Is it OpenClip or something else?
Hello! Can anyone recomend a good video about Khoya GUI and training LORAs. I have read the git hub installation instructions, but still a bit confused.
Hello guys, I'm wondering if anyone knows the name of the song in the video clip https://twitter.com/i/status/1646672027103707136.
Hello everyone! I'm wondering if anyone knows any tutorials/guides on installing stable diffusion from stabilityAI without any webUi, so basically generating the images from the terminal as shown on the github readme. I'm trying to install it on macOS (M1) and i'm having some trouble
use colab
is there like an art proof name for a background that would contain a placeholder character? I basically want to use ai to make backgrounds for my character designs.
no thats the gay ugly colab
but this is for webui
webui is easy
i specifially want to not use it, since i'll be generating images from a python script
is this one bad?
yes its stinky
the other colab is better
allowing other models to run on it
textual iversions
and loras
I mean I get that the performance will be worse, but regardless I need to run it locally
run it with colab or i will throw you into the waifu diffusion paradox

hey
Dreambooth on M1 anyone?
Nope but 🙂 Heya Wulf (y)
hey i need help setting up chilloutmix ai can someone help me thanks
Hey guys, has anyone tried to recreate Hassan Ragabs videos and suceeded? https://twitter.com/HsnRgb/status/1630990048685604864?lang=ar
is there a way to use prompt s/r on the negative prompt?
Having a really hard time referencing architectural prompts to ControllNet (openpose). I still get humans as results with architecture in the background..
openpose isn't made for buildings at all, it forces a human into the frame
mlsd is the best in this category, but canny and depth can help too, as well as HED
I´ve tried depth at first and it kind of worked. Out of 300 frames only 70 had human like features (e.g real faces or legs). The problem with depth is that it tends to cut of the arms. His Videos are tracked really well^^
Is there another website like civit.ai where you can browse models that isn't filled to the brim with NSFW?
hi
Hey there
what's happening ? what results do you get ?
you can't post pictures in general chat
Hi everyone,
I'm looking for AI artists for a paid gig, used to working with warp or controlet, vid2vid etc
Does anyone know if using movie styles, such as those from Game of Thrones, for generating an avatar could be considered a copyright violation?
need a lawyer for that
Anyone have a suggestion for a easy and free background removal tool?
i can say that if an avatar looks too much like its from an anime then some companies will send dmca
ai gen or not
you would be on better footing if you paid an artist to make the avatar and used the ai gen as just Ref
thats for a lawyer to say
they can sue for any reason they want to pretty much
It's so complicated ! I don't think they have an answer 😄
if they feel you are diluting their ip sure
under current copyright law as a normie non lawyer understands it
if you want to have copyright of your content you need the final creation to have significant alteration by a human
the more humans you have involved the better
when it comes to copying characters or designs from movies there has been cases with companies using photographers and makeup artists
for copying a look
so its not unheard of
No I'm not looking for that, I just wanted to make sure if I'm able to sell it as a Movie package.
is that image just a text prompt?
that sounds like you are in a legal grey area
that dosnt sound like enough human input to me
i wouldnt do it unless you own the original images used
for this purpose
I have questions abouts image prompt, are they kept in the servers or deleted after?
They are stored with the pictures generated, you can also recall the last promt with a single click
In <PNG Info> you can recall a promt from a picture. and send it directly to txt2img
Automatic1111 is producing generic tile style patterns every time I try to generate an image, no matter the prompt, no matter the model. What am I missing? Invokeai works fine
Has someone a short summary of all the important aspects for the prompts, which I can feed GPT-4 for prompt creation?
I use this one with GPT 3.5
#🏞|general-with-images message

hi, I want to ask if i use the photo generated by stable diffusion to train Lora (with similar face but not the same face) and then i use this lora to generate photo with weight 1.0 will i get a consistent face which is the mixture of the photo i used or how
for example, I use 10 Jisoo photos and 10 Lisa photos to train Lora, then use this Lora with weight 1.0, will I get a consistent face that is half Jisoo half lisa or will it just sometime look like Jisoo and sometime look like Lisa
Did you know that cows have spiritual powers and serve as messengers between the physical world and the spirit realm. In certain cultures, cow hide is used in ceremonial clothing and accessories because of its believed mystical properties. These beliefs lead to the idea that cows are secretly shamans, possessing wisdom beyond our understanding. 
You'll have more help on those in #🔧|finetune usually
is sdXL free from dataset copyright problem?
@vast ingot Thanks works great!
IM FINALLY unmuted..
if it helps, i used the prompt "shutterstock" and i got zero watermarks
So hellou everyone ! I need your help ://. So you maybe had saw a videos there AI (They say its stability AI) create a photos of anime girls. So I want to know how to make that kind of stuff on stability ai. Now you thinking that im simp. NO. I using renpy game and I need original photos. So maybe can somebody help me tell how to use it ?
Question: ¿Can I ask questions about Stable Diffusion WebUI in this discord? If so, ¿In which channel?
#🤝|tech-support is the best place for these questions ^^
Check out our #🍥|anime channel, they're very active and helpful!
@wild steppe Thanks <33
Hi guys, I'm totally new here. Trying to use stability.ai with my discord, but won't let me accept the terms of use. The button is grey. Why? 😦
Star Wars, worlds longest triology.
Hi i have problem with installing 1111 . i did install python but i get these errors what should i do ?
ERROR: Could not find a version that satisfies the requirement torch==1.13.1+cu117 (from versions: 2.0.0, 2.0.0+cu117)
ERROR: No matching distribution found for torch==1.13.1+cu117
Is their any AI that can take a line picture and colourise it?
controlnet can do that
did you install git?
yes
im not an expert but looks like something went wrong with your pytorch install
i gonna uninstall and install python and git again
Hey Everyone 👋
Let me introduce Agora, An Open-Source Multi Modality AI research Coalition to Advance Humanity!
what is torch ?
its part of the python framework that handles machine learning stuff
This is a great server already loving it
Hi! is there's a way to color an lineart image in SD?
hi i need help installing stable diffusion
#1072220168534642768 or if your question isn't answered then #🤝|tech-support 
controlnet can do that
very awkward question, how do you pronounce Emad? i'm gonna be at the ML meetup next week and don't want to fumble his name if i ask a question
What is better to use?
It's like e-mod
best model for creating a space helment based of the marauders game
ngl, so far been using it for porn, please help me branch out
Hi guys. Do you have a success story regarding a locally installed ChatGPT-alternative which could be connected in some way to SD to generate prompts?
I am thinking about things like trained text-models, LoRAs or such stuff, to realize kind of a "virtual friend / partner" finally. I hope that sounds not too weird.
I take it the AMA is never getting uploaded?
do you have any idea how i have to use
torch-1.13.1+cu117-cp310-cp310-win_amd64.whl ?
i had problem with installing 1111 and when i googled it said i have to install that
but now after i download it it wont open bc its not exe
i dont have any idea what should i do
I want to create some balenciaga meme type images. How could I approach the quality of what is seen in the memes?
Any progress on this? (Incorporating something like UV-coordinates into SD for the purposes of keeping a character consistent across frames/poses/images?)
Question! Is there a channel where I can ask about commissions?
This is sort of a central server! #1034941531762733167 has a list of other AI-related servers too
You can ask in relevant channels or in #1092446741984444416!
yooo much thanks
Anyone wanna watch the Coachella Livestreams? I'm watching them in the Harmonai discord server if anyone wants to join. Gorillaz is performing rn.
nothing to do with each other i think
unstable hate stability for taking nsfw images out of the training
i think they make all the sexy anime models
yeah 1.5 took time to grow momentum is is getting there. sdxl this week i hear. maybe
https://clipdrop.co/stable-diffusion this is a great preview but i can't log in right now
well 2.0 itself had wonky results for some cases.. okay a lot of cases. they investigated and realized 0.9 was a bad setting for the filter and it took out WAY too much
they set it something more precise for 2.1, like probably 0.99
Any idea how I get real photorealistic images? For example, my prompt just now was SDXL beta was: Photorealistic dog sitting in an airplane. The output was more of an animated image instead of a photorealistic image. Does it randomly change to animated vs realistic or how does this work?
no no nothing standard. okay so SD was trained on billions of images total set. On a huge cluster of GPUS, like 1000s of them. For 2.0 and 2.1 they used a nsfw filter on the set before training it, so it culled any sort of image that had boobies or butts. none of this is standard cause theyr'e writing the rule books as they go.
base 1.5 from runway ML, trained on stability's GPU cluster, is able to do porn before it is even refined at all. not bad either.
Ayo sup guys
probably not. they'd get investors either way. billionaires don't give a fuck
emad been saying kids and pornography together in the dataset presents huge problems. if the community wants to refine that sort of thing into a model later then its not like stability the company can stop them.
Hello, am I able to fine-tune the 2.2 XL model on my own data-set?
not yet, and if you had a lot of gpu power. that kind of training power should be available in the cloud if you dont own it
I mean can I fine-tune the model they offer via API?
otherwise, I think they havne't released that model, right? Is there a plan to do so?
there is a plan to release it. sdxl is still in training is why its only through websites
@marble cedar you on a laptop 4070ti?
Did you do custom cudnn installs
Then your performance is far slower than it should be.
Check out this full convo I had with Dallas in the tech support chat: #🤝|tech-support message
It should significantly improve your numbers
Does 4090 pulls 600w when rendering images?
no
350 - 400 depending on the image size
Is it only benchmarks can 100% use 4090s power?
No, typically image generation can fully utilize the 4090 - however you need to have the right cudnn files, python version and nvidia drivers to fully utilize the card.
I have used Dream Booth to train my girl friend and created a checkpoint based on chilloutmix. Shall I use this custom trained checkpoint for my Textual Inversion as my checkpoint? Or I use a clean chilloutmix checkpoint for my Textual Inversion checkpoint?
what do you plan to train in this TI?
If you plan on training again on your GF to improve quality with double training, it can help a little yes, but rarely the best solution to go for.
In any case, to answer you :
the best quality you will have is by training your TI on the model you intend to use it on. It will work on different models, but it will perform best on the model it has been trained on
Hey has anyone tried SadTalker in automatic 1111 yet ?
yep, it only outputs 256x256 faces, stitched back on the original image they tend to look very low res compared to something like d-ai
dang lol
hoping it could do larger
I've seen other methods for animating AI character heads
for something usable you'd have to run the whole video through controlnet video2video or similar
and there are a looot of frames so
the problem with all these talking head research demos is they all use the vox dataset which is stuck at 256x256 grid
like wake up this isnt 2010
but man its hard to figure out
pretty much, the talking anime head thing only outputs 256 too
thin spline-whatever it was called as well
someone suggested live2d
Yes they are all based on thin plate spline model
live2d you'd have to cut out parts, make sure there are no holes and such, it's a bit more advanced
could be feasible with the segmentation models
I was looking at Thin Plate Spline Motion Model stuff but most of the tutorials need a video
I just wanted to audio to head animation
they'd work on a phone recorded video tho 
For the strongest single image variation from face you have to go thin plate + dreambooth
however since its 256x you lose the skin
if you dont find good face you go further and further in the anime realm and no realism
in stable, if the unclip models were stronger, then you would have a free alternative to midjourney image variations
for now, you have to go dreambooth
I have tried to apply realistic skin patches using control net and a realism model to get back the skin, it is too slow and would need photo restorations skills
the post on reddit are fun until you realize the noses arent the same
yeah also the small details like mouth movement tend to not upscale very well with controlnet
got any good videos on how do it ?
getting back the mouth can be done in gigapixel upscaler
it is the only feature were it is ok to do so
all of this can of course be avoided with a large number of samples face of the same subject, then lora to keep the skin
my problem is I have a single face
pretty long process for a funny meme
you are trying to control net a meme?
nah I'm just saying most people just want to do like a funny short video with a talking character, training loras, upscaling skin and such is a bit too long of a process for this
unless your use case is more professional
for me the faces are just not good enough
and having a realistic skin wont do, I want a concept
and I want extreme control over the face
hello
can i make images of my friend to prank him, how to do this?
^^
Can someone tell me if there is something wrong with Dreambooth? I have been trying to train over and over again today. It generates class images that look nothing like the training pictures and then generates samples that look nothing like what I am training. It's just wasting my time. I upgraded my graphics card and upgraded my Automatic1111 which made me end up having to install torch 2, etc.
Newbie here, hi y'all. I have a comic hero, flat art, and I'm trying to get Mage to render it looking photo-realistic with inpainting img->img. It keeps adding it's own hero behind my hero so he gets extra cape and arms that look great while he's still flat. Is there a way to get around this? I tried on DreamStudio to 'copy' to a photo and it just kept adding it's own badges and belt buckles. Imaginative one, that one.
hey guys im new to stable diffusion and im having trouble installing it and making it work, can someone please help me?
Good morning, everyone!
How are we all today?
doing bad cause im spending all my hours trying to figure out how to instal stable diffsion
Please check out #1072220168534642768 and #🤝|tech-support There's also several videos on YT about installation as well!
yes but im having a error, it keeps saying it cannot launch python
I suggest using a combination of inpainting/negatives/prompting, etc. For prompt help, I suggest checking out #📝|prompting-help
You can post a screenshot of the error in #🤝|tech-support , and you can also search the server for the same issue using the words of your error to see how/if it has been solved
Ok im gonna post the error but when i post it can you please check it out if you dont mind, maybe u can help and just see what it is
if thats ok with you
I just woke up, so I am not really awake at this particular second. But someone should be able to help you in #🤝|tech-support , and if not, you can always check back a little bit later and ask again.
ok thanks
hey! fairly new to sd - can someone please briefly explain to me the difference between fp16/32 and full/trained/merged models? i do understand that embeddings and loras are "additions" to existing models but the others i do not get so much. pls help 🙂
hey guys, maybe someone knows what can be a reason that I don't havev DPM++ SDE Karras?
It's the only sampling method I'm missing, and it's actually the one that I want to use 
there are full models that are trained on big datasets, there are also pruned versions of them - those are the same models but made compact so they take less space, and they provide almost the same quality as the full models. Merged moldels are the ones that aren't trained but merged from by mixing few different models in on way or another.
I have no idea what fp16/32 is though
Hey
does stable diffusion have that image extender feature
can anyone tell me
like dalle where it extends an image
based on the previos image
yep
it is called outpainting
You can learn more in #1034602544263090268
ok thx
Konichiwa
Good morning!
thank you! fp16 and fp32 seem to be related to different kind of mathematics in the used algos for... i dunno. i dont really care either but read about the compability if you want to create merges. you cant merge fp16 with 32 - as ive understand it...
that's about it yes. fp32 is more precise, and can be converted into FP16 for a lighter file usually too.
they take less space, and the difference in precision is very minor in terms of picture changes
but to finetune the model, it's usually better to run on higher precision (fp32) if your hardware can manage it
thanks! 🙂
hi
i've seen this that precision is lost and that makes sense, but whats the real results? are generations a little fuzzier?
no, they feel the same quality to me, but you do see small difference. it's hard to qualify the result as "worse" visually though
I mean running the same parameters on just the 2 different models
I dont think i coudl go to civit and guess which images were 16 and which were 32. guessing 32 might be better for training?
the picture is just different in some small details
especially, yes
training benefits a lot more from higher precision
this makes sense. i might be understanding!
It depends on sampler you're using mostly...
Too high SFG scale could overbake , sometimes even too long negative prompt makes it weird.
(at least negative imbeddings based on my expirience)
keep experimenting. i was going to offer tips but they were immediately argued upon. There's a lot of toxic attitudes that want to argue every tip.
Remember this. nobody is an expert in this field. Not even the PHD doctors have it figured out and they're the most knowledgeable in the field!
Your most reliable form of discovery is going to be experimenting. try an xy grid generation with cfg and step counts if you want some sweet experiments that give a great lens on what works best. Z axis is good to use for different samplers
someone saying you can't do that is probably just being negative and wrong.
hi is it nessary to get the engine list first before every request can i hard code a engine id it that a good idea
Matrix is good way to do it, yea.
I was doing matrix for each new model first time to see how different parameters and samplers affect results
hello, i found this server bc i have some questions about stable diffusion, may i ask here or there is a channel for questions?
Hey, that depends on the Question.
If its technical stuff then in #🤝|tech-support
If its about prompting then #📝|prompting-help
If its a General question feel free to ask here
tysmmm
Hello
Hi
Is deepfloyd supposed to be released this week?
3D render of a paper dragon, studio style photography
What is the best way to achieve "similar image"? Like I upload an image of a woman in a modern living room using laptop on a couch with a small yellow dog, and I want the output to be a brand new image but with same elements and sam description. Something like in Midjourney, using "describe" to describe a picture, then using the description as a prompt for a new image. Thanks!
idk how exactly describe works in mj, but you can get variations of same image with
- variation seed (extra button if you're using automatic1111)
- doing same seed, but changing options a bit
- doing img2img
civi dead again
yea I wanted to find new models to play with
yep, same 😄

wot?
you're trolling
chill

keep your conspiracy theories over in #1008652627204132904 
kek means lol, its a laugh
kekekekek is laughter
tf about the frog, pepe the frog?
what?
You are either a child who is trolling, or terribly misinformed
I am genuinely not sure
ㅋㅋㅋ (keukeukeu) is the childish Korean equivalent of the English "haha". Since this is often used in StarCraft matches, Blizzard, Starcraft’s developers, decided to reference it in World of Warcraft: when a player of the Horde faction types "lol" using the /say messaging command, members of the opposing faction see it as "kek".[1]

Anybody knows what happened about civitai?
pepe the frog is an online meme. It was a meme before it was ever used by the right, and it is a meme now.
@vast ingot
probably adding new features \ changing existings \ maybe design changes...
In short - probably doing something with site itself
Thanks bro
yeaaaaaaaaaah politics isn't at all a topic that, if it's starting to create drama and trigger/troll people, it's not welcome in general.
we are an SD server, not a politics server. DMs if you need, but not in #💬|general-chat . #1008652627204132904 or #🌶|off-topic if you can manage to keep it civil, PMs or muting each other if you can't
good night Happy
Rest well!
dead again
ok since my home channel of anime didn't speak up, is there anyone in here who hasn't ever had a Nitro Subscription? I have a code for 3 months of free Nitro on Discord, all you gotta do is raise your hand
How do you generate photos of old historical figures, like Julius Caesar? It just shows him as a stone sculpture/bust for me
I'd take it 
just to be able to upload bigger videos lol
check DM
idk if SD itsel has his photos to work with.
You can try adding "sculpture", "bust" as negative prompt, if it doesn't work - probably will need actual training on photos if there are any
did you see it?
One of the last channels
https://discord.com/billing/promotions/P3rS6cCZuMDdemeePwxnQYen if it still works, whoever clicks first and can use it first, have at it
yea, I couldn't too
is there an AI tool to increase the quality of compressed videoes?
why does this look legit
civi back up...but can't login...eeeeeeeeeeeeh
I'm talking about civi, it has nothing to do with discord

You can't scam me if I literally have no idea how to login to my own discord account
I NEED LORAS 
But civi don't let me loging, button to auth with google doesn't work 
NOO ITS DOWN AGAIN 
we killed it
no, the nsfw anime models uploaded every single second killed it
not trying to diss sd models but legit 75% of them are just the same thing in either a different style or type of woman 
either that or furries
yea, but whatever, it's fine

That's what always happens when community is huge, even with programming
fr
stop posting messages. it lowers the chance of people seeing this
ill just bing it edit: couldnt find anything
the link to nitro was from a game pass ultimate code offer to whoever hasn't had a nitro before. It runs out or ran out on April 26th and I dunno if it's been used up or not at this point
you get these promos at random as an ultimate holder
yes
Instead of packing all your images together into a giant image most rigs can't even process, like https://github.com/LonicaMewinsky/sd-webui-keyframer does, you could just run through all the input images, average them together, then reuse that image's latent space for the whole animation. Even better, Latent Space Import/Export. Would it work as well as a LORA? I have no idea. For all I know, it could. Somebody should get on this.
Unless it's already a thing. In which case, download where?
is a1111 sd2?
(I'm looking at controlnet's instructions on stuff and it says "use this model.. or that one if you are at sd2")
no.
A1111 is a tool that uses SD.
SD is a technology, there is no SD2 on this exactly
But "SD" also refers to the SD1.5 and SD2.0 or SD2.1 models
A1111 works with all those types of models
I'm not sure what they are referencing as SD2 in your video
there are 2 sets of models for controlnet though
controlnet is something that comes "on top" of your model
if you use a model 1.X, you need controlnet for 1.X
and same for 2.X
actually, I'm at 1.4 in a1111, so I'm just trying to update my model at the moment.
but i was reading controlnet's training page since I'm contemplating on some of the issues I'm having with generation (a skeleton with just one muscle.. but the canny edge detection loses the distinguishment, screwing up my images)
then there are 2.X models, those have other requirements
controlnet training page ?
you are intending on training a new controlnet you mean ?
or just a tutorial on controlnet ?
(sorry, training is a keyword around here)
@vast ingot training.. wanted to, possibly, make it more capable at some tasks with a specifically-trained model
can i generate 2k image with stability api
i am trying to upscale a image but i get the same diminsion size
Hey guys i was wondering if theres a way to train non square images yet?
turn on bucketing (lora), it can cause problems tho
Any tutorials on this?
hello is there a LoRA for making massive structures ? I'm trying to make giant pillars of stone but it's not working.
any sprite sheet models out there already? if not, I assume it's difficult to get it to work well enough?
havent seen loras for structures much, you are better off prompting it, and/or img2img
have seen sprite art, try prompting it and experiment with different models (some models can definitely do it)
did u guys check out the new model?




