#💬|general-chat
1 messages · Page 50 of 1
I'm likely already running all that than. Unless you have to follow special install steps. Which does not seem right.
I read and follow a lot of guides etc. Never heard of anything. Except those things I don't use called deformers or something?
I don't even know what they are or if I should use them. lol
the 'boost' was to download libraries from nvidia and drop them in the pytorch dependency folder, at the time automatic1111 (I think) didn't have the ability to distribute them. But a newer post said they're included now.
Interesting I have not updated in a month.
But likely will soon.
Month is like 25 years in AI time.
But so much was changing I was like... I don't want to update every few days. So waiting for a bit longer.
yup thats why all of these txt2video gizmos are dropping out right now and I'm like 🥱
I'm waiting a good 3 weeks for us to have the real thing
txt2video is going to be awesome in the future. Like next year.
But for now I don't like it.
Exactly!
lol
If someone makes an app that alerts user of every AI advancement or news. It may just be better for the app to send an alert when 20 minutes pass without an update.
Less alerts.
I just wait for aitrepreneur's videos although even he can't keep up
Yes I follow like 3 youtubers and he is one of them
I'm waiting for him to drop a vid on comfyai since I dont like olivio's vids
after testing the same prompt/seed/config on 30 different models I can say, without a doubt, that standard diffusion loves butts.
So those videos will likely be a lot of twerking.
Vanilla 1.5 is not that good of a model.
It kind of is.
But when you look at user models it's not even in the same like region. Let alone ball park.
1.5 vanilla is better than 2.0 and 2.1 vanilla.
2.x is still trash all these months later
Sure 2.0 and 2.1 have some exclusive like features or bonuses. But overall are worse.
how do you combine concepts
I don't even have the vanillas. Anymore. I used to have 1.4 and 1.5, not sure what happened to them.
2.1 that one user model is pretty good!
sadly theres no indication that 3.0 is gonna be any better. thank the big corporations and snowflakes for censoring AI
Probably deleted to make room
But it's not as good as 1.5 models
like im trying to make a pokemon turtle made of vines but I can't it just gives me regular turtles
Agreed but it's only a matter of time before something better comes. It has not even been a year since 1.0 released right?
damn has it really?
ur right. it was only a few months ago that I discovered mj and went into this rabbithole
I've come a long way in such a short time lol
Yes AI as we know it is not even 1 year old
fuck
All bad, especially teasers like Google's music AI
But I mean going public for MJ or SD. If I am not wrong it's like AUG or something like that?
I think MJ came out sooner.
Once we get true AI generated music I can say for sure it's a good time to be alive
but not music that sounds like stock music
Yes good AI music does exist now but it's closed to the public. That's my understanding
I mean AI that can copy an artist's style. That would be the shit
yeh but it's unknown whether it just does generic or if it copies styles
Also probably 2 text to image modesl that can surpass or match MJ/SD User models that are also closed to public.
I mean, V5 is essentially closed behind a paywall rn
not that I care, MJ can go fuck itself
Yeah its good but what good is it without dreambooth?
No inpainting?
Its trash.
But so much wasted potential though
when music AI gets released, the lawsuits will start flying. the music industry does not like any form of competition
My view point is MJ V5 is the best text to AI on the market. Paid or free. However with limitations that you don't have with SD. So SD overall I think is better and of course has more max potential.
well if its released locally then the genie will be out of the bottle for good lmao
ofc
It's the fact that it runs on discord that grinds my balls
If they let us have an interface like SD's, I'd be like $300 poorer rn cuz I'd be subbed since it came out
but fine, let em do it their way
MJ as far as we know will never be 100% private, allow gore/nudity etc, and it will never be free... and likely never have nearly the same level of customization. And likely won't be open source. All of these are major victory for SD.
ugh yes I forgot about that
no boobs, no ass, and I think they nerfed celebs as well
nty.
Oh and lets not forget that MJ is struggling to release inpainting, outpainting etc... don't eve get me started on things like controlnet.
What is SDXL?
It's an uncoming vanilla SD model with more paramaters.
theyre not struggling, they just dont give a shit.
So, uh, I tried that prompt test on 1.5. OMG. It's terribad. Like art seen on the back of a redneck truck bad.
MJ is like the Mac of computers.
Oh hell yeah we need that
If people wanna waste money on something so inferior then let them
I listened to some of the audio interview things. It seems like there model has issues with some of that tech. But I don't know how true that is.
I'm guessing MJv5 is an 8B param model
It's just that much better than anything else out today
Prompts generally don't translate well between models
I've seen a few messes, but none this bad.
MJ V5 I think has some of the best celebrity functionally of any model! Otherwise your normally right they try to hold back on some of that. But a lot of people pointing out that MJ V5 at least the alpha has been pretty amazing for that.
ah, wait, don't have a vae for it. let's see if that helps
its insane in the amount of detail it can generate
I wish someone could suck that model right out of their tentacles and find a way to export it to SD without the censors
can you imagine the utopia that would emerge
Yes I agree. I want it so badly. As long as it does not flop like 2.0 and 2.1. Here is a message I sent to my friend. This is NOT SUPPOSE to be serious. This is just me giving a rough opinion. lol
Here is how I see quality of AI models and what open source can do.
If vanilla SD 1.5 is == 100 in quality on a scale.
Vanilla SD 2.0 and 2.1 are both like 70 on the same scale.
User models for SD 1.5 are like 300 on the same scale.
Dalle 2 is like 150 on the same scale.
And MJ is like 325 on the same scale (edited)
So if Vanilla SDLX can be like 150 on the same scale. I'm sure it will easily be open sourced to like 350-400+ on the same scale.
Oh and to finish this scale off I would put user models for 2.0 and 2.1 at like 250ish but of course with some unique bonuses over SD 1.5 models.
And Craiyon is like 20 on the same scale. lol
in sheer quality yes
but theres more to it than that. controlnet alone makes MJ be like a -100
Absolutely. And I agree 100%
seriously I dont even bother generating without controlnet these days
good luck having those fools integerate that into a clunky discord prompt
Sometimes though certain generations are impossible with controlnet.
So it's all about what you want!
However thats were SD shines. You can fine tune things to what you want.
Do we know how many params they are targeting?
Are they going back to CLIP and porn?
We need clip and porn both in the base model imho
If you want everything without really having to change to many settings at all. MJ is the best overall model and system likely. But like we all have all discussed. It has major issues. That can make it pointless to use at times.
Unless openclip is a lot better
Sadly all of that goes over my head. But I do want NSFW in base model. Just like 1.5 etc. However I don't know how important that really is? Like can user models force that in if they need too? OR does base model need some of it anyways. I agree if it's has no NSFW ability than the SDXL will still have a purpose! But it won't surpass 1.5 overall in my mind. I may go a long time without generating NSFW content at all. But you need the ability too in order for a truly great model.
To some point it's not important. To others it's very important. But not having any ability to do so is a problem when it comes to free expression of art.
Like I clearly still love MJ. And it has none of that. SO it's not like my use of text to image is dependent on it. But at times it's very good.
Base model is going to be 3x better with porn. Why? Because porn has all the best tags, all the highest quality, and makes up a massive percentage of the total image data on the internet
It's not about the porn, it's that making an SD base model with no porn is like making an Olympic athlete that can't eat protein. Just fundamentally really disadvantaged
can we just agree that censorship and cuckification are destructive not protective.
I am really confident that it would be best to keep it at 512!
yes the great thing is that having NSFW content in a model makes more people put effort into it. Even it's PG content are likely going to get a buff too. PG content enhances NSFW I don't know if you know what I mean but... people often want like you know amazing looking things in there NSFW content. You can still use many of those models in non adult ways.
for people with small vram yes
Big vram too imo
why
You want more parameters, not more resolution
why not both?
I can agree with that.
High-res fix will get you to 2048x2048 with no issues
yeah while the imgs look plastic af...
mj I can guarantee is at least 768
I suspect v5 is 1024+
That depends also on your settings of High-res fix too though.
Because they both take ram. If you want the thing to run in 24 GB of VRAM, you can do either 1024x1024 and 2B params, or you can do 512x512 and 8B params
oic
And I'm highly confident that 8B params is going to do better than 2B
as a side note, until the late 80s, PG rating included nudity.
Why not 768 4B? the sweet spot lol
I agree with Sen that we need higher than 512 at some point. I can be fine with 512 with more paramaters for now. We have seen how good 1.5 models can be. But I would like 768 or more with more paramaters and 1024 is likely all we need. But I'm fine with waiting a year or 2.
768 in theory is already spectacular if it wasn't cucked into oblivion
I personally would bet 512 and 8B would beat 768 and 4B.
Your 512 will have better hands, better food, better fur, better water physics, etc
I agree taek. But both would be my prefer when possible.
I think it's mostly speculation. But rumor has it that MJ needs like over 40GB VRAM to run.
Is this the official stability ai discord btw?
It was very hard to find lol
u know how those mods are. always out of touch
Yes this is the official discord I believe. 99.9% sure.
MJ developers are sometimes a pain to listen too. I think you may be talking about SD. But I hate listening to the MJ development meeting things. I feel like they drink a lot of soy milk.
lol
r0fl
One time they went on for like an hour about how they are worried about people using the Bikini prompt. And are secretly trying to nerf basically good looking woman from being generated.
IT was more than that. But that was the gift of it.
because muh feelings
Yeah
because muh unrealistic beauty standards
all the while true artists get royally fucked
Lets create an ART AI. And then judge our customers and shame them.
And nerf our product.
well same thing happened to chatgpt sadly
and I dont think we'll ever get a decent chatgpt competitor.
as soon as an uncensored gpt-4 model comes out, it will get sued into oblivion as soon as 1 person learns how to make meth
I do agree and I'm completely fine if they want to "hide" or reduce those kind of images from the public view. Like filter them down on the top image boards etc. There home pages etc. But don't nerf the model itself.
that's why chatgpt still gives you the disney-approved answers on anything remotely creative
they care about their bottom line $
lawsuits dont make money, but they sure hurt freedom
also
👀 couldn't help but lurk: https://huggingface.co/OpenAssistant/oasst-sft-1-pythia-12b
have you seen the average twitter moron
there were people sabotaging sd cuz they were scared people would use it to generate kids
Twitter is trash. I had hope when Elon took over. But it's... I don't even want to talk about it.
yea I have no faith in a corporate-sponsored AI product. If it happens, it's going to be crowdbased
the internet is too controlled these days bro. 1984 in the flesh
I wish I was joking
1.5 might be the last taste of true AI freedom we got to lick off the floor lmao.
Honestly I a lot of twitter hate is just degenerate furry artists just want there gig to not be stolen by AI. Not that I blame them. lol But I don't think for a split second that they really care about those kind of images being generated. Sure some do and those are legit concerns. However they don't have there priorities straight. They may say that is there reason. But they don't like AI art in general.
Most of them also don't know how AI art really works. And get image to image mixed up with text to image.
So there IQ is far to low to take seriously. But sadly they reeeeee loud enough that it still causes problems
Llama 65 is pretty good!
Yes it is.
I use Llama to remove stuff
One of if not the best text removers etc.
Or removing an extra limb etc.
Text removal!?
Like know how an AI may spit out garbage text that you did not even ask for
But the rest of the image is godly.
Llama can like remove stuff
Llama is an LLM though
I don't know what that means.
It's a language model not an image model
Oh that's completely different
Oh okay sorry.
Llama 65 is Facebook's GPT competitor that got leaked so the weights are freely available
Neat
Are you using llama.cpp? I have the 65B downloaded in another computer that I'll have access only in a few weeks, and I'm wondering how much memory I would need to convert it to the format llama.cpp uses
I hope this is not true. Even if it takes a few years I want something else to come along. I wonder how far 1.5 can be pushed though? When do we simply see no improvements? hmmm... Everytime I think we have peaked like five new things come out. So I'm done guessing.
🐮 COW FACT TIME 🐮
Did you know that some of the most experienced moderators on Discord are actually cows in disguise? That's right - these clever bovines have learned to navigate the internet and communicate with humans in order to keep their servers running smoothly.
Despite their large hooves, cows are surprisingly adept at typing and using computers. They've even developed their own secret language of moos and cowbells that they use to communicate with each other while monitoring Discord channels.
Of course, it's not always easy for cows to blend in with human moderators. They sometimes accidentally reveal their true identities by randomly mooing during voice chats, or by expressing a sudden interest in grazing on Discord's grassy green color scheme.
But for the most part, these bovine moderators are experts at keeping their cow-ness under wraps. So if you've ever been moderated by someone with a mysterious affinity for milk and grass, there's a good chance you were being watched over by a clever cow in disguise!
🐮 END OF COW FACT TIME 🐮
🤔
Alright? Is that a chat GTP generation or something? lol
I would never
lol
I run it on my a6000 (48 GB VRAM)
10 t/s
the server hasn't been bought no, and population changes quite a lot, depending on the weeks. we do have a very active anime community that tends to be younger sometimes, but there are quite a lot of representation of all ages, from out last "sensus"
GTX 1650 here, so no way it's going to the GPU
But the Ryzen 7 5800H can do something, I hope 🙏
For what AI? It's nearly all GPU based.
My local stack for 1.5 is way better than anything I've seen published 😋
So there are certainly more improvements to be made
Nice Taek!
I'm hoping to release the code soon, still making a UI
I am getting kinda good results with 7B and 13B models on CPU, 7B can get around 4 t/s, with 13B it's around 3 t/s
Yeah no rush. If/when you release it I'm sure it will get a lot of use and then someone else will improve it further. lol Why I love opensource.
For what AI?
LLaMA, 7B and 13B
Have you tried alpaca?
Ah okay. I was like.... SD barely uses CPU at all. I misunderstood.
Not yet
I tried 7B and 13B alpaca
both online stanford and offline 4-bit version using llama.cpp/alpaca.cpp
I can't wait till we see something like Kobald AI but using something as powerful as chat GTP. IF that makes sense? I don't know if anyone understands what I'm even talking about. But for role play, adventure and even erotic stuff.
have you tried pygmalion for that?
No
it's made for the latter
Llama 65b is really good at erotica
ugh 👀
I may look into that too.
I'll make you a story if you ask nicely :p
No I'm good. lol
😂
What the best way to get started with Llama 65b on my PC. Can I use it directly on my machine? Like SD? If so am I good with 12700K and 4080?
And 64GB RAM
Sorry
64GB regular ram
I want to know how much memory is needed to convert LLaMa 65B to 4-bit too
Because my laptop has 40 GB and thanks to Lenovo's buggy firmware, it doesn't work with 64
not to be a minimod, but I think we can bring this over to #🌶|off-topic 😓
Supposedly it took a stack of 6 A100's and 3 days of compute
Fair enough
Wait so you can't run it locally? Stack of 6 A100 sounds insane? Is that a one time compute? I'm lost. lol
Nvm I looked into it. Yes Llama 65b is not a regular user product.
Has the stability ai team made any comments on porn in SDXL?
Yes again I'm curious as well. 2.0 and 2.1 did not have that right?
If they did not I don't think SDXL likely would? However hopefully it does and they learned of their mistake.
I'm hoping they realized:
- We figured out how to make really good porn anyway
- It really stunted the quality of all their non-porn images
As long as users have the weights, they will fine tune porn into the model
well, the position on last models (2.1) was to kick NSFW out of the base models, yeah, and I don't think this is going to change in next models, but I didn't see anything specific on that XL version specificaly for that.
My guess is that, yes, they know we can train it in. it's what open source and responsible use is there. and yeah, some will do as they like watever the rules.
But as a leading AI thing currently (not even sure how to characterise all those blooming currently), it needs to set the bar quite high for the official model, in order to prevent very damaging litigation, and to be able to come into each and every country following local laws.
The only thing that SAI only tries to remove is hard porn, not general nsfw iirc
yeah but it has some impacts on quality of NSFW for sure too, even if that isn't the main target
but yeah, base model are called based for a reason, they get trained. they aren't MJ good in any subject, but they are good in everything and we can train in the rest
and it's hard to spit on free cookies tbh too, when there are big technological steps crossed each model
They don't want to walk back the decision even though it very clearly hurt the overall performance of 2.1?
2.1 is clearly inferior to 1.5
In outpaint in Auto 1111, I could overlap the dream only go one box beyond the original image, and it would generate just that one box new. Now it's regenerating the image where it's overlapped, but it's not generating anything beyond that. Anyone know about outpaint, what setting this could be?
nevermind. I had img2img selected rather than dream
how to get updates from a git repository to files in my pc in the terminal?
'git pull' or 'git clone'?
git pull doesn't do it
fatal: not a git repository (or any of the parent directories): .git
you need to be inside the right directory
git offamylawn is to chase away hoodlums
hu
using CD, move into the directory
im in there by using cd
ok, then maybe you didn't install using git clone ?
if you installed by downloading the code as a zip folder
my automatic111 web-ui is crashing everytime I switch the model :/
then you need to update by doing the same
by using git clone?
update by downloadin a new zip
if you install using "git clone", that makes a git directory.
download the zip, overwrite your old files
the thing is I moved the whole web-ui folder from C drive to D drive by copy pasting
it used to work before by typing git pull i think
check if there isn't a ".git" folder in your old install ?
it's an hidden folder
it should be inside automatic if you did a git pull
but the error "not a git directory" seems to say that no, you don't have it
.git is there
ok then make sure you are inside that folder in your console prompt maybe ? and run "git status" to know more on your git state
(inside that folder = inside webui, not inside .git, that wasn't clear)
is there a faq somewhere that states how to update, automatic only lists how to install
can try: make new directory. git clone into it. copy your changed files over from your install dir. do a git pull to fix anything terribly wrong. then run and see if it works.
click in the adress bar inside Explorer, in a windows that is in the directory you want, and type "cmd" then Enter
#🤝|tech-support also, because we can't post pics here, and this sin't the right place anyway
if cmd, think you have to change drive before cding?
in windows, you may need to do "cd /d F:"
to change directory first
F or another letter
or just F:
then maybe even git pull works now ?
because before i only have to copy the address of directory inexplorer
or use powershell (:
git pull worked after the cmd
nice
yeah, changing drive through CD in the basic command prompt in windows may require to use the "/d" option
apparently the terminal wasn't directing me to the directory using 'cd' and i didnt notice it
so = cd /d (directory name)?
yes
from what I see here```C:\Users\berti_o7k3ry8>cd /d F:
F:>```
sorry no, not directory name, but drive name
then cd path
my bad both at the same time work
F:\AI\Software\Python>```
will manually updating SD break AUTOMATIC1111's ui?
it shouldn't, no. check on #🤝|tech-support in case you want some assist around it maybe
is there a controlnet that lets me isolate the background so I can just add a background to a subject? seems like something so simple I forgot how to do it
if you have no background at all currently, canny should detect the edges of your subject quite easily.
it may produce a "faded" background, something with very few edges, since there aren't any other edges in your base picture, no background
I managed to do it with the classic midas script. How would one tell the canny to just paint outside of the subject edges, instead of inside?
the problem I see is to keep the subject intact honestly
canny doesn't seem the best to just create things out of white background
scribble would be
but scribble will destroy your main subject
maybe using a mask on it ?
canny will invent things for the background too, but it will try to do so while keeping the same canny preprocessed image
so in your case, it would mean the background would just stay fuzzy, to prevent edges
I'm not sure there is a lot you can do to change canny on that side. the best would be to push thresholds to their maximum while still having the outlines of your object
but that would allow it to modify the subject quite a lot too since so few details would be set in stone
to me, it mostly depends on :
- do you want to do a mask to preserve the subject ?
- how important are the fine details of the subject ?
Nah forget about all that G
just use midas invert
on general-with-images. invert depthmask does wonders
Realistic_Vision understands 4bpp on the prompt, that's cool
you can't get a "general" opinion in a place dedicated to AI enthusiasts
(thats not why he's here. don't feed vermin)
queue: 14/14 | 71.6/699.4s
Dang, the wait times for ModelScope Text to Video are getting pretty long.
I wish my PC met the minimum requirements to run it locally, but I only have 8GB vram
So wait, Wally West and Aqualad are now black?
the results are pretty crummy from what ive seen
or at least nothing that conventional sd to video cant already do
Yeah, still fun to mess with though. It reminds me of early text to image
does anyone know if it does nsfw?
the dc movies have ruined dc characters....aquaman is that awful hawaiianm guy, can't get SD to give the original Aquaman or Aqualad
Is there a way to check what objects and styles a specific stable diffusion model knows?
anyone know how to get green eyes? I tried (green eyes) to different strengths, but it makes the clothes green
green irises or green pupils maybe
Huh, it seems that Hugging Face wait times seem to be specific to my IP address. I had a 1000 second wait to use ModelScope Text to Image, but after turning on my VPN it's only a 92 second wait
eh, green iris and pupils also does green clothes
I guess I have ti inpaint
it will not do green eyes
it at best makes aqua blue
Hi, I have a general question about hosting SF on my main computer but accessing it on my laptop via a1111. I know that gradio isn't right but I can't seem to get Spaces to work either
Apologies if this is the wrong place to ask
Gradio works for like a few hours and I want to set up something more long term
I am using the --listen tag in my .bat
my first few steps with AI, im exited about the results... https://prompthero.com/BalrogDx and the wolf is so cute as i described 🙂 if anyone interested in these prompts, here u go!
Hello
Can I ask for some feedback on my project?
I can’t tell if people like it or not, is it too long maybe?
edit the clips and do 1 upload nobody is going to click on that 5x
but you'll be good
make it 1 project
What do you mean one project
well you need to edit for comedic timing first of all
My goal is to replicate a real talk show though so around 15min makes sense
for sure, but I'd start with 5 good minutes and start stretching little by little. That's how they do it in the paying world
Are you saying it’s not high enough quality yet for 15min
I'm not saying anything, just offering my criticism. I might be wrong
I’m feeling pretty discouraged at the lack of attention it’s getting
I thought it’d be more of a hit
It’s entirely AI
No one else is doing something like this to my knowledge
How long has it been since you uploaded?
Well there you go you're freaking out over an hour of not getting likes on the internet. You're gonna have to get over that pretty quick. Sorry to be the one to tell you.
But still it's time to grow up and stop worrying about that shit. You'll be fine. Go do things you like
This is what I like
Spent all day on this one
I feel like from a technical perspective it’s a notable achievement
Awesome, so trim it down to a concise, journalistic 5 mins and see what people think. Nobody will spend 15 mins listening to someone they don't know. You have to ease your way in
Entirely GPT-4 written, stable diffusion animations, all visuals are SD and Midjourney, ElevenLabs voice cloning, even the music is all AI
Depends if the topic is hyped and how many people you can reach with it
Fair
If you create the vid in a interesting manner to keep watching that helps as well
just use the stability.ai tag on twitter Xd you'll reach plenty of people ¯_(ツ)_/¯
Also anyone that knows how you can easily generate the exact same character but in different poses?
No @StabilityAi
ControlNet
It doesn't generate my character exactly the same and I need to put denoising strenght on 1 to have changes in poses and it doesn't even match.
I need to create an animation use my photos.
https://prompthero.com/BalrogDx someone has tipps for the snow fox of this gallery? to get a good looking flufffy fur is hard. sometimes its like spikes sometimes its to animated lookalike. The 2 Pics in these Gallery are the 2 of 30 which was looking nice
if someone has expierience or ides to get a nice fluffy fur, im glad to know that knowledge 🙂
I'm working on "Tyrannosaurus Rex runs a kitchen in 1972"
do u have a gallery to look?
ahh ok
nice pic, love the black white retro style-maybe a tipp , try to describe the shirt collar or the neck with a additional scarf.
only a idea cause the point from neck to shirt is to big, but i think its hard to describe it
Ebsynth
It will use your photos as reference points
And animate the characters inside
Totally agree. I'm running a new one that's like 20 mins. 30 steps, batch count 10
Well nevermind Grdaio failed
*Gradio
starting a new one soon
had to reboot
A dinosaur created a successful restaurant. 35 steps
is there a stable diffusion discord only for anime stuff? or can I post my stuff in #🏞|general-with-images ?
yes but you can post in #1072013871730131004 or in #🏞|general-with-images
You need to train AI on some character first, people do Lora's for it usually.
I only have one img of the character that I made with SD
hello frens, i ran into a problem with inpaint section
hello everyone, i have a question. Can i generate an image from a clip vector using stable diffusion, if yes how ? I tried to use ddim sampler but it generates some images full of artefacts
bing image creator seems like its dalle-3 or something. but more new ai stuffs with mj5. good time to drop a new stable mode lol
Idk if we'll get actual new model until lawsuits will be figured out and what data sources we're able to use and whatnot
But I'd take huge 768x768 or 1024x1024 model for sure...and wait for community to do more things with it
adobe got some model coming too
going to be a lot of options in time. chatgpt is showing them how profitable this stuff is. the models (and cash) will flow
Emad sure taking a while to drop new models, maybe hes just gonna drop distilled XL and blow the competition out of the water
yeah they been quiet
Hello, I am a high school student that has read a decent amount of deep learning computer vision papers and would like to ask you if someone could give me advice on how I can start learning pytorch to implement these papers and code my own when I write my own research paper in the future. I am already profficient with python
i saw a video on training a diffusion model on scratch
on youtube
try that
Well let me tell you somethin', brother! It's fantastic that you're already proficient in Python, dude. That's like having a headlock on the programming world, man! Now, to get started with PyTorch, you're gonna wanna check out their official website and documentation, brother. They've got tutorials and resources that'll help you body slam those deep learning concepts, just like I did to André the Giant back in the day, man!
Now, when you're ready to implement those computer vision papers, think of each one like a wrestling match, brother. You're gonna need to study your opponent – in this case, the paper – and understand their moves and techniques. Break it down step by step, and don't be afraid to ask for help from the community, man. Those folks are like your tag team partners, and they'll help you bring home the championship belt!
is there an implementation of multi-controlnet in python/jupyter or a pipeline in diffusers? (i ran it in auto1111, but can't for the life of me understand where to start at, if i want to implement it from a python script)
hey guys, where do i put the anything-v3-vae.pt file that comes with the anythingv3 model?
right next to the model, with the same name except .vae.pt at the end
if you want it used automatically
How do I report users spamming DM's?
Open a #1010934719455707218 ticket and send us a screenshot please !
Does anyone know of a way to change the lighting of an image either from the prompt or in post? prompt would be best as I get a lot of warm and mostly cold lighting in mine, but I don't mind post either.
https://clipdrop.co/relight something like this?
it's sort of stable diffusion since Stability owns CD now
I see that now. I remember this from a while back
my thing is I train a lot of images and I don't want to go through all 100+ of them to tag them with cold_lighting, warm_lighting, and what kind of lighting is this, for each one.
lighting colour affects the faces mostly
automation would help yeah. i seen some 2.1 negative embeds that may help
Naw, it gets baked into the training lora
can't be removed it has to be prevented at the source
can tag it away but that is hideous as I tried it yesterday and too much for me
lost all my hard work too when the extension decided to zap it on me. I ticketed that issue.
Does anyone have information regarding the legal regulations pertaining to the use of stable diffusion? I am currently working on a project that involves the use of the Dreamlike Diffusion 1.0 model and I am unsure whether I can share it on YouTube or use it for commercial purposes. Could someone please guide me on this matter?
can't you preprocess the images before training?
Sure if I had something to do but 100+ images in PS is a long time
anything that uses novelai as part of it's merged base, i would avoid for many many many legal reasons. these are trade secrets that were stolen from their servers and leaked to the internet
if all images were too cold, or too warm then easy peasy but you get glows, and too cold, or too warm and sometimes both in the same image
outside of that, the legalities fall under regular copyright usages. Fair use exists. be transformative and you're golden.
In Automatic1111 WebUi, is it possible to disable the (i) button? I have some LoRAs and saw that button, clicked on one that should make cute creatures (CuteChibi), and there was prompts and tags in there that I do not want my friends to see, like... I can not write any of that here.
I am not sure the LoRA creator made or used those tags, sometime that list can be rather long while othe LoRAs have a very short Info.
if you've got photoshop, there may be a fancy batch script that allows for dynamic color grading you're looking for
lol. i've not run into this issue, but i considered it. "What if some of my tools i dont want to show on streams or to a friend?" and my solution was this. Make a 2nd copy of the folder and sanitize the shit out of it. with fire! Use that for guests.
Thank you
Good morning Flo, trolling again this wonderful evening, feeling still the need to answer on everything, have an opinion that is not relevant.
tbh, I am not sure what I am looking for. I mean trying to get everything to have 5500k studio lighting is impossible.
Hey everyone, I’m just quick question. I am using local stable diffusion and I want to use the interpolation feature, but I cannot use it. Can you help me with this?
that was a legitimate answer, an yeah i come here to chat and answer questions.
You shouldn't say good morning if you don't mean it
i sort of know what you're getting at. I'm just not sure of the proper tools to do it. I don't think SD has discovered much in the way of color grading yet. We only just got offset noise for better dynamic range
There's a lot of improvement to be had in this area
who thinks auto1111 is an absolute piece of crap when it comes to code cleanliness ?
it's an "absolute piece of crap" in "a very narrow focus consideration" ?
{} works only for novelais Website.
The other one is used in Automatic1111 webui
its a very feature rich and cutting edge tool set that started as a MVP and has been wrangled together rapidly as new technologies come out. the field is moving far too fast to keep the code clean. It's the bleeding edge on that repo.
If you want a professional product that paid engineers to take their time to design UX features, you're probably wanting to subscribe to creative cloud
a1111 uses it as deemphasis too, but i like the (token:0.9) syntax better. ~~~ waiit i'm wrong. a1 uses [] for deemphasis
the speed at which the whole AI field has expanded these last few months is incredible
I'm just trying to create an extension to process videos, and... there seem to be some really bad interactions between gradio video input and the extension system made in 1111. So bad that you simply can't put a css to the video with gr.Blocks(css=...) and consequently, if you put a video whose height is pretty huge, then the video control goes all the way instead of fitting in the video in itself...
maybe 2-3 hours I'm trying to fix that stupid crap, I'm so fed up
where do i make art?
it probably has nothing to do with the code cleanliness or skill level of the people who work on automatic1111's repo
where do i make art? and whats the command?
deflecting blame for the problem onto nearby bystanders never fixes the problem, but it's something frustrated people will do consistently, for so many problems
Aye. I know from training attempts it will cook the gradient into the face so you get funky faces with the style.
man with nft pfp asks how to make art, lel
Maybe. But I've worked on 1111 code before, and ... how did they decide that 1000+ line code files and functions with 30+ parameters were a good idea ?
if it's expressive, it's art. even NFTs.
I can kick your ass in making art in adobe and 3d, am asking about this server mfer
There is no bot for making art here. Check #1072220168534642768 or #1080946152318443610
now now. y'all can make art all day. calm down. this server doesn't have a generation bot anymore. #1072229020520947753
already is publicly available through a few services 😉
I just do (word:1.1), (two words:1.2), (word, another word:0.5) instead, easier to read then lots of different () {}
Is there a bot where we can do prompts?
Ah alright. Thanks!
make a room for this specific question I can throw people at
I guess it's just the midjourney thing
They have rooms for art prompts, so i thought we had it somewhere as well
we had it too, but I don't find discord bots comfortable to use anyway
it's good to use someone else's gpu, I guess tho 😄
The best!
https://cdn.discordapp.com/attachments/1073167875700949062/1088140079593173052/image.png
Guys, please help. How can I create such an illustration in Stable? What prompts and settings to use?
Hello , how can i starts creating those AI made painting
Yes thank you kind user
guys quick question, when I use control net does the image i put on the IMG2IMG part matter?
or is it just the prompt that affects the end result?
/hello good afternoon someone please tell me how to create the images here, please
#1072229020520947753
also, im pretty sure there is not a bot anymore for creating images
Free webinar on how to integrate your own information into generative AI systems. Maybe you're an author who wants to bring their book to life, maybe you're in a company and want to write using your internal information. If you want AI to write on your content exclusively, this webinar is for you.
Where: 🫴 https://lu.ma/rg32qsbx
Does anyone have any update on 3.0?
Anything negative happen if you use different resolutions on dreambooth?
What is the best tool currently for outpainting? I havent caught up
Is it still InvokeAI?
the best one I've used is a Photoshop plugin but its not free
SD outpainting, invokeAI or SD + https://www.painthua.com/
invoke stays top tier though
Painthua is more comfortable to work with imo, but invoke is fast
probably cause it uses DDIM nomatter what you chose...
Ty. I remember using painthua but I liked InvokeAI. Time to do a git pull on that bih
I wish invoke had same button just to remove image I don't want from generated image choice, so I can choice between good images only back to back
just this one thing makes painthua so much better
Did invoke add DPM++ samplers btw?
It's been some time since I used it
I'll lyk once I get it to work
Hey 👋 guys, Is anybody successfully feeding / training their own AI data set on the cloud for their own version of chat gpt, personalised with their data set? Like their own AI assistant
.
excuse me, how do i use my seed to recreate the same photo
i dont seem be able to enter the seed anywhere
There's a seed field if you're using automatic1111 webui
Sorry I’m super new to this. Is there a link to that? Or how do I get there
Thank you. Do I need to install anything else to use this?
Any model you want from https://civitai.com/
into \stable-diffusion-webui\models\Stable-diffusion folder,
vae mentioned in model description
into \stable-diffusion-webui\models\VAE folder
But instead you can just hope someone in dreambooth chat will help you, I'm just not familiar with it
Anything negative happen if you use different resolutions on dreambooth?
yeh it'll blow up
u gotta do one rez at a time
preferably stick to the rez of the model
@past edge bummer. Makes the process a lot of work. Thanks for the feedback
any ideas why my auto1111 suddenly looks like this? I tried a full clean reinstall and all #🏞|general-with-images message link is to gen chat w img
Hey guys, please does anyone know if there is a Jax implementation of SD v2.1? I can only see one for v1.5
Thank you Soulless
!infractions
Watching this tutorial for Controlnet and Blender. https://www.youtube.com/watch?v=ptEZQrKgHAg
How in Blender do I get the hands and feet to show as in this video? I see bones, so it's hard to tell the orientation of the hand.
It's yet another simply do video. It's very simple....as long as you skip all the steps
cant get any of it to work. When I export, I only get the hands and feet for the depth map. In the video, he gets the whole body
guys
Ok, now Im trying openpose...no good luck there either
I just released a prototype of my game that uses stable diffusion as a key element. I could use some testers..... free beer! https://aigameboy.com
youre right i should have said it better
what is this communities opinion about ai in general
it's terrific
I think it's about to change everything we do, and nobody really knows yet
in other words, it changes nothing
Hey if you had to prune your model libraries down to 5 models, which ones would you keep? What's your top 5 models?
wow, this controlnet openpose is garbage
Hey guys is there a place where I can advertise a freelance project for prompt wizards / advanced users ?
hit us up in #1010934719455707218 with what you want to post so we can better advice, but I think it's could be #1045770510153302086 , depending on the content
and I see you've been here a small bit over the time but not a lot, so let me extend the welcomes from here 🙂
Hey @vast ingot I'm looking for something similar
I would recommend the same then, contacting us through the ticket system, it's the easiest to hit our whole team, and I personally can't say the best recourse on that one, so I prefer to check with my mates
👌
we get quite some of that recently, so having tickets for it also helps us see the need for dedicated channel for that maybe.
Tbh it's a complicated subject, but I think having a common place for it would be best for everyone
Sounds good, thanks
How can I install a model that isn’t on hugging face in google collab stable diffusion 1111?
you can usually connect your google drive to your google collab, and automatic installs in there. Run it at least once like that, and you'll have all the folders in your google drive
then you can just download the model at home, put it in the stable-diffusion-UI/models/Stable-Diffusion folder in your google drive, wait for it to synchronise
and run again your notebook, it should start a lot faster since all will aready be downloaded, and you should also see your new model, if it's in the right folder
outside of that method, you could also download the model at home, and just upload it to google colab once you have automatic running in there
but that would take ages off your free time
Why?
why what ^^
Why is take my free time
lots of things ^^
well, google colab gives you some free time to use it each day
and uploading a 2GB+ model from your home connection, that can take a while
Can’t you pay for more?
yep sure
Is it worth it
depends how much you want to do SD honestly... 2h free per day more or less is a lot imo, but I wouldn't be satisfied with it anymore personally
Also
If a model on citiv says no selling content with it
Can I still do it anyway
Without consequences
I am not selling the model I’m making pictures with it and selling nsfw content
if it has "no selling" on it, it's not about selling the model but the pictures, so you would be infringing on the copyright of the model you are using I guess, and they could technically sue you if they discovered it
I would need to read the conditions of civitAI to be sure, I haven't
but the no selling part is about pictures, from what I see when I publish a model myself
google disc upload doesn't count towards google collab time as far as I'm aware.
well, unless you do it in collab directly
Yes that's why it's the first thing i suggested, seems the best to prepare your setup
But i guess if you intend to sell, and pay for colab, the question is a little different
Hey We at blend created a product which magically changes background using Stable diffusion for product photography. Do check it out we have release it on product hunt. Please help us get promoted
https://www.producthunt.com/posts/blend-15bb8bed-1ea7-4cc9-aba4-c42e9ce6e7f7
How can you find out what model was used?
If it isnt a very unique model with a super distinctive style
Hmmmm...
Where can I use the bot friends?
I can't find it.
What are NCNN and ONNX and how are they used in upscaling? I see extra folders in upscaling models for these files, but i'm not sure where you would use them.
Both ONNX and NCNN are high-performance machine learning frameworks
for mobile phones? do people render AI images on their mobile phones?
I know ONNX, but I'm not sure about NCNN
Where can I use dreambot?
Currently, there is no bot on the server that generates images. However, there are plenty of other ways such as the official https://beta.dreamstudio.ai/ website or by running Stable Diffusion locally using your own hardware! Check out #1080946152318443610 for more details! You can also stop by #1025467151206854736 for any issues you experience while using DreamStudio or #🤝|tech-support for any problems you encounter while installing it locally!
Bot is down now

and you use that on your phone? because that's what i got from the link you shared as the main purpose
Maybe the inclusion of ncnn is an oversight
OH, NCNN actually has Windows support
thank you. link doesn't work. But how or where is it used? is this for A1111? or another UI?
Since i reinstalled my A1111 on a new computer (using a 4080) a lot of my images come out desaturated. Any ideas why?
never heard that specifically. maybe try #🤝|tech-support
looks like it's a vae problem. Thank you
Can you run SD using ths PC ram and not the graphics card VRAM?
No, but you can run on CPU mode
I wonder why.. Interesting though
Because the GPU vram has cores that can process the denoising which normal RAM cant
whats the best config to train loras on faces ?
how many repeats/total steps/epochs ?
Fancy that! I thought all RAM was made equal
Yea its not specificly the RAM more the GPU cores and architecture that process the graphical stuff. It needs its own RAM for the tasks. GPU vram is much much faster then normal RAM
Fun fact: you can run Windows XP fully installed on GPU vram xD
🐮Fun fact: You can fit an entire herd of cows in GPU vram! Just imagine, no more crowded pastures or messy barns - simply store your cows in the digital realm. Plus, with virtual reality technology, you can even milk them without the mess! Just be sure to give your graphics card a break - all that mooing can be quite the strain.
there are some command line flags you can use to move some operations to the CPU, thus using less GPU vram. Slows things down dramatically though.
Lel I was actually thinking: so nowadays GPUs are essentially minicomputers inside computer that take care of mostly everything graphic
https://www.youtube.com/watch?v=mJGiLUm0WZI
Everything you need to know about Chat-GPT👆
What was the trick/prompt symbol for "switch between these 2 prompts"
It was like [dog,cat] but I think that was blending
a busy city street in a modern city|illustration|cinematic lighting
https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Features#prompt-matrix
Thanks :)
you're welcome, it does a little more than you wanted though, I can't find the exact one you'd want
looking for it
[cow|horse] in a field
https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Features#alternating-words
that is the one you want
the first I linked would try all combinations of the words
this one will just use each word as an alternative
damn not even
lol
this changes each step
Lol that's interesting. It's cool though thanks for the help!
Hey guys im new to this and using automatic1111 stable diffusion and when i want to start the webui-user.bat then i get an error "Torch is not able to use GPU; add --skip-torch-cuda-test to COMMANDLINE_ARGS variable to disable this check". Can anyone help me? 😦
Whats your GPU ? For technical help #🤝|tech-support
its a amd vega 56
And hi
tried to delete the venv folder and reinstalling it all but doesnt help
directml version? whats that 😅
Thats the Automatic1111 version for AMD gpus
I think I'm gonna go back to using Chatbots mostly
@tulip beacon
Here is a short help for AMD copied from @still glacier
AMD GPU users have the following options :
- install linux and run the standard Automatic1111's "Stable-Diffusion-Webui" ; https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Install-and-Run-on-AMD-GPUs
- keep windows and use the experimental DirectML repository https://github.com/lshqqytiger/stable-diffusion-webui-directml in the above mentioned guide
- use the easier to install but less feature complete "nod-ai Shark stable diffusion" : https://github.com/nod-ai/SHARK/
- use ComfyUI (linux only) : https://github.com/comfyanonymous/ComfyUI
- use your cpu only (very sloooooow render times)
- use online services
https://github.com/lshqqytiger/stable-diffusion-webui-directml is this one the right one then?`i got windows
oh yea u sent it i will try that thx ❤️
DirectML is what enables AMD cards to work on Windows for ML purposes. It's the less good cousin of RocM on Linux. If you've got an AMD card, even without ML considerations, use Linux. It's just infinitely better. Features like FSR 2 on every game was first on Linux
Why are images considered inappropriate if there's no nudity in them
Using a filtered model none the less
sexualization is one of those things that "you know it when you see it" and if you start to define it, you end up with people doing ridiculous japanese censor bars from people trying to skirt the rules
mod discretion is best practice here
That's highly subjective imo
I get what you mean, but it kind of ruins it for the people not trying to do that
there are ai communities that are all about nsfw imagery. this isn't the place for that
The model I'm using doesn't allow NSFW
the 2 ones I had a problem with, one of those had nudity and boobs on a young girl.
The other was not direct nudity like that but was more borderline like you were told by the other user, and I think they were right on this.
the rule 4 goes a little further than just nudity. sexual intent can also lead to that, or gore for example.
Mixing with younger subject, we are more proactive when getting close to the line, following a big increase in rightful reports, like the last few pins indicate there.
I am sorry I needed to stop some of those, I didn't want to just stop the party, but I need to act on those when they get reported, and they do, to be able to still have a party
I agree that some lines shouldn't even be tested or even approached
i have a little room for understanding here, since many users are 15-18 themselves, but it's important to set expectations here.
What is the problem here? :
Installing requirements for CodeFormer
Installing requirements for Web UI
Launching Web UI with arguments:
Traceback (most recent call last):
File "C:\z\AI\stable-diffusion-webui-directml\launch.py", line 377, in <module>
start()
File "C:\z\AI\stable-diffusion-webui-directml\launch.py", line 368, in start
import webui
File "C:\z\AI\stable-diffusion-webui-directml\webui.py", line 15, in <module>
from modules import paths, timer, import_hook, errors
File "C:\z\AI\stable-diffusion-webui-directml\modules\paths.py", line 26, in <module>
assert sd_path is not None, "Couldn't find Stable Diffusion in any of: " + str(possible_sd_paths)
AssertionError: Couldn't find Stable Diffusion in any of: ['C:\z\AI\stable-diffusion-webui-directml\repositories/stable-diffusion-stability-ai', '.', 'C:\z\AI']
your venv seems broken. 3 steps :
1/ delete venv folder inside AUTOMATIC
2/ restart automatic
3/ ask in #🤝|tech-support , they know quite a lot their stuff 😉
sorry I don't know that directml version, I may be wrong, I missread
it seems like amd users need to use directml version no?
yup. or install it with ROCm on linux. Guiz likely is right. Venv folder shenanigans. delete c:\z\AI\stable-diffusion-webui-directlml\venv the entire folder. Run start bat again and it'll rebuild the venv fresh
Okay i will try that
Can you recommend like models or checkpoint how they are called that are working good, with no restrictions to experiment? im new to this and rly dont know, i got the 1.4 original version from hugging face but idk if its good
That's often not the case. Anything based on 1.5 will allow NSFW even if the author says it's not capable of it. There's also the case of the nsfw filter which blacks out images that meet a threshold detection. That won't catch everything that can subjectively look nsfw though.
#🤝|tech-support would help you with a better investigation hopefully.
dont seem to be an active channel.. so i hoped to find help here lol
but i asked there
gl
@astral goblet u got any recommendations?
delete the venv
it is the most active on the whole server usually, but yeah, not right now lol
not to fix my problem i mean which model xd
oh okay well i try to google it
there are so many models. 1.5 is an upgrade over 1.4 imo.
100% for me too
Runway ML did well there. I wonder what the drama with the release was all about. Nothing really was revealed about that. Looked like damage control went on behind closed doors though
Football edits with messi and ronaldo logo
found something here its a safetensors file but idk if i can use that on stable diffusion too? https://civitai.com/models/4823/deliberate
yup. All of the checkpoints on civit are for Stable diffusion
I'd like to tell but I can't really. nothing really exciting tbh, mostly some miscommunication leading to public confusion
sometimes for their checkpoints, it'll tell you an additional configuration file is needed. a .yaml file that goes in the same folder
yeah i'm sure it's mundane branding stuff and not as exciting as like, elon firing people, but i like to know things. inputttt. need inputtttt
and how do those models differ? some are trained more than others?
yeah! they're all refinements and merges of models people have made
so it seems i can cancel my midjourney now?
not only can people refine models with their own image sets, you can breed models too. like they're some kind of pokemon
sure! i never subbed i just made a couple accounts for trial access
yea i didnt know that i can host an text2img ai on my own
hurray technology!
wondering if there will be a way to host chat ai like gpt so there is no need for openai giving access to the service
yup!
base models, official ones, are trained "from scratch" using a giant dataset
most other models are refinments of those generic models in a given direction, like anime or photorealistic
stanford released alpaca! it can run on people's home pc's with 32gb of system ram
just the other day
there is already a way to run a GPT like at home, and fine tune it to your needs yeah
oh for real? so it does the same job as chatgpt?
it gets close! it performs very well for being a much smaller model
it's like DallE and MJ are "the same", it does great text too, but will be differently tuned
but alpaca is for image generation and not chat no?
oh i see
https://www.youtube.com/watch?v=BKb_AnREvvY found this do you mean that?
yeah thats right
but alpaca is waitlist i guess..
afaik you can get the model today. people have trained it using the data in the github. or it leaked. i haven't looked too into it. i only have 32gb of memory
in chatgpt i still got the problem that my history is missing.. hope they are fixing it fast
it's there for me right now, but it's missing 90% of the time for me too
i didn't think it was keeping my history at all then one day i used it and stuff from the very beginning was there. so it does keep it , even if its not accessible most times
just wanna get stable diff work but i dont find any solution..
if there a hardware reason you are using this directml version ?
I'd love to help you but that is the part I don't know about, never used that one
idk someone told me i need to use directml cause i have amd graphics
you are on AMD, ok
@vast ingot
so idk if i rly need to
cause the normal version didnt work too
the normal version won't work on AMD without adaptation yeah
SD came out on NVIDIA exclusively at first, and has been adapt, thus AMD lacking a little right now
so, you went with the directml version
downloaded it
you checked that you had both Git and Python 3.10.6 in the path ?
you can check by opening a cmd prompt window and typing : bash git python --version
Python 3.10.10
that seems to be wrong yeah, you should try to downgrade to 3.10.6 to be sure
I'm following this part
so i remove python 3.10.10 and install 3.10.6?
yep
being the above version, in machine learning, isn't always best
it needs all dependancies to be working on the other version too, and can break things
like 3.11 for sure breaks things here
okay im installing it rn
you will need to delete the "venv" folder again too
ye doing it too
since python version is duplicated inside it
also, one thing that breaks : don't run the webui-user.bat as admin
brb
did you ping me ?
Well I guess you handled things :p
Sorry things are kinda hectic at the moment on my side. I don't do that much SD customer service at the moment :p
oh thats how u call it xd
@vast ingot still not working... still cant find stable diffusion..
what's the error message ?
Installing requirements for CodeFormer
Installing requirements for Web UI
Launching Web UI with arguments:
Traceback (most recent call last):
File "C:\z\AI\stable-diffusion-webui-directml\launch.py", line 377, in <module>
start()
File "C:\z\AI\stable-diffusion-webui-directml\launch.py", line 368, in start
import webui
File "C:\z\AI\stable-diffusion-webui-directml\webui.py", line 15, in <module>
from modules import paths, timer, import_hook, errors
File "C:\z\AI\stable-diffusion-webui-directml\modules\paths.py", line 26, in <module>
assert sd_path is not None, "Couldn't find Stable Diffusion in any of: " + str(possible_sd_paths)
AssertionError: Couldn't find Stable Diffusion in any of: ['C:\z\AI\stable-diffusion-webui-directml\repositories/stable-diffusion-stability-ai', '.', 'C:\z\AI']
using directml cause i got amd hardware
Perhaps something went wrong during the git clone process
you might want to git clone the repository again in a fresh folder.
i will try that
3.11 will break things yeah, but 3.10.6 also really broke things for me. I havne't investigated further on this matter, but i think it had to do with the e-cores that my alderlake intel has. 3.10.9 got rid of the NAN errors that would randomly appear. nothing else other than updating 3.10.6 to a newer 3.10 version fixed this
i've not found any instances where 3.10.9 or 10.10 break anything
so .11 is bad and .10 is good?
3.10.9 is generally safe, we've seen instances of install getting borked with 3.10.10 however
installed .6 but doesnt change anything in my error then maybe i can install a newer version
yeah. they update both python versions too. neat development crew over there. it's been going for quite some time. i've never bothered to learn it
make sure to rebuild the venv after updating python
I don't think it's the kind of error that would get fixed by updating to python 3.10.9. But sure if you want to give it a go you can try after git cloning into a clean folder. Just make sure to delete env folder again if you choose to do so
$ cd C:\z\Stable Diffusion
bash: cd: too many arguments
what is the problem now?
in the git bash
don't use git bash for anything but git related commands
"run it as an admin" is such common troubleshooting advice and i hate it. It's bad security policy and it always chews up folder permissions so that more problems crop up down the road. There's usually always a better way of fixing something imo. Running as an admin is like running while shooting a gun aimlessly
yup
but people are lazy and "but mah youtuber told me to do so !"
they don't know the risk.
yeah its a very lazy troubleshoot. if it's eveyr done, it should only be done to see if it does make it work, then turn it off right away before it fudges up folder perms or allows some remote code to execute
I think it comes out of cracked games culture a lot. People download a crack. nfo file says "run it as an admin! The virus scan is a false positive. TRUST ME!" and then it works so they learn that it works
and the cycle continues
deleted sd 1.4 so downloading again 🙂
4gb takes short time
Hey all. Sorry I'm really out of the loop, what's the state of the art with respect to making reasonable hands? Has anyone trained a model that inpaints fixed hands? That would be awesome. My workflow for generating art works well except that 90% of the time is fixing hands!
whats fixing hands?
Taking generated images that have gnarly weird hands and making them better
you'll learn soon enough that SD is really bad at generating accurate hands
oh okay
there are a lot of good quality memes about this problem
Yes I know this, it's why I am wondering what the state of the art is on "fixing" bad hands. I can create tons of images that have 99% of what I want, but then I have to spend ages manually fixing hands. Just wondering what solutions people are using for this.
I'd say take a look at controlnet for that
i've downloaded an extension called depth library. lets me take hand depth estimations and line them up over an image, then use that to control net new hands
My current approach is to manually paint in the general shape I want and then iteratively allow stable diffusion inpainting to create detail and lighting match for the scene. Except it takes many iterations and is really slow and painful.
A new release of pip available: 22.3.1 -> 23.0.1
is that important? @still glacier
nope you can ignore that
Thanks I'll read about this.
One limitation i've found with that, is that a lot of the depth detail is lost when you scale the image. they could do with better interpolation / scaling code here
it does help a lot though
still the same error... @still glacier
One of my problems is that I don't want to/can't afford to spend dozens/hundreds of hours constantly staying up to date on new features and their infinite myriad of ways of being used. I have a bespoke script-based workflow and it's really hard to fit new stuff in since these things tend to get tightly integrated with specific web based UIs.
there were no errors during the git clone ?
can you share the whole log when you get the error and not just the end ?
screenshots are ok
Also we might want to move to #🤝|tech-support
dont have nitro need to split the message
Creating venv in directory C:\z\StableDiffusion\stable-diffusion-webui-directml\venv using python "C:\Users\thoma\AppData\Local\Programs\Python\Python310\python.exe"
venv "C:\z\StableDiffusion\stable-diffusion-webui-directml\venv\Scripts\Python.exe"
Python 3.10.9 (tags/v3.10.9:1dd9be6, Dec 6 2022, 20:01:21) [MSC v.1934 64 bit (AMD64)]
Commit hash: f64ad25386ad77d1e07986a204dac923b9a46894
Installing torch and torchvision
Collecting torch==1.13.1
Using cached torch-1.13.1-cp310-cp310-win_amd64.whl (162.6 MB)
Collecting torchvision==0.14.1
Using cached torchvision-0.14.1-cp310-cp310-win_amd64.whl (1.1 MB)
Collecting torch-directml
Using cached torch_directml-0.1.13.1.dev230301-cp310-cp310-win_amd64.whl (7.4 MB)
Collecting typing-extensions
Using cached typing_extensions-4.5.0-py3-none-any.whl (27 kB)
Collecting pillow!=8.3.*,>=5.3.0
Using cached Pillow-9.4.0-cp310-cp310-win_amd64.whl (2.5 MB)
Collecting requests
Using cached requests-2.28.2-py3-none-any.whl (62 kB)
Collecting numpy
Using cached numpy-1.24.2-cp310-cp310-win_amd64.whl (14.8 MB)
Collecting idna<4,>=2.5
Using cached idna-3.4-py3-none-any.whl (61 kB)
Collecting urllib3<1.27,>=1.21.1
Using cached urllib3-1.26.15-py2.py3-none-any.whl (140 kB)
Collecting certifi>=2017.4.17
Using cached certifi-2022.12.7-py3-none-any.whl (155 kB)
Collecting charset-normalizer<4,>=2
Using cached charset_normalizer-3.1.0-cp310-cp310-win_amd64.whl (97 kB)
Installing collected packages: urllib3, typing-extensions, pillow, numpy, idna, charset-normalizer, certifi, torch, requests, torchvision, torch-directml
Successfully installed certifi-2022.12.7 charset-normalizer-3.1.0 idna-3.4 numpy-1.24.2 pillow-9.4.0 requests-2.28.2 torch-1.13.1 torch-directml-0.1.13.1.dev230301 torchvision-0.14.1 typing-extensions-4.5.0 urllib3-1.26.15
[notice] A new release of pip available: 22.3.1 -> 23.0.1
[notice] To update, run: C:\z\StableDiffusion\stable-diffusion-webui-directml\venv\Scripts\python.exe -m pip install --upgrade pip
Installing gfpgan
Installing clip
Installing open_clip
Cloning Taming Transformers into C:\z\StableDiffusion\stable-diffusion-webui-directml\repositories\taming-transformers...
Cloning CodeFormer into C:\z\StableDiffusion\stable-diffusion-webui-directml\repositories\CodeFormer...
Cloning BLIP into C:\z\StableDiffusion\stable-diffusion-webui-directml\repositories\BLIP...
Installing requirements for CodeFormer
Installing requirements for Web UI
Launching Web UI with arguments:
Traceback (most recent call last):
File "C:\z\StableDiffusion\stable-diffusion-webui-directml\launch.py", line 377, in <module>
start()
File "C:\z\StableDiffusion\stable-diffusion-webui-directml\launch.py", line 368, in start
import webui
File "C:\z\StableDiffusion\stable-diffusion-webui-directml\webui.py", line 15, in <module>
from modules import paths, timer, import_hook, errors
File "C:\z\StableDiffusion\stable-diffusion-webui-directml\modules\paths.py", line 26, in <module>
assert sd_path is not None, "Couldn't find Stable Diffusion in any of: " + str(possible_sd_paths)
AssertionError: Couldn't find Stable Diffusion in any of: ['C:\z\StableDiffusion\stable-diffusion-webui-directml\repositories/stable-diffusion-stability-ai', '.', 'C:\z\StableDiffusion']
Drücken Sie eine beliebige Taste . . .
yup. i fully understand the need for a consistent workflow. i'm still redeveloping mine to a point where i can start "working" uninterupted. I keep getting hit with new tech and i'm like "ooooo"
but we hit limitations on our workflows, like you've run into with hands, and we redevelop them and integrate new systems. i would fully recommend you investigate controlnet in your workflow. it's such a powerful tool that it will likely be as default as something like negative prompts. Novel additions that are basically default SD now.
@still glacier just dont know what it means that it cant find stable diffusion
professional animation studios will use long outdated software because thats where their workflow is established and thats what works
no rockets as big as the Saturn V have launched since the apollo program. it's because all those old workflows pre computer aided design, were lost and the new tools were not re established until lately. Now we got Starship up on the same pad that Saturn V sat on. Nice to see
there is so much value in having a consistent workflow. i could draw 10k more examples
My specific use case is to generate characters for a game, where each character has 9 variations representing increased "level" of the character. The specific challenge is on getting 9 versions of the same character that look like they've age-progressed.
As a result I don't focus as much on having the most "beautiful" images possible since it's so much effort just to get age-progression that infinitely tweaking style just isn't something I have time for.
I have a process that works, it generates ~1,500 images overnight for me to sift through, and they're usually good enough that it only takes 30 minutes or so to sequence a few characters. But then -- hand fixing ends up taking another 2 - 3 hours per character. It's by far the most time consuming part of the workflow.
For example:
#🏞|general-with-images message
Almost all the hands in those images were gnarled lumps that had to be fixed "manually". I'm not an artist, I'm a software developer, and so I don't ever draw final pixels, I use a combination of rough manual sketching followed by inpainting. Although I'm actually starting to get reasonably good at just drawing hands outright I've done it so much now.
in a1111 Is there an easy way to find all your embeddings? I've installed a few but didn't write down all the key words.
they are in the embeddings folder, and you can easy access them by clicking the third button under the generate button then select textual inversions
But what about the trigger words?
its the name of the embedding
https://glaze.cs.uchicago.edu/
So anyone seen this? It does something to the images that doesn't just make it difficult to replicate but poisons the models weighting system causing it to corrupt the rest of the model.
with lora's, theres something i'm annoyed by. I often check if an embedding is working by looking at the generation details and seeing that the embedding was used. Is there a way to enable this for loras? Also, when using loras, A1's ui puts the lora file name into the prompt, but do you also have to use the trigger word?
like lora:name:1 will often work for many of them i download, but if i train one its like i need to use a key token to make it work. it'd be nice to see in the generation details if it was being loaded at all
I've got access to Google's Bard. It's amazing ..... how bad it is
I have been focusing the past month on fantasy character portraits, but now that I have switched to landscapes of high-fantasy worlds, I am having difficulty with finding a good model/checkpoint or Lora Style. Not sure if anyone has any suggestions, here.
Ideally something where I wouldn't have to make very bloated prompts in order to achieve a somewhat consistent result.
bard was a rush/panic job.
Bard would have been cool a year ago, but OpenAI put the bar pretty high
So I was wondering how viable combining stable diffusion and 3d animation software to make 2d animations would be:
Taking something like Cascadeur (I think that’s the name, the 3d animation software that essentially uses AI to make a lot of the finer details of animation) and essentially make a 3d animation with simple mannequin models that are roughly the body shape of the 2d character you want, setting up the camera angles and color coding each model. Then using diffusion to essentially go through each frame and creat the 2d animation, using the color coding to assign different “models” that would be trained on those specific characters. Making it so that you made the animation in 3d with only guidelines to convey different character and set pieces, then diffusion put down the actual world, almost tracing over the models.
i dont even think it's that bad. it has a few obvious fail cases and theres lots of stuff for the antiwoke people to hate and meme about, as they usually do, but "how bad it is" seems a bit hyperbolic. I'll likely use it for a lot of different tasks when it comes out. As well as the other branded systems too. Brand loyalty is just stupid. Use the tools that work for the task instead.
Same way i use windows sometimes, linux other times, chrome, firefox, edge, i alternate. blind loyalty to one brand is just a bad habit
I asked it to write my a game in python using pygame, it didn't import pygame. Then used functions that dont exist. Nothing woke about syntax errors..
you'll want to watch corridor's RPS animation video. They break down the process very well. A lot of the same techniques can be reapplied to what you want to do
pretty hilarious i guess... again, there are fail cases. i never denied that. but here you are insisting it has nothing ot do with anything woke. weird to fixate on that and nothing else that i brought up though
i mean, the antiwoke mafia are loud and proud about how angry they are with these new systems. I only brought it up to recognize that major criticism that goes on, not to accuse you of it
you're either a troll... or you're piping your responses through bard. either way you fail the turing test.
it'd be like not addressing the elephant in the room
I wonder if I can img2img those pics that were wrongfully removed
nope. it's just weird that you're suddenly this enraged about the antiwoke aspect being mentioned at all
arguing with mods and trying to skirt their enforcement probably isn't gooing to get you as far as you think it will
That's why I want to img2img them
I'll make clean versions and an actual NSFW version, lol
Everyone wins 🥳
it's probably just smarter to not test those rules instead. they described good reasoning for why it was removed and i think that you should go read those again and respect that decision, instead of casual accusations of wrongful action
That's too much effort just to throw your image into discord channel where it'll be forgoten in a 5 mins lol
ikr
it's not about posting at this point. it's about getting past the filter somehow. it's a puzzle. i get it. it's just... don't
filter is pretty strict btw, it seems to think it found boobies when dress is too white color or something, but eh, whatever, I don't really care if some images are ignored
That happens to me a lot too
Especially on #1072016199837290536 and #1072017962531307540
With AI doing better and better I'm sure they will eventually
And then they'll be filtered by... Humans! 🤪
humans are all out of jobs. get real
we've exited the information age and are entering the cat lifestyle age
heyyy
cat lifestyle age lmfao
that would be true if the people running this world were anything like cat owners but nope.
they will rather let us starve than give us handouts even if robots took all of our jerbs
how to use bot stable diffusion?
Where do the handouts come from?
profits from substituting humans with machines?
ok
Man MJ5 is spectacular, they just seem to take everything released for SD and make it better lol.... No controlnet though, that's a big downside right now.
@sweet turret
Currently, there is no bot on the server that generates images. However, there are plenty of other ways such as the official https://beta.dreamstudio.ai/ website or by running Stable Diffusion locally using your own hardware! Check out #1080946152318443610 for more details! You can also stop by #1025467151206854736 for any issues you experience while using DreamStudio or #🤝|tech-support for any problems you encounter while installing it locally!
@near silo
MJ V5 is fine, but it's not as good as people hype it up to be
What do no for sure is that adobe fire fly is trash currently lol
Though I can't say if I dislike Adobe or mid journey more
On the contrary the results Im seeing in the MJ5 beta feed are out of this world.
I've been look as well and meh, I don't see it
Like I see it, but it's not that impressive IMO. And most of the results can be easily mimicked or surpassed by stable diffusion
at last
someone who has a good eye like me
yeah me and this dude got into a flame war yesterday over which model is best
As much as I loathe MJ, that model is incomparable
the amount of fine detail is insane
It really wasn't a flame war, and It wasn't an argument either. You sent a very lackluster generation from MJv5, I along with the rest of the server all said that it wasn't impressive, then you sent another one to which I replied immediately after with a higher quality generation of the same prompt in stable diffusion
AI image generation is dying. Civitai is receiving DMCA notices over images generated with LORA's.
"dying" is a bit dramatic lol
Im trying to figure out controlnet to pose hands, get better hands and feet. Any tips on that? I tried some blender thing, but despite following the tutorial couldn't get it to work
I'm an SD Master, and it would take me a TON of time to get the type of output MJ5 produces.. alas it's always been this way, it's a game of catch-up. I still think SD is far more powerful over-all, but for quick beautiful renders nothing like MJ.
If all the LORAs and models have to be taken down, can't be distributed, that's very bad
People will just share them in other ways. Something as simple as that isn't going to stop a community so resourceful
any way they share that can be found - it will be taken down
It will be fine lol
tell me how to make good hands and feet with controlnet
oh master
There's a lot of info on this if you search Youtube, there are already control-net hand models. Most of the time you just need a template. Lots of Auto1111 tools for it.
openpose, for example, can do only basic poses. walking, runniong? not even that
I agree that it would take a lot of work, but it's still achievable.
I have no doubt in my mind that mid journey could produce some insane images, however I just haven't seen any
I have searched youtube. I tried the blender..,.cant make it work to even generate the depth map image
You're not doing it right then.
I followed it step by step. The simplest par
simplest part doesn't come out right...
the open pose in a1111 111...that image is so simple. It can't do much re poses
It's really such a shame that midjourney doesn't benefit anybody
It is really good tech, but for the benefit of nobody
I used to support them until I realized they are as malicious as they sre
*are
What is a LORA?
You slap it on top of model and it teaches it how to do a new style/character
Yeah, I train LoRAs
how do I apply them? I downloaded one and put it in models/LORA, I am guessing the "trigger word" does soemthing?
Trigger word is what you have to put in the prompt for the LoRA to activate
i see
Hop over to #🏞|general-with-images
Hey guys , is there something like rundiffusion which offers automatic1111 and invokeai but allows me to upload my custom model ?
is there a place where I can see prompts others have used and their corresponding images?
all over civit
right thanks
i dont like a lot about civit but its really a good resource if you don't mind all the smut there
whats up
So your the real owner of this server, ye?
???
@delicate oxide
yoo no way
please do not repeatedly ping
any plans for a 1024x1024 model pretty please
https://www.youtube.com/watch?v=-tTTyodNqrI LADIES AND GENTLEFISH - MY NEW VIDEO IS OUT!
Please share if you enjoyed 🙂
also let me know what you think xD
anyone here who could maybe help me out with deforum?
hey guys I'm trying to install URPM, do i put vae-ft-mse file in the "VAE" folder?
i take issue with people calling themselves masters of an art that is barely a year old. mastery takes years to accomplish and usually is demonstrated with a magnum opus. Just a pick i gotta cope with over here. don't mind me
i don't like to see the inherent value of a word watered down
So I need some help with a prompt. I'm trying to make a prompt where there's one mail and one female in the image. But no matter what I do it only puts 1 or the other. What do I do?
unfortunately, it's a natural language algorithm, not a technical implementation. it's easy to confuse the focus of an image by giving it too many subjects, like, more than one. Something i've done is instead of "man and woman" i describe them as a set like "couple portrait" "marriage photo", so that they're one subject. And since SD models have an inherent bias in them due to cultural forces, you don't have to stipulate man & woman
more to the point, i find one subject prompts are easier to work with
i saw some some research being done on programming languages to create models, while it's work being done elsewhere, not on diffusion models, i think it could at some point apply and i can see a vision of function calls being available to generations
https://github.com/Extraltodeus/multi-subject-render an extension like this will help a lot, especially once you figure out to do single subject prompts. This allows you to give multiple single subject prompts and then the script will composit it all together
it's stuff like that which i think could benefit from a programming language being inside the base model i think. extensions like that could just be something native to the model and so many other possibilities
I've put hundreds of hours into SD, since it first came out. I have no issue calling myself a master. Not to mention I've got an arts background as it is. I can create almost anything I want in it.
So if I'm using the couples portrait thing as a prompt for example would I not use the standard 1boy, 1girl thing?
i don't get the mj5 hype and you saying you can't get that, tells me you've peaked then. if thats mastery, you've capped your level. maybe i treat the word too literally and no one else does. i get that often
the "standard" 1boy 1girl thing is a booru tag thing and only works on the knowledge that was trained using that language. typically anime. Not to mention, the people who did that training really did not put nearly as much work into the 1boy side of the knowledge base. It's "mostly" a natural language model, and the reason keywords like that work is because they are heavily refined into the anime training
anime fans have been tagging images for decades and they're called "booru" tags.. its a .. its a dark space of research

Try a man and a woman ....
Or 1boy and 1girl holding Hands... Should work with anime models
holding_hands is the booru tag there too
oh, thinking about it too, when i was showing off stuff for my brother and his daughter, she wanted to see a tea party and that always had two subjects.
This seems fun, what's this server about?
Stable Diffusion...mostly?
You can utilize different extensions in order to achieve stuff like this. Something that you can use is zone focusing, or composition split.
What it does is it allows you to add multiple splits and different sections onto one image, while also allowing for multiple prompts in each one
So basically you could have one long composition and have it be the described background, then on the left side you can have it be a person, maybe a woman. And then on the right side You could have it be another person, like a man
So then it will make the background, the man, and the woman all together
That's .. all gibberish to me
#imagine
well Id never heard of that before and it sounds pretty well explained
has anyone tried a Tesla M40 gpu? I cannot find many people talking about it for stable diffusion
I would assume that it's not particularly good. Tesla GPU's aren't known for being very good with AI
/subscribe
the m40 has 24gb ram wichis nice, but is fairly slow overall compared to a more modern gpu.
from reddit:
I have a Tesla M40, and a 3080Ti. Using SD to produce a 512x512 image @ 100 steps, the M40 finishes in just under a minute. The 3080ti takes roughly 12 seconds.
Why am I getting low quality renders with a custom uploaded mask? :/
My mask is properly black and white.
bru just get a 3090 lel. best bang for the buck
who gives a fuark about tensor management. its all about that sweet sweet vram
hmmm, where do I put pylint commands and lines to have it working and imported or is pip install wrapt typing-extensions tomlkit tomli platformdirs mccabe lazy-object-proxy isort dill astroid pylinto enough to get --upcastattn ?