#💬|general-chat
1 messages · Page 112 of 1
someone said:
4070 ti super is about 10-15% slower than a 4080S. They have basically the same feature set (they are built on the same die afterall), including the dual encoder than will probably help out your streaming quality. Is that extra bit of performance worth +$200 for you?
huh same price difference too
4070, 4080 is big boys in terms of memory speed
yeah 256-bit memory bus compared to 192-bit
for example
but the thing is, all three will probably run SD3
i think ill go for the 4070 ti
alright
if you use that to buy a better CPU to prevent bottlenecking and etc then its all worth it
not that I know what your setup will be
i will probably get a AMD Ryzen 7 7800X3D 4.2GHz
i feel like the more i look up parts, the more expensive stuff i pick lol
but yeah im at $3000 right now
but whatever, money is meant to be spent
but even i cant bring myself to buy a 4090, price is actually ridiculous
it was soooo worth it
i thought i'd feel stupid for it but i haven't for even a second
Great CPU!
Best for Gaming rn
🥱
helo
what is the most you guys would pay for a Checkpoint model?
really depends ...... 😄 just some generic one? Probably nothing because so much is on Civit.ai. If you will get a custom model for yourself it's something different of course
imo it looks pretty generic and he calls it a "mix" which makes me think its not even a trained one, but a merge. its not publicly available as he wants to charge for it. but for a custom model how much at most would you pay?
yeah mix sounds like some merge... But maybe he has a very special complex merge with 20 different loras that just work perfectly together. That can require some skill to create too.... Without knowing what exactly your Checkpint is... maybe 50 bucks ? Sounds like it can nothing new, just things that already exist a bit better
hm can it do something specific or just some general good stuff ?
200 seems like... a lot
They dont specify, it honestly just seems like any other random checkpoint i cant find for free
can you link to it? Then I could get some idea how worth it could be 😉
honestly bunch of NSFW anime 1.X finetunes could do these
i personally wouldnt, but it looked like a good model so i was gonna download it but then realized it was behind a paywall and was curious what others thought? ive never heard of others charging for a checkpoint
yo looks like any other anime checkpoint...
why the hell does it say "only 4 left" on the website, seems really scammy to me
so was curious if this was normal, and i was just overreacting, but yeah that was my thought it looked like every other anime checkpoint available for free
LMAO
ONLY 4 LEFT?
right? i thought that was so weird
that's rarted I'm sorry
and the comments "The original is from AsuraAI, published on 10/23. I dropped an idol. I guess all the other parrots are stolen too. Regrettable."
this doesent make it better either
it's even for SD1.5 there are soooooo many good anime models on 1.5
yes, literally hundreds
exactly, and right 11 souls bought it
more like 11 got robbed, yeah my reccomendation is to not buy it
just go to civitai and type in "anime"
no intention of doing it but wanted to know if charging for checkpoints was becoming the norm
I used to use morething like " orangeMix " or "orangeBlueMix"or something like that. That was really amazing and totally free. Alot of loras perfectly worked with that
i dont think its becoming the norm, its just people being tricked, sadly some always do
exactly
this has to some sort of merge at this point
imagine someone just sitting at their computer, then put in 2-3 models and clicked merge and put it up for $200
sounds like a good business model xD
even of his preview he just has the girls standing there. no complex or interesting poses or anything special. Mayby it sucks at anything than portraits ?
i mean shit they made $2000 bucks
My son 👨🏿🦲 convinced me to sell a model merge for $200! I think it's a great idea 💡😮
right i have a custom merge, im gonna go sell it for $199 so im the better bargain
sneaky sneaky ;D
with SD3 I probably won't need a finetune for photography 🤩
I do think that there will be a bunch of finetunes for SD3 that will be about even more complex actions and expressions and etc
.... I saw some rather bad image from SD3 already 😉 Think it's prompt understanding will be superior to anything else, But not necessarily how good the image will come out in the end 😉
from my understanding though they are cutting back on the NSFW side and anime side from SD3
from the early access or where
I'm looking at Lykon's images
he has the most up to date model of SD3
the early access has a bunch of random models with random samplers
yes, but son't know how early that is and how big of a jump the final release will be
sorry havent saved them ^^'' there where image of clowns in a bar and I saw one of a man and woman on a dinner party or something
sd3 has anime, just not good anime
Guys, SD3 invite news from Emad:
Loads of invites going out next week I believe, there are a lot of people on the waiting list
anyone got access to sd3 yet? or any idea when it will be available?
cutting back on NSFW. there was nothing NSFW than a general naked body without genitals. I hope they won't cut back on that. That was was made SD2.0 so unbelievable bad
oh snap
i know someone who does, sd3 access to more people soon
sweet
I have seen anime in SD3 and I think it looks good for a base model, still probably not perfect. No idea what exactly people want from an anime model.
i just signed up for the preview last night. probably at the end of the list:@(
The ones comfy and emad generated looked good to me
yea comfy is probably best example of anime
it is not good enough for an anime base
whats the word? is it better/equal to dalle3 or midjourney?
looks a lot like early sdxl models (wdxl and wdxl)
better for being Actually Possible to run locally
i love dalle3 but you have very little control and its content restrictions are ridiulous
dall3 and mid are run on h100s
somewhere between midjourney and DALLE3
when it comes to quality, for photography its really close to midjourney
it is better than any opensource model as of now
I saw a video on google imagen. Half the time it blocked the output because of giudeline violations. In the end it just was unusable. became pretty funny xD
we have yet to try prompting it differently cause almost nobody has access
but the ones lykon posted were good to me
you can 100% do photography
you have a link to them ?
also, SD3 max size will barely run on 12gb of vram afaik
no like I want to see if promping it stuff like "The photo is done with high dynamic range and has beautiful subsurface scattering" and other crap like that
Hi, guys. I am first here. Is there any announcement for making SD3 public?
whenever i ask for a photo from dalle3 or something realistic it gives me something that looks artistic and/or 3r rendered
Yeah this would be my dream, SD3 running on 12GB at like 2-3 images/minute
more people get access this week
sure wait
i dont know exact speeds but yeah around there
get ready for auto webui to not support sd3 btw
you know if the bigger and smaller models are compatible with the same loras ?
also it completely ignores specifics, like asking for a certain camera angle
either move to comfy or try and use the unusable (sd.next)
Thanks for respond 🙂 How about implementing in the diffusers?
from what i see, its not possible
diffusers implementation is when inference code avaiable
they have that midjourney-ish feel to them, but it's till not the exact
it still has that slight "its a base model" feel
imagene have one great lora for the small and one for te big one. you never can'#t use them together. will be very sad 😦
comfy already supports it btw, just has to commit it to main
which I am hoping will stay
is sd3 gonna have the ability to generate images without lighting and create depth/normal maps etc for them?
i dont think so
]damn
how can I make StableStudio run on my graphics card
gpu?
that would be perfect for material creation
oh wait stable studio
well it will have controlnets at launch
uhhhhhh
i dont think so
yes
are there controlnets for sd2 that can do this?
SD2?
sd2 sucks tbh
yeah i could never get anything but gibberish from it
literally wouldnt generate an image for me:S
lol what
id give it a detailed prompt and it would just give me some crappy loose representation of what i asked for or a random blob of colours
dunno why, ive seen what it can do
yeah they look great! to bad I can't find the bad ones anymore... but let's hope it's generall good performing and the bad one where just cherry picked 😉
@Dark can you give me a promt and settings/seed for something so i can see if i get the same result as you
uhhhhhhhhhhhhhhhhhhh
and which model
how to run comfyui after restart computer mac
prompt: Raw Candid Cinema, Hansome man on a beach, (remarkable color), (ultra realistic)
negative prompt: ugly, blurry, distorted
idk try this
help
try #🤝|tech-support
thats with the model sd_xl_base_1.0. do i need to be using something different?
i had better results in sd1:S
ah I think now I understand what your problem is 😄 Open your web browser and go to http://127.0.0.1:8188/
no help
Then I don't understand your problem 😦
do we run com terminal
💀
i have big problem with stable diffusion
all my images are blrury
they are fine on fooocus tho
i just installed comfyui , after restart macbook, how can i run comfyui? please help
go to the folder where you installed it and double clik "run_nvidia_gpu.bat" . Then a black window with some text will appear. Don't close it. Wait a while. Then there will be some text with http://127.0.0.1:8188/ . Copy that in your webbrowser and hit go
Bats dont work on mac
if nothing appears when going to http://127.0.0.1:8188/ , then your black windows hasn't finsihed starting yet
then it's probably some other file that is called run_nvidia_gpu . He should find it..
thanks bro
is it working?
I saw on a video that they click on main.py, instead of run_nvidia_gpu. So I'm not 100% sure what is thhe same thing to do on mac and windows
you need to know it 😄 otherwise just install it again in folder you remember 😉
in in comfyui folder
but i do not know where the run_nvidia is in comfyui folder
maybe you don't have the run_nvidia_gpu on mac. try main.py instead
I have windows, so I just can guess whhat to do on mac. But there are some videos on youtube
but after your installation you started it somehow
yes
last night i turn off laptop
and open this morning
no help anyway thanks bro
it works
thanks for the video you show me
👍
cool! glad to hear that !
if you prompt with the typical novel ai word salads, yeah, yeah it does.
When you're prompting the same language that the openclip layer understands, its powerful. You can approach SDXL this way too as it has dual clips. Eveyrone usually prompts to the one clip layer only though. The same one thats in the original sd1.
if you're prompting for porn then yeah it does suck. Unstable Diffusion took a lot of money from the community to build a nsfw 2.1 model, but never delivered. I think the resent for that event is misplaced though.
theres a reason why there are no popular sd2.x models
they forgot to train the clip properly
popularity contests. how highschool.
Openclip is very capable. People just speak the wrong dialect to it
popular/good
Well your naivity is on full display right now. Congrats putting yourself out there like that. Takes balls.
i dont doubt that openclip is good, its just that when you shove it in with almost no training, its not gonna do the best it can
thank you :3
i'll dial back the trying to convince you. Entrenched is entrenched is entrenched
if you want to believe that 2.1 is good, then believe that
if i want to believe that 2.1 is bad, ill believe that
its the basis of Stable Diffusion Video
Check out the #1045349359044280360 channel, there are many great generations in there. Flow wolf is correct here imo, many people just didn't give it a try simply because of the bad press it got.
yeah, sd2.1 couldve been great
but from nai leaking, the clip not being great and other factors; it didnt get much popularity
It is is what I'm saying lol, the generations in the channel prove that
the text encoder was yanked actually. But they used the same pretrained openclip model
ah
The unstable diffusion crowd ran their misinformation campaign hard. People got suckered. Nobody wants to admit that the conversation was steeped in misinformation and fud. The archive channel is beautiful
They were the leaders of the "SD2 is censored!" conversation
had a lot of crowd funding campaigns set up to collect funds and build their own community model that would never be censored
oh, i mainly heard the "sd2 sucks" from touhouai folks
i think it's nearly 2 years after 2 drop and they've only released a beta version that was actually someone else's model renamed
bunch of flimflammers
Yeah there is a lot of brigading and tribalism in the community, it's kinda crazy. For example, either Vlad themself or fanboys of Vlad were incredibly toxic to anyone not using vlads fork back in the day if anyone remembers. I'm not convinced that many of the comments and down votes werent made by either the same small group or even one person.
ohhh vlad
does his fork even work?
from what ive heard it breaks more often than auto
Vlad renamed it to SD next
Yeah it was a good fork from what I read of it. The tribalism of some folks though, very toxic lol
Most of what vlad improved in their branch, was just code from the dev branch of automatic1111
cant wait for sd3 to release and for auto/vlad to try implementing it
sdnext will probably just use diffusers
automatic1111 with fp8 turned on is actually more memory efficient than comfyui. you turn on comfyui's fp8 support and it breaks most nodes.
can only turn it on with a command line switch and theres a few options for it. consequently, the way that the forge fork has fp8 support is tied to the code they use from comfys memory management, and is less efficient than automatic1111 1.8's
Hmm come to think of it, I haven't kept up with stable cascade. Has a1 or comfy implemented it yet? Cascade and sd3 should both function a little bit differently behind the scenes from the previous models.
yea cascade works with comfy already
comfy had cascade day 1. nothing in auto yet
sd3 will work with comfy day 1
Ah
does fp8 work on ampere or is it ada only
looking forward to learn how a transformer-diffusion architecture is done
it works on ampere afaik, but in software. no engine on teh chip to support it
ah, so fp8 saving n loading?
inference too
neat
Transformer would be like GAN right? Ive always wondered if there was research being done to combine the two
no i think transformers are different from GANs. but the new arch is a hybrid diffusion/transformer architecture
I remember a crazy awesome post a while back on something based on GAN moving a Lions Head and making it yawn
Ah
oh draggan
Drag Gan. can only work on images made in a GAN
Comfy got a drag node recently actually thanks for reminding me of taht
HDiT seems neat, 1024x1024 images in pixel space with much less compute
Oooooh
another node for making a "trajectory" https://github.com/GraftingRayman/ComfyUI-Trajectory
i dont think i'll bother wiring that up hahaa. i'm going to wait for ui support on such systems
oh my god I remember that
holy crap
its been a long time
i dont think hdit made it into sd3, but maybe later like sd3.1 or some community extrra network
hdit was published by some stability folks
though probably after sd3 started training
so most likely sd3.5 or sd4 lol
emad said there'll be no sd4
likely 3 is the last
i think at some point though, community teams can just train their own base models. Which was kind of always the goal
i dont think we'll get to a point where a few people can retrain base models with a different arch
we're already seeing that start to trickle out. pony, playground, pixart
well emad claims that we have somewhat hit a point where ~90% of the work is already done by the AI, which can be improved with manual work
and that this multimodal diffusion transformer thing will be the new "Unet" as a standard
I can't believe only now have we seen a bunch of open models with nicely captioned datasets
SD3, Pixart-Sigma
as intel cranks out gaudi chips, cloud compute is going to get a hell of a lot cheaper
Thank f*ck
supply will exceed demand sooner than later
iirc pony is same as xl in arch
yes
yeah. but it's so heavily trained that it's essentially a new base
they trained on Clip a lot right? (score_1, score_2, etc)
they did a bunch of crap on that
too much clip training
playground 2.5 isn't a refined sdxl but uses the base architecture
I remember some people claiming Pony is something that spams nsfw and other claim that its one of the best models right now
I never tried it
the new proteus model is based on pony and its for photography mainly
for anime? its one of the good xl models
i respect what pony is. it's really well built. maybe a little over fit, but that works for toons.
I also think it's just a smut model
and has a different scheduler
yes
on civit, it's communiyt gallery is dozens of users posting 100 images an hour. ALL blurred out.
damn
its so interesting that Pony is a separate model, like if it was a base model
on civitai
what the fuck I am looking at finetunes of "Pony" and its a bunch of nsfw models
pony is another latent space as i like to think of it
yeah....
ahahaaha yeah. If you go onto the subreddit /r/stablediffusion an notice this , you will probably catch a lot of heat.
the proponents of it don't like to admit the primary use
yeah it is 1000% an nsfw model, however it gives really crisp illustrations of things at low steps. but its useless as a model for anything other than person subject and heavy weighting to beg it to make something civil
i would just point out too though that its based on a hacked model so at some point passing off anything incorporating it might come bite you. i will admit though that it is very good as a base model for sharp scene setting out to then go over with a proper model. they should just release a sfw model, theyre actually onto something with it.
its the quickest sdxl model for using illustrations to quickly build a shape of an image for a photograph with an action pose.
its quite an epic fail
very few images on #SD3 at the moment
At least SD3 weights will likely come next month
Next month for main release likely, API, testing etc very soon.
I actually believe that SD3 will come april, even if towards the end of the month
if the trained SD3 Turbo so quickly I wonder how fast they will train Controlnet and etc
Loads of invites going out next week I believe, there are a lot of people on the waiting list
:S looks like we gotta wait like 2 months 😭
I understand
some lucky people get to wait a week
Emad has been known to promise stuff earlier than when it would actually come out
But April seems realistic in my opinion
yeah I wonder if another 12 people get invited to not even show people their generations
lol
The thing that I dont understand is why the ones who have access to sd3 dont test it with prompts that are...better? I mean, text in images is great but we need some "graphic power" or something like that
theres at most a dozen non-s.ai people in the sd3 test
Thibaud did a great service to us by accepting a bunch of requests for a day or two
Thibaud for example was given a bunch of mediocre prompts that were like: "1girl, blue clothing, in restaurant, beautiful, intricate"
I hope to see a bunch of new testers actually test with CogVLM type of prompts and etc
theres someone with sd3 doing hatsune miku
and not just 'tags'
with natural language
epic
Ngl it's gonna be weird transitioning to an AI that can actually understand sentences after getting used to the usual 1.5 prompts
this
the thing is broh, 1.5 prompting might actually be superior
bc from what I've seen it's bullshit that the AI actually understands you
it was half trained on CogVLM and half on the original captions for the images
I feel like we're a long way before we can get exactly what we ask for in a sentence
sentences are too bloated, a combo of natural langauge and tagging is best
I think tokyo_jab on Reddit has the best outlook on this aspect tho, we're currently just having fun with the technology at its current limitations. Any day, something could come out suddenly and wipe all of the learned experiences into wasted efforts because it's so good and that's perfectly fine because the fun experience is the important part.
And this was coming from the man who spent several hundreds of hours trying to perfect the art of making videos with stable diffusion
a lot of people went straight into perfecting video, myself included
dont give sora praise until it comes out
seeing is believing. My biggest concern is... how da fuqq are you gonna finetune a video model
with video clips? how long? what quality? how much vram? It's kind of a mystery
i think it wont
OpenAI wants to release a video model but sora is too compute intensive and inefficient. It's only a test model
real
1 frame takes 30 seconds to generate
Another day another no sd3. Life is hard
motion loras for animate diff are trained with 2 second clips at 8fps
do they work?
I wanna see the best animatediff has to offer
they do work but they can train the motion loras because they are 1.5 which is easy to train
found a few for XL too on civitai so maybe we will get them for SD3 if its easy to train
i would search reddit for the one thats awesome, trained on motion of ants, but reddit is shitting the bed right now
combed the animation tag since the search wasn't working. i wanted to download that file so yeh. here it is
https://huggingface.co/PollyannaIn4D lots here nice
the motion module for xl is still very beta. it hardly works with any lora very well
oooOo. Stability staff said something that datavoid took very personally https://twitter.com/DataPlusEngine/status/1769159798388383902
MysticDaedra's Avatar
MysticDaedra
Holiday 2023: 5 lights
15 hours ago
Just to clarify: this doesn't incorporate the virus that is ponydiffusion score_9 etc.? That style of prompting hearkens back to the days of SD 1.5 and booru tagging, and has (in my opinion) no place in an SDXL descendant, nor does it have a place in a (hopefully) future of natural-language prompting.
EDIT: Also, I see a bunch of comments saying to use CLIP skip 2, but the description says to use CLIP skip -2? Which is it?
What does drink Lora dry even mean??
uh, doesn't even know what clip skip -2 vs 2 is and works at SAI? wtf?
2 for a1111, -2 for comfy
Ah right. Wasnt sure if that was part of the copy pasta
Also words are words, if he doesnt like words that describe things then he probably needs to find a new hobby
I mean you can put score_9 into a sentence, it just looks funny
Hi, does anyone happen to know if I am allowed to use this Stable Diffusion Model commercially (https://civitai.com/models/207992/stable-video-diffusion-svd) ? It might be using SVD 1.1, and SVD 1.1 used to have/and probably still has a non-commercial license, if I am not mistaken.
kind of what i was wondering
hard disagree. 1) it's his job. 2) people are emotional beings. especially creatives.
if talkin shit on reddit was a job ,redditors and twittards would be the best paid ppl on social media.
yeah ppl on reddit are hella negative
I once got top post on r/antiwork, even got gold
and 70% of the comments were just ppl posting the saddest insults. like why
reddit is a pile of doo doo
why are so many ppl so miserable on the internet. they say twitter is even worse
it's antiwork and they're a bunch of commies
what are u talking about, it was a lot worse back in the day
no it wasnt the actual subscribers
theres usually a bunch of contrarians / dissidents who visit those subs
who are there just to naysay
antiwork be like https://www.youtube.com/watch?v=fibDNwF8bjs
because most of us cant say anything bad irl so they let it explode on social media
I cant argue with this
yea and cant say anything or u will get fired
i guess i'm bias there. i worked in kitchens the last 10 years
did anyone notice that the assholeness dramatically rose during and post-covid
anything goes in the kitchen
anyone who works/has worked in the service industry knows
I think ppl are also mad all the time cuz their cookies are being shrunk yet cost more per bag
yeah. front lines you don't dare say "well fuck you too then!" but behind the doors where the ovens are, the teddy bears are gonna have their picnic nawimean?
because the economy took a hit,now everything is more expensive,less money = more angry ppl
it really is quite difficult to get the image u want
sometimes i think my partial aphantasia helps me here. i never have an actual image in mind that i want. my mind is pure prompting all the time. flashes of images but almost entirely an inner monolog of what i'm thinking about. one might think you can't talk that fast, but woah man. welcome to my head.
ive spent an hour or so comparing lcm to euler_ancentral, and to my big dissapointment it seems its not even worth it to try to use lcm
lcm is for the latent control model lora that allows you to make images at cfg 1 in very few steps
yeah i figured i could use it since my rig is old
it won't work if you don't have the lcm lora loaded, or a full lcm model
oops. atm machine moment
i do have the lcm lora, but its rly inferior on detail, and prompt, im using "holding a glass of water" and lcm fails maybe 18/20 iamges
it can be useful in specific cases... when you want something smooth and lacking in "assets"
not necessarily just detail because smooth uninteresting surfaces can technically be detailed.........
lcm you have to make sure to have a very low cfg. 1 or 2. and only 5-15 steps kind of range
and the lora has to be loaded
indeed, this rly sucks because i rly like the fast generation
ugh, now its like 7 seconds per image to, 35-40
lol i like to explore prompts in lcm then when i find one i like , dive into it with heavier samplers
where lcm failed 18 out of 20 images, euler only fails 1/4, huge difference
so the speed is pointless lol
#🏞|general-with-images message 15 seconds per batch
i ment more like, a hand holding a glass of water
hands are tougher. lets do it
and to be honest, ive never tried a realistic iamge be4
i only do comic style things
lcm is pretty good for comic style with the right checkpoints iirc
def is with cascade
yes i though so to, but, u can rly tell the difference = /
cascade got lcm? 😮
yeah you don't need a lora or anything
use lcm, normal or simple
we're not talking 1-2 step shit here
but using it like a normal sampler
ah!
hands holding cups is a hard one. i dont think thats a sampler problem . more of a prompt / base model thing
mostly yeah
best thing to do is upscale that part of the image, unsample it and resample with the expoential scheduler or maybe sgm
then downscale and patch back in
#🏞|general-with-images message neatest one i pulled off but only one got the hand and well.. yeh
gonna need to inpaint the hand with controlnet after i think.
can stable diffusion forge use 2 gpu's? i have two but it only seems to use one
Mold computing when
https://www.youtube.com/watch?v=5mIWo6dgTmI&ab_channel=Megaprojects
It's a model for cool pony pictures... 
stable swarm ui is pretty good for multi gpu. you can only generate one image on one card at a time, but if you queue up 10, it'll spool them out
respect what you've done with it , but yeh. Thats not what its popular for. Theres a particular community that has made it as big as it is
almost like it was made just for them
ahhh, dang wonder why it cant. but hey that seems cool as well
shared memory i think. do you have them nvlinked?
It is part of life that anything that can be used for NSFW will be used for NSFW, but if you look at all the model history it has always been a model for cool pony images.
oh sure oh sure yeah. rule 34 right? you know about that
(I am not going to say no to the popularity though)
i do not, i have never heard of that term before
off to goodle to learn!
i'm just sayiing. don't shy away from the reputation it's got. you're the designer of it after all. you built it for them
acting all coy like you weren't giving them exactly what they needed
you dont nnnvlink your nvlself? its a cool term we're alll nvsaying. im not having a nvvvvstroke 😄
and a great job you've done at it. it's really suceeded so much at becoming a new base
That's actually not (entirely?) true. I wanted cool ponies but there is not enough good pony images, so I added all the available ones, but then it was not enough so I added all wester/furry/anime stuff (and unfortunately it's still not enough).
nvlink is a little connector between two cards. lets them share memory.
then nope, i had my brother hook them up and he just put it in the slot and that was all, is it something i buy seperate or does it normally come with a card?
yea mostly supported by the top end cards like a100 i dont think it works on 4090s
if you cant find images in stable diffusion you have slow imagination
i really respect the work you've done. you've employed tools in a really great way and made something iconic. i'll leave it at that
i'll add one more comment. its better than anything i could do
here on PNY site they tell u which card supports nvlink https://www.pny.com/professional/software-solutions/about-nvidia-gpus/nvlink
ada cards have naught nvlink support. lame but.. i never really bothered. it's hacky low level dealing stuff
why is making caroon horses still a thing?
i remember total biscuit always showing off his SLI machines, and bitching about stutter all the time
And all of that is a side effect of me accidentally adding two zeros in a config file.
jk, but it is partially because Pony is such a high LR model, I am not sure it will be a thing on its own if not for this quirck.
genuine question
maybe because //gestures towards all the billions of people in the world
they could make it work on all cards but they wanna make u pay more for those ADA cards
i think the big reason they didn't put it on ada is they didn't want open ai to buy 4090s and nvlink them
want their whales buyin those a100s
i have 1 nvidia gpu and a mining gpu(i dont mine, just got it cause it was all i could afford with what i had left), so im not sure itll work :(
i mined like fuck. free money bro. but with like, my 3 gpus
if u rly wanna buy a cheaper card that supports nvlink u could try buying a RTX Quadro used on ebay
p sure sd3 will ship with small models for everyone, i think thats the task and purpose of it. let even trash gpus do AI
which card it is?
Nvidia p102-100 10GB
idk much about them though, itd be my brother who does
but he is sleeping
yea that card doesnt support nvlink only some RTX quadros and ADA's
dang, wouldve been cool
i plan on doing dual 4090's provided i dont have to sell 27 kidneys to buy just 1 
havent checked the market for them in a long while
democratizing generative ai is going to be a big deal.
I like to think of the HIFI audio scene in the 70s. Best speakers. Best amps. Best equipment. Even early casset players and 8track players were HIGH FIDELITY. American manufactueres were like "we'll never cheap out on quality. we're fucking america!"
then along came Sony with the ghetto blaster. Shit box audio that sucked built on microtransisters not tube amps. pieces of shit! Friggin import crap! But you could carry it around. So it took over the market. Then along came the walkman. Crappy audio through crap little head phones. Best selling audio device ever (then)
this is why i'm pretty sure big models will fall
i think you're right for another reason too though... the quality gap will shrink as designs improve
yeah that too
well if u rly wanna use nvlink u could try buyin 2 RTX Quadro 5000s with 16gb each and then nvlink them to get 32gb vram
u can get both for like 1k usd
if you wanna do X then you should buy Y
beat up king intel and drink his blood, making you stronger
they say he isnt real, but i was told if i buy 5 diferent gpus and align them correctly, the devil will be compelled to show itself
anyway, plenty you can do with low end gpus
i assumed that my loras wouldn't work on proteus because they don't work on pony, being that its such a new latent space.
But yeah. I see what was meant now. They work dry
pony isn't a new latent space - the thing with pony is that due to the massive amount of training done, the clip model and unet weights are so far from the sdxl base model that it renders some loras incompatible
because loras only modify certain layers of the model, and not the whole thing
lcm sampler requires use of the lcm lora or a model with it merged in, same as the sgm_uniform goes best with a lightning lora or model
I think the thing with hi-fi audio vs walkmans was that most of the audio difference was either imperceptible (for the same reason a lot of people cant tell the difference between a lossy and a lossless format), or that external factors (ie surrounding noise/portability) rendered the requirement moot
If you could get the 800M image model to make stuff that was 99% as good as the 8B model, yeah I can see the 800M being the more popular model
especially if the 800M model can be optimised to run on mobile phone hardware/cpu
say, i wanna try out making my own lora.
Are there some crucial things or tips I need to take account of before i attempt it?
Big props for pony v6: it seems to be the current meta on civitai, I noticed a lot of popular generators are using it lately.
How many compute-hours went toward training pony v6 on the 2.6M images (starting from the base SDXL checkpoint I assume)?
Dataset and captions are crucial to get correct
I feel like Id seen 30 days for 2 a100s bandied about but do not quote me on that
Do we know what image resolution SD3 is trained on? 1024 like SDXL or higher?
1024 iirc
Lykons been generating 1344x768 on his twitter which is also a sdxl resolution, so suspect its also a 1024 base res
Hi Guys,
My Name's Prince Dhankhar, I'm From India. I'm new to AI and willing to learn so much.
I'm a .net developer with over a decade of experience but I'm sure I can learn python and other required languages because if I don't start understanding and learning about AI now then I might become irrelevant in near future and my family's bread & butter depends on it.
Yesterday I made an investment on myself and my future endeavours and believe me it was not easy for me to manage that money. I bought a new machine with this configuration:
i9-14900k, rtx 4090, 64gb DDR5 (6000MHz, 32*2), 2tb Samsung 990 Pro, 8tb HDD.
My first step is to gather a list of things to install on my new machine. And then start prioritising from where shall I start.
I'll appreciate any suggestions, feedback & help you guys willing to offer, I'll be active and connected to guys. Please help me out guys.
Thanks for listening.
Start with some youtube tutorials chief. 2 ways to approach it - if you want to dive straight into the deepest end of the pool, install ComfyUI and find tutorials on youtube on how to make it work, and civitai.com for models (and their education hub is excellent as well).
Other approach is to go find Olivio Sarakas on youtube and his latest video on Krita, and set up Stable Diffusion through that is that is an easily understandable way to get into it
hi guys so in the end no one was accepted for sd3 preview? I'm still waiting but I guess they'll never accept me
Thanks Buddy, Deepest end is my choice, ComfyUI is on top of my list. So far I've decided to setup different Conda (Anaconda) envs so that It doesn't become messy because I wanted to try so many things. Watched few videos for conda and I guess I've gained all basic knowledge.
What'd you say am I on correct path?
I guess they are giving access to people with social influence. That's better for their marketing strategy.
Oops I forgot I'm on stability's channel and they are reading this. 😋
Hello! Not related to Stable Diffusion but maybe some of you could point me towards the right way: what do I use to train and generate speech? or even music covers with someone else's voice?
Sorry in advance if discussing other types of AI isn't allowed here.
tbh, I am learning how to do the comepetition abt Stable Diffusion on Kaggle
so I click the link and join the server
not other server here
if you want to know, DM me.
AI Hub if you focusing on speech / audio
Yes, def go in the comfyui direction because you can also easily play around with making your own nodes... Lots of room to tinker
Hi!! Found the server :D
Thanks a lot!
xtt
xtts
ur gnna need a gpu to push it tho
its very demanding
even my 3060 was dying with 12gb vram
so a 12GB 4070 will struggle too, right? 🥲
no
your chip is faster
and newer
Oh, got it! So it isn't bound by memory capacity, but by processing power
i think its more about teh cuda cores, and vram is more about how big resolution u can do
VRAM is the most important thing
once you have 12gb you're in good shape with sdxl
then at that point it's about the cuda cores
12gb 4070 should be fine
ye
nayone know how xy plot (auto1111) for different prompts?
Yeah it should have an option for that. I'd give you screenshots but I'm not at my cpu
I looked at the documentation for xyz and it has this though so I know it's a thing
Prompts from file or textbox
With this script it is possible to create a list of jobs which will be executed sequentially.
I'd suggest looking thru the drop-down for xyz and see if you can see prompt in there somewhere
Except for AMD. You could have 24gb vram but you can't run SD on windows yet since there is no Rocm support 
I'm hoping AMD support is coming in time for SD3 
SD3 access please! We want to test it in detail. We already have strong ToS in place with our users, and will provide critical usage feedback to your team directly on what we experience.
What would be good inpainting settings if I want to remove an element from the image and replace it with a continuation of the background as if it wasn't there?
try llama cleaner or abode genrative fill
well i guess i could use S/R as long as each prompt uses no comma's, as the comma is delimiter for running each separate prompt/ I guess comma's dont actually alter the meaning of the prompt in any semantic way
so maybe that might be an option, but not clean option, as i would have to remove the comma's in the process
do you guys know a checkpoint which is trained on male characters?
3x a80 for 3 months (but so much extra time experimenting and running infra tasks)
adobe gen fill is great for very specific things
doesent that cost tokens or sometrhing silly?
mlir and directml exist
in 5 or so days the new batch of people should get in
.
NansException: A tensor with all NaNs was produced in Unet. This could be either because there's not enough precision to represent the picture, or because your video card does not support half type. Try setting the "Upcast cross attention layer to float32" option in Settings > Stable Diffusion or using the --no-half commandline argument to fix this. Use --disable-nan-check commandline argument to disable this check.
I've been trying to generate images for days but it's impossible, what should I do? Could anyone help me please
I always get this message when I want to use different models
it was only select people who got invited in week 1 i reckon. i saw on twitter lots of people asking for them, but then dr furken asked and got one immediately. emad is bogarting the current preview weights and holding them close .
hello guys! Can I use stable diffusion through macbook if I start it on my pc?
can I somehow connect to it I mean since my pc has gpu
Yup, both comfy and a1111 have a —listen parameter which allows you to use it from another pc on the network
thank you!
Hello Guys, I m using SD for a long time now and never had any interest for deepfake until now. But yesterday i saw a vid on yt from a japanese guy showing how to do it with mov2mov and reactor and I was really surprise of the quality results he got with juste a video file and a single face pic of the person (which was an ai created face).
After digging a bit, i saw than the main soft used is DeepFaceLab. But it s far more complex to used (almost like learning SD for the first time). Do you know what are the differences in results between the two techs ? Thx 😉
So did any new info or interesting SD3 preview images pop up during the weekend?
Their "slowly starting to give out invites" approach really is slow apparently.
Hey everyone 🙂
Im new on this discord. I do create ai art, too. I do use a plattform online for that but i wanna try it on my local system, too. I did create many models or so called merges for specific characters like Captain Planet and more.
nothing today
also invites are coming next week
I hope youtubers get access so that we can see videos of SD3
such as mattvidpro
I use Auto1111 and want to move all my models onto a fast external SSD drive. So thats all of my Main SD Checkpoints, and My LORAS. What settings do I need to change to tell auto1111 where my Models and Loras are, and where are those settings.?
I’m crossing my fingers for an invite. I really enjoy stress-testing new models for precision.
is it weird that i still don’t have access to SD3
weird is relative
😭🤷♀️
is that a legit question or are you trolling lol
so the actual release is being targeted for april, but it sounds like invites are going out next week according to the message above
😄
i’m excited
same
are more than 20 users in the beta?
Sorry if you guys had this question a lot, but will I be able to download SD3 already with the paid formula of 20/month?
I've read a site that claims this, but SD3 is not in the list of core models, so I guess it's false information
no its not released yet. only in early invite preview
That's what I thought, too bad, would be more than willing to pay, can't stand to wait anymore 🤣 Thanks for answering!
that is not how open source works..
Depends on the license, I would't mind to pay for early access before the full release online
This is made by a private company, so in that regard you could also call it "not open source"
the weights aren't "open source". while early weights had permissive licensing, current and future stability weights will require license for commercial use
the training and inference code that stabiliyt provides is under the MIT license
Instead of A80, did you mean A40 or A800?
Either way that's a lot of compute time 😄
ugh, sorry a100 80gb
pony xl is another that has non commercial license for it. can't use it on websites without getting permission from the author.
Potentially a dumb question but is civitai the largest website for comparing SD checkpoints and generations?
By some margin
half content is nsfw
💀
more than half
half?
I understand that, it's the internet so I'd be surprised if it wasn't
bro its down bad tho 💀
yeah its the biggest
Would you say their NSFW filter is fairly good tho? If you're logged out you cannot see NSFW stuff anyway?
The problems theyve got is that the base licence under it says that deriverative licences cant be more restrictive…
the admins of the site hate the nsfw concerns, so they employed a filter which cuts basic portraits of a pretty girl because it's NSFW.
so if you turn it on, you're missing a LOT
they very obviously don't care that theres nsfw content on their page and they only employed it as aggressively as they did to give abig f u to authority and so they would rank in search results
To be fair they have to err way on the side of caution otherwise a) no VC funding, and b) they are prime targets for being shut down and made an example of
where?
who tf cares about funding theyre a file hosting website if anything
venture capital needs to go on rule34 more
RAIL-M doesn't say you can't restrict use
Because civitai is the largest site by a wide margin, is it fair to say the best prompts and workflow techniques can be found there (SFW or not)?
Look at the brouhaha around the nsfw when the company that was doing the onsite generation for them pulled out
As in, you can sort by likes, and see what the best models are, and the best generating techniques
the people that make loras generally have good workflows
CSAM requests were floodign the provider's api is why. They should've been blocking those before sending them to a 3rd party
that wasn't just a nsfw issue
csam is nsfl
It's the largest, but I've seen a few very good models move to different sites.. So there might be some gems elsewhere
Indeed, but theres a bloody fine line between a legitimate nsfw prompt and something that will generate csam, especially when different models interpret words differently
What would you consider to be very good? I like metrics so if the master generators are consistently producing well-liked images using the model, I consider that model to be good
To do proper csam filtering you have to do it at both the prompt level, and have ai detection on the generated pixels before committing them to an image file
e.g. Dreamshaper, Juggernaut, Pony
What is CSAM btw? Is it a word I would dare to look up? (I don't know the term for real)
Deliberate is one you wont find on Civitai, that used to be top 3
I think it has something to do with child <blank>
ok 🙂 Thnks
Child Sexual Assault Material.. very very disappointing that it exists
Child sexual abuse material, or easier to say as content sexualising a minor
The poor Californian Society for Addiction Medicine needs a new acronym lol
No more confusion when talking about cyberpunk 2077
oh no.. can't say it as an acronym. blocked by discord
Aren't diffusion models capable of generating CSAM regardless of whether or not they were trained on CSAM though?
Some might not find it funny that theres confusion around cyberpunk, but dark humor is like food. Not everyone gets it
It is the generations that are the problem. That's whats being acted on
If you dont mind, please steer the discussion away from such topics 
there is no war in ba sing se
In the same way a diffusion model can generate "a cyberpunk cityscape in the style of van gogh", it independently understands cyberpunk cityscapes and van gogh as separate concepts; obviously van gogh never dabbled in the cyberpunk genre
there was that time that he hung out with dr who and arguably dr who is the first cyberpunk scifi
I'm glad SD3 is SFW, so I can tell people I use it without them thinking I do some wrong stuff
nobody will think wrong stuff
(me trolling cyberpunk fans)
The base model yes, all custom tunes 
They can, but they won't be as good at it
People are gonna think you do wrong stuff with it anyway
i stopped talking about generative AI with the people i work with because the wider perception of the hobby from people who aren't in it, is, well, you knwo
I mean people trying wrong stuff is the reason they dont release SORA
Yeah, agreed. We've had plenty of discussion in here about how the world will never be ready for it, but it's coming anyway
i bought a 3060 just for ai
my cpu is 10 years old
💀
doesn't matter
i7-4770k and a 3060 gonna give you a sweet time on sd
i got a 4080 for it. first time i bought an nvidia card in over a decade
closer to 20 years maybe. i forget
3570
I went for a 4080, the best graphical game i play is League of Legends...
On AI related servers the discussion at least is humane, on other servers or websites etc. people are just straight up against anything AI cause media outlets make it worse than it is
i5 3570*
I can't justify a card, given that a used 3090 still goes for more than my car is worth!
sdxl juggernaught just cant do hands lmao
hahaha. same boat kind of. i play a lot of high demand games, but most of my gaming hours are sunk into oxygen not included
its very bad at hands
Me with 2 4090 
Well in this case, I would argue that this is one time where the fear probably isn't unjustified
got anymore of those 4090s?
Only for myself 
electric bill: ‼️
I used to play league on a 2007 macbook pro 17, and it would run fine lol
you have two space heaters
Would be fun if something like A100 was consumer grade 😋
Yeah summer in my room is kinda 
i'm damn near ready to drill an exhaust port through my office wall lol
this past winter was one of the coldest i've ever had . when it really dropped low i just ran a training script
Can I install it on my own Google Cloud VM server?
aahhaah
I'm considering getting 2 4090s at some point, is the power consumption that steep if I'm running it heavy to train models?
my brother had a dozen cards on a mining rig. he pumped the heat exhaust from it into his central air
I'll be waiting for 5090..
Chris Jones rendering a 3D video on a 2008 vintage PC & Surface Pro 4 in ~2 ½ months.

Dunno about training that much but generating an image can suck well over 400 Watts only on the GPU
does dual 4090s help training?
I sometimes feel so limited on my 4080
With the death of SLI having 2 GPUs isnt really worth
i've always felt i got a lot of leg room on my 4080. enough to learn.
doubt it
like 20 bucks a month
I guess I should have clarified; this is general purpose training—smaller LLMs and TTS models—I have not yet and don't intend to train SD for the forseeable future
I've used 2 T4s on Kaggle in this manner; with DDP you can split a tensor across 2 GPUs and double your batch size / overall training speed
Maybe it's better to go for cloud computing instead of buying 2x 4090?
LLMs are harder to train. i'm not really interested in them yet since most of what can be done with training is just chat bots that stroke one's ego
i've not seen anything exciting in the home LLM space
Mainly because civitai has so many checkpoints that I don't think I can do better than what the people there are already doing
those support sli/nvlink though
x2
check out the bounty board. can earn buzz on civit, if thats a thing you want
lots of things people want there
Interesting, had never looked there
The Buzz earned from a bounty can be withdrawn as USD or no? And if so, what is the conversion rate?
nope
not yet - they are looking at ways to withdraw buzz, with the rate of 1000 buzz to the dollar, and they take a cut but it hasn't been implemented
yeah. thats a whole legal can of worms and i bet their lawyer team is going to be like "bruh don't"
they can't even control bot voting on their site so i doubt they'll start trading their participation points for dollars
i'm waiting for deep learned game ai, that doesn't just stand there and watch as you murder their friends
thinking intelligenlty
to either try to escape from you
or emotionally blackmail / beg for sympathy and mercy
game ai is severely lacking. no better than DOOM (original)
that would be cool
given the recent things coming out (i.e Minecraft in Sora), that won't be far off
like you pull a gun on a gta npc and they say "please dont kill me"
npcs are fucking DUMMMMB
and they just freeze there
or if the npc is depressed etc and you try kill them he says "pull the trigger"
that would be cool and very realistic
but really fucked up
you get the point tho
just more realisitc npcs
no i dont' care about that. i just want them to act like people not npcs
yeah
thats what i mean
more emotion
they create their own understandinf of a situation based on their mental state and personaity
could be done with current game ai. but they'd be stock reactions and still very predictable behavior.
loaded up solo mode on battlefield last night and oh man. ooohhh man. npcs are soooo dumb. thats why i play on live servers. only way the game is exciting anymore
used to play a lot of solo battlefield for practice but modern battlefield npcs are just as dumb as battlefield 2 npcs
nothing has been achieved in this field. I'd say Black or White and maybe F.E.A.R are the only times any effort has been made to advance game AI
Are Midjourney outputs uploaded to Civitai or no?
Because I'm curious how MJ generations would stack up to SD in terms of views & likes
Reddit is for that
or twitter
Also S.T.A.L.K.E.R was very good at managing NPC behaviour. But at the moment it seems devs are waiting for ai tools, XBOX already has a partnernship with Inworld. 1 or 2 years and we'll probably see a triple a with decent ai
Any advice for SD txt2img usage from the command line, suitable for say, Colab/Kaggle notebook usage? Instead of opening & fiddling with a web UI, supply the prompt & gen settings as args and run cell(s)?
I hope the next stalker 2 has even better AI for the NPCs but idk if they would be able to achieve it, i mean they changed the engine and everything
Stalker anomaly AI was the best that i´ve seen, how the NPCs peak, throw grenades and try to get you from behind, its very well done
Well, anomaly is a mod of call of Pripyat i think so it must be the same there
yeah.. use the command line to fire up a nice UI that allows you to use a decent workflow
Or pythonic SD usage
you can use the api but fuck it's painful
Don't get me wrong I like me a good UI but in this case I'm curious for command-line or python only APIs
someone released a python sdk for a1111
Is it this one: https://github.com/mix1009/sdwebuiapi
but if you going down the route of firing up the webui and then setting it to api only, you'd be better off using an extension for krita and making images in there
1.2k ⭐s
I'm gonna assume so
The interest in not needing to fire up a WebUI is for one-off generations on free GPUs. If you generally have a good prompt template and generation settings already in place, you could generate what you need, download the outputs, and close the session without needing to click too much or consume too much GPU time (basically a hit-and-run). With Colab I believe you can even automate the download by putting files.download(filename) at the end of a cell; not sure if Kaggle has a good equivalent
i thought kaggle also used ipynb notebooks...
Yes but Colab has the special sauce as follows:
from google.colab import files
files.download('example.txt')
Even though Kaggle is owned by Google and presumably backed by Google GPUs, it doesn't support that google.colab import.
It looks like there is an API wiki page in A1111
There are other libraries for saving to cloud storage tho
Hello. I'm a PhD scholar trying to research usage of AI and other computational technologies in film restoration, upscaling, etc. Would appreciate any leads, resources or conversations with anyone who's involved in such practices. Please feel free to DM me!
600+gb of vram for fp16 (estimated)
awesome field to be interested in. I love when poeple do stuff like this . While it's kind of bad ai resotration, its also a LOT better than the oriignal footage https://www.youtube.com/watch?v=YTE0OTVOnZU
its only open sourced so elon can swing his dick at altman more
even though most leaderboard models beat it
its kind of a shit release and grok isn't really that impressive of a system
yeah its so epic
we need ggml implementation so I can load it onto my 30000 petabytes of RAM and run it at 300 seconds / token
you need to use --ckpt-dir "path/to/models" and --lora-dir ".."
in your webui-user.bat
which model do you use
sdxl
Just out of curiosity, why use the base model instead of one of the highly rated SDXL finetunes on civitai?
which
i am using that
ah ok
Then maybe try to reproduce the art style of the most liked image this month using JuggernautXL. It's SFW because I can see it logged out, and the OP posted their workflow in the comments: they used ControlNet but see if you can get that style without CN.
Or I suppose you could use CN with that image as the reference. I meant you won't have access to the OP's original CN references
ok ye this is gonna be hard.
Yeah maybe too tall an order, I see they used a lora
cfheck #🏞|general-with-images
could IP adapter the style of the image tho?
to clarify, pony is mostly pro commercial (you can do whatever with outputs, use the model as you wish, integrate it into any commerce process, etc), the only limitation is when it's used by platforms that allow paid inference, but if there are no paid tiers, go wild (i.e. something like AI Horde),.
Is any way to use stable diffusion. Free?
A while back I think I saw a prompt template of the form
left prompt COLUMN
mid prompt COLUMN
right prompt
which produced a landscape image using the 3 prompts left-to-right. I can't find this technique. What is this called? (it's not a prompt matrix)
Might've just answered my own question, this technique might be Regional Prompting except it uses BREAK syntax
ah and you can ADDROW or ADDCOL
ADDCOL?
ahh yeah just saw you said that too
stable cascade with that image as a clipvision input
theres also "ADDCOMM" if you want a common prompt
i realized i said "websites" generalized rather than "commercial saas websites" immediately after posting that, but figured no one would notice and would know what i meant.
Kind of stoked to see you're on top of it.
I guess my lack of legal knowledge leads me to confusion here. I looked up the license and it states this:
You may re-distribute the weights and use the model commercially and/or as a service. If you do, please be aware you have to include the same use restrictions as the ones in the license and share a copy of the CreativeML OpenRAIL-M to all your users (please read the license entirely and carefully) Please read the full license here
Doesn't this essentially just say that anyone is able to use it commercially? I guess there are exceptions based on your message that I replied to
"allowed to use it commercially" is not a restriction.
also, i don't think stability will enforce it that strictly.
their goal is to build an ecosystem of models and the rail m was just a template
perfect prompt example to generate everything ? who got an example
SD usually just reads my thoughts and i dont need to prompt anything, it gives me perfect images everytime of exactly what i want
yeah that neuralink clip model helps a lot
lora? i hardly know'a
guys how to define 2.5d style in prompt?, i guess the AI cant recognize dot in "2.5d". any guess?
lucky lol
google image 2.5d, youre not gonna get some good results
i doubt any model is really trained on it without any finetuning or a lora
Is SD3 Out to the public yet?
lol a few lonths
months*
more*
june iirc
same for llama 3 lol
nvm llama 3 in end of year iirc
im gonna jump out a window "AttributeError: module diffusers has no attribute StableCascadeUNet"
looks like SAI forgot to include a unet again
It will be 100% free right?
isnt cascade bettter at following instrucotons
#✨|sdxl message damn this was almost a year ago, time really does fly by fast lmao
Are there any sites other than RunDiffusion.com that generate more advanced Stable Diffusion stuff in the browser like AnimateDiff and Deforum? I don't mean running a web UI on your home computer, I mean the work is done in the cloud.
Like running Auto1111 and Comfy UI etc?
now someone should ask, does sd3 use a unet?
sure, check out runpod
you'll have to set it up with all the extensions you want if you're doing video
Thank you I'll check it out.
anyone know how I can train my own model
all the videos im looking at are a year old and are outdated
they probably still work tbh
they don't
by outdated I mean the videos tell me to do something in stable diffusion but their version has something mine doesn't cause i'm on the latest version
gm
hi.Can you help me
whoever coded this is so dumb man
💀
why is bro offloading to cpu
my god
deadass looks like he chatgpt all the code
he clears memory too
stupid
[notice] A new release of pip available: 22.2.1 -> 24.0
[notice] To update, run: C:\Users\kilin\stable-diffusion-webui\venv\Scripts\python.exe -m pip install --upgrade pip
what should i do here
nahh cascade is wild
hi.Can you help me
ye
ignore
Gn
the #sd3 hashtag on xitter is spammed with nft shit. lame
so yeah, do you guys mostly use image resolutions of 512 or 1024?
i like verticle orientations and i boost a little over recommended reses cause its funner. 640x768 and 960x1280 or just whatever else i'm feeling at the time. sometimes i roll with hires fix at 2x sometimes 1.5.
desktop backgrounds are fun to bake
I'm trying to run InstantID for the first time. Unfortunately I can't run it locally and I haven't found an easy way to do it on runpod either.
I have found the Hugginface Spaces Demo that is pretty cool and even has ControlNet.
Still I would like to know what am I missing out on by not running it locally?
Is it basically just being able to use Juggernaut (or any other model) instead of Yamer or am I missing something else?
i've been getting more into kohya's hires fix extension too. i haven't decided if i like it more or less than a 2nd pass
can plug it against any model
guys, where can I ask for help lora related?
It really depends. You're going to hear a lot of polarizing and contradictory information, no matter where you ask
Especially if it's about training loras
But I'd check reddit and civitai mostly, if I were you
its more about an approach then anything
im trying to create a lora for game assets
i'm back from my business trip, wanted to see what's changed in the month i've been away, hoping someone tells me that we finally have rocm support in windows so I can build my LORA without shutting down and goign into my linux boot
text gen does not (koboldcpp)
so idk about img gen
damn, i really need my lora model to get finished...guess i'm shutting down later to try again
oof
where can I generate an image?
Good afternoon, my name is Maxim. I'm from Russia. For a school project, I created a short cartoon using neural networks only. Please take a look and support.
Hello
I want to use stable diffusion
My pc is extremely bad
The gpu is nvidia geforce 940m
Is there a way for me to run stable diffusion?
nop
u have 2gb vram
i dont think its possible
maybe cpu?
but prob slow
cpu, you are looking at 5-6 minutes for a 512x512 image using turbo or sd1.5 - your best bet is runpod/vast.ai if you want to run your own, or use something like civitai or leonardo or happyaccidents if you want to just use an online generator without having to install anything
^^
I was using it for stable diffusion cascade
200 seconds
The person who setup the program I was using used chatgpt to write it 💀
Hi, is there any information about the release of creative upscale?
Good morning, everyone! How are we all today?
Still waiting for more SD3 news. 😇
There should be a lot of invites going out this week. I hope we see some more Sd3 examples
👀
hi guys, im completely new to ai in general, and am trying to build my own diffusion model, starting with an autoencoder
i haven't ventured intro transformers and attention yet, and i'd like to know how far i can get without attention
right now, i have a simple conv encoder/decoder that goes from 3x1024x1024 (0-1 range) to 3x64x64 (-4 to 4 range) and back up
it's a terrible model that performs worse than a five month old
any tips on how i can improve it?
thanks a lot in advance!
Does anyone know any replacement tool for midjourney's cref function in SD?
I used to use Lora, but in midjourney, it can create consistent characters from only 1 image and I'm very impressed.
Someone know anything about algorithm they are using?
Is stable diffussion using multiple checkpoints ?
Bro
Ain't no way
Stable cascade was running like ass because Its running on bf16
The guy who coded this 100% use chatgpt bro
💀☠️⚰️
My CPU dying
You need avx instruction set to have fast speeds with bf16
I made mango pudding yesterday, and it was good.
I should
lol
I could generate some puke instead…you’ll just have to trust that it’s a positive thing once you see it
Guys do you know release date of SD 3?
I think we all do at this point. It looks really slick
What do you think of it compared to midjourney?
I approach these models with general purpose expectations. Midjourney is really good at specific things, like generating high quality syntactically precise artwork. But it has its limits and those show through the data it’s been trained on, which is (still) mostly Greg rutrowski 🤷🏻♂️
There’s still a certain veneer to their models, which may or may not be a desirable thing from a business perspective I dunno
Agreed
what are embeddings used for ?
Embeddings are sort of like flags, pieces of code that are more stable through generation after generation.
can they be used for generating images more specifically ?
So if you have an embedding for a specific feature or quality, the embedding will allow it to retain some consistent presence over your generations
But it’s not perfect lol
ok ! I get it now, and if it's a PT file, will it work through SAFETENSOR files ?
hey i got a quick question,any website or file for styles or prompts? to get ideas from?
ok I'm in the tool
just idk where to upload the PT file
do I must upload it in google drive ?
and from there ot link it ?
to*
NameError Traceback (most recent call last)
<ipython-input-9-d5f90ec83e72> in <cell line: 2>()
1 #@title Convert the Embedding(s)
----> 2 process_pt_files(file_path, 'embedding', verbose=verbose)
NameError: name 'process_pt_files' is not defined
\
what should this mean / ?
The traceback stack needs telemetry in adjudicating the process pt files and the issue here is that it isn't defined
Therefore is #🏞|general-with-images
Also you dont need to convert embeddings. They will work as .pt but they can be unsafe.
If you download them from civitai your fine as they picklescan every upload I heard.
Hey folks
Hi, I have a question, how do I use stable diffusion to color my line art? I want to use stable diffusion for coloring and coloring only. I don't want it to change the style, change the eyes, or mess up the hands. Can someone help? I've been told controlnet helps, but it still ends up messing the hands.
Theres a controlnet called lineart and u can usually emphasize the "lines" basically if u use 2 controlnets instead of 1 if ur trying to keep exact lines
sd3 taking so long to release because sd3 = agi confirmed
Hi friends, nice to meet you all. How should I use AI to generate images?
agi = ?
Artificial General Intelligence



