#💬|general-chat
1 messages · Page 100 of 1
b r u h
dont project yourself here pls
real hackers worth fn jack shit don't waste their time on pissant BS like discord fights
discord sucks honestly
they're too busy doing important, consquential shit
imagine paying for nitros :S
hes like snowden but he can bench 500pounds each leg
yeah, i know about that stuff
nice man
no void pointering!
we in presence of a very dangerous individual
What you guys listening to rn? I'm listening to Oneohtrix Point Never.
I noticed as well everyone hold on to your lan cables
he might go hack you so you can unplug lmao
only danger you pose is to the taco bell toilets
i dont really listen to music...
ha im on wifi on my xbox nice try
you do have experience digging
bro u done no one gives a rat's a$$ about ur threads
silk threads?
my name is julien actually
there's a lesson i learned a long time ago
the most dangerous people don't talk much
because they don't want you to get your guard up before they strike
I was molded by the black hats
what a badass
nice bro you propably have a 4min ping lmao
im pogging right now
he did say he was about 40, probably 40 iq
ok bro
If you don't program in Turbo Pascal, can you really call yourself a programmer?
i feel like we should move the conversation to something else? like a different topic...
im no mode btw.
im actually enjoying the show rn
how do you do a inverted mask with comfy ui? I want to change everything not masked
yes put it all over my face cowboy
you finally done with your intellectual posturing
what happened here lol
you must be the most insecure person irl
where can i ask about tech help and such, in this server?
grew up in the hacking community myself... the talkers never did fn shit
thank you.
my friends were all serial killers
but he's behind 7 proxies!
they hacked up nerds in the woods
bro is this the SD discord or flarkin cartel wher am I here
EMBEDED FAIL LAUGH AT THIS USER!
Aww. Forgot this was the general chat channel.
The last three times I have randomly checked this channel there have been bizarre trolls. Insanity.
wtf is up with this chat today
same question
lmfao that too he has a black role hahahaha
can someone moderate this server lol
dont worry is just unglued who was been sniffin glue for the past hour
i just posted there and i hope things go well.
hopefully it passes fast lol
and one even bounces through vietnam! and there's a shell on his buddy's server in canada and i heard their police and very uncooperative when the FBI calls
I'll give you some more freebies: I used to moderate this server too.
so... ummm how is everyone today?
Apparently everyone in this channel is off their meds at the moment.
nah, just one guy
thats crazy.
um ok lol?
that makes eevryone else a little off too lol
almost 29/02
so does emad believe in midnight drops?
i would strip naked and run down the street screaming im jesus! im jesus! if the models dropped at midnight
i just got buried in the tech support channel, dang
looks like i picked a bad day to stop sniffing glue
u are suppose to eat it
Or a good day to pick it back up...
you are hilarious already
Hi guys, could someone help me answering my question in #🤝|tech-support? Pleaaase
was me getting an amd card a mistake if i love ai? im trying arch linux but this os is so hard and nothing seems to work the way i want it to.
yes
especially with 50 series on the way
dang
theyre gonna change the AI landscape forever
when
u can still use it
Q4 this year or at most, Q1 next year
oh wow
but for what?
long time
linux with some patches
for rocm compat
but honestly, I wish everyone was team cuda
im too cheap for cuda.
how much you think
if we're lucky, 1800msrp
if they go nuts, for 32gb version maybe 2500msrp
maybe 2600
its a crapshoot but prices will get leaked eventually
I think it'll be somewhere in the middle. I'd say 2000
damn
bot hasnt worked for a long time
Looks like it's been down since late January.
sadly im about to buy a 4070 cause I cant wait xD
it only looks like that because it has
Hopefully i can return it when 50 series comes out
Can somebody please tell me if there is no function for generating on board now?
#1047610792226340935 is search broken again??
can someone give me an image and caption with their lora training so I can see how detailed people are
no you dont return a year old card. sell it duh
hi, so, most ai generation online isnt free or limited, do if i use my own 4060 laptop i can use as much as i can and with good quality?, which one is worth
Do people still use dreambooth? or is it all training loras now
I use dreambooth
Thats what im uysing
Thats what I thought KohyaSs is
Oh well Im uysing kohya ss
now answer me a few questions
can I train mutiple characters on the same lora?
😠
hmmm I dont see people doing that with people. but with buildings and other objects yeah
well thre reason is
I wanna use lora to be able to make pictures with both OC characters in it
OC ?
yo guys
every time i create a image (using fooocus),
the character is looking at me/ the camera
i always write stuff like "looking at laptop, staring at laptop"
but it doesnt work..
anyone know why?
lora is made to make images with strong resemblances to the dataset it was trained on, you can use lora with a low value
Right ideally so I can use two characters
now one more questions what are Regularisation images meant to be
Its meant to be like close to but not exact images to train the model on those images compared to one another?
if you had a lora based of bin laden it would change everyone into bin laden, you can work around that with a mask
Whats that website you use to grab a whole bunch of images at once based on what you enter?
gooogle?
You can do regional Lora
Whats that mean captain?
In comfyui you can do work flows that allow you to only use a Lora in one place
Another approach is called attention couple
If you don't care about perfect likeness the easiest approach is a pair of ipadapter with attention masks
And start with a controlnet for the first 0.1 to 0.2
That sets the characters in the right spot forh the ipadapter to then take over
I wanna make a Disney film with Sora
Dont have that or I would use it
I only use easydiffusion
Comfyui is easy to install
execution.py how to enable
future boom😅
😮
no, like if I buy one tomorrow could I before Q4
i have a 5700xt but im gonna upgrade to 4070ti
Then a 50
wtf kind of entitlement comes up with returning it a year later
Does anyone know where we go to get help with the official stability AI API? It looks like the "platform" channel no longer exists?
it's been 84 years... SD3 still hasn't released
I just need SD3 in my life and I'll never be bored again
it's been up and down for about 3 weeks, i'd email them
It's been less than a week, and the reddit comment from one of the stability team said 2-4 weeks...
SD3 being released soon?
literally look up 1 comment chief
SD3 when and where
hope that user gens gathering starts soon 🙂
Came here to ask this. 😛
run as in generate with one, or run as in create a lora?
Been watchin this ’movie magic’ docu series from youtube. Guessing it's from early 00's somewhere. It’s portraing matte painters, cinematographs, cgi ppl and other envolved with movies making the visual possible. Super interesting, highly recommend! A bit sad to watch it though knowing all that is history within 5 years or less. Surely new kind of ways to work arise at the same but still.
When SD3?
i tot lora is another form of stable difussion
no.. lora is "LOw Rank Adaptation" - the adaptation being it adapts the model that it applies to
When SD3
When SD3 😛
Yes, when SD3?
Mid April
stop torturing us, give SD3 😄
What's gonna come first, sd3 or Half Life 3....
Hey, has anyone already asked when sd3 is coming? 🤭
3 questions partna's
when the heck is SD3 releasin?
where the heck is SD3?
what the frick happened to SD3? ⁉️
heyhey. uuuh. i dunno if this is the place to ask. but would anyone be willing ot help me get an good setup?
i know theres like extentions and ive seen some before, but its been a while
and frankly, im a little stupid. so can anyone here help me get set up? already got the SD set up an such. so i can like make good pictures and within my vram(i have six dedicated ram. and 22485 "total available"
SD3 is the first step to that hehd
Hey, do you already have the right commandline Args in your webui-user.bat for Max performance?
Also, do you need help with setting up extensions like Controlnet?
oh im getting help. but more is alwaysgood
they reccomended Forge for me.
also. never heard of controlnet. whats it?
It's an extension that allows you to get more control over an image in several ways. For example, you can add poses to an image, declare the general layout of the image, that kind of stuff
I m trying make pictures for a friend that look like her, I use IP-adapter in Forge. What is the best workflow? Do I take the seed of the result with the most similar face and change the prompt?
Hey, i need some recommendations on decent laptops to run SD smoothly without any hiccups
any nvidia gpu with 16gb vram and 32g ram
something like 3090
if you want to run sdxl
but sd3 will require like 24gb lol I would wait a bit
Is that just speculation or do you have a source?
im just guessing based on the fact the largest model was said to have 8b parameters
Ah fair
i think it's pretty safe to say i have no hope of running this on my machine lmao
Hey guys! What's been your go-to model as of late? Really been curious about what everyone's been using. Getting back into ai art after a bit and wondering. I found a new(ish?) model i'm really enjoying, astica.ai. Anyone else tried it out?
thanks, didn’t see that!
Thanks for the response. I'm learning with the web UI right now and my current laptop runs on 8gigs ram but a 4GB GTX from earlier 2021. Tried launching the UI and using models but the whole laptop freezes and then crashes. Even tried modifying the bat file to reduce the load but no luck as yet.
mhm on 4gb vram you can probably barely load 1.5
it’s really the limited factor in open source
are you using sdxl? i also have a 4gb GTX (1050) but sdxl takes several to finish one pic. Sd1.5 is good, less than a minute
Yeah def a good idea to sit tight a bit if you're an inference only person
For training 24gb vram is a no brainer but inference... 12 might be fine, might need (or really want) 16 who knows
Kinda like 10, vs 12... I was so glad I got the 12gb 3080 long before I discovered SD and not the 10
Most of what I was doing with SDXL got me to 10.7 or 11.5gb vram used
Sorry if not allowed, jst wondering if there's anyone here with experience using astica.ai. It's open source and seems great so far but i would love some pointers. nobody in my inner circle seems to have heard of or used it
never heard oif it
Anyone care to explain why this prompt "three characters, a boy and a girl and a dog, looking at a spooky american mansion, (Makoto Shinkai, Greg Rutkowski), (anime:2) japanese ink painting " usually only gives me a girl, a girl and a dog, or a boy etc... but always never the three subjects together like prompted?
https://chat.openai.com/g/g-0pA1w4IJm-the-memory Play The Memory
with this one you can run sdxl on your 4gb gpu: https://github.com/lllyasviel/Fooocus (but it takes 2 minutes per image)
yeah this is a limitation with current open source models. Stable Diffusion 3 will solve this problem
its not out yet sadly
if you have a rtx... gpu, gtx still needs 8gb for some reason
has anyone got private access through the waitlist yet?
nope
I heard from someone that someone heard from another guy that heard from a random person that they heard that today would be when people get access through the waitlist
guess not
or maybe in the evening in the american timezone 🤷♂️
THIS IS JUST GUESSING BTW
like I have no idea they tell us nothing
its been radio silence for like 2-3 days
isnt the goal of a waitlist to give people access gradually over time? I find it weird that literally no one got access. Normally at the start of a waitlist some people get access and then more over time. Its pointless to have a waitlist if you give access to people at a specific time at once
but I guess thats how they want to do it
not even the subreddit talks about SD3 rn
I thought people would be hyping but it just died like it wasn't even announced lmao
because there's no updates on it
true
idk i think they might do it gradually. With the last bot they also gave access first to only a "few" people, but idk.
Also most people likely aren't aware of the waitlist. Idk how many are actually on the waitlist. (the announcement didn't ping iirc)
(also for all we know the bot might already be online but hidden to people without access 🙃)
They started with early users, then beta users, etc.
I signed the waitlist the day it came out and only like 4 days later I noticed that I didn't receive the email 




I bet I'll be put so goddamn back
unless its completely random
did u just put your email wrong or what
yeah that's probably it
its just so goddamn unlucky
maybe I used up all my luck by getting the Tekken 8 Closed Network Test and Closed Beta Test
I don't understand, Did you receive the email when you registered on the waitlist?
I just got one that said I was in the waitlist
I signed up the day they opened the waiting list but I still haven't received any emails
u sure
Yes absolutely, I followed all the steps to the letter
weird
But did you receive this email after a few days? Just to understand
Oh no, I probably typed in my email incorrectly the first time and therefore I didn't receive and email at all. Around 4-5 days later I noticed and signed it again and THEN I got the email
though this worries me
maybe it was just a bug or something lol, I don't understand
yeah I signed up 22th, nothing, tried to sign up again on 25th and THEN I got an email instantly
no it was immediate
u both probably just put email wrong
yeah that's a high possibility
same i signed up again and got the email now
BRUH
dub looks like im getting into early testing
I'm starting to doubt myself both ways
I hope its today bc
yeah today but probably in american timezone + evening or something
that's my guess
im hoping emad didnt just hype up that stability ai crossover thing
what is your timezone
gmt
okay you should probably be fine
he was posting on his twitter about whatever crossover it was
yes morphaistudio
don't think that's it
unless they are trying to mislead and disappoint us lmao

yeah they have a waitlist too, this is worrying
then again, the images were generated with SD3, so it must mean it's FOR SD3
I hope it can do res higher than 1mp
it should be able to technically
aren't they staying with 1024px, at least when training
still no technical paper actually
damn
probably same dataset as sdxl if i had to guess but idk
even the tech paper release would satisfy me for now
I just want to see the more technical improvements, like how flow matching EXACTLY helps and stuff
you sure it's not just aspect ratios you mean?
some of the image quality emad posted werent exactly miles ahead of sdxl
let me check
"Sampling flexibility
Sora can sample widescreen 1920x1080p videos, vertical 1080x1920 videos and everything inbetween. This lets Sora create content for different devices directly at their native aspect ratios. It also lets us quickly prototype content at lower sizes before generating at full resolution—all with the same model."
and they can make images up to 2048x
so 8mp
or 4mp sorry
idk even 2048px from a single pass sounds good
well, if highresfix will work fine then it will be okay
especially now that it's smart, it will probably KNOW what is being upscaled
img2img will be so good with this
we will see
you know what would be surprising
its if they were switching between the model sizes for all the images they have been posting

does anyone have sd3 yet?
No
oh man, ive signed up 10 minutes after announcement only to get an email now after trying again.
Checking Discord every thirty minutes and waiting for the first SD3 images to start dropping...⏳
What the fuck so its more common than I thought then
Luckily I tried again and the email arrived
Let's hope so
maybe we got on the list & its just confirmations that started coming later/confirmation emails were setup later
Are we supposed to have received an email after signing up as confirmation?
Congratulations you've been added to the Stable Diffusion 3 early preview waitlist!
I got that when I signed up
A while ago
😦
I signed up approximately within an hour after the message on X, haven't received anything
No, I don't use gmail
can i sign up if i dont have twitter
"You'll notified by email with an invite to our Discord server when you've been granted access to the preview"
This sounds like you'll receive an email at the moment access is granted
yes (idk if it's still possible though) #📣|announcements
cool
i think we need faster billing support before sd3. my card just work on stripe 2 days ago but wont on stability website
well he message i replied to is gone now but ok
I made a mistake with my discord-id, signed up again and now I got the email
Mee too! 😢
it's fine we'll all get access anyways lol
but wen
what can be done with ai for fun, because i find myself to be bored alot is there a way i can use ai to cure my boredom?
Yes! (At work sorry later.)
Is no one posting here because SD3 dropped somewhere and I'm the only one who doesn't know?
i don't think there would be silence if SD3 dropped lmao
its just that the server is dead rn
we are just waiting for SD3 access from the waitlist tbh
this is probably a bad idea, but i'm bored outta my mind. so if anyone who doesn't have access to SD wants an image made, hmu ig
I keep checking wondering.,, damn all the silence
I want an image of an easter egg with a weird design dyed on it.
Are they going nuts over there at stability drinking gallons of spiked coffee frantically trying to fix some bug that causes "high quality photo" to spit out porn every time lol
something is releasing/started today, its being worked on i read just now
Got the impression something was coming out today, thought it'd be a lil earlier in the day
Do you have access to SD3?
Can you please generate:
biped cute shorthaired caracal walking from a local grocery shop holding two white plastic bags with food, in a cloudy day, russian winter town streets, February, typical soviet panel buildings at background, dramatic cinematic film still

not sd3, just what everyone has access to
The community would love that bug. Really... I wonder what. Maybe lots of articles, contacts, getting the bots ready for the eval comparison, checking repo dependencies on something other than Ubuntu. (The last one was a joke. They're not doing that.)
weird notion of what "the community" is. 😛
SD 1.5 community
i don't know what it is, but i cannot get that thing to be bipedal. and with controlnet it turns into a robot of some sorts?
"biped" should put the caracal on its two hind legs. It works great in DALL-E
Use the word anthropomorphic too
i even put the weight up, but it did not work
did that too
FenrisXL as a checkpoint
guys i'm a time traveler, sd3 comes out tmr
i don't think i can run that model lmao. it's twice the size of my vram
what makes SD3 better than the rest just the dataset it was trained on????
or is there more behind it
prompt understanding, text, hands, just to name a few
the rest? as in dalle 3 or sdxl? if you compare it to dalle 3 it's promising to be dalle3 that can run on consumer hardware
ive only used SD, midjourney, bing dalle
i've seen some impressive images from it posted on twitter, so i'd say the quality improved significantly as well
I think the quality is around SDXL base tbh
so its aesthetically decent
just not EYE POPPINGLY BEAUTIFUL like midjourney
its not like it won't be possible, finetunes are gonna bring it up to ideogram quality if not further
Sorry if this is a obvious question but can people contribute to the code base of SD?
I guess if you make a pull request on the repository then sure? 🤷
I'm a data scientist who works w/ torch often and would love to help out
https://twitter.com/Lykon4072/status/1762126186581242333 i find the dragon in particular very good
2.1 repository or SDXL repository
Oh shit sd15 only then?
I guess I need study the code base first
oh crap I forgot about that, yeah it looks pretty good
still, not desaturated enough 😉
Is there a "start here" in terms of knowing the technical structure ?
i can run any sd1.5 model, and only some sdxl models, albeit very slow
I think it's diffusers and you might find code examples there for multiple models or something
I don't know myself
You're welcome, good luck with that
wdym some?
like fp16 models only?
or just like stuff like lightning and turbo
that, yeah
ahhh
i mean, i can generate on regular sdxl, but my memory nearly runs out when it's going through the vae
no refiner?
how do i get stable if billing support wont respond to me
haven't tried yet, but i don't think so
3 gigs
really? my memory ran out completely when i ran it
like, it used all my vram, and 8 gigs of shared vram as well
Any minute now guys...
yeah
do we have confirmation that it's coming or are we just speculating?
any minute SD3 dropping? Oh finally, good to know!
well, the online early test version that is...
also what about the techinical paper?
what a perfect timing to release SD3 on april fools day Lol
nahhh wait
"not quite there"
on a post where he posted an image of a robot with 29th on it
I can wait but as a relative noob it really surprises me how ‘bad’ SD is in understanding prompts compared to midj/dalle
Shit I wonder if that means they are suddnely delaying it last second
the dataset was literally a tagging system lmao
"black dog", "in park"
what does that mean?
Nooooo... 
to be completely honest, i'd rather wait a bit longer than have an unfinished model
but we're testing it early, that's the point
we're testing an early model through a service to give them feedback before it's fully trained
they'll release the weights when it's finished
yeah that's the point, why the stalling... please just release it! Go!!!
yeah, fair point
hiii guys who knows how to hack fortnite ? i would love to make a present for my little cousin
most likely the delay is due to issue in the integration / infra, not related to the model
tf?? hahahahaha
told y'all
No SD3 today guys. 😦
so where is it..
it is Emad, get him guys! 😄
yes!!!
ofc not, it's obvious 😛
we are prob two weeks away....
„waiting for that to be out, long day“
so maybe something happens in the next 10 hours
hmm i hope so
would be fitting that user generating starts a week after announcement
23:59 in the most western American timezone 
just hope they think of us europeans
don't give me hope 
nahh
guys!! omg i just got the sd3 discord server invite🥺
i didn't say the time zone 🤥
and don't even mention the aisans bro its like almost morning for them when this will get released
if at all released
liar 😢
nuu >w<
gonna stay up and wait to see SD3 releasing at probably 5 a.m. just as preview for selected users.-.
no idea
SD3 has at least 3 parameters
impressive
?
we need more info about it pls🙏
let me guess 0.2342634 0.6239587 0.892374
the usual suspects
Maybe it's not pruned / lobotomized enough yet. 🤫
Id like the unfinished model now plz
Tbo I mostly can't wait for people to (complain about that)/(be confused by) the prompts they optimised for an older model no longer producing the exact same results 🙃.
Explaining that people "overfit" prompts for specific models and also things like regression to the mean never gets old 🥰
SD2.x moment lmao
not the same clip model guys aaah!!!!1!
also I wonder if the more literal and sensitive SD3 prompt structure would incentivize the usage of llms
What's the best models/loras for text in pictures?
you're best off just waiting for SD3
https://civitai.com/models/146548/bettertextredmond-improving-text-on-sdxl-concept-lora I think this is alright for SDXL, I didn' try that much
but yeah, just wait for SD3 at this point
its not gonna be slightly better, it's gonna be WAAAY better for text
like near paragraph length good
Mmh I'd just use controlnet, like a canny or depth and a model with the style that I need
Stability staff posting on twitter more boring SD3 pictures, honestly what's the point? It doesn't showcase the new model capabilities...
yeah they sometimes show regular images that are already possible with current models
we want more complex prompts
unless of course we get access from that waitlist thing, then we can post our own stuff
I'll watermark them with like "SD3 EARLY" of course
or just simply say it beforehand
what is this guy doing, like what's his job? https://twitter.com/andrekerygma
X
Do you use SD or SDXL?
Anyone know where SD3 is available once you're past the wait-list?
Wanna know if it's in dreamstudio
SDXL right now
a discord server
What’s the difference?
SD3 will be smarter
😮
it will know how to generate text and it will treat multiple subjects better
You have to use it through a discord bot? :(
Shoot
I have noticed that when I tell SD to do coloured eyebrows, it won’t do it. Would SDXL do it?
Oh you're speculating
Hopefully it'll be available in dreamstudio
possibly, no idea
SDXL is slightly smarter than SD1.5
😮
not really way better but
might as well wait for SD3 and the finetunes of SD3
have you seen the shirt text?
yes
so...?
Give sdxl a chance, it has 2 text encoders, some models follow the prompt better than others
yeah its a base model + it's half baked/not finished training
I wonder if SD can make a good Sonic
1mp is kinda bad
details are blurry
Why is SD max at 2048?
yeah the model is probably still 1024px
I always use highresfix anyway so I don't mind
and considering the VRAM issues that might arise thanks to the HUGE 8B model, it's understandable
Hopefully a 4080 is enough
they should have done a 13b but I can understand 8b for training purposees
the 1.58 Bit paper might help with that, or for even larger models, but they'd have to start training all over again, even if it was faster 😬
are you talking about the twitter pics? i think that's cause twitter compression is bad
can't diffusion transformers be quantized similar to llms?
perhaps
its interesting
yeah the compression is terrible, but I have noticed that images are sometimes rather blurry in a way
but, again, highresfix usually solves these problems
well that paper says you need to retrain from scratch, quantize methods are done post training
oh you mean that
oh yeah
I don't know about stuff like if int8 would work
or GPTQ 4-bit and stuff
I don't understand what would work
u get speedups because the multiplications turn into additions
int8 could be applied to the text encoder of pixart-a and deepfloyd to heavily decrease their vram requirements
to like ~8-12GB vram
we don't even know the text encoder or clip model for SD3
ik pixart-a has a LCM version
no technical paper STILL 
Is sd3 out yet
no, but you may enjoy the most boring and random pictures of SD3 posted by stability stuff on shitter 😄
like if I wanted to generate a sign saying "McDonalds" I can use controlnet to make it so the text isn't all fucky?
Yes, you can use controlnet or even ip adapter if you have a reference image, just don't set the strength too high or the model will struggle
Hi at all 🙂 in which cannel i can create ai images? 😁
i'm afraid you can't currently, as the bot is down.
Ah ok :/ but which channel is it when the bot works 😊
i'm not sure. i've only been active here after the bot got shut down temporarily
#1047610792226340935 will be green when the bot is available again
update, still red
Imagine/
I don't need to imagine, I just scroll up in here 😛
that much hope can kill a man...
I'm only interested in bot cuz of the prompts
I've scraped 300,000 prompts
But MOAR is better
think of the possibilities, or dont
they are soon
😆
i wonder if sd3 will benefit a lot on Ada cards because it has the transformer engine and it's a transformer model. not sure how that works
SD4 when
SD4 will reproduce the real world more accurate than the actual world is looking. 🚀
SD3 soon though
max steps by default is 150, what if I put in 10,000 steps? kinda wanna try it lol
9875 wasted steps lol
probably 9960 wasted, depending on sampler.. hell with the new lightning models, that is up to 9996 wasted steps!
I know youre right but why is it bad to have that many steps?
just wondering the reason
I never really tried
It's just wasted effort.. the algorithm is just adding a small amount of noise, then removing it over and over again at that point
Just wanna know if SD3 beats ideogram 1.0 tbh
yes
probably will
yes since its open source and we can finetune it
Waiting for SD3 is excruciating, even when it comes out, not sure if everyone on the waiting list would get access to the models immediately, or if the models would be up for download any time soon.
Lol so SD3 not happening today obv
I'm not fine tuning it, just hoping the base model beats it
I mean, ideogram 1.0 is really good
It's main problems are hands, faces, and artifacting. Everything else is awesome
I'm sure in the short term, there may be the requirement of Lora's or new training to undo some of the fine-tuning that SD3 may apply. I'm worried they'll butcher the hell out of it, leaving it down to a mixup or trade off between 1.5 and it's generative and adherence to prompts.
as far as weights, they're moot. ideogram only sells a service and doesn't provide weights
stability is in the weights game
Idm open source or not, I use either way
far as we know, ideogram is a room full of a million monkeys. we can't be sure
Also, I'm hoping SD3 early access is through dreamstudio anyways
what diff would that make? they'll almost certainly do the initial preview through a discord bot
I like web UIs over discord bots
not really consequential to the model weights thoughaint it
the preview is just going to be the preview. not the actual release. we might be a couple months out from that
Just my personal preference
I mean... It makes testing easier if I can just switch between tabs and access a web UI without signing into discord on everything I got
the testing is more for their purposes too. i think they'll be employing rlhf again.
how to use sd ? i'm a newbea
Be interesting to know if the bot would allow private channels and sessions for creation. If we're testing it, the implication is to abuse it and spot for discrepancy. Unless they mean looking for errors in generation, in that case, I'd assume the testing phase would be much quicker.
well do you haev a good enough computer or you want to run it as a service
i'm just download some ckpt files , install comfy-ui locally and run default prompt successfully but i'm very confused about the image
yeah , my computer configration can run most llms.
What models are you using, 1.5, XL, 2.1?
how much vram for sdxl is needed again
bmp 10Core 30Gpu 36G Ram
sorry , i don't know what model i'm using now,the model name is v2-1_768-ema-pruned.safetensors
VRAM ≠ RAM
Only generate 512x512, not higher using that model
Or at least that's what someone online said as the output got weird at higher resolutions using that model
OK,ehh , I will choose another model best suitable for me but first i should learn how to begin so where can i learn from
i have learned some prompt skills but the sd prompt is so different
Different to what?
in normal case ,we will using Role + detailed requirements + few case + some others to construct a prompt , but in sd i just see much more adjective
emad im waiting

SD3 will probably be the most ethical AI model ever made
so take a guess
Is there any introductory post?
"Ethical" according to whose subjective morality? >.<
A lot of the times, people assert "ethical" in their "AI safety" papers but have no real argument for particular content they wish to butcher in their models.
Like, names of people, personal identities and various particulars of the sort is certainly one thing, but purely fictional content that can be abstracted from these models have no "ethical" boundaries.
only n00bz care about base models
make it as censored as you want. community will expand its knowledge on anatomy farther than you could ever imagine
when a model is heavily censored there's hardly a way to fix it.
just look at 2.1
well if it trains like ass cough cough XL
2.1 had a crapload of issues with text encoder
we need a model that is as smooth to train as 1.5
and i have a feeling SD3 won't be any different or maybe even worst
until then, its all just recycled XL
yup. wasnt born yesterday. thts why I smh when I see ppl hyped
cant see how this will be any different
thankfully 1.5 kicks ass. I can be as patient as they want
what people still make nudes in sdxl ...
we won't have another great influx of users or hit of relevance unless a propietary model leaks again.
there's no way an open model does that on it's own
i have hope, stability learned from the previous releases
open or closed model has zero bearing on its performance
no they haven't, why can midjourney and other propietary model makers can train on anything but it's just SD the one that has to remove artist data and have a way to opt out?
depends,if its a closed wallet model vs an open wallet model
doesn it make sense that the corporate models are the ones that have the liberty of manupulating any data while the open source one is the one that has to comply?
i prefer SAI than the models that while damn good seem to produce their distinctive look that you are stuck with
it will prob be as censored as SDXL and its okay, those custom sd1.5 model will not go away
by performance you mean aesthetics? closed models will probably always be better than open cause closed models are backed by capital
resignation is not a solution
time will tell, money shifting, some of the winners now will be the first to fail
an overly censored model is a lobotomized model
When it comes to model training and construction—unless they are re-training the model without the questioned data, then trying to reduce its influence based on an arbitrary statistical threshold is only going to weaken the overall performance little by little
IMO
ill judge when its out, and well see if those removed artists really make a difference or not
for sure
does animatediff always generate super blurry gifs?
Why do you need to wait to know it's censored? We already know it is.
if u use a low resolution,yes
what resolution do you recommend, i've been using 512x512, does hi-res fix work?
yes u have to use high res fix, 1.5x or 2x depending if you have the vram for it
Would you use hi-res 2x for 1024x1024 or no hi res and generate at 1024x1024?
512x512 then hi-res 2x or 512x768 then hi-res 2x
"it will prob be as censored as SDXL and its okay"
the thing is that a lot of yall think this will be another sd2.0, judge when its out
people still to this day say that sdxl is censored. the unstable diffusion community is filled to the tits with misinformation. anything to keep the donations flowing in
even stable diffusion 2.1 the censorship was fully overblown. people just didn't know how to prompt for it
well there are things the base model cant produce sure, but where I'm struggling to understand is the notion that 1.5 offers something that sdxl doesnt, that makes no sense. what exactly cant be trained into sdxl?
if u have money u can train anything to it
yes, this
money, or will, or time
or the desire to learn how to train in the first place, not just think that prompt enigneering is the beginning and the end
2's biggest problem was that the architecture was locked in and difficult to train. i don't think that was intentional
What's the quality differences of sdxl vs hi-res sd1.5?
haven't really seen big noticeable diffs
only reason i like sd1.5 is the speed, helps me visualize a prompt way quicker. I wish sd3 supports smaller resolutions for faster inference
for animatediff, 1.5 works better the sdxl motion model is not very good
afaik, stable diffusion 3 will have different versions.. one with an 800m parameter count
same as 1.5
ok, maybe, but after a couple 5 sec clips of flickering video, that gets old and back to generating something worthwhile
do we know if the 800m model supports 1mp? or it's only 512x512?
Advantages in coherent composition
dont know. i imagine a transformers based model is going to have a wider resolution operating space too
will can only eat so much spaghetti
on 1.5? hard press on x to doubt that one
hi res sd1.5 is slower than sdxl normal res
no
sdxl has advantages in composition, yes
a lot of people sleep on prompting the seperate clip layers of xl. the openclip model is exceptional at catching prompts
ah misunderstood you thinking you meant the reverse
doesnt really matter when only thing u create are portraits or asian girls
heh
i'm creating artistic porn so for me it does matter 
I'm not sure those 2 words were ever meant to be used together
they were if your name is Sarah Lucas
Well, it is significantly more coherent than older diffusive keyword based models, so any ability to train easily makes for a significant hype around it. I do also hope it can train easily, but how butchered the model becomes before we get it is definitely a worry. The hype is more so the hope that we'll be able to easily train with it.
i don't expect anything, in fact im now hoping the opposite, maybe the model is bad and is the worst of them all, maybe that'll be the only way i could ever find SD3 as an advancement.
well if u create the hard type of stuff its better for u to stay with 1.5
Ahh, is there a dreambooth for sdxl? Or would you recommend just training a lora
lora is dreambooth
but yes you can do full checkpoint dreambooth training or lora training via kohya scripts
lora =/= dreambooth
Depends on your patience and resources
Lora is a lot faster and a lot smaller
If you want lots of models the disk space can add up
Granted that's cheau as shit now compared to a 4090 so that shouldn't matter too much if you got one
wtf is a cheau
it's literally called dreambooth lora, it's more or less a tweak to reduce the trainable paramaters and the resulting file size
whatever, it's not dreambooth 😄
thats because engrish from japanese transration
dreambooth on kohya is a lousy choice anyways, at least for 1.5
which one is higher quality and closer to the trained faces?
most say the full checkpoint training is, and you can also extract the lora from a full checkpoint, if you are ok with generating a bunch of 6G files until you are satisfied
personally, I'm always content with a lora, but seeing is believing
Celestial Hovering Entertainment and Utility device
Finetine yeah
I've gotten really good results training a Lora and a lycoris-ia3 on the same dataset
Then combining them
btw, the fact that you can extract the lora from a full dreambooth checkpoint is more evidence that it's in fact dreambooth, but I digress
hope SD3 includes a lactose intolerant mode
finetine?
It's not. It's just creating a lora from the difference of two models. Turns out that's really effective at separating what dream booth changed. Thing is, they're all just doing math to floating point weights
So training both a lora and a lycoris on the same dataset then combining them outdos the quality of a dreambooth checkpoint in your experience?
So btw, your biases are showing
we all have biases
Yes, like a sharp antler
Full checkpoints are typically the domain of finetunes on large datasets these days. I don't think many dreamboothed checkpoints are doing too well. Loras make more sense now a days
Extract a lora from juggernaut XL and it won't be nearly as good, since it changed a lot
hey guys i am trying to get stable diffusion running on a local VM. my problem is that i can't seem to figure out how to get a pass through GPU working on vmware workstation 17. is virtual box better?
VirtualBox does not support GPU passthrough
what are my options for VM software then?
Unfortunately that's all I know
im gonna train a lora to region lock ppl
Are you just trying to run it on Linux on a windows box? If so. Wsl2
yes, just trying to run some ML stuff on linux VM
on windows host
@acoustic solstice Nvidia Toolkit with WSL2 running on an nvidia GPU. If you do "WSL--install" and let that complete, it will install the latest Ubuntu env on your macine... THEN install/reinstall Docker Desktop. THen you will have Nvidia Toolkit for a Passthrough to containers... --- you can run BASH from Powershell, or be in the Ubuntu environment. You can then run "wsl --install any time to get into ubuntu... run nvidia-smi and it shows your you GPU passthrough system : https://i.imgur.com/UXXt3lg.png -- Watch this vid: https://www.youtube.com/watch?v=PB7zM3JrgkI
If you use animatediff on forge, does the length of the gif determine how much vram it uses? or is it resolution
Im using hi-res fix on a 512x768 video with 400 frames , says I have 14gb vram left but it just hard lags my PC and it becomes unusable
@young hatchawesome, thank you!!
too many frames
Ahh, I will try 300frames to see if it keeps lagging
still too many,u can probably push it to like 120 frames at like 8fps or 12fps
depends on your gpu
lol
no its just that animatediff wasnt meant to do that many fps default fps is 6 and u can prob push it to 12
yes,heavier with controlnet
ahh
Sora when
How do you make the generations less snappy? Like random things disappearing and appearingin the background. Are there any settings that fix that
thats just normal u either remove bg or you edit each frame with something like photoshop
but if u are doing a vid2vid workflow u can do it at like 15fps + controlnet,it depends on your gpu,whats yours?
4090
yea play with interpolation or just gen a 15fps vid then do interpolation in another program
Okay I will try that
If I want to generate images with text in them, are there specific loras that can do it better?
A lot of my text end up wonky or with missing/extra letters
yea harrlogos lora for XL
Can I mix that with anything else?
to be able to get the text first try
or is there a way to prompt it
yea u can prompt for whatever text u want
I got the 4080
When using controlnet on a batch images, how comes it preproceses both before and after the video?
sorry idk i dont use animatediff vid2vid
i thought the fps setting was just the playback speed of the file. if i put 15 fps with interpolate of 4, it plays ultra slowmo
I get that issue too
interpolate is supposed to raise the fps so 15fps with inter 4 supposed to be 60fps
but I also get just 15fps but .25 speed
it just adds frames
yeah on the animatediff doc
it says it adjusts your fps
so added frames don't slow down your video
its the context size that the model cares about. 16 context batch. the fps i'm certain is just for the post process when stitching the file
if I change context batch does it do anything
thats the batch size i'm pretty sure
hmm so if I did 32 it would need more vram but would generate faster?
For those interested in the academic discussion on generative AI in general, and in this case particularly in visual generative, Alan Warburton (https://alanwarburton.co.uk/) is hosting a webinar in a couple of hours as part of an AI webinar series at Umea university (https://www.umu.se/en/events/fraiday-the-wizard-of-ai_11886311/)
is it being recorded, thatll be 1am here 😦
does anyone know any method or extension that would allow me to inpaint in high resolution (>1024x1024) without leading to distortions? like something similar to hi res fix but for inpainting
It depends on the fps at which you decode the images back with ffmpeg
For that you need to use an sdxl model
Its trained on 1024
right
is it possible to merge 1.5 models with sdxl models and get decent results?
not a thing
you can use the result of a 1.5 model, and then inpaint that with a sdxl model sure, but mixing the models themselves is not a thing
rip
my sister in law has asked me to edit some of her wedding photos to hide her baby bump but theyre all in 4k
any good sdxl models you can recommend to do that lol
ay what WebUI are we all using?
the one that suits your needs. You re welcome

mostly it s a1111 (or forge), or comfyUI or fooocus.
Depends of what you want to achieve
Is she pregnant of her own free will?
lol ofc its my brothers kid
I was fishing for new ones to try because my A1111 bricked and I can't make it work anymore
Then why try hide the bump, pregnancy is a beautiful part of the human experience..
might want to check with #🤝|tech-support to fix it then
ig she wants some wedding photos where she looks slim lol
Nah I tried that already, and we got it working briefly but it went back to not working
Online generators are fine, but it's limiting after using a local SD Webui for so long
what a1111 extensions did you have?
in that case try installing forge from scratch in a different folder (its a1111 but actually fast)
tyty
once forge is installed, then copy your model files over
I'll have a look
do note that not all extensions work in forge
That's alright, I don't really use any
The CivitAI one for uploading images and stuff
And that's about it
Personally I also don't notice any significant speed improvement when switching to forge :p. Not sure why so far.
what gpu are you using?
2070s, so I should get "30~45%" inference speed improvement according to them
when testing my speed when from 8.9 to maybe 9.2 it/s on some simple 1.5 stuff, no lora, no lcm, no controlnet, etc
yeah, i think they padded the numbers a bit, since i'm getting about that much improved speed on my 1060 3gb
I did notice it used about 500mb less vram tho
checking back my notes from a few days ago
=================================================================
sdwebui 1.7 --xformers
Total progress: 100%|██████████████████████████████████████████████████████████████| 60/60 [00:06<00:00, 8.91it/s]
=================================================================
sdwebui 1.7 (dev branch) --xformers
Total progress: 100%|████████████████████████████████████████████████████████████| 240/240 [00:27<00:00, 9.14it/s]
=================================================================
sdwebui forge
Total progress: 100%|████████████████████████████████████████████████████████████| 240/240 [00:26<00:00, 9.36it/s]
=================================================================
sdwebui forge --xformers
Total progress: 100%|████████████████████████████████████████████████████████████| 240/240 [00:26<00:00, 9.74it/s]
But many people do report speed improvements. So it s worth a try and it s probably a "me" problem.
ive had good luck with it then.. had a 1070 that gets 1.3it/s go to 2.2it/s with forge, the rtx5000 on paperspace went from 7.7 to 13, and the a4000 went from 9 to 14.8
i also find forge seems to load models faster
all tests with a 1.5 checkpoint, 50 step euler a at 512x512 with no prompt and 7 cfg
although i havent tested
I d recommend setting higher steps value. It gives more time to SD to stretch its inference legs properly for this test.
i would like to know aprox how many images per minute or seconds per image it will be for sdxl @ 1024x1024 with a 4070. benchamrk i found only shows old version of stable
can i also use sdxl to generate 512x512 at faster speeds? ty
you can check the sdnext benchmark database for that https://vladmandic.github.io/sd-extension-system-info/pages/benchmark.html
i run batch count 4 and take the speed of the 4th image
it s not perfect but it should give you an idea of what to expect
dont see any using sdxl model
hmmm yeah but you re still gonna hit the VAE stage pretty early. Which could mess the numbers. It s like you re giving SD an hiit workout instead of a long run. Not sure if I m clear on this one.
speed difference between 1024x1024 on a 1.5 model and 1024x1024 on a sdxl should be neglegible
ty
I did try 100 steps initially, but found 50 gave pretty much the same result plus or minus .1 it/s
can sdxl generate 512z512 or only resize after
Ok then.
You did your homework
sdxl turbo is designed for 512x512 - full sdxl models are designed for 1024x1024
ok ty
copied off toms hardware, but without access to any cool gpus 😛
figured out i can just inpaint in 512 then go over the inpainted bits in segments with a low denoising strength to get better quality
I hate it when industry people post dates and then can't meet their deadlines. Thanks for generating hype and making us wait, Emad!
(to clarify - no date and then have it drop in our laps 1-2 weeks from now would've been preferable... afterwards words gets around fast)
we are in a such a bad situation, there's no point using SDXL anymore and we don't have SD3 yet 😄
Good god i'm so tired fighting that censor dog in Dall-E, just... just kill me please =((
We have Cascade... but I don't feel like trying to wrangle that strange beast when we have a potentially way better Model with different architecture around the corner.
Yeah, Cascade is dead on arrival since SD3 "soon" 🙂
Wdym? It's been less than a week, and iirc they said at least 2-4 weeks...
think they refer to user generating, inside discord
I just hope SD3 will have some controlnet or ip adapter weights released together with it, I don't care if they take one more month to release it
It was a really strange way to go about things... I mean the SD3 announcement came like 1-2 weeks after Cascade? (and tbh Cascade is like a talented, but strange alien cousin of SDXL)
There was an Emad Tweet hinting at yesterday... and then yesterday he was like "oops, lol, not there yet."
That would be amazing, but I don't think so.
I'm absolutely looking forward to see if we will be able to properly fine-tune it on consumer hardware though.
im interested in what the multi-modal aspect is for
cause with LLMs that usually means it is able to use images
maybe the technical paper could explain a lot more to us 
Probably language/image/ maybe motion
Did anyone get called from the wait-list? 
Still waiting for XL based inpainting models. I hope SD3 will have one.
There's a bunch now
I only found controlnet based or similar hacks that only works is you set inpainting resolution to same as the image. Well I do inpainting in 6k renders professionally and need a proper inpainting safetensor model for a1111.
There's a sdxl base one, a juggernaut one, a realvis one, I can't believe it's not photography one etc
Link?
where mah sd3
"not yet fully implemented"
I mean, read the description. It's not a real inpainting model.
Only for automatic1111, sounds like it's potentially working on a number of others such as comfy,foocus. It also says it's achieving results similar to 1.5 inpaint. Worth checking out imo if you really need it
no one inpaints XL with comfy
this is a bit of a dumb question but does anyone know of an a1111 extension to make notes somewhere in the UI
even if it's just the hackiest thing possible
i keep forgetting the valid resolutions for this SDXL model because they're specific strings of numbers i literally never use otherwise and i don't like having a separate notepad/obsidian file for it
How much do negative prompts like disfigured body, two heads, extra fingers etc. help?
I am pretty much getting the exact same number of weird proportion images as without them
i think theres an extension for a111 that has a bunch of default resolutions
I tried copying the list from pins in #📝|prompting-help
is there such a thing as too many negative prompts that result in worse results
perhaps aspect ratio helper will help
I think if you use the tabs vs the drop down at the top to select the model, you can input some metadata, like preview images etc, can't remember all of the choices
yea i think that was the name,there were 2, one was abandoned and the other one
yeah like the specific resolutions are
896 x 1152
832 x 1216
768 x 1344
640 x 1536
i usually shoot for the second one (1024x1024 is also an option so 832 is technically 'middle' option)
but there's nothing like, cementing those numbers in my head so i have to check my notes
i'll probably get used to it
i should probably just set my default res to be 832 x 1216 since i almost exclusively run this sdxl model
Surprised that people here really do think SD3 will surpass Ideogram 1.0
You'd think the open source model wouldn't surpass closed source because of it being artificially limited
you mean the other way around ?
How is an open source model artificially limited
Because
I mean, look at Gemma compared to Gemini
Or well, literally all of open source LLMs
Open source LLMs barely surpass gpt 3.5, let alone even get get NEAR to gpt 4
you’re assuming ideogram has the same amount of compute and funding than openai and google
what a stupid argument
Ideogram 1.0 is really good tho
I tried it, holds itself up well to the SD3 demo images
yeah and you can compare it to the early preview pics of sd3 and conclude sd3 is slightly better
even before rlhf
??? I tried it myself, I prefer Ideogram's images to the SD3 demo pics with the same prompts more than half of the time
Won't say it's objectively better
But damn, it's not bad
they are equal at best but I wouldn’t say it’s better
it’s also crippled by being closed source
censored, will be slow to implement in painting controlnet etc
open source is just much faster at this stuff
see what they did with 1.5
Well, I don't really care about half of that stuff
Sure, more control is better
But all I need is text to image, inpainting, and image/image and text to image
BUT
I don’t think they have all of that yet
Ideogram does a lot of composition that looks like a collage approach
