#š¬ļ½general-chat
1 messages Ā· Page 107 of 1
i believe that'll be possible. it uses transformers architecture
it could be achievable using multiple programs
and the fixed code being written out is probably only possible with the T5 model
as it will be a VERY long text
i dont. or at least my conscious mind locks me out of those memories
maybe up to like 30-40 words at best if you're lucky
have not seen yet example with such a long text
I have not found the shirt example
which has a much longer text
nice
https://twitter.com/Lykon4072/status/1766239022735569070/photo/1 lykon got a type
she looks like first albino gigner with those weird patches around eyes missing freckles, first ever gigner albino
yeah the sunglasses tan freckles... weird
or she ran out of makeup to make those fake freckles like some girls do thosedays, fake garbage
yeah it looks odd
anyone know if theres a nify website with premade poses for openposes?
are you suggesting aht a stable diffusion girl would do fake makeup freckles? Why i never!
https://twitter.com/EMostaque/status/1766247808670073244 emad hyping agi now. he got bored of sd3 and yeah. next week
Emad has ADHD....he's already forgotten us
wtf it links to this https://www.youtube.com/watch?v=iE39q-IKOzA
hope he follows through with that
is there a particular model I should use for an image with no people(or any living things, or robots or whatever) in it?
that guy has great videos
thats a prompt solve
kinda feel bad for them tbh, they just released this otherwise great model that has a few blatant issues, and it's beaten by a new sd model lmfao
models have billions of parameters. you don't need a specific model for most cases
alright guess I'll ask in prompting help then
don't matter. they aren't competing wiht stability. Won't ever release their weights and are keeping it proprietary.
survial of fittest
i mean... they kinda are tho?
in terms of best model they're beaten by sd3 hands down
business wise, not really
they know what they're doing by not releasing and keeping proprietary. attracting investors
other than artifacting, hands, and faces, they're the second best when sd3 releases in beta
cash heavy whales
its kinda weird how ideogram has weird faces
like celebrities just don't work well
you don't need ascii for that system. just give the model any cypher. that's all it's doing
thats the real research they revealed. Break jailbreaks using cyphers
I will have look into that later
I legit cannot tell if some of these comments in this channel are from bots or not
and as you suspected, it's not even going to last. it'll be covered in a week
if only gemini were patched as fast with its anti white garbage
its easier to assume people are bots. stick to that.
well, i came to conclusion maybe 15-25% of people are real people, rest are bots programmed by matrix
it wasn't anti white. oh geeze. the fragile white male reveals itself once more. smh
how it was not when it painted white historical figures as black and said anti white garbage?
all it did was salt prompts with some diversity. broken preprocessor was all
not that big fo a deal. the outrage is palpable.
im used to fact that coloured people have priority in name of diversity while hiring in uk so nothing would surprise me tbh
well, fact that it was just mistake is more surprising than what i was thinking
i leave simulation and they try kill me
"colored people" š¦
maybe i ask ai how fix my videocard problem)
Go for it
domo arigato Mister Roboto!
for helping me escape just when i needed toooo thankk youuu
chat gpt said if it doesnāt help to change the driver then itās faulty
i always said it's not driver, it's mostly car in f1, changing driver won;t solve construction issues
lol, good one
i need change car?
no, it was a joke
9/10 its a driver issue.
Yes as @vagrant fox suggests, a car change is better than a driver change
i need change card
then why Hamilton somehow started to drive like garbage when merc finally stopped to had superior car?
ikr
in f1 where most drivers are plus minus same level it's mostly car that makes difference
lol i dont know racecar drivers. i know i know. my names a palindrome but it doesn't lead my hobbies
f1 is not my hobby either, I liked Kimi when he was racing thats all f1 for me
I lived down the street from a stock car track. that was fun growing up
like nfs series?
must been nice
lovve nfs games but this trakc was a different energy. let me find a vid
when I was kid closest i got to nice engine sounds was putting plastic bottle between bike frame and wheel to make motocross sounds
so you win
i dont want car ? but if somebody give my sport car from nfs i want)
i don;t i would kill myself in lonely night trying to drive away from emotions
fold like pretzel on first turn
because i would drive too fast most likely and could not controll myself in fast car my bet
just trying to be realistic
with judgement
you test it in real life?
no, thats why im still here š
lol
give me 911 and ill fly off
hahaha that school bus
I have a fast electric bike, I broke my arm on it)
it's something magical about demolition that attracts all men
it was a good show. hit 2 pass racing is exciitng. modern gladiator chariot races
it reminds me of old game called Flatout
once in a while they would have tow races where ever car has to be towing something
i would like to participate in some wreck race in old beaten up car
could be plenty of fun as long as you don;t damage yourself more than a car
guys any idea when sd3 release?
ugh, control net image doubles my sampler time T_T
Hi, in Forge in Stable diffusion is there a way to save all Generation Data under a preset so that you can open a certain Generation Data with the click of a button?
and will sd3 be on HuggingFace or a paid Clipdrop version initially?
seen you asking few times, wish I could help but never used forge ui
clipdrop isn't stability anymore
Ohhh~
should be free
i'm guessing discord bot. but thats all we got. guesses
yess I just hope they make it free
and regarding local stable diffusion install, I have a basic Lenovo Thinkpad with 8gb ram and Intel i5... will it work?
Yeah, I try asking every couple of hours, incase anyone has any info. š Pretty sure Forge UI is derived from A1111, so I think A1111 solutions would work too.
i know auto can save all settings used in text file
then you can use option to load settings from file
thats one way yeah. if you have the meta generation data from another image. Paste it ALL into the positive prompt then click that lil arrow dooey
so ig a PC or a custom built setup is required maybe
anything with dedicated gpu that's have some vram and its not 15 years old
just something with a gpu that has it's own vram. some laptops do
Yeah, I have seen that too, ideally I'd be looking for this: "You can save generation data from an image under nameXYZ. Then via dropdown or another method you select the saved generation data, so that it will all be pasted into positive prompt. Then you can press the little arrow thingy to apply all generation data correctly." My explantion wasn't optimal, do you understand what I mean? lol
so far you can do that manually and settings file is named same as image/animation generated corresponding to it
then you would have to manually choose this file to load settings
https://github.com/SenshiSentou/sd-webui-state-manager this one work? i should try it. seems like a good idea
https://github.com/harukei-tech/sd-webui-extended-style-saver another good looking one
why does generating images change so much between generations? sometimes 40sec, sometimes 55sec?
hard to say. could be a few things
I'll look into those, thanks!
think i like the look of the config presets one best but all 3 look nice
and for some reason, my computer uses 1.3gig vram at idle, i wonder whats taxing it
i have to assume its chrome
it goes down to 0,6 when i close chrome
how many tabs u use im watching now some video and it uses 0.8 gb vram
even wen i closed all tabs but one, it still said i had 15 open
all tho im using brave based on chromium not chrome itself
discord too
ugh this is so frustrating
u could turn off hardware acceleration in discord
and change chrome to brave or something light based on chromium so you could just transfer all your passwords and bookmarks
if you fighting for every inch of vram
that's a bit crazy
dont give that tool to game devs or we will have unrefined garbage mdoels everywhere, lol
one tab using 1 gig vram, thats a lot for me, when i only got 8 to work with =/
I can see EA or ubisoft shareholders meeting "If we use this tool and get rid of humans, we could push profits 15% up next year"
Oh itās too late, what do you think Bethesda is building into the next Elder Scrolls game? āA magic portal DLC that conjures any object you so desire! Only $texasā
seems way too much than it should be unless chrome is really so bad, it's been few years since i touched chrome itself
considering starfield it will be empty af lol
ill try to stop hardwhere accel
EA will get acquired by epic or microsoft if tehy cut design and inspiration out of their games even further
Establish bland identity = great way to get scooped up by corporate entertainment
if not golden goose (pack opening in fifa) they would die already considering how bad game lineup they have lately, ah and i forgot sims and 10202101212 DLC\s for every new sim game
i have tried to make f1 car and turned out to be so funny as model
like car ends at front wheels
try it out
i mean, fortnight and pop music are manufactured and churned out, but it's still creative humans doing that churning.
i don't believe AI arts will achieve true creativity that strikes chords with people, for quite a long while.
well considering people eat garbage that sounds like robot (autotune music) i dont think we far off
I think it depends on how rich the text encoders can get with their data, tbh. And that could improve more quickly than humanly imagined.
that'll be very "retro" before you know it. music shifts and changes with the times
yes but how many people still listens to old music, even in new generations, ofc there is big part thats shaped by marketing machines and new trends, but thats not all population
Sort of how the entire 80s aesthetic has been culturally digested into a weird shiny pink and blue neon vaporwave arpeggiated base
human creativity will be needed for quite a while in some capacity. once they find a person that has that edge people want, they'll train it and mass produce with AI. But the core of it is the person
hard to relate to a machine
machine does not understand what makes us tickle, its just going from point a to b, there is no inteligence in AI, not yet
yeah its pretty dumb ngl
self conscious and self learning ai will be real ai
i'm impressed by it like when i see an ant towing something huge. i'm like "HOLY LOOK AT IT!"
now its just a program made to replicate
right now i'd call it encoded intelligence
but i think any self conscious ai will came to conclusions that humans are not worth keeping on planet, specially where ai could deploy/manage robots to upkeep infrastructure so ai has access to energy and could keep existing
concious self aware machine intelligence isn't "artificial" and the whole AI name is kind of wrong. intelligence is real intelligence if it comes from a machine. Engineered intelligence? sure i guess. artificial is just.. its a dated term
depending on the way you look at it, has been made artificially in lab so could keep it's name, or like you view it could be renamed
i think it'll be emergent over time from a networked internet of encoded intelligence models just going at it
one day it'll just be like "hey guys i'm here"
When we study consciousness in a living brain, we find ourselves studying correlations between different hubs of activity.
So if we ever design consciousnessā¦. Itāll be emergent
those mods for CP are amazing
hi guys, sorry im new in this ai generative, but where can i use all this models? for example in the forum of midjourney i can blend images in the forum, but i want taht images blend with each other in movement...
cyberpunk red engine is pre good
What happens once weāre playing around with real-time NeRF environments? I mean, is that the next Minecraft?
anything called "the next minecraft" for hype, isn't the next minecraft
By definition, yeah sure
the next minecraft will be like the next among us or flappy bird. it'll come out of no where
Precedent-setting nonetheless.
Well, I would imagine NeRF tech will play a huge part in it
i think theres already nerf rendering in ue5
I mean the entire engine but thatās interesting
nerf's are engineered too. hand coded algorithms.
Right, but once we can start generating them with enough temporal consistencies, we can effectively offer them as a real time space
oh wait i'm confused there. gausian splatting is the engineered rendering
Right.
neurons crossed
Does anyone know how to replicate A1111's break function in sdxl with comfyui
you can use a custom node that breaks the clip up that way, but it won't always work with other custom nodes
https://www.reddit.com/r/comfyui/comments/15dmden/how_do_i_replicate_the_break_prompt_feature_of/ oh wait theres base support now. just gotta wire it i guess
I think the proposed solution there is comfy_ui Cutoff and it doesn't work with sdxl I believe
you need a super specific work flow since sdxl has two clip layers that need dealing with
https://youtu.be/V-mugKDQDlg?si=8q8dL7lDQDMcJ9cB damn you Amazon prime for making me want to renew
just gotta wire it up with a node that exposes both clip layers simple
the trailer should've been ron pearlman, an old soldier in the waste lands in a bar, reflecting back on his long past. "war. war never changes". thats the only time you see him. The intro to the show and the trailer
WHAT IFā¦.what ifā¦they did and theyāre keeping it a secret for the premiere
Sigh
if he's not in the show at least once...
I know right
i'm gonna ... i dont even know. it'll be atomic
i'll probably like the borderlands movie though. all it needs is production value and i'm easily impressed. i don't think the borderlands core lore is that sacred to begin with
I hope the Brotherhood of Steel character doesnāt turn out like another Finn
i like to speculate that i'll hate things., but even borderlands i'm expecting i'll love
i loved the original mario bros movie even. a little confused when i walked out but overall loved
Borderlands peaked with #2. It needs to be quippyā¦needs to have plenty of Rick and Morty-isms.
Yeah I did too actually lol
lil bob-omb wearing reaboks. whaaat
The fungus was legit disgusting though
Like awful looking throughout the whole film
yoshi too. fuck cinematic yoshi. i want to dump him off a sky platform
@karmic cedarindeed. It looks promising, but no mention of the New California Republic and the Brotherhood's motivations aren't clear yet. In the game they were.
I missed the new Mario, I figured I knew exactly what Iād be getting into if I did go lol
so, correct me if im wrong, using a controlnet pose doesent really improve drawing hands right? just the placement of things?
controlnet models can understand the openpose hands some. but goo dprompting, hires fixing, and dialing in denoising and cfg. that'll fix hands. adetailer has a lot of ways of doing a hand polishing pass too
openpose hands are part of the spec. they're different sort of lines
Itās like setting an impression in the sand and letting the water fill it. The fingers arenāt guaranteed to appear as exactly five
Ahh thatās good to know.
it was good. i liked it a lot
sonic too! i live beside the highway they filmed some of it on. so fun
hmmm i am using bad hands embedding but, it almost like it has 0 effect, maybe im doing something wrong
Sonic was a movie that knew what it was which is always good
watching sonic zoom past my house everyday. wowweee (no it was just crew following a truck lol. the rest was cg)
growing up, i always knew i lived in green hill zone
fkn knew i did
Sega IP has been more media-tested than Nintendoās so they knew what to do with Sonic
pitch him as ugly first to rile everyone up
no they actually followed through on that one
No, they faded them alllll out
The first shots had them all in full glam
All over his face
It was a more subtle version of the sonic thing
leto should've been in the a2a scene if you ask me. fkn leto
conolly never deserved that!
I tend to agree. I think he do well with the right direction, but he has proven himself to be a bit more rigid in more recent memory imo
Like his blade runner part was just cooked up in a drive over
yeah theres a couple times. i chalk that up to good directing too.
Itās a vibe. Making art is all about vibe
good ai product will be up to the directors (bringing it back on topic)
ai is like jared leto
AI is our multiverse
i never use embeddings. i dont know how they were trained. sometimes i've made my own but, i mean, they're just powerful prompts if you think about it
i do like style embeddings though
Iām curious to see if Cascadeās architecture starts to incorporate controlnets in more unique ways than what weāve seen so far
i want to see a thermal controlmap. like, hot bodies are swaety or you can make plasma/fire , ice blocks, wahtever
thermal maps
I could see them building alpha into it as a map
Maybe
At this rate thoughā¦I suppose it was more of an architectural experiment
i've thought ofa lot of abstract conrolnet ideas but i have no idea how to implement or train yet. still studying and practicing. trying to catch a handle. i think i'm missing a lot of fundamentals
yeah im starting to think its not such a good idea to use prompt embeddings
that just changed the timeline. we're fine now. thanks!
#1072236442463518882-with-images now has the context you need for this
i liked edward's batman too i didn't think i would but asa vengeful batman he was pre good
he made it work
i'm still team jacob though for obvious reasons don't think i'm crossin over
bro stable diffusion 3 looking extremely promising
And we will all be able to try it soon TM
TM? Does it come out tomorrow?
Possibly, but very unlikely
So in other words, you have no indication or knowledge of when it is coming out
That's what soon TM means, "soon trademark"
60% of the time, hes right all the time
tommorrow, or the next day. let it ride
wooo go stable diffusion!
my head hurt when reading soft-inpainting help for Automatic, can someone eli5
Are there any Models or LORAs y'all could suggest for SD that do DND Battle Maps well?
I love how the 4080 can generate the images in just a few seconds š
ęä¹ēØ
hi where to generate the images in the discord ?
The depth of field is actually subtle in this one
I'm optimistic about narrow aperture images
nsfw question but, is stablediffusion capable of minimally altering a reference image only applying a lora layer above it? like if I already had an image of a naked girl but wanted to add bukkake to it via AI
Hey Guys,
I've been using Dreambooth on runpod quite a bit last year (about 12 months ago) and I'm just getting back into it.
Obviously some things have changed and I find myself getting quite frustrated with the changes they made to the notebook.
It used to be that you just create a folder named "training_samples" and put your images in there.
Then they changed it to the imgur method which seemed more complicated but also worked.
Now they have a new cell that just spits out a bunch of errors.
One of the errors tells you to just put the images into the folder that the cell created, which doesn't work because after doing that when i run the training i get an error: images/folder not found
If I don't run the "error cell" at all and manually create the folder "training_images" and put the images there I get the same or similar error when running the training cell
Has anyone here recently used the notebook and can explain how you made it work?
I would also like to know:
Can I use Dreambooth to train a SDXL model or does it only work with 1.5 models?
Can I use a model from civitAI that was trained on 1.5? (or in general does Dreambooth work with safetensors or only ckpt? )
Thank You
Anyone here have experience with DINet and know the rules of thumb for getting good output? I've learnt that the face must face forwards, no turning to the left or right, and that the face must be in a high res video, and that it cannot be a close up... but damn it's picky about what it'll work with. It's so good when it works though, I'm hoping people can tell me what to do...
Why
Just why
Why did you use an NSFW example
why did you say that? Why aoespace? Why, tell me, why? You were my hero and u come out and say this u doodoo mouth
poop mouth.
but yeah, this is the weekend. iq drops, cringe intensifies and eventually you have someone ranting about censorship and freedom of speech when grown ass people dont want to have ick conversions.
I am ashamed of you aospace. I've known you for so long and did not think you the kind of person you turned out to be.
When does sd3 come out guyz
itāll probably be this coming week sometime.
is embedding lora or checkpoint?
what do you mean ? Are you asking if embeddings are lora or checkpoints ?
If so the the answer is neither
neither
yes, in which folder should i put it
Embeddings are the results of textual inversion
embeddings, not in your model folder
sorry my english is not the best
np
its a fair question though
embeddings are what we had before Loras.
They're fast to train and easy to use but less effective.
i found one on civitai but I'm not sure where to put it
Good evening, Gentlemen
stable-diffusion-webui\embeddings
for more technical details on those https://www.youtube.com/watch?v=dVjMiJsuR5o
thank you so much
are some models just not able to generate animals alone?
or are they just trained so hard on some things they just cant?
no matter what i type in the prompt, a girl shows up
some models can be overtained for sure.
lol doesent matter how many brackets i use
use negative prompt ?
yeah, ill try to remove it now
no I meant, try using it if you're not :p
stupid sugestion maybe, but try to restart
try asking in #šļ½prompting-help then, and give more informations about your prompt and setup there.
I am thinking out loud, but have can you use SD without a UI ?
ill try another model
it could be that the AI is remebering a aerlier prompt
now i got the most cursed picture ever, im not going to be able to sleep
trying it now
same problem, it seems to be aproblem with the model, no matter what i type, same thing shows up
we can move to prompting help perhaps?
If I get a really beautiful picture with some bad detail (usually eyes or hands), should I Inpaint before or after hi-res fix/upscaling?
Try using parenthesis and a colon with a weight, eg (animal:1.5)
sure can
That's not how stable diffusion works. It has no context so no it can't remember previous prompts.
thanks, but it seems to not work, its always a girl in the generated image
oh wait, first cat
maybe it does work
i see this speculation a lot. many people seem to assume it and confidentlly believe they're right. it must be because chatgpt does it.
yeah that makes sense
people would associate it with SD cause they are both AI
one guy tried convincing me because both prompts look exactly the same, one with "green eyes" and one without. but it was the lora they were using.
LMAO
and chat gpt only does it because the coded interface loops the conversation back into the model
not because the ai model is learning as it goes
that would be sick though
give it a few months ...
long as we can reset it to base weights
i don't know if training / inference have blended together yet on any model. not enough information to be sure though.
mischief loves a vacum
Now should i have an emma watson'esq receptionist for my website or a nicole kidman. hmm, maybe a blend of both
dont use real identities. that will get your business in trouble.
Meanwhile Iām over here touting my YouTube account, ālobabobloblawās libibibibraryā
Real names be damned! lol
you have to pay to use emma watson's or nicole kidman's images in your commercial endeavors. this isn't an ai safety thing. it's just a fact of society.
Eventually itās going to be settled thatāunless the rights have been given to the AI model to generate the contentāitās no bueno to violate.
I like to think of these larger discussions as diffusions of a different magnitude. We are diffusing how to diffuse. lol
How many µg?
lol no idea
hollywood's idea of how ai progresses. i honestly think this movie still holds up https://youtu.be/GJ1KcXeNstM
naw. old laws still apply. if you generate a derivative work, current laws cover that
or models that are trained specifically to represent a copyrighted concept will be considered derivative eventually. acourt will hold that up. old laws still apply
Well, the laws need to be engineered into the frameworks of these models quickly or else a cultural epidemic is at hand.
epidemic is an over statement. some lawyers will get paid and some gung ho entrepreneurs will fail hard
If I were to fear something in the future, it would be a group of people who decide to snap-judge how to deal with a problem as complex as syntactic copyrights
new laws that accelerate prosecution should exist, but only where actual assault and crimes exist. like ai extortion or revenge porn, or just nerds being sadistic towards celebrities like the fappening
Agreed
Oh i am not. its a local install of sd and im just playing around.
generate all you want on your local machine. you were talking about your website
yeah, i was kidding about the webside side. no way any of my 'playing' is ready for [prime time yet
its the publishing part thats troublesome
definitly
There will be sweeping legislations soonā¦not probably until later in the year iirc but Iām doing my best to prime myself for what could happen.
in canada the laws are coming into play retroactively. so if you've already assaulted/extorted/harassed people with ai images, you're in deep water
the thing with legislation is its country specific. surte the US could extradite you but if my dis-information site is domiciled in the democratic republic of the Congo, what are you going to do?
It makes sense to start with those who are the worst offenders as well as targeting the models themselves.
i'm not sure of the bill. i'm not geared for cspan today. its saturday and i got yardwork i'm gearing up for.
one of these days i'm going to deep dive on the measures happening in our parliament
True.
report how much money your site makes to the congo authorities
make your life difficult
The MPAA/RIAA/ICE have tactics. KimDotCom thought he was legit. He had to sell out ninja video though and then still faced charges anyways. (anyone remember that nonsense? lol)
Oh sure.
theres all sorts of ways to skirt enforcement. but the enforcers play the game too
From the moment I started prompting with AI, Iāve done so with a mindset towards humor, empathy and the values that I hold true.
Hey im new, im playing around with stable diffusion and sometimes encounter a problem. Right now im trying a bunch of different Checkpoints + the cyberpunk lucy lora with a weight of 0.7/0.8 And its overcooking the colors alot and sometimes just putting weird textures into the picture / into her face. Why does that happen?
The MPAA, RIAA have been trying to stay relevent for decades. the quicker someone wises up that the gamne has changed the better, Old world moodels where I go sit in a cinema and watch a movie are long gone. the lat movie i watched, i asked for a refund from Odeon stating I was not entertained. their response was they offer a seat not entertainment. IK now entertain myself with streaming services. Where most of teh big films come out on after a few months
Ive stuck with sarcasm and humor
i mean... they won. Streaming services has piracy at an all time low.
they're still relevant. don't be silly.
Yep, and as the Vision Pro experience becomes more accessible to the mainstream (read: less $$$) then the theater will officially become virtual, your friends will sit next to you and all be in the center of the theater, etc.
so much money. Huge armies of lawyers. When the motion picture and recording industry start wayning, then you can call them old and tired. they're fully operational right now
nLike a zoom gallery view
Silence is golden. And thereās a lot of smelting happening right now.
they are only relevent in the US. not really here. here we have our own idiots BBFC
theaters already make less money than home streaming. spielberg said this a decade ago
Im not sure they won. plex is still; going, there are a millionn torfrent sites ands i can walk through downtown Bankok and get any movie I want. they might have reduced piracy in the 'first world' where we have money, and given up with the 'third worls' where they dont have any money
well, sure, they won't ever irradicate piracy.
but they are making more money now than ever before
Using Fooocus with Juggernaut XL v9 + realism preset. Works really well, possibly better than realisticStockPhoto v20.
even North Koreans have bootleg copies of movies lmao I don't think streaming services give a damn about places that aren't big money
Itās also a form of cultural transference and thereās abstract value in that
that said its relatively difficult for me to find torrents on netflix stuff
compared to say, anime
I canāt remember the last time I torrented something. š³
torrent sites are a pain in the ass. just pay for usenet access instead.
so 3090ti is worse than 3090
is that youtube hype or based?
agreed. i just use them for ubuntu downloads occasionaly wne a new release is out
Hi, I have just integrated Stable Cascade into Forge (link: https://github.com/blue-pen5805/sdweb-easy-stablecascade-diffusers). Should/must I also use Stable Cascade as checkpoint? (link: https://civitai.com/models/306055/stable-cascade)
that extension downloads it aoutomatically to it's own cache folder somewhere in appdata. I wouldn't call it "integrated"
is it possible for stable diffusion to minimally alter a reference image but apply some lora effect over it?
like if I have an existing image of a vampire and I want to apply a bloodstained lora
Wait so the checkpoint is basically also downloaded automatically?
Are generations faster with the igpu disabled?
Might be a VAE issue or might have the CFG value up too high.
what's the easiest way to install chatgpt4 locally? I'm tired of bing being annoying
and I don't know what's the best way to do this locally
for that extension yeah, it downloads into appdata
why would they be? no.
Sorry if this is a dumb question but when rendering img2img with a 1920x1080 batch what should I resize the outputs too? I have an I7 w/ 3090TI and my ETA was over 3 days. I interrupted that and then went to 1280x720 and it dropped to about 18hrs give or take. I am attempting to convert a video to AI and the original video is 1920x1080 30fps.
drop the video's frame rate to a quarter of what it is. 8fps should be sufficient. after that, interpolate new frames into it to bring it up to 30fps again
a 3090 should be able to handle FHD frames
Noted, Thank you. Installed Stable Diffusion Auto 1111 Awhile ago. I've heard the first generation would take awhile. However I sure didn't expect it to be this long.
Least it's a one time thing.
dont know how long so i can't say if its normal or not. if you're here complaining about it taking along time i'm going to bet its installed wrong. first generation shouldn't take THAT long
sometimes things need to download but the console shows that
Should I do that prior to sequencing the video or after generating the AI and exporting it with 8 fps? The video is 4:17 with over 7k sequence images.
My Instagram name is the same as this here, if anyone is interested in collaborating DM on Instagram. š
7000 images is going to take a LONG time no matter what. you've set yourself up for days worth of rendering
I know haha. That is why I am wondering what the best solution would be. The tutorials on yt isn't as helpful as I expected so I'm digging around š
you could look at LCM too, to reduce the steps. Make sure your VAE stays cached in memory. and cut the frames to 25%!
if you don't know where to cut frames out, i'd suggest you work on images more. You might need to learn more about video editing basics before you delve into creating long form ai videos
tutorials won't help you with a project of this scope
yeah it seems so. Just learning everything as I go š I just wanted to attempt a short video conversion with video to video and have it animated a certain way. I did leave my PC on for the night and it render about 4k frames and the animation was decent. Just took way to long for what I was wanting and learning trying to see what i can do to make the process faster.
Help
7000 frames is what ? 234 seconds? thats nearly 4 min of video bro
I understand that. I Was just wondering what are some skills/tips to reduce to load time. I think you gave a great example of reducing fps. Originally it was a 60fps video but I cut that down to 30 in editing.
LCM loras too. look into those
chop the video to 8fps and generate at that, then the final product interpolate frames up to 60
So i don't know how true this is
But apparently the text encoders for SD3 are also getting censored
made up. clip layers are frozen
Awesome tip. I will give that a try! Much appreciated flow!
Which basically means that no amount of fine-tuning could make SD3 create anything "immoral"
I might be missing something, but Stable Cascade doesn't seem to appear for me as available checkpoint model (reference image in #šļ½general-with-images )
you're just hearing rabble rousers trying to cause rage donations to unstability diffusion . they'll promise to make a new clip encoder but then never will
outrage donations are pioneered by Trump Inc
its just more americana dumbness
No I'm not going to believe that it's not true because i don't understand how any of this actually works so I'm open to all outcomes
you'll believe it's true though lol.
You're open to being conned
wanna buy a bridge?
Why are you so defensive about this?
the extension doesn't integrate cascade at all. it only exists in that tab
And what does trump have to do with any of this?
outrage donations. people donate to his campaign to MAGA. same outrage nonsense that unstability discord crew drum up
Oh ok, I am kinda confused now, how can I integrate it then? š
the clip layers are frozen. clip license even prevents further training!! open clip doesn't. t5 i don't think can be refined legally though
and you can't censor models after they're made. lobotomizing knowledge from a model once trained , would be a very difficult engineering task. And you could just add it back anyways since the base models are available
in forge? you don't
Guys hi I'm new here where I can try to generate images?
Not here for now, try hugginface
not here. sorry. bots were only available back during the sdxl preview. may be avialable soon for sd3 again
But I apply for stabble.ai waitlist
then you'll be emailed
Huggingfaces I have before I joined here
I've been using civitai's on-site generator, is hugginface better?
But you are saying is that it's still possible?
They email me to join here ..I tough I generate here images
Wait so the github thingy (https://github.com/blue-pen5805/sdweb-easy-stablecascade-diffusers ) doesn't really work?
with a lot of research potentially but that'd be dumb and they woudln't waste on that
it works. only in the tab though
you got an email? no one else has afaik
Only to join discord
I think I might be mixing stuff up, but what Stable Diffusion checkpoint should I use when I am in the Stable Cascade tab?
the cascade tab doesn't care about anything auto1111 is doing. that selected checkpoint doesn't matter. it uses diffusers to load its own.
its not integrated at all. it's just a tab slapped onto a1111
I won't underestimate them and will think of it as a possible outcome, thx for the explanation.
Whats the best model on huggingfaces I did try lots but I didn't like any
foolish. you want to be outraged. you need it. ugh.
the porn cult. seriously. you guys need to keep it in your pants. don't fap in public ffs
Oh ok, I get it now thanks š
Ok guys DM me with best model on huggingfaces I want to try to see if I like š thank you
Does anyone know how or even if it is possible to create videos like this (https://www.youtube.com/shorts/MeGa6FnoP3E) with Stable Diffusion?
animatediff. its not easy to use. takes some practice. start with 2 second clips to figure out how to prompt it and dial it's settings in.
I will try that thanks š
WOOOOO GO STABLE DIFFUSION
when sd3
yo guys how do you download stable cascade?
Here are the models. https://huggingface.co/stabilityai/stable-cascade/tree/main
yeah i saw that, and which one should i download?
stage_b.safetensors?
If you use comfyui I recommend the comfyui_checkpoints
i use a1111
And that's where my knowledge ends.
Is it even compatible? Haven't used that for a long while.
A friend of mine has been relaying some of my prompts to test them in SD3 and... I got to say, I really hope it has a lot longer to come along
At least from what I've seen, it looks like SDXL V2. It's uhh.... Yeah
Ok, I have read a bit on the internet. My new dangerous half knowledge says, that it is not officially integrated. But it is in the sd.next branch.
i used it with latest turbo models
oh ok
so in the page, should i go ahead and download a particular model?
I'm sure SD3 will be interesting, but it honestly just feels like it's kind of already outdated to me
Ideogram and what Pixart are cooking up seem way more impressive IMO
It depends on your hardware and in what software you want to run them in.
I do think that SD3 looks good, however they've kind of been touting it as being really good at listening to proms, and pretty much all of the examples I've tested have had major major issues with listening to what I'm asking for
With the T5 or without?
No idea, the images are being demoed by somebody else.
All I know is that even without t5, the results that I'm seeing are equivalent to SDXL's prompt adherence at best
I think the T5 will have a huge impact. On my wallet and the coherence
I think the visual quality is fine. SD base models have never really been made for their quality out of the box. They are meant to be trained/fixed for different things
I mean, in their demo papers it seemed like T5 hurt the performance
it had more spelling errors, and seemed to generally not get the context of what was being asked for right IMO
IDK. I feel like SD3 needs to have a really impressive trick up its seeleve, cause I'm really just not feeling it from what I have seen
Time will tell.
Nope. Bots are down.
Ahh, too bad. Thanks.
You can check #1047610792226340935
I did, but since the message was pinned on the 6th of February, I wasn't sure if it was still down.
Do you know any free bots that do similar things?
not really
I mean for coherence
I personally don't know that, sorry. A big percentage of the community is generating images on their own hardware.
Agree, it's good, we'll probably move to SD3 when it's time, but it doesn't look revolutionary
yeah, it seems as good as it should be
far less impressive compared to SDXL over 1.5 IMO
I've just seen some demo images from it using my prompts, and they get like a 3/10 for prompt adherance
Still not back?
training SDXL is kinda my job, but I still use a couple 1.5 models from time to time for things that SDXL has not been trained for
kohya, and some others
yeah
One Trainer would be an option
Flexible Text Encoders
By removing the memory-intensive 4.7B parameter T5 text encoder for inference, SD3ās memory requirements can be significantly decreased with only small performance loss. Removing this text encoder does not affect visual aesthetics (win rate w/o T5: 50%) and results only in slightly reduced text adherence (win rate 46%) as seen in the above image under the āPerformanceā section. However, we recommend including T5 for using SD3ās full power in generating written text, since we observe larger performance drops in typography generation without it (win rate 38%).
OneTrainer is one, yeah
btw, they limited the token limit of t5 to 77 to match clip
I give it one day and the limit is non existent anymore
is there a source for this claim?
I expect most people to just not bother with it
the paper
It has a gui. I use it, but only because I can train cascade with it.
And easier concept management
page 21 at the top
https://arxiv.org/pdf/2403.03206.pdf
bump
its closed sourced
you cant
any local sdxl with comfyui guides for me to learn? just chug me a document link and i can start reading
i managed to make this not stable
i have an i9-12, rtx 3080, 32gb ram. and i run a1111. what do you think would be the best for this spec?
Hey guys when can we expect SD3
I'm itching. Checking google everyday like a spastic. Can someone put me out of my misery
and its not like you could run it lol
tru
here are some new posts
these are just portraits again though :|
I do like how the ginger woman with the glasses isn't looking at the camera
sad day, had to change vram to low on comfy = (
trying to use sdxl with shit computer
trying to use dreamshaper xl turbo, but yeah, its a huge struggle
Nobody has tried SD3 in preview yet, right?
Nobody has access
Who knows when it will happen š¦
WE NEED SD3
š
I just finished my 1-step-llama and already beg for sd3, I'm stupid
I am on the waiting list
I think i probably gave wrong discord name anywya but whatever
nah nobody got access its okay
š„ŗ
Also we need a plugin that recalculates the ligh but not chnage the shape or textures. so we cna mix and match different images.
Did you give id or username
Username
I gave username but it doesn't specify


gm
im trying to make music videos
sigh
so close yt so far
this tech is so amazing tho, soon we cna be like a one person production house
I think they should give us access and let us start tunin'
They can't. Emad needs to hype up investors for another week
We'll hype them up for him if he gives us access š
SD3
3rd time my ticket is closed with NO FOLLOW UP
Would you follow up this time?
I just opened the forth.
that's not true
did he mean that particular LLaMA model which named after GPT.
GPT 4 is pretty much closed source. All LLM that are open to the public is either Llama, Mistral, Alpaca, Vicuna or its finetunes
i assume
how many models have yall downloaded
š emad is making M"EMAD"
I need help with Lora creation, and a111 or some other webui
announcing stable diffusion 3 will be the last image generation model from stabiliyt, before the preview invites are even sent out.
awesome way to take all the wind out of the sails
I think the point was it'll be good enough to focus on a video model that can probably make still if you need it to
whatever the intenntions, the timing was bad
Yeah, it was a shortsighted pr stunt
i dont think it was a pr stunt. just a slip of information that devalues what sd3 is supposed to be
who are you addressing this to?
now the conversation will be "this is the last so we should just refine sd15 more"
The whole point of EMad's announcements is to generate hype to get venture capital flowing in.
It's not for us
i have 231 checkpoints
VCs wiill see that they're giving up on sd3 before its even out
it was a dumb thing to say
They didn't say they were giving up on
essentially
In the same post where they announced sd3 they said it'll have an upgraded version that they're already working on
"we'll never release anohter image model" says a lot
Eh, we don't really need another image model. 3D and video models are more important. I can already create anything I imagine with sdxl
It means either the thing is so damn good you wont need anything else ever again, or we are moving to txt2video or some other ai field
if it can't do handstands, its not complete
sdxl has plenty of limitations
it wont be "so good" and will have limits
Everything has limits but it'll be better to let sd3 marinate with years of plugins and custom nodes
SDXL on release vs 6 months later was a huge difference
eh. community models are actually less knowledgable imo
Constantly releasing new models just seperates the development base
sad day to see the end on its way
we'll all move on to more capable base models withotu stability in a year
where did emad say we'll never release another image model
It was a dumb thing to say, which makes me believe he's doing what every other AI company is doing, acting like they're on the verge of a new breakthrough to get the investors lining up
OpenAI did the same thing with Sora even though everyone knows it's not really financially plausible to market it to the public
do you have them on your ssd? or hdd?
SSD
about to buy another 4tb cuz i'm outta space now
dang baller
is there a way to put them on the HDD?
yeah, just slower
is it worth the tradeoff?
to me no
i'd rather throw down another 300-400 bucks and keep the SSD speed
it only takes a couple seconds to load a model on a nvme ssd
with a HDD you're talking a minute or more
so you don't get to take advantage of having a lot of models
i flip through them all the time
get a workflow or some parameters i like with a good prompt, then start cycling through models i know are good at certain concepts
yeah
i just started so im using auto1111
forge when on my phone
highly rec trying forge
for most people it's way way faster
it's basically the same thing as a1111 except at least twice as fast for me (and many others)
oh this is auto 1111 same but faster?
oh yea
would I just download this into the same folder as my stable diffusion?
auto1111*
i'd make a new one
you can point it at your old a1111 folder
that's what im still doing with forge and comfyui
Il having trouble installing stable diffusion on LM studio. Please help
afaik LM Studio is for language models and Stable Diffusion is a image generation model, so it can't run on LM Studio sadly
Hey there, that was my mistake this time, outerside wrote a lengthy reply a few days ago and since you didn't answer I closed it after some time thinking you read it. I'll see if we can recover the answer from him
Hi chat! Does anyone get early access to SD3 yet? I joined the wait list two weeks ago, but nothing happened since then.
i think only model devs have access right now
Well, hope they release it sooner.
Whatās you guys opinion about; when stable diffusion 3 comes out, will it be the perfect image generating model, will it be better than mid-journey or will it outperforms mid-journey?
it wont do handstands or have dance moves
how do you guys remember what lora does what loool
do you have a notepad that tells you what lora does what and how to activate it?
Is it true SD3 wont have horny shit
who knows, are you asking if it'll have nsfw content? maybe rephrase the question
I supose there will be a way to train your own checkpoint and then include nsfw content ?
There is no perfect image generating model and this one is no exception
Better than MJ in which way? In terms of customization? Yes. Image quality? Not so sure
The achilles heel remains as always, ease of use and setup as well as spec requirements
i would appriciate if someone could help me compare images in #šļ½general-with-images
having upscale doubts
Maybe we'll have to ease more into doing it on the cloud...
Alien concept for SD users, I know.
anybody got an idea how i can roll back my automatic 1111 version? this new version is horrible
git reset --hard {commit ID}
git pull
But i am not a professional in this!
So be careful and save all the stuff you could lost
what was the id of the version before this
like the feburatu version
a551a43164b8baf1b5652a9ee73081cca54c612b i think
Sorry, on the Version i can't help you. Go through all the commits and check what fits for you. I am a ComfyUi user.
alright, thanks for the help
ill be back after i save my models
when will sd3 come out?
before sd4
ok, today was the day of the week when I laughed the most sincerely
I just got banned from /r/StableDiffusion
"Hello, You have been permanently banned from participating in r/StableDiffusion because your comment violates this community's rules. You won't be able to post or comment, but you can still view and subscribe to it."
I always wanted to know how my grandparents survived the dictatorship, this subreddit showed me
Thank for you
Morning folks. Does anyone pls know any good tutorial on how to set the right parameters for Dreambooth SDXL on Diffusers "train_dreambooth_lora_sdxl.py"? My training is working but the results are not great š
hi guys , i newbie so i have a question . How can i generate videos by text and video embedding. Are there any models that support that?
I used the git log command to find the version I was looking for once. I just looked at the dates and estimated around the time I installed the version I liked.
6e6cc2922d39fff4029d47c316c22a1c152680ce
using
git reset --hard {commit ID}
git pull right?
if you git pull you'll update again to the latest version :p
damn so its just git reset to roll back?
I think I just used git checkout 6e6cc2922d39fff4029d47c316c22a1c152680ce
so i paste git checkout 6e6cc2922d39fff4029d47c316c22a1c152680ce ?
I'm pretty sure thats what I did...I read a bunch of tutorials and just hoped for the best lol
whatever I did worked š
good luck š
bad news
it didnt change
it still is on the new buggy and slow version
I'm looking at my notes from when I did it
git reset --hard {commit ID}
should work, (it will delete any modifications you have done to code)
yeah I had one more step after that
git pull
git reset --hard <commit-id>
oh no
my performance addons
so
git checkout 6e6cc2922d39fff4029d47c316c22a1c152680ce
no.... it wouldn't make sense to reset to a certain commit and then update
then git reset --hard <git checkout 6e6cc2922d39fff4029d47c316c22a1c152680ce>
those are not in the repositories. they should be fine.
oops
alright gonna try it
it's "only" gonna revert modifications of files listed in the original repository
git reset --hard <6e6cc2922d39fff4029d47c316c22a1c152680ce>
any added files (models, extensions, outputs, config files) should stay there
no <>
now it doesnt open
oh wait nvm mb lmao
new problem
raise RuntimeError(
RuntimeError: Torch is not able to use GPU; add --skip-torch-cuda-test to COMMANDLINE_ARGS variable to disable this check
delete your venv
shouldnt i add --skip-torch-cuda-test ?
alright
You might have new dependencies now. So, you are basically installing it new with the old commit
Why is it horrible?
buggy slow and clinky
just pull an older version. That should be fully straightforward when using git
Truly a Garbage advice
Thanks
Forgot to say you could create an extra environment for that and install the older requirements.txt š
If u need another Garbage advice
thats what im trying to do lol
yeah its still opening up the new version
now i cant generate images
Temporal Coherence. SD will kick SORAs ass. change my mind
i guess sd3 early access hasnt rolled out yet?
Hey, anyone knows what happened to stableboost.ai? The site seems to be down at least since yesterday
I need help/a tutorial on how to train/finetune stable diffusion models.
I have SD WebUI installed an running.
My goal is to create a Minecraft texturepack that replicates the style of an existing texturepack.
So from what I gather I would need to create an embedding?
The tutorials that I have found are either not detailed enough, or apparently for an older version since some features apparently no longer exist (image pre-processing for example).
u need to create a lora and i can make that for u
dm me ill tell u everything
Hello, I wanted to know if some people would recognize a model used in some AI pics but I donāt know where I can post it ?
emad is edging us
What will come first, GPT-5 or SD3?
SD3
i'll buy an a6000 before i do that
Blue sky
GPT-5 is going to get pushed as ultrapremium and will ultimately be so expensive that only certain folks in society will have regular access to it, just like how only some folks are willing to pay for GPT-4, etc.
Affordable access to AI will eventually paint society into different clusters based on access to intellectual capability, etc. hopefully folks will take it upon themselves to hone their own skills in light of this long-term development
Reality diffuses just as we diffuse our own thoughtsā¦and OpenAIās models all diffuse into an economic reality
rly? seems a bit much, u think people with the most money will get the most premium ai and this, will increase their wealth that way?
Well, paradoxically I donāt think their wealth is going to increase, itās just going to stay the same
And fundamentally, intellectual expression itself will become aligned with how deep your pockets go
The concern is that intellectual expression, when from a human, begins to lose its value as it becomes less constructive / generative and more conversational / aesthetic
And that comes from an awareness of the tax these systems place on society and the world at largeāthe carbon footprints, etc.
š
I don't know what to believe anymore
will there even be an early test anymore
they are worried about the model being "not ready"
slight overreaction
I just hope they at least release it at some point
Oh they will
it's 100% guaranteed to be released
its just that the testing phase is what we are excited about
there's this waitlist that we got no (solid) news about
besides Emad promising us stuff
hi people, i have all kinds of graphic elements and linedrawings in multiple styles in a presentation.
if I wanted to re-generate them all to be in one style with one consistent prompt, what tool would I use for that?
I just did š
This is not true. Open Source is nearly as good as commercial solutions. There was a leaked message paper from google in which they complain how hard it is to provide something above open source level of quality: https://www.theguardian.com/technology/2023/may/05/google-engineer-open-source-technology-ai-openai-chatgpt [Google engineer warns it could lose out to open-source technology in AI race - Commonly available software poses threat to tech company and OpenAIās ChatGPT, leaked document says]
How does a platform like Sora factor into that?
Head Start. That's it. Things are popping up in GenAI fast. Open Source catches up at some point - and lead in many other fronts.
Where does legislation factor into that phenomenon?
ā¦when precedent isā¦set?
If so, what precedent?
very basic, also the sword is quite bad... at this point i can surely say that SD3 is nowhere near Dall-E 3 level of characters interaction.
The model has still not finished training, but I get your point.
I'm also afraid of stuff like character interactions
emotions, facial expressions
I want to know how good it is at those
those are bad, it can't do things like different eye color and blinking, etc, Dall-E 3 can do it all.
(or how good it will be once its fully trained)
Do you know how much is left in SD3 training?
the problem is that Lykon trains it like SDXL, but it needs completely different kind of training, thus is why SD3 is bad, sorry but it's hard truth.
i stopped waiting for it, i know it's bad, they can't fool experts with those portrait images, Lol
my guess is that SD3 is around 3/4 of training š¤·āāļø
no idea on exact numbers
yeah, it need like months of training to be at some decent level, right now you may just stop waiting guys, it's not ready))
Hello friends. Since I started trying to use the XL models, I haven't been able to achieve good results. Usually, when the image finishes generating, it has poor quality. I believe I might be doing something wrong when starting to use the XL models and would appreciate help. How exactly should I use the XL models? Do I need to first load the SDXL Turbo and then use the model I downloaded from Civitai as a refiner? Should I merge the base model with the one I downloaded? Or is it some configuration that I may not have set correctly?
Are you using the base sdxl or sdxl turbo? Turbos intent was to generate images super fast
If you want actual quality images, use the base model or a fine-tuned version off civitai
But using the RealVisXL as an example, what exactly is the correct way to use these models? Do I first load the base SDXL and then use RealVisXL as a refiner?
This is the part that confuses me the most
I'm not familiar with that model but I'm fairly certain you'd want to use it as your actual model. The civitai post of the model should say whether or not you need to use a refiner or not
i'm new on this, so i'm trying to learn :/
When SAI first introduced sdxl, they stated that it's fine to not use a refiner further down the line. So if a model is fine as is, you wouldn't use a refiner. If you do need one then check what the model creator suggests for one
hmm okay, thanks
the devs are only demonstrating precedent imo
What is the compromise with these turbo models?
vanilla models are never as interesting as the human ecosystem that builds up around them
They claim to produce high detail images in low amount of steps
yeah but I only care about fixing issues present with current models, which sd3 doesnt seem to address
what are you noticing? I havenāt really looked at any of the images, still waiting to make my own once it drops.
to look at an image is to look at a mosaic of qualia*
ahā¦.fidelity
gotcha. that likely boils down to text encoder precision. if the model isnāt able to generate a sword with the context that a sword is a single blade, then it doesnāt have enough context of swords being a single blade.
now, a control net could be thought of as a mental heuristicāāokay, i need to make sure X does Yā
so thatās where those would come in, i suppose.
iām being super reductionist
but i find that metaphors work well to explain large concepts like these
obviously I could just say something like
the model should be able to recognise the points its wrong and iteratively refine them
for sure
but idk if that kind of architecture exists
even models like gemini / claude would struggle with finding things wrong
i think control nets attempt to offer that ability, as do preprocessing stages in some architectures
and in general weāre seeing a lot of whitepapers advance the science of building in more contextual awareness, so thatās an inevitability regardless.
I mean SD3's better prompt adherence might mean nothing to us if we can't run T5 efficiently
the paper states that it mostly affects Text coherence and prompt adherence slightly, but I am not so sure anymore
Still limited to 77 tokens :/
ugh.
5 times slower than SDXL for SD3 8B on a 4090 with the same 50 steps š
SDXL base*
My GPU is ready
wait wait wait
sd3 is 5 times slower than xl??????
guess im not gonna be using that lol
is there anything that resembles a general concensus about mixing and matching loras, i mean like if you had one trained on sdxl 1 and in the same prompt a different one that was based on 1.5. ive always tried to match this for that, out of some ocd level need for uniformity who knows. same with control nets, like maybe you use XL Canny .f16 safetensors but you decide to use a 1.5 based OPENPOSE controlnet. im sure there a lot of reasons people would want to do this, mine is coming from being as economical as i can in terms of resources and the fact that have been working with deforum for about a year on 12gb VRAM rtx 3060 ran locally. i almost always have use-cases for multiple controlnets but trying to use an XL model and even 1 XL controlnet is already pushing me to the edge.... appreciate any feedback, thoughts on this. happy sunday everyone, hope errbody havin a good day.
yikes that a scary thought
little bit less excited now than i was 2 min ago about 3.0 lol
it translates to more carbon emissions. š¦
more regulation, selectivity.
itās a sign of the times
Are you serious Marko
I have around 7 seconds for 50 steps at 1024x1024 in DPM++2M in SDXL in a 4090
And stability reported that SD3 8B takes 34 seconds at 1024x1024 with 50 steps for SD3 8B on a 4090
So around 4~5 times slower
they already working on SD3 turbo
Just my 2 cents, but I think SDXL Lightining is much better than SDXL turbo
At the same number of steps
yep, lighting is the best sdxl
SDXL turbo looks pretty soft, but Lightning looks pretty sharp still
sd3 is a huge step forward, but alas we will never reach Dall-E 3 level
they will finetune the 2B model to look as good as the 8B model if not better
you didnt specify what "not great" means, but this is probably more indicative of something with your data set, captioning, etc vs a training paramater if training is not failing
quality > quantity
True
@marble lintel if you look at the diffusers dreambooth script, it is not very good
if 2B is still better than SDXL in prompt adherence 
and have it generate in higher quality overnight
it depends on how much better the quality is
what is 2b
why not?
sincere question, I've never used dalle
sd3 8b is on par with dall3
nope
The diffusers dreambooth script uses the same caption for all images, the AUTO1111 dreambooth allows you to use a different caption for each image
i wish i could post some sd3 gens (not mine, but not public)
i saw it all, sd3 can't do character interaction at all
Bro if i am getting under 2 it/s on SDXL
oh lykon's images?
yeah
That would be so low on SD3
What GPU?
4070 ti
yeah that doesn't sound right, 2it/s seems a bit low for 4070 Ti
character interaction? oh, pr0n
Maybe you are using fp32 for SDXL and not fp16?
oh they want that
š
But yeah, the AUTO1111 dreambooth script is better than the diffusers dreambooth script
2 people playing chess, that's the goal
prob pretty close to right, a 12gb card hitting like 2it/s, same ram but slower card and im pulling 1.4it's
sd3 can do that
no, for example try in Dall-E something simple like: female elf with the sword attacked by the huge creepy muscle orc, epic battle, castle, rain
I'd have to try ,but I'd be surprised if XL couldnt already do that
it probably can
hmm, with 4090 SDXL 1024x1024 I get around 7it/s
So around 7~8 seconds for 50 steps
the sdxl base? sorry
I use fp16 + pytorch .compile (no xformers)
I'll take that hint!
I noticed xformers does not always give me the same image across different seeds
well, Lykon is welcome to prove it xD
i dont want to say lykon doesnt know how to prompt, but lykon doesnt know how to prompt correctly
yeah, xformers speeds it up slightly, but even with same seed 2 runs look slightly different
slightly, like a few details are different
anbody tried the SUPIR upscaler yet? im just bout to dig into SEC_courses vid on that and check it out
I don't know if he's prompting wrong, but I wish we could have at least 2 or 3 people posting so that we can have a better idea
i would post a link to another server but i think im not allowed to do that
(comfy and some other sd devs active in there)
You mean he prompts in the old way?