#💬|general-chat
1 messages · Page 106 of 1
14 of what? iterations/s or what
At 1024x1024 res
So either they are lying or I am missing out on a lot of performance
No no at 512 512
gotcha, so about 6-7 it/s
No, I mean for me 50 steps of SDXL base + DPM++2M + 1024x1024 is 7~8 seconds on a 4090
ah
that was supposed to be the discord server, but as jayjay is saying, today might also be a day for that
Ooof that's really really bad
i thought that was normal until i read about other ppl's perf
Doesn't sound right
u run command no half or some shit in your bat file?
a1111
ohh
just checked, dpmpp 2m on forge i get about 6.5-7 it/s
What gpu?
4090 gigabyte oc
not really if you do just images, imagine hurt while making 9k frames of animation
but still you dont need t5 for every generation
I am curious why they used T5, I'd probably have used a multi-lingual model like BGE-M3. Although the M3 just came out last month
most likely because they work on sd3 longer than a month, trained this shit for long time
Yeah fair
isnt it a lot smaller?
Embedding models are typically smaller than auto-regressive text models
https://towardsdatascience.com/openai-vs-open-source-multilingual-embedding-models-e5ccb7c90f05
Third party benchmarks show that BGE-M3 is better performing than even closed source OpenAI embedding models
i have a few questions regarding that
are they defining the generation time in the way people often do with sd15 or sdxl... 20 steps with the fastest sampler?
that's q #1
q #2 is something i'll have to answer myself... is the improvement in image quality and prompt adherence sufficient to exceed the value of a batch size of 4?
1024x1024 50 steps, no sampler mentioned
Stability claims 34 seconds for 50 steps on 4090 for SD3 8B
my guess is it'll save time because of that
oh shit
I slept on a driver update
Could that really be it though
i doubt it
You're using windows?
i am
Wow
i dont think thats it, i have the one from 22 feb
I'm genuinely surprised. Not being sarcastic or anything
yeah some of us like to play games so linux is not option
... I dual booth Windows + Linux
i'm on windows myself but that's because i have so much work stuff i need throughout the day
not everyone has a degree in cs just to use linux ;(
I tried installing linux once and it bricked my entire pc had to factory reset
it's pretty easy these days tbh
wtf
i'm waiting for the next amd chip to come out
GL running most of games on linux, LTT tried that not that long ago and experience suck harder than sasha grey
then i'm building another system, moving my 4090 over there, putting the 3080 12gb back in this one (it's just sitting on the shelf collecting dust atm)
putting linux on the 4090 and leaving windows on this one
u think 5000 series will be any good?
i haven't run SD on linux but i've heard it's great
what does using --xformers do?
5090 rumoured to be 70% faster than 4090
or does nvidia not really care about consumer line anymore
i dont see any difference with it
i've heard all kinds of conflicting things about the 5090
it uses xformers
that it'll be 24gb, 32gb, or 48gb
48gb i doubt
Optimized CUDA kernels for self attention
that is a huge leap
that it'll be an incremental increase in speed, and that it'll be almost twice as fast
or even more
if they give only 24 gb on 5090 then fuck nvidia
one rumor was 50k cuda cores (4090 has 16k or so, 4080 10k or so, etc.)
its new samsung memory on 5090
yeah
if the ram bumps to 32gb or higher i'm going to get one immediately
it won't be 50k cuda cores nvidia wont shoot themselves in leg, if 5090 has anything like 25-30k cuda cores its upgrade worth getting still
or 5080 if 90 costs like $3000
90 will be 2-2.5k usd for sure
wont be that expensive
cause in my country you pay way above msrp due to the insanely high vat
Can someone assist me here my images keep coming out really pixelated for some reason after installing SDXL
it will be 2k at reelease i bet esily considering how prices are hiking
show the image and ur settings
change the vae
that sucks...
they wont give you new type of memory and boost cuda cores 50% for same price mark my words
like we are talking 55-60% vat
wow.
the 50k would be more than tripling if true
4080 2500
where u live north korea?
check your dms I sent SS @fervent thunder
serbia
yeah that's crazy
k
it wont be true they wont give u so many cuda cores they still want to sell proffesional grade gpus
yeah they need real competition
thats why they always limit vram and cuda cores
they would if they had it
but they don't right now so your perspective is the one i share
i wouldn't be at all surprised if they drop another fn 24gb on us, 48gb is a total no
no, you wont get 50k cuda cores gpu for 2k usd in 5000 series even if amd or intel are more competetive
best case scenario is 32gb with maybe 20k cuda cores and faster vram imo
it will have more than plain 20k, 2090 had 4352 CUDA cores, 3090 had 10496, 4090 had 16,384
they promised the biggest generational leap yet
implying bigger than 4090 vs 3090
i would expect something like 22-27k
yes there is rumour that it will be 70% faster than 4000 series card
that is what the trend has been
roughly 33% price increase
for a massive perf increase
also GDDR7 memory, that will be costing them for cutting edge new type of memory
Tbh i might just go 5080 if it follows the same increase from 4080
Its more than enough for me
for 165fps @3440x1440p
thats why i got 4080super month ago instead 90 for now, but lets see how it goes if i generate through ai enough income i might jump onto 5090, got atx 3.0 1500w psu so i am ready
well for gaming at 2k my 3080 was enough
damn 1500w
im not upograding cards for gaming anymore just for ai
shits gonna cost $4000
bro have u tried a qd oled monitor
if ur investing into top tier hardware
u gotta try the top tier monitors
im never going back to lcd
im on samsung odyssey g7 27 inch 2k 240 hz
alienware have top tier monitors now for panoramic view
i think there is a 240hz model as well but i could be wrong
there is also 21:9 one
that i have
but 165hz
the screen looks amazing though
it is an even bigger wow factor than going from 60 to 240hz
i have a 360hz qd oled
its alright
wouldnt recommend it
i think leap from 240 to 360 is less noticable than from 144 to 240
also i am getting older im not playing counter strike like before so dont need that kind of refreshrate, ill mark 240 as my standard to live on
360 wont make anyone a better player than 240
i think ill swap to 4k monitor when cards can esily hold 144 ultra settings in most aaa games, for now on im sticking to 2k for few more years
the other features about these 360hz monitors are more appealing
like the black frame insertion
or qd oled tech itself
there is a 4k 240hz qd oled already i believe
or one coming soon
well top tier players will feel difference LTT took shroud to test it
but how many players are like shroud, 1 in 50 million?
i tested it myself
it is very subtle though
its nothing like 144 vs 240
240 makes 144 feel slow as fuck
but 240 still feels ok compared to 360, 360 is jsut a lot smoother
man, i have a neo g9 (max refresh 240) and i cant' really tell
but i also don't play competitive shooters
but if ur buying one of those zowie 360hz monitors
the bigger difference will be the BFI tech
once it hits like 40-50 fps i'm usually happy enough
well i feelt ddifference 144 vs 240 in counter strike but it was at times i did play shit ton of it at not bad level with doing a lot of flicks (quickscope kills with awp)
I mean u dont need to play competitive games to tell the difference
But for singleplayer games 130-140fps is the highest i'd take
before i start upping settings
depends on game i can play RPG with 60 fps, but racing games etc i need at least 100+ to feel game properly
like cyberpunk, alan wake 2 ill happily play at 60 fps with maxed everything
as long as fps dont suck like on switch or consoles overall lol
zelda with drops to 18 fps lol
i mean it is switch
ps4/xbox sucked a lot too
visually 30 fps is fine but when you are the one playing it
the input lag is really horrible
ps5 is bit better finally got rid of HDD and got frame generation and upscaling but still sucks big time to be on that hardware
80+ fps is where it starts to feel normal for me
that's 33 ms between frames vs 25... 50 fps = 20 ms between frames
like i dont have input lag
that's an important line to me cuz of having done a bunch of audio work in the past
ever since i started with the vive vr headset, i get motion sickness when my frames don't match my screens refresh rate. i hate it so much. 100 fov (depending on the engine) helps me a lot. I played games for 30 years without motion sickness and ever since VR started triggering it for me, regular gaming does too! hates it so much
gotta eat ginger and friggin wear c-bands to get a good fps session in
i dont have a headset but i turned on gsync a while ago
didnt feel much of a difference
if you put a sound on a delay line and the echo is around that 20ms or less point, it sounsd like chorus, not an echo
until one day i turned it off
holy shit was it disgusting
immediately turned it back on
my brain at least seems to do something similar with images
fuck vr im waiting for proper ar glasses with small form factor so i can have virtual monitors floating around my room while still using my main monitor for low imput lag etc, something like apple vision pro but refined
lol once you're immersed in that beautiful smooth refresh, going back is rough
especially on oled with the low response
vr is immersive but i had cold sweat after 1 round in fake counter strike when i used stick to mvoe around, i had to play games stationary or teleporting
i mean sure, but thats more AR.
VR is cool in a few cases. I sure havnen't bought a newer headset since the vive though
yeah ar is for me more usable than vr with it's motion sickness
Flight sims , elite dangerous, oh man. so good. Being able to look around in the cockpit to pin your target, whiel using other controls to fly that way, so fun
racing games in vr make me puke almsot instantly. since eveyrthing moves but you don't feel it
i press gas and go 5m and i gotta decouple and run to the toilet
i had to adjust
i can't
i dont know what it is about racing games. flight sims are fine. same idea aren't they?
must be the peripheral detail
in flight sim u dont have many obcjects close to you that you are passing
you just surf mostly in sky or look at city from 4000 meters
they would have to pay me to drive in vr
i do love the seperation of where you're looking and where you're aiming. i figure with the vive, part of it is screen resolution
what i've always wanted is a VR racing booth that can actually convince me i'm driving my usual route to work or the grocery store
i've done these routes so many times i've got the gear changes timed up perfectly on every turn, and have most of the traffic lights memorized...
i'd love to be able to do shit like head for a traffic light at 70mph without slowing cuz i'm confident i timed the change to green perfectly
knowing that if i didn't... just reload unlike life
whats the point of playing game to do same stuff u do irl
^^"actions without consequences day"
i would do cannon ball race if i had stamina to handle vr
The irresistible urge to dash into the street, toggle sandevistan, and shoot every driver in the head
and then reload the save like nothing happened
yup lol
yep
so terapeutic
but really, i'd love to see just how fast i could actually get to work
and they say games create killers
if i'm not afraid of a reckless driving arrest
if not games i would most likely be serial killer by now
cyberpunk is my gta at this point
i just like the first person view and graphics
timing up blasting through red lights based on a quick glance of cross traffic in both directions
shortcuts through parks, sidewalks
i need to check how it goes year ago there were still plenty of bugs i could not handle i pushed game for story was so good
cyberpunk is in a really good state rn
plus, phantom liberty is as long as the base game story
if not longer
they revamped the whole class trees
last time i fired it up i felt instantly bored again
its a completely different game from before 2.0 update
but does npc's still teleport and act like they have palpitation attack or finally they are not? in some moments it was like horro game, npcs teleporting in your face lol
I have not seen that happen
They just take long to notice me
But I do have a stealth build
with like 70% slower detection rate
do the cops still appear and disappear like ghosts?
i just go and blast everything or hack them through cameras
i will give it a go but they can forget me buying 24 gbp dlc for shitshow they did. waiting for -50% for sure
the only thing they suck at is finding you if you hide
i mean phantom lib is really good
it's longer than the main story
and can even act as an alternative to it
main story was not as long, comparing main story + side quests and making it as equally long would be nice
like elden ring dlc
if u compare it to all side quests + main
its not comparable
there are too many side quests
c'mon there can never be too many side quewsts 😄
btw Outer Worlds was nice rpg i played lately, for sure waiting for second part
yeah i used to like that stuff more espec during the lockdowns a few years ago
now i just don't have time
sorry if everyone is ask this, but any word when we can test sd3? i assume it is here with some bots or will be
nope
no official date
time for patience something we ai people dont have haha
we've already been waiting what, 2 weeks? that's at least 6 months in AI years
exactly
2 days ago Emad said tommorrow or the next day. That seems official. He's kind of #1 in charge dude
hello
Emad:
Discord access to do final tunes of model, API access to stress test, model weights drop. May put out inference code shortly too, tidying.
Working on a plan to have the various controlnets etc done
on the subreddit 2 hours ago
Can I make a sequential generation in a111 with different promts? 2 promts at a time
link to the subreddit please
he keeps giving random dates and not meeting them, he should stop doing that XD
it's been 84 years
You can easily use it as a bash script, and rename the file after you watermark it.
Then in your bash script, check if the file is renamed, and add watermark and rename if not
hi
Hello 🙂
no you're a dicordo!
Can anyone teach me to how to create an instagram post on discord ?
i dont think that's a thing
ones owned by facebook, ones owned by discord. entirely different companies
you would create those on instagram
just create ai image with square aspect ratio
yes, because it has more parameters
i see
sdxl is supposed to be 1024x resolutions. hard to get quality if you're not using the base attention it was trained for
yea
sdxl fills more memory too. You might just be blowing out your vram and thats why its "Waaay" faster. If you're not using system memory, it should be pretty comparable if not a little slower
how much vram do you have
12gb
so a 30 series card?
4070 ti
shoudlnt' really be "waaaay" faster for sd15.
oh nevermind
I just realized I kept the same upscale multiplier
but changed res from 512x512 to 1024x1024
far from same res lol
page not found
what happened to the same res tests being faster?
\
I did 512x512 upscaled 2x
and then did 1024x1024x upscaled 2x
forgot to turn off the upscale
I feel like this is how most help desk troubleshooting goes. Never believe the user. Users always lie
or they don;t know they lie lol
not lying just overlooked the setting
forgive them father for they know not what they do
different things flow
it's just the number of parameters, the sdxl 1.0 has almost 10b parameters https://stability.ai/news/stable-diffusion-sdxl-1-announcement
thats what he meant
while the stable diffusion 1.5 has only ~1B parameters I think, so it's ~10x smaller
600m
boasting a 3.5B parameter base model and a 6.6B parameter model
Can anyone go to the following page on Instagram: https://www.instagram.com/chatgptricks/ and see the page. They have 1.1 million followers. Can anyone tell me what are the secrets to their success ?
follow bots considering amount of engagement
like 98% of big acc's
20k likes with 1.1m followers? yikes
half of likes most likely bought as well
what are follow bots ? and how can you buy likes ?[
yes, this sums up to ~10B
there is plenty of services that offer that, most shit in web is fake and have pumped numbers, viewbots on twitch, follow bots on instagram, fake garbage everywhere
no wonder most of traffic in web is bots
you guys use the refiner?
bots
do you know where i can acquire the service of follow bots ?
these are messages for the off topic channel
its against terms of service. Tha'ts all dark web stuff. I won't help people be shady
google does not hurt but i dont recommend unless u aiming to resell your page to someone clueless
please move your instagram talk to off topic
Not a lot of other topics going on in general-chat this morning. I dont think its a big harm
+1
please don't minimod
lol
`Its the evening here. And i like the chat. bots bots bot everywhere
speciallyt when you jump out of blue without engaging with us for last few hours
we won;t, google will
never said i would. keep up
take your minimodding elsewhere please. you're not actually a mod
google, Ai. the local library (maybe) would all have info
actually that's why I came here too, the original paper mentions the base + refiner, so you need to do txt2img and then img2img, but many finetunes seem to completely ignore the refiner, so....do most people just use the base and not the refiner?
Even the https://huggingface.co/ByteDance/SDXL-Lightning just does not mention the refiner at all. Same goes for https://civitai.com/models/112902?modelVersionId=126688 do they just ignore the refiner?
peeps with Nvida 4090's what are your its per sec onn SD generation locally?
Yeah almost nobody uses the refiner
I used it for like the first 1-2 weeks
then just turned to finetuned models
it just takes up way too much ram too
why use chatgpt to make prompts? what benefit would that have over "one button prompts"
well i feed small example like 4 prompts (giving full prompt shape with all stuff) then ask for specific animation with prompts every x frames
well try to make 300 line prompt to keep cohherent animation, im too lazy
im not saying to use it for single image
i find it blows out community refined models. the details it "refines" are no more or less different from what the base model creates. If you give it more than 5 steps it makes the images haggard
I think the refiner architecture experiment failed. SDXL is just one model. The refiner was just a sunk cost thing they wanted to make work
haggard?
old military slang. The boys in the fox holes look haggard
we wern't haggard, we were bedraggled 😛
oh, I see, thanks.
Hard to say why they did not mention it in the paper, but I guess they used refiner to just queeze better images for the paper? and the improvement of using refiner does not outperform the community finetunes, do I understand it right?
or to use a better turn of phrase minging
you do your metaphors and i'll do mine.
i dont know why they released it. I know it was the first verison of sdxl before they changed to a double clip architectuer, and then they kept it around anyways.
I think base model even works better without it if you use loras
double clip architecture? I must have completely missed that one
sdxl has two clip layers. teh one from sd15 and the one from sd2
oh, right, I see, I completely forgot it's in the paper
if i understand things right, they're basing a lot of their choices on human preference ratings. But that's honeslty just a popularity contest not objective research. What the refiner provides, a lora easily could too. Much lighter weight
yes, that's true.
Now, when I'm looking at the chart in the paper (oh, I can't paste images here), they have 36.93% wins with just the base, 48.44% winds with refiner, but only 7.91% wins with SD 1.5, so when I look at the size of the model, the SDXL base is 4.6x better in this popularity context while being ~3x larger, but with the refiner, the total is again 3x larger than the SDXL base, but the relative improvement in that popularity contest is just 30% more
so even when looking at their numbers, it's clear that if you search for tradeoff between number of parameters and quality, ignoring refiner is rational thing to do, because you just get too small quality improvement for large increase in compute complexity
theres a few of these human preference charts in sd3 paper too. i get it. it's really hard to evaluate models. It just seems to me that this is a really loose measure to base all evaluations on. It seems like fuzzy data to me
those seems like cherrypicked numbers anyways
I don't think it's cherrypicked, but I agree the evaluation is really fuzzy
so will the discord bot will turn on again with SD 3 ?
so just now I am reading the SDXL turbo paper, and....they also compare it to the SDXL base, without refiner.
who knows)) all they do is giving us empty promises on reddit and showing the most boring portrait pictures on shitter, nothing really happening... no one got access to SD3.
there is idea, rent out 1000s of gpus and make your own model
since you cant wait few weeks for free one

it's just for the easy access that discord bot gives us
i was responding to meka not you
really good results without having to mess with the settings... will we ever get that back ?
it's like saying i dont want to let machine know what i want it should read in my mind, settings are for fine tunning, whats the issue?
that for dumb people like me messing with the settings will results in artefacts and not good quality images
i'm not saying it isn't good
i'm saying that is isn't as magical as discord simplicity, you say something, and you have it
it was
and thats a good thing
until they removed it
midjourney got only the newbie part
so we are limited with it
"few weeks" soon turning into a month 😄
before there wasn't any limitations
i don't want SD3 already... keep to it))
like the dumb people could create beautiful things on discord
Is it worth it going for high steps?
nothing like crying about free stuff needing more time to make it work like it should, you would also be one of those guys that rather have rushed game release and play with shit ton of bugs, right?
and the smart people could create even more beautiful things
What is the difference really when you go to like 50 60
so why did they stop and told us nothin
it dependso n complexity of art style you are trying to replicate i believe, anime and smooth stuff needs less steps than photorealism i think
stop what? i always had to fine tune SD vs having to just spit out simple prompt in MJ
So it does result in more detail?
yes
Is there such a thing as too many steps
That break the image
Like over 100?
200?
Nvm it goes up to 150
i suppose both ends of spectrum could be damaging
Im gonna play around with 50
the discord bot
Some Loras seem worse with high steps
the discord bot permitted to do the same thing as MJ... ending in better results and for free
recommend them option to donate to upkeep free hardware they offering for people that cant afford their hardware and then spam images there
Which bot?
if every person like u donated fiver a month maybe they keep it up 24/7
yep
Even in prompt simplicity?
my local install can go up to 999 I just foundf out by asking for 2000 steps
Like writing 4 words and getting something really cool?
I thought midjourney itself was a discord bot
yeah
Then there must be a setting to increase the limit
What was it called
DreamBot
And now its disabled..?
yes
Theyre making it paid or..?
didn;t told about what?
"it's in maintenance" "we close it forever"
Mine is set in the Ksampler
the closing of Dreambot
im ComfyUI
it was a SD bot
They probably saw how good it was
And decided to make it paid somewhere else
If it was made by a third party
well whats the difference? don't expect free hardware to last forever, they will bring it back after sd 3 release most likely to gather more information about usage but what is the issue for u to run it locally or in google collab?
simplicity
SD bot only difference I'm guessing they mean it was all set up
the bot was finetuned
Vs doing it urself now to be local
No
and just 4 words you could have something really good
Difference is
following this logic sd3 will be paid too, yikes, maybe they decided is not worth sharing resources with people spamming mostly dumb shit?
In midjourney, if it was like this, you could type literally anything without much thought or finetuning
And get awesome looking images
Otherwise the image u saw in SD bot could be reproduce locally iirc, yea idk MJ
i mean it wasn't awesome but for simplicity there wasn't any major artefacts
my profile picture was made with this bot
in just one fricking line of prompt
you can do the same with sd anywhere its not that bot was magical
All sd bot did was do the work for u
yes
U can indeed do it urself
I don't wanna pass 3 hours finetuning my bot just to remove an incorrect shadow on my drawing
like the point with this bot was simplicity
if they closed it because of the money, ok i totally understand that
but then tell us something else than "the bot is down until we are ready to return"
so wait is Matendo having a problem because previous versions of SD are unavailable or something?
no
because he cant use free sd bot on discord
get SD on your pc and run it that way
i can't find this "magic tuning" anywhere
and i don't wanna pass 3 hours finetuning my models to just have a banal drawing
the staff won't say anything about the future of the bot
is it dead forever ?
we don't know
it takes so much TIME for BAD RESULTS
its not like they put magical lora into bot
quite
i tested prompts that didn't make any sense
like "wearing glasses" and in the negative prompt "glasses"
then you either badly constructed prompts or used wrong models/loras for what you wanted to achieve
the bot didn't gave me glitched glasses like the local version did
im also having prompt issues
that's the point
yikes, no shit it was wrong resoult if u ask it to make glasses and say not to draw glasses
what is your logic
you're there complaining "i have prompt issus"
it was a test
to establish the limits of the bot
Local version gave me glitched glasses, the bot gave me one image with glasses or no glasses at all
there was no "prompt issues" with the bot
glitched glasses is more realistic resoult since you asked for two in same time in my opinion
i dont understand somebody can use for free sd3 now?
bot just didnt do what u asked for
no
yes
im having trouble making anything manly haha
because the best version is the one that didn't have any artefacts
you said in prompt wearing glasses and said in negative prompt glasses
what else it should produce?
xDDDD
thats..almost scary
its work now for all?
now ask him how many times he tried to produce this image before getting this quality
no
it should ignore the mind of the creator and do one image or another
never giving me something glitchy
only for money?
never giving me artefacts
or you shoulkd learn how to use prompts instead of confusing program intentionally
omg it can do single person portrait!!!
Lmao
yeah its magical
😄
they had a technology to make even the dumbest persons use this AI
I guess its just to show that its a good base model when it comes to aesthetics
they didn't sell it
we want more prompt adherence posts though :(
it's nowhere now even on clipdrop
what have they done with this magical technology
even with really bad and dumb prompts you would have something that's not glitchy
Are there some extensions you "must have"?
dumbest people will keep generating waifus and memes only anyway, who cares about them, if you confusing prompts intentionally don\t be surprised sd don't know what you want
bot did
then in logicall light bot didnt read your propmpts correctly and was confused and spit out 2 images instead of one
that reads prompt and negative prompt
yeah but the image was still coherent
rope,facefusion control net
but was not what u asked for
what's rope?
like two twins one with glasses, the other not
for that, something like what foocus provides by default is what midjourney does. A gpt2 engine that extends prompts consistently so same seed makes the same results.
there's also "CADS" extensions for a1111 ui that shakes up the attention so that it doesn't fall into cliche patterns
and both of them was not image you asked for still
Well midjourney is still an option matendo
change face and adapt it for picture
i'm looking towards it yeah
I should try that
on a1111 i use dynamic prompts for the same gpt2 prompt extending
i think i shall move back onto midjourney
Is that an extension
mj bot vs sdxl?
SDXL bot
dynamic prompts is an extension. helps a lot. wildcards and procedural prompting
I suppose but I usually just generate anime which doesn't really need exact faces?
I used to just manually ask the chat gpt bot lol
I'm searching for an extension that makes it easier viewing prompts
watch out on MJ. One user can crash the whole system
you can put some face in anime
very fragile
I suppose
I'll check it out later I think
I dont get it
MJ?
midjourny. you'll risk your whole organization getting bannt from the service forever if you use it to much and bring the whole server down
there's safety guards for that? and midjourney surely doesn't build a bridge with just one pillar lol
their bridge fell last week. and they blamed 2 stability accounts
"effective immeidately we're banning ALL stability related accounts"
maybe but unless their hardware exploded then no real damage in a sense?
you'd think someone like MJ would have safeguards from one or two accounts bringing down all their servers, but it turns out they don't. By their own admission
I'm just saying, you're paying for a service that is going to blame their users for it failing
Guess so but local is the way to go
Yep
I got a question in #🏞|general-with-images, if anyone's willing to answer with some tips
what your queastion need extension for anime?
I want a way to visualize prompts similar to how tagcompletion does it
but live
#📣|announcements dead channel 😦
when will SD3 be released?
I'm not sure if it's allowed to talk about NovelAi, but I used Stable Diffusion for a year, and when it comes to inpainting in anime, it always takes many attempts and edits to remove or add something. I tried NAI, and in a few attempts, if not the first, it does what I want without going through a thousand configurations. And I'm someone looking for the result to be 100% anime, like a screenshot, not artistic. I noticed that inpainting is a completely different world between these two.
Is there a place for the script kiddies playing with the API & stiching videos together?
today according to emad's reddit comments
next week probably though
i don't think stability holds weekend office hours and they're closed in UK already
hi guys i encountered problem how i fix it i press any key but its didnt happen anything, thank you for now
RuntimeError: Torch is not able to use GPU; add --skip-torch-cuda-test to COMMANDLINE_ARGS variable to disable this check Press any key to continue . . .
thanks!
i woudln't thank me yet
but if he said it, he should do it. hehe
Still looking for AI Horde related img2img help . Anyone familiar with it who might be able to help ?
It's emad. He says a lot
i'm feeling next wednesday. That'll probably be when bot access opens. just a strong feeling i got. probably wrong
could be longer
assuming ur talking about https://stablehorde.net/ this ai horde try asking in their community discord
thats an old domain. They renamed to https://aihorde.net/ because they do LLMs too
ohh ty
yea they should have a discord there
i can never get the images just right you know, always something wrong with it
I left their discord bcus didn't feel comfortable staying
Unfortunately idk if anyone would know sometimes ppl from horde look here but no guarantee
well i mean.... that community are going to be the users of horde. they'll be most familiar with helping. Here you won't find horde users most likely.
When you're up shit creek, try not to throw your paddle to the shore.
they are scared to let people try it on discord, and you asking to release something!? Forget about it!! Lmao
never
scared? i see you keep crying first about they shutting down bot, now that they need to take time to finetune and make sure new model is as good as it can be, get a grip
release it, i can finetune it myself
do your own model then
I think everyone seems to be overreacting to each other’s messages
The whole difference between them is described as follows: MJ is a street artist who draws for you for a little money. Fast, quite high quality, but it cannot be said that you can interfere with the process.
SD is not an artist, but a very smart set of tools. He won’t draw anything for you, but if he has the skill, effort and desire, he will draw exactly what you want.
an over reacting is bad mmkay
The human is the input. The idea is the seed.
cause when people over react, reactions go overboard mmkay
It’s true!
personally, i just think people need to stop blaming external factors for their own issues
Absolutely they do
and stop thinking that world is there to serve them
Most folks take all of this stuff for granted as it is.
why can't the govenrment make laws to force stability to release it today?
fucking bs
lol, they’re making plenty of laws. And most likely, none of them serve you all that well
It saddens me to think that they would enforce neutered models with handicapped creative flexibility on account of something like an aesthetic / style copyright. Because as more individuals weigh in on whether or not their work should be included, models will continue to transcend the individual such that the major powers will target style itself.
Ah, but that’s not the argument.
and thats more attitude we need to see
imo, datasets should be culled.
they aren't taking the weights after they're trained and lobotomizing them. The models are fully developed and un neutared
the dataset is the genes, not the nutsac
My view is, if there wasn't Stability AI releasing models (neutered they may be, we'd likely have no open models at all, and just DALL-E/Gemini/MidJourney
So I'm just grateful they are willing to open source what they have
datasets that just leave it all in there, are tainted. the kind of taint after a long 5 day hike where you've had no showers.
you HAVE to cull datasets
And if no NSFW by default is the price we have to pay for open models, then such is life
as long as anatomy isn't terrible or blood isn't culled as well I honestly don't care
I am grateful for what is, instead of what may be
People still think SDXL is incapable of nsfw, because they can't use prompts from 1.5 porno models on sdxl base
Just train it in there. Don't be a newb
ikr
This is a matter of defining what style represents. Let’s say you have 51% artists of a certain range of style chime in and say that they’d rather their work not be included, while the other 49% does. Well, that’s enough data to invoke the style regardless.
removing nfsw from image generator is still less harmful than biased and broken gpt's like gemini
if SD3 is only as censored as SDXL then we should be fine
The artists who were "removed" are all alive today. They're ALLLL derivative styles.
As is your comment
@alex just so you know, I did discuss this with Stability staff in person at NeurIPS in 2023 Dec
just prompt for an older artist. 51% of artists weren't removed
oh and what also makes me happier is that artists opted out and yet we still have good quality visuals
They suggested that the CLIP model was just as important for style as the image dataset for training the diffusion model
you just called my comment derivative? What does that even mean? Do you have a point?
most modern artist's are like SD themselves, getting inspired and create remix of whats been out there anyway
Which is why they went with CLIP + Open CLIP in stable diffusion
I heard it is censored and some content is prohibited there
So, with that in mind—does that resolve the inherent problem that these artists have? I fear it does not.
prohibited? you heard someone blowing smoke up your ass
So going off what I heard in my discussion
They seem to suggest, a style can be not in the image training data, but if it was in the training data for CLIP, it can still help the full model reproduce a style
why agression
I just wish fewer people would try to copy other artists and actually try to be unique artists themselves
it protects stability legally because they took reasonable measures to prevent their names being in the dataset.
its a figure of speech. They were putting you on. Trying to lightyou up
My 2 cents--- better to have a neutered model than no model at all
Stability could get knocked down much easier than OpenAI in a NYT-style lawsuit, so I’m just curious ☺️
Always going to be lazy copycats so matter what
Sure.
neutared is the wrong word. They're not taking the finished weights and editing them. That would be quite a feat
yeah
sadly u missing point that 99% of those artists are copacats and remixers themselves
.... I do generative model research as part of my job
No, but to suck the data from a training set is to make Swiss cheese.
It is literally my full time job
I mean I guess some arstyles must have become oversaturated over a long time?
idk what to say
In any case, I anticipate slightly less robust models going forward—but we’ll all be able to work around them with existing tools.
dunning kruger has demonstrated itself to me time and time again. many people who have a job, don't know what they're doing in that field still
calling models neutared tells me clearly that someone has peter principled their way into a position
oof
on no... they showed a knight running from a dragon and it looks just like a Cascade... can't properly handle sword in hands, etc, we're doomed!!!
You are very bite-y.
ikr, he's a fiesty one
sdxl can generate faces at a distance without distortion?
It's a starting point, but artists that stand out have created their own styles
But regardless
I'm kind of excited to see if v-prediction will improve the deep fried look stable diffusion images often have
look at the sword... https://twitter.com/Lykon4072/status/1766218735944118470/photo/2
and what stops other people manually recreating them? it's not like someone has monopoly for art
Me too!
And if zero-SNR will improve the ability for SD3 to generate very bright & very dark images
2.1 has vprediction
not an issue you can do inpaiting and revisions of this hand
I wonder if SD3 will have alpha support? Anyone know?
why 2 meter distance problem for ai)
@alex I doubt it will out of the box
Reading the alpha diffusion paper, it suggests they need to re-train (a LoRA).
I agree
Would be nice to have
Ahhh, that makes sense.
#🏞|general-with-images message looking at sword
after 3 days im starting to just barely understand what im trying to do
But a problem is, a LoRA can't be easily re-used if the model is changed from epsilon prediction (SDXL, and SD1.x) to v-prediction
Even if the model architechture is the same
sd3 weights were probably being built before that research was integrated. we'll likely see a new set of weights sooner than later
i'm happy with it. i'll take those weights
I recently made a battle between sd and mj who can better draw a magical forest with a lake, no one won)
now i see why... it needs more training, Lol
So in that sense, I think switching to v-prediction when they switch to diffusion transformer (from diffusion conv in SDXL/SD1.x) is the right decision
If they are going to break all loras, then now is the time
Might as well get better training stability while you're at it
prompt issue more than a model issue
no, don't release SD3 model in this sorry state!!!
I’m really hoping we start seeing some other competitors in Stability’s league. Open source should not feel like a single destination point…
same prompt
prompt for old lady of the lake paintings. or "Legend" style magical forest scenes
i'm really against the release right now!
@alex if you're willing to tolerate less realistic images, you can try Playground 2.5
that has no value
seems like you can't make your mind up, first you rush to release now you are disgusted by current state, yikes
obvious troll is obvious
@karmic cedar
all people asking for quick release should be duped with access to 1.5 model
HAHAHAH
The problem with playground v2.5, is the images are more obviously diffusion generated
Thanks! I forgot about this one!
they'd fall for it 100%
Super super high bokeh
placebo would work on them for sure
Like you know how sometimes you can just tell an image is diffusion generated? Playground cranks it up to an 11
sd like a genie if you say the right word it will make your wish come true
Absolutely, yeah. It’s just where their code is at
This is what I worry about SD3. Will we be able to prompt it to not have bokeh?
SDXL does have more boken than most images, Playground 2.5 is crazy at adding bokeh
I want narrow aperture images while it still looking good
I wonder…if StarCoder 2 will come in handy for anyone building AI models rn 🤔
thank god for that dragon pic, it really opened my eyes! Lmao
Emad implied they know we hate bokeh
Yeah, then the waxy images playground 2.5 generates is probably not for you
Easy to get no bokeh shots. People always obsess over blurred backgrounds. That's a prompting issue and they see what they want
good, thanks
SD3 isn't ready guys, not even close...
Bokeh looks good and people often promote those images
trollololololol
I think the bokeh in Playground v2.5 (and to a lesser extent SDXL) comes from the RLHF step
its trying to make that detail from a very small pixel density
its like a background face
or a hand
Bokeh is lazy and ruins most images iwo
doubt. SDXL had the bokeh artifact on the bots before RLHF happened.
Bokeh is just aesthetically popular
People see images with bokeh, and they like it, and then the model generates pictures with more bokeh
Basically turns every image into a 1-3 subject image
selection bias is selection bias
With complementary lighting around them
the only upside is that it makes the background issues harder to notice, thanks to it all being blurred out lol
you believe something, then only see evidence towards that
Yup so it's a good way to cover up model problems
🤷♂️
we're talking about models with billions of parameters. It isn't stuck on bokeh.
Distant large objects look totally fucked cuz they're rendered small due to distance? Just blur it!
yeah
whew flowwolf, you sure are a feisty one
Negative prompts are gonna be so powerful
eh
you can prompt whatever you want removed and it will be more precise about it
I try to not engage with anti social behavior online
show andtell xl is like bot?
the fuck?
^ like that
lol
yikes
So if we get more bokeh, and more delay on the release, that seems like the best compromise
snowflake alert
"i got no good arguments so i'll call you antisocial"
smh
swear words hurted feels
We are like wizards in training in a giant abstract wizardry school so everyone chill with their magic
@trail lion I'm not against bokeh per-se, but if people keep voting on images with high bokeh we're going to end up with more bokeh
(Also I talk like a goofball so feel free to direct your criticisms to that)
sdxl leaned to bokeh on the preview already, before RLHF had any sort of influence.
It's just popular in the data set and people tend to prompt towards photos more than other styles
your assumptions are not predicated by fact
we all are goofballs trying to sound smart from time to time, feel like at home brother
I try my hardest not to comment about how people say things, and instead focus on what they're saying. While 'm not perfect here, i believe resorting to name claling and personal accusations is just... well its a huge cop out
I just keep telling myself…”the text encoder provides the context”
"Prompting is King" is one of my mantras
Good one
wasn't worth it. higher upload sizes was convenient. about all really. i can live without mojers
i got nitro because of free trial only, 10 usd a month for animated background and few emojis does not do it for me knowing people in different parts of world have nitro for 4 usd
but i bought avatar effect and miniprofile effect for permanent ownership just to support discord after years of usage
Makes sense
My steam profile roars so i'm satiated
Oh mine is ancient. Im happy
wanna flex steam profiles?!
Haha “lidox”
not much to flex at mine
https://steamcommunity.com/id/zieloneciastko/
AHAH
You’ll see such treasures as…FTL, Half Life 2, Deus Ex
i win. 19 years service. 18.. pfffyftt
Yep, I was a little late to the party
ngl. Trusting steam wiht my CC back then was tough
i wanted physical discs for years
same, i used to play at my friends pc since he had ethernet not me, i joined world wide web party super late
Holy games Batman
i promised myself as a kid i would never want a game again. i'm about there i think
Anyone here an FTL fan?
thank steam for all shelf space u have now for useless stuff u collect instead of games lol
idk what Lykon means by this build doesn't enjoy this style of prompting
Can I upload custom models in the Stable API?
I know that it's partially because of CogVLM, but did he also mean that it will improve over time?
“That’s just like, your opinion, AI”
@untold herald together.ai has dedicated GPU instance hosting
I'm not sure what their price is, I'd guess around $3.5 per hour for a 4090 class machine (or L40/A6000 Ada)
Maybe $4.5
expensive
Yeah...
https://puzl.cloud/gpu-cloud A100 for 1.6 usd per hour
first google search
yikes 4090 for 4 usd per hour sounds like scam comparing to it
but i guess u pay for ready solution
I'm not sure if the puzl instances are dedicated
But the prices you found on puzl are incredible
well if you rent whole gpu then isn;t it fully dedicated to you?
GPU maybe, CPU maybe not
who would care about cpu while working on gpu anyway
CPU -> GPU throughput can be important for some tasks
But even for server GPUs, power throttling is something that certain vendors do (although it is uncommon)
i think a100 at 65% of its power would still eat 4090 all day long anyway
but im just guessing and know fuck all about dedicated servers
depends on work load, if it is fp32/fp16 and if you use tensor cores or not
you would use tensor cores for ai generation/ model creation i assume, why would you not?
Not necessarily no
You need reduced precision to use tensor cores
*typically
At least recently Nvidia has been heavily marketing L40S (basically a 4090 with double the tensorcore speed for certain precisions) as a competitor to the A100, and they suggest in some work loads the L40S can beat the A100
a100 has 6.9k cuda cores only lol so i suppose if you not use tensor cores and just vram and cuda it might underperform in certain scenarios comparing to 4090
Although 4090 can beat H100 in fp32
CUDA fp32
Pretty nuts
^ in pure compute task with no tensorcore usage, 4090 can exceed H100 in throughput
going to sleep now, hopefully waking up to sd3 beta
sweet dreams!
Night!
so it seems we are getting Comfyui support instantly
the devs are literally using it right now in comfyui
probably in a secret branch only they have access to
comfydev is an employee
exactly
secret branch sounds like cool band name lol
Comfyui users: "Mark... This is good news"
Forge and Foocus users:
that'd be agood one to run during training. i been on ONI lately
Yep yep, there’s also Stable Swarm which I haven’t tried yet
quality has massively improved, the eyes are no longer completely incorrect 💀
and then the 3rd group. People who aren't tied to one UI because misplaced brand loyalty
the ginger?
yes
ugh gonna die waiting
might've just been a ginger thing
lmao
okay it wasn't that bad lol https://twitter.com/Lykon4072/status/1760688059744686104
i bet prompt was "ginger teen on crack"
ayygh. no no. that eye is way low on one side. the forehead is abnormally huge. and also, red hair. that's just a personal problem though not an objective human deformity issue (or is it?)
point is, it's bad
^ this guy ex gfs
im not into gingers they crazy
yeah. yeah she was bro. way off the charts
the hype is chilling me
you guys overhype it for yourselves don't expect massive revolution
models trained on base sd3 and loras for it will be worth more hype
the hype is killing me
it'll change everything. images will never be the same!
holgraphic generation WOOO WOOOO
true, but I'll still have fun prompting stupid stuff with the base model in the meantime
I prepared a big prompt list
we're going to have the FULL holodeck any day now
just for the testing phase
you want weird science? You got it
anything is possible with stablediffusionbo.com
lol I want to prompt it to debug python code but write the fixed code on a sign held by Squidward
