#🏞|general-with-images
1 messages · Page 98 of 1
wait, you got access to it?
We all have access to it in the bots bro
i wouldn't get attached to it, apparently the thing on clipdrop is going to be the release
wish I realized that they updated the bots sooner lmao
the bot version is a older research model
Wait, so we are gonna get the worse version from clipdrop?:/
well. for now
the version on there sucks ass in comparison
Why not just give us the one on the bot and not compromise with the bad quality of the clip drop one
the one on clipdrop is nothing like the model they used on the website, it's more similar to the early beta version.
i don't know honestly i said a couple weeks ago it was good enough
i have no idea what they're testing / waiting for
yeah try to get a woman with freckles on clipdrop when it comes back, lol
i was talking about this: https://stability.ai/blog/sdxl-09-stable-diffusion
Discover SDXL 0.9, Stability AI's cutting-edge release in the Stable Diffusion suite. Unleashing remarkable image and composition precision, this upgrade revolutionizes generative AI imagery. From hyper-realistic media production to design and industrial advancements, explore the limitless possibili
yeah, that one is blegh
nowhere near as good as the one in this server
genuinely upsetting to see
the images they made with SDXL0.9 ARE WAY better than the ones from clipdrop, and the bot is somewhere in between
SDXL 0.9 is the one on clipdrop :/
and we can't tune the cfg on the bot or the clipdrop, so, for me, i find it very difficult to gradually tweak the prompt without going too far or not changing much at all
If we get images as high quality of these with little control, imagine what we can get when we can really rip into it and mold it how we need
i refuse to believe it is, the one on clipdrop looks like the early beta, not even remotely close to SDXL0.9
Some of these images genuinely blow MJ out of the water, and I am so here for it
Clipdrop on their own site say its SDXL 0.9, and they ARE Stability AI
they also said there was an issue with their inference, holdup il show you an example
If we get the one on clipdrop, then they massively disappointed after showing what's on the bots currently, cause its leagues better
but clipdrops whole site is lies and shitty experiences, so IDK
me and @oak osprey did a comparison, whatever they did on their website was messed up
the bot made the same broken shit by the way
multiple times
without a style tag i couldn't get a comparable result, and with the style tags i couldn't really change the result much, even with very different aspect
yeah, right? there is absolutely NO chance that what clipdrop and the bot used is SDXL0.9(the model they showed on their website).
not even close
i added just one term 'nebula' and it totally changed the output and then it was difficult to get the phoenix to show up as anything but an actual nebula, like a telescope would see
no seriously, the clipdrop one is terrible compared to the one in this server
clipdrop is worse yes
Trying to generate tigers was all deformed on the site, and washed out, and blurry and the upscaler is terrible
but both made bad stuff for me with fantasy prompts
clipdrop reminds me of Bing lmao
not in a good way
the same eyes tho
both are incomparable to SDXL0.9, idk what happened there
@smoky oak nothing tops that UncropXL picture from the family photo where it made the BOY pregnant
lmfao
did i have a stroke
i didn't ask for stephen merchant
where are you even getting these "incomparable" claims? imcomparable to what?
@smoky oak that's what SAI staff say but they don't really tell us much more so to me it's like blowing💨
the model they used on: https://stability.ai/blog/sdxl-09-stable-diffusion
Discover SDXL 0.9, Stability AI's cutting-edge release in the Stable Diffusion suite. Unleashing remarkable image and composition precision, this upgrade revolutionizes generative AI imagery. From hyper-realistic media production to design and industrial advancements, explore the limitless possibili
the one on the right
all of the announcements behave as if clipdrop is cemented
looks exactly like images I have gotten out of the server bot lol
oh you can likely get those results from clipdrop if you click Generate 10 times but then all your credits are used
infact, i would say my wolf looks even better than theirs
your wolf is theirs 😄
it'd be kinda funny if they got the checkpoints all mixed up and cant tell which is which
Their example of an SDXL 0.9 wolf
idk man, if what they used on clipdrop is 0.9, im disappointed
yours looks like a dog
the SDXL bot uses 4 models, one of which being 0.9(atleast what one of the devs said)
It was prompted as a wolfdog/direwolf, to be fair
so, yeah that makes sense that the good images you got from the bot were 0.9
Do bear in mind that the 0.9 model is a research release intended for researchers/devs to start mucking with the architecture, not a final ready-to-go model for normal users
the bot seems to use two models now but idk
also, their example for this image is pretty bad on the site, especially when I saw people in here generating images like this:
normal users pay for api access to it soon
i can make WAY better cosmic jars using my 1.5 finetune, holdup let me make some
lets see you
he can
his model can also do elder magic, and sdxl cannot
sdxl can't do zippers either but i don't think any models outside of controlnet can
they look so weird. it's a very distinct pattern, almost like you can use it to identify AI images
bro
I mean, its cool, but I would not say better personally
also, jar in jar
did you ask for that? lol
i wish that canvas look were not there
now generate one ontop of a table in a library
sdxl starts to fall apart when you get deep into composition like that too
it can't just do one jar, and it can't do a jar in a specific scene, where SDXL can
or at least not reliably
is it already 100% confirmed that it wont run on 10XX cards
yeah see, it doesn't work
what you are making is dope, but not what is being asked for
those are books next to it =/
When you ask SDXL for a galaxy inside a jar on a table inside a library, it delivers just that
an actual library, not a couple books lol
I asked for bokeh, cause thats how cameras work lol
ask for flat zoom, or narrow aperture, or whatever fstop value you want, it don't do it
.
That one is a little better
same prompt as you mentioned
right side = narrow aperture = no bokeh, just full scene in focus, and sdxl won't do it
Yeah, that one looks a bit better
the one cityscape i was able to get in focus took a lot of prompt mangling and it had the same destroyed looking buildings once they were 'sharp'
it at least can kinda sorta pass for a library
that's kinda overly harsh
but yeah, my goal is not to outdo SDXL, i was just saying that the version of SDXL on this server can't reach the same lever of detail as 1.5 finetunes, but im sure the one on their presentation sure can
I mean, it beats the shit out of a lot of 1.5 finetunes, but its also a general purpose base model that can do most things better than some of the best single subject focused finetunes
I get what you are saying, but its a little crazy to me
It beats 1.5 handily in realism
thats for sure
@oak ospreyI do get what you mean with it not being able to not have background blur, but I don't mind, as thats my preference for shooting, and is the most accurate for dimly lit scenes
I am sure that is something we will be able to do when we have better control over it
but yeah, i can guarantee you, that after i will get my hands on the base SDXL1.0 model, i would make a model that demolishes MJ
maybe, i'm just saying it's clever, because it matches human aesthetic preferences and covers up a shitton of flaws
As a quick sidenote, to remind of where we came from, this is the difference between 1.5 base and SDXL 0.9 base lmao
Exact same prompt, so yeah
hell, it already demolishes MJ in realism (not too hard to do TBH, I can do thate ven with a 1.5 model haha)
also, here you go, in a library.
I can only imagine how amazing this model will get, cause fuck 1.5 and 2.1 base are terrible without finetuning lol
the noise in the wolf's fur pissses me off. i know i can't get rid of it either, but i wish they could
Where? thats what they look like when photographed
I get what you are doing, but its pretty clear it is struggling hard to put just universe in the jar, letalone just one jar
man, this "that's how cameras work" and "that's how they look when photographed" i live in tofino where wolves are everywhere
i don't need a photo of a wolf
So you want it to look like real life? Thats kinda hard when all we can feed it are photos... not real life lmao
here is how a photo looks though. there is none of the uniform pattern in the fur like AI does
yeah, when you get close enought o see every hair 
you don't see it? i see it when i'm not even zoomed in
i've been trying to eliminate it for a while now
real photograph of a wolf at a comparable resolution
that's a tiny res
like yeah, I get what you mean
1024x1024 is the SDXL output
yeah, but the head is only like 512x512 of the image
added 'house' to the prompt, this is a great prompt
hm, ok, fair
and that image is is 500x600
I get what you are saying tho
but the moire from the training data is likely a factor in that as well
894x596 was my other img tho
which has more detail than the 1024x1024 wolf face 😦
it just feels to me like a really good 2.1 finetune
can you send the whole image?
MJ would beg to be able to make images like that
it IS
lol
i could use UncropXL 🤣
oh wait it's down
fuck
i was so ready to have a good laugh
yeah, thats all that res in this section of the face lmao
like come on man
the FBI showed up at their door because their tool made a little boy pregnant
i'm talking about the pattern tho

i mean, make a wolf even closer up in sdxl
fill dat frame, bb
not sure how to haha
i was super impressed with the tigers i made but i still saw that criss cross applesauce in the edges of the fur
i wish it could be fixed, because it's distressing and makes it feel like a waste of time to try and fine-tune the details
like, if i knew i could fix them with VAE training, that's what i'll do. but seems to be a question still even for SAI how to do it
also, y'all should see that AI_sponge thing, shit's wild
you know another thing, the text embeds for 2.1
they mega improve fine details. and HOW?
how do i get that merged into the fucking text encoder itself?
Man I hate when your generate prompts get deleted
that's with SDXL?
that's actually infuriating yeah
tried to gen an image, bot ate it
2.1
the skin looks a little too fake for SDXL tbh but the prompts people use to get these 2.1 images leave it with little other room than plastic unreal engine crap
still has massive artifacts for compression, but thats a 2.1 fault as a whole
yeah
that's because they're using text embeds on base 2.1-v
they're not using embeds on a fine-tune
the whole image looks like somebody ran it through a severe denoiser or upscaler or something
2.1 is like the crazy uncle of 1.5, i swear
stuff like this is why I don't do 2.1
hires fix
compression and distortions, and artifacts
oh man, is that what it does on 2.1?
no idea, i don't use it
it looks like they did something else to this image
its all... smeary
maybe that IS the TI
like this
that's the prompt editing i think tbh
it made a little village =]
ah maybe
it is so odd how prompt editing works
OH YES
you even showed it
the cartoony people it makes
i don't have text embeds on my bot, but i used the same prompt to show that image to you earlier
i didn't use prompt editing
Also, this section for the same part on the SDXL image is only 340x216 lol
So yeah, 894x596 is a wee bit more res to work with lol
well over 4x the pixels
its 7.25x the resoltuion of the SDXL image
dont get all nerdy on us now
I math when I math! >:C
math is scary
bro what are you talking about, we talked about nuclear energy n shit a few days ago
oh lawd
realism
dude this is gotta be like top 5 AI hands
how do i make a photo of someone sticking their tongue out
on sdxl i tried and i could not
*Cthulhuritis
i tried to make a woman stick her tongue out and then i tried to get LSD blotter on her hand and instead, she became LSD blotter art
does anyone have dreambooth discord link?
would you drink this
you mean EveryDream2?
no, dreambooth(the tool to finetune/train)

you're on it
maybe you mean the huggingface discord. otherwise i have no idea
@smoky oak fucking ranciDXL #1100170365604483202 message
that's actually impressively bad.
i want to take a look at their server
Dreambooth is a Google AI technique that allows you to train a stable diffusion model using your own pictures. This Imagen-based technology makes it possible for you to insert any subject you want into a stable diffusion model. I made a similar video in November showing you how to use the Dreambooth extension in automatic1111 but since then a lo...
you aren't aware of dreambooth?
dreambooth is a research paper
WHAT HAPPENED LMAO
can the server please fix this already, FFS
Every time, makes a mod ticket you can't access
oh nooo fuck man i don't even want to play this game anymore
brought it up to mods several times now
some sick fuck is trying to generate porn in bot 1
very realistic, much wow
tried to report it, but it made a dead channel
looks like regularization data from a good fine-tune
it sucks the image is so weird because the fur looks awesome lmao
100% would pet and die
how are we supposed to be able to report peoples gross gens in bot channels?
still has an odd distribution but could just be freshly washed
hamburger button -> apps
oh, they brought that back?
yes
no I asked the mods, and they said that they just dropped it for the ⚠️ react
I asked as soon as I saw it gone
I used it all the time
The SDXL bots feel a little like they are uhhh... goofed today lmao

I swear, if they release the shitty clipdrop model instead of the amazing one in the SDXL bot, I am gonna be so mad lmao
ok, this one looks a little better
my theory is that the clipdrop one is without the refiner model
they're trying to eliminate the need for the 2nd model
Well they REALLY need it lmao
if thats the case, then SDXL will be DOA
jaochim said that it has a discord server
it'll be great for my discord bot
Yeah, but most people will blow it off like 2.1
i can see bluewillow charging for it and calling it v5
people will just go to kandinsky 
my attempts at a wolf
walph
we are testing a lot of things, some are not great but the data is useful
ah, so THATS why the images look way worse today, at least thats reassuring haha
still crossing my fingers that we don't end up with that clip drop base model as the actual release
yea haha, we debated on pulling some of the worst things but left them because the comparisons are valuable and then we can be done once and for all
they are running different inference settings to the bot. Dont go off that for what can do solo
i'm not going to use it anymore tbh that image of the wolf with its skin missing i received earler was a no-no
some of the options are just actually broken and i've been saying there's no use but The Decision Maker ™️ really wants to inhale data
Yeah, still gonna be massively disappointing if we get the clipdrop version, and not the excellent SDXL bot one
fair enough
They are the same model underneath (most of the time), sampling settings is very important though as the community knows by now haha. Plus we are actively working to improve it further before 1.0 🙂
remember that half the point of the bot is just to determine which of our options is best
this is literally... how we decide what to go with here
so, yeah, you'll get the best lol
Fair enough, though if we are losing the refiner model, and thats why the one in the bot looked so damn good, then thats gonna be a massive amount of bad press when you dangle something exceptional, and deliver something passable
Just my feedback
if we end up with that exceptional model that was in yesterday along with being able to use it on actually accessible hardware, then SDXL is gonna be fucking incredibly amazing
@proud dagger would we have the option to finetune on sdxl1.0 which is releasing in mid-july?
holding out for that, cause I am exceedingly hyped after testing for myself, though that clipdrop model is extremely concerning haha
yes once you have it locally you can train all ya want
that is amazing
and thats saying that tools will be available for finetuning on day one then?
if so, massive W for SDXL
Our internal tools for training are or will be public (not sure if they're in sgm currently or not) and we have contact with devs of some public training tools to make sure those are ready, so yes
Oh we def are aware. The refiner has very nice detailing ability but we would like to achieve near if not the same performance out of the single model. The bot constantly runs with it varied between on and off. Right now though when approaching this research release we wanted to push it as far as we could and those two together pushed it much higher for the time being (just like how people love high-res fix and things of the sort)
Interesting... Thats interesting, and also concerning, but I guess only time will tell
still waiting for deepfloyd stage 3 and tuning tools for that
Ideally though a single unet can achieve the same if not greater performance, we are already considering improvements from here
oh yeah, so what was the hyped release date for today?
We have some really good leads for bringing it down to just one model with the full quality. This is still very much an active work-in-progress in that regard rn
thats on DF team 🤷
it looks undercooked, like it didn't finish diffusing
The number fairy Emad was counting to yesterday's announcement
Is that projected for release in Mid July, or is that for a later relase suggestion?
Yep was open sign up for research access starting yesterday along with model details
yeah idk why the excessive hype machine stuff
95% of SD reddit was swearing SDXL was out today, now they are all gonna be pissed lmao
you all do it, it's not just emad
||Emad is kinda a goon for hype and frankly stupid claims and stuff||
Its how the other SDs went too lol, research access for a tad, api access monday, then open wide soon after
better look at the pricing there 😛
Yeah, is it a reddit level cost? lmao
$200 for 5000 gens or so? i forget. i thought it was rather steep
after the full release next month, will it be 2 .ckpt files, or an entirely new format?
it's basically $0.50 per image
safetensors ideally
You know, even if we do end up with the massively underwhelming clipdrop model, at least SAI seems to give a shit this time about finetuning, rather than just leaving SDXL to rot like 2.1
wow, that is certifiable yikes lmao
Turn my PC into a webserver and let people gen SDXL images for $0.10 an image lol
and requirements stay the same? there was a lot of confusion about the requirements
or higher VRAM
The requirements are just fully made up at this point, no actual data on them
5000 credits for $200, .5 credits per image and .2 credits to upscale it
The other thing that happened yesterday is our research team gave I think three separate talks? at CVPR 2023
Currently we got it running on 8gbs locally (20xx cards). I see @proud dagger coming in with details haha
from some people saying 2048x2048 just fine on 8GB, to then being corrected saying "it got bigger"
(about all the research papers they're publishing rn)
no matter the case, it doesn't do 2048x2048 'just fine', deformations and dupes
man, i will make such great models with the new model
We have yet to see that, though we know the model can gen well under its base res, which is exciting
i thought it did 2048x based on the post from emad
naw, they've told us repeatedly in the sdxl channel
we can get some 1920x1080s out of it nicely but yea, its a 1024 model haha, all the same issues going up above
VRAM usage is 8 to 12 GiB to run in full or near full at basically any res.
It can go lower with offloading as usual
Basically if you have a modern nvidia card 12GB+, you're 100% golden. 8GiB+ or a modern AMD card, you're probably fine to run it well.
interesting, I have a 10GB card, which is reassuring
4GiB+, you're dependent on how well offloading implementations work
ok, so 4GB is still potential, that is also reassuring
2.1 seems to work on 1024x1024 too
idk about AMD, they are kinda trailing behind Nvidia at this point
hell, I am willing to offload for the level of quality that SDXL bot offers
massively so
10-series or otherwise very small, uh... in current revision it's not going to go well, but there might be community improvements to get it working if enough people are wanting it
been there, did that =/
(Remember that when SDv1 dropped it required a 3090... and then it required 12gigs... and now it doesn't even need 4)
Very true
I swear running SDv1 feels like can it run doom now haha
you been workin on that a while eh
also, been curious about the prospect of FP8 support?
LLM models are down to 4bit quantized with very little degredation now, any sights on that sort of thing at the moment?
Nows your time to shine @proud dagger haha
doesn't output a proper image natively but with highres fix, attention scaling, or other tricks, it can
They can't even do that at 32bit lmao
owo.
It's been very much looked into internally, we've just been busy on other tasks
they can but you need to know what you're doing to ask for stuff
I've been wanting to hardfocus on it but it was eating too much of my time
vicuna 30b at 4bit quantized only sees like 5% coherency drop from 32 bit IIRC
I want SDXL-4bit
measured how 😛
i love this https://www.youtube.com/watch?v=nTfqQv28Yx8
Thanks for watching!
Socials
(AI Sponge)
Discord: https://discord.gg/aisponge
Donations: https://streamelements.com/ai_sponge-le8fr/tip
I have allowed the owners of AI Sponge (Parody) access to my live streams for this purpose of hosting!
====================================
Hope You Come Back Soon!
Should be possible
That one score that measures the coherency, I always forget the name of it
as soon as i can get away with borrowing some h100s im going to see if i can just shove the model into the new fp8 tensor cores
ended up making a car thingie
Cause man they were really struggling at launch
btw re LLM quantizating: we're seeing the first half-decent 2 bit quantizations rn.
I wish SD was as easy to quantize as LLMs are >.>
they got curves ok... all models are beautiful
H100 was getting its ass handed to it by A100's for a lot of tasks
My friend is a head Ai dev for unity, and they kept them all on A100's instead of upgrading, cause they cited big performance drops in what they needed
Now what those specific tasks were, IDK lol
but he said the A100 as like 40-80% faster
but I heard they have been fixed with drivers now
not all are. well, not for all use cases. look at Bark by suno
they trained a language model directly on audio
is BARK considered an LLM?
yes
hmm, interesting
but its not capable of conversation... I feel like there should be a distinction there
they ensure their models are compatible with released embeds
oh you can flip a flag on their inference code and it starts hallucinating like LLaMA
it's an autocomplete style transformer
ohhh, neat, didn't know that
WHY DID YOU PUT THE BOOKS IN THE JAR
basically it gets a history prompt, which is a numpy array of waveform data, WAV audio. and it continues the voice semantics and follows the prompt as guidance for how to form the audio
bad TE moment haha
I more meant that LLM code is essentially just one matrix multiply operation, repeated with different weights 50 times.
Whereas SD takes in text, spins it, flips it, bops it, puts it in a unet, grows it, shrinks it, flips it norms bops it drops it, takes the result, does a backflip, does the macarena, and then 20 gigs later there's your image
so much more code to write
spin it, flip it, bop it, pull it, shake it, shrink it, grow it, kick it, punch it, suplex it, do the nae-nae and boom, It's finished.
yeah, SD text reminds me of my handwriting
how are textual inversions capable of fixing fine details that this fine-tune hasn't improved yet? i can't seem to hit the quality of a textual inversion even with extensive fine-tuning of the TE and the unet separately/together
what the hell is happening
you'd think those strong genes would carry over
but the children have no mustaches
tragic
Someone know a website that good with creating bedrooms?
@cyan snow early stages of youthelder sybdrome
Just starting to learn SD. Been practicing with Frank Frazetta and John William Waterhouse

soon....
Provided to YouTube by The Orchard Enterprises
It's in the Way That You Use It (1999 Remaster) · Eric Clapton · Robbie Robertson
August
℗ 1986 EPC Enterprises, LLP. Under exclusive license to Surfdog Records
Released on: 1986-11-24
Producer: Eric Clapton
Producer, Studio Producer: Tom Dowd
Recording Engineer: Magic Moreno
Recording Engineer...
owo?
👍 💯

Nice name, u interested in horror?
sigourney will take that thing down
don't tempt fate, bucko 
queen of diamonds is still figuring her damn shit out
the ones with 'photo'
seem to be the last to improve
portrait works great, photo not so much
For me, photo is more "scene" prompt while portrait is a "area" prompt 🤔🤣
Yeah, its half of my brand haha
Ever tried one of my terror checkpoint or Nightmar shaper?
Because if you love horror I'm already your friend :D
I have never tried a horror checkpoint, might need to soon
Here a brief example of terror vs nightmare
closer
Nah
u sent that already
alllso this discord is pretty tight on the 'nsfw' rules and the amount of skin in that example is 
I forgor
he forgor
I:m in too many servers go remember
These do literally nothing to show off the important parts of the models lmao
he sent it in 100 servers
this how i make the jars in some of my photos
1 man 1 jar
make him emaciated
hell no
@sick pagodaGonna download and try your model, see if it's good or not
also, can anyone get something similar to this out of SDXL?
we already tried with a different concept
Also try the neg prompt suggested if you don't have 1 already
What model are you downloading?
bro wtf, there is not even a human in that photo
not even 1 jar
1 MAN 1 JAR!
great result tho
your sense of humor is even more fucked up than me, and i joked about nuclear reactors
i remember generating nuclear stations exploding from sd
was that the prompt for that gen? there is not even a jar
two jarheads
check it
should include it
in png info
what in the HP Lovecraft fuck did you train on
your model is fucked, it didn't follow your prompt
what was my prompt
@sick pagodaOh wait, just a sec
is your model 1.5 or 2.1?
those models can probably be repaired by reconnecting the trained unet to CLIP L
and throwing away the broken CLIP
@smoky oak ahahahahahahahahaha it gets way better when i put midjourney into the negatives
Your model is looking a lot better as of late
isnt midjourney a paid website
Midjourney is a pretty meh overpriced service
thanks i really started spending a lot more money on training it ahahaha
the elongated neck lady is the 'photo' prompt starting to become tall aspect friendly
neck man
@smoky oak i thought you'd appreciate that excluding the midjourney image attributes allows me to clean up the output and make people look more realistic
it does haha
this is because every incoherent image caption i have from midjourney i just replace with the term "midjourney"
so any random MJ image becomes a viable negative prompt
you can also throw it in as a positive prompt though
that makes the output wild
also, can anyone get something like this out of SDXL?
150steps looks pretty realistic
probably, but it would take a lot of work. similarly, SDXL images, you cannot easily reproduce in 2.1 or 1.5
its just three totally different styles of prompting and they are incompatible. it's not to say you can't do the same stuff, but if you're wanting that, just stick with the model that does it well
that Sigourney Weaver image in the grand scheme of things has taken me about 3 weeks of fine-tuning on 2.1 and, i haven't gotten to play with that checkpoint in a very flexible way yet, but it's still not as easy to get those realistic people results as it is in SDXL
So I've been trying to get Stable Diffusion to run locally using the AUTOMATION1111 WebUI and v1-5-pruned-emaonly.safetensors but the results so far have been absolutely terrible. I'm running it on my M1 Macbook Pro with 16GB ram. Is this expected output on a device like mine...? Default settings
@chilly zenith try a model like Artius v2.1
I've also noticed that it's quite hard to make something "similar" to someone elses art because it's not only about the image itself, but the style the person who made the first one has. Like, similar how? Color? details? realism? or all of it? etc, etc :P
That model is trash lol
All right that makes sense lol
@smoky oak it's bad but he should be able to reproduce the reference images
Thanks for the suggestion
you need to try finetuned models, those will give much better results
base 1.5 is mostly useful for making awful looking stuff to laugh at and scare your friends with.
Hahah well I'm very glad to hear this. Thought I was going crazy with those amazing pictures you guys have been sharing here
And then my handicapped tiger drawn by a 4yo
i think we've all been there
apple sucks
yeah i mean, MJ can't compete with this stuff
A little reachy, but I get what you mean haha
lol
"lol"
MJ can compete with that. FOR MONEY
eh
you're going to be pouring credits into MJ's wallet to reproduce any images you want, but if you're open to accepting stuff you didn't expect, well. then it's cheaper.
the efficacy of paying for MJ is dependent entirely on expectations of the user
oh, no, I'm also starting to notice the ai placing light sources in the image! 😱
hello Moes!
I love it when women walks my direction…less so wielding knives...again :P
what the fuck
AI WOULD NEVER DO THAT
he is hunting the elder magic virus
he has a team of elder-youths on his side (not pictured)
I saw so many weapons before I finally learned why they always had one fused to their hands :P
@stone cipher i miss u

that was fast
if i'd known i would have tried sooner
@smoky oak peter is still alive
that's sweet
nooo! I was just about to goto work and now I will have to fight of the urge for ice cream as well!
the puerto rican day parade
I thought of you and the faces at a distance for some unrelated reason ;P
@smoky oak you know classifier free guidance?
and how to help it along while training
sir why do you need to see under my robes before i get into the theatre. why do you assume i am hiding other people under there, and snacks, and drinks? it is impossible
Unexpected error: Processing could not begin, you may need to refresh the tab or restart the service. 0it [00:00, ?it/s]
Total progress: 0it [00:00, ?it/s]
Total progress: 0it [00:00, ?it/s]
I just restarted stablediffusion in both browser and console
what's the point of XY script existing if it just gives us more things to debug while we are debugging
no time fo that shiz
oh wow though, night and day, this new 2m sde thing is better than sliced bread
its as fast as 2m but as effective as sde, idk how thats possible
anyone ever seen a sweater sold like that?
i guess it just assumed to make a side by side profile lmfao
Anyone knows how to solve this? It happens to me when I want to use the GPU for Roop.
This problem does not happen when I use the CPU.
I installed the toolkit and steps from github to use the GPU, but still the problem
@weak sageHow nice to see thou, how are thou doing?
i am well, thank thee
Hmmm, I need a GPU that supports a PNG with like, 50 times as many pixels
whatever you are running right now can do that
are you talking about img2img tiles?
but like if i could gen at 10x resolution it could fix all these annoying face and hand issues
That would make them worse actually
just have to get better at prompting, and use models better suited models for stuff like that
and also inpainting
you what, type "good hands" into the text box and magic happens? lol, best i can do is controlnet inpaint global harmonious and reroll 100 times, get one passable gen
I use models that do hands just fine, personally
Roop???
"mJ cAn Do hAnDs"
lmao
man it's like they don't fine-tune their text embeds at all
that halo effect
fuuuck that
i would rather use filtered LAION data for training
Generated it for my dnd character
slick
@smoky oak downloading only high-res laion images with camera models in their exif data
it's not bad
no wonder we can't do buildings
I just tried out Artiusv21, and holy crap I can actually generate 1280x1280 images.
Watermelon.
Power.
\
It'd be hard to tell it's fake.
edited
or try deformed geometry
bro, i don't need to, there is no need to get rid of something that's not there. thats not how negative prompts work.
why would i add a pretty good detail to the negative prompt?
not negative
why only 27 hires steps tho
id always use 0
why would you try copying my settings without my permission?
do i need your permissions
🗿
its open source
i don't look in your shit
but you can
i keep thinking masterchief is the doom slayer
Wasn't expecting it to turn this prompt into a human, but turned out neat
I don't think so, unless it's prebaked
no =]
fuck no, i made my own model
ever heard of finetuning, well i suppose not, you didn't even know what is CLiP a day ago -_-
i still dont
cuz nobody explained it
and i never intepretated it as important
more like third party tool
BRO, with out CLiP diffusion isn't possible, especially SDXL
that's a key component
give me 1 reason to be able to know about it for no reason
then why did i never need it before
it's encoded into the model checkpoint
so why do i need to know about it then
it's the component of the model that makes connections between text and images
what does this do
uh, maybe because you installed the AI locally
and
all i do is install the folder tho
after i click webui-user.bat it installs everything automatically
and? i'm losing my brain cells by each passing moment
do you really not care about the coding and the technology behind the AI? stable diffusion is ment for people that know what they're doing
I just love when SD decides to nuke your whole PC and you have to deal with 5fps until you save everything on your PC to restart it
no reason to gatekeep, even if someone doesn't know all the specifics or exactly what everything is doesn't mean they can't use the program
1 little missclick with a VRAM OOM, and your whole PC is toast until you restart
i just like making images that are unrealistic but looking realistic
sometimes shit takes time to learn and people'll take a while to learn it, or they won't, but they'll still have fun with what they have. and that's fine
Nep is kinda a community troll in a lot of ways, to be fair. They ask for help, then disregard that help, and then blame you when things go wrong. It happens nearly daily, and it stems from them having a lack of interest/understanding of the tech being used
no
ohh
that makes a lot more sense
Most people who have been here for a while can attest
bro im not a 20 years experienced dude
how old are you?
16
oh, same age as me
It took over 5 hours and 9 people to help them install Kohya SS, and they still never listened and never got it working right
(I was one of the people)
I was expecting you to be like 13
i got it working after time
i think fp16 was the problem
had to use fb16
nod
They give off that vibe.
Really though, I do think that sort of thing is warranted with Nep, when it comes to not caring about how things work, and just "gimmie thiss, gimmie that" and "tell me how to do it, I don't care about why" and stuff
Can't even google anything for themselves :/
out of curiosity, what are your like? computer specs. sometimes really old gpus cause some weird ass problems if you've been dealing w/ that a lot
They have a 3060
huh
i used to use ai with a igpu on my old laptop
you mean bf16
yes
on laptop or desktop? big difference
desktop
CHRIST
that's a good amount of vram for sd
how did you build a pc yourself and you're. struggling with sd
@spring sailSeriously, they always have problems with everything and never learn, it really is just a losy cause at this point
I help people all the time when I can, but I have never once ever been able to successfully help them, cause they never listen
cuz sd isnt physical bruh
Nep is an enigma lol
an what
before i learnt python i was making custom watercooling loops, that's not surprising
an sigma
maybe I'm just a little over experienced for a sixteen year old in comparison to you or smth but I'm baffled (I started messing w/ like, vqgan+clip type stuff when I was 12 on really slow google colabs)
idk I feel sd is a bit simpler than python
bruh
not harrassing you though nep
sd is python
💀
made with*
yes, that's more correct
Same here. Nep is just a very immature person, and I don't mean that in an insultiling way, just a direct observation
mostly the direct observation hits the most
I do wish nep luck in learning SD better
Same here
eh. not necessarily, NONE of SD is possible without the diffusers python module.
if they aren't a troll I'll try n offer advice when I can
But yeah, anyways, hope that little bit of information helps explain why Tdg may have responded that way. Generally speaking, they are a really nice person (Tgd)
working w/o guis is always funny, I fuck up on that a lot myself
n yeah! that makes sense
@spring sailDo you mind if I DM you?
I thought nep was getting ruthlessly bullied for struggling a little LOL
uhh, sure, what about?
A small continuation of this convo, little more details in private
sure sure, I typically keep my dms closed but luckily for you I actually have them open on this server so I can access the bot dming me
sd isnt my only interest tho
my perspective
@smoky oak did you see my comment about my inference code bug
my discord bot was overriding my cfg to 10 all the time
sooooo
thats why i had incredibly crispy crap for so long
you can uae tensorflow instead but it sucks
use
that wasn't my point, i meant that SD is impossible without python modules
Oh, you can do it in c or c++ but itd also suck
huh, but as far as i know, stable diffusion is meant to be used with python, even more so, pytorch
theyre just weights, a vae, and text embedder
yes, but none of them can function to create images without the modules. at least as far as i know.
you can use it with jax or onnx or tensorflow etc
those are all libraries, stable diffusion isn't able to run on it's own
i certainly didnt get far with them when i was just starting out but thats because you have to convert the ckpt first too
it works fine lol
pytorch has more developers using it
its easier by far
you can't just put a diffusion model on a PC out of the box and expect it to create images
microsoft and google keep trying to compete and fall flat
i dont get why that is even a point to make
no one is dumb enough to think that
are they?
you'd be surprised, but yeah, i understand what you're saying
that won't end well
makes sense
it assumed a ckpt is a tensorflow file
GPT-4 assumed that? or GPT-4 LLaMa?
when it wouldnt load, i asked and it was like try pytorch. but i didnt know how good that was, i only knew of tf
gpt4
weird, i thought it already had way more than enough parameters to be as capable as us at coding.
stable diffusion is newer
pytorch wasnt as big
a lot has changed
the huggingface modules used to have totally different parameter names and loading models through transformers was hard to figure out too
still, isn't GPT-4 trained on more recent data than chatGPT?
nay
by chatGPT i mean GPT-3.5
2021
it's good for generating prompts, and it's local
probably the best local LLM model
it doesnt know how to do anything correctly with it in code
i use it for prompting, so that doesn't really matter for me. but i bet if you finetune it specifically for coding it might be able to code atleast decently.
chances are slim of that
fine tuning a llm needs very high quality data it hasnt seen before or you risk mode collapse
this is in general a problem for humanity now because llm are flooding the internet with text
hoovering up data for training like that is over
amazing how text llms can require so much ram, when people use to say stuff like you can fit all of a massive library of books onto a dvd
and yet images take up a lot more disk space in comparison
Weirdly enough, while you can fit all of wikipedia's text in so many megabytes, the text used to train LLMs is huge - terabytes of plaintext
They're not trained on a library of books, they're trained on all text ever basically, cause they need to know about everything ever
a chatbot needs to be able to answer list every key event in japan in the year 1602 perfectly (or any other obscure overly-specific question like that)
whereas with SD, it's not really a problem rn that you can't say give me a dog that's not too fluffy because the bot's too dumb to know what not means and just gives an extra fluffy dog instead
SD just decided that it wasn't TOO fluffy. it's subjective, see
FANASY WAAAAARLD
I see the prompting as a sort of, "here is a card with the word dog on it, now, show me a picture of a dog." That's why I it feels like writing full sentences to the ai doesn't do anything because what would a "card with the word has" have a corresponding image to the ai's "library of image references?" There's like billions of different possibilities.
A dog can look very different from image to image, but if someone "has something" then it can either be worn, held, dropped, at home, etc and that's what I think is why the ai draws weird stuff hovering around, having it fused to other prompt words, and more. There's just not enough fine-tuning for normal sentences yet. :P
In a1111, When making non square images, I often I get the repeated patterns. I thought there used to be a way of generating at 512x512 for some cycles, then scaling it and continuing... ?
(actually it looks like the thing crashed.. but n/m that)
Highres. fix might be the thing you're thinking of?
@wispy nest that's it. thanks. (I think with my window size it didn't show the width/height scaling sliders)
danke!
:))
You can start with square then outpaint to the sides to



