#🏞|general-with-images
1 messages · Page 90 of 1
it's just a really good movie ahahaha
i kept thinking "i need to train a model on this thing"
It's such a damn good movie lmao
It has no business being so damn good, but it reallyyy is
Disney has fallen off hard, and it shows. I'd personally have to say that DreamWorks is better than Disney at the moment
they're all pretty awful
i liked Moana but i saw that one later than everyone else did
that one made me cry, too
so i wonder if i can fine-tune BLIP easily
i can manually caption a few good images to tune it on and maybe then it can know the character's names
Pixar's elementals looks stunning, but God you can just tell it's gonna be a nothing burger of a movie.
It's probably just gonna be a cutdown version of Zootopia, but with elements lmao
Emoji Movie 2?
I do really like Zootopia tho, I like the creative vision, and the execution of how they handled the sensitive matters of it. It's also a gorgeous movie haha
i liked Sausage Party 😁
Never watched it lol
I like adult movies, but it's just too damn much lmao
Actually, I take that back
I DID see it, in the must cursed place possible lmao
Lnao
My after school program teacher put it on for us in 7th grade lmao
Or 8th, I don't remember
I just remember us all talking her into it, and she was like "I will get in so much trouble if they find out"
They never did hehehehehe
UNTIL NOW 
It's been long enough haha I want that money then
"Ms. Kandinsky, Ms. Kandinsky. Can you paint us a story?"
@smoky oak you're accidentally about to live a plot from an episode of It's Always Sunny in Philadelphia
Damn. I don't even remember her name now lol
just call her Moneybags because we're you're going to make a lot of money on this case
and as your legal representative, well, i just take a tiny cut of 65%
this is a good deal. you should take it!
speaking of good deals, i downloaded NewsRadio, apparently it's where Joe Rogan started out, trying to be a comedian, but failed at it
@smoky oak have you ever tried the term ethnographic in your prompts
it's used by anthropologists when labelling their datasets, eg. ethnographic images are pictures of various cultures
Thanks for adding less than nothing to the conversation lol
We're not even in the park yet. And already getting price gouged
$80 for an electric wheelchair for my grandfather
will there be live music?
trying to do all of the lyrics to the song Space Oddity in image prompt form makes a football player in the space capsule
i know the quality is awful for this, since it's native 768x and not 1024x, but i removed text from my negatives lmaoooo
Not sure if there will be live music lol
john bon jovi stars in Hewy Lewis and The Secret Garden, final directors cut, VHS box cover art
OLIIN JOOVY IN
is that Mark Hamil?
SD thinks road lines look like how the local road contractors think they should look!
Lol
Oh interesting, new stylized realism model released, looks really good... Maybe my dreams face been fulfilled
Dreams face lmao
*dreams have
what the hell you sayin lmao
Ohhhh, is there an easy to use version now?
I would love to bring my ahem generations into 3D
As in, you don't have to make your own interface to use it now
Automatic1111 limitations mean this won't work on an 8GB system 😐
what the fuck
it holds the model pipeline around from the first stage
so the 2D model becomes a burden and the 3D stage will run out of memory
that is so strange. you would think they'd run model.to('cpu') after generating in stage 1
we had collab before
moves the weights out of GPU VRAM and into CPU memory
@smoky oak ^ getting trolled by 8K+ UHD+ again
an example of it making a SHARP branded LCD that contains the requested landscape
Seth Rogen lmfao
oh i put an A in Rogen
Do we have a free AI tool to generate skybox, based on prompt \ image? (image would be preferable or a way to get style from an image)
basically background thing you can see in a games - sky, sun, void below and shit
ohhhh
@hasty nova might have a workflow
Let me see if I can find the LoRA
he can make tiled 3D textures
What "use Karras" means in this thing?
There's no resolution option 😦
I was using this thingy for Shap-e before
https://colab.research.google.com/drive/1XvXBALiOwAT5-OaAD7AygqBXFqTijrVf?usp=sharing#scrollTo=RybJQ3160yFe
Somewhere was an instruction how to install it locally too, but 1050ti said it won't do it 😦
it sucks
whats with these teams putting so much effort in and getting like overburnt and pixelated stuff out
I made this to use with the panorama viewer extension for Automatic1111's webui. Trained at 512 by 1024. This is a first attempt. Works pretty well...
Found it!
I'm starting to think it's your prompting, cause I don't get such problems
its not my image, its a demo image
looks fine when zoomed out
Positive Prompt: (hyperrealism:1.2), masterpiece wide-shot photo BREAK A steampunk enchantress, clad in brass-accented Victorian garb, adjusts her monocle, reflecting the glow of the cogwork city around her. BREAK (8K UHD:1.2), (photorealistic:1.2)
Negative prompt: (cinematic-bloom:1.38),(haze:1.35),(fused-knees:1.32),(fused-legs:1.31),(monochrome, grayscale:1.3), (unfocused,blurry:1.3),(DOF:1.2),(tilt focus:1.2),(nude breasts, nipples:1.1),(wide-shot:0.5),(simple surroundings:1.39),(empty background:1.395),(worst quality:1.39)(low quality:1.39)(normal quality:1.3), bad hands, deformed fingers, author signature, text, watermark, (cropped::head:1.35),(close shot:1.3),(key lighting:1.2),(brightly lit:1.3),(bad-hands-5:0.9),(EasyNegative:1.0)
Steps: 50, Sampler: DPM++ 2M Karras, CFG scale: 3, Seed: 3860982947, Size: 512x910, Model: RunDiffusion-FX_Photorealistic, Denoising strength: 0.5, Version: v1.2.1, Hires upscale: 1.3, Hires steps: 20, Hires upscaler: 4x-UltraSharp
omg it needs cfg 3
and thats with hires fix apparently
"We focused on lower CFG support" what the hell lmao
Oh God, that is such an overworked image
the use of BREAK is pretty interesting, i don't think i'd seen that before but it seems to help make the prompt splitting more "deterministic"
it's actually both. an extension makes it work differently
but it's built into the prompt management code in A1111's base sd-webui
what i'm not sure of is whether the extension disables the built in method
i don't really care though lmao
The extension version is exceptional
how do you not know how many images were used in a training run. i keep training logs and when i add or remove images, i do so by keeping track
I dont have that option 😦
For what?
3D skybox gen
click the React button and type 'warning'
There is a website I know of that can do that, called something like Blockade Labs
@oblique galleon you might want to delete your NSFW prompts before you are banned permanently
oh wait, you can't 😄
🇫
Also you can't react in those channels it seems, I found another one of that persons nsfw prompts and there is no option to react
@oblique galleon you should be ashamed of yourself
What in the actual hell happened here
someone has been using the sdxl bots for nsfw stuff
Gross lmao
Does it even work?
i wouldn't know, i do not use A1111
Oh weird, that's an interesting error
seems like your connection failed and it partially downloaded
see it stopped at 38%?
hmmm weird
sucks that it doesn't, ya know. retry. that's been a concept since the 1980s
That could happen, thunderstorm outside...
Starlink? 
nope
downloads die like that on my 4090 / Starlink all the time
I just realised it was download, so sloow o_O
Starlink
When is the new AMD 7950 coming out 
i don't know or care, my 5800X3D is perfection
wonder how much VRAM it takes to feed a 4k sample to SD during training
Should I get that 🤔 and the 7950 🤔
jeez it can't download the thing
and the W7900
Team AMD system, $8000
probably superior to a Team Green/Blue system at $8000 tbh
the W7900 will have 64G, the Ryzen CPU will have more cores and memory channels and ECC support
the 4090 is the current SOTA for NVIDIA at 24GB and the i9 is just way more expensive at equal feature set
you can rebalance the use of the $8k to try for an A6000 GPU but then like me, your A6000 system will have a subpar CPU. for the same budget, that is.
my Xeon CPU bottlenecks the A6000 and A100
NVIDIA A100-SXM4-80GB (80G), on a Intel(R) Xeon(R) Gold 5317 CPU @ 3.00GHz with 88G RAM
not a recommended combo
get rekt lmfao
Bro, I am
having fun?
that's what matters
fuckin sweet that your gamps got you tickets
LMAO gramps
He's insane lmao
He always wins things
ALWAYS
This is what... Our 4th time going to Disneyland? Lol
We have been to Knott's 5 times, six flags 3 times
He's won several cruises, tons of money, over $4000 in gift cards, I swear he cheats the system lmao
he's a grandpa
he made it this far by using cheat mode for sure
oh dude i wonder if he's the single player in this simulation game
what if we're all in one big instance of Roy
@smoky oak for when you get home https://github.com/facebookresearch/audiocraft
Interesting
I look forward to what crazy textures I'll be able to get out of AI generated audio when it's more capable
Although I will say, as somebody who does much more abstract and complex sound design, but it's going to take a long time before tools develop to help with my workflow haha
But I am excited for AI generated singing and basic instruments
Those would actually be very useful for my workflow, as I don't have access to real instruments and real singing, and I can't easily recreate them
oh it's also good for making little samples
you don't generally need to use it to make a whole thing
you're asking for it by hyping it up so much, by the way
😄
downplay how good it is and it'll go much better
let people discover their preferences on their own. be less prescriptive
Now I am in testing phase
I have downloaded 3 datasets
So first I would train each of them by training 1k images
I'll believe it when I see it haha
Sure
go ahed
Thank you
it's a lot of images to look through but it really helped me understand the direction the model is taking
I am training faces
i trained faces too but you don't want it to lose the ability to make eg. a gecko
Oops lol
mine was yesterday in a different channel
I just don't imagine AI making reliable complex sounds anytime soon.
Mainly because you cannot easily describe them in order to categorize them
those look pretty cool
whaaat? i had to describe how a fart sounds to my doctor on the phone and i said "it sounds like sneezing, kind of" and they knew exactly what i meant
I mean, you had something to reference at least haha
have you ever seen that biopic parody, Dewey Cox: The Walk Hard Story?
he is shouting out directions to his audio engineer during recording and says, "play me that sound back, the one that smells like velvet pancakes"
and they were like "what the hell does that mean"
Extended scene from Walk Hard: The Dewey Cox Story (2007), parodying Brian Wilson circa the SMiLE sessions era. Hilarious but also brilliant song - something this film does so well. Not surprising, as Wilson's collaborator Van Dyke Parks arranged and co-wrote the song. Performed, of course, by John C. Reilly.
I didn't see this on Youtube or els...
My audio synesthesia is sight based haha
I could describe sounds that way, but it wouldn't mean anything to anybody else
it'd mean something to me
my synaesthesia is super advanced, ever since that vial of acid
Well, sight, and physical sensation based
lol the followup scene to that song's recording https://www.youtube.com/watch?v=f_1mxNtLCK0
I WANT AN ARMY OF DIGERIDOOS
feels like listening to my friend prompting my discord bot
"open your mind, and learn to play the fuckin theramin"
yeah this is awesome
❤️🔥
kinda reminds me of hogwarts
hogwarts if voldemort won
nice
@oak osprey yooooo
I fucking love when you find people you fit in with
There was a group of satanists here at the park. I guess Disney defaming clothing and satanic imagery are allowed here after all haha
They were covered in pentagrams, 666, all sorts of stuff
He had this hoodie lol
I'm surprised that's allowed in the park
did your grandpa hang out with them too
He was ahead, so I was able to say hi lol
lmao
i am happy you had that moment
i guess it's too far to go to hang out with them regularly
They seem to be getting more frequent
Oh, they were a group of older people, I just complimented them
Super occult group, black lace clothing, more of a dark goth aesthetic, leviathan rings, the works
"my other car is a sacrificial altar" bumper sticker?
Just complimented the hoodie, he told me where to buy it lol
It looks nice!
"we silk screen these in our basement" 
Showed them my black craft hoodie, which one of the guys said he had the same one lol
you should make your own with an AI image
I reallyyy want to
no one will ever have one
teespring makes good quality stuff
just order it, homie
you got this
teespring lets you sell them
so you can even refer people to your shop
you should have some business cards
super goth ones
But there is not much content to train them on, and any companies that do satanic imagery are typically smaller and more community driven groups, which I don't wanna hinder their work. You know?
It's a respect thing, I love the community
you're small, and a community
i'm not saying you go and full page advert on the same thing they have a rinky dink 3" ad on. i'm saying just have the option to sell one of your articles of clothing to someone else if they really wanted to
i really doubt you would be making some outsized impact on it? not like Hot Topic and Spencer's did
appropriating art styles to capitalise on
Hmmm... True I suppose
And I would be making designs that they don't offer, so yeah, it is different... Hmmm
was thinking you could have a shop where you even sell your packages to train LoRAs etc
I wish I had some more occult IRL friends
the teespring / shopify shops are so low overhead, cost-wise
they only really cost anything if they're doing something, i think
i had a teespring one for my YT channel
I'll have to look into that
just now btw
that's super impressive
between a 4x pixel count delta there's like, almost no increase in use
how
🤯
my 4090 can make 8k images?
native?
god i can't wait to make super deformed shit
at 2048x
That's huge! Or well... Small!
That is phenomenal news
Bro, you just made my day lmao
the thing is this isn't in A1111 or Diffusers yet. they tell me they're looking to add it before release but they don't know if they will have time to. i'm like what the fuck lol please do it, all this effort will be wasted
people will hate SDXL if that doesn't work on day 1
#1080946152318443610 pit bull terrier
@reef dust try #1080261341362786384 or similar bot channels with /dream before hitting space, and waiting. a prompt box will appear! then, type that
Exactly, that's why SD 2.1 failed so bad
they say they don't want a repeat of that 🤣
people will hate anyways because haters like to hate and ainters like to aint
i am looking forward to the fine-tuning guides
it's the over-statement of abilities and hype that makes people hate it. being humble makes people appreciate it more. people will still hate, but, it invites positive criticism rather than being torn down
it's going to land without pornography capabilities. how do you think its going to go over? brace yourself
also when stating issues that are observed, and being shown cherry-picked images from some unavailable internal model that don't have the issues
it's like "fine, give me access to that" but 
Nobody said that was the case. It can do weapons. Which no other SD model has been capable of, due to them selectively pruning that out of the models
base 2.1 can do weaponry lmfao
oooO i like weapons a lot. i agree there are big deals with it. but there will be a very vocal backlash at launch
@oak osprey obrigado
i am weird because i write a discord bot to use SD through. and so, i'm used to that concept. but my bot has features this one does not. and i have grown to rely on those. and the SDXL bot feels very restrictive and limiting - and im not even talking about filtered concepts
You can get a good looking handgun out of base 2.1?
to make one perfectly might take several tries. i had arnold schwarzenegger holding a bazooka pretty easily
well, "arnold" 🤣
"bazooka"
"holding"
you know what i mean
I'd love to see it lol
I have never found a non specialized model that can do any form of weapons, 1.5 or 2.1
i'll try again because i deleted all the threads on my server that had those things
"ITS NOT A TUMOAHOR"
something helpful to note is that stability trained 2.x on a frozen text encoder
LAION made the TE and they just built a unet for it
it makes me so curious who made the TE for SDXL
base 2.1 with skip clip and SNR fixes lul
nice
GPT3.5 came up with these prompts
Bladed, Pointed, Deadly, Metallic, Sharp, Lethal, Powerful, Modernized, Tactical, Firearm.
love to see it
how would it work

you can fire your handle at them
Ah, those don't look too terrible
Compared to what I usually see at least
SdxL obviously does a lot better, but that's a little promising
i wish i could easily turn off the SNR fixes and see what nonsense DDIM and 2.1 come up with, raw dog
it'd be very bad
God, the idea of SDXL running on an 8GB GPU at that high res is... I don't wanna say lmao
it feels dirty to say that base 2.1 is actually a good model once you fix the noise schedule. it makes such good outputs at 1024x1024
it says Ruger
i asked it to
LMAO
Ok, don't get too ahead of yourself haha
well i am not asking it to make people
2.1 could have been so much better man
wouldn't want my theory to fall apart
It's really depressing
it's just a challenge waiting to be solved
if everything were so easy in life, it would be very boring
Imagine if 2.1 had the suspected improvements it theoretically could do
go play with base 2.1 on my discord bot that uses the latest diffusers git branch with DDIM
it's pretty great
Maybe I'll try it sometime
personally i appreciate the noise it adds to images, after i've looked at so many real photos on Kodak film during my own training sessions. it's just how things tend to look. it's no wonder it picked that up
you have to train on purely synthetic data to remove that, and then skin starts to look plastic
a short blanket for a tall person vibes
apparently i should start merging checkpoints, and freezing some layers of the unet
I don't mind grain, like at all
I mind artifacts, distortions, inconsistencies, malformations, and severely incorrect camera effects
i tend to notice only the most obvious camera issues
eg. shadows being weird, reflections being broken
the thing that pisses me off is how the faces are distorted
I notice more subtle ones. But I have been doing photography seriously for years
can your best model / workflow make a crowd of people at a concert?
i don't know if it's even possible
i know you could likely CTU it with massive tiles and get a good crowd in each 1024x1024 tile
im just wondering how hard it actually is, and what it takes to achieve, with the current "best models"
it might not even be worth me pursuing it
she got accepted into Ruger University
lmao, their name is the Ruger Ruggers
im guessing their mascot is a carpeted handgun
oh snap
restored the unet from base 2.1 on my burned checkpoint. it's back!
It's like Dj Khaled
Is first name is Khaled, and so is his last name lmao
Khaled Khaled
Kinda a powerove IMO
power move?
100%
by the way, watch Huggbees' video, "Debunking the Insane Clown Posse"
laughed my ass off the whole time
@smoky oak ahh this is a breakthrough
the unet being trained for too long causes this. but the other image is combining an earlier unet with the new text encoder
@split rover check that out
ah neat! love experimentation like that 😄
what the heck made the clouds like that
looks like a phoenix blew a wicket faht
this is neat, I was able to get it to run locally. Made an ambient horror game track thing I guess?
i get Diablo vibes
my friend prompted flask of pew pew and it made some kind of grenade labeled pew peww
welcome back kotter 
i need to make this into an a1111 ckpt probably
Rick Moranis as Harry Potter 
That's pretty low resolution
yeah i know that's why i want to upscale it, how can i do it?
https://github.com/ai-forever/Real-ESRGAN
this project has examples on how to do it in a simple python script
if you don't want to do that and just want quick results and don't mind paying a few bucks, try Topaz gigapixel
thanks
when you try upscaling, but the settings are very very wrong 😄
Ive been saturating myself in Diablo 4 for the past week. Havent done any gens at all since
@smoky oak in his car doing gens on the roll
lookin' all cool, n shit
@smoky oak i challenge your amber heard lora
oh oops i put amber head
Why she so l o o o o n g
some of the hellish gens from the overnight batch
danny devito as frank in it's always sunny in philadelphia, 1944, noire, hitchcock, UHD, 8K, dark+++
random charlie
What about charlie noon, or charlie dusk? Or his rebel brother, charlie midnight?
the nightman cometh gave some Always Sunny themed photos where if you squint you see everyone from the show
this is from the actual episode i'm watching
it's a film noire episode
it wasn't the clams!
lol this mix is crazy. the quality is terrible but at the images are great
what is teh prompt?
danny devito, dirty clothes, black tshirt, overgrown curly grey hair, rage expression, (masterpiece, best quality, high quality, highres, ultra-detailed)
these are from a more 'realistic' mix, but they are very tame in comparison
🤣
im dying
@sterile temple i prompted the widest thing, the tiniest thing, and the longest thing
i dont get the last two but i like em
that expression, 'oh brrrrother'
separate prompts?
yes
new twitch emoji proposal
oh have you discovered having it generate motivational posters / memes yet?
i haven't tried that yet
i'm not sure what that is motivating 😄
motivational poster meme with text that says, "LIVE YOUR BEST LIVES INSIDE ALL OF THE DREAMS WHEN WITHIN YOUR WHERE, THANKS YOU -- WAYNE GRETZKY"
i didn't think it would get so much of it
or i would have tried to make it more coherent
SDXL omg lmao
waynne greamsy
wayne greety lmao i love it
damn, SDXL is pretty good
I'm getting these weird discolorations (using img2img), why?
@smoky oak apparently the faces at a distance thing are just a limitation of all of SD's architecture

I figured as much, it seems to have problems with far away subjects of anything where large details need to be scaled to smaller pixel representations
Though, higher resolution generations, and high res fix give them more pixels to work with, which is why high res and img2img fix faces and other things so much, IMO
even bluewillow
yeah i was noticing that a whiiile back, but i had issues with faces even up close back then. the samplers being fixed has improved a lot
i guess i should try training it on super high res stuff
just get good 1920x1080 right out of the box 
you know, the images from bluewillow scream Real-ESRGAN when you upscale
they have 1003x1003 dimension
that's what happens when you feed 768x768 to the x4 upscaler, oddly enough

lmao and they want you to pay
that's new
#anime #ai #hollywood #aigenerated
man, those are really not good lmao
they look like the first off the press gens, one button one prompt, zero heart put into them. eular a 30 steps
lmaoooo brutal
I think I gagged a little, and not in the good way
I am genuinely getting bored with keeping up with each new mildly improved realism model that comes out every day now ._.
just use mine 
I will ||dis||respectfully pass-
pass gas maybe
Yeah, your model did give me indigestion, after all
we always hate the things we can not have
oh I could have it, I just choose not to
🥳
That was the random emote I chose, and I am not taking it back
bro built like a kanye west music video
lmao bro just has broad shoulders. like me, i'm a 50 wide
oh hes a cool guy, but whoever you generated, he has serial killer eyes
they tried to get me to play football in texas but i'm not built for that
ohhhhhHHHHHH
tat's the Hitchcock prompt keyword
Ah lol
he's actually a good guy
Makes sense, cause it doesn't look even close to Seann Scott lol
Thats.... better
oh i know, i discussed that with the stability folks
that's intentional to avoid overfitting ahahaha
you can fine-tune it on who you want, without burning
I am in a way too cursed mood right now, oh my god
should i give him weed? Sean William Pot
i almost just said 3 cancellation worthy things in the matter of about 5 brain cycles, jesus christ
that jelly 💥
cancel culture only happens to rich people
hmmm... Thats kinda like... the exact opposite of the real world- lmao
mmmmm yes
Man with a challenge for walking through doors at day
and a double wide airplane runway at night
dude in the background looks like he's sneezing
LOLE
ok, its coming
imagine how much it would have sucked to take pictures of pretty flowers in the black and white era days
they prolly still loved it because they didn't have any better
truuue
plus everything was black and white, so looked totally normal 😄
They would have had a lot more control over the way they rendered tone with cell filters
and you can see it is early in flowering, and the volume of that plant
if they could capture it well
I have messed with digital level cell filters, and you can do some dope stuff
i think a lot of the images we think of as iconic of the era are pretty cherry-picked. i was looking at a lot of museum image API outputs over the last 2 weeks and a lot of photos are just trash. they're hard to do
especially historically when film and outdoor light come into play
that stuff is easy in modern era with raw video that can be colour-graded later
man his glasses must be super powerful
That is specifically why I bought a DSLR with extremely good Dynamic range
my photography style and editing lends to very high dynamic range tones. Whereas people who take high contrast and more grunge photos would do just fine on a low DR DSLR
a random MtG card appears!
Makes me think of that one joke card with the impossibly long name lol
this one lmao
is that a dog made out books with a snake tongue?
would like to just note that that is a real card lol
you can use it in real games, its not just some one off shitpost lol
There is also this one
Asmoranomardicadaistinaculdacar
longest single word card name
not AI, i dont even look at it anymore. i stopped even using my eyes when i go outside. i fell down the stairs 5 times and i dont even have stairs at home. i have not even been home in 3 weeks. i cant find it, but if someone would build me a VR headset with realtime AI video feed i could probably find my way home eventually
I am not trying to learn how to say that card name lol
you guys ever hear an eta for XL release?
they learnt their lesson on promising dates
Wait a sec-
What happened to your account?
why are you brand new?
Im an AI version of myself
respect

I left a bunch of discords I wasnt active in the other day and accidently left this one too
so now I look like a noob
all good, I like flying under the radar, although I guess Im not doing that either lol
that big green flower next to your name = 🔦
Chaz.... Why are you lying? lmao
Just say you don't wanna say, no shame in that lol
but seriously that XL I thought was coming out in like Feb or something.
#youtried
The untrained crap version, yeah
yeah i was playing with some credits on the site, it seems a pretty solid model
stand together with your fellow newbs chaz. one of us one of us
BLIP is the best, BLIP never lies. "A hand, holding a ball of ice" LOL no it wasn't but oh well
but the very well tuned final version, that is behind lock and key, and man is it pretty from a distence
pretend my phonetic ass spelled "Distance" right lmao
||why did chaz gloss over me calling him out for lying lmao||
I am actually not sure if he is the real chaz, or an impersonator lmao
you guys ever hear of BlueWillow? know who its by?
well from what i gather, the new model still has the same issues with diffusion latent space being so small, but, they're using things like controlnet internally to work around it
its pretty trash, i went back to test it with some of the validation prompts i use on my 2.1 training
and it also has a ton more parameters as well, which helps
can't do basic details at a distance
i wont waste my time with it then
apparently that does not help
it's great for certain things but it's hard to know what those are and now they only give you 10 a day
sooo lame
i think paying certain amount gives you 50 and then some more.. but they have a new "v4 model" that they have a comparison channel for
I am stupid, and I apologize @wispy nest
I hadnt heard of it before so I thought maybe it was the new hotness or something
I read your account creation date as June 10th, not Jan 10th lmao
I was like, he made a whole new account, whats up with that? lmao
damn a 768x768 image costs 15 tokens
lol nope
jun and jan look very similar far away on a small monitor lol
true
that is disgusting
oh! @wispy nest
Did you see the announcement about VRAM constraints for SDXL?
no what are they?
||please pseudo, I wanna share this if you don't mind||
yeah they seem to make 512x512 eg. a 2.1-base model by default, upscale 2x via Real-ESRGAN
you need like 48 or something? lol
fucking incredible, thats what they are lmao
good or bad incredible
8GB vram - 2048x2048 generations
oh noice
good incredible
my 3080 will handle that fine then
thats better than 1.5 lmao
mcmonkey says they're going back and forth between 6gb and 12gb internally
either way, so damn promising
you think we'll see it this summer?
I'd think so
maybe late August, early September, thats my guess, based off their training checkpoint progression
they have a lot more work to do for this release because it will come with documentation beyond a few simple examples, but also fine-tuning tools and other stuff
they are aiming to include the code required for sdxl to work into diffusers and probably Automatic, since it's already being used for internal testing
I am gonna retract that statement that almost left the little text box on my discord lmao
they used a custom k-diffusers library to bootstrap with
so at least i will be good to go 😄
you know what sucks about DeepFloyd though is the license they released it under
you can't really access it anywhere but the huggingface hub, and even then it requires an API key and it's a pain
you can download it and run it, but it's not as plug and play as literally every 1.5 and 2.x model
I could give two shits about deep floyd
Like, cool tech, but I want SDXL owo
no, i'm just thinking they'll do that with it, too
aiui it's the deepfloyd team that does the dataset work
i don't know how much they decide about how it happens. it's not clear if DF-IF's license is just SAI's new thing, or something the DF team wanted.
I will stably diffuse a nuke on their HQ if they do that
😐
like, LLaMA is technically open, but it's a hot potato and releasing fine-tuned models is legally dubious
you can easily release LoRAs and delta weights but not the whooole kit and kaboodle
well go on then, give
still. the progress that LLaMA prompted is great
not my kaboooddlleeee 
delta weights are a wicked good solution
thats an interesting new pfp lol
it's Oprah
- delta is an awesome word and 2) they're smaller files maybe
god, I am so doped up on nerve meds right now, I am loopy, and influenced
the fuck, that's not oprah
Once again, I choose to retract my statement before I send it lmao
dignity, lively
@smoky oak black and white Oprah Winfrey, hitchcock, detective, 1930s, go
how about... no :p
opah winfee
can't seem to put 1930s style on such a modern woman
@smoky oak do you ever have to put kermit the frog into your negatives?
never once lmao
some kind of AI cryptoid that shows up from time to time like a horror scene
dark kermit
looks like Alf actually
dude lmao
uhhhhhh black and white art gets you into some truly wild territory
black and white ALF (Alien Life Form), hitchcock, detective, 1930s++
what are people going to do without reddit?
join discords?
can you 'post' to discord?
isnt that what were doing right now? 🤔
just a more real time vibe
idk i never use reddit myself
I've gone off it
i mean I glance at it and am subbed, but i never interact on it
i dont even know what all this fuss is about
reddit is asking for people to pay a lot for api access, which means a lot of the popular 3rd party apps can't afford to operate anymore
and the native app is garbage apparently
Damn. I used to sub that and unsubbed a while back. I just followed a link to get this (was just there a day or so ago).
That is going to hurt as a lot of links to info is to that subreddit
it's going to be interesting to see what happens
people share stuff on discord, but it just get's lost in the chat history
I messaged the mods and told them that this hurts a lot, and I used to released on there TIs, and models, before the bad came, and this just severed the links to some valuable info to shove it behind a locked door.
not sure if they will come back in a couple of days or not
some are going offline for 2 days, some are permanent
mods also rely on 3rd party apps to keep on top of things
I hate reddit, but all that info is now just blown away with the sands of time.
one app developer I saw said it would cost approx 200k a month in api fees. another one said they calculated the average user makes 100 api calls a day, they would need to charge $3 a month just to cover the cost
Your modmail to r/StableDiffusion will not be read lol. We're receiving more than 1 modmail per minute and almost all of them are users that just don't know how to read the info message
(so we're just bulk deleting them)
What's even going on on reddit? What is it all about?
Heyas, long long long time no see. How have ya been?
(But I read your discord message here and I'm a subreddit admin so lol)
they are going to start charging a lot for api access
well...
a lot of people use 3rd party apps to access reddit
I use relay for reddit on android
I just use my browser
kinda depends on how/whether reddit responds
the goal is dark until reddit agrees to reverse the change, but... if they don't, we haven't made a formal plan
You know how they will respond, but I hope I am wrong.
still cheaper than twatter, but noone complaining there 
I think plenty of people have complained about twitter lol
never was a big deal
not after people got banned, not when Elon started throwing propaganda nonsense all over the place, not with API prices
“Reddit iterated that the price would be A) reasonable and based in reality, and B) they would not operate like Twitter. Twitter's pricing was publicly ridiculed for its obscene price of $42,000 for 50 million tweets. Reddit's is still $12,000. For reference, I pay Imgur (a site similar to Reddit in user base and media) $166 for the same 50 million API calls.”
Love the new twitter over the old one but hardly use it due to all the baggage associated with it. Don't need that level of toxic on me.
I can't access my old twitter account anymore, it doesn't say I'm banned or anything - it just doesn't let me in anymore lol
Can't search without logging in now too, which sucks
I only ever use Twitter if I need to contact a company but that doesn't really work since 2020. Basically, Twitter is just for clout chasers and old washed up has-beens' trying to stay relevant.
Lots of news goes on twitter, it has huge community of people, so usually news comes with fact-checking \ geo locating too, that's it's advantage I guess...but that's not really something twitter itself did 
Well, for the most part, that Twitter community is about as toxic as one can get if you dare to just be you. Break away from the pack and face it hard.
I don't use it as social media, don't post , don't talk, don't care 😄
Same
Anyone know about how many reg images should one use for a lora?
I love how a plain 3090 can do 1k of them in about 10-15m
"" geo locating ""
Hehehehe
first time training a LORA ( LoCoN )
overcooked freckles
it... won
the rlhf is not being done correctly lol
got people voting on broken images
my images keep getting rotated/flipped while using IMG2IMG - whats the cause of this?
for midjourney how many dollars per sentence drawn

not anymore. all that "fact checking" stuff is out the window. elon neutared all those capabilities
couple months ago he started calling the CBC a state owned propaganda machine. elon turned twitter into a post truth society
oookay mr mcmonkeys 😛 the idea is what i was focused on, but technically they can render a whole series of fonts using MJ just once and reuse them
i'd make fonts/<typeface>/<letter>.png and make some kind of stitching script that could generate banners using a given prompt
do you guys like CodeFormer?
I know it's used as propaganda tool now by Elon, but people still trying to post info...
Yea, lots of people got banned , lots of people got their channels marked as "sensetive" and thrown out of search engine, but people still trying.
Idk what is CBC tho.
canadian broadcasting corp
i will never forget elons infectious laugh when the german reporter asked him where tesla will get fresh water from
and he was like, look all around you
and then, the plant closed because no water
marvelous self pwn, sir
Hey, I'm looking for tips on how to create a Lora.
For example, I made this grid for different looks for a "6 year old" fantasy character. But the character's age clearly varies a lot, anything from maybe 10 to 21+.
Would a LORA help me, and if so, how would you recommend I make one?
civitai got ddosed
don't believe everything you read
it loads fine for me
it was slow/down intermittently the past hour
it just came back
but it was down or slow past 30 minutes
even if it was, why do you believe it was this person, and not someone else
maybe they are just taking credit
its just the timing right before it started
everything on the internet is constantly being ddosed, I wouldn't pay any attention
unless the site actually goes down
reddit was entirely broken for hours
this morning
yeah everything is
reddit wasnt ddos i think
prob just overload from the api changes and people rushing to change visibility settings
but yeah hypixel got ddosed 2 weeks back
and 3 weeks back discord was
it's been a long time since LOIC was usable to knock most AAA sites off the internet trivially
CloudFlare etc provides pretty good resistence to this kind of stuff these days
civit and hypixel and discord have cloudflare
yet they got ddosed
lots of recent ddos attacks
Anyway, looking for tips on LORA training.
I'm looking at this guide now https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Textual-Inversion
caching servers got hammered hard as their entire front page for most of their user base went private
server algorithms like "wtf we do now?"
@distant panther example: it lets you more easily try a bunch of different words to help you get the effect you want
e.g. here I'm trying different words for the hair color to see which word gives me the color I was looking for
That's super useful. I'll try it out!
controlnet is good
Hi guys, would you advise me what is an efficient way to extend an image? I had tried in the past with inpaint and others, without much success.
Dude, its destiny
How did you extended your image?
In this outpainting tutorial for Stable diffusion and ControlNet, I'll show you how to easily push the boundaries of Stable diffusion and outpaint or expand your image.
FREE Prompt styles here:
https://www.patreon.com/posts/sebs-hilis-79649068
Support me on Patreon to get access to unique perks! https://www.patreon.com/sebastiankamph
Chat wi...
There's no benefit in this tech tho...like...you can expland image using just img2img or outpaint tools
I mean, no benefit from controlnet in a way he's doing outpaint
its just something cool to do
which model do you suggest I use as a base for training realistic faces (non-waifu), I want to train it at 768x768.
JEEZ, it work so well... !
Howdy dody
@oak osprey @dense tapir I wish I was more electronically capable, cause I would totally upgrade my GPU to 20 GB VRAM, especially cause VRAM is stupie cheap now lol
I forgot the reddit blackout starts today lol
I was confused why my feed suddenly got a lottttttt smaller lol

i would never do that to a GPU lmao
and i'm "electronically capable" with a soldering gun and wick
no idea if its firmware will work with more GPU memory but i assume Chinese engineers found out?
Somebpdy just gave a 2080ti 44GB VRAM
with great claims comes great responsibility... to fuckin' prove it 
Lots of people have been giving GPU's more VRAM
just merely having it doesn't mean it can access it all with equal penalty
i'm curious what performance is like
I really wish we could upgrade a GPU with more ram as we can on a motherboard.
having the memory is better than not having it though even if performance spikes miserably when accessing those ranges
They replaced all off the 11 1GB chips with 4GB chips
So it's 11 4GB VRAM chips
They didn't add any. They just replaced them
that'll work just fine
they migt have to tell the GPU firmware? or is it more like CPU memory where it self-reports? 😮
Did you know AMD is supported on ROCm but the problem is Pytorch doesn't support any new version of ROCm?
No idea, I know on the 16GB 3070 or whatever, they needed to flash firmware to get the bandwidth higher
Therefore the 7k cards are having all kinds of issues
well i was thinking they had to add a daughter card to it with some kind of jumper pins to connect it to the main GPU PCB
replacing them will work dandy
Oh snap
Reports and testing are saying it detects all 44GB just fine, and maintains 616GB/s consistently through capacity
They do say that it boots into windows just fine, but does like to crash games and benchmarks

They don't know how to handle weird GPU VRAN like that
windows doesn't count lol try linux
Oh! It's the same user that did the 16GB 3070 and 2070 mods
when linux drivers crash you can trace back to where it occurred unless it's the GPU actually disappearing off the bus
that's an internal issue to the device then
Games and programs are already at a disadvantage for being supported on Linux, so I wouldn't push it by using an absolutely hacked GPU on top lol
well i am a kernel developer and i've been playing games and stuff on here since 2005
i haven't used windows
correction, i ran windows longhorn back when it was in beta because it was really cool
They also state that the reason they were able to do 44GB was cause the 2080ti had the same reference PCB as the 48GB capable RTX Titan
oooh
So, after reading more, it seems I was wrong, they used 2GB chips
So the PCB has 12 chip spots on the front, and 12 on the back, and it usually had 11 of the front used, and none of the back
They used 11 front and 11 back with 2GB models to get 44
*moduels
Hello
How block nude in the prompt?
Many prompts don’t have nude, after posting showing nudes 😟
some prompts add nipples and etc
how to not show nudity in prompts?
It's probably the model you are using
What model?
You should be able to prompt specific clothing, and add negatives like "nude, naked, shirtless, bare chest, boobs, nipples, NSFW" stuff like that
usually just "nsfw" negative solves an issue...kinda
it removes niples, but you can get nipless tit lol
Yeah, it's likely you're just using a model that's too sexually aligned
i am having problems with launching SD can someone help me?
Thx @smoky oak and @wild sorrel
I am fine with through hole soldering but SMD stuff I never was able to handle that stuff, and reflow is black magic to me as to how it doesn't short out the connections due to how small the space between pads are.
@smoky oak I am trying to figure something out about training with something I never used. Of course all my links about it are now blacked out so have a clue about this? I am taking that to mean if I have 1 epoch and set it say 25 I will get four saves for that 1 epoch?
If you have 10 epochs, that's saying how many epochs I between it will save
If it's 10/1, it will save every epoch, 10/2 will save every 2 epochs, 10/5 will save every 5 epochs, and so on
So first would be 10 saves, then 5 saves, then 2 saves
shit
This is what I hate about lora stuff as it has no way to save on XX steps only epochs
@smoky oak so i'm discovering that stopping the unet training about 50% of the way through is amazing results. freezing it completely, at least for 2.1, isn't a good idea. that preserves all of the initial artifacts
You can save on xx steps, at least in kohya
I can images but how would I the model too?
I just don't see any point in it, personally
I suppose on slower training hardware, it does make sense
partially-trained unet vs frozen unet
how would I though?
see the jpeg artifacts in the yellow coral at the top left
that's because the unet is from base 2.1 and the text encoder has about 9600 fine-tuning steps on it
i can use a unet from step 4200 with the text encoder from 9600
the unet converged on a pretty clear image at some point, it's just, determining when that happened, and freezing the training at that checkpoint for that component
the text encoder seems like it can take a fair pounding for a very long time
I am gonna be leaving for Mexico for a week, so I really hope I can think of a way to get LoRA training working again
i hope you do, too. is cool seeing the weird shit you make that i never expected you to
you're just full of surprises
you're not the only one having the issue but you didn't update it, maybe something in your venv updated somehow
Maybe, IDK, but I even did a whole new separate install, and it's also not working
well a new install would have new requirements too
erm, dependencies
no issue reports that sound similar, yet?
4200 steps -> 9600 TE/4200 Unet -> 9600 TE/2.1 unet
interesting that reverting all the way back to 2.1's unet reintroduces stars
there's a lot of extra "noise" but a lot of that is also just dirty-looking details
i don't think it's very accurate to project stars onto that background, to be honest. the sunlight would likely wash them out
you guys think XL will run in auto when it released if its like 8 gig or whatever?
Ok, so Kohya SS isn't made by Kohya, it's his code made into a GUI by somebody. The creator said that it could be cause Kohya operates differently with torch 2, so maybe my SD update where it installed torch 2 also affected kohya, and that could be what messed everything up
It's likely, but as is, there is no support for the way they run it
But I don't need them shunning the biggest part of the SD user base
I was messing around with it on Dreamstudio it seems like it could be pretty powerful
That's nothing compared to actual SDXL
SDXL should be able to do native 1024x1024 gens like nothing, and the version that we can use isn't using any of the new data set@wispy nest
Have you seen the 50% trained benchmarks for SDXL? They are astonishing
it's just kind of annoying that the bot is tuned to give shitty outputs regularly just to see what happens. though it would also make sense to do that to keep the true abilities under wraps for longer
im just glad it's using controlnet internally now and forever so that i don't have to deal with that project lead anymore
same checkpoint, earlier unet on right
the base 2.1 unet makes it look like a Windows 98 desktop wallpaper quality, pretty bad
@wispy nest these are images from SDXL 50% trained
Oh great, it download them all in shit ass quality
that grape cluster thing has a blocky background gradient
Uhhggggg, I hate ingur
It's imgur
They look way better
still look good here
Just a sec



