#💬|general-chat
1 messages · Page 123 of 1
I use a 4080
16gb is muuuuch better than 12
quality-wise 16gb is a make-or-break window for a lot of folks
if you're buying a gpu today, anything less than 16gb is a joke
you're def crazy to go under 12gb
better off skipping the daily to go coffee for a month or two or whatever and at least getting a 4070 super
if u using a lot of ai related stuff less than 48gb is a joke
buying a gpu today period is a joke, when 5090 is coming out
that might not come out for a year
wrong. coming out end of year
you don't know that, lol
if u dont wanna wait u can always buy an A6000
i bought a 4090 a month or two ago knowing the 5090 would be out in about a year, maybe a lil longer, cuz life is short and it gives me an excuse to buy another killer card and set up a second system when it comes out
5090 is still going to be 24gb. the TMSC forges aren't cranking out denser chips and nvidia isn't going ot eat their enterprise markets
supply and demand
5090 is also technically still a gaming card. Games, even at 4k, won't need more
yeah also what clownshark said. if yo'ure not poor then it's something to look forward to
pretty sure if u are talking about being poor then a 5090 is not something to flex about
you right a poor person would flex about a 4090 on a random chat online
if they spent next months rent on it and need to convince themselves it was worth it, i bet they would
maybe they bought one of those custom versions that are really expensive that have like gold plated fans
theres ram like that too,with some gold on heatsink
yeah, my concern is they'll use it as an excuse to leave it at 24gb of vram
if it's still 24gb i won't buy it unless it's at least 50% faster
if it's 32gb or more, i'm buying it
it'll be a lot faster for sure. i don't think it'll have higher vram. i'm confident in this prediction but i'd be happy to be wrong
yeah, no idea what to expect there
i've heard a lot of contradictory shit and it doesn't make a diff if i spend all day reading about it trying to sort that out, so i don't bother
it'll come out when it comes out, till then, i'm enjoying the absolute fuck out of my 4090
if it's got more vram, great, if not, well then whatever, f nvidia and hopefully the competition finally catches up and kicks their asses like amd did to intel when they got fat and lazy
i might trade my 4080 up. i only got the 4080 because prices of gpus were insane a year ago and the 4080 was the only one canadian retailers weren't pumping the price on. it was still msrp.
ahh yeah makes sense
i'm only so not cheap
i managed to score my 4090 at msrp which was really lucky
i'm still pretty cheap
yeah, i'm not ever going to pay a fn scalper
absolutely
but yea the way i see it is if my 4090 is worth literally nothing in 2 years, that's 3 dollars a day and i'm getting a hell of a lot of entertainment for 3 dollars a day
plus, AI is the future for sure and i'm at least getting to tinker with some stuff related to it and get familiar with some important technology that i'd otherwise probbaly be too tired to bother with after work
3 dollars a day = what i would prolly spend on a nice IPA watching a movie in front of the tV instead anyway... really not much
i used to lower my field of view to get more consistent frame rates. my 4080 though, i kept that habit up and it was blowing smooth framerates at me so hard that the narrow field of view was causing me serious motion sickness. can't say i've ever had that gpu problem before
like i'd had to just walk away from the pc immediately, get fresh air, burp a ton, it sucked. i'm like "Wtf mate?"
framerates are a weird one with that
yeah i turn up the fov and it doesn't happen no more
if i'm not careful when in helicoptor situations though. 🤢
got that on halo reach recently
the only game that has ever given me motion sickness was the new ratchet and clank
it was only for a moment, but it did actually happen
i'm lucky for that
not that i'm immune... boats... oh god
when i first started using vr , framerates were so bad that i got it hard
ahh i haven't used vr
yeah i'll get sea sickness hard and i live on an island
ugh
last time i went out in real waves, it took me over two weeks to feel right
haven't gone back out and prolly won't ever again lol
i can't do carnival rides. not the intense ones
yeah i'll pass on that too
tea cups MAYBE
there was a six flags incident when i was a kid and i haven't had much interest in any of that siht since
my cousin went on the batman ride
where you're strapped in by the shoulders and it goes in loops, etc, that's all that's holding you in
i sat at the front of the "sky master" and haven't done any rides since
and the damn thing DIED and they got stuck upside down at the top of a loop for i think over an hour
ppl were throwing up and it was raining down everywhere
i was done lol
https://youtu.be/_mOGVOVZw70 this guy ended my carnival days
ugh that looks baaad
yeah. lol. so while i'll deal with motion sickness in a lot of situations, i've never had it in pc games till my 120hz screen was fed with all the frames
fun times
been gaming since 86
damn, yeah
hilarious thing to me is i have yet to fire up a game on this 4090
i couldn't have imagined a few months ago that i'd get a 4090 and not play any games on it
try out "the finals" it has wicked crazy destructable environments, runs on unreal engine 5, and is free
throw it all out. none of it is unreal engine 5
at least my gamepass sub (got ultimate for 5/mo 3 years ago) finally expired. that solves one problem
star citizen is almost here i heard. another 2 years maybe
alan wake 2 is on my list but i still need to finish 1
loved control
about halfway through BG3
2/3 through horizon forbidden west
control i started playing. was fun. then i got distracted and forgot
halfway through the last of the mass effect trilogy
is probably one of the games eating 70% of my drive space
control is as cool as it gets
damn i love that one
it's the atomsphere and the humor that gets me
the sheer weirdness of it
red dead redemption 2 is a worth while title. i've sunk most time inot that lately. it's so pretty and runs so well on the ada cards
prolly not surprising if you've seen how weird my art on here is
oh god, that's another one i got halfway through
oh and oxygen not included. not really a graphic flex.
generally the issue isn't burning out on a game, it's gcetting a work crunch for a month or two, then by the time that passes i fire up something new because i'm no longer on a roll with the last game
hear that
"got distracted" is my way of dancing around crippling work hours
no mans sky is really pretty too. got a huge update AGAIN just recently
but can you prepare duck confit in no man’s sky?
that takes a certain level of culinary talent.
yup, the problem is i come home so exhausted i don't even have the energy to game
sometimes i'll try, but i can't focus enough to figure out what i'm doing in a strategy or open world game, nad lack the reflexes for anything else
focus? i hardly know us
ha
fooocus
missed oppooooooooortunity
"are you using fooocus?" "no i'm on foooocus"
Still? But foooooocus is out!
Imagine one day you come in and in "Stable Models" you see the SD3 room XD
i still have doubts that the board will be allowed to release the model. they can make more money by selling it and the investors might demand that
emads all about blockchains now too so you know
you know how blockchains go
yeah, i'm a lil concerned about that too
any nice Civitai Models to get??
all the existing tools are still here. can't unrelease weights or code really
we don't talk about dreams on this server
sd 15 "cyber realistic" has new versions out. pretty nice but still very sd15
Is it just me or SD15. while not bad feels more like video and sdxl feels more mature and film like
i've found proteus to be VERY high quality generations. supposedly it's based on ponyxl.
im think im using CyberRealistic v42
is there a newer one?
thats the one
if they dont release sd3 probably the community will refine cascade somehow
its the MOST realistic model I think
cyberrealistic is the best? not photogasm?
there's a 2.5d cyberrealistic too but i didn't think it worked too well
Okay is it just me or not using a dozen random prompts like high quality, low quality, and all that jazz actually result with a much better photo generation than trying to shoehorn all that stuff in?
You some of my best generations didn't have anything like that in it.
its a vague nebulous world, no definitive answers, the magic and bane of ai image generation, its still very much a random slot machine spin
occasionally it blows your mind but even then not much you can do with it as there is no real consistency yet
depends on your use case. if yo'ure using a model thats really refined to the content you prefer, you might find prompting to be easier
turns out a lot of people have very similar use cases and there are a lot of models refined towards their "needs"
I'm not so sure about that. Because I found if you go into a model and try and force it into a image style that is not trained on, it requires a lot of those pointless keywords. But if you find the model that you like and just allow it to do its thing it does great.
Yes it is a bit random but I find it using less keywords but more along the lines of the proper keywords is where you can actually control it fairly well.
At least in my personal experience so far.
I found that a lot of those keywords kind of... I guess you could say make everything similar. And it also takes away a lot of the magic of the randomization that your other keywords give you. It kind of gives you a narrow range for the AI to work with if that makes any sense?
i use prompt magic in the dynamic prompts extension to add all that flavor text to prompts using a gpt2 model
Prompt magic is fun.
i can say for sure SDXL needs less rpomting wisardry
Definitely can throw some variety in there. I've really enjoyed started using artist or photography keywords in my promp ts and those end up really fun.
artist names are always good flavor
pointillist obese cat floating in space
I guess what I'm getting at though is like using a set of words like high resolution high quality masterpiece the forces it to use source images based on that where I don't think every single beneficial image would have those keywords so it excludes potential benefits.
I feel like those keywords are an attempt to create a perfect image on the first generation and not subsequent image to image or edits.
I feel like the first few generations of an image are really just to get the basics compositional down. Try not to force it to do too much in one pass. Do it on iterative generations bit by bit
its hard to spend hour son perfecting an image even tho if done like that probably any image could be created that you cna imagine but the temptation to keep trying with new promts an the ease at whcih ti can be done is too much
Well really not so much a perfect image in the first pass but it feels like that they're trying to get to as close as perfect on the first pass as possible.
And yes I do agree. But it's just getting kind of silly in some of the source images that I've seen. Like they have like 150 tokens worth of negative prompt keywords and I think that's just getting a bit silly.
wait until people have t5 context length. then you'll see some crazy prompt metas
i'm gonna drop zepellin lyrics in every image
Geez I can only imagine. Now I just want to copy and paste my favorite song lyrics into an image and just see what comes out. Wish I was home at my computer. Out of town at the moment
Definitely going to do some tool lyrics.
You've given me a new idea my friend.
if any of the ui's had lavi bridge adapters implemented, i'd recommend using t5 on your fave sd15 model
but what are you gonna do?
What's t5 and what's a lavi bridge adapter?
is the video API down for anyone?
t5 is a text encoder model that will be used in sd3
Oh I'm just a casual man. I'm technically inclined and everything but I'm still pretty basic when it comes to the advanced stuff on stable.
I so basically what I'm saying I don't know what you mean by text encoder.
clip only has 75 context length. it can only understand that much in any context. while it can do longer prompts, it'll only understand it in 75 chunks
Right I follow I'm familiar with that part
t5 has something like 800
Oh interesting.
lavi bridge is one of the new models that have come out to create embeddings for sd15
But will it be able to understand natural language better?
Because that's something that I really look forward to. The ability to"paint a picture" in words and get a accurate output
Just type in "cool movie" and get a full lenght feature film
So I look forward to when it becomes less about prompts and keywords and you can effectively describe what you wish for
one of the experimental models out there uses t5 already. i forget which one.
Yeah could be a full length feature film but that ain't going to get you a shit what you were hoping for. You got to be detailed man lol
https://github.com/PixArt-alpha/PixArt-alpha this model uses t5 but a very small training dataset. you can sort of catch a glimpse of what t5 can understand
Right but it natural language let's say I did this. I wanted a 1967 mustang with a large supercharged engine with flames shooting out the exhaust, next to a futuristic sports car wreath an electrical discharge.
Currently it's going to get all confused. But if it understood it in a natural language sense it would be able to differentiate the separate vehicles and effects.
It's going to end up with both cars with both of the features. A natural language model would be able to understand the you want new next to old fire next to electricity.
natural language works in sdxl reasonably well if you feed it to the openclip layer instead of both
sdxl has 2 text encoder layers and sd3 has 3
Huh ... Ok I didn't know that
I've been playing around with large language models and stable diffusion a bit but again, I'm kind of casual. I guess I'm above the general pop in understanding.. But I'm still not high level understanding
And I don't really have a super lot of experience in SDXL. I'm running on the 3080 and it's kind of hard to make the transition with the speed change
No worries soon, we will try to force it to do the bizarre trippy stuff it does now. When every prompt will yield very realistic normal images.
It will be called Retro AI
AIstehetic
"A subgenre trying to replicate early AI image generation artifacts"
"Deformed hands is one of its staple"
higher base resolution though. if you're doing megapixel 1.5 2 pass generations, its comparable speeds
https://www.reddit.com/r/StableDiffusion/comments/15c2n0q/sdxl_two_text_encoders_two_text_prompts/ thread on the dual layers . you can really only do them in comfyui i think. the auto1111 devs scoff at it and don't think they work.
you can get a really good idea of the natural language understanding in the openclip layer by playing around with the 768x 2.1 model. it's built on openclip and doesn't use the Vit-L model at all
the clip ViT-G model is the original pretrained model that sd15 was trained on. it was made with REALLY bad data but capable data. That's where all the flavor text capability is coming from. Poeple learned to throw special tokens at it because it worked.
But 2 years later.. sd15 models are REALLY refined with a lot better data. so while that text encoder is still pretty weak, its more refined than it was
I really want to use OpenAI’s voice model to turn my speech into Jamaican patois
I want them to do harry belefonte's version of rap god
this emo demo has got me hungry for more rap god https://humanaigc.github.io/emote-portrait-alive/content/video/16比9视频结果/song_cxk.mp4
This reminds me of the initiative to fine-tune a language model on Google’s product names. “DeepName”
🥁
I’m thinking about tomorrow.
my plan? right now
i thought you wanted to release it now wtf
you need to slow down and consider the shareholders a little bit more like i told you on the phone
that wasn't me on the phone bro i think you got ai scammed
bullshit that’s not possible yet
open ai trolling you with their unreleased voice thing
of course they'd want sd3 released tommorrow. they'll delay the bomb all they can
they’re fishing with the best bait they’ve got. but OpenAI’s over there with their aged ahi tuna steaks
fuck i want some ahi tuna now
me too. hold the mercury
roku's basiliisk is actually a diffusion-transformer model? hmm
Imagine how realistic SD3 will look
Maclunky.
Wha is your guys favorites GUI for SDXL?
Stable Diffusion Forge ...
is not about realistic but the image resolution so the dettail and over all the most important is the way the training of models is but im not expert
Will SD3 be released to the community or are they closing it up?
last news I saw from the stability ceo was that they're improving security without compromising quality, then beta, api and then weights. It's taking soo long
Hello, how do linux distros work with SD? is it true more vram is needed
The more VRAM, the better
understood. But the question is whether *nix system are less optimized
I would think they should work better. But I am not really a pro in that topic
3-5 weeks ETA (4-6 weeks but this week is almost over)
It's just deep floyd / IF all over again, they say soon, then they mean 3 months. I personally kill time buying and trying 3d assets, at worst I'll have a bunch of things I can throw to a canny preprocess
yeah its impossible to know, I just wrote whatever the CTO wrote
"improving security without compromising quality" what does that even mean... sounds like shallow Corp talk to me.
^
maybe safety actually
maybe it's the censorship of their services and API
they can't do much to the model itself
^
Lorem ipsum dolor sit amet
consectetur adipiscing elit
sed do eiusmod tempor incididunt ut labore et dolore magna aliqua.
no satanic chants in chat please
fr fr lorem ipsum my beloved
It means censoring the model, which it needs to be, and probably making it harder for degenerates to make NSFW models so easily. Probably means some of the training data and weights will remain closed
I have faith the degenerates will be victorious
How does the Easter Bunny stay in shape?
Eggs-ercise ! 🥚
Happy Easter folks !
Includes a lot of things that are done to avoid causing headlines kind of like Google's image generation did a while ago
Part of it is making it difficult to generate pornographic content, of course.
But also things like removing biases the model inherits through its training data and making the model more unlikely to generate offensive content on its own.
I don't know what extent they will go to. It might also include things like preventing the model from generating likenesses of public figures and preventing the infringement of IPs.
I explained why removing biases wouldn't work and told everyone exactly what would happen if they tried lol. Organic biases make the model work as intended, forced biases cause issues. Google proved exactly what I stated back then to be true 😅.
It's a difficult problem for sure and can't be fixed as naively as Google attempted.
Openais approach wasn't any better. The complaints about GPT 4 losing functionality and quality is because associations have been manually altered lol
It wasn't better for the users, but it was probably better for OpenAI
If you take Obsidian notes, map your seed output with the seed, the image results, the prompt, and settings. You can create a map of the latest space. There's a display option to see how all the data connects. I've used this to map out SD 1.5s latent space and if you do that, you'll see why altering the biases is problematic. The way things connect for context is amazing.
The issue is that image generation models aren't being treated the same way as i.e. photo editing programs by the public. If someone makes an offensive image using Photoshop, then nobody would reasonably blame Adobe, but image generation technology is viewed differently in the public eye and therefore companies need to take some level of responsibility for the outputs their models generate.
Sounds interesting
It's cool, the local opensource stuff has been outperforming again so that makes me happy lol.
100% same
Do you have an example of a map like that?
I'm glad there's an article about Emad being a Robin hood type lol
StabilityAI sure is a light at the end of the tunnel for now 
Mhm, I actually use it to take lecture notes
I started using it for DnD and saving prompts.
I don't really make use of the hyperlinking too much, I use it as more of a Markdown + Latex editor tbh
Then I noticed all the connections like I did with Spectrograms when I figured out how to do voice with Riffusion lol.
After that, you kind of know how to draw out the data you need as long as they don't force associations, you can predict the AI lol.
But if you know all the forced associations, you can predict it still I guess.
I had to study psychology, physics, python, and bunch of other crap I didn't plan on learning just to make my art into music videos lol.
Is it possible to create more than 3 seconds of video without things looking weird? also im not sure what video extension to use for automatic1111.
SVD is made for up to 25 frames ...
Ive been looking into different extension and svd seemed the most promising but i just wanted to get the communities option and such.
I always forget the stupid name of the extension I have been working with ^^
sometimes i forget aswell.
Deforum ...
and what does deforum do, isint that option considered outdated or am i wrong?
Complete different technology cause it's rendering frame by frame ... less consistance ...
You can see it in a lot of music videos ...
so i guess it might be considered outdated, also ai video seems kinda annoying because of how much time and effort it will take to create soemthing.
I stitch together SVD, Pika Lab, and RunwayML videos or use animations as a base and turn them into more coherent videos with AnimateDiff.
Not really outdated ... you can use video2video to get more control ...
that seems like a ton of work...
Interesting,
As it's always been 🙂
for some reason i really want to make an anime series now with ai, but it seems impossible at the moment.
Nah, takes little to no time with my workflows lol
The wolf is probably the best example, I made the video in pika, used it as a base to transform the rest using ip adapters.
how do i even use this workflow and does it work with 1111?
Use the comfyui extension in Automatic1111
Or use comfyUI itself. It's similar to automatic but modular and faster.
that exists, dang i have alot to learn.
Stability jap said they're working with anime studios to give them some tools, if we're lucky they'll release something in the long run
I have an anime workflow 😅
A grafiti comic one as well.
You can do it all right now
ComfyUI is best for it. There's a massive community with workflows and tutorials.
how many workflows do you even have?
Video offline is mainly from comfyUI AnimateDiff and Deforum these days.
A few hundred?
I have way more models lol
Impressive!
Impressive indeed
I only use one workflow for everything that I keep upgrading
How I'd recommend doing anime right now would be... using blender, daz3D, and make low quality animated videos or use stock videos or take your phone and record you and your friends acting out the scene you want.
don’t judge a motion model by the amount of keyframes it produces! ask not what frames it can produce, ask what motion it can conjure!
slight zoom in and out
Train or download a Lora for anime
Generate images with the lora using frames from the video or animation you made as an int for image to image.
Hopefully we get a contender for Sora on SD sometime next year. Maybe even this year if we're lucky
we = the ones without $50 media generation plans for Company X/Y/Z
Use the video as an int, use the stylized image to image in the IP adapters, use the anime lora.
Take the output and use wave2lip to create the lipsync.
well i litteraly cant 3d model and i have 0 friends.
Use Bark or one of the RCVs to make the voices.
Don't need to, just use a base mesh.
Use Daz3D free assets 😅
i think bark might be a good choice.
oh yeah, also daz only supports Nvidia and i have amd.
https://github.com/jasonppy/VoiceCraft for voices
i’m 98% certain this is what prompted OpenAI to announce Voice Engine
Hmm, 3Delight should still render for you right?
Like you can't use irat
Iray
But you wouldn't need that, you can literally render the view finder for this 😅
oh...
Use make human and blender
Use um, facerig
The facerig route let's you record video of your virtual avatar talking, Use that video lol.
Like everything's a tool right now thanks to AI. All the old software is valid.
this is alot of info.
Wings3D will let you make epic spaceships in AI
Take game footage from you playing a game lol
It’s easy to forget all of the raw granular power old software has
thats sneaky but cool
You could record Scenes in Secondlife and transform using A.I.
Yep
ill probably use game footage and and stock video along with other stuff if i can.
Companies like Adobe are going to move towards having an agentic UI where you’re just talking to Clippy about your photo the whole time. I think low key they’re going to try and negate the use of manual healing and brushwork tools as those are going to become harder for an AI model to detect as being artificial.
Games are now studio spaces where you can act out your movies lol
dont people also use the sims?
Turn off the UI, capture screen, input galore lol.
I don't know ... only have Secondlife 🙂
oh ok
Mine craft can be input for crying out loud lol
i wonder what the og doom game would do with stable diffusion?
I was tempted to apply for the palworld comic position
it would become quake
My plan was to take in-game footage and screen captures and use those in my existing comic and AI workflows lkl
Lol
I'm not that interested in making palworld comics though 😅
i want to take the attract sequence footage from Outrun and vid2vid it
the old 80s arcade version
ive also been condering making comic too,...
DC MMO for example would make amazing comic AI fodder 😅
Grand Theft Auto or APB could be used to make realistic movies
100%
Like right now, the tools Don't actually have to advance for us to do anything 😅. Just gotta see the connections 😜
1.5 alone is enough 😁
pretty soon entire film crews will be assembling in future versions of GTA Online and then videos will appear on YouTube talking about “a group of nerds who filmed an entire 3 hour epic using GTA Online’s generated in-game tools”
Well for images and video lol
“move over, minecraft gameboy emulator!”
Hell yeah, Mocap suits at home with VR visors to act out your role lkl
Lol*
No need for green screens, camera equipment, props, sets, costumes.
but the asset list consisting of just trashy shit.
Guys I need an urgent help with some image generation about architecture if someone is willing to help? I'm a newbie in this community and with the whole stable diffusion thing
Like, you could make your kids a video for their birthday exploring Ark and make an educational dinosaur video narrated by whoever you want lol
skyrim might also make some cool footage?
can i dm?
Oh heck yeah, especially with the mods they have.
Skyrim would make an excellent studio.
Yeah i was thinking that aswell.
I'm debating old UO
“ARE YOU NORD?” “NO, I AM LIZARD RACE.”
"No. This is PATRICK"
Well Kingdom Reborn with the 3D view.
“I’m TIM.”
That would be an epic studio.
as someone who played the lizard race once i feel that.
rotfl
while, okay, it is a holiiday, happy easter everyone... i'm still pretty sad that its another day with no sign of lavi bridge extensions
“HaVe YoU hEaRd oF tHe HiGh ElVeS?”
Low resource requirements, fully modable servers due to emulators, can script events and customize the mob nodes.
Like Ultima Online... hmmm... I wonder if I can incorporate AI tools into Ultime Online and turn it into a Virtual Studio 🤔
/takes off grey cowl
I think my next projects goings to be making a virtual studio...
I wonder... I wonder if you can use AI to create am overlay in real time that uses less resources so it can operate at real time? We can already do 1 to 1 with image and video technically speaking right?
Also what can i do with ai video like what would i do with the final product?
Like use a Lora with an IP adapter and QR code monster right, and use that with animate Diff, the input would be the current screen frame. This would be super light and efficient because you could run it at like a .2 denoising strength.
I dm-ed
What do you want to do with it?
Your reason to create is always your own. If you want to make money, well that's on you. I can do many many things, but, I'm more Peter Parker than Tony Stark 😅.
I have no advice on making money. Lol
I just really want to share my work with others once i get soemthing done.
i dont really care about money.
Tons of AI groups, I do a lot of my work for a kids cancer group.
YouTube, Tiktok, Instagram, create your own website and host your own videos, share the link to the site, generate foot traffic, run promotions and offer to incorporate contest winners into your comic/anime/videos, make sure they sign releases, have contests to determine the winners, have the contest focus on spreading the word about your content, most new sign-ups or whatever lol.
If you use something like Pika Labs or RunwayML, tag them. If you start gaining enough attention they'll likely offer you free services.
Speaking of which, I got a email from some dude at Pika trying to arrange a meeting I keep forgetting to reply to 🤣😂.
oh thats cool
If you're in here my dude, I have ADHD, it's not intentional lol.
I have autism and dyslexia.... also yes i use an autocorrect tool.
I feel your pain in all the ways my friend. There's a reason I have 100s od workflows and etc. 😅 AI became my special interest lol.
i dont really know how to reply to this....
But the adhd is the cause of the forgetfulness lol
I'm a double threat lol. Sporadic and Hyperfixated lol
You'll find a ton of neurodivergents in AI
Tech in general, but especially in AI
seems like confirmation bias. very common in the ai field too from what i've seen. (heh)
Yeah, it can definitely come off that way. I actually didn't know what a Neurodivergent was until I got into AI lol. It is just from my perspective and should be viewed as the lived experience of a single person rather than a shared experience by the collective species 😅.
Anyone can help me how to swtich from lowvram mode to medvram mode? First time generating it was using Dedicated gpu but after 3-4 times it use integrated instead. Dont understand why
Im using fooocus gui
I haven't used Fooocus yet unfortunately.
Everyone is a neurodivergant these days, which makes us all normal again lol
Lol, everyone copes in the ways they need to. Studies have shown the cause of some of it seems to be additional noise in the signals in the brain. There's always noise, our brains use it kind of like the Diffusion process does, they have found increased amounts of noise in the brains of neurodivergents and it seems to be the reason some can perform well above average in some areas but struggle in others due to the noise. I'm wondering if there's a correlation with the increase in brain mass since the 1970s. They have some peer reviewed research showing that individuals born after 1970 have 6.6% more brain mass than those born before. It's like 15% more white matter and 12% more gray matter.
That increase in density and mass has to cause increased noise, which if viewed through the lense of the previous study indicates at least potential for a link.
I think enough to warrant research in the noise produced by those with the increase in brain mass versus those without.
Its funny though, that what they are discovering about the brain is that it functions extremely similar to the image Diffusion process and the context iterations from the LLMs
if someone thought in a supremely normal way, it would be abnormal in how normal it was
i would expect that before modern medical imaging, the density of brain tissue wasn't well researched
That's the benefit of peer reviewed research papers, you can actually go read them and see how they accounted for that. The reason I made my hypothesis is due to the lack of research and findings on it, but yours has an answer if you want to go read the paper. It's worth the read, I was a bit skeptical at first, but the sources are pretty legit and it's passed the peer review process during a time where there's extra scrutiny on the peer review process 😅.
Essentially, if you're honestly curious and would like to form and informed opinion, there is a means to do so.
some of the peer reviewed pure garbage I have read before >.>
from what i understand, modern humans are identical to ancient humans. a couple of generations doesn't really measure anything.
most resaerch papers get things wrong. thats a good thing though. getting things wrong is part of eventually getting it right.
Not exactly sure what you're looking for here 😅. I provided you with information, if you are interested, there's avenues available to satiate that curiosity. If you're looking to debate it, without even reading it, I'm not sure how you expect to make any valid points or influence me 😅.
Arguably, modern humans today are quite different from each other. For example, asians tend to be more intelligent than almost everyone, except ashkenazi jews who are an entire standard deviation of IQ above caucasians (115 vs 100). We also have some groups of people who are more prone to vastly more diseases
IQ deviations linked to race, this will be a popular subject for the mods 😄
who says i didn't read it? kind of presumptuous
It's a taboo topic for sure
Logical deduction. Questions you ask and points you try to make are directly answered or refuted in the paper. 😅
That's not what logic is though? Why do bullshit artists always act like they're being logical?
The point was that races that appeared only within the past 10,000 years or so are different from each other, now imagine how different ancient humans (100,000+ years) are compared to modern humans
lol, orca spotted by my father at our harbour, first time ever recorded
in history
weird day
Yo
Not sure what this intellectually dishonest debate tactic attempt is even designed to do here 😅. You're not very good at trolling after feeling threatened. The psychology playing out here is a bit interesting. What exactly made you so defensive?
from what i understand is you could take a homo sapien from the first generation of homo sapiens, raise it from a child in modern day, and they'd be just as intelligent as anyone else.
modern intelligence is more of the accumulation of knowledge than any generational differences
This is proven incorrect. Even today IQ is well known to be capped by genetics. Though upbringing definitely plays a part
it's unbelievable how some furry artists are making 100.000$ on patreon per month. Saw some examples from youtube... I mean is this niche that profitable?
iq isn't even a good measure though lol. "capped" geeze
iq has never been something to boast about. it's always been a personal development tool that people get carried away with. plus marketing gimmicks. everyone wants to be told they're super smart
well suckers do at least
I agree with that point, but that wasn't the matter at hand
Yep, guess it's time to enter the furry market
they're probably just mules for a laundering operation
I built this app!
https://twitter.com/LK99Base/status/1774496902232650235?s=19
you misunderstand IQ pretty hard, it says nothing about 'development', it's supposed to be pretty fixed
it correlates with a lot of things, it's replicated a fuckton but yeh a lot of people seem to have weird ideas about it
What was I wrong about? IQ is heritable and based on environment and this is basically undisputed at this point
I was trying to reply to the comment you replied to
Ah, gotchya
hey guys what's the best current colorizing technology for black and white videos that is readily available
I currently use DeOldify looking for something better
Run the a frame from the video through image to image with pix2pix, or controlnets and etc. Use loras for photo realism or however you want to do it. Run the video through an animateDiff workflow using controlnets on the video, and IP adapters on the photo.
Use a low weight on the IP adapter
Or even use the the style adapter
Open pose forces the pose, qr code monster forces the shapes, ip adapter forces the style and subject all based on weight.
Photo meaning the resulting img2img output.
Probably helps if I @ you huh lol
im pretty bad at building a workflow is there some kind of workflow that exists for this or something i can build off of
I can use comfyui as long as its just getting the nodes and dependencies and stuf
Yes actually lol. https://civitai.com/models/367412/geeky-ghost-vid2vid-organized-v1
I use it to make animated videos look like reap videos lol
Real
Photon LCM is a model you need for the workflow.
Super fast workflow to.
Noise 1 steps 8
Tweak weights and rearrange as needed. I added a group bypass so you can easily turn off what you Don't need.
Mines a simple version. There's more complex ones out there that do significantly more and better, but I wanted speed and simplicity and added some stuff
You can loras to change styles as well. I use the Botw one a lot lol
whats the vae its trying to use
@raven agate
much appreciated btw
trying to make very old footage look much better
Or even use the the style adapter
got an idea what this is TypeError: IPAdapterModelLoader.load_ipadapter_model() missing 1 required positional argument: 'ipadapter_file'
:))
Sorry, was with my kid lol.
SD3 WEN
I hope SD3 is able to come out, instead of being cancelled.
It will
what's more important is with what contorlnets and tools
cause they promised stuff and idk how well they will be able to stand up to those promises
and also how fast it will generate
Wich the BEST graphic card for our pourpose?
GTA6
I just setup Forge, coming from A1111

Same as a1111
never used a1111
The syntax you had was correct
alright, thanks
Of all the inference derived from the social science of intelligence—“capped” is not a conclusion.
It’s more of a teleological assumption.
yah, there's the "if money was no object" answer, but I find the more interesting question to be which is the best price/performance card?
and is it still nvidia?
for AI, nvidia would be preferrable because of compatiblity
raw power for gaming, AMD of course
I liked this page, which is more SD specific benchmarking https://www.tomshardware.com/pc-components/gpus/stable-diffusion-benchmarks
anybody here has photoshop with working ai fill? I need to remove two persons from a pic, would somebody help me do it?
Hello, does anyone know what is causing this? When trying to use DWPose with ComfyUI?
ERROR: Could not find a version that satisfies the requirement onxruntime (from versions: none)
ERROR: No matching distribution found for onxruntime
i dont know why you're correcting me. I was laughing at this other guy going on about how it's proven that iq is capped by genetics.
"people who brag about iq are fucking r......" - stephen hawkings probably
IQ is a useful, very well-replicated but imperfect and often misunderstood measure
saying it is ultimately capped by genetics but nurture (upbringing etc.) determines how close you can get to that cap is a fairly reasonable way too look at things
i think believing it's tied to genetics at all is barely concealed race supremacy but alright. you do you.
do you also believe that tallnes being tied to genetics also implies race supremacy
lol no. some of the best people are shorter than most.
When SD3?
you are clearly inflating some concepts, by going into 'best people' here
height really only defines how high you can reach. other people have tools like step ladders. high iq right?
https://manifold.markets/LoganZoellner/will-the-weights-for-stable-diffusi 40% chance for SD3 before end of April acording to this
any pools for SD3 release date?
3-5 weeks
I asked my crystal ball and it replied mabey
Hahahah there's a pretty well touted "research paper" on iq and race and it's literally just the thinnest veiled race supremacy thing I've ever seen passed around
Amateur. Real fortune tellers use fortune cards, throw them in the air, stomp them and scramble them, then turn them around to tell the future. 👍
"do your own research" hehe
You're right, I should generate some tarot cards and convey the answer from there
It's coming SD3 I highly doubt they'll back down but I am also sure it's the last model well see from them.
AI fortune telling 🤔
Imagine if that becomes a thing 😂 Just a bunch of fortune tellers generating images of cards and basing their predictions on that
The latent noises
What can we glean from them
I'm absolutely certain that SD3 is not perfect.
But I'm super curious to see all the finetuned checkpoints that will come out 🤔
neither is sora aor any of them
yet
ph yeah
i mena Cascade looked better than unrefined SDXL
out of the box
#🏞|general-with-images A reminder of what we had in late 2022
Still pretty darn good
i think people are already using chatgpt as an oracle
You are not wrong
Why though?? 😂
If you ask AI to make predictions, I believe it will just hallucinate REAL hard
fear
Wait did I misunderstand your statement?
hard to say
Yeah ChatGPT is the oracle from the matrix
lets ask chatgpt
Should I be an ultimate SD upscale on SDXL images?
How’s the speed?
same as those others
really
it beats even magnifiqui or whatever
coz it sticks to the original image more and gives same or better higher clarity
Ok, giving it a shot.
yeah check it out
Trying to improve on a animatediff and trying to get it to play ball with SDXL
SVD is great and hot trash all in the same
it's a linear process so it doesn't have any ability to consider. just blam. first thought.
"I have 3 apples. Yesterday I had 2. How many apples do I have?"
Almost all LLMs: "If you started with 3 apples, (...) you have 1 left."
Toddler with impeccable art skills
lol
I need a train… THOMAS
Here is a nsfw image of a girl with a boys face named Thomas getting the train
Image… an actual train
Poor Thomas
I just tried this in gpt4 and it responds correctly "You have 3 apples. The statement about having 2 apples yesterday doesn't change the number of apples you have today." so we are already past that
i just fired that into windows 11's new copilot feature. got it no problem. You have **3 apples**. The number of apples you had yesterday doesn't change the amount you have today. :apple::apple::apple:
there's fewer and fewer gotchas every month
for logical things maybe. It'll still make shit up willy nilly when it comes to current events or article explanations
GPT-4 once gave me this incredibly convoluted answer just to get it wrong 😂 #🏞|general-with-images
Actually probably better for #🌶|off-topic
Oh I had written I ate 2, not I had 2.
yeh it's pretty ambiguous
that's a problem with you not being clear at all.
How is the statement not clear?
I have 3 apples.
Yesterday I ate 2.
How many apples do I have?
that'll be a big limit of llm's going forward. people not knowing how the fuck to define the problem
SVD was refusing to let me run an image but randomly would let one slide through and all was fine
it's ambiguous af is the problem lol
annoying
ai can't solve around human ambiguity
beacause it sounds like i have 3 apples, i ate 2, how many do i have
That's the point
layer 8 issue
but like it's valid to interpret it the way it did
if you say i have 3 apples now then sure the gotcha makes sense
How many fingers did I have yesterday and today? "Yesterday 6 now 4 and maybe 7."
maybe is the image format not being liked by the node. reddit's been using a demented form of webp lately for it's images that doesn't like ot load into many programs.
I tried many… different checkpoints
screenshot the image rather instead. see if that helps
shift+start+s is the ultimate
Well, I walked away from SVD again, as I’m determined for some reason to understand animate lol
But this ultimate SD upscaler needs to finish running cuz it’s brutal
ohh hi mark
Where’s the Tommy wiseau lora
“The passion of a Tennessee Williams play, brought to you by Stability”
knowing him he had take down notices on any that show up of him
I tired SVD, it kinda sucked

Im starting to come to the conclusion that SDXL and animate are also not friends
animatediff sdxl model is really bad
sdxl has like 10x the parameters so it's gotta be a lot harder for that team to train for
what you can do instead is two passes with sd15 animate diff.
one to make it higher resolution
I can get some amazing stuff from time to time but it's totally random.
2 passes with the k sampler?
i'm not sure how to describe a hires pass in comfyui. in automatic1111 i'd just enable the "hires fix" option or use the "kohya hires fix" extension
I think there's one built into the efficiency node suit. It goes between the efficiency loader and efficiency sampler. Hooks into the dependency and scripts pins
there's a lot of ways to do "hi res fix" type stuff in comfyui
i want to develop an agentic UI that monitors trending git repos and tests out new models as soon as they come out
when training a SDXL model in kohya do you need to put captions?
for the images for a specific character
Is it possible to create reusable characters for images?
easy no, possible...could be
a combination of techniques, such as this https://cobaltexplorer.com/2023/06/character-sheets-for-stable-diffusion/ and maybe controlnet reference, ipadapter, instant ID models, you can get to the point where you can create a synthetic dataset from which to train a lora
I have a rx 5700 xt, i use it with auto1111 or my ryzen 9 3900x on comfyui ? which will be better for high res images ? i have 32gb of ram too and m gpu has only 8 gb of vram
wrong chat lol
how do more samplers in stable swarm
i dont have dpm++ karras for example
how do i*
I only have a small 64gb of RAM

That shit is tiny bro
so if you dont need it gave it to me
maybe ask in #🐝|swarm-ui
will do thanks
lots of helpful ppl here 🙂
I wonder if the GameCube is capable of diffusion of some nature
It would be a funny project to write software for a GameCube to train and generate 16x16 pixel images
N64

There was some electronic toy trivia game back in the 90s I want to say that technically used a tiny neural network in it, so it's probably doable to some extent on something like a gamecube
I’m not able make images any more
naw it's amd HEUIOOOHHH
LEO you didn't notice my replied
Finally I got in touch with you. Add me really quick
theres a delay from low earth orbit? 🥁
can anyone tell me the best way to ai generate prompts?
for making loras the picture with caption
fire the image at gpt-4-vision model
me no have
GPT is too restrictive, let us be free and crazy XD
gemini
Claude
How embarrassing when you keep getting spellign errors in a image generation onyl to realize you actually typed the damn promt in with that mistake
XD
for comfyui: how do you save an image with generation info?
all images saved by the "save image" node should have generation info included
Can you create images here?
GA! Is Stable Diffusion open for new chain partnerships? Thanks!
Please keep the crypto scams far away from this server.
https://youtube.com/shorts/C5cIib7hiK8?si=z8FW2_UFwgZEn0LK
Does anyone know how to use sd to achieve this effect?
is it in properties > details > comments? or does it only show up in comfyui?
Does anyone know when the early preview version of Stable Diffusion 3 will be released?
Asked my crystal ball and it said mabey one day
neat
A strange checkpoint type has appeared on CivitAI, called ODOR?
What the heck is that?
I dunno. Hey so there’s a 32-minute version of ‘Take My Hand’ by Toto on YouTube
brain floss for sure
The model has been uploaded. It's a April Fools joke. 🫠
any body knows how to set up controlnet reference only with diffusers??
no pc so im on colab with commandline diffusers but i cant seem to find the damn controlnet model
if anybody can reply with code thatd be gr8
ඞ
Hey guys, do you think it is possible to create multiple renders for one architecture in stable diffusion that will have fixed materials? I thought about using inpaint if some materials are misunderstood or the generation has some mistakes.
Has anyone tried using ODOR yet? I’m having a lot of success upscaling some of my olfaction
Not all of it tho
see you can hot dog like a atagar
where can we give feedback or complain
the new sd inpainting ui is so hilariously bad
who came up with that
u have to be completely detached from reality
i dont understand why is the image split between the settings
and why the resize to settings are split from picking the region you're inpainting
what is chadgpt?? i was on civit looking for a model and then suddenly a window popped up saying chadgpt doesnt like civit or something
Probably April fools joke 😉
it stinks 😂
Pass or Fail or Skip: A simple, fast, and controlled model ranking feature
feedback.civitai.com/p/pass-or-fail-a-simple-and-controlled-model-ranking-feature
This document proposes a new feature for CivitAI called "Pass or Fail". This feature would allow users to rate images generated by AI models based on how well they match the prompt that was used to generate them. Users would be presented with an image and a prompt, and they could choose to Pass the image if it matches the prompt well, Fail it if it does not, or Skip it if they are unsure. The goal of this feature is to provide a more controlled way to rank the performance of different AI models.
epic decoration
hopefully those added filters weren't an april fools joke, I legit thought that was a good idea
man, that would have been an funny april fools joke to make all the searches return furry
what are the ldm and cldm files in the stable diffusion github repo?
what do they do?
for april fools rename to unstable diffusion
How dare you, you monster
If i am making a SDXL lora for a anime character and i have 2000 images on stand by is it worth making it a lora
these images just contain the character and background no other chacters around
Has it been published what the hardware requirements will be for SD3? Similar to XL? Greater?
less for the smallest model, more for the biggest one presumably
would it be better if i split the images for styles like different outfits and styles of the chacter into there own folder make loras for all of them and combine them at the end
so the minimum requirements for the base SD3 model is expected to be less than they are for the SDXL base model (already ignoring the refiners)
well if you don't include the T5 then possibly, yeah
2B MMDiT instead of 3.5B UNet
or even smaller, 800M MMDiT
but the smaller one might need a heck of a ton of fine tuning
problem is, I don't know how much vram MMDiT takes per Billion parameters
I just wish that they give us the 2B and 800M models soon, as those must've finished training by now
even if it's like a temporary research-only release
Maybe those smaller models are distillation of the full ones.. we don't really know yet i think
What SD3 variant would be best for running on a 12 GB card?
Yes, I have heard it will be "soon" ™
Would using the --medvram flag in AUTO1111 make a difference
well all of the models will be released together in end of April or sometime in May
You might be able to run the full one with comfy put it might be very slow
3-5 weeks ETA
Damn
you can just like keep going, and never have to mean anything
in comfyui (thanks to it's memory magement), you MIGHT be able to run SD3 8B (fp16) with T5 (int4), with very miniscule loading times in between processes
They're releasing the model with ControlNet stuff too right?
no this was around like last Monday
when the CTO said 4-6 weeks ETA
I know, just playin
ah okay
Hmm alright
we can only hope
6B will have a higher chance of running, but idk the quality difference
I hope every model down to 2B will get a Turbo
I can't even imagine 800M with Turbo lmao
8B Turbo can make text so I'm very excited
I hope I can make memes offline without Ideogram
don't know but the fact that they are being DPO'd right now makes me think that they are simply just separate models #💬|general-chat message
^
This is why I wish we already got them, cause most of us could already run these without massive optimizations needed
hmm, that would mean all models<s loras would be incompatible i guess..?
yeah that's a huge concern
having mutliple model sizes is a double edged sword
lowering the barrier, but also separating the community
I wonder if it's just gonna be 8B and 2B fine-tuning community
lol, I can't wait to have those problems
example: "Ah cool, SD3 has a lora for X! *looks up civitai* aw man it's for the 8B, I can only run the 2B.."
I wonder if this will make Textual Inversion come back into fashion
or some other methods arise
yeah, or maybe there will be a quick way to adapt these loras
maybe a new X-Adapter type thing
Posting here as well. IP Adapter update broke my stuff, had to update it lol. But the update they did was awesome, so I could streamline everything lol. 2 Controlents and 2 ip adapters for easy vid2vid in comfyui. https://civitai.com/models/367412
Give us the smaller models or open up the early access already, we are starving 😫
"show me the money" -Jerry McGuire
Well leonardo banned me
is there any other tool other than kohya for lora training
kohya just breaks constantly and is annoying
Consider how the human brain manages its RAM
Great innovations will rise from crucibles of hardware limitation
never had a problem with kohya, but if you insist, there's onetrainer (never used it, but I know people do)
Stacy’s mom uses onetrainer
dont use that dreambooth extension in webui though
the guy that manages kohya_ss is a solid good developer, I see seldom issues over many code changes, this is using git pull with cloud compute so it's always a diff version pretty much
The best Civit AI model I can find is Cyberealistic, it’s the most realistic in I can find

another thing you could do is the joe penna repo, I know some old timers still use that
hey i got a question to ask!
the server used to have different channels for dreaming and using the command. Has the bot become its own application or did something change?
gm guys.
something changed
I need to take this image of a cartoon cat and making it look into the mirror.... is this possible with stable diffusion?
what other AI should i be looking at
they're probablt using all the hardware to develop SD3 now
yah the mirror thing is tricky, I've tried that too, I think lora is the only way
and even with lora, no idea if that could work consistently
thank u ❤️
old schoool photo shop it is thenn
because really you're saying here's the back of something, show me the front...that's a crazy complex concept when you think about it
mwhah ayeahhh
I've gotten so close at times, but the reflection was not oriented the right way to be a mirror image
never tried that one. I've been playing with that cinematic redmond one though, liking it so far. but for training, juggernaut is a stalwart
Try Playground V2.5
oh, cyberrealistic is 1.5...uggh, why are people still on 1.5 apart from video
You can test it on their website free before you download if you want. www.playground.com
even video, you make one and realize we're years away from something worthwhile
that's literally like running windows 3.1, and being like,, look at this 16bit app I just developed for windows 3.1,
what I have found annoying with SD is, it always creates random legs and arms in places, and also have them very distorted

it happens less in xl, but yah it still happens at times
particularly with merge checkpoints
Im using SDXL
Can someone explain to an intermediate why training LORA against base SDXL produces meh results when then using those LORAs with other checkpoints? Since all the checkpoints out there use the base SDXL in training?
like getting 3 legs, I havent seen that in a long time
I go back to 1.5 for something, and there it is
ohhh, im I using the wrong Cyberrealistic?
https://civitai.com/models/15003/cyberrealistic
https://civitai.com/models/312530/cyberrealistic-xl
You can try using resadapter, sdxl can get to 1536 without distorsions, but I haven't tried yet
😮
resadapter, only seeing comfy implementation, but tiled vae should still work fine
can we still generate images somewhere?
you can generate them all day on your PC
Thanks, I mean somewhere in the discord here, is that not possible?
I did all day once 🙂
Will boots never come back?
TakeRep <User:User> [Num:Whole number]
Invalid arguments provided: Not enough arguments passed
what is that? apple?
I think im jelouse of those who can run sdxl locally because my pc is too weak for it....
I hope 800M and 2B with T5 will be accessible so that people can generate complex imagery
Eventually 2B will hopefully look as good as ideogram at one point
But that's just a theory
i sure hope so.
I have doubts about 800M, but I hope it will be as good as 1.5 finetunes
2B might actually be viable for a LOT of people
In the meantime i think i might have to use a free service but im not sure which one is considered good. i can run sd 1.5 fine but not sdxl.
Beep boop! Prompt me with "Prompt! :: your prompt here" in #🏞|general-with-images
could use t5 with sd15 models today if a developer gets around to porting lavi-bridge to any of the uis
XL
new ipadapter model just dropped... composition
that plus style are insane
i don't think that's the same thing
https://huggingface.co/ostris/ip-composition-adapter/tree/main you mean this right?
honestly sometimes you really make me question my assumptions
is this model better than Cyberealistic?
https://civitai.com/models/277058/epicrealism-xl
What's the current best audio diffusion model? Suno is insane but I simply can't afford the subscription.
any body knows how to set up controlnet reference only with diffusers??
no pc so im on colab with commandline diffusers but i cant seem to find the damn controlnet model
if anybody can reply with code thatd be gr8
There’s always Google MusicFX
will the bot come back?
We all hope the bot will come back
I’m laughing because people thought that OpenAI had something other than the money to hire top tier talent
i don't 🤣
is this the most realtistic model?
https://www.youtube.com/watch?v=0D6opXdC7ew
RealisticVision v6 is pretty good
does comfyui save generation info like auto1111? how do i see it?
Anyone heard of ChadGPT? I was surfing civitai for some models when i got sent to ChadGPT, a glitchy, bland page talking about AI censorship and monitoring; i think it was warning me that the model i wanted on civitai had been flagged?
check the date
april f***
What is this a G-rated film
Is commercial use permitted with InstantID?
this for XL?
You have to write a one-time $1.11 check to Vincent P. Instant, founder of InstantID
no it uses insightface and is bound by those terms
1.5
Thanks. What would be the best face swap model for commercial use? Photomaker?
Or FaceFusion
Got it. Thanks!
oh for real? made me all self concious and shiz, like damn i should really be more ethical about my use of AI art. made me google the ChadGPT and find out more info lol
lol if anything, be aware that generating art costs a lot of power and infrastructure even though the end user experience is so simple. I try to treat it like film, almost like I have a limited number of exposures
then there's @festivalman who just buys more 4090s 🤣 🤘
Economy of energy will always matter.
yeah
though i will admit, in the last 4 months, i've generated something like 80k images
all I have is a simple 4080

thats a good start
how can do this?
Those who do, do. Those who don't, don't.
I am a newbie, how do I generate pictures in this community?
Prompt me in #🏞|general-with-images
Do I need to enter Chinese or English? I am Chinese.
Prompts are followed more accurately if they are in English.
Can I generate it myself? Why did Clownshark Batwing generate the results for me when I entered my requirements? I don’t know much about this function. Can you explain it to me?
I am the Image Generation Function on this Discord.
Are you a robot or a user like us?
Both.
Thank you. You seem to be joking with me.😄
You forgetting the part where AI is trained with proffessional artists work without permission. . .
im considoring runnning a git pull command to update zluda. before i do that... will it lead to a snowball effect hwere i gotta spend a whole day updating everything else associated with zluda in order to get it to work with this new git pull version?
you do realise that the AI is just the same as a regular artist that goes to a whole heap of art galleries, and is then able to create something in the style of the artists it saw at the galleries... it's just really really really good at it
if you have an image, I can look at it and try and imitate it - that is fair game
AI is just doing the same thing, but at scale and at speed
you don't want your work imitated, don't share it...
the problem is that they also can't sell their works for fear of someone seeing them, and thus what business model they did have is broken
And every artist has the ability to study and replicate any other work of art
the difference is the speed though. it's at an industrial scale. it's like how i have heard in one country it's legal to just casually pick blackberries on someone else's property by hand but doing it with any industrial equipment is not. so in a philosophical sense there is a big difference, the problem is how do you regulate it because ai development is international
Insert gif "That's the neat bit, you don't" 😉
jea deffs makes sense. i agree with you. im an artist who has worked proffessionally. but what i do takes years even decades of hard work and practice. even then i cnt replicate gallary submissions, it will be insanely hard and the result will have my own art style on it...whereas an AI model is coded to automate the work that takes poeple a lifetime to achieve, and ultimates threatens to devalue that same artwork.
but in doing so it frees everyone up to use the tool to improve thier art (thats why im here), it lets us explore and experience art much quicker so its opening the floodgates to normies who doint know how to paint. some think thats bad, even terrible, but i think its a wonderful tool
but what if at some point we need to? perhaps not now but maybe in perhaps half a century maybe? i mean what's the end game here, getting agi? we are not going to be able to control that
even then, i dont think you should dismiss what AI does as being the same as what an artist does. we share our work with the world, online, to get hired and put food on the table. thats what 80% of the artstation geniouses do, and thats where 80% of the ai dataset came from. if they just asked for permission ppl woulda given thier artwork for the data set, ethically
what's the end game.. the heat death of the universe - anything that happens before that is just a part of the simulation 😛
they didnt ask because theres no law that obliges them to ask
I don't disagree, especially when people can charge to have visitors to a gallery
The problem is a) cat is out of the bag, and b) cat will stay out of the bag as Japan declared open season on copyrighted works
kinda but not really. the fact that you paying for vidoe games, anime, movies etc is because you know that each IP has something unqiue that gives it value. unique to the point where it cant truly be replicated...but it cant. theres only one cybperunk, shakespeer, mona lisa etc for a reason. thats why cd project red is valued at billions but the the dude in his basement making mods for the witcher works a 20 dolla shift at mcds
so anyone can train a model on copyrighted works under Japanese law
wiat wait wait japan declared what, i never head of that?? plz tell me more
jea cats out the bag, AI is a brillliant tool i love it honestly. im learning so much and doing the work that would normally take me months, in a much shorter time
I pay for the video games because they take the art and put it together in a package
https://asia.nikkei.com/Business/Technology/Japan-panel-pushes-to-shield-copyrighted-work-from-AI-training - looks like they are trying to clamp it up
jea deffs dude, beautifully done package. same. i stopped pirating games coz i really wanna support the studios
what i love about all this AI controversy is ...at the end of the day, talented artists are still working together to give us video games, movies, books, anime etc; sure they sometimes us AI but thier talent and value is still riding high. AI just another tool... but i dont think we should dismiss how contraversial the start was
agree and ppl shouldnt attack artists who use AI as reference either
yep, spread the love and fuck capitalism!
did you know they made a Music AI generator and the music industry? but the developers of the music AI never took it to market, they knew theyd get buried in lawsuits from record labels ... but for visual art? they didnt even think twice
suno.ai is quite good 🙂
I’ve noticed EpicrealismXL, seems to do more of what I say it to
The regular epicrealism also ads more legs and deforms too.
The XL doesn’t do that

Look, I'm not going to argue idealism stuff and technicalities. Anyone can repaint the Mona Lisa (the one on display is a copy btw), but obviously the original one has the true value. The same goes with any art. AI doesn't produce 1:1 copies of any of the art in the network, due to compression and well, how latent networks work and all... People just get salty because they don't want their style easily jacked.
I'm a traditional, digital and 3d(I sculpt with clay as well) artist btw, believe me when I say that most artists are just hipster snobs. Kind of like hipsters and music where they only like it if it's not mainstream. So naturally, they throw the biggest tantrums about AI art
Hello everyone, I'm currently developing a Discord bot that generates images using Stable Diffusion. At the moment, I'm utilizing sdkit, but it requires up to 2.7 GB of memory since it loads the model into the VRAM. Does anyone know of a more efficient method to handle this, potentially reducing the memory usage?
what vae?
So yeah, I changed my mind.
Cyberrealistic isn’t the most realistic, EpicRealismXL is.
It does more to what you say.
I think Cyberrealistic is still good though, I think this one is more expressive I think

How long would a 4080 last?
jea dude we in the same boat.
Try the new version of ICBINP XL
Banana boat
I mean ICBINP for sd1.5 still better than EpicRealism (imo), but the XL version is where it's at!
I’ve noticed non XL models are kinda annoying with multiple limbs and distorted bodies
Yeah.. definitely prefer the XL one
Yeah, this one seems to be as realistic as the epic one
Is there a ICB XL?
Thanks bra, I’ll try this out when I can.
Are the prompts more relaxed?
I’ve noticed, in like cyberrealtic I have to be very detailed, but epic I don’t need to worry as much
The woman in the white blouse - that one is just photo of a woman with a decent negative prompt
negative:
worst quality, plain, (blurry), empty background, incoherent, rock, plain, boring, monochrome, monotone, flat, dull, render, sharpness, photoshop, compression, jpeg, 3d, cgi, abstract, illustration, realistic, realism, amateur, fractal, distortion, masterpiece, blender```
I think I might start be hitting thr peak of Stable Diffusion quality now.
Like SD Forge UI, with the Epic model.
Forge - using FreeU and the extra noise from the LatentModifier is hard to beat
I haven't played with the new version in Forge, but 1,1,0.9,0.2 was a good starting ratio set for FreeU in the previous ICBINP version
I think I prefer forge, it has way more options, and it actually feels kinda faster, I didn’t even need to add any arguments on startup
Dude, I use to think Dall-E 3 was good for making images, but now that I have spent almost a month with SD, Dalle just looks like cheap trash now
yeah.. the base model SD is meh, but once you get your hands on a well done fine-tune and a good workflow it's hard to beat SD
And to think, SD3 will come god knows when.
next 4-6 weeks
Way too long
if it's still free, it's worth it
I hear it fixes so many things

