#🍥|anime
1 messages · Page 173 of 1
no way of getting that merged
yeay i know
and generally, anime sdxl models are... not really well trained
as in, not leveraging the potential sdxl has
is that even possible to acheive that in sdxl or do you feel stability ai should release something else beyond sdxl 1.0?
i've been playing a lot with pixart as of late, and while the base model is exceptionally poor at anime, it's comprehensional levels are off the chart
i hope you keep enough time for sdxl too lol
this one done in pixart
i mean, the image quality is not great, that's something that's easy to see
nice but also its more towards painterly effect
but the image composition -> good luck getting that on sd1.5 or sdxdl
i've tried making such images in both, but can't get that "joy of life" feeling
sd hasn't been able to achieve midjourney like oil painting
something like this impossible with sd
they can get close but not quite
pixart is awesome tho, i love my crappy birds
the best i've seen ppl do with sd for oil painting effect is smudge some areas, but overrall image composition is not right
and it does painterly REALLY good
and i mean, REALLY good -> this is just the base model
this looks brilliant
yeah, the compositions the model makes are out of this world
just grabbed a random non-character prompt from civitai, a single generation, and stuff like this just comes rolling out
i wonder how accessible that will be for general users
if you have 20+ gb ram & 6gb vram you can run the model on comfyui
bit heavny on ram
hey, be glad it can be offloaded!
but i should upgrade my 16gb to 32 anyway
the OG spec had the t5 encoder in vram too
im glad its not eating up vram too much
t5 encoder is what pixart uses to tokenize your prompt, sd1.5/sdxl use clip
I think for what it is, bpn does pretty good 🤷♂️
ah tis Clara, she survived the merger
actually, i was working on that prompt in bpn
and threw the prompt in pixart, and it... just was better
daaaamn she aiming for the guinness record. Look at all them fingers
with sdxl there is lot less of those anomaly
i guess it depends on what you're after, i didnt try to match the style but the 'feeling' i get from the image is similar
she took it back a bit far, overcorrecting
but be honest... her face looks so good
I'm not gonna complain to her about her new longfingers robot hands
yes I worked awhile to get that face
consistent through all my models now
i know
image quality worse, composition/color -> it's not a contest
thats the point im making
you can pretty much look at sdxl and sd1.5 faces and can tell which is which
5 fingers again...malformed, a little large....maybe she....a ladyboy?
sdxl doesn't do it better as such, just different
there are styles SD 1.5 is better at and those SDXL surpass it by far at
sdxl has distinctly different texture , but what i dont like is the gloss
you dont see it?
well I just don't understand how gloss is on an entire image
I dont know what you mean
Im probably seeing it without knowing what I'm looking at
there is clear metalic or plastic feel to it but some recent models have improvved
oh, well that's cause I put 2.5d in the prompt
I think it's a matter of expectation. the models themselves are not fundamentally different in capabilities, it's the training and merging that's been done, it's no different than changing checkpoints and expecting it to be the same. You also have to change your prompts and weights to fit the model
time to test this merge-not-yet-merge on fallen angels
if you have tried bpn you wont see it, and some other recent models that ive tried fixed that too
mind you that's in last 1 mo or so
My models tend to stand apart in versatility, but theyre not newbie friendly
I was struggling for a bit moving to SDXL because i had a "perfect" 1.5 model that I spent months on, but once I learned to prompt better for the models I was merging and training on, the quality keeps improving without changing the model i'm using
@nova remnant i know what you mean, it's really noticable in the early sdxl models
im hopeful sdxl can get lot better
heck .. do you see when you can render sdxl images with 512x768 WITHOUT weird glitches that becomes accessble for low power pcs too
and the traiing mechanism can be much better to enocurage devs to get involved
different variants of Clara
Best anime tbh
tbh, the more i try to replicate what other people post, the more i'm impressed with this sdxl model
the prompting style is completely different from what i'm used to with 1.5 but the more i figure it out the better it gets
nice cinematic vibe
it gets boring to watch a character just stand lol
need more poses 🙂
dynamic pose adds some interesting pose
sure but right now I'm checking to see how this merge recipe does
far from done but sure. Right now my old model from April is up on civit
trying...but civit is so slow
dark gemini?
no, this is another one
that one is very versatile and can do just about anything (DG3), this one I'm only taking blocks towards building a cyberpunk/magepunk style
like a world where magic is used to power technology and they live in a cyberpunkish dystopia
ok cool, gemini is what i will get since its got more broad use case
not a big fan of cyberpunk style
sure, it has some learning curve but people who reviewed it did just fine with it so
I guess it's good
arent you going to prune that model
aww she almost had it
there's no point to prune fp16
it ends up with same size
its on the file menu on the side
its 5+gb
isn't this the right link https://civitai.com/models/6209/dark-gemini-v3
DARK GEMINI Version 3.0 (I recommend using Waifu Diffusion 1.4 VAE kl-anime 2.) ( Not far down you'll find information regarding multidiffusion ups...
she doesnt look ok
I told you, the file menu
there's an fp16 file there
only way there isn't, is if you're missing it
her resulting trauma has made her put her hair in a suspicious way
oh yeah i see it now
but even the smallest size is 3+ gb lol
anyway i'll try it out
yeah, I can't get it any smaller
though I havent tried since April but Im not about to try
your model cover images look good
thanks
my problem is that I don't feel there is much I can do to improve the model further, all I could do are LORAs but I just get annoyed trying to make it work on 8gb
so meh
brb
gonna configure quest 3 in a bit
I got them to give me 20 dollar off on the quest 3 and 70 dollar off on the elite strap
almost getting it for free 😛 the strap
otherwise I wouldnt have bought it
https://store.steampowered.com/app/479010/Kodon/ cool, vr sculpting
https://civitai.com/models/226443 pixart fine tuned model, pretty sick imagery
Fascinatio.Redmond is here! I'm grateful for the GPU time from Redmond.AI that allowed me to finish this model! This is a generalist model fine-tun...
its 1.15gb
@vital raptor you were able to run pixart on a1111 before?
it escalated
I have not tried Pixart yet
someone put a fine tuned model up on civitai
and its only 1.15gb
and i checked the hugging fac page, but looks like they dont have vae for it yet
to run on a1111
but soon to come i hope
No special tricks on that one just a fairly simple prompt. The model is Soushiki which you can find on Civitai. You can put that image into PNG info to get my prompt and all of my generation parameters.
Excuse me, where did I arrive at?
welcome to the other side of the black hole!
Well time to open another portal to home
turn back! the environment isn't friendly
Looks like a Class Y to me, very unsuitable for life of any kind. 😄
Top left is 👌🏻
It's been a while since I've added an image to my hall of fame folder
thanks 🙂
If you have a civitai account, you should feature that one in your image showcase
sure, i will upload the entire set
needs more goggles to innovate
did you know they are working on age reversal technique that is getting closer to reality? by 2030 this will have likely chance of going mainstream
@untold glacier this model looks good from the author of bluepencil and more anime focused https://civitai.com/models/221379/kawaiipencil-xl?modelVersionId=249687
Kawaii is the best. This model can generate cute girls. Need anything else?
But this is the cover art?? Lol.
I'm getting
vibes
blue pencil already makes cute girls, what's the difference?
with metadata
this one is more tuned for anime i think
blue pencil is bit more versatile
Hmm. I'll have to try it out. I need to test the latest Hiwamari and Anime Illust anyway
i tried the anime illustration one if i have it right, the tone was bit washed out unless im confusing it
hmm i like this kawaii pencil
very good form and image tone
btw im generating images at 512x768 then upscaling by 2x with hires fix.. and yes thats with SDXL
i wonder why some of the authors are releasing lcm specific model
you can pretty much run lcm lora with every models
at least that's what i do
Is there a way to use prompts to pose characters in SDXL? I can just do basic stuff but not drastic side poses for example.
describe it in spoken language
some poses require controlnet
Maybe I need to be more detailed is what I want out of it
detail lora for sdxl https://civitai.com/models/122359/detail-tweaker-xl
Detail tweaker for SDXL. Works with weights [-3, 3] Use positive weight to increase details and negative weight to reduce details. Good weight depe...
hmm guess since it's a anime model it's wanting to put in a anime character
like this
@vital raptor you may want to take a look at https://civitai.com/models/229002/icbinp-xl?modelVersionId=258447
The long awaited followup to ICBINP, this model is merged to try and get the most realistic images possible. It's the ICBINP you all know and love,...
Yeah, the naming is a problem. The gap between what it's purported to do what what it actually does is quite large.
There are a ton of SDXL controlnet models on the download page. How do I know what to get?
each models are specific to what they do
have a look at what those models do https://stable-diffusion-art.com/controlnet/
I think I have new favorites when it comes to creativity and pristine quality when it comes to SDXL anime models. This is good stuff.
Himawarimix V2. Fantastic quality of gens. Very sharp and precise.
KawaiiPencil. Really great aesthetic and creativity with this one. Great at following the prompt too.
btw this TI makes a decent amount of difference for anime https://civitai.com/models/119032?modelVersionId=245812
いろいろあるので好みのやつを使ってください Of course, since there are various options, please use your preferred one. 強弱や組み合わせもありです It can be strong or weak or a combin...
Yeah, the relatively new hk1 is fantastic.
A needed part of everyone's workflow.
yeah i like it, been using it quite a lot
I try not to use any negatives
For proper anime style, it's often needed with many models to get that extra mile.
dont think use of negative prompts with SDXL is completely meant to be excluded, just that they vary a lot from sd1.5 practice
it's not required, but I prefer to not use them if I don't need too
the more I can get a model to do without the negatives the better
you can have a visible diffence from using them
something positive prompting alone cant achieve
yeah, but the same can be said about just changing your prompts, everything changed has an affect
and if you don't need it, then why bother
positive and negative have different influence on the output
This is a good way of thinking for prompting in general. The more you can do with the least amount of tokens, the less potential there is for something to go wrong.
you do need it depending what you want to settle with
well then I guess I haven't needed it because i'm satisfied with my results 🤷♂️
yeah its taste dependant not meant to be generally excluded
I have tried them, I just found I didn't need them
i have tried them too they are nice
with distinct visible difference 🙂
you can run a plot xyz on them to see the changes
its just a negative ti it cant change style specially in XL
was just doing that actually
This is incorrect.
not just style, but the composition of the image
oh yea ? post 2 images with different style here
you can test it locally to see the changes
Textural inversions are just another form of network extension. Just because they aren't as powerful as something like a hypernetwork or a LoRA, doesn't mean they don't affect the output.
sorry i dont use XL i dont like downloading 6gb checkpoints just to change style like in early 1.5 days when you had 4gb ckpts each with a different artist
sdxl models are far more versatile than sd1.5 btw
not with style i like fuzichoco , ramdayo styles
yeah thats the thing, they are not strictly narrowed down to a set style
unless you train them in special cases
😔
Versatility doesn't equal style reproduction, lol. No one said it did.
super basic prompt, feel free to suggest a different one and i can run it again
far right image is no negatives
i dont care about versatility either i just like different styles without changing 6gb checkpoints every time
you really dont see the subtle difference in them?
@iron star I don't change checkpoints
thats cool
I didn't say I can't see a difference, I said I don't care about them
what is "wrong" with the far right image that needs correcting that these negatives corrected?
exactly, it's just taste
i also dont like anime checkpoints that cant prompt anime characters that i like,since theres no loras for XL i really dont use it
doesnt matter its your choice not to use them and some prefer to use them, there is obvious difference that matters to some
what im saying is negatives are not meant to be generally excluded
Allow me to help you. Base gen, and then with sky. Take a look at the differences between the two.
all i said was that i try not to use negatives, i didnt say everyone else had too
There's a clear difference in end quality.
i like the one with teeth
yeah same thing i was saying..
There's less blur and more detail in that one. (Edit: Blur not blue)
But granted, this isn't showing a style change.
you can see the difference easily with a portrait render
since we talkin about styles take this chinese style miku 💅
but i have to agree with sdxl its not necessary to use rigorous negative prompts like 1.5
the real problem is when the TI starts restricting or preventing something you do want in the image and you fight with it for hours before realizing it was the embedding
That's user error though, not a fault of the network extension.
and I'm not saying it does that exactly, but I just have a problem using something that i don't know what is in it
yeah, that applies to sd1.5 models as well and that's important
if I'm trying to generate something specific, I like to know what's affecting it
maybe i'll add one after to try and improve it, but i don't use it by default
💅
soon 
sure that's a good practice to have good control over the intended image, but aside that when you know what you want to achieve or prefer with a partcular TI then its fine to use it, and they do make a difference in a good way knowing you apply a ti properly
Indeed. It is hard to see how what you're changing affects the end result. That's the biggest downside to AI image generation. Things like CFG, prompting, network extensions, steps, and samplers are all amorphous ideas until one spends time with them and tests them thoroughly. Only then can one understand how they affect the end result.
It's like seeing storms and understanding that what makes them is a combination of pressure, wind, water, and heat/cold, and how they all interact. It's quite difficult, and very easy to get wrong.
btw @vital raptor im not saying using bunch of TI w/o knowing what they do is a good practice incase you got me wrong
yeah, that's why I try to replicate some things people post in here, it gives me something to try and achieve with my model and my oiwn prompting to see if I can make something similar. Practice with the tools is worth it
I think we were just arguing the same thing, use them if you want them, they can help certain things, but they're not required
Yep. This is one of the best ways to learn the ins and outs, replicating images by prompting/changing settings. Taking apart others' prompts. Testing and verification--always with more than one seed and usually at least 4, etc.
the wording "they are not required" is not justifying the use case
that said, I have seen models that almost require a negative TI almost always just to get a good render, but I consider that a bad model
you might wanna claim whole SD is a bad project on itself
just for allowing the use of negative prompts
This
except that I'm not saying it's bad to use negative prompts
We're going in circles here torqx. I think we all understand that.
yes
idk what happened here tbh
the thing i was trying to rectify is that you first claimed they make no visible diference as well as claiming they are plain not required 🙂
I never claimed they make no visible difference, that's where you're misunderstanding something
Also, they are NOT required. Python is REQUIRED, negatives are not
...........
what?
that doesn't saying anything about the visible differences and I posted an image that shows the differences
I don't see the problem
then why say negative are not required lol
What I said was that I don't need to change my images
because they're NOT required
they're optional and up to the person
how do you define require and not required... something by choice or something that makes applicable differnce ?
ok thats the thing i was saying too, taste dependent
when you say not required that implies that has no effect
no, it implies I can make the choice
so call it preferential
because the effect may not be what i want
because it is required to those who want it
semantics
Goodness torqx. Is this really worth the argument? Let's make some waifus, and leave the bickering alone, shall we?
I saw your uploads in Civitai
Nice stuff
lol im not arguing just clearing the confusion
Love the softness to the images.
Are you prompting something special for that or is it just the model?
just regular prompts ... minimal negative too lol
What's your CFG? Is it pretty low?
the last few images were experiemental with bit anatomical negative but they werent necessary with sdxl
yeah 2
Ah, that explains it. Great cohesion for that low of a CFG though.
generating all of the images with LCM LoRA
take em all
I guess the model is a bit light and soft overall.
most of them have cfg 2 except those that i posted before lcm thingy came out
This one?
yep
Ah. Didn't realize it was there. I'm viewing the image on civitai in a squished window, so it thinks that it's on mobile--not showing me the deets by default. Cool beans.
lol yeah im on pc so might have been different gui on phone
Really wish I could get LCM to work with anything I try, really.
Yeah. You and Eface using it up a storm and things looking great, but I've tried it many times with many models and even SD implementations (Comfy and Auto) and it never works like I want it to.
same, I haven't really had any models it doesnt work with
exactly
some models don't looks as good, some look better, but they all seem to function
you should be able to get it working, there are two versions sd1.5 and sdxl .. and you should use LCM sampler with it, the lora weight also matters inbetween 0.6-0.8 should be ok
I define function as a good end result, heh. In this instance, I never got that.
fair enough
Not 1?
1 could be bit high
Aaarrgh
and if you were using other samplers instead of LCM then 1 is fried
I thought that was what it was supposed to be at
Nah, I was using LCM sampler.
ahh ok
so you used LCM sampler, sampling steps at 8, cfg 2 and you didnt get decent output?
Yes, but if the LoRA wasn't supposed to be at a 1 weight, then that was probably my issue.
try with 0.6 weight
First I gotta get this gen right
oooh epic
1 weight works with my model, just a little more contrast, most of mine are 0.8 this one is 0.9
you got this uncanny cool touch with most of your images
it's like anything else, tweaking to fit your model and wants
Lol. Thanks?
lol yeah didnt want to flatter you too much but i saw your images
I hear ya, but that only works if you understand what you're working with. LCM is still new to me.
Oh the civitai ones?
yeah
A lot of those are rather simple prompts, surprisingly. Some just testing out artist name recognition with BPN, haha.
i was wondering to myself how you came up with the concepts cause those are not directly relatable with text prompts
BPN is a good model to test LCM with btw, it handles it really well
It's the cohesion. It's top of the line, model-wise. That's what LCM lacks the most.
I've been told by Eface that the best way to use it is with FreeU, to get what you lose with LCM back.
btw @untold glacier LCM possibly adds a bit more cohesion to the render
I find this hard to believe, based on what I've heard of the testing of others, and what I've seen.
this true without freeu i get 💩 results
i can't give technical evidence but i was surprised to see how a lot of the images on sd1.5 came out surprisingly well after using LCM LoRA
i see many of devs are recently uploading versions of their models with LCM implemented
i find that strange cause ive run almost all of the sd 1.5 models in my collection with lcm lora with great results
now only running sdxl
take a look at dreamshaper, they have a model up just for lcm
and some other devs are doing that too
blue pencil did it too
im sure they may have technical reasons but as a user all i had to do was apply the lora
I think it's a performance thing, at least for the first gen it's faster if it doesn't have to load a lora
could be
but even then for initial speed boost thats kinda redundant to download a 6 gig file
I've tried to merge it with my model just for less hassle, but my model won't merge with some things for some reason
but yeah, downloading another full model doesn't make sense
yeah cause lcm lora is lot smaller in size
Not at all what I asked for but I like it.
came out beautiful
No fat people allowed on that plane. You see the size of that seat?
lol im more absorbed by the scenic view
and her sitting there looks very high tech and simple
FINALLY
she is sitting outside on the wing?
Issue with the non straight wing, but still, it's what I wanted.
"There's something on the wing!"
this one is pleasing to the eyes
and mind
IKR? I'd fly on that plane in an instant
but watching her sitting on the wing triggers anxiety
lol you should work on the first image
altho its perfect as it is
how the heck did you even prompt it, are you using controlnet? 🙂
heheheheheheh
no
It took 30 minutes to prompt correctly.
wow damn
These days, I usually don't spend so much time on one thing
POW being a thing of the past and all...
well this is impressive
But that's the plane she takes to school every morning.
my adetailer extension is gltiching out
FINALLY GOT LCM WORKING! KOHYA DEEPSHRINK AS WELL!!
FEAST THINE EYES ON THIS 3456x3456 IMAGE
i remember that film lol
🤣
They aren't broken for me? 🤷🏻♂️
Dunno. Have you tried restarting Comfy?
Now if only I had access to multidiffusion, these tiling artifacts would be a thing of the past
Concatenated:
flat colors. a stunning cinematic 2d artwork. high contrast. black and white. hazy. chromatic aberration. surreal. dark and shadow
simple anime artwork. top down view of a girl with a beautiful face lying on her back with her arms spread on a cold stone floor. view from directly above. (album art)
Neg:
(photo. photograph:1.3). 3d. deformed. bad quality. amateur. clone. repeating
But it's a multi step process: the later upscale prompt is changed from "simple" to "detailed"
I also add the neg embed in at the upscale.
Wait...
That's after I changed it. No "even bangs".
Don't use a VAE? Lol
Try super low CFGs.
<2
"Edging"???

Less VRAM usage.
because your computer is a potato? 😄
I use it myself, and turn off the hardware acceleration to ensure it doesn't try and use my GPU. If you aren't doing that, you should.
I used edge once... to download chrome
In fact, for any program that you have open that could use it, like Spotify, you should turn off hardware acceleration.
discord can use a bit of vram as well, depending on what channels you have showing
Oh yeah, definitely that one. That's a big one.
Enjoy this 4096x4096 image.
chatting in this channel while watching a youtube video at 1440p resolution uses about ~1.5gb of vram for me
That's horrifying.
but I don't usually use more than about 20-22GB of vram for SD, so it doesn't bother me
I also spent an absurd amount of money on my computer
yeah
got it a couple months ago to replace my 3080TI
1024x1024 using hires fix x2 and LCM is about ~8-9 seconds
1024x1024 on sdxl with no upscaling or lcm is about 5 seconds, 2.2s with LCM
when I have it working I can use TensorRT and LCM at the same time which makes a base 1024x1024 gen about 1-1.5s
TensorRT should work on a 3070
a
But that's a pain to set up
yeah, it's a pain
i just use xformers
yes
It does. Not gonna be a ton for 3070, but you'll see an increase for sure.
I get almost twice the speed
That's because you have a 4090 lol
but i have the vram for that
with wut
I don't think a lower card could do TRT and LCM at the same time, it takes a lot of vram
ohh
I don't know. I've only ever used it on Auto
tensorrt can make it faster but more vram?
From memory, it's not too much--it's just an added unet on top of everything else, which is under 400 mb I believe.
cus when i use high res 4x it still on like a 512 base i dont even use half my vram
so im willing to trade vram for speed
rtx 6000 ada
i use xformers and pretty unscientific
but ive seen 20 it/s
you should be getting well over 35it/s with that card
yea i dont do much sd
more llm stuff
so im pretty sure i dont rly know how to optimize sd well
but yeah if you want more speed, look into LCM and maybe TensorRT
Aight, I'm heading out for the night. G'night folks.
LCM is the easiest
yeah, i'm off to work, later
what about the x formers?
large language model
no its my personal rig
really good
like a bit less than 4090 but much better power consumption
cus the slower memory, less core voltage but more cuda cores
yes but games dont care about that
nah
not for a single card
7950x
with 96gb ram
uwu
thats not a sd workload
maybe sdxl with 4x upscale something dumb
but ive been happy with sd 1.5 models
with a heavy upscale
seed hunting is quite quick
networking job
and not expensive girlfriend
tech jobs that require a degree yes
yes but tech and degree
and throw in some certs
anyways i will sleep, its monday now
i will look into the lcm and trt and tensorrt
wow get some sleep, im in ca so its only 12
hi
you can
but you're gonna need an anime sdxl version... the stock sdxl makes these way too rarely
😄
yep i'm being random artsey
artsy?
it's that thyme again!
ha ha , all my saved styles with no extra prompt
hmmm something wrong with that leg?
I see you and raise
better yet
cat herding?
was suppose to be... Battle between Vampire cats and Human bunny monsters in the heavens
but noooo
gives up
best is to do it in stages
make one, then img2img, and crop the original into the picture back up and redo again and again
or if you're using a1111 then img2img in an area far away from the 1st successful catfighter
feelin kinda cozy rn
@lyric zealot get one of these too 
surely i can get that
@gritty spruce @wooden coral @dull jackal scammer there
thanks!
f ya wanna do something funny then add semi transparent balls/swimming aids to the pictures
doesn't matter if it is a dragon/cat/mouse etc... just make them wear it
@gritty spruce @wooden coral
thank you
lol
and can't post the next one cause well that shirt went umm.. gone
ok just got the hulk. hmmmm ok
pink fox girl miku 😄
another one @gritty spruce
also @wooden coral or @finite herald or whoever is around 😄
lol
btw you can just report the messages, no need to ping us all each time 
you can react with ⚠️ to the message or right click->apps-> report to staff
oh, didn't know worked like this
t-rex arms -rawr
lol, this one is cute
How strange. The "Add Detail" LoRA doesn't appear to be on Civitai anymore. Dunno why. I'll upload mine to a google drive and get you a download link.
Wait, it was just a different name than I remembered. Here you go: https://civitai.com/models/122359/detail-tweaker-xl
Detail tweaker for SDXL. Works with weights [-3, 3] Use positive weight to increase details and negative weight to reduce details. Good weight depe...
It's not a huge effect though, but it does add some detail.
If you're referring to the link that I sent above, that's the only one I've ever used. I don't have any further info on others.
hmmm anyone know a good cheap joystick for space combat games and such?
Elite Dangerous, Star Wars Squadrons, No Mans Sky
etc
for use in vr
I need to make a bigger play area for VR stuff too, there isn't much room in here...I can't take half a step without reaching the end zone
def gonna get an apartnent with 3 rooms next and dedicate one to hobby room and create a 3x3m sandbox. Maybe take some kind of track and attach it to something. Or no maybe just get a KatVR if I get that invested
@lyric zealot You're now being used as part of Discord advertising lol

they should pay me for that 
when did EA stop saying "EA. IT'S IN THE GAME" or any other variation?
Yep
Apple? Permitting sideloading? That'd be a cold day in hell
Well good luck to ya then
Sorry... but I got a chuckle outta that
Apple really loves their walled environments
Yeah
ok
I was reading on the stable diffusion ai site about what each SDXL controlnet does, and it says he couldn't get the openpose one to work. But I saw in this chat that people use it. So, which openpose is best for SDXL? I have a 3060

well it's quiet again 😛
Was there a p i n g?
/ping
-sneaks over and saves-
darn no info? what? prompt please!
tries to figure it out
oh come on, you know i cheat at this...
i did post some actual SD images the other day...
gotta keep people guessin' 😛
These are Dalle3
where are the Dalle2 imgs tho
hmm lemme see...
punk rocker wearing black top , skirt and silk hosiery, long black-pink hair , sharp focus, epic detail, elegantly drawn with colored pencils , art by Stan Lee
there ya go
thats one way to come off the page
took out the artist, added pov chest, and replaced pencil with pen
Tomboy Supremacy
needs more muscles
ok.. .replaced rocker with nerd so it's punk nerd, and for some reason it comes out ..ummm.. I can't post these
😄
try "colored pencil drawing"
no more punk
cry about it 😢
😐 the dalle model 😐
oh
give it a try in SD though! I like trying to get as close as I can.
Love these
she can kick me anytime
she innocent, didnt do a thing,
dale is a fantastic model. too bad can only be used online, and so, the mixes are limited
Dale?
dall-e
oh
yeah dall-e is alright, dall-e's strength is responsiveness to prompts
but it's not so good at making fantastic art, like breathtaking art
i would love to model a comfy workflow around it
god damn it, my gens are semi-nsfw, dunno if I can even show em
wha?
Guys
Got any new places where I can generate SD images for free?
And I mean the ones with good customization like controlnet and Loras
what do you use in your prompt to get the little speech bubble emojis?
I haven't been able to get my model to do that
i guess my model can't do it, i get a heart but no speech bubble 😛
hmm heart in a speech bubble worked, but it's adding random hearts everywhere.. i think i fried the text encoder in this merge
8/8 [00:01<00:00, 5.69it/s]
SDXL is slow 😄
512x512 SD1.5 is a bit faster 30/30 [00:01<00:00, 32.99it/s]
a little faster with TRT 30/30 [00:00<00:00, 42.64it/s]
I think I can go faster with a different sampler, but it usually looks bad
lol, PLMS sampler 30/30 [00:00<00:00, 52.37it/s]
Lewd
I love these
What part of it sucks? Each model is different but if you want something specific you might need a custom merge or training
There’s just not very many and most are based on the same one or two base models
A 4090 would be cheaper. But also renting a service for a few hours is cheap
training a checkpoint with a 4090
I did
😮
Takes about an hour or two depending on what you’re training
lol
I was training a face, i used 15 images
To be fair, most information and tutorials are mostly wrong or very specific to something most people aren’t training
to get a good checkpoint like counterfit or neko u need a lot of images that wont do it on a 4090
You can, it’ll just take a while
yea like 6 months
In most cases you don’t need to train an entire model though
Something like counterfeit already knows anime, so just train enough to change the style
Or merge models. The pictures I posted earlier are from a model I merged including counterfeit and kawaiipencil
to change the style you would need more images unless u want it to come out in same style
A style shouldn’t take more than 30-50 images, but it depends on the style too
Although if it’s drastically different from the base model used it may take more
Back to work..
ill be patiently waitin for that xl anime checkpoint release
you can crowdfund the cloudpower to train your model, $300 should do it
getting the dataset and labeling it is 90% of the effort
how long would it take to tag if i have a varied set of 200k curated images
Does anyone know how to merge this model with the face shape of another model? This model is beautiful but the face shape is too cute, I want to merge it with a model with a more mature face shape.
Recommended Negative Prompt : (worst quality, low quality:1.8),(nose, nostrils:1.2),
does it not change the face if you prompt it for mature/adult or a certain age?
if not, you can try this method to extract a lora then merge it with a different model https://youtu.be/CnoXyIcXQV4?si=esxQvin-zZiSeyfF
the video is a little old, but the method is still valid
Merged multiple style loras for that gen. Kinda love how it turned out
Try MBW with these starting values:
Character Transfer
0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,1,1,1,0,1,0,0,0,0
Expect to need to tweak the values until you get it working well. You want MaidMix as model a and your mature face model as model b.
and they use a A100 for that speed
seems like pretty much the same as the existing LCM
high end scaling, but the comparison is scalable
i think i will take a break from lcm lora uses
specially after having some quality loss in images
i'm playing with some new merges, doing all my testing with LCM on so I know exactly how it looks
what kind of loss
quality
ok, but what do you mean by quality? is it leaving noise in the image, low resolution, lack of detail..
not as crisp
compared to using say 30 steps or more with dpm samplers
the lcm image tone misses some detail
kind of like bit washed out
in general you could say they both deliver a level of aesthetics but in comparison lcm lacks a bit
i can't even replicate my above image without LCM, it changes a lot having 30 steps and a higher CFG
but, the above image is what I was trying to get from the model, so I'm good with that
not asking you to stop using lcm lol
just saying that has some loss
and also wont handle complex prompt too well
yeah, i was just testing to see what difference it made. It's pretty drastic
I think LCM kinda sucks
nah not suck
kohya deepshrink is much better
its going to evolve
suck -> not worth using
lcm is very fresh
you can get decent images with lcm but im making a comparison
the detail are bit restrictive with complex visuals
but hopefully that will change in future
I think it's more the prompt coherence when it comes to complexity, if the model can do complex imagery with a simple prompt, it will
it adds some coherence but the overall image tone misses some detail
but they are ok for general viewing purpose, again im making a comparison
not using any lora or detail, just basic generic prompt, the image has some depth to it
btw i love it that i can render images at 512x768 on sdxl and hires it by 2x
what's the prompt for those?
this models isn't very creative
is that with lcm lora?
yeah
familiar
but that's not why
content came out ok, but texture is bit flushed out
at least not completely, that model is supposed to look like that
but, it's just a 'style' at this point, it's not very creative or detailed
like, the image i posted earlier with the underwater city or whatever, that was my BPN based model, this is with my new anime model, same seed/prompt/etc
massive difference in 'filler' in the image
image looks cool but texture is missing depth
you can tell by the buildings on the forefront and back
well this model is specifically for lineart, it knows almost nothing else at this point
i've mostly been making stupid cutsie stuff with it
ahh ok, thats also relatable to lcm
because that's what it's for
This is nice. I particularly love the lack of tiling issues.
it's not tiled
rendered at 1920x800 with hires x2
sorry, read that wrong, thought you meant like tiling artifacts
You misunderstand me. When an image is generated beyond what is best for a model, you can get "tiling" or "repetition" issues where some "ideas" are repeated throughout the image to make up for the lack of data.
Such as the repeating elements in the BPN gen.
The fact that there's none with your recent image is impressive.
i hadnt even paid attention to that actually
that's the first oversized render i did with it, just to compare
i was surprised with that too before when i was rendering images at 512x2048 on sdxl, i think most sdxl can do well w/o needing deep shrink
yeah, it's really not tiling elements in the image much... interesting
even stretching it wider, one of the models i used must have had some massive source material
looks cool
stupid UI wont let me go over 2048 😄
what happens if you try to enlarge it more from Extras tab?
that just changes the resolution, it won't add more to the image
A spirited girl with fiery red hair races through a field of sunflowers. She holds a basket of sunflowers in her hand, her smile as bright as the sun itself.
ahh right
editing the ui config to change the slider maximums...
8192x1024 is too big for my vram 😩
7280x1024 works though, now we can see the tiling from the model 😄
lol
lol, looking at your images it's like she was happy and enjoying life, then her sunflower farmer partner dumped her and she burned it all down
are these blocks specific to certains aspects of a model? If so is there a list somewhere describing them?
I've been trying to wrap my head around how the blocks affect things but finding details is proving difficult
There are several guides out there that give a general idea of the things that each block generally effects most prominently. The problem with those guides is that all of the blocks are interconnected and can vary quite a bit between some models. There really is no replacement for exploring each block independently to reach your goal. To do so, lock the seed and set a prompt that is pretty reliably produces good images. Generate an image on model A, then model B, then begin runtime merging test each layer at 1 with all other layers at 0 and take notes on what changed from the baseline model A image. Then you can use your notes to make an educated guess at how you should merge the models to get your desired outcome. A good rule of thumb for your final merge is that you generally want to try to avoid very sharp differences between any given block and the ones next to it. The values should transition somewhat smoothly in most cases.
I found part of my problem was that the supermerger extension is broken.. the Add difference mode doesn't work at all
probably a large part of my confusion when I was trying to figure it all out
Oh, yeah I basically never do normal weighted sum or add difference merging any more. I pretty much only use MBW.
which mode though? weighted sum?
MBW is only weighted sum AFAIK.
i mean, if not weighted sum

