#✨|sdxl

1 messages · Page 78 of 1

trim orbit
#

i'll offer less advice. there always seems ot be others who say it's bad

high skiff
#

No, it wasn't a waste of my time, it was a waste of time for everybody who took the time to help me

rustic garnet
#

uuh, I didn't want to make a "I told you so" out of it X_x

high skiff
#

It's very unfortunate that it worked out this way, I was really hoping that I would get some good captions from this, but this is honestly worse than swinv2 tagger that runs over a hundred images in the time that it takes this to process one

rustic garnet
#

I mean, it definitely shifts a caption more into the visual style of the image

#

although the captions are bad. You can try that if you use them as prompt

trim orbit
lilac raven
#

Should the LoRA be applied to the Refiner pass?

crisp owl
#

no

trim orbit
#

whose right. this experienced fellow saying he has success, or this other experienced fellow walking sytan thoruhg it to failure

high skiff
#

I'm not sure why he says it's so good either, it's very slow, and I think the results speak for themselves, and they're not very kind

rustic garnet
#

"and they're not very kind" <-----

#

Clip Interrogator always insults me. Thats why I hate it 😜

trim orbit
#

theres just always such a cross fire of information. especially in the cases where people have success and then others say "no thats wrong". hard to tell whats accurate so i guess why bother. i'll just keep experimenting on my own and keep it to myself.

rustic garnet
#

I mean, thats fine, and the way to go anyways. Cause whats theoretical right and what works in practice might be very different things

#

I know I can be a smartass quite often 💩 But I don't want to tell you what to do or blame anyone for anything. I just thought it could be interesting for anyone if I explain why clip interrogation gives so strange results back 🤷 oh, and English is not my mother tongue, so I just sometimes say garbage cause I'm not a native speaker

trim orbit
#

i also hate when you see people like caith having success with an appraoch, and someone demonstrates one bad result and dunks on the entire situation. i'm not into the single result conclusions ever. caith seems to have experience with it that works is all i've noticed. that seems factual to me

#

a single bad result doesn't seem to determine its not worth it

#

might indicate a bad configuration

rustic garnet
#

I mean, you will never get good captions with clip interrogation

#

they will always look horrible

#

the question is if you can train with them

#

maybe caith has good success training with horrible captions

trim orbit
#

how do you think captioning is done for the base model. i really don't follow your jedi mind tricks here

rustic garnet
#

they use the LAION captions

trim orbit
#

laion doesn't hand caption the set

rustic garnet
#

they used the alt tag in the images

#

however, pseudo once said they are using BLIP now

#

so maybe the images are BLIP captioned

high skiff
#

I believe that was what I saw as well

#

I am just really confused, because I thought a very good clip model would have produced much better captions, because I use a very lightweight and fast auto tagger (Swinv2) and it does very good in fractions of a second per image

rustic garnet
#

but BLIP is also sometimes really bad ;_; It gives me a lot of advertisement stuff very often. Like whenever there is an image of a blond guy with green clothes I get "Zelda - The windwaker"

#

maybe they are even better in clip space

high skiff
#

I need to see if I can bother pseudo and figure out what captioner he uses, because his captions were really good last I remember

rustic garnet
#

but just one thing

#

this was an old DnD character of mine:

#

when I use CLIP interrogation I get back:

high skiff
#

I used AI to generate my really strong and beefy black tiger to back seat,

rustic garnet
#

arafed image of a man in a dark coat holding a wand, bussiere rutkowski andreas rocha, blacksmith, with a full black beard, thin young male alchemist, mace and shield, sqare-jawed in medieval clothing, lord of cinder, realistic fantasy illustration, scientist, fantasy alchemist laboratory, high quality dnd illustration, hecate, brown cloak

#

which is REALLY bad

high skiff
#

No way in hell voice typing just said two back seat instead of tabaxi lmao

rustic garnet
#

however, if I use that and add the word "fat" to it and use CLIP Interrogation for the negative prompt, too

#

I get

#

which in my opinion is pretty close

#

for a prompt that is soo totally off

zinc lynx
#

i see

rustic garnet
#

so Caith is definitely right when he says it works

#

it works if you want to recreate an image

#

the question is just if you want to train on these strange captions

#

I would intuitively say that its better to train on simple but precise captions, so that the model learns how to make the images I want from the natural captions I provide

trim orbit
#

the theory is is that the model it's trained on is best for captioning, since it speaks the same dialect. thats why they're potentially good captions because they tie to what the latent spaces know more accurately. despite looking inaccurate for human purposes

#

research required

rustic garnet
#

the funny thing though is that the captions SDXL is trained on are the much simpler ones. Its never trained on these clip captions

#

the text encoder, however, is

rustic garnet
#

which seem to have a similar effect

trim orbit
rustic garnet
#

but just think about this: if you want to train a model on a celibrity, you do not want to add a description of the celibrity to the training prompts

#

you want that the model just learns from the name of the celibrity how to draw him

trim orbit
#

when i'm training people, i leave the descriptoin of the person very sparse. "name class" thats it

#

everything else gets captioned

rustic garnet
#

however, if I want to draw an celibrity without training the model, then using clip interrogation makes sense to get as close as posible

#

yeah, but I think this is not just true for people, but for other training concepts as well

#

but training is such a huge topic. Hard to say what works best and what not

#

and it seems to be different for each concept you train on anyways

lilac raven
#

I was looking at all the checkpoints saved of the LoRA I tried to make, and one of them had 0.0253 loss. I think that one actually works (Yes, it's intended to be super pixelated stuff, it's trained on upscaled 64x96 sprites)

#

And it doesn't need that crazy weight 4 to have some effect

rustic garnet
#

and you trained it on cpu? crazy ^^

lilac raven
#

I'm glad I bought the extended warranty for my laptop

zinc lynx
#

cpu trained on a laptop

glad grove
#

cpu training on a dual core (apu) laptop

polar jacinth
#

how does it work?

zinc lynx
#

id suggest trying out more capable workflow

#

ive enjoyed using Sytans

#

there are good tutorials on youtube as well

trim orbit
#

each node is a little script/process. they wire together like a sequence. queing up a prompt makes it all execute that sequence

#

you'll want to set your width and height to 1024x1024 at least. other workflows like sytan's preconfigure all of that

zinc lynx
#

Since we have released stable diffusion SDXL to the world, I might as well show you how to get the most from the models as this is the same workflow I use on a daily basis at stability.ai. In this video I show you some of the basics on how to get the model from the models to generate your best AI artwork from our models. You will need some of ...

▶ Play video
#

i liked this video when i started out

#

could also start at episode 1

rustic garnet
#

yeah, width and height have to be 1024 to work well with sdxl

urban fjord
#

With LoRA you can do 512x512 but it's not really worth it.

trim orbit
somber hill
#

is this a good youtube title : Become A Master Of Kohya SS For LoRA Training From 0 To Hero - How To Install - Find Best Checkpoint

urban fjord
#

Sytan's Corgi with and without LoRA at 512x512. But given that my 512x512 generation speed with SDXL is much slower than 1.5 there doesn't seem to be any real use to generating in this resolution.

soft bone
#

What would be the effect of using twice as many reg imgs as training imgs for a face? vs. using the same amount.

urban fjord
somber hill
#

fully edited

#

alternative title Become A Master Of SDXL Training With Kohya SS LoRAs - Combine Power Of Automatic1111 & SDXL LoRAs

somber hill
#

editing took my 3 days :/

midnight shuttle
#

90 minutes tutorial? How long if it was a written web page instead?

somber hill
#

by the way making written page is worthless. chatgpt will grab it make money, google will grab it make money, you get nothing 🙂

midnight shuttle
#

So that's why I have to watch a 90 minute video instead of read 5 pages in 10 minutes? For money? 😕

somber hill
#

this is also future be prepared to it. web pages will die

#

or will get behind a paywall

midnight shuttle
#

It's not just you. It's everyone.

#

But following videos often takes much more time and effort than simply reading an instruction.

autumn forum
#

hey, i like videos because i dont like reading.

somber hill
autumn forum
#

id rather watch a video on a paper than reading one. lmao. call me lazy but having add/adhd i cant read for shit lol Edit: unless its technical. i can read technical stuff

somber hill
#

preparing very detailed and good graphical reading really hard

upbeat summit
#

testing @delicate kelp's upcoming new SDXL model ProtoVision XL

somber hill
#

video much easier

autumn forum
heady vale
upbeat summit
autumn forum
#

i wish i knew the secrets to making good models. lol. but i have a feeling some of the best models are the ones of merged a bunch into 1 good model

somber hill
#

ofc for a good model you would need a lot of images and computation power

autumn forum
#

anyone wanna do a fun game of vote :P, which one is better

short marsh
#

Made a setup to generate supporting terms from the initial prompts with GPT

somber hill
#

by the way i spent huge time and put very detailed chapters to my all videos with corrected subtitles

urban fjord
soft bone
#

Regularization is confusing. @somber hill I saw you asked about this in kohya issues so maybe you have an answer.

If my img folder "20_ronald" has the same # of imgs as my reg folder "1_man", then the reg images are repeated once PER training image so its actually the same amount of images in terms of steps.

BUT if i put way more images into reg folder and keep "1_man" title, training still goes for the same amount of steps. So theoretically if I put 400 images into reg rather than 20, there would be one unique image per training image, rather than 20 images repeated 20 times.
Correct?

autumn forum
#

lmao 1 is base sdxl

somber hill
#

so if you have 10 repeating and 20 images it will fetch 200 class images from folder

#

cache them and use them

#

no matter how many epochs you train

#

i hope he improves this logic because this is very restrictive

upbeat summit
heady vale
soft bone
#

And would that have any benefit over just matching the training img count and letting them repeat?

autumn forum
#

interesting, thanks for the input friends, E- would be my MoviestillXL Lora and CineJugg Merge

polar jacinth
#

someone send me the workflow?

soft bone
urban fjord
#

I don't use reg images and I never have any issues. But I'm not in the habit of making LoRA models of my face so maybe it's needed there.

soft bone
#

It helps a ton with detail coherence and variety in outfits & lighting & environment

autumn forum
#

photo B was williamegg Lora

upbeat summit
polar jacinth
somber hill
#

you need to have multiplication of those

#

but still this isnt a good logic

#

what if i am gonna do 200 epochs training? in each epoch i should be able to set a unique class image for each image

#

but it is impossible atm

#

only way is setting repeat 200. but then you cant have frequent checkpoint saving

#

to compare later

short marsh
urban fjord
soft bone
buoyant axle
#

as i am still wondering how "good" captions look like (especially for person/faces)...is this caption completely bad or just the ones following after "a woman sitting under a palm tree with her hand on her head"? what would be a good caption for that image in your opinion?

somber hill
#

now this can be used if so

#

but still

#

it may not work

#

if repeating logic is like this

#

it repeats same image

#

then moves next image

#

do you know logic?

somber hill
#

so it will always use same class images

#

thus reduced generalization with prior loss

soft bone
#

even for 1 epoch this is the case. I just dont know why i should put hundreds in reg vs. the same amnt as training imgs

#

it trains for the same steps

somber hill
#

i dont think we are on the same page

halcyon tiger
#

Hello, i just installed comfyUI with sdxl and i want to know how i can upscale an image with sdlx

#

or do i need to download a whole new model for that?

somber hill
#

@soft bone the logic is using as many as possible different class images at every step

#

so that we keep generalization of model maximum

sweet bane
soft bone
#

assuming my training folder is 16 imgs and 20 repeats

#

I ask because I know for a fact that most people are using reg folders with the same amount of imgs as img folder. Rentry tutorials even explicitly say to do this. So I guess they're all doing it wrong

west breach
#

I used the VIT-H-14 to caption my images and it put this 🛸🌈👩🾠in my caption file

urban fjord
#

I find that you don't really need to do much to get generalization with a LoRA.
Comparision with a single-image LoRA without any reg-images and no LoRA.

somber hill
soft bone
somber hill
#

if you have 320 it will have 20 class image for each image

#

if you have 3200 it will still use 20 class image for each image

#

this is my best realims level but it is with sd 1.5 realistic vision v 2

obsidian rock
#

Is there a good dataset for a character (e.g. a celebrity) with good captions out there I can use? Or does anyone have one they can send me? I am new to Lora training and want to have a solid first set to experiment with

polar jacinth
#

Please guys, can someone send me a workflow, realistic, I really need it

somber hill
# polar jacinth Please guys, can someone send me a workflow, realistic, I really need it

Dreambooth is the best training method for Stable Diffusion. In this tutorial, I show how to install the Dreambooth extension of Automatic1111 Web UI from scratch. Additionally, I demonstrate my months of work on the realism workflow, which enables you to produce studio-quality images of yourself through #Dreambooth training. Furthermore, I shar...

▶ Play video
urban fjord
urban fjord
#

Sure you will get better results if you have several high res images of your subject, but you can get far with just one low res image too.

soft bone
#

@somber hill This logic is a difficult discussion over text lol.

Here's what I know. I know for a fact that most people are using "reg" folders with the same amount of images as their "img" folder, regardless of repeats or epochs. Rentry tutorials even explicitly say to do this. You're saying you don't do this, that you instead have hundreds in your "reg" folder and less than that in your "img" folder.

Since everyone is doing it differently, you should make a video on it

somber hill
#

with kohya no matter how many you put it will use repeating count per training

#

with dreambooth extension it will use up to 100 totally up to you to set

#

with other scripts i dont know their working logic

upbeat summit
soft bone
#

when it comes to regularization images, if using hundreds is better than using 15 (or however many training imgs they have), people would like to know, because they're all using 15, because that's what they were told to use.

civic sigil
#

Huh, I always heard to use hundreds / thousands of reg images for prior preservation, or something like 15x the number of training images

soft bone
urban fjord
#

Test things and not just take everything as gospel. Training SDXL and 1.5 also seems to be different so what's work for one might not work for the other.

soft bone
west breach
#

Are you supposed to put the trigger word in the reg image captions too? That's what I've been doing

upbeat summit
glad grove
#

i wanna smoke now

soft bone
somber hill
#

i covered this in my new coming tutorial hopefully

gloomy barn
#

facial restore is not good for side view

ionic gulch
urban fjord
#

Facial restore is not really needed, just use inpainting instead.

vast narwhal
zinc lynx
#

i just had in n out for dinner

upbeat summit
zinc lynx
#

poster vibe

#

do you guys like having a do it all workflow or multiple smaller workflows for specialized tasks

urban fjord
#

Multiple. One for text2txt and one for img2img in Auto1111.

vast narwhal
zinc lynx
#

is your txt2txt to get better prompting

#

favorite libation while experimenting?

rigid laurel
zinc lynx
vast narwhal
#

or the reroute

civic sigil
soft bone
urban fjord
#

The link SECources linked explained it. If you use more reg images than training images then they're not being used.

#

But the best way to avoid overtraining is to train for fewer steps.

civic sigil
#

What I'm confused is why not just use your own reg images and train them normally alongside your training images? Kohya already lets you use multiple folders with different number of repeats so you could just do it that way

soft bone
soft bone
civic sigil
#

I honestly feel like Kohya is overengineered in general

urban fjord
#

Yes, if you creates repeats of your training images it will use more of the reg images, I don't see the issue. If you do not want this than do not repeat your training images.

soft bone
soft bone
urban fjord
#

If you have more reg images than training images you can use repeat for the training images to use more of the reg images.

gloomy needle
soft bone
gloomy needle
#

If you only lower steps, itll be undertrained

#

Naturally existence of reg is for model to retain some concept after training

soft bone
#

I personally have no problems with overfitting or regularization, I'm just getting pissed that the guides and tutorials have apparently been wrong, and maybe I can clear it up for readers.

gloomy needle
#

What is the point reg images > train images?

soft bone
#

cuz training images get repeated

urban fjord
soft bone
urban fjord
#

I use zero reg images per training image and that works for me, I'm just stating what was explained to SECources by Kohya.

civic sigil
#

Why do a small amount of reg images repeating instead of a wide variety of images if you have the choice

gloomy needle
#

Thats the limit of model

urban fjord
#

You need to collect those reg images or generate them and I'm lazy. So I choose zero.

gloomy needle
#

it will forget sth after train

#

You cant fit everything in into 800m parameters

soft bone
#

In fact, the single best face lora I have seen in this channel used "20_img", "1_reg", same number of images in each.

gloomy needle
urban fjord
#

Did you see a comparision with and without the 20 repeats?

soft bone
civic sigil
#

Unless you're lazy or have some high quality reg images rather than auto generated ones

urban fjord
#

I'm saying that based on the explanation from Kohya-ss then 1 repeat and 20 epoch should be the same as 20 repeats and 1 epoch.

ionic gulch
soft bone
gloomy needle
urban fjord
#

Yes in that case you need to use repeats but if you have the same number of images in each then it doesn't seem like you need any repeats.

native moon
#

@visual glade hey what do u think about this? just a thought i had that probably others also had. but i just wanted to know if it would be theoretically possible🤔 splitting models into many parts "We are able to split models in half and generate the first part on one PC, then transfer the activation and generate the rest of the model on another PC. If we only have one PC, we could offload the unused part into system RAM. However, let's imagine that our model is so large that this approach is not feasible, or we simply don't have enough space in system RAM. Instead, we could read the current model part from storage. This would slow the process down , but at least the model would be able to run regardless. In the future, models will become much larger, necessitating more system/V RAM. If we consider NVMe storage, the model part switching could become quite fast. With PCIe Gen 4, we can read at 7GB/s, meaning we can transfer almost 24 GB of data into VRAM in just 3 seconds. With PCIe Gen 5, we will achieve double that speed. Additionally, NVMe drives are much easier to upgrade than GPU / VRAM. And if the CPU becomes a bottleneck, we might be able to utilize direct storage technology to load the model even faster into VRAM, bypassing the CPU. what do u think about that?"

soft bone
urban fjord
#

If you're getting great results with using the same number of reg images then it's just a waste of time to do 1000s.

soft bone
#

so i try anything that might improve quality

upbeat summit
vale eagle
#

in 1 repeat

soft bone
#

so that shouldnt do anything

#

I'm using real photos in my reg folder, with random filenames (dont know if thats bad)

urban fjord
#

It is the folder names that matters with Kohya-ss and not the filenames.

soft bone
urban fjord
#

Yes, I'm not even preparing my folders in Kohya-ss as it's simpler to do it manually.

civic sigil
soft bone
#

i just wonder why they set things up the way they do if it really means nothing. why append the class token to the img folder name? i want answers agony

urban fjord
#

The folder names matter as that's the caption for the images you're using.

#

Unless you're using caption files then I think the folder names get prepended to each caption.

civic sigil
#

Well I'm using the full tinetune which behaves differently I think, for some dumb reason

#

Idk what the dev was thinking

urban fjord
#

If your folder name is "1_photo of" and the caption for the image file is batman I think when training it is using the caption "photo of batman"

soft bone
#

i dont think thats the case

hexed hatch
#

how would I go about disabling the upscaling temporarily in comfyui? im using a template

soft bone
civic sigil
#

I would expect it would replace some sort of keyword like [foldername] in your captions

vast narwhal
urban fjord
#

No, if you are not using caption files then it uses folder names and not filenames. I'm doing this all the time and it never uses the filenames.

hexed hatch
vale eagle
vast narwhal
urban fjord
#

I'm pretty sure the folder names is added to the start.

hexed hatch
soft bone
#

i wonder what that does to quality

civic sigil
urban fjord
zinc lynx
hexed hatch
soft bone
civic sigil
#

Personally I think they should just take images and captions and num of repeats and leave the preprocessing up to you

vast narwhal
trim orbit
#

kohya requires the folder be set up as #_subject class as part of it's configuration. each data set folder can then have individual configurations too. its quite flexible. if you're using text captions, you still need it to be set up that way afaik

urban fjord
#

Alright, now I'm actually a bit unsure how caption files and folder names interact.

civic sigil
#

In any case is it even better to put an actiavtion word at the beginning of the prompt? I am struggling to decide if I should since the artists name is fairly long but I still want to use it as my activation words to give the model a head start. Won't it be a bad thing to have so many tokens before my subject in a prompt?

trim orbit
#

i'm not sure what happens in the code or if you need those tokens in your captioning, but i always caption with the "subject class" leading the file

#

never seems to be an issue

trim orbit
#

what matters most is that your token seems to be the main subject of the prompt.

civic sigil
#

If you are training an artists style it is best to use their name as the activation word if you want to give the model a head start though

ionic gulch
#

another try with img2img comfy workflow:
https://i.imgur.com/iBoFKN4.jpg
"highly detailed dslr frontal closeup head and shoulder portrait of a beautiful woman in front of a pyramid hit by lightning, cloudy skies, rain, enclosed by trees on both side in front"

soft bone
urban fjord
#

Kohya-ss lists the folder name as class tokens when training so it is at least using that for training.

soft bone
#

I think I've at least deduced that image filenames never matter in any circumstance

#

training will either use txt file or folder name. never filename

urban fjord
#

Yes that's right, filename is never used apart from connecting images to it's caption files.

civic sigil
#

It was convenient when filenames were used as captions though

#

They should make that an option again

soft bone
#

yes. i still do it by habit. i miss dreambooth

slender coral
trim orbit
#

who told you you needed to use 0.9 refiner?

#

must be the vae issue which was resolved and the current releases use the better vae

autumn forum
trim orbit
#

somehow morphed into a conspiracy about only using 0.9 refiner

slender coral
trim orbit
#

they have the proper vae baked in now

#

since nobody was using the refiner in automatic then, he must've meant in comfy. which you can just wire the vae from the base model to where you need it.

chrome flicker
#

anyone know how to train a sdxl refiner model using custom data?

slender coral
trim orbit
#

people building a huge mountain of this tiniest problem. like 2min into release the channel was brigaded by side servers that have so much spite for stability. the probelm was fixed and dealt with so fast. somehow it's still a perceived issue though.

trim orbit
chrome flicker
trim orbit
#

i only use auto now and then these days and prefer comfy

slender coral
trim orbit
trim orbit
slender coral
trim orbit
#

hmm. i dont know about that. there's lots of projects that tie into comfy

chrome flicker
trim orbit
#

maybe you just prefer the pain in the ass you know and not the one you have to learn

slender coral
trim orbit
#

looks comparable to me

slender coral
trim orbit
#

oh... okay. i haven't found an extension or system i haven't been able to get on comfy in some form

#

but i guess its not comparable

#

was wrong mb

#

🙄

slender coral
#

Yes your bad 😄

autumn forum
#

flowwolf you still got me blocked? 😛

slender coral
slender coral
autumn forum
# slender coral Y'all have issues with him? 😄

nah we got in an "argument" in the past. and he threatenedt to block me and i said do it lol. but i dont hold grudges so if he unblocks me thats cool too. he can just be a little be umm idk hes just flowwolf

slender coral
#

One last question, if I clone the new auto repo, I can just toss the sdxl checkpoint in the models and have at it?

trim orbit
#

this is like showing off discord community numbers for how good a video game is

#

irrelevant metrics

slender coral
trim orbit
slender coral
#

I would suggest actually developing something on a platform before talking...

slender coral
trim orbit
#

dial it back roddy rod

slender coral
crisp owl
#

So many arguments in here lately

visual glade
#

the fact that comfy has more functionality than a1111 with less contributors and commits means it has a better architecture

glad grove
#

or sd animation?

visual glade
#

hasn't someone done that already?

glad grove
#

the sd animation or sadtalker not yet

soft bone
#

image vieweragony

zinc lynx
#

🍿

west breach
#

love how easy it is to code up a custom node for comfy

soft bone
#

same seed, 16 reg images vs. 30 reg images. only 16 training images. It has a big effect so I'm gonna try hundreds

vast narwhal
west breach
trim orbit
#

whenever people talk about popularity contests in open sourced software mattering, i always reflect back on how popular openssl was when heartbleed dropped

west breach
heady vale
autumn forum
heady vale
glad grove
#

big chonker

trim orbit
zinc lynx
#

maybe im missing something but at some point having more contributors seems like a hindrance but im not a developer

#

just thinking of the too many cooks metaphor

trim orbit
#

its a double edged sword

vast narwhal
boreal bough
#

never tried it... so I must have missed that

urban fjord
#

Wasn't that just in the dreambooth extension?

boreal bough
#

other trainers that used kohya in background still support this, so I just assumed that it was a command line thing, and never questioned it

#

but could be they reintroduced that capability

boreal bough
upbeat summit
boreal bough
#

umm...
captions in the regularization folders work well
but without it also works

both options have their positives and negatives

soft bone
#

he said he's been putting instance token in reg captions

#

the same token as training imgs

civic sigil
#

Why would you do that

boreal bough
#

at least for big datasets you really profit from captions in regularization folder.
for smaller datasets, it might have a reverse effect though - as it will reinforce concepts that it doesnt have enough images to properly do.

civic sigil
#

I think they did too much abstraction for training, it should really just take images and captions

#

People dont seem to understand the basics

soft bone
#

I'm using ground truth reg with no captions for small training datasets. working well

boreal bough
boreal bough
soft bone
#

it definitely helps for faces on 15-25 imgs. made broken details coherent too

civic sigil
#

I mean images in general, reg or training

boreal bough
#

yeah no. never put instance token in regularization images

#

I misread that earlier, sry

soft bone
#

no worries lol i could tell you meant captions.

west breach
#

I was doing that, putting the instance token in the reg images. maybe why my model is hit and miss

urban fjord
#

You're watering down the concept you're trying to learn by captioning reg images and training images the same.

civic sigil
#

I dont understand why there arent reg folders for people to just download, why is everyone making their own?

#

It would be nice if we could all share a high quality captioned 10k images or so

soft bone
#

theres also face detection ai datasets but youd have to crop and resize them all

urban fjord
#

I do see that my single image face Lora has some issues, but main culprit is probably lack of training images and I don't care enough about face LoRA to really test things out. See if you can recognize this.

midnight shuttle
#

90% of the work in AI is creating the data set.

civic sigil
#

With real captions and stuff

soft bone
civic sigil
#

Yeah it would be nice to have folders with art, anime, etc as well but maybe thats asking too much

soft bone
#

You could absolutely sell that if you made it yourself

boreal bough
#

for smaller training attempts, this is totally overkill though

urban fjord
#

If you're going to teach the model completely new things then training images and ground truth is the same thing.

short marsh
urban fjord
#

If you're not doing faces, try out smaller more concise datasets without reg images and it might work a lot better than you think.

trim orbit
upbeat summit
trim orbit
spice skiff
slender coral
#

Thoughts on why I would be getting some pretty bad results?

autumn forum
raw prairie
# upbeat summit

Why does it look like a Far cry 3 screenshot when i close my eyes a bit?

fathom lagoon
slender coral
fathom lagoon
#

I used to date a girl with hands like that

slender coral
#

Is there some magical negatives I'm missing?

glad grove
zinc lynx
autumn forum
#

also 1024x1024

slender coral
fathom lagoon
#

Been having really good luck with widescreen output ratios too

autumn forum
slender coral
autumn forum
#

my step count

fathom lagoon
slender coral
#

Wanted to follow this but they didn't cover the compy:https://aituts.com/sdxl/

On July 22, 2033, StabilityAI released the highly anticipated SDXL v1.0, just a week after the release of the SDXL testing version, v0.9. The 2 most popular ways to run SDXL locally (on your own computer) are: In this guide, we'll show you how to use the SDXL v1.0 base and refiner models with AUTOMATIC1111's ... Read more

fathom lagoon
#

1344x768 for these gens

slender coral
autumn forum
#

are you looking for something like this?

slender coral
#

worse:

slender coral
autumn forum
#

prompt:"primitive dirty cavewoman portrait, by daniel f Gerhartz, sitting on a rock"
Negative: bad image, blurry, ugly face, Black and white image, bright image

fathom lagoon
#

try some frank frazetta in there too

hardy cipher
fathom lagoon
hardy cipher
#

lol

glad grove
azure oxide
#

member when ppl said 12 steps was good enough

#

now 30 or 60 is the recommended amount?? lmao

hardy cipher
#

when did people say 12 was good?

#

it really depends on what you're going for I think

glad grove
#

cant wait for SDXXXL 180 steps

hardy cipher
#

I want to go the full 10,000 steps like the max in the advanced sampler

boreal bough
#

age of the chonkers

hardy cipher
#

big boi

#

phat

soft bone
#

@boreal bough Training on base XL and inference on finetune, vs training and inference on finetune?

slender coral
#

LAST question for the day, suggestions on one person in a portriat?

hardy cipher
#

what sort of suggestions are you looking for?

slender coral
#

Not multiple faces

hardy cipher
#

I mean, put portrait in the positive? "multiple people" in the negative. you can go on. "single person" "solo person" etc until it works I guess. there isn't a set formula

slender coral
#

Was hoping there was a term you all figured out for these concept art type moodboards.

hardy cipher
#

there isn't a universal term for anything really. there are so many variables involved

hardy cipher
#

#boorulife

slender coral
hardy cipher
#

are there?

boreal bough
#

clip only finetune on all booru tags would be funny XD

#

bit of an expensive experiment... but funny nonetheless

slender coral
hardy cipher
#

someone told me that the booru thing is a rabbit hole. haven't really went down that yet. but maybe one of these days

boreal bough
hardy cipher
#

it'll be totally different things each time.

soft bone
hardy cipher
#

latent space is like a trillion dimensions

#

approximately

boreal bough
#

makes a lot more sense now

#

for sdxl, odds are high we won't be having many 'true' finetunes - as making them will be a significant investment, that won't make much sense for any single person

#

what most are doing right now, is just making a lora and merging it XD

#

which... yeah. they missed the point

#

the only time I'd recommend merging a lora, is for using that as a training base as you mentioned, which then creates ideal support for dual loading those loras, both at 1 strength

soft bone
hardy cipher
#

so if you finetune you can essentially subract the original model and turn the finetuning data into a lora right?

boreal bough
boreal bough
hardy cipher
#

ahh. well I always used the dreambooth method, so I guess that's a bit different. and never really on any major concepts, just people mostly

soft bone
#

I miss training styles. it was so much easier

soft zealot
#

Is it just me or is the re already more "custom" models on Civitai for SDXL than there is for 2.1 ?

hardy cipher
#

I never exactly had anything against 2.1, but it didn't have all the cool stuff that 1.5 had

soft bone
#

as a downloader of every single 2.1 finetune ever made

soft zealot
hardy cipher
#

so many 1.5 accessories

molten gull
#

is there a node in comfyUI that allows me to "pull" text from a textfile into a comfyUI-node ?

#

and one that allows me to "push" a message out of comfyUI into a textfile ?

hardy cipher
#

yes

#

saveTextToFile _O

#

"Load Text File"

#

not sure where those came from, but I'm sure you can figure it out if you search around a bit

boreal bough
#

while my high effort lora isn't idiotproof yet XD with longer more complex prompting it already matches some of the 'finetunes'

#

it also learned all 100+ concepts

#

but I will make the damn thing work with 5 word prompts with 0 negatives 🤣

hardy cipher
#

orly?

#

I prefer the long prompt models. it's cool to be able to type in 3 words and make magic, but no control then

crisp owl
#

I feel the same

boreal bough
hardy cipher
#

nice. well that's ideal then

#

I like to start off with something short, then build

#

deliberate was good for that

#

but a few words would probably get your prompt in a white room or something

boreal bough
#

trying to use random civitai prompts XD but oh god those prompts are all horny

hardy cipher
#

there are some absolutely ridiculous loras on there

#

in good and bad ways

#

the things people take the time to make. I'm both impressed and appalled

boreal bough
#

tracer cosplay in a forest
also 'girl' no longer generates 4~10year olds x_x
will still need about 10h longer trainining until all the age groups finally converge

soft zealot
#

"a portrait of a pretty young French Girl" with analog Film,3D model & enhance filters applied.

Not sure which order these are in (until I look back lol) but these are *and should be in this order but who knows)

SDXL1 vanilla
Runsdiffusion_XL_Beta
dreamshaperXL10_alpha

*the model names are inthe filenames if viewing in a browser

hardy cipher
#

nice. that's a lot of words there

glad grove
hardy cipher
#

well you know I won't be able to stop myself now

glad grove
boreal bough
# soft zealot "a portrait of a pretty young French Girl" with analog Film,3D model & enhance f...

running base only+lora / one single sampler node only (no upscale shenanigans)
so I had to merge your prompt a bit XD (on my typical seed of '2', so I'm not cherrypicking)

Positive: a portrait of a pretty young french girl, intricate detail, modern, 16k, digital art, artstation, cinematic lighting, vivid, professional 3d model analog film photo, a portrait of a pretty young french girl, faded film, desaturated, 35mm photo, grainy, vignette, vintage, Kodachrome, Lomography, stained, highly detailed, found footage . octane render, highly detailed, volumetric, dramatic lighting
Negative: ugly, deformed, noisy, low poly, blurry, painting, painting, drawing, illustration, glitch, deformed, mutated, cross-eyed, ugly, disfigured, ugly,deformed, misshapen

hardy cipher
#

everyone needs the lora that made this

glad grove
boreal bough
#

oh god I was not prepared for that... wood

soft zealot
boreal bough
#

XD

hardy cipher
#

lots of steroid members

boreal bough
#

offtopic. but if you ever do nsfw stuff training - 'naked' is the right word you'll usually want. 'nude' has a different meaning (just cause I keep seeing this in prompts) - unless you completely retrain the clip model, which yeah. good luck XD

hardy cipher
glad grove
hardy cipher
#

some of the nsfw lora trainers on civitai should feel bad about themselves for what they've created

boreal bough
hardy cipher
glad grove
#

i also found one but its just looks like some manequins with bones and ketchup

hardy cipher
#

I wonder if any of the loras or models have some messed up easter eggs

glad grove
#

dont type this prompt at 3am gone wrong

soft zealot
hardy cipher
#

oh yeah, I like to forget I can do that

soft zealot
boreal bough
#

before/after lora
so yeah. my female body lora is not amused by big muscle guys 🤣
not sure what I was expecting... but... FOR SCIENCE

hardy cipher
#

meanwhile I'm making weird crap like this catlurk

brittle crater
#

Thats so freaking cool

hardy cipher
soft zealot
hardy cipher
#

I make some wacky spaghetti abominations

brittle crater
#

Hey, question. Is it because I have only 8gb of vram that sdxl takes 15-20 minutes to generate images? I would have to have a better gpu if I wanted it to be faster?

#

Lol minutes not seconds

boreal bough
brittle crater
#

Really?

soft zealot
brittle crater
#

Nope I have a gtx 1080

boreal bough
soft zealot
boreal bough
#

unless you're on A1111, in which case that makes sense. move to comfy for quick generation

brittle crater
#

Oh I am in A1111 actually

hardy cipher
#

I have 6gb vram, lol. but I get around 1.75 iterations per second. slow, but not that slow

brittle crater
#

Oh man thank god I asked

#

I thought it was normal

glad grove
#

normal for a1111 code

boreal bough
#

the time you're seeing is for cpu only

#

you dont need any gpu at all for that speed XD

hardy cipher
#

comfy isn't as hard as people make it out to be

#

you can literally just load workflows from images you drag in

gaunt urchin
brittle crater
boreal bough
#

1st no lora, last with lora fully loaded. (inbetween are partially loaded)
vertical is just 4 different seeds
(prompt asked for a 'young french girl' - which is why my endresult looks younger, as my age groups work better than base XL)

#

not an ideal prompt - but this is more of a real use case by random people

hardy cipher
soft zealot
#

Im sorry but I cant see how @brittle crater is generating a (lets assume) 1024 x 1024 image using an SDXL model in only 15-20 seconds on a 1080.

Im running a 1080Ti 11Gb in comfyui using the defauly nvidia_gpu start bat and E2E time using my standard 3 pass method (no idea how A1111 does it) takes to complete a "standard" 1024x1024 image is ~80 seconds

hardy cipher
#

well he did say minutes

crisp owl
#

corrected to minutes

soft zealot
hardy cipher
#

that's what it says

boreal bough
soft zealot
#

@brittle crater I apologise. My 59 year old eyes havent had enough caffeine this morning lol

hardy cipher
#

I could do a 12 step image in 20 seconds. maybe

soft zealot
soft zealot
hardy cipher
#

40/40 [01:17<00:00, 1.93s/it]

soft zealot
#

which is about right IMHO for the generation of card I'm running

hardy cipher
#

can't really complain with 6gb of vram I guess

fathom lagoon
hardy cipher
#

only thing that takes forever is loading models since I have them stored on an external drive due to space issues. but I'm going to clear out a bunch of space and at least have the models I'm loading on my ssd

soft zealot
soft zealot
hardy cipher
#

is there a way to have it store more than a couple models in my vram? because it seems to only store base and refiner normally

#

so if I switch the base model it always takes FOREVER. and I have 64gb of ram so I think it'll be alright to store multiple in there

soft zealot
brittle crater
soft zealot
hardy cipher
#

it's this guy again

soft zealot
#

thats whilst running base sampler

boreal bough
#

lol. trying out random dreamshaper prompts on my lora is funny XD

hardy cipher
#

gpu temp 37 degrees? are you in a freezer?

boreal bough
soft zealot
soft zealot
fathom lagoon
#

im guessing she lkes butterflys

hardy cipher
#

nice. well mine goes up to 85 C sometimes, 🔥

fathom lagoon
soft zealot
#

@hardy cipher Idle temps

hardy cipher
#

well I'm on a laptop, so even with a cooling pad and everything it cooks

#

I don't think mine ever gets down that low

#

one of the fans stopped working and I had to remove like 50 screws to get to the stupid thing

#

but somehow I managed to get it working again. I mean, it still wanted to work

soft zealot
#

random observation. Why does task manager show GPU temps but not CPU temps? (cant say Ive noticed before)

hardy cipher
#

there's the other program that shows all the temps. I forget what it's called though

hardy cipher
#

I don't have that one. I just use task manager because I always forget the name of the program I'm talking about

nimble ermine
#

Hey guys! I want to install SDLX 1.0 on Automatic 1111. I see there is a normal and a VAE version. Which one should I get? Do i need to install a VAE seperately if i get the VAE version?

soft zealot
nimble ermine
#

Hey thnx!

hardy cipher
#

that desktop has some things going on

soft zealot
#

says the man that uses the standard one and then loads VAEs seperately lol

hardy cipher
#

I'm going to try that psyanimated model on this goofy setup I have

vocal stream
#

it/s

hardy cipher
#

you

fathom lagoon
hardy cipher
#

where did you find that picture of me?

fathom lagoon
#

lol i knew it

boreal gorge
#

😂

hardy cipher
#

doin the dew

hardy cipher
#

yes, I'm concerned

gaunt urchin
fathom lagoon
boreal bough
#

what choices led to this prompt + setting combo 🤣

dawn hinge
hardy cipher
upbeat summit
hardy cipher
#

that's not just a dump of descriptive words

boreal bough
upbeat summit
hardy cipher
#

warhammer 40k

#

lol

upbeat summit
#

I made this last night, but the prompt above is not mine (but this uses also 60 steps sde karras)

boreal bough
#

his result on dreamshaper

hardy cipher
#

I need to have a prompting tool that makes prompts like that

boreal bough
#

but it was so oddly specific XD

#

I also tried it, with exact same settings

#

for science XD

hardy cipher
#

I'd say it's an ai generated prompt. but it probably isn't

boreal bough
upbeat summit
hardy cipher
#

man, I was making some comic abominations earlier. wasn't going for realistic like that though

boreal bough
#

lol.
base/lora
Warhammer 40000, wh40k, sister of battle

upbeat summit
boreal bough
#

base / lora
photo of a sister of battle in wall street, Warhammer 40000, wh40k, sister of battle

hardy cipher
#

there's a lot going on here

#

superbat

#

man

upbeat summit
#

good lineart quality and flashy colors... I like it

hardy cipher
#

I think that's the same seed number

upbeat summit
hardy cipher
#

then I added the ussr propaganda poster lora. that's what was missing

upbeat summit
hardy cipher
west breach
fathom lagoon
upbeat summit
fathom lagoon
soft zealot
boreal bough
#

ah, you're going all in on the artstyle? not asking for art while also not asking for art 🤣

fathom lagoon
boreal bough
#
P: wide shot, 1girl, beautiful french woman, as supergirl, super hero,, grey and blue digital camouflage pleated skirt. On a busy street in front of the eiffel tower. Skinny, flat chest, thigh gap,toned athletic body. Seductive,submissive,innocent. health. warhammer 40000, wh40k, sister of battle. by Frank Frazetta, intricate detail, modern, 16k, digital art, artstation, cinematic lighting, vivid, dystopian style comic wide shot, 1girl, beautiful french woman, as supergirl, super hero,, grey and blue digital camouflage pleated skirt. On a busy street in front of the eiffel tower. Skinny, flat chest, thigh gap,toned athletic body. Seductive,submissive,innocent. health. warhammer 40000, wh40k, sister of battle. by Frank Frazetta
 . graphic illustration, comic art, graphic novel art, vibrant, highly detailed . bleak, post-apocalyptic, somber, dramatic, highly detailed

N: ugly, deformed, noisy, blurry, low contrast, cheerful, optimistic, vibrant, colorful, photograph, deformed, glitch, noisy, realistic, stock photo, ugly, deformed, misshapen
#

XD

#

merged all your prompts! XD

#

sry. my lora still kills the artstyle

soft zealot
boreal bough
#

first was without.2nd with lora

soft zealot
boreal bough
#

damn those shoes it picked up are cool though

hardy cipher
#

damn, that is an impressive node explosion, winston

upbeat summit
#

testing with SoCalGuitarist's new model (ProtoVision XL)

#

that blue and green does not fit really well together

hardy cipher
#

I mean, she pulls it off pretty well

upbeat summit
#

with the cape of capes

hardy cipher
#

hmm. I wonder what "CR load lora" is. I'm missing crucial nodes from winston's workflow

soft zealot
#

its ComfyToll

soft zealot
#

ComfyRoll

upbeat summit
#

great nodes

soft zealot
#

it has an on/off switch

hardy cipher
#

ahh

#

I have on/off switchs from some node pack

#

well they have boolean inputs

#

that I think are backwards for some ungodly reason

upbeat summit
#

the switchers are pretty great from the CR package

boreal bough
soft zealot
upbeat summit
#

thanks, random seed

soft zealot
#

hmm think i need to drop the gen size a bit more

upbeat summit
#

"holding a pen" gave me at least 75% of the time a good hand pose with the correct amount of fingers

soft zealot
fathom lagoon
#

ok

soft zealot
#

with spaghetti turned off lol

fathom lagoon
#

ok im using auto 1111

#

seems like a cool ui

hardy cipher
#

linear paths are the way

soft zealot
#

Full spaghetti

hardy cipher
#

it's even sagging like spaghetti noodles

fathom lagoon
#

just for more controll?

soft zealot
#

Straght spaghetti

hardy cipher
#

eastwood420: more control, more efficient, less junk code

soft zealot
#

no spaghetti

soft zealot
hardy cipher
#

true. but I wish I'd started with it tbh

glad grove
#

no spaghetti

hardy cipher
#

damn, son

upbeat summit
hardy cipher
#

I knew nothing about anything when I started

soft zealot
fathom lagoon
#

ok, maybe ill switch over. I got Auto down pretty well

hardy cipher
#

but I like exploring and experimenting

#

eastwood, just give it a try if nothing else. you can drag images in and they'll autoload the workflow

#

even the a1111 images

fathom lagoon
#

it on hugging face?

hardy cipher
#

I got it from github

soft zealot
# fathom lagoon it on hugging face?

https://github.com/comfyanonymous/ComfyUI

And I recommend starting out with @high skiff s work flow for SDXL before trying to understand others

https://github.com/SytanSD/Sytan-SDXL-ComfyUI

GitHub

A powerful and modular stable diffusion GUI with a graph/nodes interface. - GitHub - comfyanonymous/ComfyUI: A powerful and modular stable diffusion GUI with a graph/nodes interface.

GitHub

A hub dedicated to development and upkeep of the Sytan SDXL workflow for ComfyUI - GitHub - SytanSD/Sytan-SDXL-ComfyUI: A hub dedicated to development and upkeep of the Sytan SDXL workflow for ComfyUI

high skiff
#

pingly dingly

soft zealot
#

oooh ello

west breach
dusk mica
#

What are you guys s/it in sdxl? Mine are 1.46 on rtx 2070

hardy cipher
#

1.75-2 it/s normally

fathom lagoon
#

2.19

hardy cipher
#

6gb vram, barely making it over here

fathom lagoon
#

is lower beter?

hardy cipher
#

yes, ideally you want to approach 0 it/s

short marsh
#

3.23 it/s 3080ti

hardy cipher
#

not bad, bud

boreal bough
#

comfy says 3.7

fathom lagoon
#

sometimes im seeing as low as 1.3 others as high as 2.19

short marsh
#

turning off the step previews made it hit 3.6

vocal stream
clever verge
#

2.15 it/s with a batch size of 4

nimble ermine
#

Guys I cant seem to make SDLX 1.0 run in automatic 1111. I have the model placed in the correct folder but everytime i try to switch to it in Auto1111 the process fails. I have updated Auto1111 to the newest version. Do you know what might the probelm be?

hardy cipher
#

you shouldn't use a1111

vocal stream
#

it mostly works on the dev branch with a1111 but on master you mostly oom

nimble ermine
#

why is that?

vocal stream
#

they just havent optimized it for sdxl as much as comfy have yet

nimble ermine
#

Ok i see .. I ll try that if nothing else works. Can i run ControlNet with Comfy?

vocal stream
#

there's no proper Controlnets for SDXL in general yet

nimble ermine
#

Ok thanks for the info

upbeat summit
vivid silo
soft zealot
vivid silo
#

Thank you very much

clever verge
#

Isn't it builtin now?

indigo carbon
#

alright, I managed to implement AIT to my workflow. takes ~18sec per image with this

boreal bough
#

without/with lora
(not saying one is better, just interesting) XD

heady vale
indigo carbon
west breach
#

I see the model database has moved and there are some new upscale models https://openmodeldb.info/?sort=date-desc

wind storm
upbeat summit
wind storm
#

Thanks!!

#

Will keep making similar stuff on the regular, feel free to subscribe.

upbeat summit
#

fits the trip into nature very well 🙂

wind storm
#

Got inspired when boating around at our summer house

#

A lot of beavers sneaking around

upbeat summit
#

liked and subscribed 😉

west breach
#

Doesn't SDXL know umbrellas are supposed to float mid air near the character??

upbeat summit
#

yeah, already got lots of floating umbrellas

#

but holding things isn't easy

ionic dragon
#

@west breach can you help me test my lora, i am finding hardtime trying to test, i am not understanding whether its the lora or the sd model generating selena gomez

rapid jasper
west breach
west breach
ionic dragon
ionic dragon
west breach
# ionic dragon yeah

the refiner is not trained, so it can override your lora, especially with faces

ionic dragon
west breach
#

don't think you can train the refiner, at least I haven't seen anyone talk about doing it

west breach
ionic dragon
#

so i can test them

west breach
west breach
ionic dragon
ionic dragon
#

how does it perform?

ionic dragon
#

dual characters

west breach
ionic dragon
# west breach

can you please try with same seed and not using the lora

west breach
ionic dragon
west breach
ionic dragon
#

oh ok

tribal knot
west breach
brazen patrol
ionic dragon
#

@soft zealot sorry for asking, your current workflow is very complex for me, is there any workflow which is bit simpler?

tribal knot
#

I'm new to this, does anyone have a good workflow for upscaling images on comfyui? for example, here's a small render I did that I want to blow up and maybe change some details along the way

soft zealot
ionic dragon
soft zealot
tribal knot
ionic dragon
#

wait, lemme try again with your workflow

#

it has those presets

#

so its great

soft zealot
#

geeeeez make your bloody mind up willyou

soft zealot
tribal knot
upbeat summit
tribal knot
soft zealot
# tribal knot got this error

thats an Efficiency Nodes Node.

Apparently there have been "issues" with differing versions floating around causing headaches for some people depending on there thehy installed them from and when

#

of course that presu

tribal knot
#

i downloaded the direct link from the github page like 10 mins ago and already had Deerfu

#

okay so it seems the import failed, i'll take my ass to tech support if i can't find a way to fix it

soft zealot
#

I recall someone syaing they ended upo gtabbing it from Civitai and that worked

#

This is both the blessing and the curse of ComfyUI (and many other Github based projects)

zinc lynx
#

wouldnt be so bad if there wasnt so many places to download things from

#

wasn't or weren't?

#

grammar is not my thing

crimson roost
#

Why is it that text encoding takes so much longer than actual image generation? Whenever I make a new prompt, it takes 10 minutes before it starts generating, and then subsequent generations each take less than a minute

#

Anything I can do to speed that part up?

rapid jasper
elfin cobalt
#

Also, is it 10 minutes again if you change the prompt afterwards?

#

Also,

#

Who needs prompts?

#

Traditional art, watercolor

ionic dragon
crimson roost
crimson roost
elfin cobalt
#

Not VRAM, RAM. Though also VRAM.

crimson roost
#

16 ran, 8 vram

elfin cobalt
#

Hmm~

crimson roost
#

Page file is set to a pretty high number too. Running on an sad

#

Ssd

elfin cobalt
#

The default behaviour of ComfyUI is to leave the text encoders in main memory and run it on the CPU, I believe.

#

That works fine, so long as it fits in main memory.

#

Which... is unlikely. ComfyUI uses 24GB of memory to me.

#

So there's your explanation. It's swapping out, and it's really, really slow to run anything where you need to swap in and out while it's running.

#

You need more memory. I'd recommend at least 32, but 64 will save you future woes. Fortunately main memory is cheap right now.

#

Alternately you could buy a 4090 and run with --gpu-only, but I don't know that I'd recommend that as the cheaper option. vv

crimson roost
#

Thank you! Good to know I can maybe make it better without spending a million dollars on a gpu

#

Is there nothing else I can do in the meantime? Are there pruned models with smaller text encoders? Kind of confused when it comes to the more technical workings of SD

soft zealot
boreal bough
#

without/with lora

#
P: style of Casey Baugh, an award-winning photography, an impressive close-up of the face of a skinny Gothic woman that captivates the viewer. Her pale, porcelain-like skin appears almost supernatural, while her dark eyes exude depth and penetration. The fine details of her face are accentuated by the interplay of light and shadow, highlighting her distinctive features. The dark lips lend a mysterious sensuality to her expression. This close-up captures the intensity and captivating allure of Gothic aesthetics in a single moment RAW photo, Fuji X-T20, (high quality,:1.2)

N: bad anatomy, bad hands, multiple eyebrow, (cropped), extra limb, missing limbs, deformed hands, long neck, long body, (bad hands), signature, username, artist name, conjoined fingers, deformed fingers, ugly eyes, imperfect eyes, skewed eyes, unnatural face, unnatural body, error, jpeg artifacts, painting by bad-artist, (worst quality:1.5), (low quality:1.5), (normal quality:1.5), lowres, RAW photo, Fuji X-T20, (high quality,:1.2), jpeg artifacts, painting by bad-artist, (worst quality:1.5), (low quality:1.5), (normal quality:1.5), lowres

civitai prompt are getting more and more absurd XD but hey! it works?

boreal bough
#

that was a whole story 🥲

soft zealot
soft zealot
boreal bough
#

and one more with negatives as well

#

a 70s grainy photograph of a portrait of a pretty young Swedish woman in a meadow in bloom, Desaturated, bleached colors, Hippie woodstock style, Marygolds, daysies, grass, Kodak, polaroid

sleek sky
#

First try with sdxl.. lots to learn but having fun!

soft zealot
steady grove
#

prompting both encoders seperately i view analogously to having more traction like having four wheel drive instead of rear or front . Or another is like it gives better focus on latent space like glasses instead of monocles

real hearth
#

Hi, is there a specific channel for support or helping others fix issues with the latest model?

#

I heard the model requires 8GB of VRAM which I do but it refuses to work

soft zealot
#

define "refuses to work"?

Thats like going to a GP and saying "I cont feel well"

Which UI are you using ?
What eeror mesasage (if any)) are you getting
describe the symptoms

tribal knot
real hearth
#

Yeah I'll admit I was being a bit barebones but I didn't know if this was the right channel

#

This is the error I'm getting

soft zealot
#

I mean there is a tech support channel but..............

soft zealot
real hearth
#

Nope but I have a feeling I should?

soft zealot
#

probably help 🙂

#

this is what I had in tehre when I was using A1111 both for my 6Gb 980Ti and the 11Gb 1080Ti

set COMMANDLINE_ARGS=--medvram --xformers --no-half --no-half-vae

real hearth
#

It actually worked! Thank so you much!

#

First creation ever

#

Oh yeah, gotta apply the xformers too

#

@soft zealot thanks again!

trim orbit
real hearth
#

Btw do we know what's the difference between the two available models?

upbeat summit
real hearth
#

Is the one I'm using yeah 👍🏻

#

what's vae btw

#

Sorry, I'm just been out of the image generation community for months

upbeat summit
#

all good 🙂 simply said the VAE decodes your image from the latent space into a RGB image.

real hearth
#

Basically it helps with images being better I guess?

autumn forum
#

It turns what sdxl makes and reads into what we can see and read

untold totem
#

So I was futzing around with the Sytan comfyui setup for SDXL, and the base pass is always set up to go to step 20 then goes to the refiner afterwards.

Is there a reason why its hard coded to go to step 20? Like, say I did 50 total steps, there would be way more refiner steps than if i did 30 total steps

#

I’ve written a little node chain to make the base pass a percentage of total steps, which seems more logical to me, but I just wanted to make sure there was a ‘reason’ it was hard coded that way?

crisp owl
indigo carbon
crisp owl
#

YMMV

soft zealot
#

@untold totem feel free to delve into my workflow (attached) and see how I did it, just dont ask for help lol

untold totem
#

Just wanted a sanity check haha

indigo carbon
#

idk about y'all's workflows, I managed to implement AIT and I'm very happy with this

boreal bough
crisp owl
#

ait?

soft zealot
indigo carbon
soft zealot
boreal bough
# soft zealot

want the current version of my finetune lora? am curious to see how that works out - as it is mostly refiner compatible - due to not focusing on a single face

soft zealot
crisp owl
#

huh, never seen or heard

soft zealot
indigo carbon
#

whatever. I'm getting 14it/s with SDXL on1024x1024 and I'm hella happy

boreal bough
#

short prompts arent working as intended yet - but since you mostly do longer prompts it should be good to go

crisp owl
#

How does it work, just a different processing method on your end, or does it send out information to process with Facebooks hardware?

indigo carbon
#

heck, it took me 13 seconds to generate this

crisp owl
#

Hmm, interesting, I'll have to download it and check that out. Perhaps take a look at the backend for curiosity sake

soft zealot
#

random question, is there a way to refresh the cahced LORA (7 other model) information/lists in COmfyUI without restartiong the server?

soft zealot
#

Hardware requirements:

NVIDIA: AIT is only tested on SM80+ GPUs (Ampere etc). Not all kernels work with old SM75/SM70 (T4/V100) GPUs.

indigo carbon
#

also, AIT isn't even implemented in the refiner stage, it can get even faster

#

anyways, back to some gens

trim orbit
#

love the volcano in a lavalamp motif

soft zealot
ashen oracle
#

@indigo carbon are you sharing your workflow?

indigo carbon
#

yeah, metadata

ashen oracle
#

should have checked first eh

boreal bough
soft zealot
boreal bough
#

it does modify clip though, and actually listens a lot more to what is asked - so keep that in mind

ionic dragon
#

@soft zealot after generating too many images, where do you thing sdxl is not good at?

#

i can try to train a lora

indigo carbon
trim orbit
#

why?!

indigo carbon