LoRA_Easy_Training_Scripts | 東方Project AI | Page 2

humble axle Mar 20, 2023, 3:27 PM

#

you should consider that "don't train the conv part to learn denoise"

normal charm Mar 20, 2023, 3:27 PM

#

More precise?

humble axle Mar 20, 2023, 3:27 PM

#

and actually generate image with different resolution is totally different task

#

so you should use the resolution that your base model trained for

#

if your base model is something merged and you want to train lora on it
I will recommend you to disable the conv layer with conv_dim=0

vivid python Mar 20, 2023, 3:28 PM

#

I'll throw in some parameters that I used for training a locon style lora of tab_head, just to give another setup.

4/1 linear
8/1 conv
5e-4
1200 steps
768x
Cosine
Warmup ratio of 0.05
Batch size 2

Got pretty much the entire style.

humble axle Mar 20, 2023, 3:28 PM

#

BTW

#

even with conv_dim=0
loha/locon will train more layer then kohya_ss' lora

ivory crescent Mar 20, 2023, 3:29 PM

#

Clyde censored my hands images Agony

normal charm Mar 20, 2023, 3:29 PM

#

humble axle if your base model is something merged and you want to train lora on it I will r...

I always train with nai, do i need to change it for that too?

humble axle Mar 20, 2023, 3:29 PM

#

basically only need to change the settings about locon/loha

vivid python Mar 20, 2023, 3:29 PM

#

ivory crescent Clyde censored my hands images <:Agony:996544918610784296>

Oof

humble axle Mar 20, 2023, 3:29 PM

#

res and other is ok

ivory crescent Mar 20, 2023, 3:30 PM

#

Agony

vivid python Mar 20, 2023, 3:30 PM

#

Hands be too sexy man

shut siren Mar 20, 2023, 3:31 PM

#

Interesting, I haven’t had issues with using high alpha in my training runs

#

High alpha meaning alpha=dim or basically no scaling

normal charm Mar 20, 2023, 3:31 PM

#

vivid python I'll throw in some parameters that I used for training a locon style lora of tab...

This does ofc vary across datasets and styles, needless to say. Im guessing this is best used a reference?

vivid python Mar 20, 2023, 3:32 PM

#

Reference yes, each dataset is different

shut siren Mar 20, 2023, 3:32 PM

#

When I tried low alpha in the past, it required giga cranking up the LR to learn anything

ivory crescent Mar 20, 2023, 3:32 PM

#

literally had to make imgur album because I couldn't post them https://imgur.com/a/ZUf57UA

normal charm Mar 20, 2023, 3:32 PM

#

All the math i learned in school and shit still make me feel smooth brained

vivid python Mar 20, 2023, 3:32 PM

#

But seems like training at 16/8 or 8/16 then resizing using the dynamic resizing might be better

normal charm Mar 20, 2023, 3:32 PM

#

Thanks us education system

vivid python Mar 20, 2023, 3:32 PM

#

Those are dims btw

vivid python Mar 20, 2023, 3:33 PM

#

ivory crescent literally had to make imgur album because I couldn't post them https://imgur.com...

#

Lel

#

Hands too sexy for imgur

ivory crescent Mar 20, 2023, 3:33 PM

#

DrollHell

normal charm Mar 20, 2023, 3:33 PM

#

vivid python But seems like training at 16/8 or 8/16 then resizing using the dynamic resizing...

Dynamic resizing?

vivid python Mar 20, 2023, 3:33 PM

#

Yeah

#

The lora resize script can resize locon

#

And it has a few dynamic resizing modes

normal charm Mar 20, 2023, 3:34 PM

#

Oh that

#

It only lowers the size correct?

vivid python Mar 20, 2023, 3:34 PM

#

Yes

#

But usually it can lower the size without breaking things

shut siren Mar 20, 2023, 3:35 PM

#

Maybe I need to try low alpha again

#

But it always seemed undercooked when I tried it

vivid python Mar 20, 2023, 3:35 PM

#

I've generally had success at low alphas

shut siren Mar 20, 2023, 3:35 PM

#

End up having to use giga learning rate to compensate

vivid python Mar 20, 2023, 3:35 PM

#

Almost all of my lora are either dim16 with an alpha of 8, or dim 8 with an alpha of 1

normal charm Mar 20, 2023, 3:35 PM

#

Wait
I shouldve asked this earlier
But are we talking about the base dim/alpha or just the dim/alpha for lycoris training

vivid python Mar 20, 2023, 3:36 PM

#

Both

ivory crescent Mar 20, 2023, 3:36 PM

#

what warmup ratio you use?

shut siren Mar 20, 2023, 3:36 PM

#

Back when sdscripts first implemented alpha and the default became 1, I was wondering why none of my loras were learning anything lmao

vivid python Mar 20, 2023, 3:36 PM

#

vivid python Almost all of my lora are either dim16 with an alpha of 8, or dim 8 with an alph...

This one is about lora specifically though

humble axle Mar 20, 2023, 3:36 PM

#

I don't change the ratio(?

#

I always use 100 warm up step

normal charm Mar 20, 2023, 3:37 PM

#

vivid python Almost all of my lora are either dim16 with an alpha of 8, or dim 8 with an alph...

Thats where i get further confused, so when is it necessarily to raise the dim past 16?

vivid python Mar 20, 2023, 3:37 PM

#

normal charm Thats where i get further confused, so when is it necessarily to raise the dim p...

Generally... never

#

Some styles may need more

ivory crescent Mar 20, 2023, 3:37 PM

#

it's ratio in derrian's argslist DrollFrozen

vivid python Mar 20, 2023, 3:37 PM

#

Which is why I set default to 32

ivory crescent Mar 20, 2023, 3:37 PM

#

but probably could calculate it for 100 steps

humble axle Mar 20, 2023, 3:37 PM

#

yeah

vivid python Mar 20, 2023, 3:37 PM

#

ivory crescent it's ratio in derrian's argslist <:DrollFrozen:1059076895951552513>

Yeah, sd-scripts doesn't have ratio

#

Just steps directly

shut siren Mar 20, 2023, 3:38 PM

#

I’m dying from seeing loras I want to try out on civitai and then they’re like 256dim or something

#

dejj

vivid python Mar 20, 2023, 3:38 PM

#

shut siren I’m dying from seeing loras I want to try out on civitai and then they’re like 2...

Resize them down

#

We have the means

normal charm Mar 20, 2023, 3:38 PM

#

Oh another thing
Would lycoris models have anything to do with why additional networks extension isnt loading them for me LucaWat

shut siren Mar 20, 2023, 3:39 PM

#

What settings do you prefer for dynamic resize?

vivid python Mar 20, 2023, 3:39 PM

#

Yeah, provided it was a loha it doesn't load at all

vivid python Mar 20, 2023, 3:39 PM

#

shut siren What settings do you prefer for dynamic resize?

"It depends"

normal charm Mar 20, 2023, 3:39 PM

#

vivid python Yeah, provided it was a loha it doesn't load at all

It was DrollFrozen

ivory crescent Mar 20, 2023, 3:39 PM

#

I guess also 512 res with that batch size

#

or A100 KoiSee

vivid python Mar 20, 2023, 3:39 PM

#

normal charm It was<:DrollFrozen:1059076895951552513>

You need to use the locon webui extension to load loha

#

And you can only load it using the built in method

#

Not additional networks

normal charm Mar 20, 2023, 3:40 PM

#

I…thought i already installed it?

vivid python Mar 20, 2023, 3:40 PM

#

Loha weren't integrated into additional networks though

#

Only the webui way, you know, the lora:name:1 thing

humble axle Mar 20, 2023, 3:41 PM

#

ivory crescent I guess also 512 res with that batch size

actually you can use 4090 with bs12~16 on 640*640

#

with gradient checkpoint

ivory crescent Mar 20, 2023, 3:42 PM

#

I would have to use cache latents too probably

#

I like random crop NotLikeKogasa

normal charm Mar 20, 2023, 3:42 PM

#

vivid python You need to use the locon webui extension to load loha

#

That not it?

vivid python Mar 20, 2023, 3:42 PM

#

And it's up to date?

normal charm Mar 20, 2023, 3:42 PM

#

I just updated an hour ago

ivory crescent Mar 20, 2023, 3:42 PM

#

I "only" have 3090, but probably only thing that matters is VRAM

vivid python Mar 20, 2023, 3:43 PM

#

normal charm I just updated an hour ago

Then it should load loha fine, provided you use the prompt box for it

normal charm Mar 20, 2023, 3:43 PM

#

Yeah its only the prompt box that understands

#

Additional networks is just borked

vivid python Mar 20, 2023, 3:43 PM

#

It's not broken

#

It just doesn't support loha

normal charm Mar 20, 2023, 3:44 PM

#

Can it support loha?

vivid python Mar 20, 2023, 3:44 PM

#

Nope

#

Not without a rewrite

shut siren Mar 20, 2023, 3:44 PM

#

do yall adjust learning rate for batch size?

normal charm Mar 20, 2023, 3:44 PM

#

Penthsive

vivid python Mar 20, 2023, 3:44 PM

#

shut siren do yall adjust learning rate for batch size?

I never change my batch size from 2

normal charm Mar 20, 2023, 3:44 PM

#

Not for batch size, not me

vivid python Mar 20, 2023, 3:44 PM

#

So not really

normal charm Mar 20, 2023, 3:45 PM

#

I used to do batch size 3, but since using locon or loha i had to lower it to 2

ivory crescent Mar 20, 2023, 3:45 PM

#

what buckets do you use?

shut siren Mar 20, 2023, 3:45 PM

#

im a batch size 1 user lol

humble axle Mar 20, 2023, 3:45 PM

#

default in kohya

#

I m bs1 also (in llama
but I use grad acc step 64 (X

ivory crescent Mar 20, 2023, 3:46 PM

#

I remember showing Salt my batch size when making softprompt for LLM, and he though it was for WD Agony

shut siren Mar 20, 2023, 3:46 PM

#

gonna try dim 8/4 with 1 alpha for both

#

and see what happens

#

considering i was alr using lr 4e-4 with alpha=dim this might be undertrained

ivory crescent Mar 20, 2023, 3:47 PM

#

batch size 400 and 64 grad steps Agony

shut siren Mar 20, 2023, 3:47 PM

#

is there a way to tell if your lora/locon underflows with high alpha?

humble axle Mar 20, 2023, 3:47 PM

#

ivory crescent I remember showing Salt my batch size when making softprompt for LLM, and he tho...

XD

humble axle Mar 20, 2023, 3:47 PM

#

shut siren is there a way to tell if your lora/locon underflows with high alpha?

no

#

but you can check the weight of lora_up

#

for loha I need to check(

shut siren Mar 20, 2023, 3:48 PM

#

cuz my results have always been usable with alpha=dim

humble axle Mar 20, 2023, 3:48 PM

#

just check if any weight is super small

#

and has lot of zero

shut siren Mar 20, 2023, 3:48 PM

#

but i dont know if there is any problem with it lol

humble axle Mar 20, 2023, 3:48 PM

#

and in sometimes

#

my friend actually meet something like:
"All zero in the weight"

#

super underflow

normal charm Mar 20, 2023, 3:50 PM

#

I never touch weight decay
Idk if i should

shut siren Mar 20, 2023, 3:50 PM

#

i nudged it up to 0.1

normal charm Mar 20, 2023, 3:50 PM

#

Thats the default no?

humble axle Mar 20, 2023, 3:50 PM

#

You should use weight decay if you are using small dataset

shut siren Mar 20, 2023, 3:50 PM

#

default is 0.01 iirc

humble axle Mar 20, 2023, 3:50 PM

#

and actually amount<10k is small dataset

#

(some people will say <100K is small BTW XD)

ivory crescent Mar 20, 2023, 3:51 PM

#

humble axle and actually amount<10k is small dataset

me when my subject has 12 images

normal charm Mar 20, 2023, 3:51 PM

#

10k? Images?

shut siren Mar 20, 2023, 3:51 PM

#

some of my subject datasets are like 20-30 lol

humble axle Mar 20, 2023, 3:51 PM

#

I also has trained locon with 5 img total

shut siren Mar 20, 2023, 3:51 PM

#

Agony

humble axle Mar 20, 2023, 3:51 PM

#

and just do some crop aug on it

#

and get good result

#

super good result

shut siren Mar 20, 2023, 3:52 PM

#

ive been using random crop

normal charm Mar 20, 2023, 3:52 PM

#

Who the hell has a more than 10K dataset

humble axle Mar 20, 2023, 3:52 PM

#

shut siren Mar 20, 2023, 3:52 PM

#

to try to get a pseudo larger dataset lol

humble axle Mar 20, 2023, 3:52 PM

#

normal charm Who the hell has a more than 10K dataset

me

ivory crescent Mar 20, 2023, 3:52 PM

#

normal charm Who the hell has a more than 10K dataset

me

normal charm Mar 20, 2023, 3:52 PM

#

humble axle Mar 20, 2023, 3:52 PM

#

umamusume 60k dataset
after repeat for balance the amount of each characters

#

the size become 350k

normal charm Mar 20, 2023, 3:53 PM

#

Beefy ass pcs u got

ivory crescent Mar 20, 2023, 3:53 PM

#

what does pc have to do with dataset size lol

#

maybe training time chenThink

normal charm Mar 20, 2023, 3:53 PM

#

If i train on large sizes i get prone to crash after awhile

#

Too large sizes

ivory crescent Mar 20, 2023, 3:54 PM

#

@humble axle I'm about to test your settings but I don't think batch size 8 is good for 38 images chenThink

shut siren Mar 20, 2023, 3:55 PM

#

is there any drawback to high alpha besides the risk of underflowing?

humble axle Mar 20, 2023, 3:55 PM

#

ivory crescent <@267129877889679370> I'm about to test your settings but I don't think batch si...

I use batch size 8 for 22 img (after aug) before

#

just add 10x repeat on it XD

humble axle Mar 20, 2023, 3:56 PM

#

shut siren is there any drawback to high alpha besides the risk of underflowing?

higher grad, more unstability?
I don't know

#

just a scale for output

ivory crescent Mar 20, 2023, 3:56 PM

#

but warmup still 100 steps? it will be like 1 epoch 💀

#

also adamW8bit right?

vivid python Mar 20, 2023, 3:58 PM

#

normal charm Thats the default no?

0.1 is the default on my scripts, because I found it worked better than the normal default

shut siren Mar 20, 2023, 3:58 PM

#

normal default is 0.01 iirc

normal charm Mar 20, 2023, 3:59 PM

#

Noted

humble axle Mar 20, 2023, 4:00 PM

#

ivory crescent but warmup still 100 steps? it will be like 1 epoch 💀

10

ivory crescent Mar 20, 2023, 4:00 PM

#

how many steps do you do for small datasets?

humble axle Mar 20, 2023, 4:01 PM

#

lower than 1k but I forgot

#

maybe 300~600

#

or if you can read the metadata from pt file(

ivory crescent Mar 20, 2023, 4:02 PM

#

I, in fact, know how to read

humble axle Mar 20, 2023, 4:02 PM

#

good

#

https://civitai.com/models/14878/loconlora-yog-sothoth-depersonalization

[LoCon/LoRA] Yog-Sothoth (Depersonalization) | Stable Diffusion LoC...

Yog Sothoth Trained with LoRA and LoCon, Detail for locon: KohakuBlueleaf/LoCon: LoRA for convolution network ( github.com ) Extension for using lo...

#

here is it

ivory crescent Mar 20, 2023, 4:02 PM

#

yes, being able to read is good

humble axle Mar 20, 2023, 4:02 PM

#

train with 5img (22 after aug)

#

and get super good result

#

I also using this as style model BTW

#

the light/shadow is good for me

ivory crescent Mar 20, 2023, 4:03 PM

#

by aug you mean cropping, modyfing dataset by yourself

#

or flip/crop args?

humble axle Mar 20, 2023, 4:03 PM

#

crop + flip

#

I manually modify it

ivory crescent Mar 20, 2023, 4:03 PM

#

yea, I do that too for less than 30 images or even more

shut siren Mar 20, 2023, 4:04 PM

#

alpha=1 seems kinda undertrained if i keep the rest of the settings the same

#

not getting the outfit details as precisely

humble axle Mar 20, 2023, 4:05 PM

#

try little bit higher lr

shut siren Mar 20, 2023, 4:05 PM

#

im already at 4e-4 which seems kinda high

humble axle Mar 20, 2023, 4:05 PM

#

not

#

with alpha=1 and cosine scheduler you can just push to 8e-4 (

#

basically depends on your dataset

shut siren Mar 20, 2023, 4:05 PM

#

any change to the text encoder lr?

humble axle Mar 20, 2023, 4:05 PM

#

WAIT

#

you have different LR for unet/te?

shut siren Mar 20, 2023, 4:06 PM

#

yeah

#

te lower

humble axle Mar 20, 2023, 4:06 PM

#

just use both 4e-4

shut siren Mar 20, 2023, 4:06 PM

#

than unet

humble axle Mar 20, 2023, 4:06 PM

#

just use both 4e-4

shut siren Mar 20, 2023, 4:06 PM

#

you think maybe the TE is undertrained?

ivory crescent Mar 20, 2023, 4:07 PM

#

I still don't know what TE does in single subject LoRAs chenThink

#

I only noticed it when I did like 8 characters

shut siren Mar 20, 2023, 4:07 PM

#

ive been using a lower TE lr than unet just because those were the defaults ive seen

ivory crescent Mar 20, 2023, 4:07 PM

#

When I had to increase TE

shut siren Mar 20, 2023, 4:07 PM

#

not because of any theoretical reason

normal charm Mar 20, 2023, 4:07 PM

#

Never bothered messing with either of those

#

AYAYAYA

shut siren Mar 20, 2023, 4:08 PM

#

you just used the same for both?

humble axle Mar 20, 2023, 4:08 PM

#

yes

#

I just cannot understand why people use different(

shut siren Mar 20, 2023, 4:08 PM

#

i think the thought is the TE doesnt need as much training for this

humble axle Mar 20, 2023, 4:08 PM

#

when you are doing some f/t like this

#

actually te learn much than unet

#

undertrained is almost always = te undertrained

shut siren Mar 20, 2023, 4:09 PM

#

i heard TE can blow up if the LR is too high

humble axle Mar 20, 2023, 4:09 PM

#

yeah it will

#

but you have alpha=1

#

no need to afraid that

#

alpha=1 for dim=8 means 1/8 grad(

#

(BTW actually all the thing will blow up if LR is too high)

shut siren Mar 20, 2023, 4:10 PM

#

the old colab i used months ago for DB did this method where it would train just the TE first, and then freeze the TE and train unet after that

humble axle Mar 20, 2023, 4:10 PM

#

(UNet is not more stable, just in the past you only train the transformer block in it)

#

Oh I cannot say UNet is not more stable

#

It is more stable (In some other experiments)

#

but it will blow up

humble axle Mar 20, 2023, 4:11 PM

#

shut siren the old colab i used months ago for DB did this method where it would train just...

ummm

#

betwee make sense and not make sense(

normal charm Mar 20, 2023, 4:13 PM

#

Surely theres a dictionary for all the training terminology there is, right?

#

deadsmile

humble axle Mar 20, 2023, 4:13 PM

#

:deadsmile:

#

oh I need to use another one

shut siren Mar 20, 2023, 4:14 PM

#

major differences between adamw and adamw8bit?

humble axle Mar 20, 2023, 4:14 PM

#

8bit is way more smaller

shut siren Mar 20, 2023, 4:14 PM

#

is there a difference in speed or quality?

humble axle Mar 20, 2023, 4:15 PM

#

speed... not that significant
quality is also not that siginificant

humble axle Mar 20, 2023, 4:15 PM

#

humble axle https://civitai.com/models/14878/loconlora-yog-sothoth-depersonalization

trained with adamw8bit

shut siren Mar 20, 2023, 4:17 PM

#

ive been avoiding increasing batch size because my datasets are giga small

humble axle Mar 20, 2023, 4:17 PM

#

no need

#

pro-tip

#

Ideal situation is:
batch size = dataset size

shut siren Mar 20, 2023, 4:18 PM

#

also because it sometimes gives weird step numbers

#

if an aspect ratio bucket has a # of images that isnt a multiple of batch size

humble axle Mar 20, 2023, 4:19 PM

#

oh right

ivory crescent Mar 20, 2023, 4:19 PM

#

batch size 16k lets go

humble axle Mar 20, 2023, 4:19 PM

#

the step number...

shut siren Mar 20, 2023, 4:19 PM

#

bleh the results from my unet/te both 4e-4 run still seem worse than my old alpha=dim locon

humble axle Mar 20, 2023, 4:20 PM

#

use higer lr

#

maybe your dataset need higher lr

#

I use 5e-4 for dataset with trigger word only(?
(remove all tags about the trigger word)

#

If you use tag/caption + trigger word

#

you will need higher lr

shut siren Mar 20, 2023, 4:22 PM

#

oh yeah this is trigger word + tags

normal charm Mar 20, 2023, 4:22 PM

#

Thats all my datasets ever are

humble axle Mar 20, 2023, 4:23 PM

#

trigger word + relative tags need higher lr

shut siren Mar 20, 2023, 4:23 PM

#

when i tried trigger word only the first time i trained lora, it exploded

#

even at 1e-4 lol

normal charm Mar 20, 2023, 4:23 PM

#

humble axle trigger word + relative tags need higher lr

Base lr or te

humble axle Mar 20, 2023, 4:23 PM

#

all

#

te is important than unet(

#

(Just considering about TI/HYN, actually modify things for TE)

#

(And anime diffusion also just change the cond layer only and get totally different style)

#

oh wait

#

HYN is for unet transformer

#

I'm wrong sorry

#

(or something also for TE?)

normal charm Mar 20, 2023, 4:25 PM

#

The only lr ive ever seen anyone use for te is 5e-5
Or 1e-5 (mightve been unet, unsure)

#

Most times no one has it set

shut siren Mar 20, 2023, 4:26 PM

#

maybe i just go back to what i was doing before at alpha=dim, since i found LR that works for that lol

humble axle Mar 20, 2023, 4:26 PM

#

if you are always using same dim

#

considering about use that alpha always

#

fixed alpha ratio should not work(

shut siren Mar 20, 2023, 4:27 PM

#

the alpha itself matters more than the alpha relative to the dim?

#

and yeah i basically always use the same dim size

#

8 linear 4 conv for locon

#

4/2 for loha

humble axle Mar 20, 2023, 4:30 PM

#

yes

humble axle Mar 20, 2023, 4:30 PM

#

shut siren the alpha itself matters more than the alpha relative to the dim?

ratio = scale
but fixed scale means you need to tune lr for different dim(size)

#

if you fixed alpha

#

means your ratio is related to your size

#

which is good for NN

shut siren Mar 20, 2023, 4:31 PM

#

the loras i have on civitai are 128dim/128alpha lol

humble axle Mar 20, 2023, 4:31 PM

#

LOL

shut siren Mar 20, 2023, 4:31 PM

#

since i know the old behavior of sdscripts before 0.4.0 was basically alpha=dim

#

cuz there was no scaling

#

hmmm interesting that the details im getting on this character's thighhighs are wayyy better when the character is standing than sitting lol

#

i guess its just somewhat unreliable with the sitting pose

humble axle Mar 20, 2023, 4:35 PM

#

XD

shut siren Mar 20, 2023, 4:36 PM

#

maybe alpha=1 just needs 8e-4

#

double the LR of alpha=8

humble axle Mar 20, 2023, 4:36 PM

#

make sense

shut siren Mar 20, 2023, 4:36 PM

#

8e-4 text encoder makes me uneasy lol

humble axle Mar 20, 2023, 4:36 PM

#

XD

shut siren Mar 20, 2023, 4:37 PM

#

thats more than an order of magnitude higher than my previous

normal charm Mar 20, 2023, 4:38 PM

#

Trying another training using scraps of everything we just talked about
I wont say i know what im doing now, but at least i know…somewhat more than before?

shut siren Mar 20, 2023, 4:41 PM

#

i feel like ppl use widely different settings and still manage to arrive at a usable result

normal charm Mar 20, 2023, 4:41 PM

#

Well yes civitai is proof of that

#

Its more a matter of efficiency than usability

#

Well

shut siren Mar 20, 2023, 4:41 PM

#

im just trying to figure out if there are better settings than what im using that will more consistently give a usable result

normal charm Mar 20, 2023, 4:41 PM

#

Those are kinda the same thing

#

Somewhat

#

MilimThink

shut siren Mar 20, 2023, 4:41 PM

#

like my old 128dim/128alpha loras

#

were usable

#

i made some pretty cool gens with them

humble axle Mar 20, 2023, 4:42 PM

#

shut siren i feel like ppl use widely different settings and still manage to arrive at a us...

just remember

#

all the setting is depend on your dataset your task your model... depends on all the thing

shut siren Mar 20, 2023, 4:42 PM

#

but now im a low dim believer

humble axle Mar 20, 2023, 4:42 PM

#

I'm 1dim believer (X

shut siren Mar 20, 2023, 4:42 PM

#

save disk space

normal charm Mar 20, 2023, 4:42 PM

#

Theres an extension that helps view the details of other models people have trained (not additional net)

shut siren Mar 20, 2023, 4:43 PM

#

theres some 256dim users on civitai

humble axle Mar 20, 2023, 4:43 PM

#

I also recevied an issue about blow up LoHA

#

and that guy just use

#

384dim loha

#

I don't know why

shut siren Mar 20, 2023, 4:43 PM

#

surely more dim = better result

humble axle Mar 20, 2023, 4:43 PM

#

higher than the dim of some layer in the UNet

shut siren Mar 20, 2023, 4:43 PM

#

moklueless

humble axle Mar 20, 2023, 4:44 PM

#

so we should use 100B stable diffusion(O

normal charm Mar 20, 2023, 4:44 PM

#

Highest ive ever gone was 128
I was under the impression low dims were for characters and higher were for styles

humble axle Mar 20, 2023, 4:44 PM

#

I actually use 4dim for style XD

#

but since I use 1dim for character
so higher for style may be correct?

shut siren Mar 20, 2023, 4:45 PM

#

damn 1dim

humble axle Mar 20, 2023, 4:45 PM

#

Like this
4dim loha

#

Fuzi style

shut siren Mar 20, 2023, 4:45 PM

#

i mean i guess ppl make TIs for characters and those are even smaller than 1dim loras

humble axle Mar 20, 2023, 4:45 PM

#

Yeah

normal charm Mar 20, 2023, 4:45 PM

#

I figured higher dim did something like account for more of the details in the image or something

#

Hence better for styles

normal charm Mar 20, 2023, 4:46 PM

#

shut siren i mean i guess ppl make TIs for characters and those are even smaller than 1dim ...

That would be true except everyones switched to loras now

#

Or at least as far as ive seen

shut siren Mar 20, 2023, 4:46 PM

#

you use 1dim for linear and conv?

normal charm Mar 20, 2023, 4:47 PM

#

I didn’t knoe what to set it to at first, so i tried keeping them close to the base alpha and dim i had set

#

Like if i had 8/16 locon would be 9/18 or something

shut siren Mar 20, 2023, 4:48 PM

#

you were using alpha higher than dim?

normal charm Mar 20, 2023, 4:48 PM

#

Rarely

#

I only ever kept the alpha to between 8 and 16, and MAYBE 32

#

On occasions where things werent working

shut siren Mar 20, 2023, 4:51 PM

#

damn its competent when the char is standing, but cant do this when the char is sitting lol

normal charm Mar 20, 2023, 4:55 PM

#

Such perfect ai symmetry makes me cry of joy

proper ember Mar 20, 2023, 5:41 PM

#

shut siren damn its competent when the char is standing, but cant do this when the char is ...

That character has nice thighhighs

#

What's her name?

normal charm Mar 20, 2023, 5:59 PM

#

Target concept:

lycoris_princess_kantai_collection_drawn_by_tomamatto__58bf2337bfa020b99b73fe609abf7bc7.png

#

Result:

27413-843882437-_best_quality_highly_detailed_illustration_Intricate_promotional_art_8k_wallpaper_lycoris_princess_abyssal_ship_1girl_lo.png

#

so what went wrong now

📎 config-1679333789.495957.json

#

@humble axle

humble axle Mar 20, 2023, 6:02 PM

#

If you can get good result with normal lora
Just continue using it
If you get bad result with all the things

You need to adjust your dataset

#

And I need yo sleep sry(

normal charm Mar 20, 2023, 6:03 PM

#

deadsmile

shut siren Mar 20, 2023, 6:43 PM

#

Agony

#

I expanded some of my datasets from 20 to 30 images, but I don’t feel that it’s necessarily an improvement for all of them

vivid python Mar 20, 2023, 6:53 PM

#

I usually do 60-100 images for a dataset

#

50-150 is what I consider good enough

normal charm Mar 20, 2023, 6:59 PM

#

Im trying to train 24 img dataset

#

Its missed twice Agony

proper ember Mar 20, 2023, 7:04 PM

#

I would consider remaining with old version of both scripts personally with Linux

#

Wasn't a good idea updating both to latest version

#

I too had the same bad accuracy problem as his

normal charm Mar 20, 2023, 7:36 PM

#

3rd miss

#

deadsmile

normal charm Mar 20, 2023, 8:20 PM

#

4th miss

#

Agony

normal charm Mar 20, 2023, 9:11 PM

#

5th miss
But this time, it retained more details after i changed the base alpha from 1 to 4

#

All the others had it at one

#

So so far alpha=1 aint lookin good

#

Does that need higher steps or lr?

vivid python Mar 20, 2023, 9:23 PM

#

long story short, setting alpha to 1 only really works at low dims

#

higher than 8 set it to half

normal charm Mar 20, 2023, 9:24 PM

#

But it was at 8 lol

#

8/1

#

Are we talking lower than that even?

shut siren Mar 20, 2023, 9:39 PM

#

Low alpha needs higher LR

#

When they introduced alpha and it dropped my 128/128 loras (at the time) to 128/1, it essentially learned nothing on the same settings

ivory crescent Mar 20, 2023, 11:10 PM

#

chenThink

#

trying to use gradient checkpointing

#

no way this is working, I have batch size 8 at 768 res

vivid python Mar 21, 2023, 12:11 AM

#

higher batch size isn't necessarily better, not at our scale

shut siren Mar 21, 2023, 2:39 AM

#

my friend asked me to try to train with a single image - and its a fkin discord emoji

#

pretty sure this will result in absolute junk

vivid python Mar 21, 2023, 3:25 AM

#

i'd be surprised if it works

shut siren Mar 21, 2023, 3:30 AM

#

it made some absolute junk but we got a few laughs

#

doesnt help that the emoji is 64x64

normal charm Mar 21, 2023, 5:25 AM

#

Two additional misses in the past few hours since last update

#

8th attempt in progress

normal charm Mar 21, 2023, 6:13 AM

#

8th attempt ended up retaining the most details with these parameters:
4/8 linear
8/1 conv
5e-4 lr
0.05 warmup
1200 steps
This is when trained as a locon
The outputs still fuck up body anatomy fairly bad, but it showed signs of getting the concept right, so this is next checkpoint i think

#

Next ill try doing loha and see how that fares

ivory crescent Mar 21, 2023, 11:53 AM

#

@humble axle tested your settings with higher/lower TE/LE and 512/768 and they work pretty well, will have to make LoRA to compare the results but at least they work, so thanks for the help

#

KomachiLove

humble axle Mar 21, 2023, 11:57 AM

#

emoji_catlight

normal charm Mar 22, 2023, 2:41 PM

#

Tried training my 9th attempt with the same settings as my 8th, but as a loha this time. Result was another complete miss

#

Loha needs more focus or something on it right?

vivid python Mar 22, 2023, 5:23 PM

#

Loha usually take more to train, because they compress their dims

normal charm Mar 22, 2023, 5:35 PM

#

vivid python Loha usually take more to train, because they compress their dims

Does that equate to setting it for more steps then? Ive been doing 1500 so far to keep it safe

vivid python Mar 22, 2023, 5:38 PM

#

Or higher lr

worn locust Mar 22, 2023, 5:45 PM

#

1500 is definitely too low

normal charm Mar 22, 2023, 5:49 PM

#

How much then

vivid python Mar 22, 2023, 5:49 PM

#

For loha, either a high lr or like 3k steps

#

The way I train in general is start at 1e-3 for 800 steps

#

Figure out if it's good enough, if not then figure out what needs to be changed

#

If it's baked but didn't learn anything, lower lr increase steps

#

If it learned most things but not everything and isn't baked, increase steps a bit

#

If it learned nothing and wasn't baked, increase lr

#

Though I never had to do that one

#

I usually end up with something like 5e-4 for 1600 steps

#

Also. I generally keep TE at 1e-4 in pretty much every case

vivid python Mar 22, 2023, 5:53 PM

#

worn locust 1500 is definitely too low

I just realized your name is 5e-4, that's perfect

worn locust Mar 22, 2023, 5:55 PM

#

That's solid advice

vivid python Mar 22, 2023, 5:56 PM

#

I learned it from a dude with a masters in datascience

#

I picked his brain a lot

#

At least I'm pretty sure he said masters

#

It was a few months ago at this point

normal charm Mar 23, 2023, 6:05 AM

#

vivid python If it learned nothing and wasn't baked, increase lr

This is what i had to fall back to and that didnt help, so im thinking the issue is something else

vivid python Mar 23, 2023, 6:06 AM

#

Might be the dataset

#

Ah right, I should be asleep

#

I've gotta wake up early

normal charm Mar 23, 2023, 6:07 AM

#

Np

#

Ill just refer back to old trainings and improve from there ig

normal charm Mar 23, 2023, 11:22 AM

#

@vivid python im the only one who frequents this place bc of endless questions and issues Agony

anyway, i updated to v6

vivid python Mar 23, 2023, 12:12 PM

#

That's the first time I've seen the venv creation just straight up fail

#

I literally just updated my install of sd-scripts to torch 2.1.0 as well

#

So I know it exists

#

The v6 installer is meant to be in its own folder

#

Because it installs everything of course

#

But the torch_update.bat is just supposed to nuke the venv then reinstall everything with installing the new torch

normal charm Mar 23, 2023, 12:54 PM

#

vivid python The v6 installer is meant to be in its own folder

I did do this, on desktop

#

Had to download and install an older python version tho, but i digress, i suppose

vivid python Mar 23, 2023, 1:56 PM

#

I only tested everything on 3.10.6

#

That's why

normal charm Mar 23, 2023, 2:30 PM

#

vivid python That's why

So how do i fix the venv issue?

normal charm Mar 23, 2023, 6:19 PM

#

sterile bolt Mar 23, 2023, 8:04 PM

#

yo, somebody training locon/lyco in LoRA_Easy_Training_Scripts local repo? how can i understand what exactly trains right now? and if anyone can, please send me configuration file for analysis, thanks in advance

normal charm Mar 23, 2023, 8:15 PM

#

Id tell u if i could train rn but

#

AbbyShrug

sterile bolt Mar 23, 2023, 8:16 PM

#

I'll wait

#

does this mean i'm training lycoris?

digital kite Mar 23, 2023, 10:07 PM

#

sterile bolt does this mean i'm training lycoris?

lycoris with lora algo is locon

normal charm Mar 24, 2023, 1:54 AM

#

normal charm So how do i fix the venv issue?

@vivid python

vivid python Mar 24, 2023, 1:54 AM

#

not sure

#

the torch_update.bat should just re-create the venv

normal charm Mar 24, 2023, 1:58 AM

#

deadsmile

normal charm Mar 24, 2023, 2:25 AM

#

Welp
Guess im disabled now

worn locust Mar 24, 2023, 3:27 AM

#

https://tenor.com/view/homer-simpson-stupid-you-qualify-as-disabled-gif-17539790

Tenor

normal charm Mar 24, 2023, 3:30 AM

#

normal charm Mar 24, 2023, 7:36 AM

#

Cant reinstall v5 either

#

Same issue

#

When did the program suddenly decide i dont have xformers installed

#

facepalm

normal charm Mar 24, 2023, 3:50 PM

#

@vivid python idk what happened, but as stated above, i cant seem to reinstall v5 either
It all seems tied to a torch error

vivid python Mar 24, 2023, 3:50 PM

#

v5 shouldn't have any issues with installation

#

the only difference in v6 is the option for the new torch version

#

and proper checks on python version

normal charm Mar 24, 2023, 3:53 PM

#

vivid python and proper checks on python version

Running the kohya v4 installer also gives the same error about not finding a satisfactory requirement

vivid python Mar 24, 2023, 3:53 PM

#

v4 cannot work

#

v5 was when I completely changed the code

normal charm Mar 24, 2023, 3:54 PM

#

Guess i can delete that then
But that still doesnt explain the v6 and v5 issue

vivid python Mar 24, 2023, 3:54 PM

#

it doesn't you're right

#

but it's likely an issue with your computer

#

rather than my scripts

#

as I haven't had any other issues with it, nor any other reports of issues

normal charm Mar 24, 2023, 3:55 PM

#

That makes it even more obsure

#

Since everything i need should already be installed

vivid python Mar 24, 2023, 3:56 PM

#

which is only python 3.10.6

#

and git

normal charm Mar 24, 2023, 3:57 PM

#

Yep
I have git and 3.10.6 installed

vivid python Mar 24, 2023, 4:01 PM

#

i'm running through the installer again

#

so far no problems

normal charm Mar 24, 2023, 4:04 PM

#

vivid python so far no problems

Tried redownloading the py file
No dice

#

vivid python Mar 24, 2023, 4:05 PM

#

how do you have 3.10.6 installed?

normal charm Mar 24, 2023, 4:05 PM

#

Wdym by how

vivid python Mar 24, 2023, 4:05 PM

#

you can get it either through the app store or from the website

#

the app store version is dogshit

normal charm Mar 24, 2023, 4:06 PM

#

Oh no i went directly to the site

vivid python Mar 24, 2023, 4:06 PM

#

and I'm assuming you added it to path?

#

do you have any other versions of python?

normal charm Mar 24, 2023, 4:08 PM

#

vivid python do you have any other versions of python?

I checked the add to path option in the installer im certain
And before this version, i actually had 10.9 or something installed, with which training sessions never encountered issues.

vivid python Mar 24, 2023, 4:08 PM

#

is it still installed?

normal charm Mar 24, 2023, 4:09 PM

#

I only downloaded 10.6 after this error started appearing
I assumed installing 10.6 would overwrite any and all 10.9 stuff

vivid python Mar 24, 2023, 4:09 PM

#

not at all

normal charm Mar 24, 2023, 4:09 PM

#

Should I uninstall and reinstall then?

vivid python Mar 24, 2023, 4:09 PM

#

just uninstall python 3.10.9

normal charm Mar 24, 2023, 4:11 PM

#

Wait i also have a 3.11 too…?

#

Wtf

vivid python Mar 24, 2023, 4:11 PM

#

that's 100% the issue then

#

3.11 doesn't work at all

#

that I know for sure

normal charm Mar 24, 2023, 4:12 PM

#

Yeah theres actually 3 different python versions here now
Im not sure how it was working before

vivid python Mar 24, 2023, 4:14 PM

#

some things seem to want to install 3.11 for some reason

#

i've had it happen too

normal charm Mar 24, 2023, 4:19 PM

#

vivid python i've had it happen too

Wow. That didn’t fix it ._.

vivid python Mar 24, 2023, 4:19 PM

#

might be better to just uninstall all python versions then reinstall then

normal charm Mar 24, 2023, 4:32 PM

#

vivid python might be better to just uninstall all python versions then reinstall then

Managed to collect torch, so far so good

vivid python Mar 24, 2023, 4:33 PM

#

I don't know how your python got fucked up, but I'm also not surprised

#

python is honestly really shitty

normal charm Mar 24, 2023, 4:34 PM

#

Its a miracle (though i was unaware), that it even worked in the past at all

normal charm Mar 24, 2023, 5:01 PM

#

vivid python python is honestly really shitty

almost had it DrollFrozen

vivid python Mar 24, 2023, 5:04 PM

#

what does it say above?

normal charm Mar 24, 2023, 5:04 PM

#

vivid python what does it say above?

Just a repeat of the same distribution line

vivid python Mar 24, 2023, 5:07 PM

#

looks like diffusers it throwing a fit

normal charm Mar 24, 2023, 5:09 PM

#

Do i use one of the update bats?

normal charm Mar 24, 2023, 5:27 PM

#

#

I would like to state for the record that I do try what I can to fix stuff before I go pinging you, which is why ur not getting a ping every 3-5 minutes @vivid python

vivid python Mar 24, 2023, 5:32 PM

#

Oh I get it, I not annoyed or anything, I just don't have notifications on

#

Thay being said

#

Try running torch_update.bat

#

And installing 1.12.1

#

The original torch and see if it runs

normal charm Mar 24, 2023, 5:51 PM

#

vivid python The original torch and see if it runs

it did

#

just keep using it with 1.12.1 then?

vivid python Mar 24, 2023, 6:04 PM

#

Yeah, seems like torch 2 isn't working for you which is odd

#

I haven't had somebody have issues with it

normal charm Mar 24, 2023, 6:15 PM

#

vivid python I haven't had somebody have issues with it

Just remember me as the guy who is problem prone

#

momi

vivid python Mar 24, 2023, 6:16 PM

#

guess so

#

and it sucks too

#

because torch 2.1.0 made my bake times go from 2.5 hours to 1.5 hours

#

and i'm on a 3060

normal charm Mar 24, 2023, 6:26 PM

#

vivid python and i'm on a 3060

3070
Ive heard these arent the best with training

vivid python Mar 24, 2023, 6:27 PM

#

they are alright, less vram than me

normal charm Mar 24, 2023, 6:27 PM

#

Mhm
Maybe i shouldnt be too surprised

quiet notch Mar 27, 2023, 4:55 AM

#

I was trying to get this trainer up and running, but I was only able to get it working on torch 1.12.1, not on >2.0. I have a GTX1070, so I'm just randomly assuming it's not supported.

#

Oh, I remember my issue now

#

It was something about bitbytes saying that there was no kernel available for execution for my device

#

That said, I don't know if I can ever actually run with torch >2.0 cirnoHelpImDyingInside

#

that speedup sounds nice

#

it looks like there may be solution for me to try in here...

quiet notch Mar 27, 2023, 5:33 AM

#

nope, still wasn't able to get it up and running

#

There's that dreaded RuntimeError: CUDA error: no kernel image is available for execution on the device

#

Running 1070 on win10

shut siren Mar 27, 2023, 6:10 AM

#

try turning off 8bit adam

#

or errr

#

using regular adamw instead of adamw8bit

#

i know adamw8bit isnt compatible with GTX 1080 and will throw that error

#

@quiet notch

quiet notch Mar 27, 2023, 6:11 AM

#

i will give that a try

#

hopefully ram isn't an issue...

shut siren Mar 27, 2023, 6:25 AM

#

does it run?

quiet notch Mar 27, 2023, 6:26 AM

#

nope

#

same issue

#

#

switched over as you said

#

i noticed that from earlier messages, the easy_training folder is supposed to have a venv?

#

i'm not running in venv (didn't do an earlier step correctly perhaps?), could that be part of the issue?

#

@shut siren i might have to sleep soon so we can resume this some other time (unless you want me to troubleshoot rn fast)

#

i am using loha settings, btw

shut siren Mar 27, 2023, 6:35 AM

#

well "no kernel image is available for execution on your device" implies that something is incompatible with your hardware

quiet notch Mar 27, 2023, 6:36 AM

#

gtx 1070 8gb

#

cirnoSaddest

#

guess i'll be having an early christmas soon

shut siren Mar 27, 2023, 6:36 AM

#

have you tried using base sdscripts

quiet notch Mar 27, 2023, 6:37 AM

#

i have yet to touch it directly

shut siren Mar 27, 2023, 6:37 AM

#

it may be torch 2.0 as you mentioned before?

#

i know that bitsandbytes doesnt like GTX 1080

#

but you shouldnt need bitsandbytes unless youre trying to run adamw8bit

#

so theres definitely something else

#

i had someone in another discord also run into that kernel image error, but it specifically indicated bitsandbytes, and he was using sdscripts with the bmaltais wrapper on GTX 1080 - fixed immediately when he switched to regular adamw

quiet notch Mar 27, 2023, 6:39 AM

#

cirnoNotLikeThis

shut siren Mar 27, 2023, 6:40 AM

#

you got things working on older torch version?

quiet notch Mar 27, 2023, 6:40 AM

#

yeah, it works on 1.12.1

shut siren Mar 27, 2023, 6:40 AM

#

it might just be that torch 2.0 doesnt like your hardware

quiet notch Mar 27, 2023, 7:01 AM

#

Do you know if gradient acc steps is properly supported? I was getting this warning...

#

None of the inputs have requires_grad=True. Gradients will be None

#

not too sure if this is specific to the easy training scripts or lora in general

normal charm Mar 27, 2023, 8:00 AM

#

Im pretty sure 2.0 didnt work for me either, and im on 3070

#

I was just instructed to use 1.12

vivid python Mar 27, 2023, 11:08 AM

#

quiet notch gtx 1070 8gb

10 series cards requires a specific patch, I'm gonna assume you applied it, I had believed that it should just work, but perhaps the main.py file changed for torch2.1.0, which actually makes sense.

#

Perhaps it might actually be that the dlls need to be updated to support torch 2.1.0, which means, likely, that until a new version gets built, 10 series cards cannot use torch 2

quiet notch Mar 27, 2023, 1:05 PM

#

I did hear something about building from scratch

normal charm Mar 27, 2023, 8:51 PM

#

Im surprised i never asked this question before but if epoch count is ignored when max steps is set, are the number of repeats also ignored?

quiet notch Mar 27, 2023, 8:58 PM

#

normal charm Im surprised i never asked this question before but if epoch count is ignored wh...

From what I understand, number of repeats merely multiplies you inputs. It shouldn't have any effect on what you set your max steps to be (assuming max steps is the point at which the model is forcefully stopped)

#

Speaking of multiplying inputs, I noticed that when the repeat config multiplies the input, the "epoch" then becomes based on that multiplied dataset rather than the original.

#

if I understood my readings correctly, epoch is when the AI looks over your images once and has updated parameters. But with the repeat variable, when you look over all of your data once * repeat, it only counts as 1 epoch instead of what the repeat is.

#

i'm even more confused when i take into account gradient acc steps and the config parameter epoch, because the shown epoch during training can be higher than what you originally intended for it to be, and then on top of that, since input is multiplied, the real epoch is actually the shown epoch multiplied by repeats?

#

cirnoConfused

normal charm Mar 27, 2023, 9:06 PM

#

https://tenor.com/view/math-confused-gif-21641306

Tenor

quiet notch Mar 27, 2023, 9:06 PM

#

normal charm Im surprised i never asked this question before but if epoch count is ignored wh...

so to properly answer this question, i dont think it should because num repeats doesn't effect the point at which you stop, assuming max steps is the training cutoff

normal charm Mar 27, 2023, 9:07 PM

#

That was my understanding of max steps at least

#

So what it sounds like is it just affects how many images are processed during training

#

However long its specified to go

quiet notch Mar 27, 2023, 9:13 PM

#

though...

#

assuming max steps refers to optimization steps (the step in which parameters are updated)

#

hm

#

nevermind

#

i was thinking that if a "step" referred to an image training iteration, then if you set repeats to like 10 and max steps to 10, then you would only go through 1 image out of your entire dataset

normal charm Mar 28, 2023, 5:33 AM

#

okay so ik for a FACT this sample folder did not exist before, despite the date modified column saying its been here since a few weeks ago
i have really really wanted a an output folder for samples so i could better track the training progress, but from, what I can tell, theres no path argument or whatever that can be assigned for that

#

#

Theres the option to tell it how often to dropout samples, but apparently the output directory for the lora isnt enough for that (according to the error i get when using that function)

Theres no way im that blind bc i constantly check this folder after training things, downloading things, etc

#

No way i wouldnt have noticed this, but if it can actually be done then where do i specify where it drops the samples?

worn locust Mar 28, 2023, 5:41 AM

#

you just got mandela effect'd by derrian

vivid python Mar 28, 2023, 5:43 AM

#

nani

#

I don't think there is an option to set where samples go

#

at least, there isn't an arg for it

#

yeah, I don't see one

#

it is supposed to output to the outputs folder

#

I can only see the problem being that there wasn't a proper txt file being pointed to for samples

#

but dunno

#

anyways, gonna sleep now

#

if you still have questions

#

I'll answer in the morning

normal charm Mar 28, 2023, 7:25 AM

#

The samples i found in that folder were from a lora i trained awhile back
And i have only ever trained loras using ur script, at least, as far as i can remember

#

Which makes this all the more confusing

#

Since any time i turn the samples arg on, it just gives me path errors during training

vivid python Mar 28, 2023, 1:07 PM

#

Not sure then, there's no way to change the folder for samples, but the path error can be it looking for the txt file and not able to find it

normal charm Mar 28, 2023, 1:35 PM

#

vivid python Not sure then, there's no way to change the folder for samples, but the path err...

Wait what txt file

vivid python Mar 28, 2023, 1:36 PM

#

The text file that has all your prompts

#

It doesn't just pull random prompts from the dataset

normal charm Mar 28, 2023, 1:37 PM

#

U mean the sample prompt txt?

#

Is that what it needs? I do remember i used it for that lora

normal charm Mar 28, 2023, 2:19 PM

#

@vivid python

vivid python Mar 28, 2023, 2:23 PM

#

normal charm Is that what it needs? I do remember i used it for that lora

Yep, it's required

normal charm Mar 28, 2023, 2:25 PM

#

vivid python Yep, it's required

I see. Also, ‘nother quick question: how high do u reckon my lr should be if i train at, say, 5 dim?

vivid python Mar 28, 2023, 2:27 PM

#

Depends on the dataset

#

I usually follow

#

The idea of 1e-3 to see how it bakes really quickly

#

Then adjust based on those outputs

#

Dim doesn't affect lr a ton, from what I know

normal charm Mar 28, 2023, 2:30 PM

#

Ok, thank u

sterile bolt Mar 28, 2023, 7:04 PM

#

so i use 41 pic dataset with 5 repeats for this settings to lora training (local LoRA_Easy_Training_Scripts):
self.optimizer_type: str = "AdamW8bit" self.scheduler: str = "cosine_with_restarts" self.cosine_restarts: Union[int, None] = 3 self.learning_rate: Union[float, None] = 1e-4 self.unet_lr: Union[float, None] = 1e-4 self.text_encoder_lr: Union[float, None] = 5e-5 self.net_dim: int = 128 self.alpha: float = 128 self.train_resolution: int = 768 self.batch_size: int = 6 self.num_epochs: int = 12

then i trying to start train LyCORIS with same dataset and same count of repeats when add next settings:
i changed value self.net_dim: int = from 128 to 16
i changed value self.alpha: float = from 128 to 1
self.lyco: bool = True self.locon_dim: Union[int, None] = 8 self.locon_alpha: Union[int, None] = 1 self.locon: bool = True
add self.network_args: Union[dict[str:str], None] = {
algo": "lora"
"conv_dim": "8"
"conv_alpha": "1"
"disable_conv_cp": "True"
}`

as a result, i get an output image worse than lora. im dumb?

SPOILER_xyz_grid-0046-2581195198-indoors20portrait20facing20viewer20sheffield20_azur20lane_20standing201girl20solo20maid20headdress20hair20over20one20eye20maid20yello.png

vivid python Mar 28, 2023, 7:14 PM

#

sterile bolt so i use 41 pic dataset with 5 repeats for this settings to lora training (local...

base dim 128?

sterile bolt Mar 28, 2023, 7:14 PM

#

for lora 128, for lycoris 16

vivid python Mar 28, 2023, 7:15 PM

#

eh, lora don't need to be dim128

#

https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

GitHub

GitHub - derrian-distro/LoRA_Easy_Training_Scripts: A set of two tr...

A set of two training scripts written in python for use in Kohya's SD-Scripts repository. - GitHub - derrian-distro/LoRA_Easy_Training_Scripts: A set of two training scripts written in pyth...

#

also update pushed

vivid python Mar 28, 2023, 7:15 PM

#

sterile bolt so i use 41 pic dataset with 5 repeats for this settings to lora training (local...

also, 16/8 is better than 16/1

#

and locon is just not better for characters because of style bleed

#

and lower dims require higher steps

#

or higher lr

sterile bolt Mar 28, 2023, 7:17 PM

#

vivid python eh, lora don't need to be dim128

i always used in lora 128/128 and i like output results, as in preview above

vivid python Mar 28, 2023, 7:18 PM

#

I usually never use dim128 lora

#

either I resize them

sterile bolt Mar 28, 2023, 7:19 PM

#

can i increase number of steps by increasing repeats of dataset? like from 5 to 20

vivid python Mar 28, 2023, 7:19 PM

#

or just don't use them

vivid python Mar 28, 2023, 7:19 PM

#

sterile bolt can i increase number of steps by increasing repeats of dataset? like from 5 to ...

more epochs

sterile bolt Mar 28, 2023, 7:19 PM

#

or how i can try increase amount of steps

vivid python Mar 28, 2023, 7:19 PM

#

better that way

sterile bolt Mar 28, 2023, 7:22 PM

#

value self.net_dim value self.alpha
this is need to change when training lycoris or will be used only conv values?

vivid python Mar 28, 2023, 7:23 PM

#

it will use both

sterile bolt Mar 28, 2023, 7:24 PM

#

NotLikeThis

#

which one is better trying to training - lycoris or locon?

vivid python Mar 28, 2023, 7:28 PM

#

depends

#

actually, it doesn't

#

lycoris is locon

#

they are the same

sterile bolt Mar 28, 2023, 7:29 PM

#

self.lyco: bool = True # turn on if you want to use the new locon architecture

#

so, different architectures

#

which one is better for now

vivid python Mar 28, 2023, 7:31 PM

#

between LoRA, LoCon, and LoHa

#

LoHa is trash

#

the other two have their uses

#

neither one is better than the other

#

LoCon learn styles much better

#

because they train on the whole model

#

LoRA are better for characters, because they don't learn style easily

#

and you can certainly get exactly everything right about a character at dim16

#

just, don't do 16/1

#

that just doesn't work

#

16/8 works

#

8/1 works

#

16/1 doesn't

sterile bolt Mar 28, 2023, 7:34 PM

#

Sheffield has a lot of details, so i want to try to do it as accurately as possible, lora can't convey all the details

normal charm Mar 28, 2023, 7:34 PM

#

vivid python LoHa is trash

After extensive trying
I concur

#

DrollFrozen

vivid python Mar 28, 2023, 7:35 PM

#

sterile bolt Sheffield has a lot of details, so i want to try to do it as accurately as possi...

I managed to get Shun + Shun small into dim16, who has just as many small details

sterile bolt Mar 28, 2023, 7:35 PM

#

lora?

vivid python Mar 28, 2023, 7:35 PM

#

Hatakaze has a really complex pattern on her kimono, dim16 got that as well

#

lora

sterile bolt Mar 28, 2023, 7:36 PM

#

i cant reach all details on dim 128

#

https://tenor.com/view/homelander-speech-bubble-the-boys-upset-crying-gif-26284753

Tenor

vivid python Mar 28, 2023, 7:36 PM

#

train better?

sterile bolt Mar 28, 2023, 7:37 PM

#

what i need to change? all my settings are above

vivid python Mar 28, 2023, 7:37 PM

#

eh, all of it

#

high batch size lends itself to learning less often

#

1e-4 works well at 16/8, if you set that for all and go like 3k steps

sterile bolt Mar 28, 2023, 7:38 PM

#

vivid python high batch size lends itself to learning less often

is it good or bad

vivid python Mar 28, 2023, 7:38 PM

#

bad usually

#

the way batch size works is that it merges that many images into one latent

#

then trains on that latent

#

means it learns less small details usually

#

I train at batch 2

sterile bolt Mar 28, 2023, 7:40 PM

#

then why i bought 3090

vivid python Mar 28, 2023, 7:40 PM

#

speed

sterile bolt Mar 28, 2023, 7:40 PM

#

nah

vivid python Mar 28, 2023, 7:40 PM

#

training takes a 5th the time it does on my 3060

#

regardless, you do you

sterile bolt Mar 28, 2023, 7:40 PM

#

vivid python 1e-4 works well at 16/8, if you set that for all and go like 3k steps

this settings for lora, right?

vivid python Mar 28, 2023, 7:41 PM

#

I've stopped giving concrete advice because people don't usually actually follow it

sterile bolt Mar 28, 2023, 7:41 PM

#

and how u calculate ur amount of steps?

vivid python Mar 28, 2023, 7:41 PM

#

I don't

#

I just set steps

sterile bolt Mar 28, 2023, 7:42 PM

#

but how it works

vivid python Mar 28, 2023, 7:42 PM

#

I use the variable to set max train steps

#

kohya handles the rest

#

I pretty much never use repeats

#

unless the dataset is really small

sterile bolt Mar 28, 2023, 7:42 PM

#

41 is small?

vivid python Mar 28, 2023, 7:43 PM

#

3 repeats probably

#

epochs save instantly

#

so I see no reason to just use more epochs

sterile bolt Mar 28, 2023, 7:43 PM

#

k i will try ur settings rn

vivid python Mar 28, 2023, 7:43 PM

#

you usually need to bake a bunch of attempts and tweak settings

#

this will be true regardless of dim

sterile bolt Mar 28, 2023, 7:44 PM

#

but i won't get 12 epochs in 8 minutes due to batch size change cry

vivid python Mar 28, 2023, 7:44 PM

#

speed is the enemy of accuracy, I've found

#

anyways

#

not gonna be looking at this chat for the next whole day

#

gonna be away from computer

sterile bolt Mar 28, 2023, 7:46 PM

#

which scheduler do you use?

kindred belfry Mar 28, 2023, 8:41 PM

#

vivid python LoHa is trash

Or you probably don't know the right hyperparameter to use?

#

To claim something it is probably better to show some evidence as what I have always been doing Drool
(Providing dataset composition, hyperparameters, trained networks, xyz grids etc

vivid python Mar 28, 2023, 8:47 PM

#

sterile bolt which scheduler do you use?

Usually cosine or cosine with restarts

vivid python Mar 28, 2023, 8:48 PM

#

kindred belfry To claim something it is probably better to show some evidence as what I have al...

Not worth it, I don't plan on spending the hours to train loha when locon have gotten better results

#

I also don't plan on getting into this

kindred belfry Mar 28, 2023, 8:49 PM

#

That's your choice of not switching to loha, and I understand it. In the end they are not that different, but claiming loha is trash is totally misleading.

vivid python Mar 28, 2023, 8:49 PM

#

Loha don't improve over locon

kindred belfry Mar 28, 2023, 8:50 PM

#

Could be. It depends on many factors. I don't claim any of them to be an improvement in the end.

vivid python Mar 28, 2023, 8:50 PM

#

The problem is how much must be done to make them work

#

And even then, they usually end up having a larger file size from my experience

kindred belfry Mar 28, 2023, 8:51 PM

#

It always works for me

#

Maybe for some reason it doesn't work well with your configuration

#

For some others they find loha to be better

vivid python Mar 28, 2023, 8:52 PM

#

I've had them work, but I've only seen them worth messing with when making something huge

sterile bolt Mar 28, 2023, 8:52 PM

#

vivid python Usually cosine or cosine with restarts

im using cosine with 3 restarts, so, idk, your settings (16 dim, 8 alpha, 1e-4, 2 batch size, 3 repeats and 3k steps) dont seems to be good (lora with your settings from right)

SPOILER_xyz_grid-0054-2675608733-indoors20upper20body20facing20viewer20sheffield20_azur20lane_20standing201girl20solo20maid20headdress20hair20over20one20eye20maid20yel.png

kindred belfry Mar 28, 2023, 8:52 PM

#

Personally I think they are on par. We really need to see the dataset, the training hyperparameters, and the results to be able to say something

vivid python Mar 28, 2023, 8:52 PM

#

Like the entirety of umamusume

vivid python Mar 28, 2023, 8:53 PM

#

sterile bolt im using cosine with 3 restarts, so, idk, your settings (16 dim, 8 alpha, 1e-4, ...

Meh, you do you

#

I'm not gonna help

sterile bolt Mar 28, 2023, 8:54 PM

#

CracksInTheWall

vivid python Mar 28, 2023, 8:55 PM

#

I don't help people anymore

#

Because I've continually gotten "fuck you" from people

sterile bolt Mar 28, 2023, 8:55 PM

#

but i dont say that

vivid python Mar 28, 2023, 8:56 PM

#

Sure, you probably don't, but I've just stopped wanting to help people because of how often it does happen

#

That being said, I'm also phone posting, which makes it annoying to type

sterile bolt Mar 28, 2023, 8:58 PM

#

https://tenor.com/view/kiss-a-homie-kiss-smack-gif-17593938

Tenor

vivid python Mar 28, 2023, 8:58 PM

#

In general

vivid python Mar 28, 2023, 8:59 PM

#

sterile bolt https://tenor.com/view/kiss-a-homie-kiss-smack-gif-17593938

I have literally no idea what that means in this case

sterile bolt Mar 28, 2023, 9:02 PM

#

it means thanks for help anyway

vivid python Mar 28, 2023, 9:08 PM

#

I see

#

BTW, just as a thing to mention

#

Could be the dataset

normal charm Mar 28, 2023, 9:37 PM

#

vivid python I don't help people anymore

Dw about that with me, i already feel like enough of a burden if i ask for help more than 3 times

vivid python Mar 28, 2023, 9:38 PM

#

normal charm Dw about that with me, i already feel like enough of a burden if i ask for help ...

Your questions are very basic, doesn't usually take much time to answer

#

Also technical questions are usually less of an issue

#

It's more or less actual training questions that I struggle to find the will to help with now

worn locust Mar 28, 2023, 9:57 PM

#

sterile bolt so i use 41 pic dataset with 5 repeats for this settings to lora training (local...

If you go from dim 128 to dim 16 and don't increase the learning rate, then obviously the result will be trash
I always do 5e-4, try that

vivid python Mar 28, 2023, 10:05 PM

#

worn locust If you go from dim 128 to dim 16 and don't increase the learning rate, then obvi...

you can do 1e-4, but you need a ton of steps

worn locust Mar 28, 2023, 10:13 PM

#

yee

shut siren Mar 29, 2023, 12:16 AM

#

You’re going to need a much higher LR going from 128/128 to 16/1

sterile bolt Mar 29, 2023, 6:24 AM

#

worn locust If you go from dim 128 to dim 16 and don't increase the learning rate, then obvi...

i will try, thanks, you set 5e-4 for all settings? (lr, unet, text encoder)

worn locust Mar 29, 2023, 7:14 PM

#

sterile bolt i will try, thanks, you set 5e-4 for all settings? (lr, unet, text encoder)

text encoder 1e-4

worn locust Mar 29, 2023, 7:46 PM

#

So what's min_snr_gamma

vivid python Mar 29, 2023, 9:05 PM

#

a thing

#

no clue what it actually does

#

but apparently it improves training

normal charm Mar 29, 2023, 9:30 PM

#

xyz chart time

shut siren Mar 30, 2023, 12:59 AM

#

it adjusts for the fact that loss is inversely proportional to noise timestep

#

kinda normalizes it

#

https://arxiv.org/abs/2303.09556

arXiv.org

Efficient Diffusion Training via Min-SNR Weighting Strategy

Denoising diffusion models have been a mainstream approach for image
generation, however, training these models often suffers from slow convergence.
In this paper, we discovered that the slow convergence is partly due to
conflicting optimization directions between timesteps. To address this issue,
we treat the diffusion training as a multi-task ...

worn locust Mar 30, 2023, 1:45 AM

#

pogchimp

normal charm Mar 30, 2023, 2:13 AM

#

8426holyskull

normal charm Mar 30, 2023, 3:20 PM

#

Anyone know how to format the negative prompt in the sample prompt txt file?

quiet notch Mar 30, 2023, 3:27 PM

#

Does anyone know if the warmup_lr_ratio is based on optimization steps or epochs? Or maybe even total image iterations...?

vivid python Mar 30, 2023, 3:29 PM

#

normal charm Anyone know how to format the negative prompt in the sample prompt txt file?

Do your normal prompt first then, on the same line --n then your negative prompt, might actually be only one dash though

vivid python Mar 30, 2023, 3:30 PM

#

quiet notch Does anyone know if the `warmup_lr_ratio` is based on optimization steps or epoc...

It's based on whatever you use to calculate your steps, if you give a step count it will use that, if you give it epochs then it will Calc the step count and use that

#

Basically it's the ratio of total amount of steps

vivid python Mar 31, 2023, 1:12 PM

#

Has anybody managed to get loha working? I want to replicate a training setup to see if I can provide a decent starting point for loha, because they are pretty different from training locon or lora

normal charm Mar 31, 2023, 1:22 PM

#

Everytime i trained loha, it took way too many times to get it right

vivid python Mar 31, 2023, 1:27 PM

#

Seems really finicky

#

But I was thinking

#

What if we can get more accurate styles out of roughly the same space as locon

#

I say this because, unlike locon, which don't work with cp decompression

#

Loha seem to work well with it

#

So we might be able to reduce the size of loha to match that of dim16 dim8 locon

#

With better results than dim16 conv dim8 locon

#

But I'm only thinking about this in a purely hypothetical context

#

Because I haven't managed to bake a loha that I was entirely happy with

normal charm Mar 31, 2023, 1:38 PM

#

Ive only managed to once I believe

#

And I haven’t been able to reproduce the results

#

The thing that sucks about training imo is that even if one models training settings worked well, it cant’t be reproduced to work for another

#

At least in my experience

vivid python Mar 31, 2023, 1:41 PM

#

Unfortunate

normal charm Mar 31, 2023, 1:41 PM

#

Even similar dataset sizes don’t promise anything

vivid python Mar 31, 2023, 1:41 PM

#

It might not be possible to have good defaults then

normal charm Mar 31, 2023, 1:42 PM

#

I would agree with that assessment, yes.
More likely, it’s better to have defaults to work from rather than work with.

vivid python Mar 31, 2023, 1:43 PM

#

Well yes, but I don't think I've had an instance of loha working without 20 or so bakes

#

Which means I have no clue what would make sense

normal charm Mar 31, 2023, 1:49 PM

#

Again, loha takes work to get right. I already stuggle enough with normal locon training, cant get anything without training it at least 5 times. Analyzing how loha works is not my best interest currently

#

For that, I’ll wait for more knowledgeable people to take the reigns

kindred belfry Mar 31, 2023, 2:03 PM

#

vivid python Has anybody managed to get loha working? I want to replicate a training setup to...

I suppose you are asking someone else than me because I did the entire series of my experiments in loha. Still drop the message here just in case. You can otherwise check the lohas posted on civitai. (Like the umamusume ones of mht, or probably the one trained by the person that asks about supports of loha to be added in comfui

vivid python Mar 31, 2023, 2:48 PM

#

I mean, if you managed to consistently get good results I'll look though your setup

#

I should mention though, most of the loha on civitai are trained poorly, so I don't plan on using them as basis

kindred belfry Mar 31, 2023, 3:04 PM

#

I don't know what you mean by a good model. You can try mine anyway https://civitai.com/models/21305/tenten-character-lohafullckpt

Tenten-character-LoHa/FullCkpt 転生王女と天才令嬢の魔法革命 | Stable Diffusion Lo...

All the intermediate checkpoints can be found in https://huggingface.co/alea31415/tenten-characters The base model is ACertainty Trained at clip sk...

vivid python Mar 31, 2023, 4:14 PM

#

Well I have some questions, primarily, why train on ACertainty?

#

Actually, also why clip skip 1?

kindred belfry Mar 31, 2023, 4:22 PM

#

No difference of clip 1 and clip 2 in my experiment. 金Goldkoron#9929 from Mynefactory said he found the model to be better if trained on clip skip 1. As sd is initially trained on clip skip 1 I see no true reason to train on clip skip 2.

vivid python Mar 31, 2023, 4:23 PM

#

But why train on ACertainty?

#

Over nai

kindred belfry Mar 31, 2023, 4:23 PM

#

Training on acertainty is basically as good as nai

#

And I don't want to say I train on nai. That's all.

vivid python Mar 31, 2023, 4:24 PM

#

But mixes have a track record of destroying lora

kindred belfry Mar 31, 2023, 4:24 PM

#

See my experiment for that

vivid python Mar 31, 2023, 4:24 PM

#

Especially anythingv3

kindred belfry Mar 31, 2023, 4:24 PM

#

Acertainty is basically like nai in terms of how the trained Lora performs

#

Acertainty is not mix

vivid python Mar 31, 2023, 4:25 PM

#

Also, while that looks fine, it's not exactly the use case most will use loha for

kindred belfry Mar 31, 2023, 4:25 PM

#

I cannot help in that case

#

I only train model for multiple concepts at a time

vivid python Mar 31, 2023, 4:26 PM

#

People will likely use loha to train one character or one style

#

So I need to have a setup that is decent in that case

kindred belfry Mar 31, 2023, 4:26 PM

#

Though my task is supposedly harder so I don't see why it would not work for them if it works for me

vivid python Mar 31, 2023, 4:27 PM

#

I think it's a result of you having more data actually

kindred belfry Mar 31, 2023, 4:27 PM

#

Oh yeah. Probably.

vivid python Mar 31, 2023, 4:27 PM

#

Because I've had very poor results with smaller concepts

kindred belfry Mar 31, 2023, 4:28 PM

#

It has been long time that I don't train anymore on dataset of 30 images

#

The last time I did it saw probably in November

vivid python Mar 31, 2023, 4:28 PM

#

Perhaps I should make it very clear in my scripts that loha should only be used when you are training a bunch of concepts at once

#

I don't mean a small dataset btw

kindred belfry Mar 31, 2023, 4:28 PM

#

Or sufficient images for one concept

vivid python Mar 31, 2023, 4:28 PM

#

I mean a small amount of different concepts

kindred belfry Mar 31, 2023, 4:28 PM

#

I don't think that matters

vivid python Mar 31, 2023, 4:29 PM

#

I think it does

kindred belfry Mar 31, 2023, 4:29 PM

#

On the other extreme you also have loha of blueleaf

#

Wait

#

Trained on 5 images

vivid python Mar 31, 2023, 4:29 PM

#

Because with only one concept, for example, it seems to not actually fill everything

#

I had this issue when I tried to train a loha on unicorn

kindred belfry Mar 31, 2023, 4:30 PM

#

Alright it was locon, then I don't know

#

What do you mean by fill everything

vivid python Mar 31, 2023, 4:31 PM

#

Seems like it just has some data that doesn't really get trained

#

In the case of unicorn, her China dress would periodically be the incorrect color

#

Which was more common than I wanted

#

Which meant that her dress wasn't learned as much as it should have at the amount of steps I'm used to baking

#

This was true for all three of her outfits

kindred belfry Mar 31, 2023, 4:33 PM

#

Could be.

#

I cannot say how you compromise training speed and quality with all the hyperparameters and captioning technique

#

This is too complex to investigate

#

I probably train much longer than most people anyway

vivid python Mar 31, 2023, 4:35 PM

#

3k steps is usually the furthest I go

#

Unless I have a particularly large dataset

#

500+ images

kindred belfry Mar 31, 2023, 4:36 PM

#

Like the one above is 40k steps with batch 8

vivid python Mar 31, 2023, 4:36 PM

#

I use batch 2

#

I don't really have the vram to go higher

kindred belfry Mar 31, 2023, 4:36 PM

#

So just not the same scale lol

vivid python Mar 31, 2023, 4:36 PM

#

Not at all

#

So long story short

#

Loha requires a ton more training

#

Unlikely that people will use it given that it seems to be a huge increase in time spent training

kindred belfry Mar 31, 2023, 4:38 PM

#

Cannot say

vivid python Mar 31, 2023, 4:40 PM

#

Judging by the fact that I haven't gotten it to work at lower step count, and you have at a factor of over 10x I'm going to say it's likely this is the case

kindred belfry Mar 31, 2023, 4:40 PM

#

In the end it may just depends on the habit and the use case of each user

vivid python Mar 31, 2023, 4:41 PM

#

I can't see most making use of loha because of the step counts

#

As most come from training lora

#

Which usually take 20 minutes to train something decent, if you don't mind screwed up backgrounds, eyes, and hands

#

(That's the old, and honestly very bad, dim128 training)

kindred belfry Mar 31, 2023, 4:47 PM

#

An interesting fact is that the picture of anisphia is probably seen like 40000 times when I trained that loha. I never know how many is enough. I just train for sufficiently long time and check if I have some good results.

#

If it's overbaked I can always use intermediate checkpoint but I never find the final checkpoint to be really unusable.

#

On the other hand for the mother of anisphia it is around 5000 times.

kindred belfry Mar 31, 2023, 8:06 PM

#

@vivid python https://www.canva.com/design/DAFeAteHW18
I have no idea whether this is true but someone finds that you have less bleeding with loha

Canva

EDG

edg's tutorials

Check out this Presentation designed by EDG.

vivid python Mar 31, 2023, 9:13 PM

#

at least in the case of Unicorn, that was not the case it had just about the same amount of bleeding if not more, but that might have been related to dataset

kindred belfry Mar 31, 2023, 9:49 PM

#

I don't know. This is just what I mean each person would find it more or less useful depending on their dataset and what they want to achieve.

normal charm Apr 3, 2023, 8:05 PM

#

1 character
4 concepts
111 images
28-32 images for each concept

What settings would anyone recommend for that
~~yeah ik “it depends on the dataset” but i havent gotten it right still so i need help~~

#

I have so much conflicting knowledge about training shit that i cant even say ik what im doing anymore

vivid python Apr 3, 2023, 8:23 PM

#

normal charm 1 character 4 concepts 111 images 28-32 images for each concept What settings w...

all 4 are roughly equal in size?

normal charm Apr 3, 2023, 8:49 PM

#

vivid python all 4 are roughly equal in size?

If u mean the amount of images then yes

vivid python Apr 3, 2023, 8:55 PM

#

normal charm If u mean the amount of images then yes

alright, give each 2 repeats, and do dim32 to start with, you can dynamically resize down later, alpha16 works well probably, 5e-4 unet, 1e-4 TE for 1600 steps

normal charm Apr 3, 2023, 8:56 PM

#

vivid python alright, give each 2 repeats, and do dim32 to start with, you can dynamically re...

Alright ill give it a go after current training finishes (i doubt itlll work but just in case)

vivid python Apr 3, 2023, 8:56 PM

#

👍

worn locust Apr 3, 2023, 10:41 PM

#

oh wow https://github.com/kohya-ss/sd-scripts/commit/83c7e03d050fc25f47a591c4ddfe28abdabc7ae7

GitHub

Fix network_weights not working in train_network · kohya-ss/sd-scri...

#

was it bad this whole time just cause it wasn't working right? lol

vivid python Apr 3, 2023, 11:08 PM

#

network_weights was created for loading hypernets

#

so it's still pretty much not useful

normal charm Apr 4, 2023, 2:25 AM

#

vivid python 👍

Tried those settings, nothing changed much.
To be specific about the issue, it keeps merging the first two outfits of the character, while mostly missing the third (the most complex one maybe). Theres a swimsuit outfit also, and go figure, it manages to get that working fine for the most part, and that has less images to use then any of the others.

The first two outfits are similar, so its hard to differentiate them too much using the available tags. The third is completely different, but its missed the concept and ignores the activation token each time ive trained it

vivid python Apr 4, 2023, 2:26 AM

#

normal charm Tried those settings, nothing changed much. To be specific about the issue, it k...

not sure then, that usually works for me

#

it worked for the three outfit unicorn Loha I made, though it did have some bleeding, which I believe was more or less a tagging issue

normal charm Apr 4, 2023, 2:29 AM

#

shut siren Apr 4, 2023, 3:39 AM

#

normal charm Tried those settings, nothing changed much. To be specific about the issue, it k...

had an experience like this when i tried a lora with 3 diff outfits

#

got the swimsuit but couldnt differentiate the main and alt costumes

normal charm Apr 4, 2023, 3:57 AM

#

shut siren got the swimsuit but couldnt differentiate the main and alt costumes

Were the others similar in appearance?

shut siren Apr 4, 2023, 3:58 AM

#

not really but some of the tags might have been similar

#

based on straight up booru tagging

normal charm Apr 4, 2023, 4:08 AM

#

It is very annoying when you train something 10 times in a row, and it only nails the bikini each time flawlessly, is all imma say

shut siren Apr 4, 2023, 4:09 AM

#

its ok

#

proompter

normal charm Apr 4, 2023, 4:11 AM

#

Hahaa

vivid python Apr 4, 2023, 4:44 PM

#

https://github.com/kohya-ss/sd-scripts#change-history

GitHub

GitHub - kohya-ss/sd-scripts

Contribute to kohya-ss/sd-scripts development by creating an account on GitHub.

#

AHHHHHHHH

#

that is all

normal charm Apr 4, 2023, 7:41 PM

#

I should be saying that

#

CracksInTheWall

vivid python Apr 4, 2023, 8:44 PM

#

I have a lot of work ahead of me

shut siren Apr 5, 2023, 1:50 AM

#

hmm lycoris added a new algorithm as well

#

not sure of the details

normal charm Apr 5, 2023, 2:50 AM

#

groan

shut siren Apr 5, 2023, 3:57 AM

#

surely thats not a bad thing

normal charm Apr 5, 2023, 4:17 AM

#

Maybe not
If you dont already struggle enough with the systems we do have already ~~like me~~

#

Dont mind me, tho, im just a walking skill issue

#

haha

vivid python Apr 5, 2023, 4:28 AM

#

all I know from what I read, it's basically LoHa but different

normal charm Apr 5, 2023, 4:29 AM

#

Different in that its easier to work with?

#

Right? Right?

#

yanfeismug2

vivid python Apr 5, 2023, 4:29 AM

#

NO FUCKING CLUE

normal charm Apr 5, 2023, 4:29 AM

#

Agony

#

The best kind of clue

vivid python Apr 5, 2023, 4:30 AM

#

not like it matters anyways

#

sd-scripts just broke compatibility with it anyways

normal charm Apr 5, 2023, 4:30 AM

#

That sounds terrific lol

vivid python Apr 5, 2023, 4:32 AM

#

yeah, the block weight thing breaks compatibility with LyCORIS

#

granted it's up to Kohaku to update to sd-scripts

#

not the other way around

shut siren Apr 5, 2023, 4:37 AM

#

it says its like 300kb files

vivid python Apr 5, 2023, 5:44 AM

#

300kb lora huh? I'll let other people use it

#

I'm not even gonna try and touch it

#

Had enough trying to get loha to work

#

Don't want to tinker with this

worn locust Apr 5, 2023, 6:16 AM

#

vivid python I have a lot of work ahead of me

I'll just wait for 4chan to find the perfect ratio

worn locust Apr 5, 2023, 6:16 AM

#

shut siren hmm lycoris added a new algorithm as well

WHAT

#

#

MomijiWide

kindred belfry Apr 5, 2023, 8:17 AM

#

vivid python sd-scripts just broke compatibility with it anyways

I don't think this is the case. I can run with both sd-scripts and lycoris at the most recent versions.

#

It is just that lycoris does not support blockwise learning rate for the moment. I don't know if kohaku plans to add it any soon.

#

As for IA3, I cannot say what its use case will be. For now it trains the same part as lora, in terms of how it trains it is similar to a mini hypernetwork, and in terms of result it is more like ti but for the style.

#

Its small size also indicates it's probably not as good as other methods in general, but if your style is simple enough it should be fine. It trains faster for the same number of steps (like half of time of loha), but it is unclear to me whether we would need more or fewer steps to get something that is ok to the user.

vivid python Apr 5, 2023, 1:03 PM

#

kindred belfry I don't think this is the case. I can run with both sd-scripts and lycoris at th...

I didn't test it yet myself, was just going off of what kohya said

vivid python Apr 5, 2023, 1:13 PM

#

kindred belfry As for IA3, I cannot say what its use case will be. For now it trains the same p...

Yeah, definitely not gonna even try to use this. I'll just implement it and let others fuck with it. Not worth my time, not gonna learn how to train it.

kindred belfry Apr 5, 2023, 4:50 PM

#

I don't think you have anything to add in your script for ia3. The user only need to specify it in the network argument part, so in the end you probably don't need to touch the lycoris part in your update.

shut siren Apr 5, 2023, 4:50 PM

#

The documentation says it’s less transferable between models so that probably limits use cases

kindred belfry Apr 5, 2023, 4:55 PM

#

In fact the block wise thing that kohya implements is also just another network argument so I am not sure if the easy learning script really needs to be modified for that.

#

And yes I guess ia3 at it's current stage may just become an argument that no one uses lol

vivid python Apr 5, 2023, 4:59 PM

#

kindred belfry I don't think you have anything to add in your script for ia3. The user only nee...

the popups needed to be modified to allow for it

kindred belfry Apr 5, 2023, 4:59 PM

#

I see I never used the popup version because I only work remotely

vivid python Apr 5, 2023, 4:59 PM

#

a majority of my users use the popups

#

so I need to make sure it is possible to use with them

quiet notch Apr 5, 2023, 5:26 PM

#

I only use popups to generate the configure file, then I just use the configure file

#

cirnoBigBrain

vivid python Apr 5, 2023, 5:46 PM

#

so do I, but that's still using the popups lel

normal charm Apr 5, 2023, 8:38 PM

#

Should i update ur script? Havent done it in awhile @vivid python

vivid python Apr 5, 2023, 8:39 PM

#

you can, lots of smaller updates happened probably

#

soon there will be the update for sd-scripts

#

which introduced block weight training

#

but I need to set up some stuff for it

#

namely, a proper popup for it

normal charm Apr 5, 2023, 9:02 PM

#

Ill just wait til can is a must

#

I dont wanna suddenly throw something into disarray

#

Like last time deadsmile

cold orchid Apr 6, 2023, 3:53 AM

#

quiet notch I only use popups to generate the configure file, then I just use the configure ...

Ye but u can just edit argslist

#

I just backup argslist for my own brain dead reasons, and just edit that for each

quiet notch Apr 6, 2023, 5:33 AM

#

i dont even know what argslist even is

#

cirnoPoiPwease

vivid python Apr 6, 2023, 12:54 PM

#

It's the python file that has all the args

vivid python Apr 8, 2023, 5:11 AM

#

Finally got around to adding all of the block weight training stuff that kohya introduced

#

It took way too long to make sure everything was working as expected

#

but anyways

#

it's done

#

people can update through the update.bat if they already have it

#

or use the v6 installer

#

links just to make it easier to get to:
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts
https://github.com/derrian-distro/LoRA_Easy_Training_Scripts/releases/tag/installers-v6

GitHub

GitHub - derrian-distro/LoRA_Easy_Training_Scripts: A set of two tr...

A set of two training scripts written in python for use in Kohya's SD-Scripts repository. - GitHub - derrian-distro/LoRA_Easy_Training_Scripts: A set of two training scripts written in pyth...

GitHub

Release installers v6 · derrian-distro/LoRA_Easy_Training_Scripts

Complete re-write of the installer to be a python script. does everything the previous installers did as well as allows installation of torch 2.0.0 or 2.1.0 as well as triton for those versions. Ad...

shut siren Apr 8, 2023, 5:13 AM

#

theres a LoKr module in lycoris now

#

kekega

vivid python Apr 8, 2023, 5:18 AM

#

yeah

#

seems like kohaku suggests using D-Adaption for it because it's a bit finicky to train

normal charm Apr 8, 2023, 5:31 AM

#

Until theres another game breaking way to train stuff i aint changing shit

#

That aside, i barely even understand what block merging is, even having looked at that one rentry page for it

vivid python Apr 8, 2023, 5:35 AM

#

uh...

#

this is that game breaking way

normal charm Apr 8, 2023, 5:35 AM

#

Awesome

#

More struggling

#

Love to hear it

#

MordredBored

vivid python Apr 8, 2023, 5:36 AM

#

block weight training will allow you a ton of control

#

... or you could completely ignore it and continue like you have

#

this update doesn't change the ability to train like before

#

just adds a new way

normal charm Apr 8, 2023, 5:37 AM

#

I could ignore it
But if i start seeing everyone switching up and talking about “oh yeah i used block merging for this” then fomo will get me

vivid python Apr 8, 2023, 5:37 AM

#

oof

normal charm Apr 8, 2023, 5:37 AM

#

As i am not immune to that

vivid python Apr 8, 2023, 5:38 AM

#

I can't say many will

#

its very complex

#

like very complex

#

like 125 values complex

normal charm Apr 8, 2023, 5:38 AM

#

That would explain how i still didnt really understand it

vivid python Apr 8, 2023, 5:38 AM

#

well, techincally its only 25

#

but you can set the weight, dims and alphas

#

per layer

#

which means 25 per, or 125 different possible inputs

#

anyways

#

sleep time for me

ancient lynx Apr 8, 2023, 1:40 PM

#

Now we just need auto mbw for lora training troll_handsome

cyan orbit Apr 8, 2023, 4:02 PM

#

Weighted captions is apparently a thing now, kohya just added

shut siren Apr 8, 2023, 4:36 PM

#

Yeah one of the guys in unstable was working on it for a while

vivid python Apr 8, 2023, 5:39 PM

#

I'll add it once it's out of dev branch, I'm a bit burnt out after this update

#

So I'm gonna step away from it for a few days

worn locust Apr 8, 2023, 8:03 PM

#

vivid python which means 25 per, or 125 different possible inputs

That's an unreasonable amount, surely we can figure out an optimal setup with math or something

vivid python Apr 8, 2023, 8:04 PM

#

well, there are presets for the weights

worn locust Apr 8, 2023, 8:04 PM

#

cyan orbit Weighted captions is apparently a thing now, kohya just added

god DAMN it

vivid python Apr 8, 2023, 8:04 PM

#

but literally only that

worn locust Apr 8, 2023, 8:04 PM

#

GOD damn

normal charm Apr 8, 2023, 8:49 PM

#

Too much tech for brein

#

agony

normal charm Apr 8, 2023, 11:43 PM

#

So no new args besides the huggingface ones?

vivid python Apr 9, 2023, 12:16 AM

#

And all of the stuff for block weights

#

But that goes into the network_args

shut siren Apr 9, 2023, 12:16 AM

#

ill let someone test out the block weights lol

normal charm Apr 9, 2023, 8:55 AM

#

@vivid python eh?

#

i used the update file, but ig it didnt fully work?

#LoRA_Easy_Training_Scripts