LoRA_Easy_Training_Scripts | 東方Project AI | Page 3

normal charm Apr 9, 2023, 9:05 AM

#

wait nvm, false alarm, it works now. dunno what happened tho

vivid python Apr 14, 2023, 9:27 PM

#

updated my scripts to support dylora https://github.com/derrian-distro/LoRA_Easy_Training_Scripts

GitHub

GitHub - derrian-distro/LoRA_Easy_Training_Scripts: A set of two tr...

A set of two training scripts written in python for use in Kohya's SD-Scripts repository. - GitHub - derrian-distro/LoRA_Easy_Training_Scripts: A set of two training scripts written in pyth...

normal charm Apr 15, 2023, 1:41 AM

#

Dyfuckingwhat

#

vivid python Apr 15, 2023, 11:12 AM

#

Dylora

vivid python Apr 15, 2023, 1:32 PM

#

From what I know, it's a way to make low dim lora work better? I haven't thoroughly tested it

quiet notch Apr 15, 2023, 5:49 PM

#

https://arxiv.org/abs/2210.07558

arXiv.org

DyLoRA: Parameter Efficient Tuning of Pre-trained Models using Dyna...

With the ever-growing size of pre-trained models (PMs), fine-tuning them has
become more expensive and resource-hungry. As a remedy, low-rank adapters
(LoRA) keep the main pre-trained weights of the model frozen and just introduce
some learnable truncated SVD modules (so-called LoRA blocks) to the model.
While LoRA blocks are parameter efficient...

#

They state it can train x7 faster than lora...?!

#

cirnoShookWoke

bleak minnow Apr 15, 2023, 5:51 PM

#

👀

quiet notch Apr 15, 2023, 5:51 PM

#

without compromising performance

#

cirnoCursedShook

bleak minnow Apr 15, 2023, 5:51 PM

#

https://tenor.com/view/cigar-smoke-funny-gif-25177516

Tenor

#

new toy to play with

#

nice

quiet notch Apr 15, 2023, 5:51 PM

#

cirnoWobble

worn locust Apr 15, 2023, 7:30 PM

#

quiet notch They state it can train x7 faster than lora...?!

More like 7x slower AYAYAYA

bleak minnow Apr 16, 2023, 12:54 AM

#

worn locust More like 7x slower <:AYAYAYA:978879333248667718>

AunnThink

worn locust Apr 16, 2023, 1:16 AM

#

I tried reading the paper but it sounds like dylora is gonna be useless

#

If it was 7x faster that would be epic but it wasn't when derrian tested it

shut siren Apr 16, 2023, 1:36 AM

#

are the results any different for dylora

#

or is it another dejj like ia3 and lokr

normal charm Apr 16, 2023, 2:17 AM

#

I due prefer a good speed

vivid python Apr 16, 2023, 4:50 AM

#

worn locust If it was 7x faster that would be epic but it wasn't when derrian tested it

actually, I didn't really test it entirely, just tested to make sure it actually trained, it's possible that it actually is, I just haven't had the time to test it myself

#

though, I did find that it trains about as fast in terms of iteration speed, just ran out of vram that first time

#

so it didn't count

normal charm Apr 16, 2023, 5:37 AM

#

So the jury is still out

worn locust Apr 16, 2023, 5:47 AM

#

shut siren Apr 16, 2023, 7:42 AM

#

what kind of settings were tested for dylora?

#

based on the paper, it seems like the purpose of dylora is that you can do inference at different ranks

#

cmonbrug

#

my dylora seems super undertrained for the same settings as locon

vivid python Apr 16, 2023, 11:31 AM

#

Which version of dylora did you use? Kohaku's is different from kohya's

#

And because of that dylora is not going to be able to be used depending on the mode

#

If kohya's, then you have to use additional networks

quiet notch Apr 16, 2023, 6:05 PM

#

worn locust If it was 7x faster that would be epic but it wasn't when derrian tested it

not a x7 faster, but a x7 faster to get to optimzal results. did you cut training time by 7?

quiet notch Apr 16, 2023, 6:29 PM

#

oh man... so many new implementations of lora training while i was busy

#

well, mainly ia3 and lokr and dylora

#

cirnoNotLikeThis

#

and then this block weight training thing

#

i haven't even looked into what optimizers to, besides that adam8 is the "best"

worn locust Apr 16, 2023, 6:31 PM

#

quiet notch not a x7 faster, but a x7 faster to get to optimzal results. did you cut trainin...

I should've considered this more seriously

worn locust Apr 16, 2023, 6:32 PM

#

quiet notch well, mainly ia3 and lokr and dylora

You should test dylora out of all the new things

quiet notch Apr 16, 2023, 6:32 PM

#

i'm doing that right now cirnoSugoiWow

#

setting up a json, but i won't be able to train until a bit later

worn locust Apr 16, 2023, 6:33 PM

#

quiet notch and then this block weight training thing

I'm waiting for someone to figure out an optimal setup and roll with that

shut siren Apr 16, 2023, 6:33 PM

#

i tried kohaku's dylora

worn locust Apr 16, 2023, 6:33 PM

#

quiet notch i haven't even looked into what optimizers to, besides that adam8 is the "best"

adamw8bit I guess

shut siren Apr 16, 2023, 6:33 PM

#

4e-4 unet with 5e-5 text encoder learned like basically nothing

#

at ~900 steps

#

what dims were ppl testing on dylora

quiet notch Apr 16, 2023, 6:34 PM

#

batch size?

shut siren Apr 16, 2023, 6:34 PM

#

supposedly the idea is that you can do inference at a diff rank than its trained at based on the paper?

#

im always batch 1

#

basically stochastic lol

quiet notch Apr 16, 2023, 6:34 PM

#

👌

#

i missed the "inference" part on the paper

#

wait, what do you mean by inference?

shut siren Apr 16, 2023, 6:36 PM

#

like, generating images

quiet notch Apr 16, 2023, 6:37 PM

#

oh, that's interesting... i'm reading now that it's adaptive at inference time

#

i was under the impression that training is adaptive in determining rank (dim)

#

hence, was confused why you would want to pick a dim, since dylora would optimize the dim anyways

shut siren Apr 16, 2023, 6:38 PM

#

from my understanding of the paper, the supposed benefit of dylora is to avoid having to do multiple training runs at different rank

#

to find the optimal rank

#

since you can just select the rank used at inference

#

now, im not sure what the dim settings on dylora do

#

maybe its the maximum rank?

quiet notch Apr 16, 2023, 6:42 PM

#

it might not be used at all?

#

i'll have to "read" the paper again

shut siren Apr 16, 2023, 6:44 PM

#

rip neither kohya's or kohaku's repos having english documentation for how to use kek

#

ok im just gonna run kohya's documentation through deepL lol

#

"Features of DyLoRA in this Repository
After training, DyLoRA model files are compatible with LoRA. LoRAs of multiple dims below a specified dim(rank) can be extracted from the model file."

#

so i think the rank specified for dylora is like the max rank

#

it will simultaneously train for all ranks below that

#

"According to the paper, higher ranks of LoRA are not necessarily better, but it is necessary to find the appropriate rank depending on the model, dataset, task, etc. Using DyLoRA, LoRA is trained simultaneously at various ranks below a specified dim(rank). This saves time in learning and searching for the optimal rank for each."

#

"Also, specify a unit for --network_args, for example --network_args "unit=4", where unit is a unit to divide ranks. For example, --network_dim=16 --network_args "unit=4" where unit is a divisible value of network_dim (network_dim is a multiple of unit)."

#

so you can specify how to divide them

#

based on this i think you can do like dim16 with training also at dim12/8/4 if you set unit=4

#

in kohaku's its called block size iirc

#

"For example, training with dim=16 and unit=4 (see below) will train and extract LoRA for 4, 8, 12, and 16 ranks. By generating images with each of the extracted models and comparing them, the LoRA with the best rank can be selected."

#

basically dylora is to avoid having to retrain multiple times to find the ideal dim size

quiet notch Apr 16, 2023, 7:05 PM

#

interesting... cirnoThinkHmm

#

i wonder how increasing dim effects training time

#

cause lately i've been trying to train style at 8 dim, and with default unit 4, then dylora would only train 4, 8, which doesn't seem like it would be an improvement

#

i haven't experimented with dim/alpha at all tbh, so i don't know too much about how they effect results

#

but i guess with dylora and extraction, it would be easier to extract lower rank dims and compare them

#

i know there's already comparison grids of dim/alpha, but it's a different kind of learning if you do it yourself with your own dataset that you're familiar with

shut siren Apr 16, 2023, 7:16 PM

#

not sure why with kohaku's it seems to need either a higher LR or more steps

#

than locon

quiet notch Apr 16, 2023, 8:03 PM

#

currently training dylora, and the samplers per epoch look terrible

#

cirnoHelpImDyingInside

shut siren Apr 16, 2023, 10:18 PM

#

i didnt have any turn out well

quiet notch Apr 17, 2023, 11:55 PM

#

i'm thinking about trying this dylora with dadaptation

quiet notch Apr 18, 2023, 2:04 AM

#

quiet notch currently training dylora, and the samplers per epoch look terrible

ha, i'm an idiot. i put too many 0's on my unet_lr, so I was training x10 less than usual

#

cirnoLaugh

#

cirnoHelpImDyingInside

vivid python Apr 18, 2023, 2:44 AM

#

oof

bleak minnow Apr 19, 2023, 1:22 AM

#

good

errant wraith Apr 21, 2023, 2:33 AM

#

any guide on how to use these scripts?

#

or even link a message in a convo of someone explaining it

#

feeling really stupid rn

vivid python Apr 21, 2023, 3:12 AM

#

you just need to follow the popups

#

once they are installed using the installer

#

you can run them by running the run_popup.bat

#

once loaded it will ask you a bunch of questions sequentially

#

if you know what settings you want, it's pretty quick

#

if not, then it might be a bit confusing

#

I'm working on an overhaul of the UI right now, as in, I'm making a whole UI right now

#

what are you having an issue with in particular?

quiet notch Apr 21, 2023, 10:15 PM

#

Honestly, using the json file with notepad is my UI and honestly that's all I need.

#

The arglist.py (i think) is a good reference as well, albeit a bit hidden.

marsh basin Apr 21, 2023, 10:26 PM

#

Hey, could someone help me out with the following or give me tips how I can succesfully make a lora out of these images:

#

I know these are quite limited, but I cant figure out how to do this properly. I am getting mixed results with Kohya, would your easy training script help? Like normalizing.

#

Going to check it out rn though

vivid python Apr 22, 2023, 3:51 AM

#

the easy training scripts also uses kohya on the back end

#

so it's likely you won't get better results if you were getting bad results before

#

that being said, that is far outside of what I normally train, so I can't really help you

quiet notch Apr 22, 2023, 5:56 AM

#

just some quality of life features i'd like to see implemented.

if the output folder doesn't exist, just create it
allow the provided json name to be used as the name of the output folder, log prefix, and output name (togglable functionality)
maybe have the same functionality with im/reg folder path, but enforce suffixes to keep naming ordering consistent (togglable functionality)
let custom schedulers take the "num_warmup_steps" and "num_training_steps" as arguments for kwargs (my custom schedulers are a similar implementation of built-in schedulers)

#

my workflow is currently as follows

generate a template json config script through json
edit the json config. i find myself redundantly editing the output folder, output name, and log prefix to the same name
copy json file to create variants, usually adjusting one hyperparameter, but also changing the output folder, output name, and log prefix
create output folders
run multiple json training
go away for a long time and hope training didn't stop because i made a typo or forget to create a folder or something

#

it's very fiddly but very powerful, i like it

#

it's just... after doing 50+ trainings... it kinda gets to you

#

this is probably super extra and probably not needed on main branch, but if in the json file name, i put something like e12, it would know that this json is meant to be ran for 12 epochs, and will run for 12 epochs regardless of what's in the file itself

#

that's probably something i would have to do for my own personal workflow, but just something cool to bring up, i guess

#

cirnoShrug

#

other examples could be Ux# to multiply lr_unet by #, or Tx# to multiply lr_textencoder. all separated by spaces

errant wraith Apr 22, 2023, 7:10 AM

#

vivid python you can run them by running the `run_popup.bat`

ah, when i tried running that after installing with v5 quite simply nothing happened

vivid python Apr 22, 2023, 7:14 AM

#

errant wraith ah, when i tried running that after installing with v5 quite simply nothing happ...

Sometimes it takes a while for everything to initialize, this is seemingly an issue with tkinter, not sure if I can change it

vivid python Apr 22, 2023, 7:21 AM

#

quiet notch just some quality of life features i'd like to see implemented. - if the output ...

Creating an output folder Is doable. Id have to rewrite some of my json code but allowing the use of the json name is possible, not entirely sure what you mean by enforcing suffix. Pretty sure custom schedulers already have that ability in the scripts but I don't use custom schedulers nor anybody else I've talked to, so I didn't feel the need to care about its implementation, either way, that one would probably be really annoying to account for. To be entirely honest, seems like you just kinda go... way too far per bake?

#

That being said, development time is currently being spent on making a UI

#

Oh, and about the name being used for arguments, that kinda defeats the purpose of the json files in the first place. And it would be a lot of work making a parser for a system very few would use

flat coral Apr 22, 2023, 7:28 AM

#

pressing enter helps sometimes. it could have just paused on its own. happened a few times for me

vivid python Apr 22, 2023, 7:31 AM

#

Oh very true, that's unfortunately a quirk of command line

flat coral Apr 22, 2023, 7:32 AM

#

well somehow my output with easy training seems a bit different from just doing it through powershell. probably just me

#

i do like the easy features and .json

#

a custom ui/gui would be ideal and great

errant wraith Apr 22, 2023, 9:37 AM

#

vivid python Sometimes it takes a while for everything to initialize, this is seemingly an is...

uh so what now

vivid python Apr 22, 2023, 10:57 AM

#

errant wraith uh so what now

A popup should have appeared, sometimes it doesn't appear on top of everything else so just alt + tab to find it

vivid python Apr 22, 2023, 11:41 AM

#

flat coral well somehow my output with easy training seems a bit different from just doing ...

There might be a real reason for that, by default weight decay is something like 0.01, I have it set to 0.1 by default on my scripts because I've found it generally produces better results

errant wraith Apr 22, 2023, 1:32 PM

#

vivid python A popup should have appeared, sometimes it doesn't appear on top of everything e...

ah i see, I didn't expect one that doesn't show up on the taskbar

#

ty

vivid python Apr 22, 2023, 3:14 PM

#

Yeah, quirk of tkinter was hoping to not be using it anymore at this point but that's not how it panned out. Good thay it's working for you now though

worn locust Apr 22, 2023, 4:44 PM

#

vivid python There might be a real reason for that, by default weight decay is something like...

What does weight decay do

vivid python Apr 22, 2023, 5:08 PM

#

It basically is the rate in which a model forgets something, so a higher weight decay can help in reducing bad training early on

#

Or, If too high, it could completely not learn anything

#

Granted, 0.1 is give or take a good spot to be in

#

If you have a really high lr, the weight decay can actually fix some of the issues that comes with that

#

Usually

flat coral Apr 22, 2023, 5:44 PM

#

vivid python There might be a real reason for that, by default weight decay is something like...

That makes sense. Ya. I have my LR low. As in for character 1-1.4e-4 max for characters. (Lion too)

#

0.1 does sound a bit too much in some cases

worn locust Apr 22, 2023, 5:59 PM

#

I'm trying to add dadaptation as an option to my colab but I got this error

Setting different lr values in different parameter groups is only supported for values of 0
I have these settings ```toml
[additional_network_arguments]
unet_lr = 1.0
text_encoder_lr = 0.5
network_dim = 16
network_alpha = 16
network_module = "networks.lora"

[optimizer_arguments]
learning_rate = 1.0
lr_scheduler = "constant_with_warmup"
lr_warmup_steps = 35
optimizer_type = "DAdaptation"
optimizer_args = [ "decouple=True", "weight_decay=0.02",]```

#

I don't know what a parameter group is in this context

#

I think the parameters groups in question are unet and text encoder
But other people can set them differently just fine

#

I had to pip install dadaptation manually, maybe that's why? But what else can I do

vivid python Apr 22, 2023, 7:01 PM

#

D-adapt can't have seperate unet and te

#

It's based on adam not adamw, so it's all on one lr

vivid python Apr 22, 2023, 7:03 PM

#

flat coral 0.1 does sound a bit too much in some cases

0.1 is not really too much, in 99% of cases, even 1e-4 (which isn't really that low) works fine, as that's what I used to train at

worn locust Apr 22, 2023, 8:12 PM

#

vivid python It's based on adam not adamw, so it's all on one lr

i thought decouple would change that

vivid python Apr 22, 2023, 9:56 PM

#

worn locust i thought decouple would change that

decouple decouples weight decay

#

that's it

worn locust Apr 22, 2023, 10:18 PM

#

True

normal charm Apr 23, 2023, 1:09 PM

#

Should i run the update again

#

LucaWat

#

Also is there anywhere i can read about the schedulers?

errant wraith May 16, 2023, 3:19 PM

#

so i want to train only the out/up layers, but when i put a weight of 0 for the middle layer, it just asks me to cancel, .1 says not an integer, it only accepted 1

errant wraith May 16, 2023, 3:56 PM

#

oh i should try updating..

merry hearth May 16, 2023, 4:11 PM

#

I have a question that I train lora on colab and the result is also very good (probably) but the size of that lora file is only about 10-20 mb (or I have trained too few images) and is there a way can anyone help it improve because when exporting lora file about 10 files, 1 file seems to be fine

vivid python May 17, 2023, 2:39 PM

#

errant wraith so i want to train only the out/up layers, but when i put a weight of 0 for the ...

That sounds odd... let me double check the code

#

Ok yep, that was my mistake, I forgot to set it's mode to float I'll fix it soon

vivid python May 17, 2023, 2:41 PM

#

merry hearth I have a question that I train lora on colab and the result is also very good (p...

What exactly do you mean?

merry hearth May 17, 2023, 2:44 PM

#

vivid python What exactly do you mean?

i don't know if my lora is really ok, and i also tried many versions and the result is very different and sometimes error and my model seems the file size is very small compared to some other lora (the lora which is 80 -150 mb in size)

#

vivid python May 17, 2023, 2:46 PM

#

All of my lora are either 16-ish mb or 30-40mb depending on if I'm using a lora or locon

#

So it's not an issue, just means you are using a smaller dim size

#

It might just be your training parameters causing problems here.

#

Because a small dim size won't break them

merry hearth May 17, 2023, 2:51 PM

#

you are used colab to train not

vivid python May 17, 2023, 5:23 PM

#

I don't use colab to train, no. I'm the one who makes the easy training scripts

#

@errant wraith I updated the scripts, it should be fine now

normal charm May 17, 2023, 9:37 PM

#

I havent run the update bat in so long

#

DrollHell

#

And im still scared too

fiery tendon May 17, 2023, 9:41 PM

#

Wouldn't #1092821901430227085 be a better channel for this?

normal charm May 17, 2023, 10:04 PM

#

It was maybe made before that

#

If u mean this entire thread

fiery tendon May 17, 2023, 10:29 PM

#

Like why not move there?

errant wraith May 18, 2023, 12:11 AM

#

vivid python <@226568326514606094> I updated the scripts, it should be fine now

Sick, thanks drakkon

vivid python May 18, 2023, 2:29 AM

#

fiery tendon Like why not move there?

this is among the oldest posts probably, the "guides and resources" section didn't exist when this was created. I saw no reason to move over as usually there isn't much talking that happens here, that being said once the UI is done I'll probably create a new thread over there

magic magnet May 19, 2023, 4:29 AM

#

Discord doesn't have a 'move thread to another forum' functionality sadly

#

so yeah right now moving this thread would entail shenanigans and cause more confusion probably

vivid python May 22, 2023, 6:36 AM

#

alright, I'm gonna be making a new thread over in guides and resources. because I just finished a complete rewrite

#

I added a UI

#

#1110094921316171816

#LoRA_Easy_Training_Scripts