#🔧｜finetune | Stable Diffusion | Page 8

split acorn Jan 2, 2023, 8:00 AM

#

a rare token is like "olis" or "hta" or "sks"

frank urchin Jan 2, 2023, 8:00 AM

#

i see what you mean yeah

split acorn Jan 2, 2023, 8:00 AM

#

You could try it! might work well. I'm not sure

frank urchin Jan 2, 2023, 8:00 AM

#

correct me if this is dumb but could i like shorten princess to prncs or something 😭

split acorn Jan 2, 2023, 8:01 AM

#

The longer the instance token the more you have to train

#

You could. But honestly, I think that's something you worry about later once you get the hang of it alicatPog

#

If you're training on Any3, I'd recommend "hta"

frank urchin Jan 2, 2023, 8:02 AM

#

im using elysium

#

is that bad 😭

split acorn Jan 2, 2023, 8:02 AM

#

nah, that's fine

frank urchin Jan 2, 2023, 8:02 AM

#

ok so

#

princess tutu is fine for now then?

#

this isnt serious or anything

#

i just like learning new stuff so its all just for fun

split acorn Jan 2, 2023, 8:03 AM

#

mmm, if you're learning you could try that, but the next time you try it, try it with "olis" or "sks" or "hta" and you might have better results

#

alicatLove

frank urchin Jan 2, 2023, 8:04 AM

#

ok tysm

#

what else should i add? (if anything)

white current Jan 2, 2023, 8:05 AM

#

instance prompt:

[filewords] or princess tutu, [filewords]

frank urchin Jan 2, 2023, 8:05 AM

#

wdym by filewords?

#

OH

white current Jan 2, 2023, 8:06 AM

#

just type [filewords]

frank urchin Jan 2, 2023, 8:06 AM

#

youre not telling me to replace that are you

#

LMAO

white current Jan 2, 2023, 8:06 AM

#

no

frank urchin Jan 2, 2023, 8:06 AM

#

im too tired for this 💀

#

ty guys

#

ok whats next!

white current Jan 2, 2023, 8:06 AM

#

for class prompt: [filewords]

#

i use sth different but eh need to experiment

frank urchin Jan 2, 2023, 8:07 AM

#

okey

#

anything else?

white current Jan 2, 2023, 8:07 AM

#

alright done

#

now make sure you have selected the model you made

#

on the leftmost

#

section

frank urchin Jan 2, 2023, 8:08 AM

#

#

look good?

#

great model name i know

#

😌

stone garden Jan 2, 2023, 8:08 AM

#

it's better than mine, mine crashed the whole webui!

frank urchin Jan 2, 2023, 8:09 AM

#

LMAOOO

white current Jan 2, 2023, 8:10 AM

#

frank urchin LMAOOO

press train

#

and

frank urchin Jan 2, 2023, 8:10 AM

#

WOOO

white current Jan 2, 2023, 8:10 AM

#

now u have to wait a bit

#

only a bit

#

3060 ti might have low vram

#

but that processor is mmmmmmmm

stone garden Jan 2, 2023, 8:11 AM

#

enough time to pray to the art gods!

white current Jan 2, 2023, 8:11 AM

#

it nom nom iterations

frank urchin Jan 2, 2023, 8:11 AM

#

8gb isnt thatttttt low

#

its just low for ai 😌

white current Jan 2, 2023, 8:11 AM

#

frank urchin 8gb isnt thatttttt low

it is for AI

#

fr

frank urchin Jan 2, 2023, 8:11 AM

#

i heard peoples 4090s dying from stable diffusion

#

thanks to overheating

#

and having to RMA them

stone garden Jan 2, 2023, 8:12 AM

#

just be careful with the temperatures of your card if you don't already have a hand on it (not literally)

split acorn Jan 2, 2023, 8:12 AM

#

I think you can only db with 8gb by doing like 256x256 but I could be wrong

frank urchin Jan 2, 2023, 8:12 AM

#

i have a very well cooled pc!

stone garden Jan 2, 2023, 8:12 AM

#

as long as it doesn't turn int o *has!

frank urchin Jan 2, 2023, 8:12 AM

#

yall i think it failed

white current Jan 2, 2023, 8:12 AM

#

frank urchin and having to RMA them

skill issue on NVIDIA's part

stone garden Jan 2, 2023, 8:13 AM

#

and then had :P

white current Jan 2, 2023, 8:13 AM

#

frank urchin yall i think it failed

what

vale egret Jan 2, 2023, 8:13 AM

#

My 4090 never seems to go above 65deg. Is that because it only has 3/4 of the power cables plugged in?

frank urchin Jan 2, 2023, 8:13 AM

#

white current skill issue on NVIDIA's part

yeah 4090 and 3090 have awful heating issues

white current Jan 2, 2023, 8:13 AM

#

vale egret My 4090 never seems to go above 65deg. Is that because it only has 3/4 of the po...

MonkaS

frank urchin Jan 2, 2023, 8:13 AM

#

vale egret My 4090 never seems to go above 65deg. Is that because it only has 3/4 of the po...

LMFAOOO

vale egret Jan 2, 2023, 8:13 AM

#

What?

frank urchin Jan 2, 2023, 8:13 AM

#

frank urchin yall i think it failed

any ideas 💀

white current Jan 2, 2023, 8:13 AM

#

plug that 4th one, you arent using it full capacity...

white current Jan 2, 2023, 8:13 AM

#

frank urchin any ideas 💀

uh

#

Lake.exe has stopped working
Please reboot

vale egret Jan 2, 2023, 8:14 AM

#

Do i need full capacity? It seems to be like 95% the way there with 75% the power. Looks like good efficiency to me

white current Jan 2, 2023, 8:14 AM

#

vale egret Do i need full capacity? It seems to be like 95% the way there with 75% the powe...

u dont

#

dam

#

i gotta get rtx 4090

#

but how

#

am broke

stone garden Jan 2, 2023, 8:15 AM

#

just save for a A100 instead, not that much of a price difference!

vale egret Jan 2, 2023, 8:15 AM

#

Sell your ai pictures on patreon

frank urchin Jan 2, 2023, 8:15 AM

#

oh heres what happened

#

that sucks

stone garden Jan 2, 2023, 8:15 AM

#

yeah, didn't you need about 12 gig for training? or did they fix that?

frank urchin Jan 2, 2023, 8:15 AM

#

LORA

vale egret Jan 2, 2023, 8:16 AM

#

frank urchin oh heres what happened

Time to close every other app on the computer

frank urchin Jan 2, 2023, 8:16 AM

#

makes 7gb work fine

#

LMAO

#

white current Jan 2, 2023, 8:16 AM

#

That explains why

#

W11

#

sucks

#

hm

vale egret Jan 2, 2023, 8:17 AM

#

Firefox uses gpu

white current Jan 2, 2023, 8:17 AM

#

fr

frank urchin Jan 2, 2023, 8:17 AM

#

true

#

ok i closed like 5 tabs

#

i will try again

#

and it failed again

#

is it using LORA?

#

i feel like its not

#

this should work fine

stone garden Jan 2, 2023, 8:18 AM

#

if nothing else, then I'm sure there's people here who can help make the embedding if you ask them nicely! :D

white current Jan 2, 2023, 8:18 AM

#

frank urchin is it using LORA?

does it say injecting lora

#

in the cmd

frank urchin Jan 2, 2023, 8:19 AM

#

yeah it does

frank urchin Jan 2, 2023, 8:19 AM

#

stone garden if nothing else, then I'm sure there's people here who can help make the embeddi...

true true

vale egret Jan 2, 2023, 8:19 AM

#

Embedding? I thought kole was training a model

frank urchin Jan 2, 2023, 8:19 AM

#

this is a model

#

i dont really need a whole model thats just what got suggested to me

#

💀

vale egret Jan 2, 2023, 8:20 AM

#

What are you trying to train?

stone garden Jan 2, 2023, 8:20 AM

#

@split acorn Found the speed issue. I set the batch size to 1 and now I'm getting over 4it/sec

white current Jan 2, 2023, 8:20 AM

#

frank urchin i dont really need a whole model thats just what got suggested to me

afaik, it is faster and higher quality

#

it would take like

#

2000/4 = 500 seconds to train

frank urchin Jan 2, 2023, 8:21 AM

#

vale egret What are you trying to train?

i just wanna be able to have SD make a certain character

frank urchin Jan 2, 2023, 8:21 AM

#

white current afaik, it is faster and higher quality

yeah makes sense

stone garden Jan 2, 2023, 8:21 AM

#

I'd also recommend an embedding if it's just for one character, that way they might use it on all kinds of models :D

frank urchin Jan 2, 2023, 8:22 AM

#

thats what i was thinking as well

vale egret Jan 2, 2023, 8:22 AM

#

You can definitely train a character using an embedding, it is somewhat lower quality

#

But at least you can do it with lower vram

frank urchin Jan 2, 2023, 8:23 AM

#

yeah this doesnt seem to be going well

stone garden Jan 2, 2023, 8:23 AM

#

stone garden I'd also recommend an embedding if it's just for one character, that way they mi...

Have you had good luck there? I trained an embedding against the generic 1.5 model and then tried to use it with a custom model and the likeness no longer worked

#

and you might need to tweak the images, and settings if your results aren't to your liking so better start slow with something simple as an embedding! :)

frank urchin Jan 2, 2023, 8:23 AM

#

do i need to start over to do an embedding?

vale egret Jan 2, 2023, 8:25 AM

#

stone garden Have you had good luck there? I trained an embedding against the generic 1.5 mod...

Embeddings only generally work on the model used for training and on models which are 40+% merges. Other than that, the results can be very wrong

stone garden Jan 2, 2023, 8:25 AM

#

stone garden Have you had good luck there? I trained an embedding against the generic 1.5 mod...

works extremely well for me, but as you said, changing the model which creates different stuff will change the likeness as well. Depends on it's too much or too little. And also how. But you can, if you use something like the webui increase the strength a little more with the use of ()

stone garden Jan 2, 2023, 8:26 AM

#

stone garden works extremely well for me, but as you said, changing the model which creates d...

Yeah tried that. Unfortunately it doesn't help much on some of these more complex merges.

vale egret Jan 2, 2023, 8:26 AM

#

I accidentally used one of my anime embeddings in SD 1.5 and it gave me Japanese people with massive bug eyes

stone garden Jan 2, 2023, 8:26 AM

#

frank urchin do i need to start over to do an embedding?

you don't need to start over, or rather, it depends on what you mean, but you got all the settings, and the images already so there's not much for you to do if you try again, etc

frank urchin Jan 2, 2023, 8:27 AM

#

ok thats good at least LOL

vale egret Jan 2, 2023, 8:27 AM

#

No need for regularization with an embedding

stone garden Jan 2, 2023, 8:27 AM

#

stone garden Yeah tried that. Unfortunately it doesn't help much on some of these more comple...

yeah, merges are very finicky, results can be almost anything! But I myself mostly use merges AS the embedding as I'm after a style and not a character when merging. But we're all using it differently.

round hare Jan 2, 2023, 8:28 AM

#

Thanks, i't a really good idea, but unfortunately, I can't put this kind of expression in automatic1111. I only able to fill the box with one number

frank urchin Jan 2, 2023, 8:28 AM

#

ok so where would i go from here then to make an embedding?

stone garden Jan 2, 2023, 8:28 AM

#

stone garden yeah, merges are very finicky, results can be almost anything! But I myself most...

I really just want to put a character into some of these really good models

vale egret Jan 2, 2023, 8:29 AM

#

frank urchin ok so where would i go from here then to make an embedding?

I have no clue what that ui is. Embeddings are available on the train tab of the UI when extensions are disabled

stone garden Jan 2, 2023, 8:29 AM

#

stone garden I really just want to put a character into some of these really good models

depends on how well know the character is, how much detail and likeness you want. There's also a lot of small things you can try, like having the embedding prompt word later in the prompt sentence, or earlier, or twice, etc :)

frank urchin Jan 2, 2023, 8:30 AM

#

vale egret I have no clue what that ui is. Embeddings are available on the train tab of the...

LMAO OH YEAH

stone garden Jan 2, 2023, 8:30 AM

#

stone garden depends on how well know the character is, how much detail and likeness you want...

I'll do some more testing. Thanks!

frank urchin Jan 2, 2023, 8:30 AM

#

forgot that was a default thing my bad 😭

#

i was in dreambooth

stone garden Jan 2, 2023, 8:31 AM

#

stone garden I'll do some more testing. Thanks!

sorry for being vague, but as I don't know what character you want and in what level of detail, it's the best I can give you :)

frank urchin Jan 2, 2023, 8:31 AM

#

should i change this?

stone garden Jan 2, 2023, 8:32 AM

#

stone garden sorry for being vague, but as I don't know what character you want and in what l...

Not vague at all. You helped!

#

you can always play around and test what'll happen, if it works or not. But I fear the vram needs to be at a certain number to even be able to run :/

stone garden Jan 2, 2023, 8:33 AM

#

stone garden Not vague at all. You helped!

you must be hallucinating, I never help people. I mostly stumble over things until I either get kicked out, or someone gives me another drink... wait, this isn't the bar. W-Where am I?!

frank urchin Jan 2, 2023, 8:34 AM

#

stone garden you can always play around and test what'll happen, if it works or not. But I fe...

worth giving it a go! any settings i should change here?

#

(got this btw dw)

stone garden Jan 2, 2023, 8:35 AM

#

don't worry about the settings, try and get it to work first. You can always redo it if it turns out bad. And also ask people for more help. There's no limits... other than the electrical bill, but I never pay my bills so

frank urchin Jan 2, 2023, 8:36 AM

#

LMAO

#

i turned 18 a couple weeks ago

#

and literally 3 days after i turned 18 my dad started making me pay rent 😭

split acorn Jan 2, 2023, 8:37 AM

#

frank urchin worth giving it a go! any settings i should change here?

These are the settings I used:
https://docs.google.com/spreadsheets/d/1rGy5Jb63LdFMfzqN_7Y-X6E5bnsYlqRt7zD61tCcGNs/edit?usp=sharing

frank urchin Jan 2, 2023, 8:37 AM

#

ty!

split acorn Jan 2, 2023, 8:37 AM

#

The Prompt Template had a file that just said [filewords] I think?

#

just going to double check that

white current Jan 2, 2023, 8:38 AM

#

@split acorn finetuning on stable tuner doesnt work cause vram lul

#

guess back to lora

stone garden Jan 2, 2023, 8:38 AM

#

frank urchin and literally 3 days after i turned 18 my dad started making me pay rent 😭

welcome to hell! having to pay your bills, why can't everything be free. And also owned by me? >:(

frank urchin Jan 2, 2023, 8:39 AM

#

LMAOO

#

Yeah it's not fun

white current Jan 2, 2023, 8:39 AM

#

frank urchin and literally 3 days after i turned 18 my dad started making me pay rent 😭

Im so glad i have amazing parents... Alhamdulillah.

frank urchin Jan 2, 2023, 8:39 AM

#

I'm currently in debt to my dad!

#

Isn't that fun

split acorn Jan 2, 2023, 8:39 AM

#

oh yeah, Stable Tuner is only for 24 GB

#

EveryDream I think is also 24 GB? alicatHm2

stone garden Jan 2, 2023, 8:40 AM

#

frank urchin I'm currently in debt to my dad!

just say you're going to ask your parents for help

white current Jan 2, 2023, 8:40 AM

#

frank urchin I'm currently in debt to my dad!

what sort of parents...

western society ffs...

frank urchin Jan 2, 2023, 8:41 AM

#

stone garden just say you're going to ask your parents for help

wdym?

white current Jan 2, 2023, 8:41 AM

#

@split acorn also i did a clever trick with .txt file prompts

frank urchin Jan 2, 2023, 8:41 AM

#

white current what sort of parents... western society ffs...

yeah it sucks

#

for my 18th birthday my grandparents gave me $1000 to use for buying a car

#

and my dads already taken a quarter of it

#

for rent

white current Jan 2, 2023, 8:41 AM

#

frank urchin and my dads already taken a quarter of it

...

stone garden Jan 2, 2023, 8:42 AM

#

frank urchin wdym?

Just joking that you can ask your parents for help, but as your dad is the one who you need to pay it'll turn around as a weird scene :P

white current Jan 2, 2023, 8:42 AM

#

frank urchin and my dads already taken a quarter of it

man if i had money i would send over to you

frank urchin Jan 2, 2023, 8:42 AM

#

oh yeah 😭

split acorn Jan 2, 2023, 8:42 AM

#

split acorn just going to double check that

Right, there was a txt file with [name], [filewords] and a text file with just [filewords] that I tested (there might be a better way, but hey, it works)

frank urchin Jan 2, 2023, 8:42 AM

#

white current man if i had money i would send over to you

hey its not too bad coolguy

#

at least they agreed on not kicking me out

#

they havent decided what they'll do when i run out of money to pay them tho

#

so thats worrying 💀

white current Jan 2, 2023, 8:43 AM

#

what the f...

#

thats how they treat their kids...

frank urchin Jan 2, 2023, 8:43 AM

#

yeah its definitely different

#

its weird being THIS cautious about spending money

#

bc im literally this close to being broke bc of my own parents

#

😭

#

the best part is, is both my mom and dad are very against "normal" jobs

#

and would literally shame me if i got a typical job

stone garden Jan 2, 2023, 8:45 AM

#

@stone garden The secret sauce was just to add the embedding name at the front of the prompt as well as around midway and even with the custom model it looks quite a bit like the source images

frank urchin Jan 2, 2023, 8:46 AM

#

so this is literally gonna talk days right

stone garden Jan 2, 2023, 8:47 AM

#

if you have it save after X images, then you can always check the quality before continuing, I think. Not sure about that now that I think about it :O

frank urchin Jan 2, 2023, 8:47 AM

#

its at 500 😭

#

ill just check on it tmr and we'll see how it goes

#

thank you all so much for the help!!

#

ima head to bed now

#

currently 2:48 am here

#

PinkHeart

regal harbor Jan 2, 2023, 8:58 AM

#

I have 300 carefully editing and captioned images, some are closeups of faces, some are single people, some are 2 people together. I'm training on a 1060 6gb, so it's slow.

Could someone tell me an ideal LR to not overtrain, but to maximize my time?

little hollow Jan 2, 2023, 10:00 AM

#

frank urchin so this is literally gonna talk days right

just put it at 10k, and stop anytime

#

let it train as much as possible

split acorn Jan 2, 2023, 10:01 AM

#

10k takes about 40 mins with a 3090 (and GA Steps of 1)

#

Yosh what Taken said

#

10k should be enough to give you good results

little hollow Jan 2, 2023, 10:02 AM

#

regal harbor I have 300 carefully editing and captioned images, some are closeups of faces, s...

there is no ideal
its like shooting in the darkness
low lr might never find a solution not because it is slow, but because it will move in the wrong direction
too high might overshoot as well, so changing your lr is useless

#

you need to find the ideal lr to grad to batch -> this is literally throwing rocks and trying to hear if it hits something

#

takes time

split acorn Jan 2, 2023, 10:03 AM

#

yeppp

little hollow Jan 2, 2023, 10:03 AM

#

certain styles have a "standard" sort of in different models

in 2.1 anime is between 0.1 to 0.0005

#

realistic is ~ 0.05 to 0.001

split acorn Jan 2, 2023, 10:03 AM

#

has anyone had luck training on the new Waifu Diffusion 1.4 epoch 1? Results so far are really bad alicatNF4

#

via dreambooth

#

I'll try going up to 1800 steps, 600 and 1200 were pretty yikers (nope 1800 is also yikes... will probably wait for the full release in 5 days and then fiddle around trying to make it work after)

little hollow Jan 2, 2023, 10:10 AM

#

Can anyone donate his favorite ti templetes?

regal harbor Jan 2, 2023, 10:25 AM

#

little hollow realistic is ~ 0.05 to 0.001

oh, so I've been too low

regal harbor Jan 2, 2023, 10:27 AM

#

little hollow there is no ideal its like shooting in the darkness low lr might never find a so...

gonna try this now; I'm training realistic... but 200 images takes forever, like, 10 steps on each image will take me hours

split acorn Jan 2, 2023, 10:54 AM

#

little hollow Can anyone donate his favorite ti templetes?

I've been liking the [name], [filewords] template personally

little hollow Jan 2, 2023, 10:55 AM

#

split acorn I've been liking the `[name], [filewords]` template personally

i tried that with
filewords, name
name, filewords
name
filewords

#

this can copy style and the drawing style altogether

#

but... it doesn't produce high quality

#

just mid

#

i didn't try just name, filewords yet

split acorn Jan 2, 2023, 10:56 AM

#

Yeah, I could see that being mid quality

regal harbor Jan 2, 2023, 10:56 AM

#

I'm confused

I train one model, then I train another model, but it seems to continue from the last step of the first model (the Lora Model). I don't understand what's going on there

split acorn Jan 2, 2023, 10:56 AM

#

you might have better luck with just [name], [filewords]

#

I've only done this for anime stuff though

regal harbor Jan 2, 2023, 10:57 AM

#

what exactly does Lora Model mean here?

split acorn Jan 2, 2023, 10:57 AM

#

split acorn I've only done this for anime stuff though

Not sure how well it works with non-anime stuff

#

Danbooru tags make training via filewords a breeze. Also Shuffle Tags works really well, imo alicatUwU

little hollow Jan 2, 2023, 10:58 AM

#

yeah, sadly not for 2.1

split acorn Jan 2, 2023, 10:59 AM

#

danbooru tags + 2.1 for training? or

split acorn Jan 2, 2023, 11:00 AM

#

little hollow yeah, sadly not for 2.1

? curious on what comment that's in response to, because I've been struggling with 2.1

#

mind you, I haven't done too many models yet, but... the old settings that worked really well just... aren't

#

alicatHm

little hollow Jan 2, 2023, 11:03 AM

#

the danbooru tangs on 2.1 are useless

#

tags*

#

@split acorn

#

you either need to use clip 2.2
or do it by hand, clip 1 works semi well
danbooru like crap

split acorn Jan 2, 2023, 11:04 AM

#

This includes models like WD 1.4 epoch 1 (which was trained on 2.1)?

little hollow Jan 2, 2023, 11:04 AM

#

this is for example using clip 1

little hollow Jan 2, 2023, 11:05 AM

#

split acorn This includes models like WD 1.4 epoch 1 (which was trained on 2.1)?

dunno, didn't touch wd 1.4 training

#

2.1 has horrible hands and eyes
look at this pic for example above -->> there are no hands, and the eyes are red

#

even though the entire embedding is B&W or yellow tint

split acorn Jan 2, 2023, 11:06 AM

#

yeahhhh, my Clip 1 results are horrible too

little hollow Jan 2, 2023, 11:06 AM

#

eyes in anime usually went above the 0.995 filter --> meaning that eyes are suxual and weren't included in the model

split acorn Jan 2, 2023, 11:06 AM

#

So I should be training using like Clip 2? and then change the settings to Clip 2 when using it?

#

for better results or

little hollow Jan 2, 2023, 11:07 AM

#

split acorn yeahhhh, my Clip 1 results are horrible too

do 8 to 20 words, and use the

#

filter out bad words and such

#

its really easy, you can even filter in the entire dataset or replace words or certain tags

#

usefull as hell, i removed "a drawing of" and "a pencil drawing of" into nothing

#

basically erasing it all in 1 click

little hollow Jan 2, 2023, 11:08 AM

#

split acorn So I should be training using like Clip 2? and then change the settings to Clip ...

yeah clip 2 is better, but no versions can run locally yet

#

everyone gets errors

split acorn Jan 2, 2023, 11:08 AM

#

Run 2.1 models via Clip 2 without getting errors?

little hollow Jan 2, 2023, 11:09 AM

#

clip 2 doesn't work locally

#

no idea why

split acorn Jan 2, 2023, 11:09 AM

#

ahhh gotcha

#

going to give that a try just to see what happens

little hollow Jan 2, 2023, 11:09 AM

#

lots of people tried to make it work, but it just wont cooperate

#

you can use the collab for clip 2, but 30 secs per image

#

sometimes a full minute

split acorn Jan 2, 2023, 11:10 AM

#

it runs, just going to double check that it's doing something

little hollow Jan 2, 2023, 11:10 AM

#

split acorn it runs, just going to double check that it's doing something

wait what? did you manage to make clip 2 run locally?

split acorn Jan 2, 2023, 11:11 AM

#

ooo the results are better but still kinda poor

#

I probably overtrained it a lot

#

going to try a diff model

#

OHH

#

I think I know what's happening

#

Training dreambooth on 2.1 results in models with the same infistructure of 1.5, because it doesn't create a yaml?

#

oh it does make a yaml

#

mmm

#

will post example pictures in a sec

#

Really poor quality though

#

but like, it still works

#

I didn't train it via clip skip 2 though

#

Oh that's not the same seed

#

one sec, deleting and trying again

#

Clip Skip 1:

#

Clip Skip 2:

#

super poor quality but no error message

#

and a little bit of a difference

#

but huge though

#

clip skip 1 being better

#

I'm doing something wrong though. I think the settings definitely need to be different

little hollow Jan 2, 2023, 11:24 AM

#

Clip skip in 2.1 doesnt do anything

split acorn Jan 2, 2023, 11:25 AM

#

It did do something, but I'd agree that it looks broken

#

or at least it's not working as intended

little hollow Jan 2, 2023, 11:25 AM

#

Because it touches floats but it just means that the floats round up to a different close number

split acorn Jan 2, 2023, 11:25 AM

#

ahhh

little hollow Jan 2, 2023, 11:25 AM

#

So basically it doesnt do anything

split acorn Jan 2, 2023, 11:25 AM

#

that would explain the subtle difference

little hollow Jan 2, 2023, 11:26 AM

#

Say thqnks to sinister for explaining that

split acorn Jan 2, 2023, 11:26 AM

#

alicatLove tyty

#

Sweet! So I'll ignore 2.1 training for now BonGoat

little hollow Jan 2, 2023, 11:28 AM

#

Dont ignore it, 2.1 gives off much higher quality results

split acorn Jan 2, 2023, 11:29 AM

#

I could try with the base 2.1 model alicatHm2

little hollow Jan 2, 2023, 11:59 AM

#

2.1 base is gonna give you different results

split acorn Jan 2, 2023, 12:17 PM

#

That's what I'm looking for, different results, because the WD 1.4 epoch 1 results were super bad GoatUppies

#

(2.1 would use different prompting and would need me to redo my filewords, so I'll save that for a future project)

little hollow Jan 2, 2023, 3:13 PM

#

split acorn That's what I'm looking for, different results, because the WD 1.4 epoch 1 resul...

Usually youd wait for 2k 1k steps on 1 grad or
10 grad for 100 steps on 2.1

#

Epoch 1 almost always will be useless, also you can accelerate your training by using good "Init words"
When you are creating the embedding

#

Good init is ~1 to 3 words

split acorn Jan 2, 2023, 3:30 PM

#

Is that true for dreambooth too?

#

I'll give embeddings a try

native lodge Jan 2, 2023, 4:25 PM

#

frank urchin ok so where would i go from here then to make an embedding?

Hello! I have the same error

frank urchin Jan 2, 2023, 4:25 PM

#

native lodge Hello! I have the same error

Don't do it in dreambooth

#

Do it in the training menu

native lodge Jan 2, 2023, 4:27 PM

#

frank urchin Don't do it in dreambooth

Is there any instruction?

frank urchin Jan 2, 2023, 4:27 PM

#

Im only training for my first time so I probably shouldn't give advice haha

#

OMG

#

it worked!

#

i was able to train on a 3060 ti!!

split acorn Jan 2, 2023, 4:32 PM

#

also, fyi it should be only "princess tutu, [filewords]"

frank urchin Jan 2, 2023, 4:32 PM

#

#

iidk how much its gonna suck

split acorn Jan 2, 2023, 4:32 PM

#

yay GoatUppies

frank urchin Jan 2, 2023, 4:32 PM

#

but hey im happy it worked

#

do you have to restart webui for it to load the new embeddings?

split acorn Jan 2, 2023, 4:33 PM

#

nah

#

just type the name of the embedding and it should auto activate. For the "-18000", "-17000" you'll need to click and drag them to your "embeddings" folder

#

Example:
E:\Programs\AI\Auto1111\stable-diffusion-webui\embeddings

frank urchin Jan 2, 2023, 4:34 PM

#

yep!

#

done that

#

time to see how bad it is 💀

split acorn Jan 2, 2023, 4:35 PM

#

now just type the name of the pt to test it out

#

for example "princess_tutu-18000"

#

or just "princess_tutu" for the default

frank urchin Jan 2, 2023, 4:39 PM

#

ngl this aint working 💀

#

its like

#

not using my embedding 😭

native lodge Jan 2, 2023, 4:43 PM

#

frank urchin Do it in the training menu

how to use it?

frank urchin Jan 2, 2023, 4:43 PM

#

native lodge how to use it?

#🔧｜finetune message

split acorn Jan 2, 2023, 4:44 PM

#

frank urchin ngl this aint working 💀

mmm one sec, I'll give you an example

frank urchin Jan 2, 2023, 4:44 PM

#

split acorn mmm one sec, I'll give you an example

ive used embeddings before

#

im not sure why its not working

#

ive got it in the prompt and stuff

#

i tried a few prompts

split acorn Jan 2, 2023, 4:45 PM

#

and the name of the embedding in the embedding folder is "princess_tutu"?

frank urchin Jan 2, 2023, 4:45 PM

#

yep

#

i put 2 in there just in case

#

with different names

#

and tried both

#

split acorn Jan 2, 2023, 4:46 PM

#

weird!

#

That should be working

native lodge Jan 2, 2023, 4:47 PM

#

can someone just give me an example how to?

split acorn Jan 2, 2023, 4:47 PM

#

do you have a screen shot of this area?

frank urchin Jan 2, 2023, 4:48 PM

#

native lodge Jan 2, 2023, 4:48 PM

#

something for noobs

split acorn Jan 2, 2023, 4:49 PM

#

frank urchin

Oh so it's working, it's just working poorly / bad?

#

Also just use princess tutu, [filewords] next time for your instance prompt. The instance prompt you used is super wrong alicatLove

native lodge Jan 2, 2023, 4:49 PM

#

https://youtu.be/HahKXY7AQ8c
It worked for me, but the interface has updated and I don't understand where to click now

YouTube

Aitrepreneur

DREAMBOOTH LOCAL Training Inside Stable Diffusion! CPU OPTION For F...

Dreambooth local training has finally been implemented into Automatic 1111's Stable Diffusion repository, meaning that you can now use this amazing Google’s AI technology to train a stable diffusion model with your own images. You can train a character, an object, a style, or anything you want! There is also a new option that allows you to use D...

▶ Play video

frank urchin Jan 2, 2023, 4:50 PM

#

split acorn Also just use `princess tutu, [filewords]` next time for your instance prompt. T...

oh ok ty 😭

native lodge Jan 2, 2023, 4:50 PM

#

Or can I just bring back the old interface?

#

Just like in this video

frank urchin Jan 2, 2023, 4:51 PM

#

you'd have to downgrade versions

#

which idk if you can do

native lodge Jan 2, 2023, 4:51 PM

#

frank urchin which idk if you can do

what is this?

frank urchin Jan 2, 2023, 4:51 PM

#

native lodge what is this?

downgrading?

native lodge Jan 2, 2023, 4:52 PM

#

idk

frank urchin Jan 2, 2023, 4:52 PM

#

im not sure what youre asking

split acorn Jan 2, 2023, 4:59 PM

#

I

#

It's really not that bad once you get used to it

#

the new DreamBooth UI

storm parrot Jan 2, 2023, 7:02 PM

#

hi all!
I really need expert advice about few questions.
Trained casual artstyle for icons of in-game resources.
Used dreamboot from Thelastben and model 2.1 768
The dataset consisted of 100 images. Iterated the training several times. As a result, it turned out about 30k steps
The result is disappointing, with many images having severely distorted proportions. Also, the model practically stopped responding to the CFG scale values...

Who has already encountered training 2.1, what is the optimal UNet_Training_Steps and UNet_Learning_Rate for a dataset of 200 images?

frank ibex Jan 2, 2023, 7:05 PM

#

What learning rates have you tried?

storm parrot Jan 2, 2023, 7:12 PM

#

frank ibex What learning rates have you tried?

it was 2e-6 learning rate

sleek yoke Jan 2, 2023, 7:29 PM

#

Hi all. Happy new year!! May I ask whether there is some fine-tuning examples for stable diffusion, especially for inpainting model?

honest nexus Jan 2, 2023, 11:07 PM

#

is there any other google colab for textual inversion training? I found only this: https://colab.research.google.com/github/huggingface/notebooks/blob/main/diffusers/sd_textual_inversion_training.ipynb but I don't like it because you can't stop and change learning rate when you're training

Google Colaboratory

oak spear Jan 3, 2023, 1:59 AM

#

Did anyone have any semblance of success training a Textual Inversion with Anything V3 through Automatic Web UI? Because so far it’s failing really badly for me.

tropic quail Jan 3, 2023, 3:18 AM

#

I trained my own model, but it doesnt really follow prompts, seems to just churn out random images in the style of the instance images I used to train it

#

any idea what could be wrong?

amber musk Jan 3, 2023, 5:58 AM

#

Hello, I found a Ti colab for lower GPU user. But i dont know why I got error: RuntimeError: indices should be either on cpu or on the same device as the indexed tensor (cpu) The output can be seen in txt file and
The colab is here: https://colab.research.google.com/drive/11Z1k5rb_Rx-gHQBZA0A40je_tg00nsX8?usp=sharing

📎 actual_training_output.txt

#

This one is for Textual Inversion

real dust Jan 3, 2023, 7:39 AM

#

is there a useful guide somewhere to explain a little better how to decide how to configure ti/hypernetwork training? using automatic1111 there are a lot of options such as reversing images, deepbooru for anime tags vs BLIP interrogator (or both) for automatic image tagging, what prompt templates are recommended - the guides I've found have been very light on detail. Since training takes 8+ hours it takes a long time to conduct experiments.

white current Jan 3, 2023, 12:27 PM

#

Collected 9K images for my diffusion project
gotta collect 18K more
mmm

obsidian idol Jan 3, 2023, 2:34 PM

#

Last night's training was with 1300 images. V2 training has been .. interesting

#

white current Jan 3, 2023, 2:35 PM

#

obsidian idol Last night's training was with 1300 images. V2 training has been .. interesting

What do you train on?

#

What are your settings if ok to ask

obsidian idol Jan 3, 2023, 2:37 PM

#

I'm a newb I'm terrible person to ask for good advice. But webui embed in this case.

#

5950x/3090

#

In terms of introducing many unique subjects with few shared classifications, I've had much more success with dreambooth.

white current Jan 3, 2023, 2:40 PM

#

obsidian idol In terms of introducing many unique subjects with few shared classifications, I'...

I am stuck between DB and finetuning so idk lol

#

some say its same but programs like StableTuner differentiates it

split acorn Jan 3, 2023, 2:41 PM

#

They only differentiate it to make life easier

#

DB for training on a token and Finetuning training on captions

#

They don't need the same settings, so it just disables the settings it doesn't need

white current Jan 3, 2023, 2:43 PM

#

whats the difference between tokens and captions

split acorn Jan 3, 2023, 2:43 PM

#

Have been having good luck with 2.1 training on TI and Hypernetworks but I think I'm doing something wrong with Dreambooth 2.1

#

Token is just the instance token. People use a rare token like "sks". Captions just means the filewords, the words that describe the picture

#

so you train on those words

#

is how I understand it

obsidian idol Jan 3, 2023, 2:52 PM

#

In 1.4, i used the same steps count and samples for embed vs. hypernetwork vs. dreambooth. For my scenario, embed and hypernetwork were similar, and dreambooth was exceptional. ~1000 artistically drawn pokemon with species and types. My measure of success generally was applying typing to different species. "dragontype eevee" for example. an attempt was made at "gmax style."

#

#

"dugtrio GMAX"

little hollow Jan 3, 2023, 3:23 PM

#

obsidian idol Last night's training was with 1300 images. V2 training has been .. interesting

Too much pictures, the ai just goes haywire, better split them into styles

#

For example: bi pedal, quad pedal, wtf this isn't legs pedal etc...

white current Jan 3, 2023, 3:46 PM

#

@little hollow Wait, so more pictures aren't always good? I am probably going to train a sci fi diffusion soon, with a dataset im collecting which has over 26K images

#

Well, sci fi is a really broad topic so i think its fine?

obsidian idol Jan 3, 2023, 3:53 PM

#

I think the issue @little hollow is pointing out is with the singular uniqueness of the samples and names.

#

For my case, I separate them into "styles" by identifying the types (firetype, poisontype), so the result is a dozen or so styles with 60-90 samples respectfully.

white current Jan 3, 2023, 4:13 PM

#

obsidian idol For my case, I separate them into "styles" by identifying the types (firetype, p...

i see

#

i first write a script so it makes a textfile with the filename and the filename written in it, then i append the clip interrogation prompt to the txt file

#

or vice versa

obsidian idol Jan 3, 2023, 4:15 PM

#

BLIP + manual species/type data works well for me. "a cartoon character holding a stuffed animal in its arms and smiling at the camera with a smile on its face, Slowbro, WaterType, PsychicType"

#

white current Jan 3, 2023, 4:19 PM

#

obsidian idol BLIP + manual species/type data works well for me. *"a cartoon character holding...

yeah for 10 images manual is ok for 30k it isnt lul

obsidian idol Jan 3, 2023, 4:20 PM

#

well, let me rephrase -- I have a spreadsheet of numbered subjects and attributes (style, type, gender, region, etc) that I use to script into a string. I append that string to the BLIP. By "manual" I mean not BLIP.

vale egret Jan 3, 2023, 4:21 PM

#

You could use all sorts of things for training pokemon, like dex entries, or the dex species name, or maybe even base stats and abilities

white current Jan 3, 2023, 4:21 PM

#

obsidian idol well, let me rephrase -- I have a spreadsheet of numbered subjects and attribute...

ikik

vale egret Jan 3, 2023, 4:22 PM

#

Then you can submit the results to the CAP project

obsidian idol Jan 3, 2023, 4:22 PM

#

vale egret You could use all sorts of things for training pokemon, like dex entries, or the...

that's what i do; sticking with based attributes such as type right now.

earnest aspen Jan 3, 2023, 4:23 PM

#

Hello,

How are people doing Textual Inversion on Apple Silicon since it appears to not be working on latest version of InvokeAi?

I have tried several google colabs including the official hugginface one, and can't get it to work?

obsidian idol Jan 3, 2023, 4:23 PM

#

results for 2d have been really positive. able to merge species and cross-type species. ok maybe "really positive" isn't the right word, but it's .. within expectations.

#

results for 3d ... well,

little hollow Jan 3, 2023, 4:28 PM

#

white current <@234041104633298954> Wait, so more pictures aren't always good? I am probably g...

If you give the dataset pictures with nothing in common to make something
It will take the only thing it has in common between them
In this case, white background and a basic shape

sudden isle Jan 3, 2023, 4:28 PM

#

#

welp, I got 5 fingers

#

but not what i wanted lmfao

#

obsidian idol Jan 3, 2023, 4:29 PM

#

vale egret Then you can submit the results to the CAP project

what is the CAP project?

sudden isle Jan 3, 2023, 4:29 PM

#

vale egret Jan 3, 2023, 4:29 PM

#

Create-A-Pokemon run by smogon

sudden isle Jan 3, 2023, 4:29 PM

#

I got it to be consistently 5 fingers, except its literal nightmare fuel

#

#

even the off fingers look decent

little hollow Jan 3, 2023, 4:30 PM

#

@white current
Try to place them by a certain category, the more round ones
The more square ones or something

Anything you can think off, and if something has nothing in common with the rest - to the bin with you

sudden isle Jan 3, 2023, 4:31 PM

#

any clue what caused this nightmare fuel with my training? I used a dataset of 11k hands

#

1e-6, 14,000 steps

-mixed_precision=fp16
--train_batch_size=1
--resolution=512
--gradient_accumulation_steps=1
--use_8bit_adam
--train_text_encoder

#

I did this twice

little hollow Jan 3, 2023, 4:38 PM

#

@white current #1045349359044280360 message

Look at this example of how learning works for embeddings at least, the chat between me and sinister, he gave out a long explanation, it lasted ~1 hours so ~ 60 70 chat logs down

#

Some visual explanation from his as well, it really helped me to understand on how to filter

#

From 400 pics i went to 30, and those 30 gave out 10x as much effect than the 400 could

obsidian idol Jan 3, 2023, 5:55 PM

#

UnicodeDecodeError: 'utf-8' codec can't decode byte 0xe9 in position 93: invalid continuation byte

#

Argh. Turns out I was converting the txt files to utf-16 during processing.

#

Headsup. They need to stay utf-8 apparently. heh

regal harbor Jan 3, 2023, 5:56 PM

#

how does the text encoder learn new ideas?

#

if I give it an image with something it's never seen before, and a description, how does it understand what in the image the new thing is?

obsidian idol Jan 3, 2023, 7:28 PM

#

regal harbor how does the text encoder learn new ideas?

Are you asking at a technical level or a procedural one? There are a lot of resources for the procedures, while the technical is mathematical voodoo for me.

regal harbor Jan 3, 2023, 8:29 PM

#

I want to understand theoretically how it thinks, so I can make the right decisions when I curate / edit training data

amber musk Jan 4, 2023, 12:39 AM

#

https://github.com/chavinlo/distributed-diffusion Idk is this what I am hoping for? Training SD over peers like Stable Horde?

GitHub

GitHub - chavinlo/distributed-diffusion: Train a Stable Diffusion m...

Train a Stable Diffusion model over the internet with Hivemind - GitHub - chavinlo/distributed-diffusion: Train a Stable Diffusion model over the internet with Hivemind

gloomy belfry Jan 4, 2023, 10:18 AM

#

amber musk https://github.com/chavinlo/distributed-diffusion Idk is this what I am hoping f...

I wouldn't use that, unless you want your name on some list

dense flame Jan 4, 2023, 11:55 AM

#

How many epochs would you recommend for a 60k image dataset on everydream?

storm parrot Jan 4, 2023, 12:31 PM

#

Hi all!
I have read several guides on training a model in a dreambooth. Everywhere it is written that file names must have a unique identifier. However, the examples in such cases are usually about learning based on one subject.
I am having trouble figuring out the correct approach to naming image files in a dataset if I need to train a model on an artstyle rather than a specific subject.

I am asking for help with the file naming approach if I am training a model based on different subjects that are similar in style.

Should I give descriptive names to the image files in the dataset?
Should I give the files a unique text identifier and include a description of the image in a separate text file?

PS: When training a model based on approach 1.5, I gave the file names a descriptive name by separating the words with spaces. The results were good. However, on version 2, the results deteriorated sharply

prisma nacelle Jan 4, 2023, 6:05 PM

#

Anyone know if adding mirrored images to the data set help with training? If so, do i need to add to the description that the image is mirrored, or should i just name the file as SubjectA Mirror (#).ext ?

little hollow Jan 4, 2023, 6:35 PM

#

prisma nacelle Anyone know if adding mirrored images to the data set help with training? If so,...

if the point of your embedding is the subject like a person - then no its a bad idea, but for a pose and such, concept why not

prisma nacelle Jan 4, 2023, 6:40 PM

#

little hollow if the point of your embedding is the subject like a person - then no its a bad ...

so those guides that have mirrored data sets for their photos of a person were not good? generally they tend to say have x amount of images then mirror it for x2 images in the data.

little hollow Jan 4, 2023, 6:42 PM

#

prisma nacelle so those guides that have mirrored data sets for their photos of a person were n...

imagine someones face with a freckles on the left side, only the left side(from the perspective of someone looking at the person)
and the person having one arm only, lets say his right

you flip the pictures, now what do you have? - sometimes a person with freckles on the left, sometimes on the right, a hand on the right, sometimes on the left

ahh ok, so it must be a person with sometimes L/R freckles, and sometimes L/R hand
4 different variants arise

#

or worse -it might make the face fully symetrical

#

what im saying is correct for 2.1 yeah?, not so sure about 1.4/5

#

should be quite simmilar

prisma nacelle Jan 4, 2023, 6:43 PM

#

oh i see, my thoughts are if you specify that it is mirrored then the AI might take that into account.

little hollow Jan 4, 2023, 6:44 PM

#

just take 4 pics and put a mirror
i think that 20 epochs are enough?
then try to see how well it performs

prisma nacelle Jan 4, 2023, 6:45 PM

#

right, in order for me to test that was why i opened the original question. If i am to specify it is mirrored, would it be done in the text description or the file name?

little hollow Jan 4, 2023, 6:45 PM

#

filename

prisma nacelle Jan 4, 2023, 6:45 PM

#

alright, cool.

little hollow Jan 4, 2023, 6:45 PM

#

the templete might give the regular one the word mirrored sometimes

#

yo, if anyone thinks im wrong tell me, id be glad to learn something new

prisma nacelle Jan 4, 2023, 6:46 PM

#

i'll give it a shot after i set up the data. the main reason is indeed because the data has asymmetrical aspects and i sometimes notice the AI doesn't see it as asymmetrical.

bright plank Jan 4, 2023, 7:07 PM

#

Hey is anyone using runpod for stable diffusion 2-1? I can't seem to get dreambooth working on it

white current Jan 4, 2023, 7:10 PM

#

bright plank Hey is anyone using runpod for stable diffusion 2-1? I can't seem to get dreambo...

There should be some tutorials on that

bright plank Jan 4, 2023, 7:11 PM

#

I've watched the tutorials I've found, the dreambooth tab just doesn't show up for me. The tutorials for the Joe Penna notebook don't work for 2-1

white current Jan 4, 2023, 7:13 PM

#

bright plank I've watched the tutorials I've found, the dreambooth tab just doesn't show up f...

Hm weird

obsidian idol Jan 4, 2023, 8:53 PM

#

For sample descriptions (filewords/captions), is it problematic to have superfluous language and punctuation? For example:

a squat, quadrupedal amphibian with bumpy, blue-green skin. It has small, circular red eyes and a short, blunt snout. Its mouth is wide with two pointed teeth in the upper jaw and four in the lower jaw. On top of its head are small, pointed ears with reddish pink insides. It has three clawed toes on each foot.

#

Not because I like to write verbose descriptions, but because there are some well-written descriptive language that I can reference, and ideally programmatically.

rain tapir Jan 4, 2023, 10:16 PM

#

bright plank Hey is anyone using runpod for stable diffusion 2-1? I can't seem to get dreambo...

It is currently not working

#

Even if you get those juicy yaml files, the training sucks as of now

#

It just doesnt train well on that model, there may be a way to do it, but I have yet to see someone do it successfully

ornate flare Jan 5, 2023, 3:52 AM

#

#

I'm assuming you're all using dreambooth

#

this runs all the way but the final model is the exact same as the initial

#

The only weird thing is how it's "only" using 9gb of ram

#

I don't think it's training at all

#

i can change the learning rate to 1 and nothing changes

split acorn Jan 5, 2023, 7:05 AM

#

where does that come from? alicatHm2

ornate flare Jan 5, 2023, 7:09 AM

#

dreambooth notebook

#

https://colab.research.google.com/github/ShivamShrirao/diffusers/blob/main/examples/dreambooth/DreamBooth_Stable_Diffusion.ipynb#scrollTo=5V8wgU0HN-Kq

Google Colaboratory

#

If i try on colab, same exact settings

#

it does work

#

decently well actually

#

#

it's the default dog whatever

#

if i try it locally however it doesnt work

#

im on a 3080

ornate flare Jan 5, 2023, 7:13 AM

#

split acorn where does that come from? <:alicatHm2:906175728490528828>

is there a different way of doing it?

#

i think the problem might be xformers

ornate flare Jan 5, 2023, 8:03 AM

#

i installed xformers

#

does not fucking work it's driving me nuts

split acorn Jan 5, 2023, 8:32 AM

#

mmm, you might have better luck with this one:
https://github.com/d8ahazard/sd_dreambooth_extension

#

or using the colab

#

There's also this, which is pretty easy, as well:
https://github.com/devilismyfriend/StableTuner

ornate flare Jan 5, 2023, 8:57 AM

#

lovely thanks

livid axle Jan 5, 2023, 10:00 AM

#

Hi there! I know how to train for faces and I know how to train for styles... but how would I do that for specific body parts like a hair-style? 🤔

prisma nacelle Jan 5, 2023, 10:23 AM

#

livid axle Hi there! I know how to train for faces and I know how to train for styles... bu...

what kind of model you working with?

livid axle Jan 5, 2023, 10:25 AM

#

I stick more with the v1.4 and 1.5 Versions 🙂

prisma nacelle Jan 5, 2023, 10:32 AM

#

so from my understanding after trying to train a character with a specific hair style, you would need samples of the data set that has many people with the same hair style. you can probably make this yourself using img2img and inpainting.

after you have enough data for it just set it up with the right descriptions and it should be able to train it.

livid axle Jan 5, 2023, 10:37 AM

#

prisma nacelle so from my understanding after trying to train a character with a specific hair ...

Ok, I try with inpainting, thank you 🙈

prisma nacelle Jan 5, 2023, 10:39 AM

#

good luck. lol i spent the last few days stuck with my training only to find out the build i was using was bugged.

ornate flare Jan 5, 2023, 12:53 PM

#

I am about to xform so hard

dark pivot Jan 5, 2023, 3:13 PM

#

I'm trying to train a textual inversion embedding for sd 2.1, but I keep getting the error Sizes of tensors must match except in dimension 0. Expected size 1024 but got size 768 for tensor number 1 in the list. Does anyone know what I'm doing wrong based on that error?

obsidian idol Jan 5, 2023, 3:28 PM

#

are your samples uniform? and "resolution" set correctly?

#

jolly bear Jan 5, 2023, 7:32 PM

#

Just the settings I use on StableTuner to train on a 12GB 3060 card. I also set sampling to more than the total number of steps to avoid any samples, I do sample epochs and save the epoch though.

#

If you don't train the text encoder you can set the batch size higher, I've used 4 successfully but I've heard people using higher numbers.

#

Train epochs can be set to as high as you want.

#

As each epoch is the total number of images seen once (each step=one image) it can take quite a while on a 3060 card, I usually set it to 5-20 epochs, but depending on your needs you might want to set it higher. I've got some pretty good results with 10 epochs and more.

restive orchid Jan 5, 2023, 7:56 PM

#

jolly bear Just the settings I use on StableTuner to train on a 12GB 3060 card. I also set ...

That guy on yt was supposed to share a notebook for Linux users.. i couldn't find any.. do u have hold of it to run on runpod or colab ?

robust urchin Jan 5, 2023, 9:18 PM

#

@torpid oar ici pour les questions

#

sur dreambooth

frank urchin Jan 5, 2023, 11:19 PM

#

no idea why i cant run this on 8GB

#

are there any settings or something i messed up?

#

keep getting this

vale egret Jan 5, 2023, 11:27 PM

#

That’s only half the error

prisma nacelle Jan 6, 2023, 1:03 AM

#

anyone get dreamartist to work for training? i can't seem to get it to work and would like to experiment with it a little to see how it stacks up with other tuning methods.

jolly bear Jan 6, 2023, 1:48 AM

#

restive orchid That guy on yt was supposed to share a notebook for Linux users.. i couldn't fin...

I run it natively on windows, can't help you with Linux.

restive orchid Jan 6, 2023, 4:57 AM

#

jolly bear I run it natively on windows, can't help you with Linux.

Oh no prob bro.. cheers

stray kindle Jan 6, 2023, 6:33 AM

#

Any tips for textual inversion with a photograph style?

vale egret Jan 6, 2023, 7:23 AM

#

prisma nacelle anyone get dreamartist to work for training? i can't seem to get it to work and ...

There’s a lot of discussion about it on the extension github page, with different people getting different results. I’d also be happy to learn the right way to do it, so lmk if you figure it out

https://github.com/7eu7d7/DreamArtist-sd-webui-extension/issues/18

GitHub

Any successful result replication? · Issue #18 · 7eu7d7/DreamArtist...

Hey guys, I am just wondering if anyone has successfully replicated the 1 image embedding and recreated similar results from 7eu7d7? Right now I have no luck testing it myself. Training time for th...

gloomy belfry Jan 6, 2023, 7:29 AM

#

restive orchid That guy on yt was supposed to share a notebook for Linux users.. i couldn't fin...

you get a notebook when you do a cloud export

peak canopy Jan 6, 2023, 9:30 AM

#

livid axle Hi there! I know how to train for faces and I know how to train for styles... bu...

@prisma nacelle I'm wondering how can we train a new model with faces? What repository or guide can I use to extend the base SD model? Any quick ideas will be helpful

livid axle Jan 6, 2023, 9:32 AM

#

peak canopy <@105856953049198592> I'm wondering how can we train a new model with faces? Wha...

https://colab.research.google.com/github/ShivamShrirao/diffusers/blob/main/examples/dreambooth/DreamBooth_Stable_Diffusion.ipynb#scrollTo=Rxg0y5MBudmd

Google Colaboratory

peak canopy Jan 6, 2023, 9:34 AM

#

livid axle https://colab.research.google.com/github/ShivamShrirao/diffusers/blob/main/examp...

Oh, I thought this is for faces. If I want to train a new model using 100s of hand images so that SD can generate good pictures of people with good fingers/hands, is dreambooth useful in that case? or any other ways we can approach this?

livid axle Jan 6, 2023, 9:36 AM

#

Dreambooth and Textual Inversion should both work for this 🙂

peak canopy Jan 6, 2023, 9:37 AM

#

livid axle Dreambooth and Textual Inversion should both work for this 🙂

Great, is there any other training methods generally available to extend the SD model?

#

just trying to learn all these tech behind the fine-tuning and training

livid axle Jan 6, 2023, 9:40 AM

#

peak canopy Great, is there any other training methods generally available to extend the SD ...

I also know this thing: https://github.com/victorchall/EveryDream
And aesthetic embedding, but the last one seems more to be for style.

GitHub

GitHub - victorchall/EveryDream: Advanced fine tuning tools for vis...

Advanced fine tuning tools for vision models. Contribute to victorchall/EveryDream development by creating an account on GitHub.

peak canopy Jan 6, 2023, 9:43 AM

#

oh yeah, I've heard about it. I thought it was using a dreambooth way. But looks like it's a different one. It's helpful.

peak canopy Jan 6, 2023, 9:45 AM

#

livid axle Dreambooth and Textual Inversion should both work for this 🙂

I'm planning to start with dreambooth first, in that case

Regularization images: `a photo of hand`  - generated with inference
Class: `hand`
Instance token: sks_hand

After training, the prompt can be like → a photo of man saying hi, sks_hand

#

Is my approach is right? any feedback?

livid axle Jan 6, 2023, 9:50 AM

#

peak canopy Is my approach is right? any feedback?

Seems rights to me, Feedback: the sks thing is a random example term, you can use something own if you like^^

peak canopy Jan 6, 2023, 9:51 AM

#

for sure, thank you so much for your answers 👍 🙏

#

It's really helpful and I'll also explore Everydream

full knot Jan 6, 2023, 2:11 PM

#

for shiro dreambooth, the instance token is the instance_data_dir right ?

#

i mean it rely on the directory name ?

#

rain tapir Jan 6, 2023, 6:03 PM

#

That's the subject

#

But yeah, I think they call it the token

full knot Jan 6, 2023, 7:12 PM

#

thanks, so for the shivaro db the only triggers is from the directory name or the whole instance prompt ?

pure blade Jan 6, 2023, 7:49 PM

#

full knot thanks, so for the shivaro db the only triggers is from the directory name or th...

what the folder name is doesn't matter, its just the path to where you want to store your training images

full knot Jan 6, 2023, 8:07 PM

#

yeah, i just want to know what is the instance "trigger" on shivaro db x)

#

since there is no specific field for it

pure blade Jan 6, 2023, 8:11 PM

#

the prompts decide that

full knot Jan 6, 2023, 8:23 PM

#

i see thanks

finite creek Jan 6, 2023, 9:04 PM

#

Hello, anybody knows of a tutorial or document going through all the settings in Dreambooth A1111 webui?

split acorn Jan 6, 2023, 9:07 PM

#

https://www.youtube.com/watch?v=9Nu5tUl2zQw

YouTube

Olivio Sarikas

DreamBooth for Automatic 1111 - Super Easy AI MODEL TRAINING!

DreamBooth for Automatic 1111 is very easy to install with this guide. With DreamBooth for Automatic 1111 you can train yourself or any other subject. Use your own trained Model to create images in your styles or of yourself. The DreamBooth training in for Automatic 1111 takes only around 30-40 minutes with a good GPU.

LINKS From Video ##...

▶ Play video

#

There is some information missing, but it's a really good start!

#

For best results, generally your dataset should be with different backgrounds, clothing, lighting, expression and different camera distance/angles. In the video they used an unideal dataset, but the rest is pretty good!

finite creek Jan 6, 2023, 9:27 PM

#

split acorn https://www.youtube.com/watch?v=9Nu5tUl2zQw

Thank you Alicat! I have followed it, its pretty good. Having issues training an object (a car), I did it once and it came out pretty good, now Im trying it again with a variation and not working so well. Not sure what went wrong...

cunning isle Jan 6, 2023, 9:52 PM

#

just had a little disucssion in #🌶｜off-topic where someone asked "how many images do you need for finetuning" .. answers varied from 10-1000s ("2.1 can't be tuned on 10 with good results" "not dreambooth"...) whats the situation , i'd basically heard "fine tuning is possible with a few dozen", but aparently more can also help increase accuracy for a model narrowed for a more specific domain?

split acorn Jan 6, 2023, 9:57 PM

#

I don't think there's an answer to that. It all depends on what you're trying to do and what you have available

#

How many images for training can vary from 1 to 1000s yeah

cunning isle Jan 6, 2023, 10:00 PM

#

(haven't started on finetuning myself , I need a new PSU for a bigger GPU first.. but basically I'm interseted in generating game art - textures, background wraps - and I have some hacks in mind to try and project onto scenery from keypoints - anyway a fine-tune on sci-fi film stills could help out I guess.. I wouldn't want it to replicate specific copyrighted things but just be better at making 'generic futuristic buildings' etc)

split acorn Jan 6, 2023, 10:03 PM

#

Here's a good guide for styles:
https://github.com/nitrosocke/dreambooth-training-guide

GitHub

GitHub - nitrosocke/dreambooth-training-guide

Contribute to nitrosocke/dreambooth-training-guide development by creating an account on GitHub.

#

a bit outdated, but the information is still relevant

finite creek Jan 6, 2023, 10:22 PM

#

split acorn Here's a good guide for styles: https://github.com/nitrosocke/dreambooth-trainin...

Thank you, this is quite helpful 👍🏻

honest nexus Jan 6, 2023, 11:10 PM

#

is there a way to resume a textual inversion with this google colab notebook? https://colab.research.google.com/github/huggingface/notebooks/blob/main/diffusers/sd_textual_inversion_training.ipynb

Google Colaboratory

prime perch Jan 7, 2023, 12:53 AM

#

Hi, im pretty new to this, but have been playing around with embeddings. I want to copy the style of art from the rules book of the RPG my group is playing. So I trained it on 50 character images like the one on the left for 100,000 steps and its produces... the right. Any idea how I train it to understand what this art style is like, or even just what a human face is? Using the embedding leads to all pictures turning into monstrosities like the one on the right. *Not sure if this is the right place as this is an embedding not a model, but I didn't see an embedding fine tune channel. *

finite ivy Jan 7, 2023, 2:02 AM

#

I'm in pretty much the same boat as @prime perch, I've trained 8 different embeddings trying to generate images with my likeness, but I get results like this (second). This is trained with 18 images similar to the first image, trained with the colab notebook that @honest nexus posted a few messages ago. Any thoughts? Image gen info: 22 steps Euler a on protogenDragon. I've tried with the default SD1.5 model, but the results are even less coherent. (at least it gets my hair right though, ha)

4274722930-2093819657-professional_photo_of_benclements_in_black_shirt.png

honest nexus Jan 7, 2023, 2:16 AM

#

finite ivy I'm in pretty much the same boat as <@82665835100897280>, I've trained 8 differe...

try another model, sd is pretty bad for textual inversion. I suggest you elldreth vivid mix

finite ivy Jan 7, 2023, 2:23 AM

#

honest nexus try another model, sd is pretty bad for textual inversion. I suggest you elldret...

Ok will do! Is there a convenient way to find that model? Google leads me to a reddit post with this model
https://civitai.com/models/1259/elldreths-og-4060-mix

Elldreth's OG 4060 mix | Stable Diffusion Checkpoint | Civitai

This mixed model is a combination of my all-time favorites. A genuine simple mix of a very popular anime model and the powerful and Zeipher's fantastic f222.What's it good at?Realistic portraitsStylized charactersLandscapesFantasySci-FiAnimeHorrorIt's an all-around easy-to-prompt general purpose semi-realistic to realistic model that cranks out ...

prime perch Jan 7, 2023, 7:21 AM

#

finite ivy Ok will do! Is there a convenient way to find that model? Google leads me to a r...

Just search for Elldreth on that site. its there. Not getting much better results from it myself sadly.

steel eagle Jan 7, 2023, 8:05 AM

#

Anyone used dreambooth to make a LORA? wondering how to use the .pt file it generates in models/LORA in A111, the additional networks extension won't load those

honest nexus Jan 7, 2023, 1:12 PM

#

finite ivy Ok will do! Is there a convenient way to find that model? Google leads me to a r...

i'm working with elldreth vivid mix and works pretty fine with textual inversion

#

I still think the best way to train textual inversion is from automatic1111, lowering the learning rate every 300 steps

little hollow Jan 7, 2023, 1:39 PM

#

oi, ACCELERATE implemented into a1111?

#

i saw a few results from training using it, they were freaking top notch

#

even replicating everything that he did gave me about idk a third of his quality?

honest nexus Jan 7, 2023, 1:42 PM

#

little hollow oi, ACCELERATE implemented into a1111?

how to activate this accelerate

little hollow Jan 7, 2023, 1:43 PM

#

honest nexus how to activate this accelerate

its in one of the diffusors, im not quite sure what it is
or how it works, i know that it does work in DB or was it HN?

winter apex Jan 7, 2023, 2:29 PM

#

cunning isle just had a little disucssion in <#1002601204901236756> where someone asked "how...

i trained myself with 20 images in SD 2.1 and turned out very good, the most important thing is the dataset

lethal totem Jan 7, 2023, 2:35 PM

#

Can we train SD on 32x32 and 7x1 images for example?

cunning isle Jan 7, 2023, 2:36 PM

#

winter apex i trained myself with 20 images in SD 2.1 and turned out very good, the most imp...

thanks for the info, thats encouraging

limber peak Jan 7, 2023, 3:28 PM

#

Is there a good low vram (sub 10gb) version of dreambooth out there currently?

little hollow Jan 7, 2023, 4:03 PM

#

new au1111 has Gradient Clipping --modes: norm/value
default is 0.1 - any ideas what it is and what it does? seems like a cool new option but figuring it out is gonna take way too long alone

finite ivy Jan 7, 2023, 4:08 PM

#

honest nexus try another model, sd is pretty bad for textual inversion. I suggest you elldret...

Currently using this, preliminary results are looking good at 850 steps at LR .005:300,.001:500,.0005

honest nexus Jan 7, 2023, 4:16 PM

#

finite ivy Currently using this, preliminary results are looking good at 850 steps at LR .0...

do you use google colab notebook or automatic1111?

finite ivy Jan 7, 2023, 4:17 PM

#

Auto1111, I couldn't quickly figure out how to get the model into the colab notebook you sent so I just went local

#

Here are the results at about 1050 steps, still takes some coaxing to get right. Might be some trouble with my dataset with multiple people, but this is pretty good!

#

First is generated, second is reference

4274723029-1424997576-photo_of_benclements_mod6_man_close_up_alone_sharp_focus.png

lethal totem Jan 7, 2023, 4:47 PM

#

finite ivy First is generated, second is reference

add some negatives

#

and this will be awesome

#

pos is: photo_of_benclements_mod6_man_close_up_alone_sharp_focus

#

maybe negative is: lowres, low_resolution, bad_light, bad_shadows

finite ivy Jan 7, 2023, 4:49 PM

#

Thanks! I appreciate the help!

lethal totem Jan 7, 2023, 4:51 PM

#

and show me example

#

🙂

#

what will change

finite ivy Jan 7, 2023, 5:08 PM

#

I'm away from my computer right now 🥲 I will keep the channel updated though. Also fwiw, this is on a 6gb card, so this embedding stuff can be accessible to more people than training a whole dreambooth model!

honest nexus Jan 7, 2023, 5:32 PM

#

finite ivy I'm away from my computer right now 🥲 I will keep the channel updated though. A...

i've tried to train with my 1060 but cuda goes out of memory

finite ivy Jan 7, 2023, 5:35 PM

#

honest nexus i've tried to train with my 1060 but cuda goes out of memory

Interesting. I believe that's the same card I have. Have you tried smaller image sizes? --medvram? --xformers? I'm using those args and I'm able to train on 512x512 images

honest nexus Jan 7, 2023, 5:36 PM

#

yep, 512x512 and xformers, but never tried --medvram

finite ivy Jan 7, 2023, 5:45 PM

#

honest nexus yep, 512x512 and xformers, but never tried --medvram

I'm not sure if it helps in training, but it certainly allows larger image generation for me

finite ivy Jan 7, 2023, 7:01 PM

#

@honest nexus I also used the modifications in this reddit post. Give it a shot and see if you can get it to work
https://www.reddit.com/r/StableDiffusion/comments/yibx9b/successful_hypernetwork_training_on_a_6gb_vcard/?utm_source=share&utm_medium=android_app&utm_name=androidcss&utm_term=1&utm_content=share_button

r/StableDiffusion - Successful hypernetwork training on a 6GB vcard

51 votes and 44 comments so far on Reddit

astral mica Jan 7, 2023, 7:53 PM

#

Can anyone point me in the right direction? I'm trying to follow Aitrepreneur's instructions to create a textual inversion embedding in runpod using the Auto1111 UI and keep on getting a {} error when trying to preprocess images.
https://www.youtube.com/watch?v=4E459tlwquU&t=659s

vale egret Jan 7, 2023, 8:22 PM

#

astral mica Can anyone point me in the right direction? I'm trying to follow Aitrepreneur's ...

Read the terminal

astral mica Jan 7, 2023, 8:57 PM

#

vale egret Read the terminal

Nothing's happening in the terminal when I create the embedding and try to preprocess the images...

glass dove Jan 7, 2023, 8:58 PM

#

hi, so i dreambooth trained using the colab for 2k steps, is it possible to go for more steps without having to start over?

#

results feel a little undertrained

full knot Jan 7, 2023, 9:36 PM

#

you can start with your previous created diffuser model

#

instead of starting from 1.5 or whatever your base is

tidal cliff Jan 8, 2023, 2:17 AM

#

anyone know why the TI templates have so many lines in them? like... painting of [name], rendering of [name], etc... what's the point of all this

#

im not sure how these templates are even used I guess

#

are they using them to create... potential images of your [name] while training... but then how does the algorithm determine if the resulting image is "good" or not... like... what's driving the loss function of the optimizatino routine

#

with each step of the training process... how does SD figure out if it's going in the right direction or not...

#

I had thought that it was using back propagation on your set of training images... using the captions that you write for each one as input and the actual photo as the correct answer

#

but in that case, what's the point of the TI templates

#

are they just there so that you produce a variety of different images for like... qualitative evaluation while the thing is running? but they serve no purpose in the actual training process ?

crisp cloak Jan 8, 2023, 3:28 AM

#

finite ivy I'm not sure if it helps in training, but it certainly allows larger image gener...

Is it possible to train embeddings using the google colab notebook?

finite ivy Jan 8, 2023, 3:36 AM

#

crisp cloak Is it possible to train embeddings using the google colab notebook?

Yes, but for high quality results of a person's likeness you will need to use a model other than SD 1.5, I posted a result a couple messages back with my results from that colab notebook and the results were subpar. I haven't done much looking into it but you need to figure out how to import a model other than SD 1.5. another user recommended me eldreths vivid model, and it's worked really well in my local installation, so I would recommend that.

crisp cloak Jan 8, 2023, 3:43 AM

#

finite ivy Yes, but for high quality results of a person's likeness you will need to use a ...

You can upload the custom model to your google drive and just copy and paste the path “ckpt path” or just insert the huggingface ckpt link.

But this is for training models. For some reason I can’t train embeddings using google colab. I’ve tried several times. I can’t find any resources on that either

finite ivy Jan 8, 2023, 3:49 AM

#

crisp cloak You can upload the custom model to your google drive and just copy and paste the...

Can you link the colab notebook you're using? This is the one I used
https://colab.research.google.com/github/huggingface/notebooks/blob/main/diffusers/sd_textual_inversion_training.ipynb

Google Colaboratory

crisp cloak Jan 8, 2023, 3:52 AM

#

finite ivy Can you link the colab notebook you're using? This is the one I used https://col...

https://colab.research.google.com/github/TheLastBen/fast-stable-diffusion/blob/main/fast-DreamBooth.ipynb

I tried it a week or two back on this one

Google Colaboratory

crisp cloak Jan 8, 2023, 3:54 AM

#

crisp cloak https://colab.research.google.com/github/TheLastBen/fast-stable-diffusion/blob/m...

I would go into the gradio app link and to the train tab but when I started the preprocessing it would keep giving me errors

crisp cloak Jan 8, 2023, 3:55 AM

#

finite ivy Can you link the colab notebook you're using? This is the one I used https://col...

Oh wow. I had no idea there was a separate notebook for training embeddings. Thankyou so much bro. I’m gonna try this today. Really appreciate all the help

finite ivy Jan 8, 2023, 3:55 AM

#

crisp cloak https://colab.research.google.com/github/TheLastBen/fast-stable-diffusion/blob/m...

Yes, ok, so there is a difference between that and textual inversion embeddings. What you linked is for training a dreambooth model, which as you stated, creates a whole new 2GB .ckpt model on whoever you provide it pictures of. This works really well, but ckpt files are large and you can only use one at a time, but TI embeddings are harder to train it seems, but you can use them on top of nearly any model, so long as the version matches

#

No prob! I only found that a day or so thanks to another user. Again, you'll likely have to figure out how to import a different model into that notebook bc the default 1.5 model is not that great at it.

crisp cloak Jan 8, 2023, 3:58 AM

#

finite ivy No prob! I only found that a day or so thanks to another user. Again, you'll lik...

Ohh. I will definitely let you know if I figure that out

finite ivy Jan 8, 2023, 3:59 AM

#

crisp cloak Ohh. I will definitely let you know if I figure that out

Please do! That notebook is in theory 4x faster than my PC, so viable results with that would be huge!

#

@tidal cliff
The templates are used to give the embedding some 'context' to what you're training. The template file provided probably isn't suitable for plug and play into most trainings. I created a custom template file named 'custom_subject.txt' that only contains "a photo of [name], [filewords]". This is sufficient for training on a likeness of a person, in fact, for my own likeness, I created a template file that only contained "a photo of [name], a close up photo of a young man" to some pretty good results. The [filewords] is a caption that describes the image. This is useful because without it, the embedding will pick up on things in your training images. Say for example you have a lot of pictures of you in a black shirt with trees in the background, then the generations of your trained embedding will favor you in a black shirt and trees in the background, so by letting the network know what is NOT you in the image, the embedding becomes more versatile. I hope that makes sense?

crisp cloak Jan 8, 2023, 4:02 AM

#

finite ivy Please do! That notebook is in theory 4x faster than my PC, so viable results wi...

https://www.reddit.com/r/StableDiffusion/comments/zqlhwn/is_it_possible_to_train_an_embedding_using/

I also just found out this link as I was asking you. Might want to give this a try as well

r/StableDiffusion - Is it possible to train an embedding using Auto...

2 votes and 3 comments so far on Reddit

finite ivy Jan 8, 2023, 4:03 AM

#

crisp cloak https://www.reddit.com/r/StableDiffusion/comments/zqlhwn/is_it_possible_to_train...

I've seen this error before actually. You have to find the line
logvar_t = self.logvar[t].to(self.device)

and change it to

logvar_t = self.logvar[t.cpu()].to(self.device)

crisp cloak Jan 8, 2023, 4:05 AM

#

finite ivy I've seen this error before actually. You have to find the line logvar_t = self...

Oh. Thankyou so much for all the help. I’m going to try this all. Hopefully I can start making the embeddings I want to 🤞

finite ivy Jan 8, 2023, 4:07 AM

#

crisp cloak Oh. Thankyou so much for all the help. I’m going to try this all. Hopefully I ca...

No prob! I never thought I really knew much about this stuff but we close to the frontier here, so experiment away!

final patrol Jan 8, 2023, 4:09 AM

#

Since LORA's been around for a bit, is there a general opinion on its usefulness? I know it's a smaller file, so ignoring that, does it have any particular weaknesses/strengths? I assume it's somewhere between TI and Hypernetwork?

split acorn Jan 8, 2023, 6:44 AM

#

it's like dreambooth lite

#

can train with less VRAM and the added bonus of smaller file sizes

#

there's also a convenient webui extension that lets you try it out with various models and lets you adjust the strength too

#

With first impressions, I prefer TI and Dreambooth to LoRA, but I'm still pretty new to it

obtuse shard Jan 8, 2023, 7:31 AM

#

finite ivy Can you link the colab notebook you're using? This is the one I used https://col...

Uhh. What’s inversion training?

empty shuttle Jan 8, 2023, 7:57 AM

#

Anyone want to test (for free) a dreambooth service focused on video game characters? I mostly am looking for feedback on whether you like the results. Link is https://polymorf.me/ . Send me a DM and I'll send you a custom test link. We use a combination of diffusers + dreambooth + some Textual inversion + some img2img depending on character

white current Jan 8, 2023, 9:58 AM

#

final patrol Since LORA's been around for a bit, is there a general opinion on its usefulness...

my lora test run

#

(very small part of a 2k image tank dataset)

ornate flare Jan 8, 2023, 10:27 AM

#

Can someone explain me something

#

can i actually train stable diffusion on 1024x1024 images by selecting resolution = 1024?

#

or if i do that does it just downsample my input to 512x anyways?

white current Jan 8, 2023, 10:32 AM

#

ornate flare can i actually train stable diffusion on 1024x1024 images by selecting resolutio...

you can in theory, but if you dont have 80gb vram or sth, you cant

ornate flare Jan 8, 2023, 10:32 AM

#

kk

#

it works on 768x768 on my 3080

#

1024 crashes just checked now

spare herald Jan 8, 2023, 10:54 AM

#

can anyone offer advice on why a hypernetwork seems to have no effect?
I got good results during training but it doesn't seem to affect the image gen

agile wadi Jan 8, 2023, 11:34 AM

#

So I've been succesfully training Embeds with my 3070ti 8gb, but when I increase the batch size over 1, I get a CUDA out of memory error - I've seen people succesfully train on 8gb with batch sizes of 4-6 before, any ideas why this is happening? I'm launching with --xformers --opt-split-attention --medvram

finite ivy Jan 8, 2023, 1:52 PM

#

spare herald can anyone offer advice on why a hypernetwork seems to have no effect? I got goo...

I've never had much success with hyper networks, but if you're in auto1111, have you made sure it's mounted in your settings? Much like you have to select an SD model, you need to select a hyper network. I trained one once and it did change image generation for sure, but not in an effective way

spare herald Jan 8, 2023, 1:53 PM

#

thanks for your help, I did have to figure out that stuff but I think I did get it working

#

I think in easy sd the setting takes time to turn on and off, in automatic1111 it does seem to work when you enable and reset the ui

#

doing smash bros it's not easy

#

I'm having some success, need to do a lot more description and having more reference images will make it better but I'm already 500 img in smh

#

went from the machine having no idea who mr game and watch is to having a pretty good idea, 8 hour training tho rip

#

most people just trying to put in one face I wish it was that easy for me

finite ivy Jan 8, 2023, 1:58 PM

#

Rip yeah. Local training is slow for me as well, I got embed training working on my 6gig card, but takes time. Can hyper networks do multiple subjects like that? I didn't know that was possible, if so, cool!

spare herald Jan 8, 2023, 1:58 PM

#

yes

#

like I said most tutorials or examples is 1 face

#

so I wasn't sure if it would be ok with 89 subjects + their sidekicks and cohords

#

cohorts

#

and with some settings it does reach what I call the singularity where all the characters are mashed up but

#

my last training went like this

#

oh I can't post pic

#

https://imgur.com/a/2ZLhWDP

Imgur

Untitled Album

#

imo

#

smarter than some of the irl people I've tried to explain smash bros to

#

it def has a hard time looking at 2 subjects in 1 pic that it doesn't know

#

so I'm having to break down like banjo and kazooie etc

#

the koopalings are a nightmare for it but I am helping

finite ivy Jan 8, 2023, 2:03 PM

#

Interesting! I'd like to see how you make out with this!

spare herald Jan 8, 2023, 2:03 PM

#

the training in most of the models for mario series subjects is just wack af and is hindering the process but we'll get there

#

if nintendo doesn't find out

#

don't tell em

finite ivy Jan 8, 2023, 2:05 PM

#

Safe with me 🤐

spare herald Jan 8, 2023, 2:05 PM

#

I had to do it cause it's like 98% anime girls and I'm like yo show me yoshi

#

and it's like some japanese guy

#

I cannot stand for this

#

I'm concerned about weights

#

like, for example the pikmin people there's 2 dudes and 5 pikmin and so I have multiple shots of the pikmin cause there's 5 colors of each and 3 stages of growth

#

it's rough being me

#

so like, is olimar and the pikmin gonna weigh more than say cloud who the machine already knows or mr game and watch who it's clueless about

#

we'll find out I guess

finite ivy Jan 8, 2023, 2:10 PM

#

So long as they're labelled correctly in your dataset though that shouldn't be too huge right? Also with such a large and varied dataset I would set the LR really low

spare herald Jan 8, 2023, 2:10 PM

#

uh

#

I'm new

#

what's lr

#

oh I got another question too

#

so

#

can I use this .pt in conjunction with deepdanbooru so I can stop labeling the shit at some point?

#

like can I feed my smash dataset another dataset of smash images and have it point out who's in the img once it knows

finite ivy Jan 8, 2023, 2:13 PM

#

learning rate, by default it's set to .005, in my recent experience with embeddings, .005 becomes too fast around 800 steps or so, so I went down to .001 and then .0005

spare herald Jan 8, 2023, 2:13 PM

#

I'm running .0000001

finite ivy Jan 8, 2023, 2:13 PM

#

But it might be something you play with. If you end up with generations that look beyond screwed up then your LR is too high

spare herald Jan 8, 2023, 2:13 PM

#

for 100000 step

#

yeah I've reached singularity before

finite ivy Jan 8, 2023, 2:14 PM

#

I'm not overly familiar with LR for hyper networks, so I don't know if that's lower high but that's definitely one of the knobs to turn as you start to refine your process

spare herald Jan 8, 2023, 2:14 PM

#

tbh around 30 or 40k it has it mostly figured

#

but like subjects that are really similar it doesn't have it figured by 100k

#

duck hunt is hard cause of the name

#

there's 2 roys

#

it's rough stuff

#

I think it's a worthy endevour I hope at least, but I'm worried when I release it nintendo gonna shut the whole sd project down haha

tidal cliff Jan 8, 2023, 2:31 PM

#

finite ivy learning rate, by default it's set to .005, in my recent experience with embeddi...

thanks for your reply re: the template file. How many vector per token have you used to get good results on a person trained on photos? Also, how many epochs did you let it run at 0.001 and 0.0005? I find it's hard to get a sense if the model learning too fast or too slowly

finite ivy Jan 8, 2023, 2:35 PM

#

tidal cliff thanks for your reply re: the template file. How many vector per token have you...

8:00 to 10:00 seems to be good for getting someone's likeness. As far as learning rate, you know it's gone too fast when you generate something with it and it looks otherworldly and not even close to sensible. When an embedding is undertrained The output images will look reasonable and you can tell it's starting to get the idea of what your subject is, but it's not quite there yet. The case of a learning rate that is too high is worse than too low. A good schedule that I found is this: .005:700,.001:1000,.0005:2000,.0001

#

This is for batch size 1 and grediant accumulation 1. That schedule would change if those values were to increase, but I can't increase them because of VRAM

finite ivy Jan 8, 2023, 2:58 PM

#

I haven't verified this, but I hypothesize that if you're training a style instead of a person, you can set the learning rate a bit higher and train for less steps

crisp cloak Jan 8, 2023, 4:17 PM

#

finite ivy 8:00 to 10:00 seems to be good for getting someone's likeness. As far as learnin...

Hey bro thankyou for all your help. I’ve been able to make the embedding successfully. Also, in the link I shared you can add any model you want to by pasting the hugging face download link in. The embedding will be trained through that model

finite ivy Jan 8, 2023, 4:18 PM

#

crisp cloak Hey bro thankyou for all your help. I’ve been able to make the embedding success...

No problem!! Can you post results? If love to see what that notebook made

crisp cloak Jan 8, 2023, 4:19 PM

#

finite ivy No problem!! Can you post results? If love to see what that notebook made

Yes ofcourse. I’m in the middle of training another embedding right now. As soon as I’m done with it I’ll post the results

plucky current Jan 8, 2023, 10:40 PM

#

Hey, can anyone give me some sugestions at finetuning since i'm a newbie?, I have about 350 images instance images and 550 concept images, all images are high quality and 1024x1024, but it seems like I'm getting A LOT of failed ckpts, any advice? (im trying to fine tune on dreambooth colab)

peak canopy Jan 9, 2023, 1:28 PM

#

Folks, What's the difference between diffusers and ckpt model types? I'm getting good results when dreambooth training using joepenna repo with ckpt file compare to the diffuser repo.

winter apex Jan 9, 2023, 2:17 PM

#

plucky current Hey, can anyone give me some sugestions at finetuning since i'm a newbie?, I ha...

when using dreambooth less is more, i havent trained a style but 350 images seem too much

plucky current Jan 9, 2023, 2:40 PM

#

alright, thanks a lot for the advice, I'm currently trying again with 100 images, all captioned and for about 1000 steps (10x the number of instance images), what about concept images, any advice? does the rule of less is more also apply? thank you

final patrol Jan 9, 2023, 3:13 PM

#

Has anyone here trained LORA? According to this guide, you can train many concepts at once, but you're not supposed to use the character name or series name, so that it would "become implicit to other tags". This is really confusing to me, because I want to add a cartoon series that has various characters in it. How would it know the difference between the style and a character? How am I to differentiate between characters?

For instance, for X-Men, on a given image, I want to be able to make Gambit hang out with Jubilee, or Storm with Wolverine, or Gambit with Storm. Maybe I just one one of the characters to hold a cat. If I can't use any character names, how would it implicitly know which setup I want when generating images?

https://rentry.org/lora_train#captions

LoRA Training Guide

/hdg/ Logo Imgur (3 Sizes)
Written by StyleAnon with some help from a few others and the Thread!
Links to other Collaboration Edition Guides/Resources
PromptAssist | LoRA Repo
LoRA Training Guide
What is a LoRA?
For Using
For Training
Diffing two models
Captions
Saving and resuming training
Instr...

split acorn Jan 9, 2023, 3:43 PM

#

Because you're training on an instance token

#

The instance token is the replacement for the "character name" or "series" basically

final patrol Jan 9, 2023, 3:45 PM

#

Got it. You are still giving each concept a name, but not the original one

split acorn Jan 9, 2023, 3:45 PM

#

yeah, using a rare token to help it with the learning process

#

You can do it another way though

#

I'll find the video

final patrol Jan 9, 2023, 3:46 PM

#

ohh thank you.

split acorn Jan 9, 2023, 3:47 PM

#

I'll find the time stamp

#

but basically they trained on something the original model already knew how to make

#

Which is another valid way to do it

#

though, I'm not sure if it's better or worse

#

https://youtu.be/FRClNMC_z-s?t=207

YouTube

kylevorbach

HOW I FAKED MY LIFE USING AI: or (THE LIFE AND DEATH OF RYAN GOSLIN...

I fooled my friends and family with photorealistic images I created using StableDiffusion and then posted to all my social media. I go into how I made these photoreal pictures and also my descent into madness.

Wanna see for yourself?

instagram.com/kylevorbach
twitter.com/kylevorbach

I'll be sharing my AI pictures that didn't make the cut and ...

▶ Play video

final patrol Jan 9, 2023, 3:51 PM

#

Ah I watched several videos, but not this one. Thank you. I'll investigate.

split acorn Jan 9, 2023, 3:51 PM

#

It's not nessesarily better overall

#

but it can work CB_nod

final patrol Jan 9, 2023, 3:51 PM

#

regarding what you were saying before, each character has to be a pre-existing token, right? I can't just do "xmenGambit" "xmenStorm", etc..?

#

I've seen models use phrases like that before.

#

But maybe not LORA tunings...

split acorn Jan 9, 2023, 3:54 PM

#

mmm

#

Generate images with those tokens and if they look good, then it should work (to some extent)

final patrol Jan 9, 2023, 3:54 PM

#

I'm sure they don't exist

#

it would likely be random

split acorn Jan 9, 2023, 3:55 PM

#

mmm let me reword it

#

if it doesn't exist, then you're better off using a rare token

#

So if "Xmen Gambit" doesn't consistently produce good quality Gambit pictures, then you're better off with like "olis" or another rare token

final patrol Jan 9, 2023, 3:57 PM

#

"Gambit" gives a gentle unsure whiff of the original character. So that would be a good candidate to train?

split acorn Jan 9, 2023, 3:58 PM

#

I honestly don't know, but my gut feeling is no.

final patrol Jan 9, 2023, 3:58 PM

#

okay

#

I'll experiment

split acorn Jan 9, 2023, 3:58 PM

#

Rare tokens, so it's completely random.
Or well known tokens, so you have high quality token you can train off of

#

I'm not aware of anyone doing anything different alicatHm2 though, it'd be cool to see

final patrol Jan 9, 2023, 3:59 PM

#

ah, all or nothing. Hmm

#

Thank you very much. I mainly wanted to know if LORA worked best with a fundamentally different approach (and it sounds like it doesn't). I'll experiment and figure something out.

median sun Jan 9, 2023, 4:06 PM

#

anybody else getting error when loading 2.x model with automatic1111? I''ve tried to get it working 4 times now and I kinda dono what to do anymore.

#

the yaml is in there renamed and auto1111 is updated

#

it just gets killed without an error and sais error in the ui

full knot Jan 9, 2023, 5:52 PM

#

someone have an example of captionned image for scenes please ?

stone bloom Jan 9, 2023, 6:06 PM

#

Any clue what might be causing problems with lora dreambooth, whilst training the 512 version of SD 2.1?

768 seems to be developing normally, but 512 immidietly turns into an abstract mess.

split acorn Jan 9, 2023, 6:11 PM

#

I've had poor luck with it too, not sure why alicatHm2

stone bloom Jan 9, 2023, 6:11 PM

#

It's so weird, I swear

split acorn Jan 9, 2023, 6:12 PM

#

TI still works great, just not dreambooth

#

I'm not sure why

stone bloom Jan 9, 2023, 6:13 PM

#

Yeah ti seemed to work fine for me as well

#

Dreambooth becomes literal black magic to the 512 base. Tried almost everything at this point. Nonema, 8bit, text encoder, non standard resolutions, preservation, learning rates. Nada.

Simply won't budge.

#

Meanwhile even 512 training on the 768 looks better 🤡

split acorn Jan 9, 2023, 6:20 PM

#

I honestly wouldn't be surprised alicatKEK

stone bloom Jan 9, 2023, 6:20 PM

#

At least I can vaguely recognize the shapes

#

agony

#

Think I only haven't tried turning off xformers and fp16 in parameters.
Then again, the 512 base would output blank, brown images without using xformers, so what gives.

full knot Jan 10, 2023, 12:55 AM

#

for the caption image names and text encoder, does every tokens counts as their own or a whole ?

#

like "An ARAV74 plane", does the whole sentence will be trained word per word and quite destroy "plane" token ?

#

or it will create a new entry in the model for the entire sentence only ?

#

or should I name the concept image file "ARAV74" only and specify after at prompting "An ARAV74 plane"

#

hmm i may be confused between concept and caption aswell

#

i still don't even know where the text encoder is reading from : the filename or the instance prompt ? both ?

misty glacier Jan 10, 2023, 8:14 AM

#

Anyone knows how to train stable diffusion inpainting?

#

I can only find [img2img inference] [img2img fine tune] [inpainting inference] sample code.

#

But I want to do [inpainting fine tune]

tacit wedge Jan 10, 2023, 10:46 AM

#

Bit of a noob question here. If a model is trained using a specific sampler, does that mean the same sampler will deliver the best results when creating images. Or doesn't it necessarily work out like that?

torn turtle Jan 10, 2023, 11:29 AM

#

which one is the best sampler that can produce optimal image with CFG Scale more than 10?

split acorn Jan 10, 2023, 11:53 AM

#

tacit wedge Bit of a noob question here. If a model is trained using a specific sampler, doe...

It doesn't necessarily work that way. DDIM does work well for both though when training realistic

#

Anime trained on DDIM can still look better using something like Euler A

#

Although my Euler A models turned out well so ChillBar_shrug

tacit wedge Jan 10, 2023, 12:32 PM

#

split acorn Although my Euler A models turned out well so <:ChillBar_shrug:87286823158772535...

Thanks for the info 👍

restive stream Jan 10, 2023, 4:11 PM

#

Did anyone finetune inpainting model to generate backgrounds for transparent images?

plucky current Jan 10, 2023, 6:44 PM

#

there are so many options to train right now, I wonder, what would be the best way to train something on a consistent style?

full knot Jan 10, 2023, 10:08 PM

#

torn turtle which one is the best sampler that can produce optimal image with CFG Scale more...

ddim if it's still the case

stone bloom Jan 10, 2023, 10:36 PM

#

plucky current there are so many options to train right now, I wonder, what would be the best w...

I'd say textual inversion is usually enough, since style is more of an adjustment in tone, rather than a totally foreign concept.

Dreambooth is kinda overkill for the most part, unless we're dealing with styles, which focus on subjects completly unknown to the base model.

With ti for style, your best shot would prolly be with a [name], [filewords] template, having manually described content of each training image in their filename. Just avoid using adjectives, which are inherent to a given style.

plucky current Jan 10, 2023, 10:48 PM

#

stone bloom I'd say textual inversion is usually enough, since style is more of an adjustmen...

thanks a lot!, how many images should I use? also, any good colab sugestion?

stone bloom Jan 10, 2023, 10:55 PM

#

plucky current thanks a lot!, how many images should I use? also, any good colab sugestion?

3-5 is usually enough, but you can't go wrong using more, unless you just feed it bad examples.

#

#

here's an example from the people behind it

#

so when you make a template, go for something like [filewords] in style of [name]

#

[name] will automatically get replaced by name of the embedding/textual inversion, and [filewords] will be replaced with whatever you wrote in filename of each image

#

as for colab, can't help, since I'm only familiar with local 😅

#

@split acorn Did you use Lora, when you tried to dreambooth SD 2.X in 512?

split acorn Jan 10, 2023, 11:16 PM

#

alicatHm

#

No, just dreambooth

#

Oh

#

I think maybe

#

Well I tried WD 1.4 as one of them

#

and that's based largely on 512 2.1

#

so yes! at least a model based on it

plucky current Jan 10, 2023, 11:17 PM

#

stone bloom [name] will automatically get replaced by name of the embedding/textual inversio...

so, if I wanted to train a pose for example, [pose] (image names) in the style of (my textual inversion) right?

stone bloom Jan 10, 2023, 11:23 PM

#

plucky current so, if I wanted to train a pose for example, [pose] (image names) in the style o...

come to think of it, maybe I'm misleading you, since I'm not sure if colabs support the method I mentioned

restive bridge Jan 10, 2023, 11:26 PM

#

in the case of fastDB, does text encoder training consist of blip auto-captioning? or does it just train the token name?

full knot Jan 10, 2023, 11:34 PM

#

restive bridge in the case of fastDB, does text encoder training consist of blip auto-captionin...

didn't find yet

#

i hope to do some serious tests on caption / text encoder soon

stone bloom Jan 10, 2023, 11:48 PM

#

plucky current so, if I wanted to train a pose for example, [pose] (image names) in the style o...

also, if I understood you correctly, the pose would be a new, separate embed/textual inversion

#

You can use multiple embeds in one generation. Only requirement being, that they were trained on the same model.

#

one embed is basically one concept, be it person, shape, style, pose, composition, action, whatever

tender silo Jan 11, 2023, 5:40 AM

#

How is Embedding and Textural Inversion different from using Dreambooth? I'm still new to this so I'm kinda clueless
I'm also seeing LoRA being thrown around

#

Let's say I wish to create a consistent output that shows a set of poses, a bunch of clothing from different eras in the style of Bloodborne
Basically I want to generate Lady Maria doing an A pose, T pose, and Flossing in various historical clothing, how would I best achieve this? Then would it be possible to change it to Queen Elizabeth II

#

Sorry for the cursed example, Im not trying to mock the British royal family, just trying to learn how they’re different from each other

little hollow Jan 11, 2023, 11:48 AM

#

stone bloom so when you make a template, go for something like `[filewords] in style of [nam...

Im doing
[Filewords], [name]
[Name], [filewords]

Works great

stone bloom Jan 11, 2023, 11:51 AM

#

Same here. Just phrased it that way, since the example is easier to understand I hope.

little hollow Jan 11, 2023, 11:51 AM

#

Sometimes adding
X, bad art, horrible art, bad painting, horrible painting, bad darwing, horrible drawing

#

Might eliminate the need to use negs(works like 20 40% of the time? Idk)

stone bloom Jan 11, 2023, 11:52 AM

#

Oh? That's interesting. Never considered training in negatives per se'

little hollow Jan 11, 2023, 11:52 AM

#

What you caption is to be eliminated from the embedding

#

What is left is what makes the embedding

#

That's why it works at all

stone bloom Jan 11, 2023, 11:54 AM

#

Makes sense, I guess that's why some of the default templates would use phrasing like "the weird X" or "dirty X"

little hollow Jan 11, 2023, 11:54 AM

#

Yep

stone bloom Jan 11, 2023, 11:55 AM

#

Just sounds counter intuitive to me, purposefuly training it on qualities that would supposedly make a worse generation.

little hollow Jan 11, 2023, 11:56 AM

#

No, what happens is that it lets you render all of the words of the org picture

What cannot be generated using those prompts is given to the embedding to regenerate it

#

Hopefully this clears it up a bit

#

And btw, id put the embedding somewhere close to the beginning of the caption, as it put more attention to the ones at the start

#

Too late and it might regen the entire original picture using only the prompts(caption) you given it

#

Lets say a dog in a forest, and you want only the dog

You caption
A forest, green, branches, leafes, sky etc...

But leave out the dog out without being captioned

It will regenerate everything beside the dog

And here comes the embedding trying to regen the dog with random words

#

Once it gets better at making the doggo come back, the loss drops

#

At 0 loss, you get q replica of the dog

#

But, if there was a stop sign you didn't caption - it might try to regen it as well

#

This might interfere and slow down the training

#

And give you stop signs

stone bloom Jan 11, 2023, 12:03 PM

#

Usually shuffle my tags, since I tend to train more abstract or generalised concepts. Something akin to generalised style guidelines, rather than a certain character/subject.

#

Not sure if it was for the better, but seemed to make sense to me, since it resembled training a checkpoint designed for a certain type of art.

#

Interesting. Someone basically had the opposite problem.
Just can't wrap my head around what might be causing this behaviour..

plucky current Jan 11, 2023, 3:02 PM

#

is there any colab that I can use to train an embending?

stone bloom Jan 11, 2023, 3:10 PM

#

Not sure, but could this be it?

https://colab.research.google.com/github/huggingface/notebooks/blob/main/diffusers/sd_textual_inversion_training.ipynb

Google Colaboratory

honest nexus Jan 11, 2023, 4:30 PM

#

stone bloom Not sure, but could this be it? https://colab.research.google.com/github/huggin...

yes it works, but not good as automatic1111 train

#

It needs an update definetely

plucky current Jan 11, 2023, 4:36 PM

#

honest nexus yes it works, but not good as automatic1111 train

that's a shame, apparently I can't train it on auto1111 with my gpu :/

honest nexus Jan 11, 2023, 5:16 PM

#

plucky current that's a shame, apparently I can't train it on auto1111 with my gpu :/

same to me, i'm trying to tweak that colab to get some decent results

whole gorge Jan 11, 2023, 5:50 PM

#

I am having difficulties with embeddings getting to the result I want

#

they always seem low res or out of scale

#

Its taking about 12gigs of GPU memory to train an embedding

#

#

its either blurry or checkerboard sometimes like this

tough flame Jan 11, 2023, 9:36 PM

#

When I fine tune a model in dream booth, should I be using the model name in the prompt?

stone bloom Jan 11, 2023, 9:49 PM

#

tough flame When I fine tune a model in dream booth, should I be using the model name in the...

I think it's either your instance token, if you use filewords, or whatever unique word you used in the instance prompt.

#

Also looks like someone's been trying to embed space marines

high venture Jan 11, 2023, 9:50 PM

#

Anybody had luck with StableTuner on 12G card?

stone bloom Jan 11, 2023, 9:57 PM

#

whole gorge Its taking about 12gigs of GPU memory to train an embedding

Shouldn't be that big.. My guess would be either:
-training with too big a batch size
-training in wrong resolution
-training with images that weren't resized and cropped to your models resolution
Other than that, if you are using autos, look into settings under the training tab.
You should probably check "cross attention optimizations while training" and "Move VAE and CLIP to RAM when training if possible"

hard peak Jan 11, 2023, 10:02 PM

#

Is it possible to use checkpoint merger to allow one to use danbooru tags with another model?

My line of thinking is that the LAION tags are pretty awful especially when compared to the danbooru tags of waifudiffusion. It would be interesting if you are able to merge WD with another checkpoint to allow using danbooru tags.

stone bloom Jan 11, 2023, 10:06 PM

#

Technically speaking yes, it's just that they'll have lesser or greater impact, depending on weights of the mix.

#

Merging is quick and easy, so just give it a go, see for yourself. Try out different proportions, and note how the different tag systems interact.

restive bridge Jan 11, 2023, 10:35 PM

#

Has anyone found the ideal class images for training faces? with portraits being the intended outputs

hard peak Jan 12, 2023, 12:04 AM

#

Yeah, the merging didn't seem to work terrifically. Tough luck for me

split acorn Jan 12, 2023, 12:14 AM

#

That should work alicatHm2 just a matter of finding the right merge combination CB_nod

coarse hemlock Jan 12, 2023, 12:53 AM

#

06368-1389850066-gpu20nvidia20graphics20card20308020on20fire.png

fading forge Jan 12, 2023, 1:59 AM

#

stone bloom Shouldn't be that big.. My guess would be either: -training with too big a batch...

Has "cross attention optimization while training" been updated recently? There was an open issue on it claiming it was negatively affecting training if enabled, wasn't sure if it's been updated (or if it ever even was an issue to begin with)

bold dragon Jan 12, 2023, 4:05 AM

#

Hello everyone, fairly new in training with stable diffusion. Hope this is not a noob question.

I trained a style with 30 images with hypernetwork. I tried different layer structures, using the preprocessed captions without further editing, and different learning rates down to 5e-8, 20000 steps.

The problem i have got is that, i was able to img2img and got a similar trained style, but the results were very messy. The lines were not straight, shading was not consistent, especially the eyes, it was totally messed up.

I wanted to find out which part had gone wrong.
Was it my data sample not large enough? or was it the learning rate or steps that was not set to the right scale? Or is it the model is already well trained but I need to fine tuning the settings in img2img instead.

Not looking for an exact answer, discussion is also welcome, really need some new ideas on what to do next. Thank you guys.

prisma nacelle Jan 12, 2023, 6:37 AM

#

bold dragon Hello everyone, fairly new in training with stable diffusion. Hope this is not a...

having messed with hypernetwork training myself, and still struggling to get the ideal results i'm constantly trying to update the training data with anything that might be "slightly" off.

I've also used dreambooth models and tried TI embeds. Overall dreambooth works well in fast learning but it also pretty much makes the model it is trained on not usable for anything else.

the hypernetwork issue you experienced is the same as mine, sometimes it just doesn't work at all and other times it works on img2img. Recent hypernetworks I've trained have been better, but there is still those problems such as uneven outputs, inconsistent lines etc.

recently trying to see if TI embeds can do any better, but that seems to be taking longer to train and hard to see if it is any better than hypernetwork.

what exactly are you training the hypernetwork to generate?

bold dragon Jan 12, 2023, 8:53 AM

#

prisma nacelle having messed with hypernetwork training myself, and still struggling to get the...

Hey shihiko, thanks for the time in replying
For the TI embeds method u mentioned, was it "embedding"? sorry if this is a stupid question, not very familiar with the terms yet.

I was training a set of chibi characters with a painting style that is similar to fire emblem heroes's chibi characters.
I have trained both embedding and hypernetwork with the same set of images, and what i have experienced are:

Embedding could generate a much closer style when i use txt2img, but one problem is that there is a hint of my original images' pose in all generated images, and when i ask to generate some poses that never existed in my dataset, the results are broken. Not sure if that was because of my prompts not accurate enough or some other issues, couldn't figure out yet.
After several tries with embedding, the same poses made me give up in the hope of using prompts only to generate sth in different compositing, so i then thought of using img2img, hoping for with a simple draft, i could get a stylized result. So I switched to train with hypernetwork. (after reading articles saying HN is better in training style) Failed several times but in the end, i tried with a setting of layer: 1,3, 0.75, 0.75, 0.75, 3, 1, LR: 5e-8:20000. Took one of the “alright” pt and was able generated sth that I think is around 50% looking alike my expected style.
So in order to improve the result, I tried to add the trained embedding model in (1.) in the prompt when doing img2img together with the HN model (2.) But no matter how i tried differently with changing CFG scale, steps, denoising, sampling method, I could never get back to the 50% in (2.)

Thoughts in mind now:

Should I continue training with embedding but in a much slower rate? However, i know my GPU is not having enough RAM to train.
If stick with HN, what other settings should I try?
Couldn't try with dreambooth, simply couldn't run the thing with my current GPU

#

omg, my message is so long, really sorry about that

prisma nacelle Jan 12, 2023, 9:05 AM

#

bold dragon Hey shihiko, thanks for the time in replying For the TI embeds method u mentione...

yeah so TI is referring to Textual Inversion embedding.

I also experience alot of "pose biases"

I think the issue here is just not having enough training data to give the AI more examples to learn from. When it keeps learning from the small data set, it improves in detail but also becomes more biased towards what the data set contains.

If you are able to get the style you want to come from the img2img it might be a good idea to get the AI some more data from generated images that are closer to what you want. Which is what I am doing, it takes a long time and trial and error, but I feel it is the best way to go when trying to be specific with what you want.

bold dragon Jan 12, 2023, 10:11 AM

#

prisma nacelle yeah so TI is referring to Textual Inversion embedding. I also experience alot...

thank you shihiko, will feed the AI with more images
lets hope the trial time end soon for both of us
good luck with ur training as well !

split estuary Jan 12, 2023, 10:13 AM

#

prisma nacelle Jan 12, 2023, 10:17 AM

#

bold dragon thank you shihiko, will feed the AI with more images lets hope the trial time e...

yeah about to train another hypernet, the embedding didn't work too well and it was like 6 hours of training lol... sometimes it isn't easy to know if the training went wrong too. the last training i did was the right character showing up but the colours were wrong and stayed wrong for the 5000 steps afterwards.

bold dragon Jan 12, 2023, 10:43 AM

#

prisma nacelle yeah about to train another hypernet, the embedding didn't work too well and it ...

wow, 6 hours! how big was ur dataset

stone bloom Jan 12, 2023, 11:02 AM

#

fading forge Has "cross attention optimization while training" been updated recently? There w...

No clue, but I've been running just fine with it, doing even like 8/9 batches in textual inverson with a 8vram laptop gpu

ornate flare Jan 12, 2023, 12:39 PM

#

What does finetuning with 8-bit adam look like?

#

Is it noticeably worse?

#

also, weird question but if i wanted to finetune on about 10000 images

#

what learning rate would be most appropriate?

hard peak Jan 12, 2023, 1:39 PM

#

Is 3-4 images truly enough to train a subject in dreambooth?

hard peak Jan 12, 2023, 1:59 PM

#

Additionally, for dreambooth, should one provide mostly closeup portraits, full body, or a mixture of the two?

whole gorge Jan 12, 2023, 2:22 PM

#

So I got a decent embedding but its struggling on the faces/helmets

#

can I add additional reference images to my dataset that focuses just on those details and then it will improve the embedding or will it get confused if you have images that are only a "part" of the whole?

#

02014-721768276-enb_space_marinev2-3500masterpiece_ultra_detailed3d_battlefield_machinery_mecha_motor_vehicle_pillarboxed_reali.png

#

I made a space marines embedding and it works pretty well for their armor but the helmets are wrong

#

and it doesn't understand if I say I want one without a helmet

prisma nacelle Jan 12, 2023, 3:21 PM

#

bold dragon wow, 6 hours! how big was ur dataset

30ish for that one. didn't work well so back to drawing board.

whole gorge Jan 12, 2023, 5:54 PM

#

Dont trust your preview settings for how your embedding performs

#

always switch to a custom model and try generating using some of the similar keywords from your captions

#

and if hypernetwork is like embeddings copy them all over so you can try myembedding-1000, myembedding-1500 etc

#

its also better to train using a generic model like the SD 1.5 one or WD

#

how is the results of a hypernetwork different from textual inversion?

dapper prism Jan 12, 2023, 8:35 PM

#

How much VRAM does EveryDream need to train a 768px SD 2.x model?

winter apex Jan 12, 2023, 8:47 PM

#

has anyone tried training a person with LoRa and got decent results?

#

i tried on myself and it was a failure

hexed bloom Jan 12, 2023, 8:59 PM

#

Anyone have a recommendation for a number of epochs using 30k+ images?

restive bridge Jan 12, 2023, 10:19 PM

#

what's the visual difference between over/underfitted text encoder vs unet?
every time a model fails i have to experiment in both directions with both unet and text because no one seems to know the visual difference between under and overfitting of the text encoder vs. under or overfitting of the unet.

winter apex Jan 12, 2023, 11:19 PM

#

will try it, thank you

#

file size is the least of my worries, i already have like 50gb worth of sd models

foggy fog Jan 13, 2023, 12:26 AM

#

Hi all, i'm using dreambooth to train files. I need to hold a lot of fine tuned models but storage is an issue considering the cpkt files are 4gigs

I saw a video that converted a tar file to a cpkt really fast. I downloaded a similar tar file and saw it's also 4 gigs so doesn't solve the problem.
https://www.youtube.com/watch?v=-6CA18MS0pY

Is there any way to just hold the weights in some smaller file format then convert them to cpkt easily. Goal again is to massively reduce storage size of the customization.

YouTube

Software Engineering Courses - SE Courses

How to Run and Convert Stable Diffusion Diffusers (.bin Weights) &...

In this video, I am explaining how to run Stable Diffusion models that not provided in .ckpt file format. Moreover, I am step by step explaining how to convert these .bin training weight / model files into a .ckpt file to use in Automatic1111 Web UI and other interfaces. Furthermore, I am explaining how to use generated ckpt file to teach your f...

▶ Play video

dim rampart Jan 13, 2023, 12:42 AM

#

what's the best collab for making those insanely amazing videos that i keep seeing on instagram? I know a few but would like to get input from others as this field is ever changing.

wide meadow Jan 13, 2023, 2:21 PM

#

i have used camenduru's colab model to run it works great but can't work with models larger than 7gb i also use nocrypt model it still needs to choose to be able to install 7gb but it's not very stable when 2 or more images, it can't be output compared to camenduru, can output more than 2 images without any problem so is there any easy-to-use model like camenduru and can load 7gb ckpt model file

obsidian sand Jan 13, 2023, 6:01 PM

#

wide meadow i have used camenduru's colab model to run it works great but can't work with mo...

This https://colab.research.google.com/github/TheLastBen/fast-stable-diffusion/blob/main/fast_stable_diffusion_AUTOMATIC1111.ipynb

Google Colaboratory

wide meadow Jan 13, 2023, 6:27 PM

#

Where can I find instructions?

hexed bloom Jan 13, 2023, 6:36 PM

#

If I have images that aren't 1:1, does keeping center crop **unchecked ** automatically force the images to be squished to 1:1?

#

I'm looking to squish images that are 512x512+ or 512+x512 when training

split acorn Jan 13, 2023, 11:06 PM

#

It shouldn't if you're using the dreambooth webui extension version. They added aspect ratio bucketing about 2 weeks ago.

novel pond Jan 13, 2023, 11:33 PM

#

I've been working on a new embedding that I'm working on anythingV3. But then.

#

RuntimeError: CUDA out of memory. Tried to allocate 1.25 GiB (GPU 0; 4.00 GiB total capacity; 2.13 GiB already allocated; 180.00 MiB free; 2.24 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF

whole gorge Jan 13, 2023, 11:36 PM

#

anyone good with photoshop scripts?

novel pond Jan 13, 2023, 11:36 PM

#

I'm using NVIDIA Geforce GTX 1650

#

and xformers...

whole gorge Jan 13, 2023, 11:36 PM

#

trying to load image, remove background, add a solid color black layer, move the layer done, flatten image, save as png

#

yes you ran out of ram

#

training in particular is very ram intensive

#

try checking this box if you havent

novel pond Jan 13, 2023, 11:39 PM

#

whole gorge try checking this box if you havent

Is there way's to get more ram?

whole gorge Jan 13, 2023, 11:39 PM

#

get a better graphics card

#

4gigs is very much low end for this type of thing

novel pond Jan 13, 2023, 11:40 PM

#

like... RTX 3090?

whole gorge Jan 13, 2023, 11:41 PM

#

anything Nvidia with more ram

novel pond Jan 13, 2023, 11:45 PM

#

whole gorge anything Nvidia with more ram

Like... Which one?

whole gorge Jan 13, 2023, 11:46 PM

#

Depends on your budget

novel pond Jan 13, 2023, 11:46 PM

#

like more then one RTX 3090?

whole gorge Jan 13, 2023, 11:46 PM

#

I don't know card stats I just went from a 1080 to a 4080

#

and there are still things I can crash on with 16gigs of ram

novel pond Jan 13, 2023, 11:49 PM

#

Damn. I was about to get ready to make some custom models...

whole gorge Jan 13, 2023, 11:49 PM

#

oh you want to make models lol

#

even embeddings takes 12gig of ram

novel pond Jan 13, 2023, 11:49 PM

#

and embeddings

whole gorge Jan 13, 2023, 11:50 PM

#

also preparing the images before training is very important I am finding

#

my first attempt I just dumped images into a folder and it didnt go well

#

00198-597856304-emb_space_marinev2-1600Mech_Punk_full_body_dark_blue_armor_1.5_holding_weapon_detailed_helmet_helmet_intricate_batt.png

#

I tried to make a space marines one and it still wont get the helmets right and you won't be able to get specific poses or chapter colors etc

#

and I've trained it through different settings and sets of images a few times

novel pond Jan 13, 2023, 11:53 PM

#

I was trying to mix Anythingv3 with some of that f222 along with some sd.1.4.

whole gorge Jan 13, 2023, 11:53 PM

#

thats just merging models

novel pond Jan 13, 2023, 11:53 PM

#

1754903917-masterpiece_best_quality_1girl_white_hair_medium_hair_cat_ears_closed_eyes_looking_at_viewer__3_cute_scarf_jacket_ou.png

whole gorge Jan 13, 2023, 11:53 PM

#

you can do that on the merge tab in automatic1111

novel pond Jan 13, 2023, 11:53 PM

#

checkpoint merger you mean?

whole gorge Jan 13, 2023, 11:54 PM

#

yes

novel pond Jan 13, 2023, 11:54 PM

#

I already did that.

whole gorge Jan 13, 2023, 11:54 PM

#

thats exactly what that does mix's checkpoints together

novel pond Jan 13, 2023, 11:54 PM

#

But it doesn't have that realistic feel that I'm currently making...

whole gorge Jan 13, 2023, 11:55 PM

#

well you are using an anime checkpoint?

#

https://civitai.com/models/3450/moistmix-v1

MoistMix V1 | Stable Diffusion Checkpoint | Civitai

This is one I have been working a long time on, and I think it's finally ready for release. A do (almost) anything model.Beautiful lighting, paintings, portraits, multiple photography styles, photorealism, anime and animated styles, alien creatures, armor, clothing, massive dreamy landscapes, abstract retro art, horror, space, nsfw, extremely de...

#

try this one

#

I ve had a lot of success with it

novel pond Jan 13, 2023, 11:56 PM

#

AnythingV3 yeah that one.

#

I'll go check that out then...

#

I also honestly want to make my own textual inversion.

whole gorge Jan 13, 2023, 11:59 PM

#

I am doing that right now

#

But it takes 12gigs of VRAM

novel pond Jan 13, 2023, 11:59 PM

#

like a anime + realistic way.

#

What do you have?

whole gorge Jan 13, 2023, 11:59 PM

#

a 4080 which has 16gigs of ram

novel pond Jan 13, 2023, 11:59 PM

#

the 12gigs of VRAM i mean

whole gorge Jan 14, 2023, 12:00 AM

#

but I just upgraded from a 1080 which I think had 8

novel pond Jan 14, 2023, 12:00 AM

#

let me guess, a GeForce RTX 4080 Graphics Card?

whole gorge Jan 14, 2023, 12:01 AM

#

https://www.newegg.com/msi-geforce-rtx-4080-rtx-4080-16gb-gaming-x-trio/p/N82E16814137766?Item=N82E16814137766

#

interesting the 3090 is more than my 4080 but has 24gigs of ram

novel pond Jan 14, 2023, 12:03 AM

#

What about the
NVIDIA GeForce RTX 4090?

#

also $1,309.99??

#

Rtx 4090 £1,599.00?

#

Jesus...

whole gorge Jan 14, 2023, 12:09 AM

#

I had not bought a new card since 2016 but I already wish I had more VRAM

novel pond Jan 14, 2023, 12:12 AM

#

Looks like the road for me to make textual inversions has already ended in a matter of a few minutes...

whole gorge Jan 14, 2023, 12:24 AM

#

trying to make one now

winter apex Jan 14, 2023, 12:28 AM

#

novel pond Damn. I was about to get ready to make some custom models...

use google colab, i always use it to train my models

novel pond Jan 14, 2023, 12:30 AM

#

winter apex use google colab, i always use it to train my models

I'm not very good at codes cause I'm a bit of a moron...

whole gorge Jan 14, 2023, 12:30 AM

#

you have to like rent server time or something for that?

#

@novel pond Im using my first collab and im literally just clicking play buttons

#

the code is already written

winter apex Jan 14, 2023, 12:31 AM

#

novel pond I'm not very good at codes cause I'm a bit of a moron...

use thelastben collab, it also has automatic1111 as an option

#

yes its literally clicking buttons and adding whats missing

whole gorge Jan 14, 2023, 12:31 AM

#

oh if its a collab that runs automatic1111 then its practically the same

novel pond Jan 14, 2023, 12:32 AM

#

Fine, is there link for on automatic1111?

#

towards google colab cause I was usally on webui.

winter apex Jan 14, 2023, 12:33 AM

#

novel pond Fine, is there link for on automatic1111?

https://colab.research.google.com/github/TheLastBen/fast-stable-diffusion/blob/main/fast_stable_diffusion_AUTOMATIC1111.ipynb

full knot Jan 14, 2023, 12:43 AM

#

does anyone knows how the text encoder is set on the shivan collab ? I mean which values are used ?

whole gorge Jan 14, 2023, 1:02 AM

#

time to test the embedding

#

fingers crossed

novel pond Jan 14, 2023, 1:02 AM

#

whole gorge time to test the embedding

Hope to see the improvement!

whole gorge Jan 14, 2023, 1:07 AM

#

#

I mean they do mostly look like miata's

#

but in my experience you have to back through the embedding and check the every X iterations models and try using some different checkpoints

#

you want to train on the base checkpoint like 1.5 but switch when checking it out

novel pond Jan 14, 2023, 1:24 AM

#

By the way does anyone know DPM++ 2M Karras Simpler? Cause I've seen others prompts using that one. But I've seen anything like on my webui.

whole gorge Jan 14, 2023, 1:26 AM

#

#

there is loads its one of the default in automatic1111

novel pond Jan 14, 2023, 1:31 AM

#

novel pond Jan 14, 2023, 1:32 AM

#

whole gorge

Here's mine ^

whole gorge Jan 14, 2023, 1:38 AM

#

fwiw I wouldn't worry about missing a sampler

novel pond Jan 14, 2023, 1:41 AM

#

hmm.

whole gorge Jan 14, 2023, 1:42 AM

#

xy_grid-0081-2937808722-emb_space_marinev2masterpiece_ultra_detaileddigital_art_in_a_battlefield_fighting_monsters_science_fiction_solo.jpg

#

see how largely similar they are for a given seed

#

xy_grid-0079-686984041-emb_space_marinev2masterpiece_ultra_detailedillustration_on_a_battlefield_science_fiction_solo_weapon_holding_sw.jpg

#

even changing steps as well doesn't usually make a big difference

#

you are more likely to find what you want by tweaking your prompt, just generating lots to choose from and then going into img2img

#

the biggest advantage for AI art is being able to take hundreds of shots at it

#

xy_grid-0090-2655482062-na_miata-2000_realistic_3dcar_driving_ground_vehicle_letterboxed_motor_vehicle_on_vehicle_seat_seatbelt_steering_whee.jpg

#

all of the karras's were similar and the first seed was the same with all samplers

novel pond Jan 14, 2023, 1:49 AM

#

I doubt mine could reach to 150 steps with the low memory I got. But there's only few changes around the prompts.

whole gorge Jan 14, 2023, 1:49 AM

#

I don't think steps uses more memory

#

just takes more time