#🔧｜finetune | Stable Diffusion | Page 10

fallen cloud Jan 28, 2023, 12:21 PM

#

I really need to give hypernetwork a chance some time 😄

#

Had a thought earlier bout what would happen if i make a model in DB then added a trained hypernetwork ontop of that 🤔 er atleast train a DB-model, then perhaps use hypernetworks for postures.

fast epoch Jan 28, 2023, 12:43 PM

#

Are you ready for the launching?

#

of dreambooth in webui google colab?

fallen cloud Jan 28, 2023, 12:46 PM

#

Hell yeah! 😁 👍

#

Just tried my checkpoint model on step 42000. Hella way to strong still though, but actually starting to look really good 😁

brazen osprey Jan 28, 2023, 12:54 PM

#

fallen cloud Had a thought earlier bout what would happen if i make a model in DB then added ...

I’ve done this a fair bit with hypernetwork a and got some good results - moving to different models other than what was originally trained the hypernetwork on. Moving between the models, for example one specifically trained towards sketching, or anime models etc. as long as the cfg isn’t too high it can be a good way to explore the subject creatively.

#

I trained a network on the hatbox ghost from haunted mansion @ Disney and then experimented with some different models. This is an example of comics diffusion (Charlie Bo artstyle) and into the spiderverse models, after training the ghost HN with the sd 1.5 pruned model. The spiderverse model is totally a unique interpretation that’s kinda fun.

00031-4164670988-concept_of_a_skeleton_with_a_top_hat_and_suit_and_a_cane_with_his_right_hand_and_holding_a_hatbox_in_his_left_hand_by_its_handle.png

00055-3675427862-concept_of_a_skeleton_with_green_hair_and_wearing_a_top_hat_and_suit_and_holding_a_cane_with_his_right_hand_and_holding_a_hatbox.png

#

I quite like the technique for “exploration”, which is necessary sometimes when the client doesn’t know specifically what they want and you can output 1000 different variations.

fallen cloud Jan 28, 2023, 1:20 PM

#

Sweet! 😁 haha, yeah looking forward to learn more about that kind of techniques also . Right now i made a controlled random code-generator for my DB modells, that tries a few hundred variations of the best prompts i managed to create, all giving the model a different style, different traits, different settings, and then giving the client a the best of the batch to look if they find any particularly style thats more to their liking, and then keep working from that viewpoint 😌 not totalt completed though, due to the AI branch keeps evolving like a motherfucking mutant spider every day it feels like 😂 #loveit

brazen osprey Jan 28, 2023, 1:23 PM

#

Oooh yeah right. If it’s a colab you could edit the code cell to pull a specific version away from the head. That way you don’t have to worry about updates if you don’t want them

#

Or a batch file etc. I don’t use invoke but I’ve seen some that have a batch file that pull the latest each time the user runs it

fast epoch Jan 28, 2023, 1:49 PM

#

https://www.youtube.com/watch?v=DK3xg8QLh_U - you find everything here

#

Made a video for it

#

Dreambooth + pix2pix

#

Textual inversion training works too

fallen cloud Jan 28, 2023, 2:16 PM

#

fast epoch https://www.youtube.com/watch?v=DK3xg8QLh_U - you find everything here

Nice one! 💪 😁 just watched it all. Im going to try it out tomorrow! Dont know if i dare open another Google Colab process (not used to GC) if that could lead to that my current training can be canceled 😅

fast epoch Jan 28, 2023, 2:18 PM

#

Yeah, if you're on the same google account, you can't run 2 sessions at the same time

fallen cloud Jan 28, 2023, 2:20 PM

#

hmm.. but can one log in with two different accounts perhaps? 🤔

fast epoch Jan 28, 2023, 2:21 PM

#

Yea

#

On google chrome

#

you have an "add" button

fallen cloud Jan 28, 2023, 2:22 PM

#

Hmm.. will check! .. really dont feel like compromising the current training now after 12 h, at 27% 😂 ..but as always, to curious to be able to wait

fast epoch Jan 28, 2023, 2:23 PM

#

#

Here

fallen cloud Jan 28, 2023, 2:41 PM

#

Jeex.. now i can actually train two models at the same time, and still produce content locally 😅 this is not good for the thing called social-life.

fast epoch Jan 28, 2023, 2:56 PM

#

:)))

#

Who has one?

#

:))

#

What do you train?

fallen cloud Jan 28, 2023, 3:11 PM

#

Right now a model of my girlfriend. Trying to get a as accurate model as possible, all the way into the bone marrow 🙂 then i have a couple of other projects to try. Going to make a painting/photo of all my parents pets they have had since i was a kid, and put them all in one photo, standing infront of the house we all lived in. Yeah, and of course some dirty shit tentacle porn. Got many female friends who's into that also and want themself portraited in different kind of tantaclisch-situations 😂

#

But firstly, trying to find the perfect formula in how to make the best of the best model possible. What kind of photos needed, what pre-editing that is the best, what training settings and amount of images needed, etc.

fast epoch Jan 28, 2023, 3:15 PM

#

:))))))))))

#

Do the woman really want that?

#

:)))

#

I'm at the step of finding the best formula

fallen cloud Jan 28, 2023, 3:16 PM

#

Haha some actually do, yeah 😂

fast epoch Jan 28, 2023, 3:16 PM

#

It's bad that the people who really know how to make good models do not tell others

#

about the settings, data images etc.

fallen cloud Jan 28, 2023, 3:20 PM

#

fast epoch I'm at the step of finding the best formula

Really? 🙂 Interesting! I would love to share information about that. Coz, as you say. Nobody shares that info, atleast not what i have found from scavenging the interwebs thoroughly for the last months 😂 ..more then the basics on how to get a avarat though

fast epoch Jan 28, 2023, 3:21 PM

#

Yea, watched many youtube videos

#

But they all failed

#

they are "average" only with face photos

#

but they don't work with styles or with full body shots

fallen cloud Jan 28, 2023, 3:37 PM

#

Yeah, feels the same way. Would be more productive for the community if people share more of their information 😄 but then again, I guess plenty of people is aiming to try to ern some money in this hype and dont want to let others onto the same path. Personally im just amazed on the tech and what in can do, and want to learn more about it 😌 ..and to create amazing art of course! 😂

fast epoch Jan 28, 2023, 3:40 PM

#

It's very risky to sell ai generated art nowadays.

fallen cloud Jan 28, 2023, 3:51 PM

#

Have you experimented with the captions also, and perhaps got some knowledge on how its best to produce them to get the best result? ..that is probably my next step in model-processing right now. My first atempt is the model i produce now, but with almost 3000 images the editing was.. quite simple. Is it woth it to edit every caption to describe exactly the image content, of is BLIP interogation with adjustments for faulty information enough?

fickle haven Jan 28, 2023, 4:40 PM

#

guys what is the path to finetune this : https://huggingface.co/Cryonicus/Gemini_Anime

Cryonicus/Gemini_Anime · Hugging Face

#

fast epoch Jan 28, 2023, 5:09 PM

#

fallen cloud Have you experimented with the captions also, and perhaps got some knowledge on ...

BLIP is decent

#

It should work without manually captioning

fast epoch Jan 28, 2023, 5:10 PM

#

fickle haven guys what is the path to finetune this : https://huggingface.co/Cryonicus/Gemini...

It's easier to complete the CKPT_Link tab

#

With this https://huggingface.co/Cryonicus/Gemini_Anime/resolve/main/Gemini_AnimeV1.safetensors

#

And yea, the path to huggingface is Cryonicus/Gemini_Anime

fickle haven Jan 28, 2023, 5:22 PM

#

fast epoch And yea, the path to huggingface is Cryonicus/Gemini_Anime

i got errors D:

fickle haven Jan 28, 2023, 5:22 PM

#

fast epoch With this https://huggingface.co/Cryonicus/Gemini_Anime/resolve/main/Gemini_Anim...

I Make ckpts , not safetensors

fickle haven Jan 28, 2023, 5:23 PM

#

fast epoch With this https://huggingface.co/Cryonicus/Gemini_Anime/resolve/main/Gemini_Anim...

will it make me a finetuned ckpt if i put this in the text space

fast epoch Jan 28, 2023, 5:23 PM

#

It should

#

Or if not, you can convert it aferwards

#

very simple

fickle haven Jan 28, 2023, 5:26 PM

#

fast epoch Or if not, you can convert it aferwards

its broken

fast epoch Jan 28, 2023, 5:26 PM

#

No

#

Don't put the link at the huggingface path

#

put it at the "CKPT_Link"

#

https://huggingface.co/Cryonicus/Gemini_Anime/resolve/main/Gemini_AnimeV1.safetensors

#

this

#

and leave the path to huggingface blank

#

Did it work?

fickle haven Jan 28, 2023, 5:30 PM

#

fast epoch Did it work?

oh my god..... r.i.p drive space

fast epoch Jan 28, 2023, 5:32 PM

#

it has 5.98 GB

#

that model

#

you can do the following trick

#

Runtime -> Disconnect and delete runtime

#

And reopen the notebook inserting the good link

#

where I said

#

This way, it will download only the gemini model

fickle haven Jan 28, 2023, 5:33 PM

#

i tought if it was a ckpt would be smaller

#

this is why i wanted a ckpt JNWFJNE

fast epoch Jan 28, 2023, 5:33 PM

#

nah

#

a ckpt is almost the same

#

but you can use the dreambooth extension directly in webui

#

I found out how to do it

fickle haven Jan 28, 2023, 5:35 PM

#

i dont undetstand

fast epoch Jan 28, 2023, 5:36 PM

#

#

You can train directly in the webui's extension

fickle haven Jan 28, 2023, 5:36 PM

#

fast epoch You can train directly in the webui's extension

i have a old version

fast epoch Jan 28, 2023, 5:36 PM

#

I made dreambooth to work inside webui

#

I made a google colab notebook

#

Like the one you use right now

#

You use google colab to train it now

fickle haven Jan 28, 2023, 5:38 PM

#

can i have the link

#

also can i have the link to the gemini CKPT. it had a horrible conversion error

fast epoch Jan 28, 2023, 5:39 PM

#

Wait a little. Gonna personalize the notebook to include exactly your model

#

Gemini

#

done

#

https://github.com/Bullseye-StableDiffusion/stable_diffusion_webui_allinone_dreambooth/blob/main/SD_All_in_One_gemini.ipynb

GitHub

stable_diffusion_webui_allinone_dreambooth/SD_All_in_One_gemini.ipy...

Contribute to Bullseye-StableDiffusion/stable_diffusion_webui_allinone_dreambooth development by creating an account on GitHub.

#

Right click on Raw and Save link as...

#

Then you open that .ipynb file in google colab

fickle haven Jan 28, 2023, 5:45 PM

#

ok how do i load it to the lastben

fast epoch Jan 28, 2023, 5:45 PM

#

you don't

#

it's separate

#

you load the file in google colab

fickle haven Jan 28, 2023, 5:45 PM

#

uh??

#

wich google colab

#

i only use pages of google colab to train

fast epoch Jan 28, 2023, 5:46 PM

#

You wish to train right now or to use the gemini model?

#

#

On what I gave you, you can either train or use the gemini model as it is

fickle haven Jan 28, 2023, 5:48 PM

#

i already did it

#

but i want to upload this link into the last ben

#

bc idk how to finetune in any other places

#

i want to train it in the last ben

#

dreambooth

fast epoch Jan 28, 2023, 5:49 PM

#

fickle haven its broken

Then put this link https://huggingface.co/Cryonicus/Gemini_Anime/resolve/main/Gemini_AnimeV1.safetensors

#

in the ckpt_link and leave everything else blank

fickle haven Jan 28, 2023, 5:56 PM

#

finetune___Stable_Diffusion_-_Discord_28_01_2023_18_56_21.png

#

UHHH

fast-DreamBooth.ipynb_-_Colaboratory_-_Google_Chrome_28_01_2023_18_56_27.png

fast epoch Jan 28, 2023, 5:59 PM

#

Yea, seems like it can't be converted into diffusers

fickle haven Jan 28, 2023, 5:59 PM

#

:C model must be broken

fast epoch Jan 28, 2023, 6:00 PM

#

yea

#

Did you try to write Cryonicus/Gemini_Anime in the huggingface path leaving everything else blank?

fickle haven Jan 28, 2023, 6:06 PM

#

fast epoch Did you try to write Cryonicus/Gemini_Anime in the huggingface path leaving ever...

yes i got errors

fast epoch Jan 28, 2023, 6:10 PM

#

Then the model is broken

#

or the dreambooth from thelastben is not updated

fast epoch Jan 28, 2023, 6:43 PM

#

Does anyone know what are the best settings for a person training in dreambooth?
Because the settings found on youtube keep failing

bronze igloo Jan 28, 2023, 7:42 PM

#

@fast epoch can you tell us more about what you are running now and what is failing?

fast epoch Jan 28, 2023, 8:51 PM

#

bronze igloo <@886639463034413166> can you tell us more about what you are running now and wh...

Yea, I'll give you examples

gloomy pike Jan 28, 2023, 9:33 PM

#

When I try training an older pt backup I get stuck at this task.

"Applying cross attention optimization (Doggettx)."

I can always train a new hypernetwork and back ups don't always freeze here but they do more times than not.

gloomy pike Jan 28, 2023, 11:24 PM

#

putting max steps higher than the step the tp file finished on fixed it.. does someone have a better understanding why? I mean it made enough sense for me to try and it worked but why? does it keep track of it's total steps and sees its self done if the steps are lower??????

gloomy pike Jan 29, 2023, 12:52 AM

#

How do you know if the pt file in your embedding is actually being called? Even if there isn't an associated pt file in your embedding, having anything extra added to the prompt despite keeping seed the same will have a change on the result. Sometimes I get an error at boot up saying my pt files have failed to load too.

bright surge Jan 29, 2023, 1:09 AM

#

Hey, are any stable diffusion experts out there that could you lend me a hand with something? I hope that this is the correct chat for this.

I have made a custom model of a face using the Google colab resources rather than my own since my laptop does not have enough VRAM for training but it handles generation just fine. However, I would love it if I could have that model as an embedding to use in other models like analog-diffusion or open-journey, rather than just the base 1.5 that I trained it on. I do not have the VRAM for training an embedding sadly. Checkpoint merging doesn't work so well and it degenerates the likeness of the custom model, or I may not have the sliders or values correct. Any tips on this? I would appreciate it so much! 🥹 .

split acorn Jan 29, 2023, 1:14 AM

#

Merging custom dreambooth models are usually, New Model (A) + DreamBooth Model (B) - Model DreamBooth was trained on (C) with Add Difference = 1 (There are a couple other numbers you could try here). If the New Model is close to the DreamBooth data you added, then it should work well.

split acorn Jan 29, 2023, 1:29 AM

#

bright surge Hey, are any stable diffusion experts out there that could you lend me a hand wi...

Here's a good video on it:
https://www.youtube.com/watch?v=xLQcWKI5OLk&t=2s

YouTube

Olivio Sarikas

Stable Diffusion: Merging Models in Automatic 1111 - The BEST Trick...

Merging Models in Automatic 1111 is the BEST way to refine and improve your Models. Checkpoint Merging in Automatic 1111 explained in a very easy away. Weighted sum and Add difference for Checkpoint Merger explaint in Automatic 1111 for Stable Diffusion. Merge any Stable Diffusion Model to mix different styles and models together. Improve the lo...

▶ Play video

gloomy pike Jan 29, 2023, 3:27 AM

#

This is what I get when I start Stable Diffusion with a hypernetwork trained pt file in the embedding directory. How do I actually use my results?

Error loading embedding cammyTrainedModel012823.pt:
Traceback (most recent call last):
File "C:\StableDiffusion\stable-diffusion-webui\modules\textual_inversion\textual_inversion.py", line 205, in load_from_dir
self.load_from_file(fullfn, fn)
File "C:\StableDiffusion\stable-diffusion-webui\modules\textual_inversion\textual_inversion.py", line 177, in load_from_file
raise Exception(f"Couldn't identify {filename} as neither textual inversion embedding nor diffuser concept.")
Exception: Couldn't identify cammyTrainedModel012823.pt as neither textual inversion embedding nor diffuser concept.

bright surge Jan 29, 2023, 3:28 AM

#

split acorn Here's a good video on it: https://www.youtube.com/watch?v=xLQcWKI5OLk&t=2s

Thank you Alicat! I will test that out. I think the sliders and values that I am using are off as I already tried that merge formula. I will check out that video and see if I can see where I am going wrong habby .

split acorn Jan 29, 2023, 3:32 AM

#

gloomy pike This is what I get when I start Stable Diffusion with a hypernetwork trained pt ...

Hypernetworks go in the "hypernetworks" folder

#

I'm not sure what Auto did in the recent updates

#

I think you can now?

gloomy pike Jan 29, 2023, 3:33 AM

#

split acorn Hypernetworks go in the "hypernetworks" folder

and i can call it in prompt?

split acorn Jan 29, 2023, 3:34 AM

#

https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Features#hypernetworks

gloomy pike Jan 29, 2023, 3:35 AM

#

split acorn https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Features#hypernetwo...

darn missed that, i was looking through the dummies guide thanks! ill see if it works

gloomy pike Jan 29, 2023, 3:41 AM

#

split acorn https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Features#hypernetwo...

it works! 😁 thanks! ive gone over that page you linked me so many times, hypernetworks section was so brief!

split acorn Jan 29, 2023, 3:41 AM

#

Yeah, was recently updated

dapper prism Jan 29, 2023, 3:42 AM

#

Has anyone tried finetuning with 15,000 or more training images & text pairs? How long did training take?

gloomy pike Jan 29, 2023, 5:50 AM

#

has anyone ever tried throwing desired results from training into the database with the originals to push it more in the desired direction?

split acorn Jan 29, 2023, 6:08 AM

#

yep

#

works well alicatPog

#

is how I turned a 1 image dataset into an 8 image dataset for better variety, by cherrypicking and editing the results and then feeding them back in

fallen cloud Jan 29, 2023, 12:43 PM

#

dapper prism Has anyone tried finetuning with 15,000 or more training images & text pairs? Ho...

Biggest imageset so far is my current model in training. ~3000 images, and the traning time in Google Colab was aprox 45 h.

fast epoch Jan 29, 2023, 12:46 PM

#

Idk, but the dreambooth doesn't work anymore

#

they updated something today or yesterday

#

in the webui

fallen cloud Jan 29, 2023, 1:00 PM

#

Damn 🫤 just woke up and was planning to try that DB out 😂

fast epoch Jan 29, 2023, 1:05 PM

#

Fixed it

#

downgraded to a lower version

#

Seems that this model doesn't have "sha256"

📎 train_dreambooth_working.ipynb

fallen cloud Jan 29, 2023, 1:14 PM

#

Nice! 😁 then i will try that one in a bit. The current model worked out.. well.. not good at all 🫤 became way to strong, and some elements lingered on into every render, like a painting she was working on and a bed she lay on. And those also makes it almost impossible to put other preferences into the image despite the setting from the original image.. hmm..

#

Might be a good model to merge though, if it turned out that strong 🤔

fast epoch Jan 29, 2023, 1:29 PM

#

Gonna refix this

#

#

Textual inversion training

#

I don't really get the results on which I wish

#

still training

#

trying to make a model for Margot Robbie just to see what are the best settings

#

Bad that the "textual inversion masters" don't show us some settings

#

And how do I know if the model is flexible enough?

#

This server has around 21000 people online

#

And no one answers :))))

#

Nah, the dreambooth is still not working

dapper prism Jan 29, 2023, 2:27 PM

#

fallen cloud Biggest imageset so far is my current model in training. ~3000 images, and the t...

What GPU?

scenic mural Jan 29, 2023, 4:13 PM

#

Does anyone remember M.U.S.C.L.E men? I want to fine tune a model that can imagine new variations. Is there a notebook that would be particularly suited to this?

raw wraith Jan 29, 2023, 4:59 PM

#

I heard some people saying that instead of training models on subjects/styles it's better to train a lora and then merge the lora with a model
Any truth to this?

neat mural Jan 29, 2023, 6:16 PM

#

hi guys need some help with training embeddings ? i have 2 pcs, 1 works fine and the second with better hardware doesn't seem to pickup any of the images i do, even if they are the same input

tacit bronze Jan 29, 2023, 7:36 PM

#

chai from hi-fi rush

fallen cloud Jan 29, 2023, 8:48 PM

#

raw wraith I heard some people saying that instead of training models on subjects/styles it...

Haven't tried that yet, but it sounds as an interesting idea though! If you find out more I'm interested to know more!

fallen cloud Jan 29, 2023, 9:05 PM

#

fast epoch And no one answers :))))

Yeah, feels like this channel aint overflowing with active users 😂

fast epoch Jan 29, 2023, 9:10 PM

#

I think that I fixed the dreambooth extension

#

Doing tests now

fallen cloud Jan 29, 2023, 9:16 PM

#

Oh, holding my thumbs for ya!

#

Im running some tests on the model in training. Step 104 000 to step 124 000 in evaluation now 🥳

fast epoch Jan 29, 2023, 9:38 PM

#

📎 SD_All_in_One_with_dreambooth_and_with_everything_workingv2.ipynb

#

Updated and tested

#

working as of now

#

it has dreambooth, lora training and pix2pix

fast epoch Jan 29, 2023, 10:11 PM

#

What are the best settings for person training in LoRA?

#

11 photos

tepid sundial Jan 29, 2023, 10:14 PM

#

I've had really good results with 6-12 images and using the default training scripts available in the lora repo

#

Quality of images have a very big impact in my experience

split acorn Jan 29, 2023, 11:12 PM

#

raw wraith I heard some people saying that instead of training models on subjects/styles it...

Yeah, LoRAs are faster, more flexible, smaller, and can produce good results. Since it's just training the specific weights instead of the whole model. I would recommend CB_nod

#

Kohya's repo for LoRA training is the best ATM, imo

#

Here's the GUI version

#

https://github.com/bmaltais/kohya_ss

GitHub

GitHub - bmaltais/kohya_ss

Contribute to bmaltais/kohya_ss development by creating an account on GitHub.

serene flicker Jan 30, 2023, 12:43 AM

#

I need some help with TI. I've trained a really good one on some ghosty things before, it was fantastic, worked great. Today, I have been trying to train another one, but when using all the same settings, it just doesn't work! I have preprocessed all of the images, all are the same size, have captions, etc. But after looking at the training files and testing out some of the produced .pt files on a sparate device, it actually isn't training anything. DIfferent embeddings produce pixel-prefect copies, and it doesn't actually look like anything in the training data. It just look like a normal generation. Has anyone else faced anything like this before? Any help would be greatly appreciated.

#

Here are the settings from the embedding training

📎 settings-2023-01-29-16-12-07.json

#

I've tried training three times today and this has happened all three times

serene flicker Jan 30, 2023, 1:35 AM

#

I'm trying restarting sd and deleting venv, I doubt this will work

#

But we will see

frank ibex Jan 30, 2023, 1:35 AM

#

In your txt2img tab, did you select a model?

serene flicker Jan 30, 2023, 1:35 AM

#

I did notice a weird "x/800000 steps" in the command line that would increase with every step of an image

serene flicker Jan 30, 2023, 1:35 AM

#

frank ibex In your txt2img tab, did you select a model?

Yep,sd1.5 as seen in the settings

frank ibex Jan 30, 2023, 1:36 AM

#

that's what can trip me up from time to time, just making sure it wasn't something like that

#

I haven't had any issues where nothing is training though

serene flicker Jan 30, 2023, 1:37 AM

#

It is a very weird issue

winter apex Jan 30, 2023, 1:37 AM

#

split acorn https://github.com/bmaltais/kohya_ss

i really need a colab of this repo, it can also extract a lora file from an already trained dreambooth model
https://www.reddit.com/r/StableDiffusion/comments/10kuzmh/how_to_extract_small_lora_file_from_custom/

#

i love how the people in civitai are slowly transitioning to LoRAs instead of textual inversions and dreambooth models

serene flicker Jan 30, 2023, 1:40 AM

#

I don't understand really what lora is, I tried it once but it wasn't working because i kept running out of memory (I can do it now since I figured out the mem issue) but I don't know how to use it. Is it like a dreambooth model? Or something else fancy?

#

alright, I'm retrying the training, I will probably know within the first two training images if it's working ot not

#

Ok, based on the first training image I think it's working? The outfit is more similar to one of the input images than what I was getting previously.

#

I'll make sure to look at the second and third to really make sure

#

weird, I am still getting this line

#

I do not have it set to 80000 steos

#

it only goes up with the images generated during training, but then it also went up with images generated normally. I think that's the issue maybe? Or could be a side effect of whatever the issue is?

#

well that line is gone now

#

Must be a weird bug

#

But it might be working now

#

I guess the only way I will know for sure is if I test the embedding for differences in an image

#

Oh I think it's working!

#

Not really what I am going for, but this is only at 200 steps

serene flicker Jan 30, 2023, 3:07 AM

#

Nevermind, training is still broken :(((

nova finch Jan 30, 2023, 3:08 AM

#

Why

serene flicker Jan 30, 2023, 3:08 AM

#

It just doesn't actually train anything

serene flicker Jan 30, 2023, 3:08 AM

#

serene flicker I need some help with TI. I've trained a really good one on some ghosty things b...

This is my issue

nova finch Jan 30, 2023, 3:09 AM

#

Fuck

split acorn Jan 30, 2023, 3:57 AM

#

there's a colab version, one sec

serene flicker Jan 30, 2023, 3:58 AM

#

I still think that second line has something to do with my issue and I have no clue why it's there

split acorn Jan 30, 2023, 4:00 AM

#

winter apex i really need a colab of this repo, it can also extract a lora file from an alre...

https://colab.research.google.com/github/Linaqruf/kohya-trainer/blob/main/kohya-LoRA-dreambooth.ipynb
https://github.com/Linaqruf/kohya-trainer

serene flicker Jan 30, 2023, 4:46 AM

#

I'm curious if having "overwrite old embedding" checked in the create embedding tab is the issue, htough I just did a clean install of the webui so I guess I will never know

#

not the solution I guess, either things. I guess I have to revert to an old version.

#

wait

#

It might be working?!

#

Hoenslty idk at this point, the training images are very different between 50 and 100 steps

#

ill wait a few hundred more steps then go to bed

#

nah I don't think it's working, that 100 step training image was probably coincidence?

viral bison Jan 30, 2023, 6:18 AM

#

fast epoch

Heya I'm complete noob in coding, could you help me how to install this ?,

viral bison Jan 30, 2023, 6:19 AM

#

viral bison Heya I'm complete noob in coding, could you help me how to install this ?,

any steps would be appreciated

#

thanks in advance

fast epoch Jan 30, 2023, 9:26 AM

#

viral bison Heya I'm complete noob in coding, could you help me how to install this ?,

Yo

#

Download that file

#

Search "google colab" on google

#

click on "upload", then upload the downloaded file there (on google colab) and then run all the cell codes

#

but not all at the same time

#

Step by step

#

https://www.youtube.com/watch?v=DK3xg8QLh_U&t=13s - made this video tutorial

YouTube

Bullseye-StableDiffusion

World Premiere - Dreambooth and Instruct-pix2pix on Stable Diffusio...

Thanks for watching!
I created a Discord server for discussions/help about Stable Diffusion on Google Colab: discord.gg/rH9YXMYfpT
You can download the notebook file for Google Colab from: github.com/Bullseye-StableDiffusion/stable_diffusion_webui_allinone_dreambooth/blob/main/SD_All_in_One_with_dreambooth_and_with_everything_workingv1.ipynb
Jus...

▶ Play video

#

If you still have issues, let me to know

#

Working on a method to launch the notebook from google drive and to save all the progress there

#

atm

#

to make google drive as a "HDD/SSD" for launching webui

fast epoch Jan 30, 2023, 11:16 AM

#

Made it to run on google drive memory

#

so you don't have to redownload everything all the time

fallen cloud Jan 30, 2023, 11:20 AM

#

@fast epoch my virus-guard jumped up and down for some trojans when i installed that colab into my drive btw. could it be because of the civitai extension perhaps?

fast epoch Jan 30, 2023, 11:20 AM

#

fallen cloud <@886639463034413166> my virus-guard jumped up and down for some trojans when i ...

Yea

#

for one of the extensions or for the newest xformers

#

or even the model can be

#

if it's ckpt

#

the safetensors are the safest

winter apex Jan 30, 2023, 1:38 PM

#

split acorn https://colab.research.google.com/github/Linaqruf/kohya-trainer/blob/main/kohya-...

cool, im gonna try and see what happen

#

thank youu

viral bison Jan 30, 2023, 1:45 PM

#

fast epoch Yo

Huge thanks for explaining everything, I will try it out and reach you out if I have any problems

fallen cloud Jan 30, 2023, 2:43 PM

#

@fast epoch I will try to load a different model with it later on and see, looked like it wored though! 😁 👍

fast epoch Jan 30, 2023, 2:45 PM

#

fallen cloud <@886639463034413166> I will try to load a different model with it later on and ...

Yea, it works

#

Even if the progress will stop at a certain point in webui (it will show like "1 hour left" and nothing changes), if you check the code from google colab it is doing the job

#

training epochs...

fallen cloud Jan 30, 2023, 2:46 PM

#

Anybody has a good way of telling when a model is overtranied, and when to train it further, but with smaller steps etc? 🤔 rignt now i've trained a model with aprox 3000 images, 200 000 steps. Now I have to evaluate which save of the model that is the best, and were it started to get overtranined. Right now I've mostly been guessing and gone by feeling,. But perhaps someone here has more experience of Dreambooth models?

fast epoch Jan 30, 2023, 2:47 PM

#

When you'll see "model saved" or something like this, it is really finished and you can reload the webui page

split acorn Jan 30, 2023, 2:48 PM

#

fallen cloud Anybody has a good way of telling when a model is overtranied, and when to train...

There's a couple ways. For example, visual distortions (there's a certain look overfitted models get) and you can do overfitting tests.

#

for the overfitting tests, basically make prompts that require like the character changing outfits or changing styles

#

or a prompt that doesn't fit the base data

#

if the result always puts on a certain outfit that you trained on and doesn't do anything else, then it means it's overfit

fast epoch Jan 30, 2023, 2:49 PM

#

viral bison Huge thanks for explaining everything, I will try it out and reach you out if I ...

If you still have issues, you can ask for help on my server.

split acorn Jan 30, 2023, 2:49 PM

#

If you need to have a low CFG in order to get good results, it's probably overfit GoatUppies

fallen cloud Jan 30, 2023, 3:14 PM

#

split acorn if the result always puts on a certain outfit that you trained on and doesn't do...

I will try to sett a trial prompt, working trough all of the step-savings that has been produced and see. I have never tried to train such a big model eralier, so its a first trial and error now i suppose. But the feeling I had when i run a few testruns from step 2000 up to 122 000 steps, it felt like it was very hard to make the AI to use the model in almost any kind of "nre situation", it kinda clinged on to the original images and settings all the time. So far its been no visual distortions though. Im about to run the last batch from 122 000 steps - 202 000 steps now and see what the result will be.

I have a feeling that when working with one single model and such a big number of images, perhaps i need to work more with the captions too. Describe more if expressions and postures and such.
I dont know though, in the end thats just a feeling that I perhaps need to try, but hopefullt has somebody already tried that and knows some about it 😂

viral bison Jan 30, 2023, 3:29 PM

#

fast epoch If you still have issues, you can ask for help on my server.

Added

split acorn Jan 30, 2023, 3:37 PM

#

At the end of the day, it just comes down to what your goal is

#

If the model does what your goal is, then you're golden

#

even if it's overfit

fallen cloud Jan 30, 2023, 3:46 PM

#

split acorn If the model does what your goal is, then you're golden

Mmhm true that. Im actually not sure if i have a specific "goal". Im trying to see what it takes to make a as perfect model as possible. To be able to catch a human persons all looks, personal physical traits and quirks into one model. If it is possible and how good of a model it is possible to produce, and perhaps find out a formula for being able to do that 😌

#

Test the boundaries of the AI modelwise so to say

split acorn Jan 30, 2023, 3:47 PM

#

To be able to catch a human persons all looks, personal physical traits and quirks into one model.
Then you can test for that, and if it can do that, then you're golden

fallen cloud Jan 30, 2023, 3:49 PM

#

Yeah. Thats why i have a feeling i need to specify that in the captions. What is going on in the pictures. How else should the AI know what is what. I dont know though how "strong"/important the captions is for the result during Dreambooth training.

#

If the captions are vital for the result, i would not have any problem sitting down a week and write the captions manually. But when i dont know if it would be a waste of time or not i'm really not there to invest that time yet 😂 ..probably though, even if nobody knows of i cant find out how vital it is, i will probably try anyway some day. But hopefully there are people who knows more about this than i do and can guide me on the right path 😁

split acorn Jan 30, 2023, 3:54 PM

#

The captions are a huge contributor to quality. For large datasets, people batch caption. Tho, inaccurate tags do hurt the quality but it's just a matter of the cost of time vs quality

#

Also if you're doing large datasets, are you caption training (finetuning)? DreamBooth is good for like a few concepts but finetuning is better for many

viscid cedar Jan 30, 2023, 3:57 PM

#

I am unable to message gobot

#

Is there any subscription needed

split acorn Jan 30, 2023, 3:57 PM

#

viscid cedar I am unable to message gobot

#📣｜announcements message

fallen cloud Jan 30, 2023, 4:05 PM

#

split acorn The captions are a huge contributor to quality. For large datasets, people batch...

Thats what i did this time. i BLIPet the captions, then searched all the files for errors it usually does, like describing the female model as a hi, and misinterpreting tattoos for bracelets and stuff.

I have done like 50-60 different models using dreambooth out of friends and family mostly, trying different settings for getting the best result. So for "avatars-training" i have a formula. But when truing to get body language, natural poses etc into the mix, and also needing to up the image quantity im back to being a newbie it feels like. A long way to go and the variables are far greater when working with larger image sets, and trying to get it more detailed.

Caption training? Hmm.. i do train the text_encoder in dreambooth if that is what you mean.

I'll post my settings. brb.

#

This is my current setting (first tryout) for the large imageset with 2948 images.

split acorn Jan 30, 2023, 4:07 PM

#

are you using an instance token?

fallen cloud Jan 30, 2023, 4:07 PM

#

On that i must say no.

split acorn Jan 30, 2023, 4:07 PM

#

You're probably finetuning then if you're learning off the captions and aren't using an instance token / instance prompt

fallen cloud Jan 30, 2023, 4:09 PM

#

Yeah, no im not using any instance tokens. Then perhaps its even More important that the captions are describin the main images correctly and as accurate as possible 🤔

#

My next wounder about captions is, how detailed should they be. Is a few lines alright.

#

Like this is pretty much the standard format of the caption (just took one out of the batch on random)

"a woman with a necklace on her neck smiling at the camera with a smile on her face and a necklace on her neck"

split acorn Jan 30, 2023, 4:13 PM

#

If you want a model that's good at making women, necklaces, smiling at the camera, then yep!

#

But if you want it to "catch a human persons all looks, personal physical traits and quirks into one model" then you need to include those captions on those images

#

Or at least, including those captions will allow you to get them when you prompt for them

#

if they're missing it won't happen unless it's overfit (and that will only mean some things are possible)

fallen cloud Jan 30, 2023, 4:18 PM

#

Now when actually discussing it with someone it suddenly feels so obvious that is the way to go 😂

Well.. i'll see what this model is capable of doing at least, and then start working for more detailed information in the captions in the next one.

Now i saw that dreabooth actually renamed the captions-folder from "captions" to "captionsoff" also 🤔 perhaps me adding the captions.zip and captions in the ../model/cations/ folder manually perhaps didnt work at all.

#

Dreambooth refused to let me add the captions during the image-upload phase and started to abort due to that "model(1845).txt" is not a supported imageformat. So i had to add them to the drive manually before I started the training.

#

😮‍💨

split acorn Jan 30, 2023, 4:20 PM

#

GoatNLT

#

Yeah, I only local, so I'm not sure how the colab versions work exactly. alicatCry

fallen cloud Jan 30, 2023, 4:22 PM

#

Mmhm.. im going to try @fast epoch webUI version now instead of lastbens fast-dreambooth. Hopefully it will work better. Atleast until i've upgraded my computer so i can run all this locally instead 😅

#

@split acorn do you know how extensive you should write the captions also?

split acorn Jan 30, 2023, 4:42 PM

#

If your goal is to ""catch a human persons all looks, personal physical traits and quirks into one model"
So make sure to include "looks, personal physical traits and quirks, body language, and poses" for example alicatPog

#

I'm not sure what the limit is or how to go about that in the most optimal way, but that's the general jist of it

#

You could try small datasets first with various captioning methods

#

to figure out which one works best for what your goal is and then scale it up from there

fallen cloud Jan 30, 2023, 4:46 PM

#

Good idea there. Perhaps its better to acctually go through each different set of images, containing different kind of expressions and traits, to get that set to work in a model first, then when all different "sets" are working, add them up into one model containing them all 🤔

split acorn Jan 30, 2023, 4:47 PM

#

mm mm, is an idea alicatPog

#

For more complicated models or for training that includes "sets", I would recommend Kohya, personally

#

but that's just me alicatPog what you're doing could work perfectly fine

#

https://github.com/kohya-ss/sd-scripts

#

Lets you rebalance the sets easier

#

(or at least it's one that im familair with that allows easy balancing)

#

There's a colab for it too, but I can't speak to how good it is or if it's better vs what you're using

#

Everydream is nice too alicatUwU

fallen cloud Jan 30, 2023, 4:51 PM

#

Ooh.. i haven't tried Kohya yet. Found it somewhere yesterday and actually has an open window with a colab running kohya open, thinking of checking it out. I think someone mentioned that Dreambooth is better for smaller amount of images, and kohya could be better for, as you said, more complex models. Everydream was also mentioned in the same sentence as Kohya so i have a window with that one too. But hasnt found a cloab of it yet 😂

fallen cloud Jan 30, 2023, 4:52 PM

#

split acorn (or at least it's one that im familair with that allows easy balancing)

If I have any thoughts when trying it, perhaps i can check with you for some pointers?

split acorn Jan 30, 2023, 5:01 PM

#

Yep, sure. I haven't done any large scale models yet though, so someone else might provide better feedback alicatPog

#

There's a server for DreamBooth and EveryDream and many people for Kohya hang out on a couple servers (no official one, that I'm aware of)

#

they might be able to help more than I can

fallen cloud Jan 30, 2023, 5:07 PM

#

Sweet tx ^_^

split acorn Jan 30, 2023, 5:28 PM

#

https://www.youtube.com/watch?v=dVjMiJsuR5o

YouTube

koiboi

😕LoRA vs Dreambooth vs Textual Inversion vs Hypernetworks

There are 5 methods for teaching specific concepts, objects of styles to your Stable Diffusion: Textual Inversion, Dreambooth, Hypernetworks, LoRA and Aesthetic Gradients. The question is: which one should you use?

In this video we review 3 key research papers, look at the underlying mathematical mechanics behind each method, analyze data from...

▶ Play video

#

is a good video, as well

fallen cloud Jan 30, 2023, 6:05 PM

#

split acorn https://www.youtube.com/watch?v=dVjMiJsuR5o

That was a interesting one informationwise 😂 thanks for sharing!

bronze igloo Jan 30, 2023, 8:03 PM

#

Anyone have this issue?

Renders look most like subject during training preview - then turn into a completely different person

I have noticed that the live previews look amazing, albeit a bit blurry, during the beginning of the render process. Then, after about half way through, they start morphing into some unrecognizable subject, which usually ends up looking like a weird relative of the subject, or they just turn into a senior citizen.
I have created 4 models so far based on different subjects, all with the same settings/amount of training. Two of them produce some pretty amazing results, while the other two behave in the way I just described.
Why does this happen and are there any tips on how to prevent this?

fallen cloud Jan 30, 2023, 8:05 PM

#

I have had that issue also, dont know why or how to solve it though.

inner meteor Jan 30, 2023, 8:49 PM

#

bronze igloo Anyone have this issue? Renders look most like subject during training preview...

is this for Dreambooth or TI embeds?

bronze igloo Jan 30, 2023, 8:49 PM

#

@undone portal dreambooth

#

using huggingface diffuser example

inner meteor Jan 30, 2023, 8:49 PM

#

ahh yeah i have that issue before too. i am assuming you using later version of Dreambooth

#

i actually switched back to early december build

#

cause i was struggling to learn while they kept changing the code

#

and i had same exact issue .... never resolved.... but i also haven't gone back to new code

#

december builds were much simpler

#

but they didn't work with SD2.x

#

anyone know any servers or locations that work with embeddings/TI? i'm having issues getting mine to look like my subject. i did tutorials.... SECourses had a good tutorial with sample iimages... i followed along... got great results... then when i put my images in... horrible

#

so i'm guessing it's my source images. but i have no idea why. they are clear and i even changed all the backgrounds

bronze igloo Jan 30, 2023, 8:53 PM

#

@inner meteor what do you mean by "version of dreambooth" do you mean their train_dreambooth.py?

inner meteor Jan 30, 2023, 8:55 PM

#

yeah you can go to old versions of the code

#

let me get url

#

so ... like here - https://github.com/d8ahazard/sd_dreambooth_extension

GitHub

GitHub - d8ahazard/sd_dreambooth_extension

Contribute to d8ahazard/sd_dreambooth_extension development by creating an account on GitHub.

#

if you click on COMMITS

#

on the right hand side...

#

see under <>code

#

then you can download the build from any point in time

#

so what i did was uninstalled the extension

#

and used an old code base from mid december

#

i know they slowed down and are now focusing on stabilizing what's there

#

but i'm not sure if it's "polished" yet

#

i'd look at tutorials you following, and look at the date they posted... then get a build from around then

formal grail Jan 31, 2023, 12:18 AM

#

Anyone now of a way to solve the saturate image output in Stable Diffusion (Automatik 1111). I feel most images are over-saturated by at least 30%, giving the images a kind of childish comic look.

rain tapir Jan 31, 2023, 12:38 AM

#

Is it just me or is pix2pix super inconsistent af?

#

Like it seems only once in a blue moon it actually does something without making the image look like bullshit

last kernel Jan 31, 2023, 3:54 AM

#

Hi everyone,
I am looking for available options to fine-tune stable diffusion inpainting for a custom dataset and need some help. I found two open-source models- one by runaway ml https://github.com/runwayml/stable-diffusion, the checkpoint is provided but the training code seems to be missing there, and the other one in the StabilityAI stable diffusion repo https://github.com/Stability-AI/stablediffusion for which I am currently trying to run the existing model, Not sure whether training scripts are available or not.
Is the training or fine-tuning code available on GitHub or hugging face for stable diffusion inpainting? or is anyone able to fine-tune the text2img stable-diffusion or inpainting model?
Also, the major thing is what are the minimum hardware requirements to fine-tune the model? I do see some stats posted for the model trained from scratch. Can't find any info related to hardware specs for fine-tuning?
Apology for any redundant questions. I started exploring stable diffusion last week only and recently joined this discord channel.

GitHub

GitHub - Stability-AI/stablediffusion: High-Resolution Image Synthe...

High-Resolution Image Synthesis with Latent Diffusion Models - GitHub - Stability-AI/stablediffusion: High-Resolution Image Synthesis with Latent Diffusion Models

serene flicker Jan 31, 2023, 4:44 AM

#

serene flicker I need some help with TI. I've trained a really good one on some ghosty things b...

https://github.com/AUTOMATIC1111/stable-diffusion-webui/issues/7264 I FOUND MY ISSUE

GitHub

[Bug]: RTX 4000 loose Train Embedding Effect · Issue #7264 · AUTOMA...

Is there an existing issue for this? I have searched the existing issues and checked the recent builds/commits What happened? When you Train Embedding on RTX 4080 or 4090 (after last Torch and xfor...

#

(I'm not on 40 series but it's the same issue I am having)

median rose Jan 31, 2023, 7:32 AM

#

Hey folk!

#

Can anyone please tell me what can be done about this?)

split acorn Jan 31, 2023, 7:54 AM

#

median rose Hey folk!

You could try reinstalling python. Make sure to install for all users and click the "Add to PATH" box.

median rose Jan 31, 2023, 8:26 AM

#

split acorn You could try reinstalling python. Make sure to install for all users and click ...

already tried a couple of times, with python the paths are registered, the command line recognizes the command. Python version according to documentation

#

oh.. looks like it looks like I slightly mixed up the channel in dc (

stone garden Jan 31, 2023, 12:58 PM

#

Hi guys! Is there anyone succeed in using dreambooth to fine-tune stable diffusion inpainting? Neither https://github.com/huggingface/diffusers/tree/main/examples/research_projects/dreambooth_inpaint nor https://github.com/ShivamShrirao/diffusers/blob/main/examples/dreambooth/train_inpainting_dreambooth.py works

GitHub

diffusers/examples/research_projects/dreambooth_inpaint at main · h...

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch - diffusers/examples/research_projects/dreambooth_inpaint at main · huggingface/diffusers

GitHub

diffusers/train_inpainting_dreambooth.py at main · ShivamShrirao/di...

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch - diffusers/train_inpainting_dreambooth.py at main · ShivamShrirao/diffusers

winter apex Jan 31, 2023, 1:28 PM

#

stone garden Hi guys! Is there anyone succeed in using dreambooth to fine-tune stable diffusi...

i have read that you can train a normal dreambooth model and then merge it with the inpainting model with a certain config, it will give you a good custom inpainting model:
https://www.reddit.com/r/sdforall/comments/zyieht/how_to_turn_any_model_into_an_inpainting_model/

r/sdforall - How to turn any model into an inpainting model

188 votes and 41 comments so far on Reddit

winter apex Jan 31, 2023, 1:36 PM

#

bronze igloo Anyone have this issue? Renders look most like subject during training preview...

Something similar happened to me and the problem was that i deleted the background (to a solid color) to isolate my subject in my training images, and it gave me very inconsistent results. I tried again but with the original images and it turned out great

remote latch Jan 31, 2023, 3:55 PM

#

Hi! I would like to share my finetuned model:
v2-base and v2-1 fine-tuned with NovelAI-like aspect-ratio-bucketing https://huggingface.co/ttj/flex-diffusion-2-1

ttj/flex-diffusion-2-1 · Hugging Face

hexed bloom Jan 31, 2023, 5:34 PM

#

200 epochs. Does this mean my learning rate was a little too low, since it still seems to be learning, or should I keep going with the epochs?

#

Anyone know? PepoThink

serene flicker Jan 31, 2023, 5:35 PM

#

I honestly don't know what loss means for training stuff, could you explain?

hexed bloom Jan 31, 2023, 5:37 PM

#

Basically it's the penalty score for how bad the model's prediction is. The higher the number the less predictable the model will be = bad results

#

A "perfect" theoretical model would have a loss of 0 for example

serene flicker Jan 31, 2023, 5:37 PM

#

But that would take forever to train?

#

I'm currently at 407 epochs in my currently training embedding

hexed bloom Jan 31, 2023, 5:37 PM

#

A perfect 0 will always be impossible I believe

serene flicker Jan 31, 2023, 5:38 PM

#

Yeah that makes sense

#

But my loss wildly changes a lot

hexed bloom Jan 31, 2023, 5:39 PM

#

I noticed that a higher batch size leads to less jumping around

#

I'm doing batch 32

serene flicker Jan 31, 2023, 5:40 PM

#

Ah, I am only doing 5

#

with a training set of 25 images

hexed bloom Jan 31, 2023, 5:42 PM

#

Ah yeah I'm doing about 6800 images 😭

serene flicker Jan 31, 2023, 5:42 PM

#

Dang

#

I am only doing an embedding so that's not necessary for me

hexed bloom Jan 31, 2023, 5:43 PM

#

Yeah of course

#

I'm trying to see if I can do huge data properly and seems to be working well, but it's all photography stuff

serene flicker Jan 31, 2023, 5:43 PM

#

Ah

#

I just released an embedding today and I plan to release another one in a few hours

hexed bloom Jan 31, 2023, 5:44 PM

#

Oh noiice!

#

I'm still trying to master the craft lol

serene flicker Jan 31, 2023, 5:44 PM

#

Is your model on anything specific?

serene flicker Jan 31, 2023, 5:44 PM

#

hexed bloom I'm still trying to master the craft lol

I'm just getting really lucky with mine I think 😆

hexed bloom Jan 31, 2023, 5:45 PM

#

It's specific to modern art photography I would say

#

Anything from portraits, animals, and weird art stuff

#

I photographed some friends pets to use for the animals so they always come out 😂

serene flicker Jan 31, 2023, 5:48 PM

#

Aw that's cute

#

I've been wanting to train something on my cat. I have like 300 photos of her on my phone anyway

hexed bloom Jan 31, 2023, 5:49 PM

#

Yeah do it up!

#

The biggest thing I learned was, make sure they are all in different backgrounds and settings

#

That makes the biggest difference

serene flicker Jan 31, 2023, 5:49 PM

#

Well she is an indoor cat, so the backgrounds are very similar

hexed bloom Jan 31, 2023, 5:50 PM

#

Bathroom, bedroom, by the window, kitchen, in fridge, in bathtub, etc etc

serene flicker Jan 31, 2023, 5:50 PM

#

why would my cat be in the fridge

hexed bloom Jan 31, 2023, 5:50 PM

#

And ofc in a box

hexed bloom Jan 31, 2023, 5:51 PM

#

serene flicker why would my cat be in the fridge

serene flicker Jan 31, 2023, 5:51 PM

#

hexed bloom

very cute

worn fable Jan 31, 2023, 6:20 PM

#

what would be good LORA settings for characters , Using Kohya_ss variant ?

fallen zinc Jan 31, 2023, 7:20 PM

#

is it possible to fine tune the instruct-pix2pix models with textual embeddings / LoRA? has anyone tried this?

#

I'm wondering if it's possible to teach instruct-pix2pix to do geometric transformations, like "rotate the cube"

hushed delta Feb 1, 2023, 1:13 AM

#

Anyone have a guide of how to utilize textual inversion files in Automatic's client?

#

so .pt files

thorn vigil Feb 1, 2023, 1:39 AM

#

anyone able to help a fine-tuning noob? i'm stuck on getting the process beyond initialization because of my column naming. Various TypeErrors. Using the ImageFolder method

#

Traceback (most recent call last):
File "/notebooks/training/diffusers/examples/text_to_image/train_text_to_image.py", line 730, in <module>
main()
File "/notebooks/training/diffusers/examples/text_to_image/train_text_to_image.py", line 474, in main
if image_column not in column_names:
TypeError: argument of type 'JpegImageFile' is not iterable

Is this gist of it.

frank ibex Feb 1, 2023, 7:57 AM

#

hushed delta Anyone have a guide of how to utilize textual inversion files in Automatic's cli...

Put your pt files in sd-webui/embeddings/. Under the "generate" button on the top right, the third button is called "additional networks" - it will open a menu with embeds, hypernetworks, loRA, the whole shaaabang

river hatch Feb 1, 2023, 8:16 AM

#

Question for those using DreamBooth: For training faces what has been your best settings? I'm getting inconsistent results. My settings have been 1e-6 training, 2-4k steps, with and without class images, with and without instance prompts. Wondering if you have found a config that has worked well for you

hushed delta Feb 1, 2023, 8:17 AM

#

frank ibex Put your pt files in `sd-webui/embeddings/`. Under the "generate" button on the...

R they enabled by default

frank ibex Feb 1, 2023, 8:19 AM

#

Yea when you load the ui from a terminal window, you should see an output with all the names of your embeddings

#

rapid perch Feb 1, 2023, 11:06 AM

#

So I understand that having different backgrounds is pretty important for textual inversion. Does anyone have experience with masking their image and adding plain colored backgrounds? Or would that defeat the purpose of textual inversion?

finite creek Feb 1, 2023, 1:07 PM

#

Im getting blurry images after training a model, does it mean its over training? only used 69 epochs, learning rate of 0.000001, 16 input images and 10X for reg. images

#

I think I used these same settings another time and got good results

split acorn Feb 1, 2023, 3:46 PM

#

rapid perch So I understand that having different backgrounds is pretty important for textua...

yeah, doing that can work fine. Just make sure to describe the background. And keep in mind if all your pictures have it, then it can start sticking with your generations

hexed bloom Feb 1, 2023, 5:00 PM

#

This graph means that I should continue with my training, as the loss is still going down, correct? My sanity sample prompt still holds the original artist's style.

upbeat tulip Feb 1, 2023, 5:33 PM

#

Can anymore kindly explain to me or refer to some resources on how to finetune the inpainting checkpoint of the stable diffusion model on my custom dataset?

atomic cedar Feb 2, 2023, 3:05 AM

#

Yo guys me and my mate are very new to this, how would we take a hyper realistic image of a person that was generated and feed it back into stable diffusion to finetune or fix certain elements of the photo?

#

Any reference point of where we can start to research or look into?

indigo orbit Feb 2, 2023, 8:29 AM

#

When training a woman's face with textual inversion, should avoid pictures the woman whose head is turned sideways, like when she is lying down?

cloud raven Feb 2, 2023, 9:42 AM

#

hi guys, i am trying to train SDv2.1 with Dreambooth but i'm having some problems with the results. The context right now is people in wheelchairs but the idea is to extend it to others disabilities. I'm using 10 instance images of persons in wheelchair and 200 class images generated by the model before training. But the resulting images are too plain compared with the original model and there is often "extra fingers, extra limbs, deformed face, deformed wheelchair, etc". All of these words are in my negative prompt but it seems that is not enough. I know that "person in wheel chair" is recognized by the model but it has the same problems with deformities even with negative prompts. I am using diffusers repository btw. Do you know if i the approach that im following is right or should i change it? or if it exists some repo that do something similar... any kind of advice is welcome, thx

abstract crag Feb 2, 2023, 12:39 PM

#

atomic cedar Any reference point of where we can start to research or look into?

you can do it with inpainting(manual masking parts to be replaced), and also there is pix2pix option now which I havent tried but it works by instructing AI to make changes on given image

ember mulch Feb 2, 2023, 2:14 PM

#

Hi guys, what is the current best colab notebook for training a dream booth model? Im trying to have it train on garden blueprints

fast epoch Feb 2, 2023, 6:01 PM

#

Do you know if there's a google colab notebook made for textual inversion training except this one https://colab.research.google.com/github/huggingface/notebooks/blob/main/diffusers/sd_textual_inversion_training.ipynb ?

Google Colaboratory

rain tapir Feb 2, 2023, 10:49 PM

#

cloud raven hi guys, i am trying to train SDv2.1 with Dreambooth but i'm having some problem...

10 instance images is too little

#

I usually run 100+ for good results

#

I also recommend using ur own reggies and not self-generated

#

source: master dreamboother

gloomy pike Feb 3, 2023, 1:24 AM

#

Can anyone point me to a place that can give me good examples of under and over training and when to decide to stop training or change rates?

#

ive recently tried 1e-5 for 2000 steps and then going to 1e-6 but I still don't really know what to look for... some stuff I guess with smaller databases start getting rainbow edges and details a lot sooner, what can I do?

#

i use a really smooth image upscaler for small images.

west thunder Feb 3, 2023, 4:30 AM

#

I have a question about training with LoRA. What should my dataset look like for a person who isn't already in the SD dataset? About how many pictures and what should their minimum or ideal resolution be? I've seen some AMAZING LoRAs but mine are coming out garbage. I think it's my dataset.

split acorn Feb 3, 2023, 4:30 AM

#

10 can work just fine (people generally recommend 10-100 for most things), and the whole purpose of regularization is to basically tell the program "these images are normal". There'd be no point if the reguarization images weren't created by the model you're training on. Any good results would just be placebo if you're not.

gloomy pike Feb 3, 2023, 4:43 AM

#

split acorn 10 can work just fine (people generally recommend 10-100 for most things), and t...

Hey, can you give me a tip on what to look for when starting on a dataset and choosing when to chance to a finer rate? Should there ever be a point where my results look exactly like my source or should I always be trying to keep it a little ahead so it looks a little like my source but not exactly? Are rainbowing edges a sign of a blown hypernetwork?

split acorn Feb 3, 2023, 4:46 AM

#

You're hypernetwork training?

gloomy pike Feb 3, 2023, 4:47 AM

#

split acorn You're hypernetwork training?

yes...

#

is it a bad way?

split acorn Feb 3, 2023, 4:48 AM

#

Mmm one sec

gloomy pike Feb 3, 2023, 4:48 AM

#

i kinda like it but I hear it takes I while, im having issues recognizing what good and bad training looks like early on though :S

#

I've trained all this last week on a few things with a wide variety of result

split acorn Feb 3, 2023, 4:51 AM

#

overfitting can look like rainbowing edges yeah, like the quality of the generations loses overall quality

#

Honestly, it just depends on what your goal is

#

If you're doing a character hypernetwork, what you could do is test if you're able to switch details of the subject

#

if you're not able to, then it's a sign of overfitting

#

If general prompts (non-super specific ones) are looking quite similar to your source images, then that's also overfitting

#

backgrounds can give it away

#

From my experience, with smaller datasets, it's pretty easy to figure that out alicatKEK

gloomy pike Feb 3, 2023, 4:55 AM

#

split acorn overfitting can look like rainbowing edges yeah, like the quality of the generat...

will the size of the dataset effect how many steps you can go before it starts falling apart? If there are these rainbow artifacts presents or will using other ai generated photos to learn on also increase this effect?

#

oh ok

split acorn Feb 3, 2023, 4:56 AM

#

I'd personally recommend LoRA over hypernetworks though, because with hypernetworks your modifying the layers indirectly, where LoRA is directly. But I honestly have no idea what one is better or not, I just prefer the direct control.

split acorn Feb 3, 2023, 4:57 AM

#

gloomy pike will the size of the dataset effect how many steps you can go before it starts f...

Yep! size of dataset and number of steps until it breaks are related

#

with larger ones you can typically get away with more steps, and with smaller ones, it tends to break sooner CB_nod

#

I don't understand the second question though

gloomy pike Feb 3, 2023, 5:01 AM

#

ok, excelent, about the second question. Ive used some ai generated photos within some of my hypernet datasets and I would find those details showing up sooner, It was a small set though too.
What about tags? Does it help if I go ahead and load them up with a bunch from deepbooru? I've had some sets that had most prompts the same and I found it go down hill faster but then again it was a set of around 20 images. My problems might all be related to my dataset quantity. I understand better now.

gloomy pike Feb 3, 2023, 5:02 AM

#

split acorn Yep! size of dataset and number of steps until it breaks are related

just a last question, is there any broad chart or recommendation that cross-references number of images by most effective steps???

#

like if im training 20 img, should I start with a higher rate and change before 1000 steps?

#

ive been going twice beyond that

split acorn Feb 3, 2023, 5:03 AM

#

mmmm

#

https://github.com/AUTOMATIC1111/stable-diffusion-webui/discussions/2284
https://github.com/AUTOMATIC1111/stable-diffusion-webui/discussions/2670

#

These are some good sources of information on hypernetworks

#

Number of images vs most effective steps changes depending on the model, settings, dataset and goal so alicatKEK BUT these links have some good general recommendations for settings

serene flicker Feb 3, 2023, 5:06 AM

#

I just trained a lora model for the first time, does anyone know why I would be getting this error when trying to use it?

gloomy pike Feb 3, 2023, 5:07 AM

#

split acorn mmmm

Thank you for your help! The possibilities with this tool seem almost unlimited, can be quite overwhelming lol

serene flicker Feb 3, 2023, 5:08 AM

#

serene flicker I just trained a lora model for the first time, does anyone know why I would be ...

I should probably try asking in #🤝｜tech-support

split acorn Feb 3, 2023, 5:08 AM

#

serene flicker I just trained a lora model for the first time, does anyone know why I would be ...

You could try the extension instead (ironic given the error but alicatKEK )

#

https://github.com/kohya-ss/sd-webui-additional-networks

GitHub

GitHub - kohya-ss/sd-webui-additional-networks

Contribute to kohya-ss/sd-webui-additional-networks development by creating an account on GitHub.

serene flicker Feb 3, 2023, 5:09 AM

#

Ah, is that a replacement for the button thing?

split acorn Feb 3, 2023, 5:09 AM

#

this was the original

#

and yeah, an alternative

#

I prefer this, also gives you fancy sliders

serene flicker Feb 3, 2023, 5:09 AM

#

I mean, a different lora I have seemed to work

serene flicker Feb 3, 2023, 5:09 AM

#

split acorn I prefer this, also gives you fancy sliders

Ah

#

I will install it

split acorn Feb 3, 2023, 5:10 AM

#

This extension should work for all of them (?)

#

(well, assuming they were trained on Kohya, I suppose, or recent db extension)

serene flicker Feb 3, 2023, 5:10 AM

#

I used the dreambooth extension for training

split acorn Feb 3, 2023, 5:10 AM

#

Ooohh that might be the cause

#

older dreambooth extension training might not be compatible

#

I think the only way to use that is to merge your lora into a model

serene flicker Feb 3, 2023, 5:11 AM

#

Oh :(

split acorn Feb 3, 2023, 5:11 AM

#

yeahhhh was dark times back then

serene flicker Feb 3, 2023, 5:11 AM

#

Well the only reason I did it was because I had a large dataset with 200 images, would textual inversion respond well to that?

split acorn Feb 3, 2023, 5:13 AM

#

I personally have no idea. LoRA responds well to large datasets like that though

#

one sec, I'll check alicatPog

serene flicker Feb 3, 2023, 5:13 AM

#

Thanks :)

split acorn Feb 3, 2023, 5:16 AM

#

Yep! TI can do larger datasets like that, as well. CB_nod

#

I haven't made any TI that large though, so I can't help with that alicatKEK

serene flicker Feb 3, 2023, 5:17 AM

#

I only have an 8gb gpu, and have been successfully training some things with around 25 images, so I wonder if my settings should change. Should I use gradient accumulation? I feel like it makes training slower but I would be running this one over night.

#

I also should probably have a pretty high vector count

split acorn Feb 3, 2023, 5:18 AM

#

high vector count isn't always a good thing. Kindly like higher DIM count with LoRAs

serene flicker Feb 3, 2023, 5:18 AM

#

split acorn high vector count isn't always a good thing. Kindly like higher DIM count with L...

Honestly I don't know what DIM is (I never followed a guide when making my lora earlier)

split acorn Feb 3, 2023, 5:18 AM

#

GA can work if you want higher batch size but don't mind sacrificing time, since you're limited on VRAM

serene flicker Feb 3, 2023, 5:19 AM

#

I don't understand why higher batch size is better to be honest

#

I get good results on a batch size of 5, which trains pretty fast. I also have GA to 1.

split acorn Feb 3, 2023, 5:20 AM

#

How it was explained to me is DIM can be seen as how many points on a curve there are. The more points, the more information it absorbs, but some of that information can be "noise" from the image, stuff that you don't want it learning on

#

higher batch can improve the training alicatPog but like everything, there are limits

serene flicker Feb 3, 2023, 5:21 AM

#

So a batch size of 5 and a gradient accumulation size of 5 would get me 25 images at a time, but take like 5 times longer.

split acorn Feb 3, 2023, 5:21 AM

#

especially with smaller datasets

serene flicker Feb 3, 2023, 5:21 AM

#

Ah

#

So if I have a large one it's not really necessary?

#

And do you think more steps would be better with a large dataset? I would assume so

split acorn Feb 3, 2023, 5:22 AM

#

I'd just defer you to the link alicatLove

serene flicker Feb 3, 2023, 5:22 AM

#

Good idea 😆

#

My settings

#

Theoretically should work

#

I guess I will see in the morning

#

Thank you for all your help @split acorn :)

#

Tis on its way, if I have time before school tomorrow I might share results. Depends.

split acorn Feb 3, 2023, 5:29 AM

#

Yosh, good luck GoatUppies

serene flicker Feb 3, 2023, 5:41 AM

#

split acorn Yosh, good luck <:GoatUppies:920183785570586634>

Well shoot, forgot I had a faulty power supply and everything just shut off really loudly so the training failed

#

Dunno if I want to get up and retry

#

Maybe tomorrow

#

Maybe I should install those new batteries into my power supply that I have

#

Anyway imma go back to sleep

somber roost Feb 3, 2023, 3:11 PM

#

Dream Studio is trolling me

#

I'm trying to generate a yellow surfboard, but I get a weird blurry blob instead 😭

high venture Feb 3, 2023, 8:09 PM

#

Worked long time with dreambooth from last year probably November commit, updated to the newest one, and the training process is going, but the model learns nothing, just receive the random images. What could it be?

serene flicker Feb 3, 2023, 10:54 PM

#

high venture Worked long time with dreambooth from last year probably November commit, update...

I think it has to do with xformers and the latest versions of cuda and torch. The same thing happens with textual inversion. Basically you gotta revert to an older version of the webui and delete venv to redownload all the python packages.

serene flicker Feb 3, 2023, 10:55 PM

#

split acorn Yosh, good luck <:GoatUppies:920183785570586634>

Well, my computer didn't shut off while I was at school, and it made it to about 7000 steps in about 9.5 hours.

#

I am tesating it all now

serene flicker Feb 4, 2023, 3:34 AM

#

split acorn Yosh, good luck <:GoatUppies:920183785570586634>

Welp, I hath released the embedding. https://discord.com/channels/1002292111942635562/1071271601699553320

crimson wasp Feb 4, 2023, 7:00 AM

#

Seems there's a bug in many popular models where the first word in the prompt is ignored, which might explain strange training behaviour with them, as well as a potential way to fix it
https://www.reddit.com/r/StableDiffusion/comments/10baavg/bug_warning_with_some_models_other_than_sd14_like/
https://github.com/arenatemp/stable-diffusion-webui-model-toolkit

split acorn Feb 4, 2023, 8:12 AM

#

serene flicker Welp, I hath released the embedding. https://discord.com/channels/10022921119426...

Ooooo nice job alicatPog

fallen cloud Feb 4, 2023, 11:37 AM

#

Somehow fast-dreambooth wont accept me uploading captions with my images in google colab anymore.
I always get a error message that 'modelname (xxxx).txt is not a recognised image file', and the training stops..

So I have to add the captions to the session manually afterwards. But when training, dreambooth renames the 'captions' folder to 'captionsoff' despite me checking the external captions box. So i supposes that it's not using my captions 🤔 Perhaps someone here can confirm if that might be the case, and even better.. tell me how to be able to force fast-dreambooth to use the manually added captions in the model-training 😂

Anyone had this issue and knows if my feeling is true?
And if so, how to fix it?

wispy tulip Feb 4, 2023, 2:38 PM

#

Did anyone have any luck with training LORAs on objects, in particular weapons? I am struggling with what ratio to maintain between the pics of weapons themselves and people wielding them.

gloomy pike Feb 4, 2023, 11:51 PM

#

Hello, a brief question, I tried using batch generation with masks and I could not find the results anywhere in the output folders and only a grid layout of the results in the designated output directory.

serene flicker Feb 5, 2023, 4:19 PM

#

Finished another embedding :) https://discord.com/channels/1002292111942635562/1071827026991915049

prime rivet Feb 5, 2023, 10:22 PM

#

Here is a thing people might find interesting. You can finetune the model a lot with something as simple as Ben's fast dreambooth without touching UNET or Textencoder. Just train concept with good images and the model improves accordingly on that concept.

#

Since it basically just finetuned the text encoder more.

#

So if you are struggling with something, just concept train in DB and you can improve it. This allows you to keep the model otherwise intact.

fast epoch Feb 5, 2023, 11:05 PM

#

Is it a problem if I wish to train a person who has the same background in almost all the photos?

fallen cloud Feb 5, 2023, 11:07 PM

#

fast epoch Is it a problem if I wish to train a person who has the same background in almos...

Could be. If the pictures are from pretty much the same angle the background often seems to "stick" and can be really hard to change to whatever setting you'd like. Sometimes it works better and sometimes its hopeless to remove it.

split acorn Feb 5, 2023, 11:07 PM

#

If you caption the background, and add that background to the negatives, you can kinda get around that

#

It's best to not have them all the same though, yosh

#

But even that doesn't really fix it sometimes

fallen cloud Feb 5, 2023, 11:08 PM

#

I have some image-sets that seems to stick no matter how much i train the images. Thinking about trying to remove/replace the background manually on those to try to get them to work.

split acorn Feb 5, 2023, 11:08 PM

#

Yep, honest the best way to salvage the dataset

#

Imo

#

I've done all gray backgrounds, and then captioned it, and it worked quite well. Though, when it started to overfit, you could see the gray leaking in the background

fast epoch Feb 5, 2023, 11:11 PM

#

Then what's the best approach? Selecting the person and leaving with a transparent background or to swap the background's color in every single photo?

split acorn Feb 5, 2023, 11:11 PM

#

White is popular

fast epoch Feb 5, 2023, 11:12 PM

#

to make the background white in every photo?

split acorn Feb 5, 2023, 11:12 PM

#

Yep

#

Just make sure to caption it

#

I'm not sure what other people are doing though for transparent backgrounds. From what I've seen/heard, that's the most popular option

fast epoch Feb 5, 2023, 11:13 PM

#

Nah, I don't use tags/captions

split acorn Feb 5, 2023, 11:13 PM

#

I only did gray because my subject had white hair

fast epoch Feb 5, 2023, 11:13 PM

#

The best method without captions

split acorn Feb 5, 2023, 11:14 PM

#

Oh yeah, I don't think that'd work

fast epoch Feb 5, 2023, 11:14 PM

#

It worked

#

:))))

#

with different backgrounds

split acorn Feb 5, 2023, 11:14 PM

#

It's looking for similarities between the images, after all

#

Oooh yeah with different backgrounds yeah, that's fine, that'd work

#

But if they're all the same, I think it'll leak in super easy

#

At least from my experience with that method

fast epoch Feb 5, 2023, 11:16 PM

#

so to add different colors to the background leaving only the person the same

split acorn Feb 5, 2023, 11:16 PM

#

You'll get your person and the backgrounds will be of varying solid colors

#

I think

#

Not sure if they would be solid or a mess of different colors, would be interesting to know alicatPog

fallen cloud Feb 5, 2023, 11:19 PM

#

Im planning to try different methods for that. Blurred background, solid colours and change the background to mixed backgrounds. Future project though. Glad to hear this worked out for you though 😁 👍 makes me hopeful for all of my "useless image-sets"

split acorn Feb 5, 2023, 11:21 PM

#

Wooo

#

Yeah go for it and feel free to share how it worked out alicatPog

fallen cloud Feb 5, 2023, 11:23 PM

#

Absolutely! 😄 👍 Kind of curious of what makes the best result for this issue.

warm ridge Feb 6, 2023, 12:01 AM

#

So hey, I've just started trying to experiment with textual inversion to train some embeddings in automatic1111, and uhhhh... Yeah this kinda thing is what I'm getting. I've left the learning rate on the default 0.005, but watching it go, it's generating utter garbage without any discernable difference all the way from step 50 to 5000. I wouldn't be surprised if it was just not matching the training concept well, but I don't know why it's mangling everything so badly like this or where to start on fixing it. (Not every image is this bad, but all the rest are still super grainy junk.)

split acorn Feb 6, 2023, 12:10 AM

#

I got previews that looked like jumbled messes when I clicked the "preview via txt2img" button and not having a prompt in the txt2img tab, but I'm not sure personally

mellow meteor Feb 6, 2023, 12:52 AM

#

warm ridge So hey, I've just started trying to experiment with textual inversion to train s...

This is my new wallpaper now 🙂

main scaffold Feb 6, 2023, 9:21 AM

#

Couldn't launch python
exit code: 9009

help pls guys

fast epoch Feb 6, 2023, 12:58 PM

#

Did you notice that the last dreambooth update made the extension to produce worse results (trainings)?

vague pulsar Feb 6, 2023, 2:12 PM

#

main scaffold Couldn't launch python exit code: 9009 help pls guys

did you follow the github installation guide? Did you install python3 and did you add it to your PATH environment variable? Are you using anaconda, miniconda or anything similar?

fallen cloud Feb 6, 2023, 2:20 PM

#

fast epoch Did you notice that the last dreambooth update made the extension to produce wor...

Kinda felt so also, but trained such a large model without captions so took for granted it was due to that 🤔

hazy schooner Feb 6, 2023, 8:38 PM

#

fast epoch Did you notice that the last dreambooth update made the extension to produce wor...

I've heard people say it was because of something related to pytorch though not sure

stone garden Feb 7, 2023, 1:27 AM

#

them nips

stone garden Feb 7, 2023, 2:03 AM

#

stone garden them nips

Pretty hot right

abstract plover Feb 7, 2023, 3:03 AM

#

hello people, over the weekend we've released a service to let people train SD 1.5 using Dreambooth as a fast and easy service: https://dreamlook.ai/create-models

winter apex Feb 7, 2023, 3:13 PM

#

abstract plover hello people, over the weekend we've released a service to let people train SD 1...

i would be really interested in LoRAs, are you planning to add it?

abstract plover Feb 7, 2023, 3:19 PM

#

winter apex i would be really interested in LoRAs, are you planning to add it?

Yep that’s definitely something we want to add

fast epoch Feb 7, 2023, 10:03 PM

#

The newest webui is so bad. Made a model with dreambooth, tested it on the newest version of the automatic1111's webui and it generated some bad-decent images. Then I used the same model, with the same prompt on an older version and it generated way better images.

split acorn Feb 7, 2023, 10:37 PM

#

It might be worth it to just use one of the standalones, since then you can avoid all the potential dependency nightmares, especially considering that auto1111 updates so often

finite creek Feb 8, 2023, 9:24 AM

#

Hello, anybody know how to find the commit number of this video? for both Stable diffusion and dreambooth? https://www.youtube.com/watch?v=9Nu5tUl2zQw&t=194s

YouTube

Olivio Sarikas

DreamBooth for Automatic 1111 - Super Easy AI MODEL TRAINING!

DreamBooth for Automatic 1111 is very easy to install with this guide. With DreamBooth for Automatic 1111 you can train yourself or any other subject. Use your own trained Model to create images in your styles or of yourself. The DreamBooth training in for Automatic 1111 takes only around 30-40 minutes with a good GPU.

LINKS From Video ##...

▶ Play video

fast epoch Feb 8, 2023, 12:05 PM

#

Hello
I have a question about what Ben wrote in his notebook
Image
Does it mean that we can train even with 1080p or 2160p images?
The maximum resolution there is 1024
But it also wrotes "or larger"

#

obsidian sand Feb 8, 2023, 5:12 PM

#

fast epoch Hello I have a question about what Ben wrote in his notebook Image Does it mean ...

Not recommended, training will take too much times and probably colab will crash.

split acorn Feb 8, 2023, 6:02 PM

#

We're just not there yet, at least for 1024 x 1024.

For apsect ratio bucketing, you can get 256 x 1024 however! For 512x512 model training alicatPog so if you have a 1:4 image, it would get resized accordingly.

#

I don't recall last ben having aspect ratio bucketing, however

shy cosmos Feb 8, 2023, 6:05 PM

#

#1047197565365538826 is now under the Stable Diffusion category

still adder Feb 9, 2023, 1:30 AM

#

fast epoch Did you notice that the last dreambooth update made the extension to produce wor...

Try using an older version of bitsandbytes, for me that fixed things a bit

stuck parrot Feb 9, 2023, 3:27 AM

#

https://www.reddit.com/r/StableDiffusion/comments/10xjx8l/i_made_a_new_caption_tool_made_especially_for/

r/StableDiffusion - I made a new caption tool. Made especially for ...

0 votes and 0 comments so far on Reddit

oak gust Feb 9, 2023, 7:08 AM

#

can you still train loras using dreambooth?

stuck parrot Feb 9, 2023, 10:37 AM

#

oak gust can you still train loras using dreambooth?

You can use Kohy's scripts to to so. not sure about the a1111 extension

fast epoch Feb 9, 2023, 12:19 PM

#

The dreambooth extension is so bad

#

You can't even compare the extension's results with the dreambooth's script results

#

Same for LoRA

serene flicker Feb 9, 2023, 1:53 PM

#

https://discord.com/channels/1002292111942635562/1073240057038782514 Just finished a new embedding!

keen cosmos Feb 10, 2023, 1:18 PM

#

hi! I trained two textual inverison embeddings, one with my girlfriends face and other with mine. The problem is, when I use both of them in the same prompt, somehow it only transforms the faces of the characters in the face of the first prompt word (i.e. myFace). What I am doing wrong? is there some configuration in training that i missed? or is it just a finetuning problem. THank you in advice!

split acorn Feb 10, 2023, 3:21 PM

#

Typically, if you want multiple faces with multiple embeddings, they would need to be generated seperately

#

so, for example, through inpainting

#

There are some repos that allow for multiple prompts for one generation which could you let you do both (I think Comfy UI could do this) but by default, I don't believe repos like Auto1111 or InvokeAI support it natively

tame lily Feb 10, 2023, 4:38 PM

#

quick question, probably has been answered plenty of times - can I merge two checkpoints but the base is the depth map one from SD?
problem is I believe there is a tensor size difference as the depth map model seems to have one more value compared to other models

fast epoch Feb 10, 2023, 5:06 PM

#

Is normal that the upscaler fixes the bad eyes of my model?

#

I mean when I generate 512x512 images without upscaling, the eyes are pretty bad. When I use the upscaler, the eyes are very good.

dim wharf Feb 10, 2023, 7:19 PM

#

does anyone know why im getting this errorwhen merging with pix2pix

#

if thisis not the right channel im sorry

dapper prism Feb 10, 2023, 11:41 PM

#

Is there a lightweight dataset tool that simply displays the caption and image, and lets you edit the caption? I have a dataset that I need to refine the captions for

brittle lagoon Feb 10, 2023, 11:44 PM

#

what format is the dataset in?

#

you will probably need a ryo solution, but I know how to open most formats in python

dense bridge Feb 11, 2023, 12:20 AM

#

Hey guys. So I'm trying to train a embedding to use on a ckpt that is already heavily stylized . Call it ckpt 1.

When I train using images generated with "ckpt 1"
And then ran on ckpt 1 they seem super oversaturated. Clown faces and high contrast colors.

If I use trained embedding on stock SD 1.5, they perform much greater. But this is reverse of my desired result.

So I think what I need to do is use normal unstylized images to then train on ckpt 1, so then my embedding will not be overstyled when I use it on ckpt 1.

Would love some feedback or insight!

dapper prism Feb 11, 2023, 12:22 AM

#

brittle lagoon what format is the dataset in?

Its just image files and text files, where each text & image pair shares the same filename but obviously use a different extension

#

Basically the standard format

#

I just want a quick and easy way to double check all the captions visually

brittle lagoon Feb 11, 2023, 12:24 AM

#

I haven't found anything ready made

dapper prism Feb 11, 2023, 1:55 AM

#

brittle lagoon I haven't found anything ready made

Looks like this powershell script does the trick

📎 Caption-King.ps1

#

https://github.com/Jukari2003/Caption-King

GitHub

GitHub - Jukari2003/Caption-King: Streamlines AI image training set...

Streamlines AI image training sets for AI tools like Stable Diffusion - GitHub - Jukari2003/Caption-King: Streamlines AI image training sets for AI tools like Stable Diffusion

brittle lagoon Feb 11, 2023, 2:41 AM

#

dapper prism Looks like this powershell script does the trick

Thanks! I've been looking myself

stuck parrot Feb 11, 2023, 3:47 AM

#

dapper prism Looks like this powershell script does the trick

I have a Breadboard fork that is designed for captions as well

#

https://github.com/theovercomer8/breadboard

GitHub

GitHub - theovercomer8/breadboard: SD Captioning VIA Breadboard

SD Captioning VIA Breadboard. Contribute to theovercomer8/breadboard development by creating an account on GitHub.

pliant drift Feb 11, 2023, 3:56 AM

#

Say i want to train Dwarf fortress style SD. This is a game that generates long text descriptions of every crafted item by every creature in the game. Something like a "Jagged twisted metal sword, crafted of the highest quality, menacing with bones and spikes of granite" would be an simplified example. Obviously i would want a huge data set with tons of great tagging to do this training. Photos of metal ore, polished metals, different quality of materials, seperate photos of various metals that are twisted, jagged, smooth, bent, hooked, stones crafted into different shapes, photos of stuff adorned with bones and spiky stuff, leather, straps made of various quality, the list goes on an on. I could write 100s of different goal images for this set i want to develop no doubt.

What i'm wondering is, could I use SD to generate this set, curate the hell out of the results, and expect a healthy model from that? would the minor imperfections of SD like, train in harder and lead to inbred models?

dapper prism Feb 11, 2023, 4:14 AM

#

stuck parrot https://github.com/theovercomer8/breadboard

Thanks for the suggestion! Currently, I find the powershell script's simplicity is really useful. It requires no installs and works on whatever windows device I run it on (its portable). I also don't need to do any autocaptioning right now (your tool seems to be more geared towards that), just manual fixes from stuff that was autocaptioned (and captioned by others).

stuck parrot Feb 11, 2023, 4:16 AM

#

yea, the editing of captions part is in the works. i've just been focusing on making an autocaptioner first

main breach Feb 11, 2023, 2:05 PM

#

I don't know if I'm in the right place, but is there some documentation available regarding Block Weighted merging of diffusion models? Maybe someone documented their experiments and findings? Or are we all still stabbing in the dark seeing what sticks?

digital totem Feb 11, 2023, 7:56 PM

#

I get good results (very similar to my face) in first part of processing, then it's getting different af. Why?

#

we can see the process in webui

#

it happens in every lora model i created

split acorn Feb 11, 2023, 9:59 PM

#

main breach I don't know if I'm in the right place, but is there some documentation availabl...

https://rentry.org/Merge_Block_Weight_-china-_v1_Beta
https://rentry.org/BlockMergeExplained

Merge Block Weight Magic Codex 1.0Beta

Sources:

picture
doc
*Direction should be changed to influence.
Merge Block Weight Magic Codex 1.0Beta

Introduction
Getting Started Tutorial
2.1. Installation
2.2. Feature Introduction
MBW Fusion Introduction
3.1. Style Change
3.2. Improving the overall quality of the model (Composit...

What is Block merging?

Please send me feedback
This guide is still work in progress. Any and all feedback is highly appreciated, it doesn't have to be suggestions, even questions regarding things you didn't understand can help me figuring out what to refine. For the moment I can be found in /sdg/-threads, but I might m...

main breach Feb 11, 2023, 10:02 PM

#

split acorn https://rentry.org/Merge_Block_Weight_-china-_v1_Beta https://rentry.org/BlockMe...

hot darn, thank you!

split acorn Feb 11, 2023, 10:24 PM

#

Yep, no problem alicatLove

livid axle Feb 12, 2023, 8:20 AM

#

When I merge Models sometimes it works fine. And sometimes the resulting images get weird colours. I dont see the weird colours in the preview-images while rendering, but in the end they are there. Can anyone explain how this happens and how to avoid that?

split acorn Feb 12, 2023, 9:02 AM

#

https://github.com/klimaleksus/stable-diffusion-webui-anti-burn

GitHub

GitHub - klimaleksus/stable-diffusion-webui-anti-burn: Extension fo...

Extension for AUTOMATIC1111/stable-diffusion-webui for smoothing generated images by skipping a few very last steps and averaging together some images before them. - GitHub - klimaleksus/stable-dif...

#

This extension helps you avoid that

#

@livid axle

livid axle Feb 12, 2023, 10:09 AM

#

split acorn This extension helps you avoid that

Thank you!

split acorn Feb 12, 2023, 10:12 AM

#

No problem alicatLove

fallen cloud Feb 12, 2023, 11:04 AM

#

hmm, lastben fast-dreambooth are behaving strangely in Colab again 🤔

indigo orbit Feb 12, 2023, 11:36 AM

#

Been failing to train an Asian lady's face well on LoRA. I used 18 images. Should I use more pictures? Most images I used were close-up. Should I provide different poses? The file is only 9 MB btw

hexed bloom Feb 12, 2023, 4:07 PM

#

indigo orbit Been failing to train an Asian lady's face well on LoRA. I used 18 images. Shoul...

Your images should be different from one another, including background, clothing, poses

untold halo Feb 12, 2023, 8:51 PM

#

dapper prism https://github.com/Jukari2003/Caption-King

super duper thanks for that 🙂 I was looking for something better than mass tag editor extension.

fallen cloud Feb 12, 2023, 9:02 PM

#

Anybody has any knowledge about if it is possible to "over explain" a caption when preparing an imageset for training? Or is it "the more information the better" when coming to captions?

dapper prism Feb 12, 2023, 9:04 PM

#

untold halo super duper thanks for that 🙂 I was looking for something better than mass tag ...

yeah, sometimes its nice to just have a simple tool for cleaning up the manual and autotagging outputs

untold halo Feb 12, 2023, 9:05 PM

#

any other small useful tools worth to mention ?

dapper prism Feb 12, 2023, 9:07 PM

#

untold halo any other small useful tools worth to mention ?

having a simple script for converting webp to png would be one

#

Like this one I made recently with ChatGPT: https://gist.github.com/ProGamerGov/c49d872b86fffd37be9f1fd118d89f97

Gist

Convert all '.webp' images in a dataset to '.png'

Convert all '.webp' images in a dataset to '.png'. GitHub Gist: instantly share code, notes, and snippets.

#

Some of the dataset tools don't play nice with webp images, so its handy to convert them to a more well supported format

untold halo Feb 12, 2023, 9:15 PM

#

lucky for me and everybody, xnview (free) allow batch conversion without issues 🙂 but good to know (not everybody wants to instal whole image viewer for that small thing)

#

Found that as well (not useful as I not use tags for image descriptions) BooruDatasetTagManager
https://github.com/starik222/BooruDatasetTagManager

GitHub

GitHub - starik222/BooruDatasetTagManager

Contribute to starik222/BooruDatasetTagManager development by creating an account on GitHub.

main breach Feb 12, 2023, 11:43 PM

#

So a question about training: when I train a lora for SD1.5 on a specific face, and in some photos of that face, the person is wearing lipstick, do I add "wearing [color] lipstick" to the caption if I want to avoid the training paying attention to the lipstick? Do I understand that correctly?

dapper prism Feb 13, 2023, 3:16 PM

#

untold halo any other small useful tools worth to mention ?

Just remembered this spell checker tool that I find really useful for helping fix grammatical and spelling issues with my captions: https://github.com/tbroadley/spellchecker-cli

GitHub

GitHub - tbroadley/spellchecker-cli: A command-line tool for spellc...

A command-line tool for spellchecking files. Contribute to tbroadley/spellchecker-cli development by creating an account on GitHub.

fallen cloud Feb 13, 2023, 3:43 PM

#

Are there any extra good captioners besides BLIP for regular photos? 🤔 BLIP kinda sucks sometimes in so obvious images. (havnt googled even yet, just crossed my mind)

dense bridge Feb 13, 2023, 7:23 PM

#

hey guys im trying to train an embedding for a girl useing the Babes 1.1 ckpt from civitai, i can create the look i want through prompting but i would rather create an embedding so i can just call up "sally" and get a close enough version.

Can i train useing images generated from Babes 1.1? use those images and then do i train useing stock 1.5 cpkt or on the babes 1.1 again?

it would seem that when i train on the babes 1.1 the embeddings are right fucked, hyper contrast over saturated looks.

id really appreciate if anyone with some experience in embedding training could DM me please. thank you!

hazy schooner Feb 14, 2023, 1:01 AM

#

fallen cloud Are there any extra good captioners besides BLIP for regular photos? 🤔 BLIP ki...

Depends on how willing you are to use colab or a python script: this tool has BLIP 2, GIT, Coca, and CLIP

https://github.com/theovercomer8/captionr

autumn obsidian Feb 14, 2023, 6:22 AM

#

Hi, I'm using theLastBen for training 2.1-512 model on a dataset can anyone explain me about the concept training used by him. Also, how is it different from other methods

fallen cloud Feb 14, 2023, 10:34 AM

#

hazy schooner Depends on how willing you are to use colab or a python script: this tool has BL...

Ooh.. well thanks! Ill check that out during the day! 😄 👍

main breach Feb 14, 2023, 1:52 PM

#

Ok I don't get it, one LoRA guide says "use at least 100 repeats and 1 epoch" other guides say "use 5-10 repeats and 10+ epochs" I've seen LoRA trained for 35~42 epochs... seriously what gives? I've tried to Train 10 images * 10 repeats * 10 epochs VS 10 img * 100 rep * 1 epoch, both result in very similar models. the single epoch LoRA might be ever so slightly more accurate... Is there one right answer?

dapper prism Feb 15, 2023, 1:26 AM

#

Anyone else experimenting with 'Diffusion With Offset Noise'? It seems to solve the issue with training on really dark and really bright images, and lets you move the render output average away from the default half way between black and white: https://www.crosslabs.org/blog/diffusion-with-offset-noise

Diffusion With Offset Noise

Fine-tuning against a modified noise, enables Stable Diffusion to generate very dark or light images easily.

cobalt sorrel Feb 15, 2023, 4:35 AM

#

dapper prism Anyone else experimenting with 'Diffusion With Offset Noise'? It seems to solve ...

@stuck parrot

stuck parrot Feb 15, 2023, 4:39 AM

#

yep

#

i have a lora out for 1.5 and 2.1-768 that implements it

narrow tinsel Feb 15, 2023, 1:09 PM

#

Anyone have experience with aspect ratio bucketing? Are there issues I need to look out for?

dapper prism Feb 15, 2023, 10:23 PM

#

How many epochs are people finetuning SD 2.x models for these days?

serene flicker Feb 15, 2023, 10:40 PM

#

https://discord.com/channels/1002292111942635562/1075546510986596432 Made yet another embedding, I think this one came out really well

cobalt sorrel Feb 16, 2023, 1:10 AM

#

Anyone else having problems when training embedding? This error: A tensor with all NaNs was produced in Unet. This could be either because there's not enough precision to represent the picture, or because your video card does not support half type. Try using --no-half commandline argument to fix this.

hot breach Feb 16, 2023, 1:37 AM

#

main breach Ok I don't get it, one LoRA guide says "use at least 100 repeats and 1 epoch" ot...

repeats and epochs are fundamentally the same thing

main breach Feb 16, 2023, 2:09 AM

#

Yea I gathered as much by now, I guess it's more adventageous to use epochs if you're going to save them and are worried about overtraining the model

hot breach Feb 16, 2023, 4:02 AM

#

hopefully whatever you're using is letting you save ckpts along the way regardless of using epochs or repeats, matter of using the right tool and using it properly

hoary stone Feb 16, 2023, 5:21 AM

#

Hi All! Question. If I wanted to fine tune SD2.1 with my face, and a friends face. Can I do this in one finetune, or do I need two different models? What would be the process of labelling?

dense bridge Feb 16, 2023, 5:50 AM

#

hey all, i am looking for some help training an embedding. if anyone has some experience i would love to chat. please DM me. i am trying to replicate/embed a character similar to this:

fallen cloud Feb 16, 2023, 9:09 AM

#

hoary stone Hi All! Question. If I wanted to fine tune SD2.1 with my face, and a friends fac...

You should be able to do that in the same model. Just my renaming the imagesets to different keywords. For exampel "therealmiscanalysis-(1).png.. therealmiscanalysis-(2).png.. etc and you friends to miscanalysisfriend-(1).png.. etc.." i haven't tried training multiple subjects in the same model since i first started though. Heard it sometimes can mix up the data, or that one model gets "weaker" than the other. But worth a try!

cloud basalt Feb 16, 2023, 6:57 PM

#

Is it possible to train LORA with multiple concepts together instead of just one concept?

brittle lagoon Feb 16, 2023, 11:34 PM

#

cloud basalt Is it possible to train LORA with multiple concepts together instead of just one...

yes

narrow tinsel Feb 17, 2023, 3:43 AM

#

Are there any guides for large scale model fine-tuning? Like how to make similar models to what's on huggingface or civitai? I've found tons of guides to textual inversions, dreambooth, lora, etc. But very little for large model fine-tuning. Found the pokemon model guide, and training parameters for waifu. But I'm having trouble figuring out how to design a good data set of ~1000 pictures. How many pictures to have of each body position, head shots, locations, characters, etc. Basically the ratios used in designing the data set.

hot breach Feb 17, 2023, 5:38 AM

#

narrow tinsel Are there any guides for large scale model fine-tuning? Like how to make simila...

https://github.com/victorchall/EveryDream2trainer/blob/main/doc/DATA.md that might be a good start, but I think in general there are no stepwise guides for this type of stuff

#

a few mores hints buried in here perhaps: https://github.com/victorchall/EveryDream2trainer/blob/main/doc/BALANCING.md#do-my-concepts-or-subjects-really-need-to-be-equalized

#

get friendly with tensorboard, start paying attention to what is going on with your training

#

this is a bit old from old version of the trainer, but some more ideas there from training a sorta large set of 1600: https://github.com/victorchall/EveryDream-trainer/blob/main/doc/README-FF7R.MD

#

the link over to huggingface from the above ff7r readme has more info as well

half mortar Feb 17, 2023, 6:38 AM

#

fast epoch The newest webui is so bad. Made a model with dreambooth, tested it on the newes...

This is why I keep reverting back to Dec 31 2022 on my computer. Nothing new is working for Automatic 1111.

narrow tinsel Feb 17, 2023, 7:04 AM

#

hot breach https://github.com/victorchall/EveryDream2trainer/blob/main/doc/DATA.md that mig...

Thanks. I've already read all of those and I review them occasionally to see if they've been updated. Unfortunately you're right, there are no stepwise guides for large scale models. But I'm slowly working my way through it. Right now comparing EveryDream2Trainer Vs WebUI Dreambooth extension on how their different bucketing types affect model training. Maybe I'll write a guide if I ever manage to create a good model.

hot breach Feb 17, 2023, 4:08 PM

#

fundamentally its just training image:label pairs, so most of your effort should be tuning how you caption and tuning hyperparameters

#

ed2 takes care of aspect/size stuff on its own, I'm pretty confident in the code that handles that

#

there's a video on crop jitter on my youtube channel that explains most of that process but its not something you need to lose sleep over as its automated

near juniper Feb 18, 2023, 12:54 AM

#

Hey guys, new here
I was wondering if it was possible for my AI to improve its artstyle when re-creating my character's model (from digital drawings) with LORA but I don't know what parameters to be increasing or adjusting for it to grasp the details. The only thing I have done is change the Learning Rate and Unet Learning Rate by adding an extra 0 after the decimal place per training run and using the latest .safetensor model as the LoRA network weights. (I now have 4 safetensors files for each stage of its learning).

When using txt2img I notice that when it is generating an image, it can look amazing when it's still blurry and then the final image comes out distorted, over saturated or the good shading downgrades and I was wondering if im missing a setting or prompt to fix this?

#

Any links, resources or advice would be very appreciated

random star Feb 18, 2023, 9:18 AM

#

#

does anyone know what color augmentation does for Lora?

random star Feb 18, 2023, 9:49 AM

#

and also, should i use regularization images?

fallen cloud Feb 18, 2023, 2:44 PM

#

Pausing my ctpk tranings for a while and was thinking of trying out som LORA-traning instead. For how many steps would be recommended for a batch of 100 images? Anyone got some hints? Right now i put it on 670 repeats, but when looking around it seems unclear of what would be the best number of epoch that would be ideal 😂

#

Im using Kohya locally this time. Would prefer if the was a good colab-version though so one can keep on the content-creations meanwhile.

#

So hint for good lora-colabs har also very much welcome 😌

main breach Feb 18, 2023, 6:09 PM

#

Some guide I watched said you should have 1500 training steps.... whatever that means. So for 100 images that would be 1 epoch of 15 repeats

#

It's also something I'm trying to figure out atm

fallen cloud Feb 18, 2023, 8:40 PM

#

Went down to 100 repeats though. Now im struggling with Kohya google colab, which wont work. Or well.. ir works, but the samt lora i trained earlier today which tock me 1,5 h now takes 15,8 h 😂

mental frost Feb 18, 2023, 10:26 PM

#

Finally getting started on LORA training
Impressed at the time it takes, but I think I need to work on.... something lol
My first guess would be captions in general.... I think
Not sure if I need to be more or less specific atm though

narrow tinsel Feb 19, 2023, 3:16 AM

#

hot breach ed2 takes care of aspect/size stuff on its own, I'm pretty confident in the code...

Yeah, I've noticed that image:labels pairs are extremely important, possibly the most important aspect of training. There is also the way training handles captions: how many tokens it accepts and shuffling. Now if only I could figure out how to use multiple gpus for training so I can train faster. Thanks for all the help.

hot breach Feb 19, 2023, 3:19 AM

#

any fine tuning stuff is ultimately doing image:label pairs, but "dreambooth" is only using a simpler token/class label for the caption effectively

#

if you use per-image captions you can increase the value of training by providing more information via a longer or varied caption per image

narrow tinsel Feb 19, 2023, 3:20 AM

#

fallen cloud Pausing my ctpk tranings for a while and was thinking of trying out som LORA-tra...

Generally, with a batch size of 1 --> 100 epochs. Your training image set also affects this: If you have images that are very similar, this can cause over training. I just did some tests on a training set of 137 images, and 100-150 epochs seemed best.

hot breach Feb 19, 2023, 3:20 AM

#

a few repos let you do that, I think kohya, and joepenna as well

#

you can label an image not just "cloud strife" but "cloud strife holding his buster sword" or "cloud strife standing in the midgar city slums district"

#

or "close up of cloud strife with a serious look on his face" or "cloud strife, full shot, facing to the side" etc

#

you get more value from the training that just labeling everything "cloud strife man" as traditional "dreambooth" would have you do it

narrow tinsel Feb 19, 2023, 3:22 AM

#

I don't use dreambooth, only fine-tuning. And I manually caption each image using a sentence to describe the image, followed by a series of of tags mentioning details, specific body positions, environment, frame, etc .

hot breach Feb 19, 2023, 3:22 AM

#

this is the way

narrow tinsel Feb 19, 2023, 3:22 AM

#

seems to work well, hoping it will work well with shuffling so I can test everydream trainer more.

hot breach Feb 19, 2023, 3:23 AM

#

I'm not sold on using shuffling unless it is a booru tagged dataset

narrow tinsel Feb 19, 2023, 3:23 AM

#

thanks for all the help

hot breach Feb 19, 2023, 3:23 AM

#

I've been kicking around some better ways to do data augmentation on captions, its a more complex problem

#

there's a yaml driven captioning method, but its more complex and theres no good tool to make the yamls for you

narrow tinsel Feb 19, 2023, 3:23 AM

#

I use the booru style, but not the specific tags. captions have over 75 tokens, so I got to do something to get all tha tinfor in.

hot breach Feb 19, 2023, 3:24 AM

#

ED2 supports a .yaml just like a .txt, but the yaml format is sort of complicated and again no real tool for it

narrow tinsel Feb 19, 2023, 3:24 AM

#

never tried yaml

hot breach Feb 19, 2023, 3:24 AM

#

at some point I will build some sort of parquet/pandas DB-driven caption and meta data format for everything, and have something fancier to drive data augmentation on captions

narrow tinsel Feb 19, 2023, 3:25 AM

#

looking forward to see what you come up with

hot breach Feb 19, 2023, 3:26 AM

#

yeah myself and a few other contributors have been kicking around ideas on what to do here, it would be nice to have like, say, subject, verb, direct object, then preposition phrases [] that can be randomly picked every epoch

#

its a secondary NLP problem, and creating the data is also very labor intensive so it needs to be automated

#

blip and other captioning programs.. sorta help at least, someone has been messing with training BLIP to learn specific character names, too

narrow tinsel Feb 19, 2023, 4:18 AM

#

Once I learn how all this works, it might be worthwhile for me to learn some programming to help develop. I'm using this for my business, so if I can make it more profitable, it'll be worth the investment.

mental frost Feb 19, 2023, 4:19 AM

#

Any general advice for LORA training, particularly for portraits/faces?
From the little reading I've done so far, DreamBooth might be better for faces
Wondering if I can get anything similar with LORA since it takes so much less time IIUC

narrow tinsel Feb 19, 2023, 4:24 AM

#

Where can I learn about the parameters of the stable diffusion model? I've heard: DALL-E 2 has around 3.5 Billion parameters, Imagen has 4.6 Billion, the first Stable Diffusion model has 890 million parameters. And talk about "extended parameter models" and "having to split data set into two models because the resulting models would have too many parameters." But I can't find any specific info on what the parameters are. I'm guessing it refers too stable diffusions CLIP or Imagen's T5 model, but I can't find more than that.

dry totem Feb 19, 2023, 5:12 AM

#

hello, is there a way to colorize a monochrome image? for example, i want to colorize a sepia image. by the way, im using auto1111

brittle lagoon Feb 19, 2023, 5:30 AM

#

Try deoldify. It's a neural net speficifally trained to do that

dry totem Feb 19, 2023, 5:55 AM

#

thanks, will try that!

crimson wasp Feb 19, 2023, 10:13 AM

#

narrow tinsel Are there any guides for large scale model fine-tuning? Like how to make simila...

Waifu(sp?) Diffusion 5 released their source code, and they did a massive overhaul model. Techniques they used seem to include 10% caption dropout, which the SD 1.5 release notes also mentioned as helping somehow, and also randomizing the prompt order and occasionally dropping some parts of it (since they use an image tag list as the prompt) https://github.com/waifu-diffusion/network-trainer

#

They also used varied aspect ratios, rather than a set square resolution

narrow tinsel Feb 19, 2023, 10:28 AM

#

crimson wasp Waifu(sp?) Diffusion 5 released their source code, and they did a massive overha...

Thanks, I'll check it out.

vapid zealot Feb 19, 2023, 2:56 PM

#

/imagine

#

/dream prompt

river cypress Feb 19, 2023, 4:04 PM

#

Anyone have a blank safetensor file

#

Like for the merge checkpoints it's possible to merge loras into models if we have a blank safetensor file right?

cunning vine Feb 19, 2023, 4:29 PM

#

anyone here knows how i can add more layers to the output of SD? i'm trying to fine tune the model to get more channels as output. so like stability has done here: https://huggingface.co/stabilityai/stable-diffusion-2-depth which outputs 4 channels (i'd like to make it output more than that)

stabilityai/stable-diffusion-2-depth · Hugging Face

fallen cloud Feb 19, 2023, 11:28 PM

#

Im training CTPK-models mostly, but are thinking of "boosting" them with a lora on top of the base-model. Has anybody tried that. Im thinking of what will be the best. Use the base-model for the face, and then the lora for body and postures, or the opposite around.

Feel like they collide when the images-sets are a bit to similar. (faces and bodys etc in both)

marsh quartz Feb 20, 2023, 2:56 AM

#

how i can use bot pls ?

dreamy sentinel Feb 20, 2023, 8:32 AM

#

So, I just noticed I've been training loras wrong all the time (or so it seems). I used cosine with restarts as scheduler but never adjusted the number of cycles. Didn't have any luck finding sources talking about the correct way of defining the correct amount either. You guys have any advice?

plucky current Feb 20, 2023, 10:29 AM

#

I have a huge database of images, all treated and 1024x1024 mostly with the same style, pose, concept, what would be the best way to finetune that style? Ive heard that most anime models are already overtrained, would that be a problem? Thanks

narrow tinsel Feb 20, 2023, 12:14 PM

#

cunning vine anyone here knows how i can add more layers to the output of SD? i'm trying to f...

no idea. but if you do find out, let me know. I'll look around too.

worthy orchid Feb 20, 2023, 2:27 PM

#

plucky current I have a huge database of images, all treated and 1024x1024 mostly with the same...

have you tried doing any training? embeddings are a pretty easy place to start. you don't need a huge amount of images though, and it's better to have fewer with better quality and better prompts than just more images

plucky current Feb 20, 2023, 2:40 PM

#

worthy orchid have you tried doing any training? embeddings are a pretty easy place to start. ...

I tried all of them, with different steps also, got some interesting results but never what I was looking for, I see those huge models on Civitai like anything/grape/etc and I wonder how they did it, I've heard that dreambooth gets overkilled by 100+ images so I dont think that is the answer, I also tried lora, and that was probably the best results I got so far, I do want to retrain a improved version, and I would prefer it too be a checkpoint, so that I could freely use other loras, sorry if I didn't explain the situation clearly, and thanks for the reply

worthy orchid Feb 20, 2023, 2:42 PM

#

did you write custom prompts for the embeddings, or did you use auto generated ones

plucky current Feb 20, 2023, 2:45 PM

#

the first training I did was on embeddings and it was a long time ago so I dont remember clearly, but I dont think so, also the last checkpoints I did were with auto danbooru caption and cleaned to remove undesired stuff, embendings work well with over 100+ images?

split acorn Feb 20, 2023, 2:46 PM

#

dreamy sentinel So, I just noticed I've been training loras wrong all the time (or so it seems)....

Basically you adjust your resets if it doesn't train properly and feels like it's getting "stuck". There's no one size fits all answer so it's a lot of just testing and finding out.

split acorn Feb 20, 2023, 2:48 PM

#

plucky current the first training I did was on embeddings and it was a long time ago so I dont ...

Honestly, for getting practice or getting a better idea, I'd recommend starting with a small dataset first and then adjusting until you're getting good results and then start scaling up.

#

You really don't need that many pictures to have pretty good results. And after awhile, doing bigger datasets becomes easier since you'll have a good idea of good datasets vs bad ones and what captions work and what don't.

#

And it's waaaayyy less work to adjust with small datasets

plucky current Feb 20, 2023, 2:55 PM

#

Thanks, any more advice? or it is just test what works until it works?

worthy orchid Feb 20, 2023, 4:32 PM

#

probably a good idea to create like 4 different sets with different caption methods, or different groups of input images and run them all for the same time so you can compare

kind lodge Feb 20, 2023, 5:58 PM

#

Hello, anyone have
advice on training a model for medical illustration? My initial plan is to first train it to mimic my style using DreamBooth, and then train it on anatomical concepts using LORAS.

sweet otter Feb 20, 2023, 7:38 PM

#

hey guys! does anyone have any experience with using BLIP/deepbooru for captioning?

I have a very large dataset (30k+ images) and im not sure if im wasting my time generating captions. deepbooru gives me a shit ton of tags that seem generic (ie, bokeh, out of focus, girl, etc) and BLIP gives me somewhat more accurate tags but still generic (man standing with a light behind him, etc).

Do i need to be using captions? Do they help really with anything?

#

blip seems so bad lmao

#

how the hell did the pokemon dataset use blip???

worthy orchid Feb 20, 2023, 7:47 PM

#

yeah those pokemon descriptions are atrocious

#

i was told smaller more accurate datasets are better than bigger ones, so I'd try doing a set of like 100 with hand-written captions and see if you get better results

#

though I'd be interested to know if doing 30k images with autogenerated captions still works

dapper prism Feb 21, 2023, 2:33 AM

#

In general, which of the 2 options is better for a finetuning a Text to Image model: A dataset of 1000 carefully labeled images (with low quality images manually filtered out) or a million images with auto generated captions and auto generated aesthetic scoring? Basically is quantity better than quality?

copper basalt Feb 21, 2023, 3:27 AM

#

What docs are folks using to learn how to fine tune an inpainting model? These are the only docs I've found so far on the topic: https://github.com/huggingface/diffusers/tree/main/examples/research_projects/dreambooth_inpaint

GitHub

diffusers/examples/research_projects/dreambooth_inpaint at main · h...

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch - diffusers/examples/research_projects/dreambooth_inpaint at main · huggingface/diffusers

worthy orchid Feb 21, 2023, 3:10 PM

#

would 50k blip captioned images work better for style training an embedding over 100 hand captioned ones?
what about a hypernetwork?

sweet otter Feb 21, 2023, 4:59 PM

#

im interested in this answer as well

#

i just finished captioning my 30k dataset

#

ill see what happens

fallen cloud Feb 21, 2023, 5:34 PM

#

Can somebody trow me some numbers for training a dreambooth model.
Approx. 100 images on a person. Try to train for photorealistic.

I feel i have stuck in a loop and need to try some new ways.
And would love some shared knowledge.

Base model,
Unet-steps,
Unet learning rate,
Text-steps,
Text-step learning rate.

Would really appreciate it 😅

serene flicker Feb 21, 2023, 5:55 PM

#

worthy orchid would 50k blip captioned images work better for style training an embedding over...

For a style embedding I would say the hand captioned works better. Blip often gets things very wrong and repeated and doesn't caption things in a way a person would often enter a prompt.

worthy orchid Feb 21, 2023, 6:09 PM

#

i know, the blip captions are garbage, but does it actually make a difference in the results?

plucky current Feb 21, 2023, 9:04 PM

#

what are concept images for? are they usefull to train a particular style?

#

also, is it usefull to ''reflect'' the images used to fine tune a model?

sweet otter Feb 21, 2023, 11:55 PM

#

worthy orchid i know, the blip captions are garbage, but does it actually make a difference in...

lets say i have 5000 steps done in my embedding

#

i have 1000 images and 1000 blip captioned txt files

#

do i need to make a second embedding in order to test the results without captions?

#

and what would i do--simply remove the txt files from the dataset directory?

worthy orchid Feb 21, 2023, 11:59 PM

#

sweet otter and what would i do--simply remove the txt files from the dataset directory?

yeah that should do it i think. though I'd copy the folder first so you have a folder of images for each embedding

sweet otter Feb 22, 2023, 2:07 AM

#

i made a new folder, copied the images, deleted the txt files, and got this error

#

\stable-diffusion-webui\venv\lib\site-packages\torch\cuda\amp\grad_scaler.py", line 336, in step
assert len(optimizer_state["found_inf_per_device"]) > 0, "No inf checks were recorded for this optimizer."
AssertionError: No inf checks were recorded for this optimizer.

#

then went back and re-booted the prior (captioned) embedding training, and it started normally

#

makes me think that maybe it failed because there were no txt files

#

in which case ill need to have txt files but have them blank perhaps

#

heres a sample of the directory:

#

#

heres what it spat out after 5000 steps

#

#

definitely not there yet, the art looks literally like what DallE did 6 months ago

#

going to dinner while this runs another few thousand steps then will try to do a noncaptioned version

stone garden Feb 22, 2023, 11:31 AM

#

good idea, spell icons. How was it captionned for the training you showed here ? all using the same token or... ?

#

lots of possibilities in the captioning approach here

#

you could use the main "wowicon" token, plus the class, the type of spell, the main color, or so many other ways to describe an icon here. Never thought of that one before, good find

obsidian sand Feb 22, 2023, 5:05 PM

#

fallen cloud Can somebody trow me some numbers for training a dreambooth model. Approx. 100 ...

Base: I use my own model but Hassanblend 1.4 is good too.
Unet steps: 65 per image. I use 30 img.
Unet Lr: 1e-5 with lr scheduler polynomial.
Text steps: 350
Text Lr: 1e-6

fallen cloud Feb 22, 2023, 5:07 PM

#

obsidian sand Base: I use my own model but Hassanblend 1.4 is good too. Unet steps: 65 per ima...

Thanks! 😄 will give that a try next model!

obsidian sand Feb 22, 2023, 5:09 PM

#

I did that on LastBen colab, idk if that will have the same results on local/different colab.

fallen cloud Feb 22, 2023, 5:11 PM

#

obsidian sand I did that on LastBen colab, idk if that will have the same results on local/dif...

I usually also go with lastben, coz im used to it. But will try Kohya's dreambooth lateron and see if there's any difference between them 🙂

sweet otter Feb 22, 2023, 7:32 PM

#

stone garden good idea, spell icons. How was it captionned for the training you showed here ?...

heres an example of the blip captions

📎 00076-0-Ability_Ambush-realesrgan-x4plus.txt

📎 04154-0-INV_Axe_17-realesrgan-x4plus.txt

04968-0-INV_Helm_Robe_RaidMage_I_01-realesrgan-x4plus.png

📎 04968-0-INV_Helm_Robe_RaidMage_I_01-realesrgan-x4plus.txt

📎 09189-0-INV_Misc_Gift_02-realesrgan-x4plus.txt

📎 09269-0-INV_Misc_Herb_19-realesrgan-x4plus.txt

sweet otter Feb 22, 2023, 7:32 PM

#

stone garden you could use the main "wowicon" token, plus the class, the type of spell, the m...

the danbooru captions are horrendous

stone garden Feb 22, 2023, 7:33 PM

#

yeah those have no consitency

#

you don't teach any useful tokens there 😢

sweet otter Feb 22, 2023, 7:34 PM

#

00134-0-Ability_BossFelOrcs_Necromancer_Purple-realesrgan-x4plus.png

📎 00134-0-Ability_BossFelOrcs_Necromancer_Purple-realesrgan-x4plus.txt

📎 15841-0-INV_MISC_Ring_mop12-realesrgan-x4plus.txt

23002-0-Spell_Shadow_ShadeTrueSight-realesrgan-x4plus.png

📎 23002-0-Spell_Shadow_ShadeTrueSight-realesrgan-x4plus.txt

23013-0-Spell_Shadow_ShadowWordDominate-realesrgan-x4plus.png

📎 23013-0-Spell_Shadow_ShadowWordDominate-realesrgan-x4plus.txt

📎 23015-0-Spell_Shadow_SiphonMana-realesrgan-x4plus.txt

stone garden Feb 22, 2023, 7:35 PM

#

would it have been manual caption, I would have gone with something like one of those templates ;

Mage WoWicon ice : Iceball
Hunter WowIcon beast : Recall
... (I haven't played wow in a while)
this way, it would make a model able to spit out think on that same format easily

sweet otter Feb 22, 2023, 7:35 PM

#

ill try it

stone garden Feb 22, 2023, 7:36 PM

#

you don't need all the icons to try

sweet otter Feb 22, 2023, 7:36 PM

#

are there any other tokens you think it should know?

stone garden Feb 22, 2023, 7:36 PM

#

do a test on 50, it should start to lend results, given this is a style

#

I don't this so, I would stick to "wowicon' as main token, and specify the class and specialisation (mage, ice) on each, as secondary keywords. the last part, the real spell name, is more here for regularisation : by having lots of small tokens used only once, you make it so that overtraining will take longer before happening, letting you more room in terms of trainng steps

#

not sure how many class there is in WoW now, but I would use a total of 100 icons, evenly spread on the classes, and split each class "budget" between each of the 3 specialization, taking the most interesting icons

#

if that makes sense

sweet otter Feb 22, 2023, 7:41 PM

#

can you give me an example for 1 image?
wowicon, warlock, drain life, shadow, green

#

like that?

marsh hedge Feb 23, 2023, 4:19 AM

#

👋 hi friends!

Sorry if this is too self-promote-y, but I figured folks in this channel might appreciate this blog post I wrote today, where I walk through using LoRA fine-tuning with Stable Diffusion on replicate.com

https://www.shruggingface.com/blog/self-portraits-with-stable-diffusion-and-lora

Making Self Portraits With Stable Diffusion and LoRA

In this post, we walk through making self portraits with Stable Diffusion and LoRA

indigo orbit Feb 23, 2023, 12:56 PM

#

Hi. I'm trying to train a LoRA for proper figure skates. What kind of training photos do I need? Close-ups of one of them not worn, close-ups of one them worn, close-ups of both of them worn, or wide angle of them worn?

marsh hedge Feb 23, 2023, 2:07 PM

#

indigo orbit Hi. I'm trying to train a LoRA for proper figure skates. What kind of training p...

Yeah, If you tried all of the options you mentioned, I believe you would get some decent results!

hollow niche Feb 23, 2023, 2:14 PM

#

Hi everyone, hope this is the right place to ask: I am about to train/tune for the first time(I have some cool 3d models I can render out). Could anyone here point me to good resources to pick a model(Dreambooth/LoRA/Textual Inversion) and maybe a step by step? That'd be amazing. Thanks!
(I am using Auto1111 on RunDiffusion, btw)

worthy orchid Feb 23, 2023, 2:36 PM

#

hollow niche Hi everyone, hope this is the right place to ask: I am about to train/tune for t...

Embeddings are easiest so start there. This tutorial helped me: https://www.reddit.com/r/promptcraft/comments/zyc5eh/stable_diffusion_detailed_tutorial_on_embeddings/

r/promptcraft - [Stable Diffusion] Detailed tutorial on embeddings

10 votes and 1 comment so far on Reddit

#

Start with a small group of images with accurate prompts. You can do the blip thing to get your started, but you'll want to go in and fix them.

hollow niche Feb 23, 2023, 3:43 PM

#

Awesome! Thank you, that's super useful materials! 🙏

jaunty surge Feb 24, 2023, 5:33 PM

#

worthy orchid Embeddings are easiest so start there. This tutorial helped me: https://www.redd...

I was wondering do I want to ask, or I need to search carefully, and I found your links! Thanks!

#

Btw if there a way to understand - do I need to train LoRa for stylistic or it's just an Text inversion, how I decide? I know it's matter of many trials, but ...

worthy orchid Feb 24, 2023, 5:35 PM

#

embeddings can learn styles

#

loras are better and faster, but harder to setup

jaunty surge Feb 24, 2023, 5:40 PM

#

YASS, thank you so much!

arctic jasper Feb 24, 2023, 6:49 PM

#

is there a good guide anywhere for training a style in dreambooth that's up to date? I tried a training last night and i just get errors when i attempt to use it. Just having no luck at all with training whatsoever on dreambooth, lora, or textual inversion, so i'm definitely doing something wrong.

File "C:\AI\stable-diffusion-webui\modules\devices.py", line 152, in test_for_nans
raise NansException(message)
modules.devices.NansException: A tensor with all NaNs was produced in Unet. Use --disable-nan-check commandline argument to disable this check.

turbid karma Feb 24, 2023, 6:50 PM

#

Cat and dog cartoon

#

Give me pictures about dogs

serene flicker Feb 24, 2023, 10:08 PM

#

My first successful attempt of a v2 of an embedding, out now! I think it came out great! https://civitai.com/models/11642/digital-diffusion-21

Digital Diffusion - 2.1! | Stable Diffusion TextualInversion | Civitai

Create amazing art in a "digital art" style with this 2.1 embedding! v1 or v2? You may notice that there are two versions available. Which should you use? While v1 can respond better to more complex prompts, v2 works with simpler prompts and just adds detail and color to them. v2 had a larger dataset than v1 so it is more diverse as well. I ...

forest yew Feb 25, 2023, 12:57 AM

#

I've had my first attempt at doing a TI training today. I'm running it on a 12GB 2060 and could only set the batch size to 2, but I've read elsewhere that others are getting much larger batch sizes on their 12GB GPUs. Any pointers to what I might have different which is causing me to run out of VRAM for the training?

serene flicker Feb 25, 2023, 2:09 AM

#

forest yew I've had my first attempt at doing a TI training today. I'm running it on a 12GB...

I am able to do batch sizes of around 5 fairly quickly on an 8gb 3070. Make sure you have "use cross attention optimizations while training" checked! Though I think this might still be broken on newer versions of the webui, I have been training on one from early january with older versions of xformers and cuda instaleed since that is what breaks it. By broken, I mean it just doesn't train anything into the file.

forest yew Feb 25, 2023, 2:59 AM

#

Ah yes, I don't have that checked because I saw a lot of reports about it being broken. My first attempt finally finished (about 7 hours on 16 images) and I feel like I'm already pretty close to working workflow but also like I have done something fundamentally wrong. Doing a Prompt XY, some epochs generate extremely odd results, like it will change from a very good replication of the face to very strange things, like the image attached or what looks like patchwork dolls, before becoming accurate again.

worthy orchid Feb 25, 2023, 2:11 PM

#

would taking my least favorite outputs from my embedding, then training a second embedding on them, then putting that into the negative prompt, help me get better results?

stone garden Feb 25, 2023, 2:13 PM

#

worthy orchid would taking my least favorite outputs from my embedding, then training a second...

that's a convoluted way of thinking but... maybe ? it could, but it could also have big problems. Like if you liked any part of your failed output, and that a similar part was in another failed output ? it could be learned, and then when used in negative, it would try to repulse something you like .
Also, there will be lots of weights opposing themselves between the two embeds, I worry that the negative would cancel a lot of what the positive brings

worthy orchid Feb 25, 2023, 2:14 PM

#

yeah i could see how that would be a problem, especially for something as simple as embeddings

stone garden Feb 25, 2023, 2:14 PM

#

even like the general style

#

if it's a cartoon

#

both embed will for sure learn that

#

even the fails will be cartoons I mean

worthy orchid Feb 25, 2023, 2:15 PM

#

could i use the first embedding in the prompt templates? doesn't that help negate info you dont want to be trained on?

stone garden Feb 25, 2023, 2:16 PM

#

hum... can you though ? I mean, will the embeds be triggered during training on the caption ?

#

I'm not sure those activate during training at all, but I could be wrong

worthy orchid Feb 25, 2023, 2:17 PM

#

maybe i can do some tests to figure it out

stone garden Feb 25, 2023, 2:25 PM

#

if you do, please hit me up with the results, I'm always interested in things like that

pure spear Feb 25, 2023, 3:30 PM

#

I’m trying to train an art style on dreambooth. It’s abstract silhouettes of things and the details play a huge part. It seems when training on 512 images I lose some of that detail. Is there a way to train on a higher resolution? I’ve heard it’s useless because it automatically resizes images anyway. Is that true?

indigo orbit Feb 26, 2023, 8:50 AM

#

So I’ve been training a LoRA without much success. I used 20 images. Been wondering if I should load up my training folder with a lot more (50-100) images that isn’t as good as the original images. Will more images dilute the effect from the original pool of images, or will it be constructive?

last oyster Feb 26, 2023, 9:00 AM

#

indigo orbit So I’ve been training a LoRA without much success. I used 20 images. Been wonder...

how many network rank u using, what is the size of the lora file u produced

indigo orbit Feb 26, 2023, 10:32 AM

#

last oyster how many network rank u using, what is the size of the lora file u produced

What's a network rank? It's 9 MB in size

last oyster Feb 26, 2023, 10:49 AM

#

indigo orbit What's a network rank? It's 9 MB in size

U probably use the default network rank which produce 9 mb file，try train it with 128 network rank and network alpha, I comes a long way to realize this, too.

#

On the training parameter, finds network rank and network alpha, set it to 128

indigo orbit Feb 26, 2023, 10:56 AM

#

Oh! Forgot to mention that I used Kohya ss to train my lora, so...

last oyster Feb 26, 2023, 10:56 AM

#

Me too using kohya ss

indigo orbit Feb 26, 2023, 11:27 AM

#

last oyster Me too using kohya ss

Why are all the tutorials telling me to use the main image directory which contains the folders of my training images for different loras, rather than the specific folder of the training images for my intended lora?

#

And it refuses to train if I choose my specific folder for my intended lora. Huh...

last oyster Feb 26, 2023, 12:19 PM

#

indigo orbit Why are all the tutorials telling me to use the main image directory which conta...

I think kohya ss like to expect there is only one folder on your img

#

Like 100_name

#

I haven't try to train multiple folder tho

indigo orbit Feb 26, 2023, 12:41 PM

#

last oyster I think kohya ss like to expect there is only one folder on your img

Yup that's what I came to suspect as well. Thanks for confirming

indigo orbit Feb 26, 2023, 12:57 PM

#

Does this page look right for training white figure skates? I don't even know if the parameters on this page is used for the training lol

#

How many repeats should I pick? I have 28 images

#

What should my destination training directory be?

last oyster Feb 26, 2023, 1:09 PM

#

indigo orbit Does this page look right for training white figure skates? I don't even know if...

I did not use the tools tab

#

U define the output on the folders tab

#

I did not use the tools tab at all 🤣

#

https://youtu.be/9MT1n97ITaE

YouTube

Olivio Sarikas

LORA: Install Guide and Super-High Quality Training - with Communit...

The Easy Starter Guide to Installing LORA on Automatic 1111 for Stable Diffusion. Follow my super easy Lora setup guide and learn how to train your Lora files for super-high quality portraits. Use Realistic Vision V1.3 as the base model for extremely detailed and realistic results. Get better portraits with Lora, the super fast training tool tha...

▶ Play video

#

I use this tutorial

indigo orbit Feb 26, 2023, 1:20 PM

#

Yeah that's what I used. It's awesome, but it's not enough for my use case

unique cloak Feb 26, 2023, 1:28 PM

#

those seem like al right parameters to me.
Repeats don't depend on the image count : they are multipled by the image count. It's how many time each picture is trained on. Using the default or recommanded values on that seems the best. I haven't trained LORA but I did a lot of dreambooth, so the measures on this aren't the same. I would train on 100 to 200 repeats usually on subjects training like here.
about the destination folder, any empty folder on your disk will do. it's temporary data for the training
But the main difficulty that leads to good or bad quality results is usually the dataset. It's easy to not see some repetitions, some lower quality photos, to remove all texts, ... Numerous error and biases can happen, but to know what to change, the main way is to try to understand what problem your previously trained model had, and fix the dataset accordingly. That can be adding pictures for poses you want but didn't have, a close up or two to help on fine details learning, or removing pictures that repeat something that got trained by error the last time.
Or it could also be under/over training.

indigo orbit Feb 26, 2023, 1:39 PM

#

So if I have 100 training images, how many repeats should I have?

unique cloak Feb 26, 2023, 1:42 PM

#

like I said, it doesn't matter in that way. You'll have the same number of repeats.
100 repeats on 50 pics = 5000 steps
100 repeats on 100 pics = 10000 steps
it multiplies.
Repeats are "how much" you need to train the model on the new concept. It mostly depends on if your concept is easy or not to get for the AI, from your dataset.
Last important parameter that isn't there is Learning Rate, it's "how fast" the model trains, it's how much each step is allowed to train the model at once. You don't need to change it here, I just wanted to be more complete

indigo orbit Feb 26, 2023, 1:59 PM

#

Network Alpha = 1 ok?

unique cloak Feb 26, 2023, 2:00 PM

#

those I don't know, there aren't any networks to set in dreambooth

indigo orbit Feb 26, 2023, 2:08 PM

#

Is the logging folder only for debugging?

unique cloak Feb 26, 2023, 2:22 PM

#

and outputs sometimes. it depends on the tool. mine puts everything in it, models, image, tensorflow, ...

#

since I never use Lora, I can't tell

deep sentinel Feb 27, 2023, 8:15 AM

#

Does anyone have an idea for converting the custom trained text2img model to inpainting model , rather than Automatic 111 Ui, any script to do the conversion

unique cloak Feb 27, 2023, 10:46 AM

#

deep sentinel Does anyone have an idea for converting the custom trained text2img model to inp...

well, I don't think there is anything about this because :

"txt2img models" can also do inpainting. badly but they can, we had only that for quite some time
inpainting models are trained in a different way, they don't learn the same weights, and don't look for, or retain the same information, so there is no conversion possible from what I got from the dreambooth trainings I used and read

#

so you would need to retrain on the same dataset. Also possibly needing to use an inpainting model as base

#

(they have a different inner structure/yaml than classic models)

deep sentinel Feb 27, 2023, 11:08 AM

#

I have seen a script which can do the work, https://github.com/huggingface/diffusers/issues/1619
Hope you'll look but i still get an error while doing so

GitHub

Adding additional input channels to model after intialization / Con...

Have scoured the docs for an answer to this, to no avail. Is it possible to add additional input channels to a model after initializing it using .from_pretrained. For example (taken from your Dream...

#

The error which I'm getting is Image and Mask must have the same batch size,

I trained my standard dreambooth text2img with a batch size of 4 and I'm thinking that this might be an issue to do so.

Can you @unique cloak look into it

unique cloak Feb 27, 2023, 11:11 AM

#

deep sentinel I have seen a script which can do the work, https://github.com/huggingface/diffu...

thanks a lot, I had no knowledge of this !

unique cloak Feb 27, 2023, 11:11 AM

#

deep sentinel The error which I'm getting is Image and Mask must have the same batch size, I ...

right now, sad Guizmus has lots of things to do because he is sad following a hack. so not for a little while, I don't even have my training tools ready for now. sorry

deep sentinel Feb 27, 2023, 11:13 AM

#

Yeah ok no problem, if anyone in the server can solve the issue it's happy to look at it

indigo orbit Feb 27, 2023, 12:59 PM

#

Am i really training on 1 epoch by default?

tribal frigate Feb 27, 2023, 5:48 PM

#

What does it take to train a flexible model? Like if i wanted it to be able to respond to any prompt? Would i need all the possible subjects covered in the data set or can it extrapolate once it's seen enough variety?

And what kind of dataset are we talking about for a model with reliable results. Hundreds, thousands, millions of pictures?

ripe sentinel Feb 28, 2023, 3:35 AM

#

Does any one have a good tutorial about making embedding? I want to learn a bit more about setting number of images step and idk I feel kinda lost

indigo orbit Feb 28, 2023, 7:07 AM

#

I read from a YouTube comment that on LoRA training, if I increase the batch size, I should also adjust the learning rate. Can somebody confirm this, and how do I adjust it? Proportionally, or inverse proportionally?

fallow pier Feb 28, 2023, 7:07 AM

#

is this typical for starting DB training? I'm training 30 images, probably more than I needed but I will let it run if it looks good so far

rn_image_picker_lib_temp_cfaf029e-7056-4d5e-a475-f655afa6aab3.jpg

unique cloak Feb 28, 2023, 8:54 AM

#

fallow pier is this typical for starting DB training? I'm training 30 images, probably more...

the training doesn't seem to have started yet in that capture. it's preparing the class images, and should take some time since you went with 9k pictures. This is a step you won't have to do twice anyway, those are class pictures you can use in other trainings too.

#

so all good for now

indigo orbit Feb 28, 2023, 2:38 PM

#

Using kohya to train a lora, I have managed to train a face of a person and she looks 70% accurate. I've been wondering where my stopping point should be - it should be at a point right before it's considered 'overtrained', right? If so, then what are some definite indicators that my subject is being overtrained? Would the subjects have deformities of the same kind as when their LoRA strength there is too high?

unique cloak Feb 28, 2023, 2:52 PM

#

this is not for realistic style, but Nitrosocke made a guide that had a comparison as answer to this :

#

https://github.com/nitrosocke/dreambooth-training-guide

river cypress Mar 1, 2023, 4:27 AM

#

Has anyone tried to merge loras

stone garden Mar 2, 2023, 8:32 PM

#

from which check point would you finetune the model? 14 or 15? I think i could have used a few epochs more. There are 2 lora files used trained by me. One for the trench coat and one for the comic art style of Joëlle Jones. It's about the art style, the trench coat i'll fine tune another time.

#

okey, this might be stupid to use a 2nd lora file to check out which check point I should use. And i'll go for another training run with double the amount of repeats.

#

okey maybe I should use a second Lora to have some more highres resources instead of just the 512 base model which produces crap images with a slightly altered prompt

stone garden Mar 2, 2023, 11:22 PM

#

okey, can go even a step further i guess... lets double the repeats and also the epochs this time

forest yew Mar 3, 2023, 1:13 AM

#

serene flicker I am able to do batch sizes of around 5 fairly quickly on an 8gb 3070. Make sure...

I did a second install of A1111's webui and set it up to use xformers and Cross attention optimisation and it's happily running along at batch size 16 with VRAM usage switching between 7.1GB and 10.2GB. I didn't need to do any additional steps to get xformers to work.

serene flicker Mar 3, 2023, 1:26 AM

#

forest yew I did a second install of A1111's webui and set it up to use xformers and Cross ...

Interesting, I guess it works now. I saw that it was working with the 0.17 dev build I think, I guess that was finally moved into a1111.

#

Thanks for letting me know, I have been missing some of the new features of later versions

arctic jasper Mar 3, 2023, 7:48 AM

#

anyone know how to make the small lora files from dreambooth? Its making these massive 4GB files, i thought they were supposed to be like 100MB or so

stone garden Mar 3, 2023, 8:24 AM

#

Adjust the network alpha and the other one to a lower value. Keep under 256 for dimm.

#

It's wise to keep them the same value. At the moment I use 255 for both.

#

And for better results in the lower end of the noise spectrum, I set the offset noise to 0.1

sweet otter Mar 3, 2023, 9:17 AM

#

hey guys

#

when im training a model/lora, i notice its been saving extra checkpoints as its been moving along

#

now i want to run the model more iterations, but its finished

#

how do i "continue" training?

#

add more steps?

#

add more images to the directory?

sweet otter Mar 3, 2023, 10:27 AM

#

stone garden from which check point would you finetune the model? 14 or 15? I think i could ...

@stone garden what is this cross-section/how are you getting this in training?

stone garden Mar 3, 2023, 10:29 AM

#

I guess you use automatic1111? Not been using automatic1111 now for a while, but there's a save checkpoint every certain steps or something. If you have 1200 total steps and 200 at save after certain steps the you get 6 checkpoints in total. Place them from your trading folder into your models/Lora or models/stable diffusion folder. If you want to continue select the checkpoint you want instead of the base model and raise the total amount of steps.

stone garden Mar 3, 2023, 11:46 AM

#

sweet otter <@456226577798135808> what is this cross-section/how are you getting this in tra...

Select x/y/z plot at scripts choose prompt s/r at x and y.
So you added a Lora file like this lora:whayevername-00001:0.6 at x you type 00001,00002,00003,etc...

#

At y you type 0.6,0.7,0.8,etc..

#

The first value is the string/integer you want to replace in the prompt followed by the values you want to replace it with, separated with a comma

tiny wolf Mar 3, 2023, 2:26 PM

#

I'm training a lora model right now, but I see that you have to use a regular model along with it when you generate images? Or could I load the lora safetensor both as the main model and in the extension?

#

Or would I be better off just using dreambooth if I want to maintain the style of the lora model?

unique cloak Mar 3, 2023, 2:41 PM

#

tiny wolf I'm training a lora model right now, but I see that you have to use a regular mo...

LORA models go "on top" of a model, and kind of merge with it during use, they apply their weights change inside it. It's why you need a base model.
You can merge for true a LORA inside a model, an make a ckpt where the LORA is inside, close to as if you had trained it without using LORA, as if directly in dreambooth

#

(I'm checking what I said on merging, I have a doubt)

tiny wolf Mar 3, 2023, 2:43 PM

#

I've never done training before so not sure what that meant lol

unique cloak Mar 3, 2023, 2:43 PM

#

https://github.com/cloneofsimo/lora/#merging-full-model-with-lora

#

yep, it's possible

#

ok I'll rephrase

tiny wolf Mar 3, 2023, 2:43 PM

#

So how would I do that in the webui?

#

I don't need a good GPU if I were using dreambooth right

unique cloak Mar 3, 2023, 2:44 PM

#

dreambooth is quite high on VRAM yeah, higher than LORA

#

need a better GPU for dreambooth than LORA usually

#

dreambooth is training a model. Input is a ckpt, output is a ckpt trained with what you wanted. When using that new ckpt in AUTOMATIC, it knows the new stuff and you can prompt on it.
LORA is kind of the same, but instead of being a big 2GB file output, it's a lot smaller file to share. It's a little lower quality than dreambooth usually.
When I was talking about merging, I meant, LORA training gives you a LORA file, that you can already use in your automatic. But you could also take that LORA file, a ckpt, and merge them into a single ckpt. This would make it close to if you had trained on dreambooth

tiny wolf Mar 3, 2023, 2:49 PM

#

But if merged a lora with an existing ckpt I'd be getting "style" from that merged ckpt

#

What I really want is the style of the lora

#

Meaning I don't want any other influences

#

Does this really mean dreambooth would be better in this case?

unique cloak Mar 3, 2023, 2:53 PM

#

that means that you want to merge LORA with the model it was trained on. During training, LORA starts from a base model too, and those are its "default" weights

#

in almost any case where you have the sufficient hardware and time, dreambooth feels better to me yes

tiny wolf Mar 3, 2023, 2:55 PM

#

unique cloak that means that you want to merge LORA with the model it was trained on. During ...

So would I just create two safetensor files and merge them somehow?

#

I really don't have the GPU power to do dreambooth 😅

#

Only like 6 GB

unique cloak Mar 3, 2023, 2:58 PM

#

1/ train LORA with your pictures and any model as base model
2/ get a LORA file back from that training
3/ use that LORA in your automatic, no need to merge, it already works
or
3/ merge that LORA with the model you used in 1, and yes, get a new model, almost equivalent to as if you had just done a dreambooth on 1

#

(lots of those chans 😉 )

manic patio Mar 3, 2023, 3:03 PM

#

Oh yeah, just seeing the few posts that are here lets me know I'm in the right space. Thanks for the recommendations and the tips.

unique cloak Mar 3, 2023, 3:04 PM

#

no problemo, I love this stuff

manic patio Mar 3, 2023, 3:05 PM

#

Here's the full output from my first attempt at training. I'm using 10 images with 10 captions (imagename.png and imagename.txt) and it was set to 120 epochs, batch size 10, fp16, gradient accumulation steps = 1

#

#

The results from testing the prompt aren't bad, but there's some anatomy that's a bit off and I think it could be much better. I'm trying to train a specific facial expression

unique cloak Mar 3, 2023, 3:06 PM

#

ok so that's about 120 repeats

#

so yeah my thoughts, but keep in mind I mainly do dreambooth, not LORA

#

it's going down at a normal speed, so it could still be trained without problem to me

#

if you have some specifics that are not good, like face, maybe the dataset doesn't have clear shots, easily understandable by the AI, to train on

#

looking for those details specificaly in the dataset, and adding/changing a pic for a close shot of it can help on that side

#

but first of all I would add more steps

#

go up to 150 repeats

manic patio Mar 3, 2023, 3:08 PM

#

unique cloak so yeah my thoughts, but keep in mind I mainly do dreambooth, not LORA

Yeah, that's a great idea, IMO. The parts that are messed up aren't entirely present across the dataset

#

oops, I replied to the wrong sentence.

#

about the dreambooth vs. Lora statement.. I'm using this notebook:

#

I'm not sure what the differences are in just using the dreambooth extension on A1111 vs. this type of notebook approach

unique cloak Mar 3, 2023, 3:10 PM

#

manic patio about the dreambooth vs. Lora statement.. I'm using this notebook:

that statement was mostly to say, the repeats/loss value I'm used to expect may not be the same and I could also say some wrong things in there

manic patio Mar 3, 2023, 3:10 PM

#

ah okay

unique cloak Mar 3, 2023, 3:10 PM

#

LORA is trying to mimic how dreambooth works

#

but it changes one thing major

#

it saves the differences it's making in the model, and makes a file with all those differences (small file) where dreambooth makes a new model with all the changes (larger file)

#

the reality of it is that there is still a loss of quality, it's why you can add more "layers" in your lora, to keep track of more changes that would happen in the model and have a higher quality

#

so LORA calls itself dreambooth too, because it's effectively what it mimics

#

the dreambooth extension does dreambooth the classic way, making a ckpt

#

there is also a LORA extension to do it in automatic too

manic patio Mar 3, 2023, 3:14 PM

#

unique cloak there is also a LORA extension to do it in automatic too

I didn't know this.. I'll have to try that with the next dataset I put together. ty

#

You're a wealth of knowledge and I appreciate you taking the time to reply to these questions. I'll make a few adjustments in the direction we've discussed here and post results a bit later. 🙏 thank_you

#

Actually one last question before I start the notebook and play the waiting game.. if you don't mind...

#

In testing the previous training attempt, I had 1 of the 4 test images for a prompt come out 99% perfect and the other 3 were quite poor.

#

What do you make of that situation when it occurs?

#

Is this another sign of under-trained but trending in the right direction?

unique cloak Mar 3, 2023, 3:33 PM

#

manic patio What do you make of that situation when it occurs?

this always occurs, and is a way for you to check for bleeding. in my tool I have 3 of each
Bleeding is when your concept starts to appear in other things, unprompted.
The good one is a picture made using one of your captions
The bad ones are the same but with CFGS to 0, meaning it will ignore the prompt

#

if you start to see things in there that come close to what you are training, this means you are bleeding all over

#

it's kind of another way to "overtrain"

#

this means you need more/better class pictures (can also be called regularisation pictures, depending on the tool)

manic patio Mar 3, 2023, 3:37 PM

#

unique cloak this means you need more/better class pictures (can also be called regularisatio...

They're called regularization images in this notebook and I assumed them to mean "images that contain similar concepts to what you are training but are not specifically the concept you're training" if that makes sense. Did I have that right?

unique cloak Mar 3, 2023, 3:38 PM

#

One thing I should add, regarding the loss value : it's not always very relevant... Depending on the concept, the loss can just mean nothing at all even. it's based on a flawed function, you can't really rate if the model is close or not as simple as that, it has a hard time evaluating what you want it to really train

unique cloak Mar 3, 2023, 3:38 PM

#

manic patio They're called regularization images in this notebook and I assumed them to mean...

yes, it's usually that. Like training Steeve Jobs as main concept (instance data) and "a man" as class concept (regularisation data)

#

depending what you do, it can be very generic

#

it's a way for the model to not forget what it knew before

#

to "keep it grounded"

manic patio Mar 3, 2023, 3:39 PM

#

cool, I can likely improve on what I provided it then

unique cloak Mar 3, 2023, 3:39 PM

#

yes, what you provide it will be trained on too in the end

#

by doing targeted training on your topic and generic training on regularisation, the model can keep more things in for longer. training is always a learn-forget relationship, end models still has the same size

tiny wolf Mar 3, 2023, 4:31 PM

#

unique cloak 1/ train LORA with your pictures and any model as base model 2/ get a LORA file ...

So either one works, but merging that base model would mean introducing things from it right?

spare marsh Mar 3, 2023, 5:31 PM

#

I'm currently using A1111 on my home PC with a 10GB RTX 3080 card. I am interested in fine tuning and training to be able to more readily reproduce certain character types or models. For example, training it to be able to generate images of D&D races like Dragonborn more effectively. I don't want to pay for a cloud computer so I know options are limited in that regard but things with SD and AI in general are progressing so quickly that it's easy to lose track of what options are available and are best recommended. Is Textual Inversion the best way to go here still? And if so, is there a guide for getting the best results? As an example, should I be looking to create an image set with close up of the face, full body, side view, back view, etc.. to get the most coverage and cover as many bases for various prompts as possible? What is a good target number of images? Should I be using things like file naming to help guide the AI as to what a prompt for the provided image might be to help it train better and identify the subject of the image better (as opposed to background elements or the like)? Any other information on the various settings and how they might impact the resulting embedding?

sweet otter Mar 3, 2023, 7:25 PM

#

im a bit new to training--ive trained a LoRA using dreambooth and it finished. iwant to continue trianing. what do I do?

#

do I re-use that same model and just add more steps/epochs?

sweet otter Mar 3, 2023, 8:06 PM

#

manic patio

what is the file path to find the full loss chart over the entire trainig session?

#

i can only see the loss-per-epoch currently. in the model>dreambooth>[model name]>logging folder

manic patio Mar 3, 2023, 8:09 PM

#

sweet otter i can only see the loss-per-epoch currently. in the model>dreambooth>[model name...

I'm not sure about dreambooth specifically. I was doing the training in a Lora/Dreambooth notebook. It had a log file that I could use inside of the notebook directly into Tensorsensors or whatever the name of that package is

#

https://colab.research.google.com/github/Linaqruf/kohya-trainer/blob/main/kohya-LoRA-dreambooth.ipynb

Google Colaboratory

#

this is the notebook

forest yew Mar 3, 2023, 8:54 PM

#

serene flicker Interesting, I guess it works now. I saw that it was working with the 0.17 dev b...

Oddly enough, it's actually the broken 0.0.16rc425 that is installed, but it seems to work. Perhaps there is another variable at play and it is a combination of factors which cause it not to work?

woeful kettle Mar 3, 2023, 9:30 PM

#

I'm not sure if this is the right channel, but I'm right at the very beginning of trying to figure out how to generate images of a homebrew fantasy race. I have a few pieces of art with them depicted, and I have text descriptions, but I'm not how to use those or whether they will be enough?

#

like if the model had never heard of warcraft orcs before, how would I tell it?

manic patio Mar 3, 2023, 9:56 PM

#

woeful kettle like if the model had never heard of warcraft orcs before, how would I tell it?

There's lots of models out there that know what that is

#

I'm no expert, very much learning the ropes, but the first step will be to curate 10 to 20 (10 is usually plenty) good examples of your custom race or things that are really really close to their likeness.

#

Then pick a pre-existing model to serve as the base model

#

for example:

#

#

Here is the result of the first generation for "orc" as the positive prompt with "text" being the negative prompt

#

I used abyssorangemix3 model for this, though that model is probably WAY hornier than what you're after, but it does know what an orc is

tribal frigate Mar 3, 2023, 10:01 PM

#

Has anyone tried to train a model with a 12 GB VRAM? If so, how was it... is there any chance to do models with small samples on a GPU like that?

woeful kettle Mar 3, 2023, 10:08 PM

#

manic patio There's lots of models out there that know what that is

thanks for the reply! I'm not totally sure I understand when you say to pick a pre-existing model as the base model? Do you mean that I should get 10 or 20 images and use them to fine tune an existing model?

Can you walk me through how it would work a little more? Like suppose I had 20 images of my homebrew Octopus people called "Foobars" -- what do I do to train the model? How do I get the txt2img output?