StableDiffusion is too big to fine-tune | Learn AI Together | Page 1

gritty tendon Apr 26, 2023, 6:27 PM

#

I need to fine-tune a text-to-image model. I'm trying to do it with StableDiffusion but 16Gb seems not to be enough VRAM even for 64x64 images and batch size of 1.

Is this normal or I'm doing something wrong?
Do you know some lighter version of StableDiffusion?
How could I configure SD to make it have less parameters?

Thank you!

sacred goblet Apr 28, 2023, 12:40 AM

#

Dreambooth likes ~18-25GB ram, but there are variations that use less, I believe. You are best off using colab to be honest.

gritty tendon Apr 28, 2023, 9:58 AM

#

sacred goblet Dreambooth likes ~18-25GB ram, but there are variations that use less, I believe...

All right thank you!
Btw afaik Dreambooth and Lora are only for fine-tuning. How could I train StableDiffusion from scratch (not pretrained) for a different dataset (not LAION)?

sacred goblet Apr 28, 2023, 10:58 AM

#

I'm not actually sure. Something akin to dreambooth but with a higher learning rate. I'm not sure if it freezes anything on top of the VAE components and text encoder, but there are repos that help you to fine tune these too.

However, I'm not sure exactly why you'd want to? Given a semi-large dataset you can use some variants of dreambooth to fine tune SD into a very different, if not more, capable model. Examples of these trained endpoints are available and are very good. To get SD level results from scratch would cost your tens of thousands in compute power, I'd assume. Plus the 10TB of space for the dataset.

gritty tendon Apr 28, 2023, 2:43 PM

#

sacred goblet I'm not actually sure. Something akin to dreambooth but with a higher learning r...

I want to train a SD model that can only generate 64x64 of faces without having been trained for anything else. I dont' want a generic model but a text-to-image model that can only generate different types of faces

#

Do you know some reliable small text-to-image model?

sacred goblet Apr 28, 2023, 3:39 PM

#

https://github.com/cientgu/VQ-Diffusion

GitHub

GitHub - cientgu/VQ-Diffusion

Contribute to cientgu/VQ-Diffusion development by creating an account on GitHub.

#

This is essentially stable diffusion

#StableDiffusion is too big to fine-tune