#✨|sdxl
1 messages · Page 28 of 1
nvm have it. just need to figure out how to use g vs l and all that haha
If this isn't modern art idk what is
also recommend installing this plugin, it allows you to install missing custom nodes/models https://github.com/ltdrdata/ComfyUI-Manager
Im not sure how to get creepier images. Its just seems to be too tame here
so im using the workflow for umm sytan how do i make ultrawide photos?
Do the base and refiner models share the VAE?
these are looking pretty sinister... yay
edit those to a different aspect ratio
Have you tested Dreamshaper?
I don't want to get two ahead of myself at the moment, or be too optimistic, but it looks like there could potentially be hope for the 3090
It is still refusing to boot in my new computer, however I just got it to boot no problem with the old computer (with a brand spanking new windows install)
N
Alright, I can confirm now with no uncertainty now that the GPU is the problem
ComfyUI - I want an Image Browser like A1111 where prompt and settings can be recalled ... is there such a thing in comfyUI?
Dreamshaper XL, quite different results comparing with the base
Delete VENV and then restart ComfyUI?
im 100% sure seller knew ab it.
have u contacted theM?
They have been working with me for multiple days now to try and find solutions. I have tried it everything feasible under the sun to try and give them the benefit of the doubt, but I can't do it anymore
I have now officially escalated my original claim with PayPal to their money recovery team, and they will be severing our connection and taking an information from both sides
It is undeniable that the GPU is faulty now, and I should not be held responsible for a graphics card that started showing problems within less than 1 hour of appearing in the mail.
I have spent over 24 hours cumulative in the last 4 days working on this problem, and I found absolutely zero solutions, and nothing but a long list of reasons why all of these issues are pointing directly at the graphics card being dead on arrival
damn welp that blows, deal was too good-ish to be true in the end. hopefully u get ur money back
😦
Carpe Diem. For what it's worth - bend over backwards to buy a new (not used) GPU. Bite the bullet - and the expense - you'll live longer 🙂 Do ou have patreon/PaypalMe at all?
For anyone else getting errors when launching SDXL using the Vlad Automatic installation, try launching the app using this command line ".\webui.bat --backend diffusers". I have a i9 13900k 4090 system, but had to use that to make it work.
you can buy him a really big cup of coffee, link is on his github
I have buy me a coffee, but there is no chance in hell I'll be able to afford a new 3090, not would I want to, cause the GPU is too expensive for what it's worth
Can SDXL save images in .png format retaining the metadata like SD normally does? Default seems to be jpg with no metadata.
I am using VLAD A1111 - I start from a command line --medvram --backend diffusers
What does --medvram do?
It recntly stopped working due to Xforners conflicts - but since I am not using --xformers, I am not sure what to do - except jump to the excellent (but featureless) ComfyUI
Medium VRAM usage - I use an 8Gb RTX2070
--lowvram - even lower vram usage
I think I just read that using tiles low ram gpu can work with Vlad A1111... ComfyUI seems like a headache.
Literally nothing has changed ❤️❤️😘😍🥰♥️♥️💙💜
--medvram does not seem to be a working command line argument for ComfyUI in case anyone was wondering.
I was surprised at how easy to instal and setup ComfyUI really is - and don't worry about its "noodle nodey interface" - its a cinch!
I only use --medvram in VLAD A1111
I'm gonna miss the entire launch of SDXL cause some jackass decided to sell me a broken GPU
Share a desktop with a friend?
Did you buy it used?
It doesn't just work like that, I don't have any friends willing to do that, nor would I want to put that burden on them
Yes
I'll get my money back eventually, but it will be long after SDXL is interesting anymore
We'll be on 4kSDXL before you know it - or 8kSDXL!!!!! 😄
GPUs are pretty much the only thing I don't trust used due to Digital coin mining. They use the absolute crap out of a GPU and sometimes completely mess up the bios of the card with custom crap
Used is my only option, so I'll just have to take this risk again I guess
As I push towards an A1000 - I might have a very usable RTX2070 available ....
What card and budget are you looking for?
It's not that I don't have a graphics card, it's that I don't have a graphics card capable of doing the research that I'm trying to publish alongside my workflow
Oh no - there's me thinking out loud again! Ignore me 🙂
It is not all me trying to sound ungrateful, I immensely appreciate your guys' support, but I don't expect anything from the community
I keep getting this error when I try to get to the part of local tunnel - Earlier two parts go smoothly.
/tools/node/bin/lt -> /tools/node/lib/node_modules/localtunnel/bin/lt.js
- localtunnel@2.0.2
added 22 packages from 22 contributors in 2.14s
Traceback (most recent call last):
File "/content/drive/MyDrive/ComfyUI/main.py", line 64, in <module>
import cuda_malloc
ModuleNotFoundError: No module named 'cuda_malloc'
I do have a GPU that will run SDXL, but I don't have a GPU that will do what I need to do with SDXL, which is the whole point why I scrape together every penny I had (and even every penny I didn't) in order to buy this GPU
I need minimum a 3090. It has to be 24GB VRAM
So I'm gonna have to wait for this guy to drag out this claim as long as possible before he has to fork over the money, and then go through this whole process again with a new seller
How much was that one?
Immensely frustrating is the most understated way to put this
CUDAS Malloc is CUDA Memory Allocation
$600
Hell, I don't even have the money for it. My mom paid it for me to give me some time to sell my 3080 and 3060ti to pay it off
So I am just stuck, and so is my mom until this jackass hands my money back over
ModuleNotFoundError: No module named 'cuda_malloc - you probably have a PATH error - and installed this to your appdata file and not the ComfyUI_windows_portable_nvidia_cu118_or_cpu\ComfyUI_windows_portable\python_embeded\Lib\site-packages
Are you in the US?
which workflow are you using for this? I've tried creating 9:16 images but the perception is always weird
Yes
I'm going to sleep. I'm so done with today and all of this.
is the file cuda_maloc.py in that ComfyUI folder?
because it's in the repo; https://github.com/comfyanonymous/ComfyUI/blob/master/cuda_malloc.py
If I choose two images in batch in A1111 each will be different because it increments the seed by 1 for each image. How do I get SDXL to do the same?
on comfy?
SDXL is just another model
its not a UI
on Vlad A1111.
SDXL also increments 1 by 1 - set Seed to -1
SD.NEXT/Vlad - set Seed to -1
If I set seed to 500 in A1111 and set batch to 4 images, it will increment each image by 1 seed. Right now in Vlad A1111 it makes all images exactly the same, apparently using the same seed for each one.
No, if you set the seed at 500 - it stays at 500 - setting it to -1 it will increment
the only other way is to setup Agent Schedular, and increment 500, 501, 502, 503 etc for each new picture
Thanks. I'll look into it more.
I agree in "ordinary" A1111 that 500 witl self-increment by 1 step at a time
Yeah, thought so. Can't get Vlad A1111 to do it. When I change seed to -1 I get error... cannot reshape tensor of 0 elements into shape
Sounds like a bug - I have an Xformers bug - so I cannot get into Vlad today ,,, have posted at Github and await his advice
yea but it says cuda_malloc not found while trying to install localtunnel as part of comfyUI on google colab. Thanks
Sounds like an instal PATH error ...
I will redownload the notebook and try from scratch again on colab
Does anyone have a good workflow with SDXL I could copy?
If you want prompts (A1111 and Comfy) I got prompts
Prompts would be good
I could use some new ideas
I like some of the galaxy in a bottle things... or would like to try
Often when d/loading, stuff can end up in %appdadata% folder instead of in python ... if u forget to select PATH when installing Python
the new LlaMA2 has reward and safe model,just like the nsfw combined with SD,but still a long way to catch up with GPT4
This is awesome. Is there a prompt or workflow for this?
Seems to use a test.ckpt and says not found. Is the ckpt in hugginface? Tks
Simple - Galaxy in a Bottle, style of Breathtaking night landscape with syellecafen
I Uses @high skiff s as a staring point
3rd attempt and we are getting close
Are you using a custom upscaler for htis. When I load this into comfyUI by dragging, I get an error about a SDUPscaler node
yes you would need to get this and put it in your custom nodes folder in comfyui
Awesome. Can't seem to get the workflowt o work though on Colab ComfyUI.
Great let me give it a shot.
Ill give it a shot
ahhh trying to replicate/improve on the default COmfy Prompt
Is that what the default one is?
yarp
so the , with the empty space and , is not a mistake?
These are starting to look really good. BUT... what I want is for the bottle to be glowing and illuminate a pitch black room with the light emanating from the glowing galaxy in the bottle
that's not a mistake. I understand it as "kicker"
what does that mean?
something to steer the conditioning. you can actually add random words and see the composition change
the white space is a conditioning without a meaning
try with and without and check the difference
(some models more than others)
So I can use that in any prompt and it fills it in with something random?
no
Excellent that worked in Google Colab also. Had to adda s snipped out of code to cd into custom_node and do a git pull. Tks
it just steers the conditioning, the same as like changing the resolution by 10pixels might end up with a totally different result
no problem... always happy to help
that is interesting... I'm going to give it a try
to break the monotony (eg: generating always the same face) I often add a completely random unrelated word at the end of the prompt
it can't be something with a strong weight (like say... "banana") but it often works to get that little unexpected variation
nightime intergalactic landscape in a bottle syllecefan SDXL 0.9 ComfyUI
nice
Actually, Syllecefan is probably the name of the originator of the prompt I modified 😉
Cowgirl in white preparing pomegranates in a glass bottle

Dope
When I made my prompt - they came out very blue without me asking to - yours are very gray/green - you asked for those colors?
nope not at all
damn, did you buy it from the link I shared with you?
Ain't she gawjus?
^..^<
Thanks 🙂
mmmm, nice melons
lmao nice mellons
melons
Her name must be Mellonie? 😄
Mama! I'm scaahed!!!! 😦
G.O.A.T. 😄
lol
Pomegranate Cowgirl In White, White Horse
that is an interesting style
FULL PROMPT - Painting of white clad cowgirl pomegranate juice white horse blue sky, preparing and slicing pomegranates by Diego Rivera VICTO NGAI Zdzisław Beksiński georgia okeeffe
It is possible to train using Kohya Trainer for SDXL. Check this out https://github.com/kohya-ss/sd-scripts/discussions/662
@timid sonnet are you running your own chkpoint?
This is getting pretty good here
Cool - the right side especially
Very Route 66 - nice!!!
this one is awesome
Do you have a powerful nVidia GPU?
Here is the ComfyUI d/load - but only if u have an nVidia GPU.
you can even run on AMD; but its slower
U can use the dream command here in Stability AI, or use SDXL 0.9 on NightCafé and/or Clipdrop.co
Go to bot cahnnel to use / dream
bot channel
Anybody know how to find prompt used in ComfyUI - is the info stored somewhere?
If you feel that I am not communicating effectively and you didn't know what I meant ...
... but you dID kNOW wHAT I Meant 🙂
its make the generated image better
the pedant police has arrived
Nep - SDXL is a two-stage process. The SDXL Base model makes an image. This image is txt2img format. This image in the second stage is fed into SDXL Refiner in a kind of (but not exactly) img2img fashion. Some of the txt2img data is also sent from the Base model - so the SDXL Refiner is used as the 2nd stage of the SDXL process
... kind of ... 🙂
it is an img2img, just at latent level
you ar edescribing SDXL0.9 which is a pre release . This behaviour may or may not remain the same on full release of V1
latent2latent 🙂
I'm using my own, but it should be fine unless you are pushing the resolution too far. Also make sure the height and width and target height and width on the SDXL Clip encoder match aspect ratios with the empty latent you supply to the Ksampler.
Students face, looking determined and focus.
Go to the Bot Channels and use / dream
Because the Bot Channel will produce SDXL 0.9 images for free
SD devs said to just use 4096
at least for the 0.9
If you try use a 1:1 aspect ratio for clip and then a 9:16 for the image it will stretch out the perspective. It's why people keep getting long people in their generations
Are we flirting? 🙂
what do you consider the bare minimum "powerful GPU " to generaate images because I quite happily use a 3 generation old 1080ti without any issues
width/height and target_width/height need to be the same (4096)
but it doesn't need to match the latent
But you don't have a powerful GPU - or - But you don't have a powerful GPU?
I'm just reporting what they said on the official server
I've done loads of testing. If it doesn't match the latents aspect ratio it messes up the perspective, usually more noticeable with people.
My SDXL 0.9 in ComfyUI is often 768x1024 - to make poster size
One was a statement - the other was a question
got SDXL working on A1111 without silly extensions, this is amazing!
I can't say I experience the same but haven't tried much
I stick with the numbers @high skiff used in his workflow and as long as you dont go far beyond the bounds of 1024x1024 I havent noticed any distortion issues.
Ie the same behaviour as SD1.5 or 2.5 , as long as you dont try to generate stupidly oversized images then you get little disrortion
I tend to (mostly)stick to these max generated sizes
2.39:1 1280 x 536, 1024 x 432, 768 x 322, 512 x 216 (Cinema/anamorphic)
3:2 1280 x 856, 1024 x 680, 768 x 512, 512 x 341 (Professional Photography)
4:3 1280 x 968, 1024 x 768, 768 x 576, 512 x 384 (General Pictures)
5:4 1280 x 1024, 1024 x 816, 768 x 516, 512 x 408
16:9 1280 x 720, 1024 x 576, 768 x 432, 512 x 288 (Wide Screen)
16:10 1280 x 800, 1024 x 640, 768 x 480, 512 x 320
Yeah these are fine. I'm talking about the sizes you can provide the SDXL CLIP encoder
You can mismatch sizes, which in my experience can cause strange issues when the aspect ratios don't match.
left aspect ration match, right all 4096
Well the smaller ones aren't
I leave them hard coded at 4096x4096 and dont touch them
It's usually portraits where it's more noticeable. But it doesn't always happen. I was just answering the guy what my workflow was for the portrait image and why I'd changed it.
and providing i stick to the sizes I mentioned I see very little if any stretching
Changing those clip values can change the images drastically. It's quite interesting.
not saying that you are wrong, I'm sure you did more experimenting than myself
I just reported what I was suggested
Yeah I saw that suggestion too. It's fine most of the time. I just like messing around with the values and I noticed that when I was getting stretched out people that helped.
ayy man you finally made it work 😄 congratulations
and reasons
do you even lift
the case is so dirty but its a beast... ill take it to the car wash later and hose it down
I saw carpet and almost had heart attack. But then noticed that pc is lifted up.
i run sdxl on my fridge /s
I run it with pen and paper /s
the bruh intensifies
wat
im glad you dont have to type these very long negative prompts anymore
(most cases=
dumb question, dues easyn work on sdxl in A111 or does it need to be retrained?
does easynegative work on sdxl
soap water always works 🙂
i cant seem to be able to get them to show when using sdxl
@eternal fog is there a way to link a text node to these nodes
so I can just type in the prompt once
These came out pretty good... yay space
I wish I knew how to train a model or embeddings or a LoRA this could be pretty cool to just load up when I want
Those are text nodes. They go into the SDXL clip encoders. If you always want the same text in both just connect 1 of them to both text inputs.
you can think of it as something close to img 2 img, the output from the base model gets sent there for detail and refining
Just because I feel like I couldn't get this exact being right in any other ai so I'm excited sdxl can do this
Oh, that's really cool
nope I'm actually a genius. I managed to get it to wwork with A1111
Spoopy new creepypasta?
What did you do to get this to work in A1111?
There's a branch you clone
there is a branch that let's you load it
okay, this isn't simple so bear with me- first step is to download this :https://huggingface.co/stabilityai/sdxl-vae
then change commandline_args to --opt-sdp-attention --xformers --theme dark --no-half-vae, well, you don't have to do all that, but you do need --no-half-vae
and that's pretty much it, it works WAY better than comfyui
It's not the nodes, it's the inference. A1111 has it's own magic tricks
I really like your images but I have to say I really like ComfyUI I have been able to get some pretty amazing results so far
I'm going to run your A1111 setup though and compare. I like good surprises
depends, you have a lot more control and margin for experimentation with the node system
I also like that it is so easy to share different workflows..... that part is miles ahead of A1111
just drag an image or load the .json file
yeah the image metadata stuff rocks
Idk, I personally have a lot of experience with nodes. and SDXL works insanely better on A1111 for me. It's also much faster
I think I will still set it up and give it a try
For the SDXL-VAE what exactly am I downloading.... Just the sdxl_vae.safetensors file???
yeah, add SD_VAE to quicksettingslist then select it.
Do I still need the base model and the refiner?
just the base. it doesn't support the refiner just yet, but man, this is great
I recommend using hiresfix for now, it kinda does the job
Ok so just this file??? and where do I need to put this?
yes
models/VAE
Ok and where is the quicksettings list you were talking about... Can you screenshot that?
user interface/quicksettings
I did that and it still says failed to load model then it reverts back to another model
disable all extensions except for crucial ones, happened to me as well.
I use Vlad AUTO1111 for SDXL, and ordinary AUTOMATIC1111 for SD
Do I need to get the model without safetensors?
U can d/load both diffuser and safetensor - diffuser goes in the stable-diffusion-diffusers folder - and then appears under Models drop down
Start VLAD/AUTO1111 using webui --medvram --backend diffusers
bro are you blind?? the fans are on the back of the case
NO IT'S NOT. what kind of PCs do you know??? the air should go out of the radiator
plus, no one cares what PC you have. leave him alone
where do I put the command line arguments?
Open a CMD Prompt, then open the stable-diffusion directory where A1111 is - then write in the Command Prompt webui --medvram --backend diffusers
okay, I can immediately tell you are genZ
of course you don't
well, usually they do. people learn things. but no one here judges you before you start judging them. you kinda violated the server's rules.
cough cough 
okay, good luck I guess. I NEVER talked to someone that doesn't know what year they were born
Off topic
Well that escallated quickly
I take no sides.... except the dark side of the force
What do you all think of these???
Do i still need the --medvram with a 4090?
hmm
--opt-sdp-attention doesn't work in vlad1111
Ok it runs with the SDXL model... Do I need any other files besides the Base Model or is it all good on just that one in Vlad?
That looks fantastic... Did you run the prompts I had?
If its running - don't touch anything! It'll be OK ...
yep
no VAEs or anything needed?
but if it does crash, just delete VENV, and wait the 20 minutes it'll take to rebuild it 🙂
Like I say - if its up and running - leave it alone 🙂
Just rerunning some old MJ prompts thru ComfyUI
did any of you get img2img to run on vlad1111? 👀
you're hilarious
I think it can be done on comfy 🤔
sd next
Someone made a different version of A1111 it has some different features and supports SDXL
First few good results from Vlad A1111
I haven't used this interface before and haven't used A1111 in about 6 months... Can anyone tell me how I can upscale these or get these to be upscaled automatically as they are generated?
Not the exact one, but one very closely linked ._.
oh ok, did you talk with him so you can return it back to him?
Likely gonna need to force it on him, but we'll see
I've been in contact the whole time
ok, i hope you have done the transaction through paypal g&s
he is not talking about you
What are you talking about, you have nothing to do with this conversation lol
he's mistaking you for someone else, or just trolling 🤷♂️
Nep is always a troll, or at least I hope he's not serious lmao
I think Nep is just spoiling for a fight!
Just ignore him ...
Yummy, this rum and raisin ice-cream is way cool - let me write up a prompt so all y'all can have some! 🙂
Troll prompt
looks more ghost-y 👻
She 8 all the eyes scream 😦
eyes scream
👀
tbf, if she suddenly showed up on my house, I'll give her all of my ice cream 😨
I'd give ice-cream to these two ...
Just playing around
Much less grief using Topaz GigaPixel imho - but that's just me!!! 😄
Trying out multiple colors , behind a textured glass, looks good till upscale.
I only have access to an old license for GigaPixel so I'm not sure if it will be able to do the upscaling I need
Some freaky pix
SDXL allows me a one pass Upscale of 6x - so my 1024x1024 becomes 6144x6144. Without SDXL, it used to be a 6x followed by a 2.75x etc
But one day, and just for fun - I'd like to try an AUTOMATIC1111 Upscale/InPaint/OutPaint/ControlNet etc etc
Some Neon Flamingoes
Here is a wildcard Prompt = Stock Image ... it has amazing results
dont know how I misread your prompt. but here's "Neon Flamingo Evangelion"
What prompts are you using for these?
a crazy old scientist with a long white beard holding a blue light, still from a fantasy movie, cg artist, the electric boy, by Adam Paquette, inspired by Rube Goldberg, brandishing cosmic weapon, 3 - d 8 k, juno promotional image, sad wizard, asian old skinny scientist with a big beard and beard, by Peter Mohrbacher
Does sdxl support image to image
local yes
How would I use image to image in ComfyUI?
A1111
add a load image node, then encode vae, then feed into the first sampler, set denoise to .25 or start step to above 0
do you have a workflow setup with this already?
A1111 doesn't work with the refiner yet, but it's definitely the best way to use SDXL
for some reason it doesn't work for everyone, but when it will, that's going to be the meta
i'll make a quick one
@west breach after testing the prompt node two things, when selecting no style the addition of a l clip box and output would indeed be handy. Also I guess this may be unfixable but when doing the styles since it is a wildcard it makes comfy ui start at the begging. Removing built in optimization. For ex before if you wanted to increase a samplers step you start at sampler step w the node tho it restarts the entire gen as it thinks it a whole completely different input. Otherwise everything working amazing thank you.
if you aren't using wildcards, set the control after generate to 'fixed' to stop it triggering the whole workflow
might be better to create another node without styles? then you can enter your own style prompts
Ahhhh. Wow. Much intelligence you possess.

I understood some of those words
Yeah that’s how it’s rigged up currently. It’s just a mild inconvenience to switch out the pipes
haha, dUo an I are talking about a custom node I made
so that doesnt have anything to do with the JSON you provided?
no, that shouldn't require any custom nodes, I hope
Hey peepz, can somebody give me a hint how to make an image perfectly round? I have an image of a mushroom from beneath and it is not round. Thx guys!
@west breach FYI this is not quite 16:9 as it equals 1.75
The correct measurements are 1368x768
Oh apart from an inconvenience it would unclutter two nodes
resolutions have been set to the closest multiple of 64
what was wrong with using 8 ?
Can you send a link to the upscaler model you use?
eg if I use your AR of 16:9 for creating wallpapers there are gaps each side of the desktop on my 16:9 monitor
do you have any? you can select a different one
Thank you
U can blame SDXL. They trained it at these ratios.
Yeah it's using the resolutions listed in SDXL documentation. You can add your own by adding a user_ratios.json file to the root dir of comfyui
holy crap folks... I can run ComfyUI and VLAD A1111 at the same time and it doesn't crash... IDK how that works but its awesome
its like running excel and a web browswer at the same time.
They are 2 differenty programs
Its call multi tasking ;o)
just can't run batches at the same time
unless you have 2 GPUs and have one allocated to each program ;o)
ya but my graphics card still only has the same amount of vram to pull from
Prompt = Stock Image
Diffusion models are trained in 64 increments which means there's no real 16:9, you can generate and crop to get your desired ratio
Ok , just I know that @wicked frigate I think it was thats done a A1111 ratio plugin thingy used multiples of 8 for that whcih can give you a true 16:0 (I havent checked how far out any othe others are)
Yeah there are workarounds, but you'll get the best quality when generating at the trained resolutions
link me up im interested in that bud
cheers mate

Ill do it in ab an hr and report back. Or a little later.
That looks like sex tho.
I have to go to bed anyway 😄
Goodnight
also added a new HaldCLUT node for applying film emulation to the final image @upbeat summit
before you go can you upload the image you have in that screenshot. that looks great
this is either a very subtle troll or just coincdence lol
I'm not trolling ?
why would that be a troll? you got bad juju.
@narrow seal different aspect ratios aren't always possible on web services. try using dreamstudio, they've got aspect ratio settings.
So it's not possible on web
the website you use doesn't appear to allow aspect ratio changes. dreamstudio does
hey I did say "or just coincidence 🙂
It was just mildly amusing that only 10 minutes earlier we had this
is that clipdrop?
Yes it is
/dreamdog
I thought that channel was for clip drop stable diffusion
it's weird that they keep the image generation limited to squares without negative prompts, but they're still pretty good. they've got all the other tools you can use with pro too.
naw this server is for all things stablity and this channel is for all things sdxl. clipdrop is a site stability owns
You can turn your square photo to 16/9 but it regenerates sides, and it doesn't give good results
the SDXL bots are looking pretty solid ATM
Very excited to see how much more quality I will be able to squeeze out on launch
i dont think uncrop uses sdxl yet
Guess it's sdxl 0.9
Are you guys using 1.0 somewhere there
Well i have no idea what I'm talking about too. Just saw clip drop, images were good and bought the pro plan
You could try the bots which are 1.0 candidates
yeah, the bot uses SDXLa SDXLb and SDXLc
you can use the 1.0 candidates for free in this server in one of the bot channels
uncrop is the tool they have to extend edges and stuff. it doesn't use sdxl 0.9. it uses 2.1 i think
Unlimited generation? Or does it have limit
unlimited
I think one of them is better than the other two
Damn i have paid for nothing then
for sure, or one is really bad of the 3, cause I keep seeing some ugly ass gens
you wouldn't be able to tell. results are randomized. you'd have no way to know if a result comes from a certain model.
one of the models is like super HDR look, and it looks crusty and gross lol
that way people can't cheer on their biases
all I know is some of the images look consistently as bad almost all of the time
Idk man, I think I can slightly tell, even with random settings
x8 is required due to how the code works, x64 is sometimes preferable just because the model was trained on x64 inputs so it tends to be kinder to those
anyone has a clue how to generate the Hollywood director Christopher Nolan profile looks like himself
i just confused sdxl can't work well on a lot of celebs image
realistic people is unreal
Like a sort of celeb and protection thing
love these tanks it making. this lora is working well. i think i'll publish on civit on sunday, my next day off. Want to try a few more things with it. Will take these findings into the 1.0 version
a new master of Monopoly.... Dare to play... try passing go with this guy
one of the models, or settings combinations that happens pretty frequently looks like shit lmao
what is thissss
I am not sure what is up with whatever is happening with the hyper overly tonal contrasted images lol
the tank looks dope, the rest looks crusty
somethign feels very weird with the SDXL bots at times haha
the lora uses starcraft unit images to train. i cut the background out of 90% of the images. so if i don't prompt a backgorund it geneartes against grey. yeah the background is just "grassy meadow" and it do look jank. that aint my lora, that's just sdxl being bad at grassy meadows
In that case I'd assume its just bad prompting
genuinely what is happening with the tonal contrast lol
I hope they release info on why this looks so shit so we can steer clear
That's my assumption too. I still gotta up my game with prompts. I don't have a bunch of saved clips for xl yet. Might cobble a cheat sheet together on sunday
Prompting on xl is sooooo weird and different
Anyone got tips?
I already have some documentation on it, tho it looks like 1.0 is gonna make all of the community info useless tho
same issues for me. i just bang out prompts to get a test out. no real thought to them other than how do i draw out the lora best?
naw it still uses the same two clip layers. the prompting methodology should still have a ton of relevance
we will have to see
Oh shi any mod here? I made a prompt to bot-1 that doesn't comply with the rule #4. Can anyone delete?
any suggestion about the adjustment?have u tried with modify workflow or different prompts?
Sorry didn't realize there was a gore restrictions, though it was for nudity
greetings
At the moment I am extremely discouraged from all of my SDXL work, so I am on hiatus right now. Trying not to be too down about the whole thing
you got a bit of a sky is falling view on 1.0. maybe you've been listening to pseudo a bit too much
that guy seems like a bit of an anchor
Pseudo and I know a lot more about the way SDXL works than a lot of people do, and it has a lot of core issues, but they shouldn't be things we couldn't fix honestly. Its monumentally better than any previous SD version, even with its issues
And I don't have a sky is falling view on SDXL, I am actually very excited, which is why I am upset because I will be missing the start wave of interest in SDXL
Magritte and Varo - The Multiverse, Snow Crash, in the style of rene magritte and remedios varo
thats a vague statement. sdxl is so niche that of course anyone with interest will know more than most people.
being discouraged based on conjecture is just a personal choice. i'm sorry to hear you've gone down that road. hope ot see you on the other side
Did you even read my message?
like, genuinely
cause it seems like you just brushed it off lol
oh no. "do you even read?" is not a reply that bodes well. ttyl sytan. enjoy your friday afternoon
or morning whatever it is where you at
Bro, I just said I am not upset at all with SDXL and you are using this condascending tone of "that's a shame to see you be so stupid"
Like, I legit just said I am only discouaged cause I don't have the means to do the research I had planned anymore lmao
If you think I have some negative view towards SDXL after pouring almost every waking moment of my time into it, IDK what to say
Like yeah, SDXL has some issues, but everything has issues, and I lookforward to working with people to try and fix/lessen the effects of them
yeah. thats what i said. that you hate sdxl... 🙄
ngl. these conjecture spirals are dumb
my meowmeows! i love them 😻
"You being discouraged based on conjecture is just a personal choice. I'm sorry to hear you've gone down that road"
Anyways, I haven't, there was legit no conjecture at all in my statement, but ok
I gotta convince the team to make SDXL 2.0 an exclusive catpic generation megamodel. It's what's best for society,,
hmm, probably having a common issue, but I'm trying SDXL in A1111 and getting either black image gens with the vae on HF or off-color images with an older vae
actually, that is without the refiner. and it's still might be the best way to run SDXL(A1111)
industrial revolution began when we started sharing cat paintings. coincidence?
Reason I am discouraged = I was sold a faulty GPU and will now miss all of the research I had planned for the release of SDXL
reason I am NOT discourged = some weird conjecture thing you are assuming from me @trim orbit
He never actually said he hates sdxl
SDXL rocks, even just 0.9. I have been able to do some amazing things with it already, which is WHY I am upset, cause I lost the means to do so by being scammed by somebody
I was speaking towards this context. you never brought your gpu up this morning, and you should really not assume that's the primary focus of a topic in #✨|sdxl
use the VAE that's in the model (in auto that's, uh, turning VAE off, since the auto VAE option is actually a VAE replacement option). Also make sure you launch with --no-half-vae
will give it a shot thanks
also, when does A1111 implement the refiner?
^..^<
I mean, its not conjecture if its something I have live tested
Prompting in 1.0 will be different for sure, but I am not sure just how different
It looks like it could be different in a positive way
SD.NEXT (VLAD) AUTOMATIC1111 works with refiner ...
but comparing identical gens from just a week ago, images don't look even remotely the same now, which means you need to prompt a different way for the same output, which is fundementally a change
but I think it could be one for the better honestly
it seems to listen a little more (to its own detriment often honestly)
SD.NEXT is a bad UI. I tried making identical images using a 1.5 model with both, the A1111 version was WAY better.
thanks i know that. hyperboles are fun for dramatic effect. the flippant tone of that post is conveyed well, at least i thought so, with the eye rolling emoji. maybe i could add a /me rolls eyes extraordinarily command to get that across better? next time i'll try. thanks for chiming in on this "doesn't matter at all" topic
alright, so you aren't being an ass to exclusively, dope. Good to know for the block note
it's not really a contest. they're both just gradio ui's that have beta support for sdxl. automatic's doesn't prompt sdxl right either.
I use Vlad for SDXL (and ComfyUI); and I use GRadio AUTOMATIC1111 just for SD
I kinda wanna try A1111 for SDXL, just to run some direct comparisons sometimes against mixed diffusion
I was talking about normal models. A1111 is by far the best
I would disagree very much with that most people can't even figure out how to install it
available on the release_candidate branch today. git switch release_candidate
I have used ComfyUI and Vlad A1111 - image quality is equal ...
people keep forgetting about all the other user friendly uis
... in SDXL
its not very hard at all, I have had people who have like no program or SD experience install it in like 10 minutes
yeah, but what about compared to partial diffusion?
that's just selection bias then. both do 1.5 "normal models" the same and if they aren't, you don't have the settings tuned to match each other. likely different vaes
idk, I tried both- A1111 was insanely faster and better, and it didn't even use the refiner.
A1111 has a CFG up to 30 - ComfyUI up to 100 - so CFG 10 in A1111 is actually CFG 30 in ComfyUI?
they're both just gradio ui's and arguing about what does 1.5 "normal models" in #✨|sdxl makes zero sense
That did it, thanks. I set the VAE setting to "None" and not "Automatic" in this case (just noting for clarity and for anyone looking later)
a1111 is not faster, at best it's going to be the same speed as comfyui
no, they are the same, just limitedin range as over 30 is basically never used
and that is pretty optimistic to expect as well
man, it was 20% faster. I tested both
configuration layer 8 issue
I run A1111 and comfy together now (I use A1111 for inpaint cause it works so much better for me over comfy), and A1111 is always at least 10% slower for 1.5
then your inference isn't optimized
what do you mean?
compared to comfy? if comfy is slow for you, then you're doing somethign wrong
comfy is a much lighter install, it makes sense that auto would have some bloat and slow it down a bit
See my models list ...
comfy is actually much faster than auto now with that BF16 VAE. That thing is black magic
among teh popular UI's that people can install easily, comfyui is unequivocally the fastest out there
Idk man. I tested all settings with both, A1111 was much better and faster
that seems like a physical error, there is no reason comfy would be slower otherwise
configuration issue then. telling you man
Comfy has a much more efficient way of diffusing
theres no reason comfy should be slow
ComfyUI was up and running 30 seconds after it landed on my desktop. A1111 I think arrived on a Wednesday, (or was it a saturday)?
20% sounds like the difference between SD1.5 vs SDXL at 1024x1024
your testing is flawed
I am gonna have to agree with flowwolf here right now
this is a good profile,but not the realistic Christopher Nolan i wanted
i think people are using default values in automatic. the 512x512 width and height
SDXL is 4 x (512x512) - so on my 8Gb VRAM machine, it can take about 4 times as long as SD
or people comparing samplers that call the unet twice on comfy vs euler on a1111
yeah, or maybe an inefficient node setup to achieve the same results
Yes, some models make a hi res pass
idk. I'm making images like this with SDXL on A1111 with 6.7it/s =\
on what GPU?
4070TI
that seems weird
actually it will be fast on automaic than comfyUI
i love that eular is popularized as a word in the lexicon again. 1) because it sounds like Bueller, and 2) because Euler discs are frigging cool. i won't pretend to know anything about the mathematician himself though. maybe he was cool. probably just some nerd though.
(Someone anonymously post a ComfyUI picture, and an A1111 picture ... we can then have a guessing game over 10 items ... winner gets an nVidai A6000
...an A6000 which has done a year of data mining 😄
on that not, flowwolf is out for the day. it's friday! you know i gotta get down on friday
sonds like Bueller? They are pronounced way different (which I learned to my own dismay lmao)
Euler = Oiler
thats just the correct way of saying it . like that matters
Euler was pretty cool though, gave us lots of handy math
idk, whatever my A1111 inference is it's way better and faster than comfy is for me =.
fair enough
As no-one said to his girlfriend "eulerv me?"
I know its Oiler now, and I still say you'll-er lol
everyone says it "eular" like it's spelled. eular. eular. eular. see.
Euler, Euler, Ferris Buehler!!!!!
Is this A1111 SDXL? But not the Vlad version?
Euler isn't that good. In my opinion DPM++ 2M Karras is best
took too many math classes where you'd get chalk thrown at you to call it "you'll-er" 😛
yeah
yeah haha
I only found out recently 😅
My favourite models are most of them 'cept DDIM
SDXL doesn't work with DDIM
I find it interesting how many people are against DDIM, when its won my blind vote for quality in this server basically every time for SDXL-
i use ddim cause i'ts hella fast
sdxl works fine with ddim
Heun, Uni-PC, Euler and Euler A, Karras ++2M,
there's no reason it shouldn't work
then once i find a prompt i switch to something that cooks longer
okay now i'm really out. ☮️
although, I will say that Pseudo did just have a big discovery with an issue with DDIM, and I think he fixed it, but don'tquote me on that
I got a very faint kind of sketchy outline using DDIM ... but no, it's not good
In A1111 at least
I don't have any problems with DDIM. but DPM++ 2M Karras is just as good, and it's slightly better with detail
my whole workflow was made specifically to work with DDIM, tho I have a big feeling the sampler preferences will change with 1.0 cause of how different it is
is there a ddim_eta setting in comfy? its hidden away in a1111 but i like it allot adjusting between .7-.9
comfy feels like magnitudes faster than a1111 for me
this has to be one of the most fucked SDXL images I have ever seen lmao
I am at a loss for words lol
you can change ddim_eta in the code but it's not exposed in the ui
so i can try to make myself a node ?
I heard that Kohya optimized his SDXL training scripts, so maybe I am not completely out of hope for my research
I got myself a beef and curry node at Lidl
although my tests are likely to be way slower now unfortunately
Node?! Noodle 😄
Nodette
How much optimized?
no idea, hopefully enough for me to not be completely screwed on my research
I'm going to wait until 1.0 before I try that Lora - First Ever SDXL Training With Kohya LoRA
Isn't SAI going to release specialized finetuning tools with SDXL1.0?
no idea
@indigo carbon Check these out
they said they are, but I wouldn't hold my breath
they could release tools, but that doesn't mean they are gonna work/be good
We're you not able to use the 3090 or 4090 I forgot what u had?
its ruined, completely dead
Wait when did that happen?
they sold me a scam GPU, and now I am fucked for like the next month
We're you overclocking?
no, the GPU itself was already dead
he sold me a faulty card
either that or it miraculously died in transit, in the original box, with an antistatic bag, wrapped in bubble wrap inside another bigger box with no cosmetic damage 
Carz
wait, what did you do with your old gpu?
I still have it, but it can't do what I had planned
What did you get it off?*
I dropped all the money I had (and a lot of money I didn't) on that 3090, and now I am completely tied up until this jackass gives me my money back in paypal claims
ah, sucks, but at least you're not totally hosed
Ebay paypal?
no, paypal goods and services
they are on my side ATM
but I need him to not fight it forever so I can get a new GPU ASAP, otherwise all he is doing is hurting me for no reason
Since it's goods and service you should not have that hard of a time it's not like you did friends and family
its not really the hard time thats the problem, its that its gonna take forever and I am gonna get a GPU WAY after the launch of SDXL when my research will be meaningless
Could always be worse
it could be snowing
I'm hoping the zero terminal SNR model wins, so that offset noise isn't needed. It's difficult to train with offset noise.
that's kinda a toxic mindset
Discounting all issues as "it could be worse" is a good way to get complacent
Didn't mean it in a toxic comment just ingeneral
Teddy Bar Samurai
I had a car I was gunna sell to a friend for 13,000$ and it burnt into flames as he was driving it home before he had paid me cuz I was being nice and I was in a different state and afterwords he wouldn’t pay be because he said “it was the cars fault” even though his friend test drove it and did fine. Now I’m out 13k. Lawyers said it’s to hard of a case to handle and unfortunately would probably go more into his favor.
I'm not trying to say it can't be worse, cause of course it can, but that doesn't discount that I have been royally taken advantage of while trying to do good for myself and others.
I am not even out $600 right now, my mom is, and thats not fair to her either
Are you a wizard?
Let’s hope it gets resolved soon!
I am strong arming the fuck out of the seller, cause I am done
holy crap, I really like the way SDXL generates so far.
I am giving him the option to just fold early, or I will burry him in proof and evidence in that paypal claims court
It’s very good!
I'm just playing around with old prompts in 1.5 style, is there any best practices for XL? I feel like with SD 1.5 I gradually worked my CFG down all the way to like low 4s to get anything good out of them
I don't recommend using prompts from 1.5 for SDXL
you will likely be hindering results that way
@high skiff what was your old GPU?
yeah, SDXL prompting is way easier than 1.5
How so? (I just started using this 20 mins ago so steer me correctly )
MSI 3080 Trio X
You can train LoRAs at least then
in 1.5 you use tags, in SDXL, it plays a lot better with linguistic prompting
1.5: Dog, grass, sunset, field, detailed, photograph
SDXL: A photograph of a dog in a grassy field at sunset
it also allows you to do more specific prompting a lot easier
oh, I always wrote slightly more narrative tags anyway (rather than the booru style)
tho its not as consistent as it should be at the moment
and then a lot of attention pushing
there is a little hope for my research not being completely DOA, but not having access to a proper GPU to test my theories is gonna hurt my findings for sure
my biggest limit now is time, as training a LoRA on the 3080 will be way slower than on the 3090
weezord
oh yeah, lora:add_detail would be insane with SDXL
wonder if one day text2img will be combined with text generation
my fractional offset kinda already does that built into SDXL's pipeline
It's not too bad, depends what you are wanting to do. Something like 1200 steps takes about an hour with a decent sized dataset.
yeah, hopefully I can find better settings that I was using
And it seems like LoRAs overcook really quickly anyway
cause even on the 3090, that Na'vi LORA took 2 hours, so likely closer to like 3 on the 3080
So you only need like 600 steps or less
compared to 1.5, no way
you can be very brutal with SDXL compared to 1.5
let me do some math, just a sec
Interesting, maybe that's because of the blue skin. Everything I've tested it was done within 30 minutes.
Well "done"
oh yeah, finetuning SDXL should be way easier than 1.5, SAI said they will release tools capable of that.
I trained my Na'vi LoRA for 6900 steps
only question is, will we be able to use them
I feel like if I did that long with my stuff it would majorly overcook
likely, yes
I've seen people recommending wildly different training settings though
I've been using mcmonkeys
mine actually did end up being pretty overcooked, but I found better results from under-driving the overcooked LoRA than using the properly saturated one
so it's like those precooked chicken strips you thaw out and microwave
the results came out pretty damn good by using the overcooked LoRA at .5
left is raw SDXL, right is with the same LoRA trained into 1.5 as a refiner pass
I get great results from half powered Loras I train too
How does SDXL do avatar stuff without a LoRA?
It tries if you add tons of emphasis and contextual tokens
I've found it very difficult, even impossible, to get "alien" skin colours
and that's the least bad one lol
But generally sucks
I tried to make a red skinned demon yesterday
And it kept either putting them in red skin tight clothes, or would cover them in blood
even with my LoRA, you have to strengthen it with the term "cyan skin", which helps immensely
Was going to say that's not half bad attempt in my experience. Of course it's your best case finding lol
specifying man or woman immediately tries to make it human. I found "humanoid" is a decent alternative that lets me make lizard people somewhat consistently
Loras work really well to nudge existing knowledge into the front more. Like, big boobs. Holy I get humongous boobs if I train 5 pics of big boobed ladies.

Interesting, I'll give that a go.
so yeah, the LoRA is a big success lol
Looks like the guy from Jack Reacher's avatar
oh, cool side effect
Cause it still didn't pick up the blue skin so well, you can actually do different color skins
I got purple one time, but I deleted it to save some space for a grid
Can you make a blue skinned alien dog with it?
but yeah, overall it works really good
Next hardware acquisition, a NAS for grids
not in the capacity to be running SDXL on my PC at teh moment unfortunately
😦
got to get everything back together from the hell that was fucking around with this broken 3090
my 3080 is in, with no drivers ATM lol
Looks like it's tonemapped like the first movie. Wonder if a new dataset consisting only of stills from the 2md movie would improve quality some.
anyways, I have chores to do, I will come back and test some more with the LoRA when I can
I tone mapped the images to fix that, these images were just early and bad prompting
gaddamn this is good....how has img2img been? sadly i'm not getting any great mind blowing results. it looks like images sd1.5 models can reproduce, i'm seeing no leap in image quality sadly
you can see I improved the tones a lot more later on
Jacked
I have a type 😅
without the cyan skin reinforcer lol
unique effect lol
but like I said, it should be able to do any color skin cause of that
even was able to pick up some of the headware
and the neck radios on the earlier image
it can also blend with specific people if prompted properly
I've notice that Loras get more of the smaller details brought into xl
got it to blend with Taylor Launter
Very well too
I ahve theories on how to improve results, just scared to try and train a LoRA again and get burned
I got stable diffusion earlier but for some reason everything I generate is trash
What're the best settings?
anyone getting these high quality results using img2img in sdxl?
define trash and what model are you using
what checkpoint are you using?
that's really old and doesn't gen good stuff typically
for sd 1.4-1.5 models you wanna use more finetuned models
depending on what you're trying to make
but the one that gens the best stuff right now is the SDXL model
i can send a reference image of what im doing to make quality wise
Could I send you it in DMs, it's a little NSFW but I don't wanna make NSFW but the quality of it is really good
sd 1.4 isn't gonna produce good results
you're gonna want XL model or a finetuned model
define medium end
1650 ti
that's low end
oh
you'd wanna look up finetuned 1.4-1.5 models and probs stick to like 640x640 if you're even able to handle that
you barely have any VRAM iirc
fast food?
try more than one word? i don't know any model that does good with just one word
like "A mcdonald's restaurant" or something
how can i merge sdxl model?
find another model finetuned on SDXL and merge it with that? something like Waifu Diffusion XL for instance?
... how many imgur dead links have leaked into the training set, oof
in the same way you can't merge 2.1 models with 1.4-1.5 i think you need to merge sdxl models with other sdxl models
I wonder, was your training set collected before or after Imgur did that big purge
Ooo that's spooky
I have many sdxl models and I want to merge them
Many? There's about 3 public ones lol
sdxl
sdxl refiner
wdxl
not waifu diffusion
art diffusion
dreamshaper xl
Hmm so I tried this and it works ok, but as soon as I try to make them a particular gender, it just completely ruins it and then they are just human with red clothes again 😦
add gender specific features at a very low weight
You wouldn't want to merge the refiner, I'm not sure it's even possible.
don't merge the refiner
okay. I just want to merge other models. is it possible?
does that exist?
only a kind of like alpha version of it. https://huggingface.co/hakurei/waifu-diffusion-xl
oh, cool
I didn't know we started finetuning SDXL already
what other SDXL finetunes are there?
Some people got it early. That waifu one is just a test.
Yeah it doesn't want to do it. Adding female even at weight 0.1 instantly just makes it a normal woman
i don't see why it shouldn't be, but it might be more intensive than merging other models since XL models are huge
Is there any reason you want to merge dreamshaperxl with sdxl when it was trained on it?
It would be equivalent to merging dreamshaper7 with the base 1.5 model lol
with wdxl
I guess you could do it for the ollobrains approach where you merge literally everything for fun lmaoo
wait there is already dreamsharperxl??
wow that was fast. is it official or fanmade?
I can already see him merging mega model with sdxl 1.0 when it releases
oh no, not that guy
Hahahahah
I meant like features that implies female without saying it. On 1.5 tunes "feminine" worked okay. Think "midriff" worked as well since it had booru data.
Something similar probably works on SDXL. I made a lizard woman once but I didn't save the image so i forgor
as in proper lizard fully scaly no ears etc. like the government agent kinda lizard
i have a technical question if anyone is willing to help me out:
So i got this product mockups. that i want placed on various backgounds.. how do i tell SDXL the type of product is in the image? im using image to image masking function to generate the background for it.
i know the image is placed there, but can it know to make the background fit? how do i achieve this magic? 😄
holy shit, it does: https://tensor.art/models/617046080397449350
that's dreamshaper SDXL
sometimess i get lucky depending on the prompt, but sometimes, i wish it could of generated more content aware background with the image im feeding it.
is he also considering the type of product and sizes that is beneth the mask?
im building a product photography application based on SDXL API and can't figure some stuff out.
besides the fact that sometimes it changes the proiduct under the mask for no particular reason... even though mask is full black.
mabey inpainting is the route u want to go? alternatively you could try only generating the background and then edit the product in the old fashioned way with an image editor
I've managed to get 1 out of 100 tries and I can't post it because they are naked lmao
If you'rre trying to force a specific color instead of just species mixing that might be harder
I just want red skin lol
d e m o n
Yeah that doesn't work
As soon as you mention anything to do with a woman
It just stops even trying
Even just clothes
sdxl needs to touch grass confirmed
My demon women have blue skin one my bench prompt though
think I had "body paint" or something in it
One thing it's doing which I understand why, but it's annoying. Is making normal colour skin, but then with very very sharp red lighting
which it just kinda ignored and made the skin itself colored
lighting so sharp it turns the skin red ig
I can get fantasy skin colors but then the whole scene is that color
I'm updating one of my libraries but after I'll test those non-human prompts a bit. I had it working super well in 1.5 tunes like realisitc vision and dreamshaper
Yeah it's doing that too
if comfy has BREAK syntax support could try that
Don't believe it does
hm
Every single time it gets it close, they have no top on lol
maybe skip the color on CLIP L but that might make it harder to get at all
colour is only on CLIP G
It sort of does dudes, but it looks like a guy in a mask lol
This is the closest I've got when trying to make them female
But there's always normal skin on it somewhere
I am back
alright, my PC is setup with the 3080 installed and working drivers
3090 has been banished to the shadow realm, as I can't even stand to look at it ._.
no ai other than parti has benn able to do this, wow
SDXL in ComfyUI no specific model
alright, time to test my na'vi LoRA more
have fun 🙂
right is slightly sharper?
coherence is great in both
yeah, SDXL Learns a lot for sure, especially since this is not with a text encoder, just the U-Net
and according to a new reddit post by stability, training SDXL 1.0 will be a lot better / easier
Its tsill trying to learn the ears, so I will be adding more images into the dataset for later
looks good, but too close to the actual movie character so likely heavily overfitted and spitting out stuff close to the dataset
she looks fairly different to the actual movie, and I am prompting simply
thats what she looks like in the movie
so there is already a lot of facial structure variation
looks similar, with a bit of that "handsome squidward" thing sdxl likes to do to faces, with a stong jawline and high cheek bones
Can I merge sdxl with my Lora? Right now i have to choose LoRa (lyco actually I think) or refiner. With both I crash . Last time I tried at least.
I tried in merging them in comfy but I get an error and idk if I'm doing it right. I set sdxl as both models and attach the Lora to the second checkpoint loader. I set the merge model node to 1. To just copy the second checkpoint but it errors
@spark bear
there you go, prompting for some different ethnicities
I will likely be able to generate more racial combinations, and then train these back into the LoRA
youre better waiting for 1.0 and training again.
for sure
I am just dicking around right now, learning some stuff
might change with 1.0
There was a reddit post earlier from one of the SAI Devs that implied LoRA Training was easier on 1.0
But they'd also trained for an insane amount of steps so
¯_(ツ)_/¯
depends on how many repeats
I hope I can still make TI embeddings with 10-15 images 🙂
if you train like how Caith does, thats not much at all
What do you mean ?
30 epochs on 50 images how caith does it is like 20 minutes
On what GPU?
@boreal bough Care to elaborate? I don't wanna give away your info
I mean you just don't need that many
I've done 6 epochs on a image set of about 150 and it worked fine
50 epochs on 1200 images seems silly
if you knew what his method was, you would understand why he does that, tho i don't wanna give away his info on it
its very interesting the way he does it
And it would take hours, even on a decent GPU
it produces good results, just in a very different way
again, not the way he does it
like I said, 30 epochs on 50 images in SDXL would take like 20 minutes on a 3090
Saying, not the way he does it isn't helpful lol
Thats cause I don't wanna share his info if he doesn't wanna share it himself. Just know that it doesn't work how you would assume, and there is really no speed hit to it
so if SAI is using his method for training them, then it could take way less time than you would assume
I know I was blown away when I tried it lol
It just feels illegal haha
Anyway my point was, that them saying it's "Easy" to train but then providing an amount that would take ages, especially if you don't have some server grade or top end GPU isn't what I'd call "Easy"
and my response is, if they train it like Caith does, then it would run fine on a consumer GPU
likely an 8GB GPU could do it


