#✨|sdxl
1 messages · Page 45 of 1

what's going on with the ticket tool?!
hahaha wow
It's entertaining 😄
also what is that save image node? I dont seem to have all those options in mine
I think we have to take it down...
@meager sundial
IT'S GETTING WORSE
technical support!
oh
it's gone
Aha!
very temporary
same for me, should be fine :)
was hoping to get this tho
there are custom nodes though that could have more options, I don't know where to find them tho
yeah that looks custom
why does it not see my checkpoints?
anyone else have the problem that bodies are getting streched out when you make the height more then the width?
I think its from an extension uhm
wonder if its any of these
did you put them in your models/Checkpoints folder?
image gallery one?
anyone got a comfyui workflow for image-to-image with masking generation?
the model should be a .safetensors file inside your models/checkpoints folder
can I use the refiner in comfy ui?
what's this? somehow it even has my nodes 
I was using "textured skin" as a stylistic prompt. But I think it also depends on what style of photograph you are going for.
its one of these, couldnt figure it out yet
lol just did a "high res fix" in comfyui
Happens sometimes. I've had some luck mitigating it by using the SDXL text encoder node and setting the height, width, target height and target width to match the aspect ratio of the latent you are using.
ill just get em all xD
oh thanks... I made a silly mistake in the "extra model path" document :D
its a node manager, helps with installing
ah that's why I suddenly have more issues on github 
probss
incredible how there is already a node manager for a model that came out yesterday. Where did you find it?
mine aren't in there... thank god
it's a node manager for comfy as a whole, the ui's been around for a while
^
I need to figure out how to upscale without loosing so much detail 
Which upscaler are you using?
Does comfyui has an api like A1111 has txt2img?
I was thinking we need something that detects how busy an image is and automatically adjusts the number of steps on the sampler running on the 2x image
You can upscale by small margins, 1.1x each time wont lose much
The long run
apparently, yes, never tried it personally but there are some examples in script_examples dir
Differentiql changes are almost always superior to to large ones
sdxl is gonna be a game changer
Oh damn okay sorry, that's above me 😅
the more fields the better the quality
Can I use both the Vae and Refiner with the base SDXL model in ComfyUI? Sorry for asking, I am new to this.
Hmm thanks. Then I might as well give comfyui a try, at least until A1111 implements proper refiner usage
are you using refiner for the ultimate upscaler?
Optical polarizers have the same thing
Use 1 to change an angle and you lose between 0 and 100% opacity
Use thousands and you can continue spinning it without losing any opacity almost at all
yep
have you tried lower resolution tiles?
I thought 1024 would be fine since its XL 
Will do
My god here I was thinking I knew about 80% there was to know about 1.5, and now everything is so exciting again - I could watch tutorials and try different things for hours, it's so much fun
Watching tutorials for hours = Fun? 
sounds like absolute pain with how every channel tries to maximize time, peddles sponsors, ads, etc
this is the manager node for anyone who wanted to kno earlier:
https://github.com/ltdrdata/ComfyUI-Manager
The dumbest ones are those that take literally 3 seconds to do with 10 minute vids
yessir
I abhore it that the dislike button is gone
the longer you watch tutorials, the more things you have on mind like "Oh I gotta try that one!" And you just watch until you can't take it anymore and gotta try it. It's a lot like edging
well yeah you gotta be picky about the chanels you watch, true
clickbaits be like: REVOLUTIONARY FREE AI MODEL
Is SDXL the 1.5 killer?
10 things you should know about ai/sd scary
Went wrong
thanks for sharing your workflow and how it is set up! I really like some of the ideas
you must be really suspectible to clickbait if you are consinstetly playing those ones
my pleasure. It is heavily laid out for my personal usage
I think it's more of an issue with every google search yielding those,but probably getting too offtopic 
Short vids wont generate lots of cash, so yt doesnt promote them, they are usually at the page 50 or below
page 50? if you look for a specific extension or method or something you'd see one in top 5
Top 5 if you are looking for something that is known
are you just searching ' ' and scrolling until you get a vid you need, i dont get it page 50
I mean, maybe?
Thank god GPT is here
if you watch a lot of channels you trust, the algorithm will recommend you more of those. Just don't be tempted by clickbaity bullcrap
Hey, that's me!
skin still seems a bit more overprocessed then everything else but pretty good
As an AI language model I can't recommend any YouTube tutorials since they might be unsafe.
What does the ascore inside the Clip Text Encoder?
Btw, GPT has lost its accuracy due to PC bs... it cant even do math now
I meant just asking gpt directly instead of yt/google/reddit surfing
Just use Falcon Locally with OogaBooga
Cant
Takes too much space with the stuff it downloads
how can you outpaint with comfy?
no chance is asking GPT about stuff like sdxl that have came out 2 years after what it's trained on is better than just google
Oogabooga with Falcon 13B is like 20GB? 🤔
You can collect all the documentation on SDXL and train a LoRA on your LLM lol
btw if you thought GPT was too PC try Llama-2 with the default system prompt, it thinks even mayonnaise recipes are harmful...
I meant what it downloads into disk C
had like 70gb free, after oobabooga had like 35, and i still cant find where in the FUCK DID IT PUT ALL OF ITS SHIT THERE
eh local gpt, especially one small enough that you can train even loras for also doesnt sound like it'd be that insane compared to just ctrl-f-ing and skimming the documentation youd train on
lol that sounds odd. Oogabooga has a one-click installer that's usually quite slim. I'd recommend having ai stuff on a seperate SSD from your OS-Drive anyways tho
but i guess gpt4 + large context + docs yeh i can see that
true
AHHH!
I have a fully uncensored model, ask it how to make an explosive shuriken and it will lend a hand
Yeah im still looking for the right settings
same ^
yeah just saying the base models are kinda out there with the safety stuff
it seems like there are 2 comfy workflows now, one is using the refiner on top of a complete image from the base model and the other one is refining an image with leftover noise
nous-hermes trained on llama2 works great for that matter
that does look slightly more realistic at first glance (tho whether it is better aesthetically, not sure)
i'm not sure which one is supposedly corect
Just browse the huggingsface leaderboard for LLMs and look for the score on HellaSwag, as this seems to be the best benchmark rn
The safety is whats killing their intelligence sadly
They became delusional at a certain point where it lies to you due to censorship
if you use gpt4 through the API (if you have access) and give it a different system prompt it's way less censored
Although I don't disagree, it's kind of hard to pinpoint what exactly made chatGPT worse. I doubt that the only problem is HRL and alignment
I wouldnt say there is a correct way, just different approaches and we will see which yields the best results sooner or later
yeah we can be pendantic about what's correct or not, the community can find new ways to use it
but i guess official way maybe a better word for it
Well, i know when it happened
Sadly talking about it will get me in trouble here
i guess i could look at the original repo to see roughly how it's "supposed" to work
I love nous-hermes
I tried loading in your workflow and got this issue
fundamentally they are the same workflow just with different levels of noise
there's a new model from july (you can see it in the api), but you can still access the model that was there from march and is 100% unchanged if you need
I'd assume the example comfy workflow for sdxl is what would pass as official, since sai and comfy seem to be an item now
you just need to use the api access instead of the site
You can replace it with a normal image save node
hey I'm working with a developer on image creation for fun, what do you guys use for this case? google colab? With colab the session interruption is super annoying any solve for that?
Okay
Agree with this, there's no right or wrong within limits of course. 🙂
Im using gpt4 for coding, but ill keep that in mind for math and phys related stuff
Early 2023 was the last time the model didnt screw up math badly
if you need an LLM for coding, check out "Starcoder"
it was trained on a lot more code afaik
I think i know that one
Visual studio addon right?
for coding there's the alpha testing for the new gpt4 version of codex or whatever if you request access from github
but I never tried it after getting access
you can also try phind though it's getting dumber lately somehow
I haven't tested it personally but thought it was another .safetensor model for the OogaBooga Webui
OVNI files unclassified
wow this is actually great 😂
Mmmm yeah oobabooga is a no go
Ill try to load it in a collab ty
I tried the new chat copilot for vscode. I didn't find it very useful
the best copilot is still your brain as of now
Gpt4 works great, but you need to be precise with what you need
exllama has pretty minimal requirements and webui if you just want to run some llm locally btw, ooba feels kinda bloated
Question... is there going to be an official version of this? Is this a good way to fix it or is there a better way to use FP16? https://civitai.com/models/117188/sdxlfixedvaefp16
This is merge model for: 1. 100% stable-diffusion-xl-base-1.0 https://huggingface.co/stabilityai/stable-diffusion-xl-base-1.0 2. sdxl-vae-fp16-fix ...
true, I think ooba is more for building upon and creating a front/backend for products
I mostly just use copilot to autocomplete my code. When I tried the new chat thing, I found a lot of the suggestions were no good
3.5 screws up a lot
And i mean a LOT
Use 4
if this is the same as the huggingface vae then it works well, I trained a lora with it and wasn't getting any nans
I'm not sure what the legality (both for my contract and stuff like gdpr) are with sharing everything open in my vscode or repos to an US company so I've never tried it
yes it's a merge based on this https://huggingface.co/madebyollin/sdxl-vae-fp16-fix
Im doing a project in dataset management and training, gpt4 is priceless
Sorting it all, center cropping, finding corrupted pics that are most likely useless, binary corrupted img search
Dataset purity sorting
Everything
In all honesty, without gpt4 it would be impossible
the few times I tried gpt4 for more complex stuff it helped but I still had to change it at the end + prompt it a bunch of times and at that point it didnt seem faster than me just reading online, trying and adjusting
You need to make sure that your logic doesn't have a backdoor
Or it might givr you a statement that it will just autofill that backhole to close the function without warning
And make sure to not ask it to make anything longer than 2k tokens
I have access to 16k I think or maybe 8k
So ~50 lines or less
How well does it remember at 8k 16k tokens?
I remember it losing context with more than 2k with its acvuracy
(just checked it's 16k) not sure haven't experimented recently
Mmm i remember a graph comparing some models to gpt4 at longer context
It got worse with more tokens, but its better than anything else
There was a llm that could handle 10m or 2m tokens, but it had ~ 30 40% context accuracy
Relative to what they tested
I'd still rather get a finetuning API so I can effectively handle really long 'contexts' like we could do before
Mmm im going to try to make one in august, one with memory instead of token knowledge
like with gpt2 I just trained it (for a month on free colab) on all my conversations and I'd want to recreate that for gpt-4 https://svilentodorov.xyz/blog/gpt-15b-chat-finetune/
Previously in this series - finetuning 117M, finetuning 345M
Titans are woking up
Mmm you're not starting to train it with hard material right?
You should train it from easy to hard
Much faster learning
eh, you do a bunch of epochs and at least then it's random sampling from the training data
given that you start from what it was trained on already, nto from scratch
from scratch it'll take forevr without a big budget I'd imagine
Mmmm try to sort it a bit
Btw, mind sharing the notebook?
I've last done it with gpt2. You're much better using the llama 2 stuff if you are doing it today
Got it
so far SDXL 1.0 looks about the same as SD v2.1 just with better PR and marketing
rememmer this is by the same guys that wanted to cancel Automatic1111 and now doing the same by designing the XL model in such a way that it works better on ComfyUI
ive never seen sd v2.1 handle text
I used 2.1 since release up to few months ago, it is obsolete to 1.0
not to speak of 2.1 being way more lackluster at understanding prompts in general
2.1 was bad at everything basically beside having an exteemely fast embedding training time
It needed sometimes ~600 prompts in the neg to get same img quality as none in sdxl
show me base 2.1 creating this masterpiece:
Someone have a good tutorial material to make a Lora with XL 1.0 ?
Eyes, hands, anime, faces etc... obsolete relatively to 1.0
I had some with my embeddings
Crystal embeddings, bird embedding and 3d can do that in 2.1, but with messed up eyes and feet
Yeah it's definitly possible but not just with the base model
I think auto just wasn't very interested in the whole nda and stuff so he's lagging behind
Can you use Roop on Auto with SDXL?
There is a technique to do that with 2.1 base
Someone made a prompt that made gpt 3.5 do prompts based on your input
It was extremely precise
I'm really surprised how much effort people put into not sharing their Workflows. You take advantage of an open source work that took years to develop, you steal the prompts, the settings from everyone, to end up being selfish. It's profoundly lame as reasoning.
Insane quality
I missed something during yesterdays live. they said something about sdxl now being more drag and drop ready. what was that about?
Did you know how are made Spaghetti ?
I'd imagine they meant it's more usable out of the box rather than with extra tools and finetunings
i think there was more to it than that but im not sure
would be really great if they could reupload
the audio kept cutting out so not sure
they talked about some new ui thing, might have been that
wondered if they somehow merged the refiner into the base model
pog
my GTA V workflow?
for the people that dont know about this website, it's awesome:
https://supagruen.github.io/StableDiffusion-CheatSheet/
seems to work better than with 1.5 too
Thanks for sharing 🙂
Sure, sharing is not really the watchword these days
Im a n00b with comfyui, do you know how to use the sdxl1.0 vae on It?
Also this one is good: https://proximacentaurib.notion.site/e28a4f8d97724f14a784a538b8589e7d?v=ab624266c6a44413b42a6c57a41d828c
prompt 1-3: “a portrait of a character in a scenic environment by [artist]”
prompt 4-6: “a building in a stunning landscape by [artist]”
work in progress (ノ*・ω・)ノ*. ☆゚ @proximasan @EErratica @KyrickYoung @sureailabs
For information about this project see: About: Image Synthesis Studies Database 🦜
Navigation:
- use Ctrl+Shift+L/ Command+Shift+L...
I mean, I have the workspace for using SDXL 1.0 and the refiner but I don't see a way to add the VAE
there's a vae loader node, you can plug it to encoder and decoder instead of the checkpoint vae
vaes should be placed in models/vae by default
I'll try
Yes, I did that, so the problem was I wasn't using the right module.
So, It would be like this:
yep
Thanks, It was pretty obvious once you told me, but I wasn't able to pull It by myself.
can i install sdxl on i7 intel iris xe laptop 8gb ram?
What GPU?
I think it's integrated gpu, intel iris xe
Then its a no
You need a beefy GPU
Even if the generation of images is slow, it is not possible?
A good GPU, and 32 GB of ram is suggested
is someone facing problem of similar generation with small variations only? whenever I try to generate images by adding tag "Art by Yoji shinkawa", I always get similar images unless I change rest of the prompt but still, it mostly follows same vibes. Unlike in other models, every other Different seed always give different image.
Cpu mode might do it, but you need a pruned model with 8gb ram
you should be able to run it on cpu with enough ram but the speed will be abysmal, also fp16 will be upcasted on cpu iirc
so it will require more ram than gpu
768 img should take you ~2 to 5 minutes with cpu
comfyui runs on CPU but good luck with the speed, It's faster to draw it yourself in paint.
yeah, CPUs can't do FP16
I have worked on it in google colab but it seems that I have to pay
Yep, If you use too much colab they make you pay
Collab gives you free use for a certain amount of CU, and it wont give you a gpu if they are all used
Buy cu, its not that expensive
But buy it with free use, not the one that disappears each month
just use different google account. lol
nowadays everyone got multiple account
there's only so many google accounts you'll be able to make before they start asking for verifications and stuff
2 google account basically allows me to use google colab for whole day.
It is the first time I have used colab I have not seen any option to use what you say for free
idk
I do alot of training and automation tasks using google colab
longest runtime was 10 hours.
it depends on your overall usage and luck I guess 
after that, I wasn't able to use it anymore, then I switched to another google account to continue train my dreambooth.
🤔
I'm saying switch to another google account...
You can run Auto1111 with SDXL model
tho it's slow, better option is use ComfyUI but i don't know if it has colab version.
what account?
I don't understand sorry
hello - when is v1 available on Beckrock please?
You said that google colab is no more giving you free access, right?
name this creature
It is the first time I use it, I have tried to start this and I get an error. https://colab.research.google.com/github/TheLastBen/fast-stable-diffusion/blob/main/fast_stable_diffusion_AUTOMATIC1111.ipynb#scrollTo=p4wj_txjP3TC
I almost thought of Night King...
what kind of error?
in the last step it gave me an error
in last step?
I'm going to run it again I don't remember anymore, I'll give you a screenshot wait
yesss
Ngrok token...
register and get a token from here. https://ngrok.com/
If you don't provide token, there is no way to access hosted site by colab, Ngrok acts as tunnel between you and colab. because you cannot directly access colab's local web server.
QUESTION: on my video card 1080 ti 11 gb, 1 image is generated for 1 minute sdxl 1024-1024. is it possible to speed it up somehow? what speed do you have on which video cards?
30 seconds for me on RTX 3060ti. 8gigs vram
Hey fellas. I've been used SDXL on Auto1111 and I've seen that switching btw refiener and base takes hella long.
That is if I use comfyUI*
almost 8 minutes in auto1111.
I have 3060 12GB and 32GB system ram.

I already registered but I have no idea what that is
Are models stored in HDD?
refiner by itself needs like 6gb additional vram so the unets are switching between cpu and gpu, hence the slowdown
Here, now I do like almost everyone here, I show the images of my good Workflow, hiding my EXIF data so as not to share. It's fine not to share when working on open source.
Yep. Should I move to SSD?
ofcourse dude.
I also use comfyUI. I have 50 seconds generated . it's a very long time. ((
Oh my. I always installed Auto1111 on my D HDD.
Let me swap to SSD and test it that way.
It's 130 seconds if I also use refiner. 
hdd mostly affects the initial model loading speed to cpu ram tbh
like on cold load, but if you're keeping them cached it shouldn't be an issue
no, nothing, it was great image you generated there buddy. Legend of Zelda!
In Auto1111 with how much denoise should I use the refiner?
yeah but It needs alot of ram to store both refiner and base. and it isn't the case for Auto1111, it keeps only one which you have currently selected.
Is that supposed to be sarcastic?
you can change how many models are cached in ram in settings
halp! colonel sanders is haunting my house 
Oh, how do I change that?
Nah, It was an instant reaction for being so closed with workflow, but it's fine, no offense. Sorry if that offended you.
I would like to have both refine and base cached if I can with 32Gigs ram
just ctrl + f cache in the setttings tab, I haven't used auto for a while but it should be there
is it just me or is sdxl not actually that good
@visual glade (or indeed anyone else) is there a way of embedding URL links inside workflow ?
Oh nice. Yeah that worked.
interesting fail
Oh you wouldnt say that if you have seen the 1.5 base.
very fluffy 🙂
I have seen 1.5 :D and yeah that's kinda a bit bad too, 2.1 is okayish
This is exactly where I want to go with my message, people here come to take everything, and do not share when they find something interesting. I remember a time when everyone worked to pull SD up, not to keep its secrets like a jealous bird incubating its eggs.
All the images in the chat have been removed from the EXIF data, so as not to share... Well done guys, great mentality
dam, It looks super cool tho
hello everybody
Hmm. The 1.0 Vae seems somewhat baked.
chill buddy. many people copy image directly instead of uploading, Copied images are stripped of metadata.
what a time te be alive
I am playing around with how I prompt it though
This is after refiner.
Either I am doing something wrong or there is something wrong with the vae.
there is something wrong with the vae, try the fp16 fixed one from hf
hmm 
https://huggingface.co/madebyollin/sdxl-vae-fp16-fix/blob/main/sdxl_vae.safetensors
this one doesn't have the weird horizontal artifacts and works with fp16 training
lora - around 10gb vram + 10gb ram
yours looks way better - different style. but still similar in composition
finetune - a fuckton
yup
I'm using 9.5gb right now with 1024x1024 buckets, so yes, should be
kohya seems to have fixed some memory issues
Damn thats really well optimized.
Can you send the parametres you're using?
I need to go and grab a pack of cigs and then I'll be right back.

I'm not really doing it the script way but hold on
Feel free to try my workflow 
It's free upto a point. I use Google Colab with SDXL 0.9 for my gens
hey buddy, you seem to have alot of knowledge about comfy, can you please tell me how can I interage things like BLIP or deepbooru in workflow?
Use any ComfUI notebooks and you should use CPU and GPU for free to a certain extent. Use two accounts to use it for 10 hrs
This looks pretty well: https://civitai.com/models/116197/jenna-ortega-sdxl
1.8GB's for LORA though
What network dimension did they use?
vae = "madebyollin/sdxl-vae-fp16-fix"
network_module = "networks.lora"
network_dim = 64
network_alpha = 64
network_train_unet_only = True
mixed_precision = "fp16"
save_precision = "fp16"
caption_extension = ".txt"
full_fp16 = True
optimizer_type = "Adafactor"
lr_scheduler = "constant"
lr_warmup_steps = 0
learning_rate = 1e-4
min_bucket_reso = 512
resolution = 1024
max_bucket_reso = 1536
bucket_reso_steps = 64
max_train_epochs = 1
save_every_n_epochs = 1
train_batch_size = 1
gradient_accumulation_steps = 10
vae_batch_size = 1
enable_bucket = True
flip_aug = True
gradient_checkpointing = True
sdpa = True
cache_text_encoder_outputs = True
cache_latents = True
cache_latents_to_disk = True
noise_offset = 0.0357
these should be the ones that matter somewhat
I have the token, what do I do with it?
It seems that ComfyUI is the way to go for SDXL based on few posts I have seen in comparison to A111. ComfyUI on Google Colab takes about 125 seconds (after loading the models) to generate
I'm not sure if there are any nodes for deepdanbooru or blip, you'd probably have to code it yourself
paste it in input box where it asks for ngrok token.

I get around the same time indeed with COMFYUI ON colab.
Thank you!
comfy isn't really the best tool for tagging datasets imo
What about Lora training? are you using comfy ui for lora?
What's your preferred Sampler for SDXL?
nvm, you are using koyha.
I'm using kohya's scripts for training but you can use the loras with comfy, yes
okhy, thanks for info.
Do I have permission to download this? It's the best thing I've seen in months
my depression is gone
nah dude, I don't use colab.
It's local for me.
The first one is without the refiner and the seccond is with refiner.
The ees always have artifacts, what might be the cause of that?
colab is fine until you boot up a UI with it
what's your ratio of steps from Base to steps from Refiner?
It's same for me too
for now, I use base model only
plus it's faster to use single model.
50 base, 10-25 refiner. Using 0.3 denoise.
Are you using the VAE "rasterized" image for the refiner or the latent image? I had much better results when handing over the latent
I have no idea how I can send the image without VAE.
"Where do you see yourself in 30 years"
How can I send the latent image?
If Auto1111 then you are using Img2Img for refiner meaning rasterization. simply put you have decoded latent into image. or else it's not possible to use Img2Img.
Oi, I'm using Auto1111.
Ah, sorry
what should i do now?
Guess I'll have to wait until they incorparate a decent workflow.
you got the link buddy
delete it before anyone else abuse it.
Probably. I was 100% for auto1111 aswell but switched yesterday to Comfy, I don't regret it
Cause this loading base and then refiner is not really optimal.
Delete the image*
risky frisky
Living on the edge
blue text is link. click on one which says ngrok as domain.
it is bad? what's happening?
Is there a way to zoom in/out ComfyUI on Android? It always opens the menu to add a node
so glad that comfy is taking off right now
Does Comfy have inpainting support?
I don't see any link
Node Based Workflows on android? I'd recommend just not doing that, sorry if that's not very helpful 😅
I think so, haven't tried it though. Would interest me too - has anyone tried it?
can you dm me the image? I'll point it out for you.
yes thank you
you can do inpainting in comfy but there's no way to use a brush inside the ui like with auto, you'd have to make the mask yourself in photoshop or something and then load it (can be a separate mask or alpha channel)
Must be rough for poor automatic and what he went through with all the leaks and drama involved, but as a consumer, Comfy is fantastic. I'm used to Nuke and Davinci Resolve (VFX editor by day) so node-based workflows are much preferred imo
there is a clipseg node that kind of works for inpainting, you just specify what in the image you want to edit and it makes a mask for you
Thanks for the info - this out-of-the-box painting tool in automatic always blew my mind, shame it's not in comfy. But oh well, could probably code a bridge between Gimp and Comfy or smth
Oh! Like the SEG-controlnet?
yeah
You can use the load image node for inpainting. right click on the picture > open in mask editor
that's cool!
Oh! Wait that's brilliant
also preprocessors https://github.com/Fannovel16/comfy_controlnet_preprocessors can do segmentation as well
oh there's a mask editor now?
No way! That already works with SDXL?
SDXL1.0 Base+SDXL1.0 Refiner. I think i fell in love with SDXL1.0!
always been there
mind blown
controlnets don't work with sdxl but preprocessors only rely on pixel space images so there's no problem using them for various stuff
Did they filter out nudity in sdxl training lol
well yeah it's not a problem to generate a depth mask, I'd rather have a way to let the depth influence my result in the way controlnets worked with a1111 afaik
that's probably going to take a while, I reckon illyasviel would have to train new models
depth maps are still pretty useful on their own
we already have some cooking 🍳 just not quite ready
oh that's nice to hear 
This is correct.
love the colors on this!
also my gpu seems to go up to 106c whenever I generate a picture x)
Icey, Icey
Well anyway even if SDXL is harder to prompt right now, it does look promising for the future. I can totally see how this is a step towards getting really good high quality images out of image gen AI
It does what it does really well
starting to get messy on full body SDXL1.0 Base+SDXL1.0 Refiner.
when you put 1 too much 0's into cfg
why the size difference?
I've noticed that generating images with no prompts in SDXL results in more coherent outputs than SD 1.5's nightmare fuel
it was cold outside...
(sorry)
probably pruned, right?
No, it's reduced precision, fp16 vs fp32
ya'all running comfy?
yep
comfy is comfy
A1111 is just too slow
anyone on AMD GPU ?
been trying comfy with 6950 xt and i have no luck over 512 generations
About how long does a generation with 50 iterations split across base and refiner take in comfy?
Someone told me about SD.NEXT which is supposed to make AMD GPU with windows work
I heard AMD is best to work on Linux instead of windows....idk
haven't had the time to look at it yet
Im using 60 steps total on a 3080. takes just under 13 seconds
yeah i read that PyTorch with ROCM might help but havent bothered testing
A1111, haven't tried comfyui yet but thinking about it
Base model is supported but refiner usage is not implemented yet
You have 8gb or 10gb version?
Has anyone had any luck finetuning your own style with the new SDXL model?
10
lora, yes
I see, I have 8gb
Is that using Kohya ss?
yeah
It doesn't realy follow your prompt. 😄
Is that at 768x768? I'm only getting 1.2 it/s and it took me 2m 18s to make this on my 3080 😢. I'm running my own implementation in Unreal so I have a lot of optimization to do
SDXL will do nude if you prompt tribes it seems
full 1024 sq
2.68it/s
good idea, ill try without tribes promt 🙂
pruned versions I would surmise
keep in mind that's probably only the generation part, with 10gb vram you'd have to offload to cpu between base and refiner and also use tiled vae decoding, all of that adds a lot of time
Training Precision = bf16 <- fp16 can and will cause NAN errors in many situations
Clip Skip = 1 <- no novelai in sdxl
resolution = 1024x1024
Gradient = true by default
Cache Latents + to disk on by default
Net Dim = 8
Alpha = 1
Train on = Unet only <- those who dont know the difference, should REALLY use unet only for now
Learning Rate = 1e-3
Optimizer = AdamW8bit
LR Scheduler = constant with warmup
Warmup Ratio = 0,05
Save Precisioun = bf16
bucket res = 512 x 2048
offset noise = 0 (cause then it auto sets it to the correct value)
Notes:
• good captioning is still relevant
• Dataset size of 30~60 to avoid typical issues. 10 or more will work fine if you know what you're doing. under 10 works if you really know what you're doing.
• set repeat to 1, do 40 epochs. 20 epochs will prob be your 'ideal' lora. so save every few epochs. 200 for super complex, multiple concept/object loras.
•image count x repeats x epochs = steps``` This is still good for lora training right
also I'm getting around 1.3it/s at 1024x1024 with comfy and a 3060 12gb for the record
Hi, I need some guidance on how to preserve a white and transparent background persistently. Occasionally, white backgrounds are successfully generated, but mostly the generated color appears to be a shade of grey instead or a totally different background. I've noticed an inconsistency in the results, although I've been using the words 'white' and 'transparent' in my prompts quite often.
Yeah, I swap models as needed which adds a few seconds delay in and I have attention slicing, cpu model offload and vae tiling enabled - I use Diffusers as my backend
you can use fp16 with the fixed vae, in fact you should use full fp16 training, it will reduce the vram usage a lot, clip skip is unnecesary, other than that it's all as normal, kohya claims adafactor works a lot better for sdxl, offset is 0.0357 by default for sdxl iirc
this vae solves nans in training with fp16

that sounds about normal then, diffusers implementation is probably a bit slower than comfy
Ah cool, good to know. Unreal is taking up a few gb VRam as it idles but I disable realtime rendering in the viewport when generating an image which doubles the generation speed
im kinda sad sdxl is so not really working in auto for me, hope they improve the performance. on Comfy its just fine
When will sd beat midjourney
already does
is there a workflow update? or which workflow are you using in comfyui? mind sharing?
Is this better than mj v5
How
the answer is: depends. But in some areas it surely outperforms MJ whereas in others it falls behind
nono im just using a normal workflow, just comparing comfy with A1111
i cant use sdxl on Auto bec. of the performance
if you want creative output that does not do exactly what the prompt says MJ does a better job at suprising you, but if you talk about doing exactly what your prompt says SD outperforms MJ by quite some ammount
question though: Does someone know how the "style" works. Is it just a text addition to the prompt, a lora or a specialized model of the base?
a picture in comfy takes max. 2 minutes, when i generate in Auto 1111 it takes 20 minutes or more
Nono im saying the quality Which is better
WAS NOdes Suite ahs a Image Remove Backgtround Node.
You could further refine by then adding/merging theis to a plain coloured background
depends.
Auto1111's performance never was really good, especially compared to diffusers, but with SDXL it got unusable for me 😄
You mean in A1111?
SD, MJ and Firefly are still having different target audience pretty much
no, on the website and the bot
#✨|sdxl message
As far as I know they use these, could be wrong tho
I
how do people get it follow the prompt so percisely
I've been using them and style is implemented quite accurate
Any tips for using the Kohya ss for SDXL on Google Colab?
I just have "drawing and 3d" on negative prompt
Keep running out of memory when I start training
Anyone know how I can run SDXL models in automatic 1111 I've updated it but I was told SDXL works by loading two models or something ?
In A1111 right now simply use base model
did they merge them or does it simply not support refiner yet?
do you have qauility loss doing that ?
Doesn't support refiner yet. Technically you can use refiner for img2img but that is not really the same
not noticeable
Depends on picture you prompt
yeah i know 😅
Some get more impact than others
if you want to use Base+ Refiner + Upscaling workflows with SDXL please use comfyUI. Much better control
Yeah comfyUI is the go to right now
any links to how to set that up ?
and if you have an working auto1111, you can reuse models+venv in ComfyUI so the switch is quick and reversible
Cool, easier will be for me to try out
You can also drag and drop the images made with comfy to get the workflow used
I'm seeing on Civitai a few of the people putting down SDXL models now don't seem to have refiners
4x-ultrasharp is better
Check out this video. This has good directions for Comfy on Google Colab and also locally on your computer (Assuming you havea a powerful GPU). - https://youtu.be/FnMHbhvWUhE
Updated for SDXL 1.0. #ComfyUI is a node based powerful and modular Stable Diffusion GUI and backend. This UI will let you design and execute advanced Stable Diffusion pipelines using a graph/nodes/flowchart based interface. In this video I will teach you how to install ComfyUI on PC, Google Colab (Free) and RunPod. I will also show you how to i...
brilliant! thx a lot
oh the old "Powerful GPU" sketch again
What exactly is a "powerful GPU" ?
I quite happily use a 6 year old 1080Ti
You do not need a latest generation card.
Sure its nice to have but you don't "need" one especially if you are just tinkering for fun
Ok does anyone have a config for sdxl loras
Cause this is getting real questionable how people are even making one that's 49 mb
Cause I can get 90mb but that's as close as I get

And most of the answers I get are fuzzy or do not line up well.
i would definately still call a 1080ti powerful. it was flagship in it's day
its just a phrase that grabs my Goat
When I started I was quite happi8lyusing a 980Ti, I only upgraded for the extra 5Gb VRAM
oof. wasn't the 980ti the one that split the vram so it had a fake 8gb?
no , it had 6Gb VRAM OOTB
i was an amd boy back then because i was only playing video games and was happy to compromise on the price
i was thinking of the 970
8 dim 8 alpha should be around 50mb I think
the other params don't really matter much size-wise
conv-dim too I guess
which was a 4Gb card but yes I do seem to recall something about some mfrs doing something dogy
Question
So if I use an anime character for a sdxl lora
Given sdxl's limited dataset due to photorealism, I shouldn't expect it to pull it off until we get an anime model by someone right?
pretty much, yeah, your mileage may vary
Yeah cause its a 4 arm character
what do you mean limited dataset?
you can totally train a decent Ranni I think, she'd just be less anime-ish
Because you can't do anime that well with SDXL
Also meant a char like this
Purple arms, 4 arms, etc

It can't do it, and the lora doesn't DO IT
sdxl isn't exclusively photorealism
It's HEAVILY though
I say that becuase 90% of posts are people doing realistic characters/creations vs anime type stuff here.

And the anime ones are extremely mediocre in comparison to sd 1.5
I don't think it was trained on 6 b m booru images like nai so it's not that great at anime tbh
i don't know what that means? people are mostly generating photos with it yeah. anime only gets so good. 1.5 exists for that. but here, simple prompt Anime robot in the style of studio ghibli, ponyo
Yeah
That's why I say should I wait til someone makes an anime model
sdxl is an anime model. it has crazy anime capabilities. comics too.
Aight let me try that rq
I mean all 1.5 anime checkpoints have their parentage in NAI and not default 1.5. SDXL has better anime outputs than default 1.5 tho
ooo, thx for the link
@visual glade i saw you mentioned yesterday that SDXL TensorRT works right?
does it have any limitations compared to a normal SDXL?
No problem
What should I do if I use SDXL in auto1111 and it works badly?
How do you set Train on = Unet only in kohya? Just set TE LR = 0? Or a flag?
Comfy
Im letting everyone else crash and burn to learn then asking to get the easy way out

ok so yeah they were right
Need a base model + a refining model?
i need to reference an anime artist lmao
I'm adjusting your workflow a bit with sytan's workflow I'll send an image after I test a bit
Let me know what you come up with 
Only base required but have both. Depends on the workflow
wtb memory upgrade for my 4080 so i can train stuff lol
not trying to be right. just trying to dispell disinformation
Its called 4090 
Its prolly my lora based on these circumstances
@uncut steeple 
Or maybe its just the anime dataset is too honed to avoid customization of this calibre

there are a lot of rumors being pushed that suggest sdxl needs a massive amount of training to be capable of anythhing. it's not a photo model. it's a foundational model.
4 arms do be weird
I know you dont wanna hear it, but I have 2 
"Oh it did. I think it took about 3.5 hours to train this for 2 epochs on my 3090. I've trained my last few 1.5 LoRAs at 1024 this took substantially longer."
i've been making loras with my 4080
batches of 2
not perfect, but it is still quite good, some fine tuning and it would be a lot better
the texture of the skirt 😍
im sure civati ppz are on it
I'm aware
I have the entire collection of civitai on a document.
Models and loras

I'm the guy. The one who decided to be a masochist listing this all
So im constantly updating to hope some loras come out too
https://huggingface.co/thehive/petrichor-SDXL-Finetuned-Fp16
Doesn't seem to be released. But here sdxl anime finetuned model
I aint donwloading 1gb+ of a lora tho
Why does the bot make good looking anime pictures but locally it looks bad with the same prompt. How do i add anime as style like the bot has?
its released just epnding review of application
@trim orbit ah, ill look into that
try adding "Detailed, hq, high resolution, " etc... any time i add anime to a prompt it just goes full 90's anime
the L prompt, like in here should do it - wasn't made by me, but a remake of my ver
or what i have here(my workflow, made by me)
Already tried that, still doesnt look like the bot, but thanks
These look amazing, i will try that
Thanks
there's a prompt node that has a "style" option, much like dreamstudio and the other bot services offer. i haven't found an actual release of that node though. Joe has teased it in some images. i think you'd want something like that to invoke the internal "style" or however they're calling those presets
Try this:
Style: Anime
Positive: anime artwork {prompt} . anime style, key visual, vibrant, studio anime, highly detailed
Negative: photo, deformed, black and white, realism, disfigured, low contrast
Thanks, i will try using comfy ui and look for something like that node.
Where do i put the "Style:Anime"
Okay thanks, will try that
Of various styles
Thx
Anime using SDXL base+refiner
Damn, that looks nice
Do i need to use Comfy for using the Refiner?
the people (especially legs) are really messed up
the metadata seems to be lost on discord, is there a way to get it or see your prompt? Or would you be able to share it?
Put it in Png Info
How would I use a LoRa.safetensor with SDXL? Using diffusers
I can see the Prompt
I should retry this in SDXL1 (this was 0.9)
or if it was generated in COmfy open in Browser, savem dragndrop into COmfy
Tried that locally and as with the bot. The bot (right) still looks way better somehow. Installing Comfy now
Way better lol
I'll atttempt to fine-tune a LORA with Samara Weaving now.
its i s alearning curve but I've settled on a Daily layoutthat works for me and is a tadge more pleaseing onthe eye (plus its quicker & more memory efficient than A1111 for many people)
Bot>>>SDXL
that's wicked good. 0.9 in some ways makes better pics since they structured the release of 1.0 for better training. my loras i made in 0.9 even affect 1.0 way better
If I didn't fight god every day, that layout would be terrifying.
The bot sdxl looks so much better idk why
really ?
Quite structured to my Eye
You have your main parameters on the LH side, the main image output in the centre and a HiResFix thing (with options) on the RH side
Would be cool if the showed us how do use "style" locally
Here is the weaving dataset if anyone wants to try aswell.
oops. I am one buck.
Yeah don't group them in with that idiot

I like how an autism meter gets called out but pets on a plate don't
that's an angry cloud!

Funny thing is i am autistic, so he was correct
Damn
zamn
Did you use Comfy?
yup
Are you using multi prompts?
yup
Studio Ghibli, Anime Key Visual, by Makoto Shinkai, Deep Color, Intricate, 8k resolution concept art, Natural Lighting, Beautiful Composition
, artistic, creative, contrasting, detailed, 8k, expressive
Thx, will try that now too
the bot is sdxl but with a lot of the settings randomly changed for you to a lot of best case kind of settings.
Empty class room anime
nice
does the supporting terms make any difference than putting it on the linguistic positive? or is it just for workflow?
Is that an extension?
How the fuck do you have that in stable?
is what an extension?
comfyui is engineered diferently than gradio UIs
multiprompting was introduced in MidJOurney I thought.
Stable has never had that.
has anyone tried something like xyz grid in ComfyUi to check samplers?
has it now. SDXL has two text encoders
I still can't understand how that can be.
It just has multi prompting?
oh shjiot
My Prompt Input Area
random?

one clip layer takes a linguistic style prompt. so word salads work less good there, and the other is classic vit clip and works just like the old prompt styles
yup, its a WAS Suite Node, pulls a random prompt via API from Lexica
i've been manually using one button prompt extension in webui, to create a bunch of randomized prompts then feed them through comfy ui
I tehn run everything through some text manipulation steps to feed to the correct place and also to output the prompt used to a txt file (still have some work to do on that)
take them round the back of the bike shed and whisper sweet nothings to them
No i mean
how do i get the nodes of a picture on civitai
it has 29 nodes but idfk how to copy it to comfyui

How did you make them do a sharp turn?
metadata should be in the imaghe
nvm got it..
same prompts but higher res (1024x1024) on 3070. Much better
i copied the nodes
just drag drop to see if that works
drag drop the image
nah i copy paste the node thing
Link Tidier. Save in \ComfyUI\web\extensions (0=straight 1= slight curves 3 = Standard) https://drive.google.com/file/d/11pAM5HW3S72DQv4Qw6cXQXTwCrLFlN2o/view?usp=sharing
LMAO
I can't make a planet shaped like a spoon :(
no matter if I phrase it or make a word soup
Hi everyone, I downloaded the new SDXL base, refiner and lora. It is an available option in the checkpoint drop down menu, however I cannot seem to select it. It just loads for a while then reverts back to the originally selected model. Any idea why this might be happening?
terminal messages?
I feel you.
Ooh, ty
it tries then fails and reverts back. could be a few things. wrong yaml file. need to update the ui. or you need some kind of launch option enabled.
how do you guys get bot level images? whats hte secret?
good prompts
Can you give examples of what you mean?
I'm using the SDXL 1.0 model which by default outputs great.
The right setup
Seems like it fails due to size changes, can you show us your workflow?
Ah wait, that is au1111?
Honestly, comfy is messing my brain. By the way has anyone succefully implemented this type of workflow:
1.5 Fine Tuned Model -> Highres fix -> SDXL Base+ Refiner and vice versa
How do you get sdxl image portraits, which are NOT out of focus ?
I tried everything i could think of :
Neg Prompt: blur, blurred, out of focus, bokeh etc.
Pos Prompt : different cameras, tele lenses, sharp, crisp, f-stop, etc.
I cranked up the weights, which somehow sometimes had the opposite effect.
Different Samplers, cfgs scales and steps.
But to no avail, only some parts of the face are in focus the rest is not.
I generated 400 already and none were in focus.
no loras and every checkpoint needs to be baked in
have you updated auto1111 today?
question how do u use that entire UI with sanity
sorry im quite new to this. how can I show you my workflow?
no actually, havent updated for a while
does anyone use comfyUI? i dont find DPM++ 2M Karras sampler? Or what is it called in ComfyUI?
that's your problem right there.
Seems like you are using au1111, not comfy so yeah you cant
Go to tech-support channel
And also do a git pull
guys what is the use case for refiner ?
Creating an image at low res then refiner with higher scale with 0.5 denoising strenght ?
Native au1111 cannot use sdxl as before of ~2 weeks ago
From 2 weeks ago to date i have no clue
Seems to be fixed now! Thanks
pick dpmpp_2m as sampler karras as schedules
I gotta say that 1.0 looks great but can also look awful what there is a lot of visual clutter.
It well just mesh smaller details together even when refined.
It's a great model and I can see it working much better when the community makes checkpoints from it.
i tried copying your layout, most of its red? 
Does anyone know when dreamboothing will be supported for SDXL? I tried both the A1111 extension and the Joe Penna dreambooth repo to no success. I know it's brand new and devs are still working on things, just curious if anyone has had any luck 🙂
ahhh i see thats explains it ty mate!
thanks!
check all of the below
Credit & Notes:
modified using original from https://github.com/SytanSD/Sytan-SDXL-ComfyUI
HRF modified from https://civitai.com/models/107144/sdxl-09-semi-official-workflow
Seed with Text from https://gist.github.com/alkemann/7361b8eb966f29c8238fd323409efb68
Multiple Nodes from WAS Suite, Efficiency nodes & Deerfu ::
https://github.com/WASasquatch/was-node-suite-comfyui
https://github.com/LucianoCirino/efficiency-nodes-comfyui
https://github.com/Derfuu/Derfuu_ComfyUI_ModdedNodes
styles sheet with SDXL Styles + others https://drive.google.com/file/d/1IZq_0CGTbfxlAdIMsjz3VwRzHjl3JL-n/view?usp=sharing (config in WAS Node Suite fpr location)
you need to add the Path to the styles2.csv (or whatever you decide to call it) in "\ComfyUI_windows_portable\ComfyUI\custom_nodes\was-node-suite-comfyui\was_suite_config.json"
So in my case its
"webui_styles": "Z:\AI\ComfyUI_windows_portable\ComfyUI\styles2.csv"
SO you can save it wherever you like
Link Tidier. Save in \ComfyUI\web\extensions (0=straight 1= slight curves 3 = Standard) https://drive.google.com/file/d/11pAM5HW3S72DQv4Qw6cXQXTwCrLFlN2o/view?usp=sharing
Hide
Anyone have a good workflow for comfyUI for the new alpha dreamshaper model?
Thanks!
will custom checkpoints be lighter in terms of size? Like pruned 1.5 ones? Using refiner in auto1111 is too slow workflow. it has to load the model every time you change from base to refiner.
My A1111 gens take around 8 minutes on my 3080. Using ComfyUI, they are taking roughly 15 seconds. Do I just have some setting wrong with A1111 or is it just not working that well currently.
A1111 isn't well setup for it right now
try this prompt: positive: iphone photo {prompt} . large depth of field, deep depth of field, highly detailed negative: drawing, painting, crayon, sketch, graphite, impressionist, noisy, blurry, soft, deformed, ugly, shallow depth of field, bokeh
A1111 needs tons of optimisations
Just making sure I wasn't missing some crucial setting
what does . do in a prompt?
a1 was cobbled together quickly at the beginning but i think sdxl proves that it has outgrown it's uses. The community will learn better workflows as we grow.
now we only need controlnet and animatediffusion for ComfyUI
btw does the dreamshaper alpha model need a refiner?
it doesn't, in my experience
If you ask me, now is a good time to try experimenting with getting used to ComfyUI, if you haven't already. There's more reason than usual with A1111 being behind and it'll benefit you in the long run!
can comfyUI do outpainting?
ok so just upscale after basemodel and thats it so cool
You can also try the prompts in the sticky. #✨|sdxl message
i really like nodes if i can do controlnet animatediffusion and roop eventually im happy and stay its sooo much more performant
Thank you for the reply, i´ll give it a whirl 🙂
invokeAI's latest build supports SDXL and is looking real sleek, I might consider it an alternative to both auto and comfy
Well that’s not nice
Personally like the control Comfy gives me, but I do want to ask, how does it compare performance wise?
Not sure if it does anything
Thats even better, awesome, thx 🙂
doubt it has too much effect, if any but if it does, you'd probably have to dive into the actual weights to find out, It's probably really good for artificially making other tokens less connected by putting a few to a ton between them
CLIP is wild on on the inside, lol
i heard people saying CLIP is like magic
https://github.com/twri/sdxl_prompt_styler/blob/main/sdxl_styles.json this looks good for prompting
i wish stuff like that would be baked into the metadata of the model and that UIs like comfy and a111 would just read it from the metadatata and make dropdowns based on them
What json setup would you recommend
wheres the download for sdxl 1.0?
i dont have a recommendation yet. i'm still figuring out what works good for me
If you mean SDXL, here: https://huggingface.co/stabilityai/stable-diffusion-xl-base-1.0 and here: https://huggingface.co/stabilityai/stable-diffusion-xl-refiner-1.0
It's in the pinned
Those are just a few examples that work. You can come up with your own.
ah thanks thats the last thing i was thinking whew
shouldve known its pinned
they aren't just examples that work though, some of them are based on how the model was trained and not random
oh hey that's twri's script! this guy knows what he's talking about
Just glad to save you from tearing your hair out looking for it!
@uncut steeple try this
I made it, and added a few prompts other than the sai ones.
its very nice
Offset lora: 0->0.5->1
oh do i have t make an account in hugging face to dload?
The most incredible thing is SDXL, it responds very well to basic styling.
I will once I got the missing nodes 
Thanks, happy you like it.
I think if you just git clone it should let you sidestep that
hank
ComfyUI_experiments/sampler_tonemap
ComfyUI_experiments/sampler_rescalecfg
Is it me or why is text i put in the prompt coming back in the pictures it generates?
i think thats it
ngl im a human and i wouldnt know what to even attempt to draw if you asked me to draw a planet shaped like a spoon lol
haha yeah it happens now. SDXL sometimes tries to make text
a decal is how I thought I could trick it.
@wet nacelle wait im guessing you got it to work??
unfortunate
I thought
someone else also had issues last night, i dont think they resolved it either lol
Yah all those guts hanging out, shudder
closest i've gotten
Looks really similar to mine, any real improvements?
oh cool, i can train a sdxl lora with 1.2gb of vram to spare with 4080
yeah, it's almost the same, just some stuff so you can use a higher cfg scale without burning the image and I added A score stuff from sytan
basically you'll see less burn artifacts and the A score makes the images better i think
any way to know which selection was taken from dynamic/wildcard prompt in comfy, or is it impossible to see?
i was wondering this myself. seems to be no but i would be completely happy to be proven wrong
damn this one looks cool as fuck
cosmic god cereal
The styles used in the bot channel - are these natively embedded in the new SDXL model, or are they just adding "extra" prompts to my prompt when I choose them?
more sortof but not quite. here's the prompt i've gotten to. 3d render of a spoon orbiting a star, in cosmic glory, colossal sized spoon with earth like features and textures across it's surface

You can do a 1st 8 steps with
A ball on a spoon
Or a ball in a spoon
On the 2nd pass lets say steps 8-12 a planet in a spoon
ghoul
some prompt editing might help a lot. there's hope yet @kind pewter
How are you using multi prompting?
I'm trying to use it and am not receiving any changes.
now thats an interesting trope never seen before, zambies eating like civilized people
Please reread #✍🏼|rules-and-tos , epsecially Rule 4 
thank you! & @delicate grotto
who broke rule 4
DM me and I'll look later. On a conference call for work
Sytan's workflow is set up for multiple text inputs
Can you send it?
comfy right?
walking dead lol.
a winning image #🏅|pantheon message
it is ?
? si ti
I also added random seed and the seed will be the same on all passes, to me the details are nicer but I haven't tried with other prompts yet
Yeah that happens
ngl. they look 15
As someone who hasn't tried LoRA before, what notebook (or tutorial) should I use to train a model for SDXL 1.0? I used Dreambooth when it came for SD 1.5 but I never tried it for SDXL 0.9 and I also never used LoRA
who
kohya-ss with the gui is what i use. its nice and easy to work out
Thank you, I'll try to use that one then
kohya's has always been touted as the best since dreambooth apparently breaks with updates
anyone know why my faces look like they've had lawnmower accident?
yes. your images. lets not expand on it. You do you. i'm just noticing the same trend with woman portraits as 1.5 had
Does anyone get this when trying to use a LoRa they have made on Kohya ss? modules.devices.NansException: A tensor with all NaNs was produced in Unet
(8k UHD, soft lighting, high quality, film grain: 1.1), a cow
Negative prompt: NSFW, cartoon, painting, illustration, (worst quality, low quality, normal quality:2), (deformed iris, deformed pupils, semi-realistic, CGI, 3d, render, sketch, cartoon, drawing, anime:1.4), long neck, Photo of a zombie, Photo of a television screen, Photo of a camera lens, photo of unspeakable Horrors, Photo of a pile of body parts and blood, Close up photo of filleted skin, Photo of a cat, Close up photo of a tooth removal
Steps: 10, Sampler: LMS Karras, CFG scale: 3, Seed: 1409750279, Size: 768x768, Model hash: 31e35c80fc, Model: sd_xl_base_1.0, Version: 1.5.1(8k UHD, soft lighting, high quality, film grain: 1.1), a cow
Negative prompt: NSFW, cartoon, painting, illustration, (worst quality, low quality, normal quality:2), (deformed iris, deformed pupils, semi-realistic, CGI, 3d, render, sketch, cartoon, drawing, anime:1.4), long neck, Photo of a zombie, Photo of a television screen, Photo of a camera lens, photo of unspeakable Horrors, Photo of a pile of body parts and blood, Close up photo of filleted skin, Photo of a cat, Close up photo of a tooth removal
Steps: 50, Sampler: DPM2 Karras, CFG scale: 7, Seed: 1409750279, Size: 1024x1024, Model hash: 7440042bbd, Model: sd_xl_refiner_1.0, Denoising strength: 0.4, Version: 1.5.1
how do I use refiner in auto1111
1.5 had this issue, it was fixed with "restore faces" i believe. idk if that works with sdxl tho, mabey it does
I've removed it, please refrain from posting content that is generally uncomfortable to view (mark as spoiler if needed) - esp for death/gore stuff
a lot of these services are trying to build brands by buying exclusivity deals. i hate it.
nah
skeletons (non-gore type) are ok, right?
Yep that's fine!
Does the ComfyUI not work? It lists Intel Arc as a card that can be used.
codeformer still works super well when it needs to. adetailer extension does a good job if you use it with a lora too
or embedding even, or just a good prompt
ya i was also going to note how 1.5 has that issue. could be a training thing. makes sense that images tagged as "girl" would be younger folks
stupid question but what's codeformer?
boy and girl token tends towards children aesthetics. woman and man tend towards adults. sdxl uses the same clip for one of the text encoders.
@trim orbit have you tried training a LoRa for SDXL yet? I'm trying to run a LoRa I made in the Kohya webui but getting thrown a NaN issue
ping, this is the answer
to get women, either prompt woman, female, or feminine
What is the render time for 32 steps on the AMD?
it's another AI that's great at restoring faces
girl will get you ~8-20 year olds most of the time, depending on other context in the prompt
a bit, seems a bit blurry for 3d
i got a few NAN's when is tarted. in the GUI there's a tab called guide, offers good tips. you need to --network_train_unet_only, use bf16, and cache the text encoder outputs. ever since going that route, i've not had problems with nans
anyone know a good upscaler for sdxl
iv got 3d in the negative
remacri
but idk what to do
do you guys know any proper workflow with lora and refiner combo? I currently don't use the refiner though
Great stuff, cheers
i find even when youre using "girl" in more adult contexts, the face still looks like they're working through puberty yet
Super stable results at 768 one to one aspect ratio
yeah... I guess vegan art is banned
maybe try analog photo or something like that
good luck!
a good prompt for detailed skin
can't disagree with you there though
(8k UHD, soft lighting, high quality, film grain: 1.1), Raw photo of Hugh Jackman as a steampunk clockwork
Negative prompt: NSFW, cartoon, painting, illustration, (worst quality, low quality, normal quality:2), (deformed iris, deformed pupils, semi-realistic, CGI, 3d, render, sketch, cartoon, drawing, anime:1.4), long neck, Photo of a zombie, Photo of a television screen, Photo of a camera lens, photo of unspeakable Horrors, Photo of a pile of body parts and blood, Close up photo of filleted skin, Photo of a cat, Close up photo of a tooth removal
Steps: 10, Sampler: LMS Karras, CFG scale: 3, Seed: 1409750279, Size: 768x768, Model hash: 31e35c80fc, Model: sd_xl_base_1.0, Version: 1.5.1


