#✨|sdxl
1 messages · Page 46 of 1
extremely wrinkled man, with HUGE pores, many face scars and a LOT of stubble, sweaty skin, high cheekbones and deep laugh lines,
reject waifu, embrace skelly!
this isn't a bad shot
load upscaler model, insert image into upscale, rescale to desired resolution
then in negative put smooth skin, makeup
thanks for the help
this was my result
nice
whoahh we like makeup tho 🙂
@indigo vinemedic
i keep mascara in positive prompts whenever i prompt for a lady, its such a nice detail for eyes
what do you use as negative prompt?
it can, just do full-body in positive and closeup in negative
it should do it better than 1.5 due to how they trained it
rule of thirds photography as a positive prompt worked nicely for me in 1.5, i wonder how sdxl reacts to it
i think too, doing just a little token towards the eyes, draws attention to the eye and the latent space provides more details in those focused attentions
wicked
has anyone been able to train on 1.0 yet
/settings
"Can't" is a bad default assumption. You should rather lean towards "I'm prompting wrong" when you're unable to achieve a result
Does anyone know when dreamboothing will be supported for SDXL? I tried both the A1111 extension and the Joe Penna dreambooth repo to no success. I know it's brand new and devs are still working on things, just curious if anyone has had any luck 🙂
Lykon seems to fine-tuned Dreamshaper XL
all the loras i've made for 0.9 work on 1.0 and i trained one last night. worked fine. people doing full models too.
How much Vram will I need for a full model?
@floral island
i can't train for some reason
i keep get ting missing model/unet
Will renting a 3090 with 24GB's on Runpod be enough?
Missing model directory, removing model: G:\stable-diffusion-webui\models\dreambooth\SDXL Model test 1\working\unet
nice. i do love relevancy
you used something from my prompts to make that? 😮
use kohya-ss to train instead. i don't like how poorly the extension updates
back when i used the extension, everytime i updated it'd need to be a full reinstall with me making sure the requirements.txt files didn't conflict manually
my mistake
Err no, sorry, I added you by mistake 🙂
ill download it
https://github.com/bmaltais/kohya_ss this one is up to date and is wrapped wiht a nice gui
It was a random effort, rather than something like Arcimboldo
look into my beady eyes 😮
im gonna sell my soul
ComfyUI crashed my PC with too high of image generation settings and now I get this issue trying to load it up. Any solutions?
not in this economy. supply and demand. everyone wants to sell out now

ok, I have some ideas for a future version of stable diffusion
-
stable diffusion needs to become a 3d editing program, possibly based on a blender, in which there are several primitive 3d models of people, animals and other objects or buildings, as well as upload your low poly 3d models. which will create a framework for generation. This is the only way it will be possible, for example, to make a comic or a storyboard with actors who are guaranteed to have their hands and feet in order, as well as, for example, to make a crowd of pedestrians in a city with the correct geometry of the body.
-
Stable diffusion should identify the characters in the composition and give them an ID number in order to be able to use this character or many characters in the future when changing locations.
-
The 3D program should be able to edit the parameters of the character's body.
-
As well as the ability to generate faces with a variety of parameters, mixing, and loading parameters from photos.
-
It should be possible to upload a photo of a face and generate its 3d model as a meta-human.
I do not know where to stir ideas, so I put it here.
Reinstall Python?

Idk comfy is easy to install, reinstall?
i don't know how comfy ui would've messed up your python folder. don't think that's possible
Reinstall everything?
but that's whats wrong. your python folder got haggard
😦
@trim orbit ill just take your soul then
you probably did something else and that broke python install
Does deleting VENV work in ComfyUI?
ComfyUI worked fine until my PC crashed
i'm happy to provide. i've got plenty of soul always
stable diffusion is only the image generating parts, all of those things that you mentioned are to be developed by 3rd party developers
not sure why i got the flowers, but eh... i'll take it ❤️
The PC crashed and took ComfyUI with it? OR ComfyUI crashed and took the computr down?
that's a very differetn program. what stable diffusion is is more like an open sourced rendering component. blender uses a raytracing component as it's renderer. blender is the visual suite that uses that renderer as part of it's architecture
id post this into #🤝|tech-support , hopefully areytis and lone are active
how long the generation takes for you guys with euler a, 80 setps and batch of 4? Its around 20 minutes for me and I feel somethings wrong. (edit 8gb ram rtx)
https://www.cycles-renderer.org/ this is the renderer i think that blender uses
PC crashed and took ComfyUI with it.
I think?
python being borked speaks more to me that something else happened
does anyone has a clue how generating mythological symbols with appropriate prompts like this
the symbols are likely being added after
that looks like a gryphon so you can just prompt for that
and that's a 1.5 render. you knew that already. you're posting that for an audience to see. not a real inquiry. but yeah, reminder, this is the sdxl chan
I know how the blender works, but there is an addition for the blender stable diffusion, why not develop this promising direction, the SDXL version has not solved the problem of crooked hands and only a 3d program can solve it
good i'm glad you know, then you'll understand what i mean by a smaller component of a larger project
how about the feature extraction?i mean the major design style, can we make it thr sdxl?
How long the generation takes for you guys with euler a, 80 setps and batch of 4? Its around 20 minutes for me and I feel somethings wrong, in sd1.5 its 30 seconds at max (8gb ram rtx 3070)
developed by a 3rd party. SAI is just here to provide the image generation part
this is why the devs were hyped about the community because its on us to go fucking crazy with it
with stable swarm and comfyui and stable studio all being developed alongside stable diffusion, there will be convergence into a fully featured studio app like blender at some point. your suggestions are good and clear. it's a good direction and where theyr'e headed already i believe @amber pecan
awesome ^^
controlnet will help a lot for getting hands out of sdxl
i think thats just a simple style that you can change to in the settings
okay
I’m super new to SDXL is it the github to get it?
i need to redownload 1.0 since the model weights got released
pretty sure thats just dark mode

no, thats a picture of automatic1111, its a setting there
settings -> User Interface
youll see this
Great!
I just download the GitHub?
To get the automatic1111?
Ok
I haven’t downloaded anything
Can possibly at least implement this function 2) ?, stable diffusion should be more stable and not random, the work should be controlled by the creator of the pictures, otherwise it's just an endless empty game of generating random pictures
- Stable diffusion should identify the characters in the composition and give them an ID number in order to be able to use this character or many characters in the future when changing locations.
i love the git clone url command that's so popularized with this realm.
i think there might be more steps, the github has very simple installation instructions so you can just follow those
i know git commands aren't the most user friendly, but i lovem
Great thanks!
- ?, stable diffusion should be more stable and not random, the work should be controlled by the creator of the pictures, otherwise it's just an endless empty game of generating random pictures
if you can figure out how to do this then im sure many companies would love to hire you lol
so control net is currently not compatible in SDXL?
correct there are no controlnet models for sdxl currently
controlnet gives artistic compositional control. a studio set up for building thhat controlnet input would be really nice. its likely already underway by a few teams.
it wont be too long before there is controlnet for sdxl. it is inevitable /echo
people are already using blender as an open pose creator
ideally we dont even use a gui to generate images, we could just generate images from scans of our brain activity like from that one research paper a few months back
the second setting is set to "fixed" which will make your seed static, its overlapped so its hard to read
just set that to randomize
imagining SAO worlds with a simple thought
if you trust yourself to the elon neuralink tho XD
Unfortunately, I am not a programmer, but the solution to this problem lies in the fact that the AI has to explain what is in the picture and Meta has succeeded in this. And then it will be a new level of generation, through dialogue with AI.
https://toyxyz.gumroad.com/l/ciojz?layout=profile here's a decent looking one
-Blender version 3.5 or higher is required.- Download — blender.orgGuide : https://youtu.be/f1Oc5JaeZiwCharacter bones that look like Openpose for blender Ver93 Depth+Canny+Landmark+MediaPipeFace+fingerIn version 93, new tools were added. -Openpose_attachUsing this tool, you can render the images you need for multiple controlnets at once using t...
for some reason my answer to that is just better prompting lol
have they? i havent kept up with them
no amount of systems on top can solve lazyness
ya currently we're limited to just trying our* best to get the AI to generate what we want. ideally tho we dont need to
soon, public ads will use focused microwave beams to scan brains of people in the area and target diffusion generated ads to display specifically towards them
so they'll try to sell me anime figures whenever i'm buying stuff from the supermarket huh
i hope i don't have to learn to use comfyui to use sdxl, my pc can't handle it on auto11, it takes too long T__t 3050 8gb user <-
a boxer boxer
no need to learn how to use nodes when you can just be lazy and steal workflows like i did 🙂
It's already light-years beyond developing genuine artistic ability to paint or draw or take a photograph yourself- further development is always underway but as it is the simple leap from "no generative ai" to "nearly indistinguishable from photographs when properly processed" is astronomical, and likely not to be reproduced in scale for a good while if ever in terms of the technology
i can never steal workflows because what i make is niche
i use auto for an hour long session, and images start taking 10min on the vae step. there's a huge memory leak somewhere. closing and opening it again helps. sometimes its only 10min of work before the vae step starts getting longer and longer to cook
If you want ai to be better sooner though, feel free to collaborate on the open source nature of many of them
Auto is famous for memory leak sadly. In my experience comfy is too, with the new sdxl base model at least
at first it was 10mins per image on my end, had to used medvram for auto11 to generate properly
maybe it's just pytorch
gonna be checking 1.5.1 now tho
Finally got wildcards to work in comfy 
Could be . I'm not a programmer, nor am I particularly versed in the technology of generative ai. I just keep restarting it lol
well by workflow i mean just simple stuff like getting it to generate an image and using the refiner. with sytans workflow for example, you can load it up and just get straight to prompting, very similar to a1111. assuming you simply prompt and generate as your normal workflow of course.
Is there a checkpoint model somewhere I can drop into DiffusionBee or other SD installations?
What nodes? Also nice.
still taking too long with 1.5.1 without medvram on my gpu, i guess i'll be using comfyui for sometime ;s
GL with generations guys ._./
i updated automatic1111, downloaded the "refiner" model and put it in.. and it sucks. i feel like i skipped a step, what am i missing?
Are you generating at least 1024p?
yes, im trying 1024x1024
the steps go: create image with base model -> img2img with refiner. the refiner itself shouldnt be creating images
ohhh
a1111 is a 2 step process right now, comfy's is better imo since you can just do 1 click
Using Impact node now, had issues getting it to run before
But its acting weird for me still 
Can't wait for controlnet, gonna be game changing again
Also takes a ton of VRAM
Whereas comfy runs on 8
Thanks!
after clicking 100x to set up a workflow
just press the arrows on the control after gen to the right until u see randomize
another ux thing i hate about comfy, go to pan the screen and it doesn't catch my space bar, so i click to drag but accidentally click a node and drag it way off messing the whole layout
soemtimes i just get fed up and load auto
When dragging image into comfy is it possible to change only prompts and seed and keep the workflow? @visual glade
I just want to see if anyone's used the SD Ultimate Upscale script with sdxl, or if it is compatible
going to try to get stable studio running with comfy tonight again. i just have to symlink it all up with my existing folders i think
Sometimes with this method images are flattened in my case
It works
is it worth using without controlnet tile?
hello everyone .... so i have a 3060 8GB vram band i downloaded dreamshaper SDXL1.0 and i tried to run it at 1024/1024 resolution on atomatic 1111 and a basic prompt no lora no nothing but every time it keeps saying Out of cuda memory ... why is that ? like other SD1.5 models works fine in that resolotion
already on
i forced my self to koyha ss
a1111 holds too much stuff in memory. i have 6gb vram and comfy is at least running stuff out at 1024 for me (without refiner)
okay fine i guess i have to install comfy and get used to it
I gotta give this comfy a try some time...
i installed it a while back but it didnt do anything a1111 didnt so i skipped it
yeah its kinda like a dog walking on hind legs for a while but
well "work" is something it can add to the list 😉
comfy deffo for middle end machines though. im sure a1111 will work tearfree eventually
is comfy the one with the weird nodes thing?
yeah. its painless if you already know blender or use music DAWS but hard to wrap your head around a pegboard UI at the same time as grasp AI art generation
What DAWs use a node graph?
I've used the setup with Blender and in Fusion with Davinci Resolve, but haven't tried Comfy yet.
not node graphs per say, but you use your ins and outs and ducking and tracks and stuff, handy concepts for comfy
change to randomized and enter 0 as value
ITC: we complain about comfyui like children going for their first haircut.
i dont wanna! what if it hurts?
comfy rules but i still can't figure out masks + compositing
Ah I see what you mean. Stuff like digital modular synthesizers
tes
eys
yes
in fact, it'd kinda like if comfy could be controlled with mixers or make macro surfaces for it.
what the fuck!
How do I find the negative_prompts used in the bot generation?
diffusers is awesome for pretrain models
something about that image makes me expect it to move suddenly
so much fun
or drag and dropping an image that was already generated with comfy
where is the best place to learn how to use comfy? for a complete beginner
https://github.com/comfyanonymous/ComfyUI_examples and olivios videos
thank you sir
Just finished my first LoRA based on SDXL 1.0 (Trigger J3nn4) (using Jenna Ortega as a Test here)
well damn
has some weird videogame engine aesthetic to it i love it
like early dierectx
This is without the refiner or VAE.
directx
Is it private or public? Good work
Looks nice
now that you said it we need a direct comparison :P
What’s the question tho?
How are we supposed to know?
just use clip and try to work it out
This is new clip
Since I used random highres images from the internet I will keep it private. I just used Jenna as a test subject
the fuck! no
should get close enough to a working prompt
should I ping staff?! the fuck!
any1 got any face detailer workflows going?
good luck genning today generators. Stay classy. I'm out
did you look it up?
drag this into comfy UI
Why do you mention 6 billions of images?
"What's that?"
moments before a FHRITP
Got dammnit give me the promt
which parameters did you use?
it is included in the metadata
What is metadata
because that's the amount of images on danbooru and also the size of the nai dataset according to nai at least
//oh sorry I meant 6 million
too harsh of me
the workflow and prompt stored in the image itself, stuff you don't actually see visually
comy doggo vs. sdnext doggo
Do they know what comfy is though? that is the question.
How do i see it?
upscaled comfy doggo
But the interesting thing here are the spec, right?
20 images 20 repeats
40 regulisation images 1 repeats
stopped at epoch 8
used epoch 7
Training time ~1.5h on a RTX 4090 (64GB RAM)
see what I mean.
yep
see above
what about LR, did you train text encoder?
https://github.com/comfyanonymous/ComfyUI
download this and read what is written on there
This is why I don't understand why new people join the server and ask these quesions. It's like as if they haven't tried to search Youtube for it. @bold osprey
okay
can't believe i'm a comfy shill now
the legs look like they don't have bone damn
otherwise the color grading is insane
yes, TE and UNET too
are you guys using any icons for SDXL / Comfy? I want to tuck it neatly into my taskbar
Anybody using it in A1111?
not recommended
shart
The arm with two hands is curious hahaha
If I make a large batch of images in comfyui, how can I find the seeds of the individual images from the batch? They don't seem to increment and I can't find them recorded.
drag and drop the image you want the seed from in comfy
That just sets up the workflow for the whole batch.
you have to use the index from batch node, or something like that, forgot the exact name
do you lads know of a youtube video that explains all the nodes in comfyUI in good detail? And some of the addons? Really wanna learn this stuff to make full use of it
oh...
the seeds are somehow between the numbers so just incrementing by 1 doesn't work like with auto
In automatic 1111 discarding next to last Sigma seems to help
Here are amazing ways to use ComfyUI. This node based UI can do a lot more than you might think. Especially Latent Images can be used in very creative ways. You can inject prompt changes. You can combine latent images to new results. Stop render steps and finishe the rendering after you changed to prompt, sampler and settings. A world of possibi...
Thanks 
a1111 and its related forks do function, but the results i've seen haven't been great. even if you manage to get things running smoothly enough, an overnight update could screw it all up
Is it doing a random seed each time and forgetting them?
Yeah, my jaw dropped, when I saw "6b"
I know that waifu diffusion project has 15m dataset
eyy! Olivio!
yeah you can also just set a seed in the sampler if you want to
what did you use for the text encoder learning rate?
it's not random, just that the seed counts for the whole batch, not 1 image
Perhaps I can use increment, but that's far less conveniant than random.
Has anyone tried this combination sd1.5 512px for base and sdxl 1024 for refiner, will it work????
could you please explain in simpler way. it went over my head 🙈
thank you for the delivery
What did you use as your reg images?
yeah that would be like every anime image on the internet ever made probably 
So many wheels for a light load
I would love to be able to recreate a single image without having to regenerate the wole batch. I've got a generation I really like, but it's part way through a batch.
oh... i was thinking about using it in deforum 😕
I do it the other way around. SDXL Base -> Juggernaut as "refiner" after upscaling.
0 will be the first batch image, 1 - second, etc
BRO this video literally explains how all this shit works in less than 3 minutes
Thanks.
So, someone yesterday made this with 1.0 but, when i installed necessary node packs from the extension manager, i still keep getting these errors. Any idea?
HELP
helped me getting into comfy as well
use comfy manager to install missing nodes
anyone get a fine tune / LoRA / etc. working yet?
Anyone Plz Help me. After i generate one or two images. it shows me that out of memory error. I am Using 1660 Super 6Gb vRam.
ok
40 images generated with Photon (sd 1.5) that were using "Jenna Ortega" in the prompt (2048x2048) from various views and angles
https://github.com/ltdrdata/ComfyUI-Manager
is it this one?
that's the one
on A1111? same issue with me
Read the whole message :P Already got the missing nodes installed, even clicked "install missing ones" and still no go :S
But, how many of these images are pages from manga... Because it can be good to have not only arts
LoRA, yes
So it's weird how missing node addons are installed, but image/comfy still errors saying it's missing 
hahahah yes bro I DID IT
I literally just made this workflow
that looks great
let's gooo
and cute!
1.0 trained? Or can one use lora from below SDXL? 
what happens after the restart
Running some testing on it and getting images, but will be releasing it later 🙂
1.0 trained, took about an hour
You can use 0.9 LoRA's but they don't work as well as retraining
getting the refiner in there will do magic to you result
"text_encoder_lr": 0.0004,
"unet_lr": 0.0004,
"learning_rate": 0.0004,
can someone help me understand how swarmui multithreading works when it is using a backend that, presumably, doesn't have that?
same thing. For every addon i installed myself/added that was missing/sdxl related, i closed the CMD to restart, and it started to install dependencies, but once running again, it still complaned 
Oh damn! With kohya?
yep!
I trained both TE and Unet, even though I'm not sure how it handles the TE at this time, being that there are 2
Wow, so just regular kohya training? Cause i tried it before, but it constantly complained about stuff in the wrong folder lol. So gave up back then 
Yeah thats the next step :D
I need to try out UNET
I am 16 Gb ram and 12 GB vram, using sdxl on comfy ui and it keeps killing itself, ```got prompt
model_type EPS
adm 2816
making attention of type 'vanilla-pytorch' with 512 in_channels
Working with z of shape (1, 4, 32, 32) = 4096 dimensions.
making attention of type 'vanilla-pytorch' with 512 in_channels
missing {'cond_stage_model.clip_g.transformer.text_model.embeddings.position_ids'}
torch.Size([1, 1280]) 512 512 0 0 512 512
torch.Size([1, 1280]) 512 512 0 0 512 512
100%|███████████████████████████████████████████| 20/20 [00:05<00:00, 3.69it/s]
Killed
I use Derrian Distro's Easy_LoRA_Training scripts, the Gui is a little better for LoRA's in my opinion than Kohya_SS from bmaltais
Can someone give a solution?
I thought it only worked with 1024px So it now works with 512p??? What about 256px??
anything below 512px with the base results in noisy and overdone images
honestly I have no idea, but hope someone helps you. It shouldn't be killing itself after a gen
@high skiff trying your workflow for the first time. you're a legend, thank you
That's the base model. The base model is supposed to make a 128x128 latent image
there is no generation,
if im understanding right
just don't
Thanks, that worked great.
Did you check your output folder? It looks like it ran 20 steps and then died
using different conditioning dimensions and target below 1024 I think you can get images that look normal but 1024 is still the default
?
On my system take 1,5hr I want to speed it up 512>1024
128 Latent = 1024 pixels, just to be clear
yes works fine
base makes 128x128 latent
(or is optimized for that, I suppose you can make images from it just fine)
not with only the base and without the separate vae
Mine just stops after final step of generation
I did no outpul
that's kind of a strange graph since the image decoded from base latent will also be 1024, likewise the latent from refiner will be 128...
Which conditions do you used?
makes it sound like there's some sort of upscaling going on when in reality there isn't
you are what you eat.
That is why there is no image, and possibly could be a bug killing it
comfy has nodes for text encoder where you can specify width/height and target width/height
if you specify target width and height lower than 1024 but width and height higher than it, the result should in theory be pretty decent as well
SDXL Aspect Ratio is nicer because it has all the trained ratios ready for selection
@hard fractal how goes control net? 🙂
did you double check your VAE is not hardcoded in the command args of the startup script
possibly, instead I set the resolution separately for the latent image
that's for latent but from my understanding the target dims should also match the latent size
might be bullshit though
Can't say for sure what works better yet
It prompted to use --no-half-vae , so I did
makes sense though could also just be downscaled
this is the way
now would you look at that
ooh, with a multiplier now. still need a proper math'd out node though
you know more than i do! I just like the settings being accessible 🙂
gotcha, i put the 0.9 vae in the VAE folder and then selected it under settings and saved it. did a full restart of auto1111 and then it worked for me. search my prior messages to see what i was getting before and if it's just stopping, check the command window for an error you can google
i learned from the devs last night that target width/height should be the resolution you're generating at
it's handy, i just threw it together as a quickie until something better comes along.
Ok. It's not showing any error. It just sort of hangs
idk, does it work? if it works, then great! (i haven't tried)
its hit and miss honestly
I just use this for multiplying since it ouputs both before and after
trying something out with the CLIPTextEncodeSDXL node
did you go to the output folder to confirm if it generated anything but just didn't render it in the web.. sounds like you have a specific issue here with your install but web1111 works
No i mean that I checked the output folder, there is no output image
Hey Annuvin. Can you share the setting you use in your kohya gui lora training?
The Json file?
Maybe the saving is crashing it, that is an odd issue. Have you checked if there is an open issue about if on ComfyUI github?
If you're using an upscaler, wouldn't you want to use the same seed to generate the higher res upscale with the exact same details, just more of it in the higer res? Watching Olivio's video
I think I pasted them earlier? not really sure I have any jsons with them, I use my own script with variables and stuff
Oh you make the training using the script? I wanted to use the gui for the training. (
Thank you though.
oh I don't use any guis 
that said if you're using bmaltais' repo or something most of the arg names should match up
for those struggling with MISSING NODES:
don't forget to load the SDXL Vae https://www.huggingface.co/stabilityai/sdxl-vae/blob/main/sdxl_vae.safetensors otherwise you get chromatic aberration & noise over your generations.
No you won't
it's jpg not png
Error occurred when executing KSampler:
mat1 and mat2 shapes cannot be multiplied (154x2048 and 1280x768)
File "C:\Users\emerg\Desktop\webui\StableSwarmUI\dlbackend\comfy\ComfyUI\execution.py", line 144, in recursive_execute
output_data, output_ui = get_output_data(obj, input_data_all)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "C:\Users\emerg\Desktop\webui\StableSwarmUI\dlbackend\comfy\ComfyUI\execution.py", line 74, in get_output_data
return_values = map_node_over_list(obj, input_data_all, obj.FUNCTION, allow_interrupt=True)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "C:\Users\emerg\Desktop\webui\StableSwarmUI\dlbackend\comfy\ComfyUI\execution.py", line 67, in map_node_over_list
results.append(getattr(obj, func)(**slice_dict(input_data_all, i)))
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
if you use a vae loader with the sdxl vae it'll be fixed
I saw you were having trouble with the workflow from my image, did you try the one that I uploaded as a file? If it’s from the image yes I use custom nodes.
you're not supposed to set latent size to 128 but idk if that's the problem
that's the dimensions for the image in pixel space, not encoded
@polar epoch if your having trouble using the workflow I used for that image use this instead. I took out the custom nodes for you. Or does it still give errors? #✨|sdxl message
Aye, the file worked, just oddly not when dragging over that old man pic
The fuck are you talking about.
yes there are faint lines but it's not noticeable on surface level.
and all I'm saying is you will get rid of it if you use the official vae they posted.
Yeah the old man pic is my private workflow haha. It has custom nodes and stuff. That I use for ease of use and to make pretty pictures like bloom and stuffs
if you don't find it necessary then no problem.
It's super noticeable even at 2 monitor height-distance view. /shrug
yup. 100% very noticeable
i will never not notice it looking at a pic
looks like the refiner doesn't like working at 1024x1024 but being passed a 128x128 latent?
I can see it from the couch 4 meters away 
I just use the other VAE for it, no issues there
I'll be autistic just for you sweetie.
they reverted it so it's the official vae version now
Also, is there a setting/way/addon that has a "PNG info" where it notes what nodes were used, settings for nodes and such? Similar to png info in automatic, that way i can add nodes where due if it's needed without shifting my entire current workflow :P
the sdxl_vae on the main branch on huggingface is the proper one now
Damn dude, really pulling out that card.
the official vae still has fp16 issues afaik, the one from madebyollin doesn't
Ah ok :P Could you take a screenshot of the workflow so i can see what you used? As i wanna achieve the same level of details you guys get :P
It doesn't seem to like any resolutions I set in that configuration and spits out seemingly random matricies multiplication missmatche dimensions
i've had no issues with using the one with this hash from the official upload
That's what i used to get VIA's workflow, but they seem to use even more custom ones the manager doesn't have :P
Does anyone have a the SDXL LORA Fine-tuning parametres for KohyaSSGUI?
it might be working for inference or you're using fp32 unkowingly
For the best images sytan will be releasing a workflow with all stock nodes that will achieve great results! Hopefully he will release by this weekend. I’d just way for that.
for training this is the best in any case https://huggingface.co/madebyollin/sdxl-vae-fp16-fix/blob/main/sdxl_vae.safetensors
that moment when bitches think that giving help to the people that don't want it is a good idea.
my workflow hasn't changed much, are you asking for inference workflows to be posted or...?
just messing around :p
lol
Aye. I can do addon nodes as well, just need to learn the basics of nodes and what goes where to achieve X :P
Probably a custom sdxl aspect ratio helper that I got from someone. You don’t have to use it.
I hunt workflows for sure but i just check to see if meta is intact, if not, i figure its TOP SECRET stuff
I'd love to use it actually :P Would be perfect for ultrawide generations :P
Ajani?
Yeah I may end up making a YouTube video lmao. For those struggling with comfy or something for 2.0 when sytan comes out with his workflow idk
:p
I’d send you the file but I’m not home right now 😦
Aye. I'm smart, but not so clever sadly. So takes me fairly long time to get the basics of anything new and foreign 
What is the point of including stuff like this upscaling in comfyui nodes?
the blurry one is the upscaled one lol
Are you latent upscaling with a low denoise?
its just a bilinear
No worries, i'm gonna watch youtube videos that explains what is what, why place X there and what it achieves hehe
maybe you forgot to disable add noise?
you're supposed to use that upscaling with a second pass
for a normal upscaler check out hat-l/esrgan ones
So your saying that if I use this VAE is should have lesser issues than I already don't have?
I'd be using a VAE without having to use it to begin with correct?
Instead of using the official one I should instead be using a fork that supposedly fixes the issues I don't see nor have.
AM I understanding this correctly?
huh? I wasn't even responding to you
No worries. You can do it! I believe in you
He's in need of attention
Is that the hacked together one I put on here lol?
Just ignore him lmao
possible? I forgot which workflow it came from.
You've been timed out for 1hr
See if you can play nice with everyone without being so rude when you come back 
if you have to know the official sai vae produces nans in fp16 precision which makes your loras and stuff basically broken
I linked the official vae from the official fork https://www.huggingface.co/stabilityai/sdxl-vae/blob/main/sdxl_vae.safetensors
Ohh. There we go, think i found out why certain nodes didn't wanna show. Some of them didn't install dependencies
I just like the easy selction lol
This vae is meant to be used alongside SDXL
If you don't trust the fp16 fix then trust the official vae
It looks like mine, I just took throttlekittys and added a multiplier to it
it could be mine. i never intended it to actually be used. I guess i could make a gist and register it with the manager
i difer to you guys on that. I like it. I'm lazy and it helps things along
As i downloaded 1 click install/portable comfy, how would i pip install stuff to it? Just simply open CMD in it's folder and install missing module? or what's the way for venv ones outsude of anaconda?
do it. I like it.
Yeah yours is the one I use. I love it! Thanks for making that. At least I think it is lmao
Besides them holding the cup on the wrong side with the force, I like this image 🙂
force sip
I'm using it
what does width and height do?
seemingly nothing lol
nothing. I just wasn't sure if the node needed values; I had a massive headache trying to get the node to run in the first place because of windows(?), and was too annoyed to fix it further
literally copy/pasted to a new file and it worked
Does the newest version of Easy Diffusion contain SDXL 1.0 ?
Which folder do I put the refiner for ComfyUI?
with your other stable diffusion models
If you like Easy Diffusion, check out Arthemy. They are making a super cool application imo, I like their evolution workflow (just img2img but super sleek) I don't think Easy Diffusion supports SDXL yet though, but I could be wrong
Dont know if this helps but I've added a routine to my workflow that writes the prompt & seed used (want to add some other parameters in when I get 5) to a .txt tfile in the output directory with the same filename as the image generated from it.
NB full info on any custom nodes, styles sheets etc needed to work ar in the Credit & Notes textbox in the workflow
Alright i will look into that. Thank you for your quick response 👍
where is the ''trainer ready'' now?
looks like it's working 👍
took me a while lol
How much coke was on the table before you started vs after? Jk lol but nice images
hahahah nice joke (5kg)
kinda boring
Thanks! :D
And is there a addon that has like a png info node, and would read from current workflow you have, state "node missing", list nodes, and highlighting with a red outline between the nodes image originally used and where rest is fine to note what's missing?
oi you taking long to reccomend its scary
how to config sdxl aspect ration?
less cartoony, bigger dynaminc range of colour, but I feel its more a a problem with sdxl
hey, that looks cool. I've build something similar - a metadata .txt generator with just nodes. It helped getting to know comfyui and it's fun building components. I should probably start learning to build my own nodes 😉
i use an extension (a1111) sd-webui-aspect-ratio-helper
Ya.. no img in output folder. It was generating earlier but then stopped. Found a similar issue on A1111's GitHub. It's an issue with vae it seems.
nah nah i used an artist name for that
its actually fun mixing artists yall should try it
Im just tinkering with this at the moment from WAS Suite
im uncultured I dont know any artists xD
looks nice. which UI are you using? 🙂
a1111
why, cant you run it on a1111?
oh, is refiner working already?
SDXL can run on auto1111.
yes but you cant use the vae-ft-mse-840000-ema-pruned.ckpt vae
or actually any vae i tried
are you using some theme to get the connections so straight? 
because it will mess up the image agt the last step
How'd you get those connections to be square?
i updated webui to 1.5.1 for it to work too
@fierce hollow @jagged cove
Link Tidier. Save in \ComfyUI\web\extensions (0=straight 1= slight curves 3 = Standard) https://drive.google.com/file/d/11pAM5HW3S72DQv4Qw6cXQXTwCrLFlN2o/view?usp=sharing
you need sdxl vae for that
thanks 
ah - great idea! looking good 🙂
wow, comfyui is a game changer for sdxl; kept getting oom with sdnext (i'd get maybe 10 images generated before crashing, but was always at 95% gpu vram utilization). sitting at around 75% vram used with comfyui. rtx 3070, 8 GB ram.
If you're using Auto1111 I recommend using the fp16 VAE for SDXL that you can find on huggingface as that has a better performance.
gawsh, haven't done stuff in a long time, like... pull requests. but i did it! 😮
Ah, SO much cleaner. Thanks!
you mean the third party vae?
ill try that thanks
just got the sdxl vae
litterally no difference than no vae
that i can see
you want clean ?
This is my daily driver workspace in ComfyUI ;o)
Managed to hide most of the lines altogether
Few random SDXL images made in Comfy today
Yes, then you do not need to use the no half VAE param and I pretty sure you get better performance.
workflow.json PLZ
I love that.
They're in the images
you might be interested in https://github.com/space-nuko/ComfyBox it's like a cross between comfy nodes and normal ui
How do you get that image save node?
This one
thanks, will try that tomorrow 👍
Treid it, liked the idea, my workflow refuses to load, move on 🙂
fair enough
I was so hesitant to move from A1111 but i get 10x better speed with comfy
its part of WAS Nodes Suite
@shy kelp Just throw the image in Comfy. There's the workflow
How do you do that?
Save the image from here first and then drag&drop it in Comfy
Has anyone tried making Loras yet?
open image in browser , save to PC, dragndrop into comfy
mfish: Yes, LoRAs work great
unfortunately some of the nodes do not show up in the WAS nodes when you update using manager. I don't know what to do about that lol
Did you use Kohya?
Interesting how does it do that??
I did get a bug though When loading the graph, the following node types were not found:
Text Multiline
Text Concatenate
update manually using git clone
xif data or something Im assuming
did that too and still missing nodes. Not really shure what to do.
I did a bunch with kohya's scripts earlier, they work quite well for the most part, still playing around with the settings
usually i can figure it out lol
Do you need the sdxl dev branch still?
delete completly from custom_nodes and then do a fresh git clone
I swear nobody knows this lol
afaik yes, the main branch is outdated by a couple weeks
I have so many QoL ideas. I guess I need to build some front-end helpers. I think it would be really nice, especially when you build compact builds, when you hover over a pin that it shows the source / destination node - maybe even highlighting the corresponding source / destination node with a border.
will try that as well
Yeah I didn't believe it at first either 😄
Yes, here is one I made to get cleaner white backgrounds.
Top is SDXL 1.0, bottom is SDXL 0.9.
Left is no LoRA, then 0.9 LoRA and then 1.0 LoRA to the right.
you also need to use sdxl_train_network.py, not train_network.py
Cool cool. Gonna get the branch now. Thanks! So it wont work with the GUI?
oh and dont forget to restart Comfy after updating
I used the bmaltais/kohya_ss version.
with bmaltais' gui? I don't think so unless they also have an sdxl branch, too much stuff to modify otherwise
I'm seemingly missing Text Multiline & Concatenate nodes, those sound like pretty basic node names, should I just reinstall my swarmui?
I have thay installed I can't find it thougj
bmaltais/kohya_ss version has SDXL support on the main branch.
ah I see
did you restart comfy after uodating?
Can you please share your Lora config file?
of course, still no dice, trying more things
I have been using 2e-6 for checkpoint fine-tuning on 1.5 based models and have no idea what le etc to use for xl lora.
I'm sorry I don't know how to install the missing custom nodes in SwarmUI. Didn't get it to work at all so I don't know how that works.
If in doubt ask the maintainer on his git page
Look awesome please share image with this workflow 🙏👍
ok!
if you use adafactor it will set the lr for you based on the initial one which is like 2e-6, it's pretty cool
it gets lower/higher depending on loss
Oooh, I see!
does kohya_ss work on AMD gpus?
do you need the vae or is it built in? I mean I know there is a vae, so whats the advantage to not having it?
I also have no idea about how to set the steps on Kohya. What exactly is epoch and repeat?
epoch is how many times all the datasets repeat, repeat is per dataset within an epoch 
not sure that's understandable...
Oooh
if you're using a single dataset epoch and repeats are basically interchangeable, some settings like checkpoint saving rely on epochs though iirc
So for checkpoint training I usually do number of images x 100
So for LORA I should, lets say I have 20 images
Config for that LoRA with paths removed.
I should do 40 epochs and 50 repeats?
There is no need to do repeats unless you need to balance your dataset.
depends on how many steps you want, that's probably going to be an overkill though
if you have one dataset that sounds equivalent to just having 200 epochs
20 epochs 1 repeat is probably what you want to aim for.
But do 40 epochs and save every 10 to be sure.
I reinstalled everything and I got this now
It did not even pretent to create an image unlike the last time.
Setting up MemoryEfficientCrossAttention. Query dim is 640, context_dim is None and using 10 heads.
Setting up MemoryEfficientCrossAttention. Query dim is 640, context_dim is 2048 and using 10 heads.
Setting up MemoryEfficientCrossAttention. Query dim is 640, context_dim is None and using 10 heads.
Setting up MemoryEfficientCrossAttention. Query dim is 640, context_dim is 2048 and using 10 heads.
Setting up MemoryEfficientCrossAttention. Query dim is 640, context_dim is None and using 10 heads.
Setting up MemoryEfficientCrossAttention. Query dim is 640, context_dim is 2048 and using 10 heads.
Setting up MemoryEfficientCrossAttention. Query dim is 640, context_dim is None and using 10 heads.
Setting up MemoryEfficientCrossAttention. Query dim is 640, context_dim is 2048 and using 10 heads.
model_type EPS
adm 2816
making attention of type 'vanilla-xformers' with 512 in_channels
building MemoryEfficientAttnBlock with 512 in_channels...
Working with z of shape (1, 4, 32, 32) = 4096 dimensions.
making attention of type 'vanilla-xformers' with 512 in_channels
building MemoryEfficientAttnBlock with 512 in_channels...
left over keys: dict_keys(['cond_stage_model.clip_l.transformer.text_model.embeddings.position_ids'])
Killed
Does anyone know if SDXL support tileable / seamless generations natively? If not I'm looking for that and can hire someone too if anyone is interested. Thanks in advance.
it's not steps, it's how many times you train over the whole dataset, even 50 sounds overfitty at least naively looking at that
probably, yeah, 1000 is what people usually recommend as a good starting point
stuff like flip aug also factors into the number
That's why I said do 40 epoch, save every 10 and see when it works th ebest.
Searching the github it looking like a memory problem
And this is why 90% of Civitai LoRA are deep-fried.
But I have 12 gb Vram and 16 GB ram
Oh I see. I understand now.
I mean, if you have 1 image that's going to fry, sure
So for the config, I should live it as default and chose adafactor with 2e-6 lr.
This one was 20 steps.
huh I dont see anything sdxl 1 on civitai yet, Id have thought someone wouldve uploaded something by now
dreamshaper
found the problem! Your Refiner strength is way to high! Saw a video of some testing Refiner strength and the more it goes from 0 to 1 the longer the faces get!
oh yeah i can see it from his profile but not from search
It’s doesn’t seem to load the refiner standalone
the option of tiling should be available still in automatic1111 am I right? still didn't test Im downloading now
I think you might have set 1.5 in the filtering.
You can change it by clicking the filtering icon on the right and choosing sdxl
no I specifically set it to sdxl 1.0 and I get 0 results (and I get some when I set it to 0.9)
does it crash when loading the model into ram or vram?
ah maybe some otherr setting actually because now I see it, probably just had a style on or something
It crashes I add to queue, not when I selected the model
thanks I really don't know. I have been using a 2.1 model available on replicate to date. I don't know what I am doing at dev level so just need to find the right help.
will there be an inpaint model released aswell ?
that's rather weird, but I remember people had problems with memory when their page file was too low, might want to check that
I am using linux btw and I dont have problems running Sd 1.5, what is min spek of sdxl? I am also using xformers
you should be able to load the base model with 8gb of vram, but it will require around twice that to load into ram first
as in, around 16gb of cpu ram, but I'm not sure if it actually crosses that
that's why issues with page file/swap in any case
Maybe I am running out of ram? I have 16 GB ram, couple of GB must be occupied by my OS
If you can afford it without any issue I do recommend upgrading from 16 GB RAM to 32 GB or higher.
It will make the SDXL experience so much nicer.
Is there any tricks I can use to reduce ram usage for now? I just want to experiment.
I just checked and I'm only using around 6gb for loading, strange
Very interesting
one thing you might want to try is --dont-upcast-attention when starting comfy
@upbeat summit are you here by chance?
also --normalvram if that's not set already
A problem with the low VRAM settings is that it is kind of leveraging your RAM and if that's also relatively low you will get nowhere, but sure try.
--use-pytorch-cross-attention (sdp) might or might not work better than xformers too, worth a try 
Yes
what folder do lora's go in in comfy ui's directory?
as per response to someone else with the same problem, might be worth asking the question at the maintainers git page
Only trick would be to quit anything you don't need right now.
Got anything here? Quit anything running
Quickbooks uses alot of computer resources or is the qb not quick books?
@visual glade I've got a request: Can we get a UI function to store/load values across multiple nodes for the current graph?
Say, sometimes I like the base model only, and sometimes I like base+refiner, and I have my graph set up to do either- but I have to manually click around to convert from one to the other. If I had a dropdown to do that at will, that would be ...comfy.
It should mostly concern itself with values on nodes, and their muted state, not position or anything extra.
Oh no, that's a torrent program lol
These are my tasks atm, then you can see what uses the most and quit anything using significant amount
Just use it like aspect 3:4 in positive prompt?
you select what you want from the dropdown.
requests probly should go in the discussions/idea section of comfyui's github, theyre tracked and responded to better there
what is a good nuber of steps for model and fopr refiner ?
Why ur letting Google use 5gb of ram?
@fleet veldt I swear this is a bot. They've only messaged once in the server and spammed reacted my reply.
It's as if it knew the last message I sent before I was timed out.
am not a bot..and try 839 times back before this even went public

then you were somehow very tentative.
Will I really buy a $500 GPU to generate cute women? ||kekking hard rn, I'm not even running it at the correct 1024x1024 size yet|| I am thinking about switching to comfy, but also worried about loosing all those cross attention and lowvram optimisation :/ (RTX 2060, 6GB)
I know a man whos using sdxl comfyui with OOTB settings plus one of those no issues
last 3090 i got was a used\refurb just needed fresh paste. +$750
comfy should be way more optimized than a1111's from what ive read
my 1080ti with OEAM cooler plus fitted with an EK Waterblaock was <£200 and runs cjust fine. A1111 or comfy, SD1.5,SD2.1 & SDXL
sure it s a 1/4 f the speed of a 3090 but who cares
Wow, I'll probably install it this night then. So no optimization needed, it'll run better just by installing comfy's UI?
I do a bit more than just inference. and run huge batches during inference.. but gotta do what works for you. M40 wasn't enough so I kept throwing money at the problem 🙂
Just Playing
python3 main.py --dont-upcast-attention --use-pytorch-cross-attention and closing discord worked. Generration in 147 second. Thanks @polar epoch @fierce hollow
any of you is using upscaler with comfyui and sdcl?
i really hope the next gen nvidia cards finally ship with more memory, last time they increased memory on the cards was like what Titan? Titan RTX half a decade ago?
UpscaleModelLoader:
- Value not in list: model_name: '4x_NMKD-Siax_200k.pth' not in []
UpscaleModelLoader:
- Value not in list: model_name: '4x-UltraSharp.pth' not in []
VAELoader:
- Value not in list: vae_name: 'sdxl_vae_1.0.safetensors' not in []```
Anyone know how to fix this? I have the files but I can't seem to get it to locate them, I don't know which folder its trying to read them from but I've tried 'upscale' and 'upscaler' in the models folder
ok, due to shockingly popular demand, I am gonna be starting a patreon with hopes to further fund my AI research and provide even better things in the future
takemymoney.gif
Yep.
Don't go too wild on upscalers, or it'll chug on 64GB ram and 24GB video memory as a afternoon snack 
where can I put "CUDA_LAUNCH_BLOCKING=1" for comfyui?
well have only 12GB of GPU RAM (4070) and 128GB of RAM. BTW wich upscaling method are u using?
because with 4k ultrasharp it comes out too sharpened
Pretty small filesize for 16kX16k
8k x 8k is already like 350mb lossless lol
I've had great results on foolhardy. My go-to upscaler for basically everything. definitely worth trying
Will your upcoming SDXL workflow be locked behind a paywall then?
no, that will be live and for free on patreon
sorry
on github
anime prompt through an analog photo style filter
but more advanced documentation and future research will likely be behind the patreon. At least to some extent to try and support myself, as I am fully unemployed
I have to use .exr at work XD that size would go into the multiple gb
when ur just starting out it's better to do Ko-fi FYI
which one? Remacri Remacri Backup Mirror ?
Patreon is better when u have an established base of people following you
Wonderful to hear, looking forward to it. Thanks for your efforts!
and Ko-fi takes no cut of the donations
Is the base SDXL model capable of image2image? I assume so? I'm getting blurry results right now tho.
Will join in once your page is setup. All the best.
kofi is better
why not use somewhere like https://www.buymeacoffee.com/ rather than Patreon (which takes 8%)
@high skiff seriously consider Ko-fi if you're just starting out on a platform for donations; use Patreon once you're more well-known and have an established base of users willing to subscribe.
Yes the base model is capable of img2img
snap (almost) lol
I have had buy me a coffee up for a while, but I assume that people either don't see it, or its not a common platform that people have access to
Ahh thx. I must be doing something wrong then. All good. Thx
ko-fi is moar well known than that too xD
I am setting up a kofi right now
cool 
you do ? Its not obvious on your github ;o)
Curious what workflow you use to achieve that :P
scrolls to bottom and walks away shame faced
i don't upscale myself cuz no point. the current settings with AI produce an image that's the size I'd want to use. my workflow is only for 1024x1024
LoRA: I just tested 8 low quality (iPhone front camera) images with 20 repeats over 10 epochs. The results are insane for that input
I set one half up, but turns out they want your phone number, and as it's my personal and only number, i chose something else instead :P
I put it at the end to not seem desperate 
Patreon's gonna want it too, ain't it?
Ah, so it's not a upscale, it's simply just resizing image 
Not that i can remember when i set mine up.
here it is in the read me 😅
once @ the beginning and once @ the end lol and use a visual for it
heck, SDXL can spell "Donate"
you could make a donate visual/logo with it
transparent it out and make it fit nicely
I upscale with the "highres fix" equivalent where it resamples the image a bit. But i wanna use a proper sampler and redo the same steps for more details
alright, I will update things to be a little more forefront, now that I have more community support and legitimacy behind my name now
I have a new upscale workflow coming out
2048x reliable
Anyone know if there's a DML method of running SDXL?
Would appreciate any recommendations for ComfyUI arguments and settings to utilize for a 3060 with 6GB VRAM and 16GB RAM? The refiner is a no go in most of the workflows I've tried in ComfyUI.
Ye, don't just use a small link nobody will notice, consider making a DONATE logo
breh sick af prompt
alright, will do haha
what does this mean File "D:\stable-diffusion-webui-1.5.1-RC\stable-diffusion-webui-1.5.1-RC\modules\textual_inversion\textual_inversion.py", line 283, in create_embedding
cond_model([""]) # will send cond model to GPU if lowvram/medvram is active
File "D:\stable-diffusion-webui-1.5.1-RC\stable-diffusion-webui-1.5.1-RC\venv\lib\site-packages\torch\nn\modules\module.py", line 1501, in _call_impl
return forward_call(*args, **kwargs)
File "D:\stable-diffusion-webui-1.5.1-RC\stable-diffusion-webui-1.5.1-RC\repositories\generative-models\sgm\modules\encoders\modules.py", line 141, in forward
emb_out = embedder(batch[embedder.input_key])
TypeError: list indices must be integers or slices, not str
im trying to create a Textual Inversion
Подскажите ,как с телефона это сделать ?
On your phone?
xD
uh.
it can spell DONATE fine kekw
Да
Скорее всего, вам нужно будет запустить его через Google Colab.
enjoy a small pizza ;o)
Это приложение?
Это не приложение, а сайт.
Made with xl 1.0?
Однако я не уверен, была ли еще создана коллаборация Google для SDXL.
yep
Hey guys.
The guy speaking in russian is trying to figure out how to run it on a phone.
Is there a google colab link for SDXL yet?
Я через другой делаю,в компе ок,а с телефона никак
I bet I could make a dope DONATE logo lol
I mean - the bots are still working
a lot of people in the chat here didn't know that when they just wanted to mess around
hells yeah
it can even pull stuff off like this
lol
With lora? Mine are all mutated with both versions 😅🥲
splitting the word up real stylized
corgi flowerfield donation? 🤣
mmmm perhaps
Если на вашем ПК настроен репозиторий Automatic1111, вы можете добавить --share в командную строку, чтобы сделать его общедоступным URL-адресом для доступа на вашем телефоне.
God google translate has gotten a lot better
Lol
u need to gen at a minimum of 1024x1024 or a ratio of similar size
I am trying to come up with a new 1.0 release prompt that will be even more prolific than my corgi prompt
Language gaps are starting to slowly not exist.
Can't wait for real-time voice to voice translation.
new SDXL is 🔥 amazing stuff, anyone know if inpainting works yet? (on comfyui)
damn that armor is close to overwatch O:
what artist/style is it?
man, I could only imagine
This basically just a little bit of modification #✨|sdxl message
train an RVC on your own voice, feed a real time TTS translator into it
We already have RVC, which is an open-sourced voice to voice deepfake software.
Use GPT-4 or DeepL for translation
Yeah.
oh yeah, I love RVC haha
is better than GT
I hang out with their staff when I am not here lmao
I cannot afford GPT-4, so what I usually use
GPT-4 is free
is a locally-run LLM.
it's on Microsoft Edge
they were showing me a new thing they implemented just a few days ago
chatGPT is free, not 4
Oh you mean Bing-GPT4.
I have access to it of course
That doesn't make sense, since they're quite literally the same model.
if you didn't know
I do know.
They also add another model on top of it, prometheus.
prometheus+gpt-4
Ah.
it's a much better stack than just gpt-4 alone
That I did not know.
I've literally used this for all my tl needs for JP->en en->JP etc
it works for any language practically
and it's waaaay more fluent than GT
you can also ask for specific tones
like casual/friendly/for a discord message/etc
since some languages use diff words for diff tones
like JP you don't wanna use formal speak for discord xD
bing makes terrible custom dnd goblins!
I don't know why that has become my turing test XD but that's the golden standard I now judge LLMs by
like what kind of custom gobby do u want? actually this is sdxl not general, getting offtopic xD
good point 🤣
Base 0.9?
1.0 base, 1.0 refiner, 0.9 vae
Oooh
Wait in automatic 11111 checkpoint what did u use? The base model or the refiner model?
i'll send u a dm for what i got with something i composed just now
How do i properly install missing modules for the 1 click AIO version of comfy? Added a few more addons to it
if it doesn't come with a requirements.txt
can just keep pip installing every time it fails?
like pip install numba
was suite should have a requirements file
if there is a requirements.txt, get to that and open that folder in terminal/git bash/your preferred cmdline
but idk if you can just pip install it with the portable version, might need to activate the venv
pip install -r requirements.txt
we are way past this but i left this genning for a while (and this is waaaay too big to be a logo tbh) but i thought this was badass lmao
that is pretty sick haha
results from my very low quality image input LoRA (8 images, 20 repeats, 10 epochs) All training images were similar to the photo of me, same background, same shirt. (my goal was to test the results on basic input that anyone can create on the fly. I used my iPhone front camera)
that looks great 🙂
here are some more
with kohja?
Yes.
wen Pixelass - Idiotproof Easy Step by Step Guide for Simple Lora Training release?
parameters are the same that I use with 1.5?
Interesting insight -> "{token} man" >> "{token} male person". If you use male person SDXL won't put you in imakes like "baking cookies"
It would add numba to anaconda's location on my C drive instead of within the venv, so as it's contained, it can't see it
@boreal bough what are your thoughts on 1.0 lora params? 1e-3 worked well for me on 0.9, but on 1.0 it looks way worse
or SDXL Lora's for Dummies manual would work too 😉
speed?
he makes way too bloated videos, but this is basically what I used as a base: https://www.youtube.com/watch?v=AY6DMBCIZ3A
Updated for SDXL 1.0. How to install #Kohya SS GUI trainer and do #LoRA training with Stable Diffusion XL (#SDXL) this is the video you are looking for. I have shown how to install Kohya from scratch. The best parameters to do LoRA training with SDXL. How to use Kohya SDXL LoRAs with ComfyUI. How to do checkpoint comparison with SDXL LoRAs and m...
You can port attention masking from lora_diffusion on github and use mediapipe selfi segmentation to create a attention mask to ignore the background. That works incedibly well
I don't rememebr but fast compared to 20 images + 40 reg images.
My own face loRA did not use any reg images.
LR?
how complicated is this lol
Hi, has anyone tried to copy paste A1111 venv to comfyUI ? I'm an a very slow internet (around 300kB/s) so I'd prefer not to have to worry about it
LR, TE, UNET all at 0.0004
Base+Refiner+Upscale SDXL1.0
Not too complicated, essentially just this bit: https://github.com/cloneofsimo/lora/blob/bdd51b04c49fa90a88919a19850ec3b4cf3c5ecd/lora_diffusion/cli_lora_pti.py#L339C1-L363C1
I believe ComfyUI is not using too much space, but I could be wrong, I think I just unzipped the 1.5 GB file and that was it.
gotta make a venv & conda install & activate all the stuff then iirc
i'm not gud with portable packages/using python stuff on diff drives xD
i usually just keep all the stuff installed on my main drive and use any python stuff on my main ssd
HI there. Should I be waiting for some sort of official update to A1111 before I can use sdxl 1.0?
would that create transparent background? basically waht rembg would do?
update, its already supported
Comfy creates its own venv I believe
Sort of - but better. It's more like "don't learn this part of the image" instead of "this part of the image is black/white/whatever" which might be picked up by the model
thx, will play around with it.
still messing around with it. I can confirm that 1/16, 5e-4 will take about 200 hours to train. so no need to go there XD
what do you use for upscaling?
might be time to change scheduler
@high skiff Did you finalise your upscale workflow yet?
still working, goal release is in a few days along with 1.0 workflow, my kofi, and my documentation
got a lot going on ATM
Could you send me a screenshot of your ksampler setup? I want to try it on mine and see what's so different. Because the way I'm trying to do it is seemingly ass.
yeah, i can share in DM's, just please keep it between us cause its not finalized just yet
SDXL+Refiner
wait then how did he do this with 4e-4? #✨|sdxl message
RTX 5000 series rumored to be 32GB of memory for the flagship, we could be seeing 64gb professional cards 😮
where'd ya hear that?
To bad they prolly won’t come out until 2025 lol
Hmpf. Installed A1111 yesterday, and was able to generate 512x512 images super quickly (eg ~10 seconds). Today, it's more like ~10 minutes. I'm trying to figure out what the hell I did, or messed up, to make it slow down so much.
yup but still xD
after the 4000 rumors being proven totally false do you really trust some random rumor?
remember when 4090 was supposed to be 48GB
and make all the AI enthusiasts happy
lol
it's still like 1.66x faster which makes me jelly




