#✨|sdxl
1 messages · Page 72 of 1
Subscribers don't use bots as a rule. That is for trials, which are an on or off deal. Meaning they disable trials quite often due to overwhelming demand.
no, i am talking about the time its taking to process it, 71s/it
no
oh, thats a lot xD
yeah
I think its ~5s/it for me with batch size 12
the estimated time is 345hrs ~ 14days
my batch size is 1
its random, quite possibly that returned result has the word astronaut in there somewhere, i dont know. I dont own lexica and i dint create the node as I said yesterday so stop asking stupid questionsplease.
Fancy a random prompt collector generating random prompts, whatever next!!!!
as i used only 85 images, i used 1
ok sure,
sorry
doesn't matter. Do batch size as high as you can. But that won't solve the timing issue. Dunno whats wrong there
used these
- use lower rank. 32-64 is usually enough
- don't train the text encoder (in particular not with rank 256 X_x). Use
--train-unet-only - if it is still slow, try to use fp16 instead of bf16. It might be that your graphic card does not support bf16
its crazy that kohya_ss does not have a separate parameter for unet dims and text encoder dims. if you train text encoder, a dim of 1-4 is more than enough. For unet you need larger dims (12-64)
i followed secourses tutorial
ok sure
does 3080(laptop) support bp16?
if you train unet only you can also check "cache text encoder ouputs"
probably, but just google
i didnt get any results on google for that, so asked you
the text encoders are HUGE and training them with dim=256 is insane. You can get totally overfitted results on them even with dim=1
I also just googled and it says yes
this?
but this one has diff context
when u cache text encoder ouputs checked, the shuffle caption won't work,anyway,this is highly recommended parameter for SDXL training
even with rank-64 should I train the text Encoder, as itsn't adding any new words or stuff, just adding the keyword(name)
training text encoder can help but it extremely fast overfits. So I would not train text encoder and unet together
how to disable text encoder training?
train only the unet. That should be enough
with 256 rank?
--network_train_unet_only is the parameter
just do rank 48 and increase if for some reason that does not work for you
thank you
ok
over here?
no. I think you can't do that in the GUI. You have to run the commandline
@dense chasm i installed kohya scripts can you help me use your config, how should I run it?
the readme is in chinese
how is the addon called for those node ui/control that everyone uses?
@ionic dragon try these settings
got one this error
is this kohyascripts?
yeah
how to run kohya_scripts, the readme is in chinese?
I"m using the GUI
oh, i even have the gui, thats what i was trying until now
lemme use your settings
does anyone know if there is a possible way that i can insert in the prompt the subject of the image and not generate that particular part but rather keep my initial image?
let's say i want to generate a juicy cheesburger, but i provide an image with a cheesburger.
so this is the config file?
can i give the parameter somehow like a studio photography of [init_image] resting on a wooden plate?
yes, you just need to fill in the folder etc
i am using image-to-image with masking, but sometimes i get a new burger image behind my buerger image.
ok thank you
example of my problem.
so i want it to create only the background for this image, yet i want the prompt to know what initial image is.
my inital prompt:
In this stunning product photography scene, a clear glass jar emerges from a creamy, dreamy substance with swirls of splashing cream behind it. The cream has the perfect amount of thickness and texture, making it look light and fluffy.
To highlight the unique features of the jar and cream, Rainbow elements are merged with glassy textures that add an elegant edge to the overall aesthetic. Ambient light illuminates the scene from above, creating a soft glow that accentuates each element without overpowering any of them.
The background is framed by a clean white surface that gives the impression of being in a minimalist studio.
The color tone is understated yet vibrant; it pops just enough to draw your eyes into each element without becoming jarring or overwhelming. The balance of composition follows contemporary proportions that have been proven successful by professional product photographers over time. Overall, this scene showcases a strikingly beautiful product ready for consumption by anyone looking for something fresh with natural appeal!
so basically this keyword: a clear glass jar should be my init_image as a variable or prompt parameter.
maybe this is something of a feature requestr for Stable Difussion team.
the config file you sent doesn't train the text encoder, right?
Prompt? looks amazing
don't think so
thanks
i dont want to train text encoder, how can I do it?
"Amidst a sun-dappled forest, a weathered stone arch stands, overgrown with ivy and surrounded by vibrant wildflowers. Beyond it, a meandering cobblestone path leads toward distant, mist-shrouded mountains. A rustic wooden bench beckons, nestled beside a babbling brook. The scene evokes a sense of nostalgia and adventure, as if plucked from a fantasy epic. Light filters through ancient trees, casting enchanting patterns on the forest floor. This evocative composition captures the essence of a hero's journey, where every step holds the promise of discovery and wonder in a world brimming with untold stories."
is it possible when i use a lora for myself i add another lora onto it? like use 2 loras to for example pixelate me?
I got it from ChatGPT by asking this: "Describe a realistic photograph in a way that an image generation AI can understand, in around 100 words."
@west breach got this error
check this?
to not train text encoder, I think you add --network_train_unet_only to the additional parameters field
but I haven't done that myself
I'm getting 2.1 s/it on my 4090. Not sure if that's slow or not
with batch size of 4
will do both
is making a lora better than dreambooth?
I think it's just faster and produces a smaller file? but maybe a little bit less quality
thanks
i am just using your settings, and will check the speed
Ima try it in a pixelart/anime style
Post results here 👍
@west breach how many steps do you use to train LoRAs?
I do 10 epochs, save every 1 epoch and compare the results from each file to find the best output
how many repeats?
5
should be okay
I'm training on about 3000 images, but it's for a style
but isnt 17400 steps too much
im training a face
my gpu is performing worst on kohya, dk why
35s/it
try with 1 repeat and 5 epochs?
are you sure it's not maxing out your vram? I got super slow it/s when it went over my vram and started using the shared ram
oh ok
Nice! Are you using base SDXL or any models/LoRAs?
my ram is being shared, as i am doing other tasks too, but not shared vram
It's using 1.5GB of shared gpu memory
you seem to be the person who can run my workflow without issues 🙂
I have some tweaks inbound
Yea. I absoltely love it! Great work
what does shared vram mean?
like 2 appllications using the gpu?
It's using your RAM so it doesn't crash, but it makes everything very slow
oh, so thats the reason why its toooo slow?
how to optimize the settings you sent for 16gb vram?
Prompt styler? Does it modify / add something you predefined to the prompt?
hmm, what shoudl I change in my kohya settings to make it use around 14gb vram?
what's shared gpu memory?
Yes. As an example i can add Pixel art as a main style and then as a supporting style anime or whatever you want and then it tries to generate something in that genre wich always worked. I used pixel art and anime to generate things like these: https://cdn.discordapp.com/attachments/1100170514670039070/1137010784598245376/Parameters_anime_artwork_isometric_style_silkpunk_isometric_shop_desig__pencil_graphite_vector_art_seed-0ts-1691154895_idx-0.png
are you running anything else that's using gpu memory? like do you have comfy or a1111 open?
no
just kohya
Cool, thanks! I'll have to look into that..
it's just using your regular ram so it doesn't crash but it's only meant for quick spikes in usage, as it slows everything down a lot
Trying that out. I now just need the links for these:
You havent added them to notes
atleast it looks like you havent
are you using batch size 1?
yep
I cant recall , i think the version you have has only 2 styles. the revised uses 3 . 1 that applies globally to all prompts and then the 2 you have now
not sure if a lower network rank uses less vram, try setting it to 64?
ok
impact pack IIRC thought i had put thelink in the notes box
Oh yea, didnt see that. My bad
theyre for the facedetailer ive added
Yea. Even tho i dont generate that much with faces ill take a look at the results!
ChatGPT + SDXL again "In the photograph, a sunlit cafe nestled on a charming cobblestone street invites a moment of respite. Tables with checkered cloths spill onto the sidewalk, adorned with steaming cups of coffee and freshly baked pastries. The aroma of espresso mingles with the melodies of distant laughter. Patrons, some engrossed in books or conversation, bask in the warmth of dappled sunlight filtering through lush vines overhead. The facade, a rustic fusion of brick and timber, exudes Old World charm. The scene captures the essence of leisure and camaraderie, a timeless tableau where everyday moments become cherished memories."
Youre results are really impressiv!
Too many / small people always get messed up :c
@west breach now it works, 3s/it
thank you
hooray! glad it's working 🙂
im sure you can figure out how to disable it if you dont need it 🙂
Yeah 🙂 Using this great workflow: https://github.com/SytanSD/Sytan-SDXL-ComfyUI
good old @high skiff 🙂
Hey that's him
It seems like that i installed everything but im still missing the node MMDetDetectorProvider
Did i forgot to to add anything to the right path?
it should be in Impact Pack
and i presume you restarted?
the comfy server notthe pc
I stil want to get it installed. Just give me 5 minutes and i got it fixed
@soft zealot Could a face detailer fix the faces in my results ^^
Thanks for the workflow 😄
YMMV. This is Stable Diffusion. This is The Way
@soft zealot Got the error with it not being in the path fixed. Im just gonna do a reinstall and hope that its working then
@soft zealot Did you insatll the 4th optional option?
pretty certain the answer is no
but Ihavent eaten since 19:00 last night and its currently 13:20 , just waiting to be called for an op to removemy gall bladder
Oh man, go eat! Wishing you luck for the OP that everything goes right
Im just gonna try to install only the missing note
maybe that will fix it
sadly I cant eat until after it, cant even drink water atm
Oh that sucks.
@soft zealot Found out why
MMDetDetectorProvider and other legacy nodes are disabled by default. If you want to activate these nodes and use them, please edit the impact-pack.ini file in the ComfyUI-Impact-Pack directory and change 'mmdet_skip = True' to 'mmdet_skip = False.'
what the hell
now im missing all of them like its on True
Odd I don’t recall doing that but then again I’ve had impact pack installed for a while
``error: subprocess-exited-with-error
× python setup.py egg_info did not run successfully.
│ exit code: 1
╰─> [6 lines of output]
Traceback (most recent call last):
File "<string>", line 2, in <module>
File "<pip-setuptools-caller>", line 34, in <module>
File "C:\Users\Edgar\AppData\Local\Temp\pip-install-ccnlm28v\aliyun-python-sdk-core_15345e11038e400faa83fabc496920f6\setup.py", line 41, in <module>
VERSION = import(PACKAGE).version
ModuleNotFoundError: No module named 'aliyunsdkcore'
[end of output]
note: This error originates from a subprocess, and is likely not a problem with pip.
error: metadata-generation-failed
× Encountered error while generating package metadata.
╰─> See above for output.
note: This is an issue with the package mentioned above, not pip.``
Might need to revisit the face detailed bit at some point
@soft zealot really weird that. After changing mmdet_skip = True to mmdet_skip = False. Im getting the same massage like on startup.
Thats super weird
Could you send me your impact pack in your custom nodes folder?
That should technically fix it
Which is better?
Hi, have opt-outs been respected for the training of SDXL? The question, when I asked about it a few days ago, had no answer :/
Id choose left because less people
For me, right looks MUCH better at first glance, but then you notice how details are messed up
Left is more "accurate" but uglier
B is better overall I think
Is here an existing UI who use ConfyUI in background who is able to do inpainting and stuff ?
Is it possible to run SDXL with 6 gigs of vram?
Currently using 6.1gb... maybe?
--medvram or --lowvram and you should be fine
Ah ty
comfyui has inpaint example
ComfyUl has an inpainting feature that can be used to fill in missing or
damaged parts of an image . You can find more information about
how to use it in the ComfyUl
repository on GitHub
(1) Inpaint Examples - GitHub. https://github.com/comfyanonymous/ComfyUI_examples/blob/master/inpaint/README.md.
(2) problem with inpainting in ComfyUI. : r/StableDiffusion - Reddit. https://www.reddit.com/r/StableDiffusion/comments/12dn8jj/problem_with_inpainting_in_comfyui/.
(3) ControlNet 1.1 Inpainting · comfyanonymous ComfyUI - GitHub. https://github.com/comfyanonymous/ComfyUI/discussions/603.
(4) undefined. https://avatars.githubusercontent.com/u/121283862?v=4.
(5) undefined. https://github.com/comfyanonymous/ComfyUI_examples/blob/master/inpaint/README.md?raw=true.
(6) undefined. https://desktop.github.com.
(7) undefined. https://docs.github.com/articles/about-issue-and-pull-request-templates.
3 votes and 1 comment so far on Reddit
Any inpainting workflow that is entirely in ComfyUI? That would be the dream, say you're happy with an image and you want to improve it, would be nice to send it to a second workflow where you can draw an inpainting mask directly in the UI, etc..
Another cool prompt, too many messed up people though 😦
"In the photograph, a bustling city square thrives under the midday sun. Skyscrapers and historical buildings converge, their facades reflecting modernity and tradition. Streams of people crisscross the scene, some immersed in their phones, others savoring street food from vendors lining the plaza. Fountains shoot arcs of water, creating a refreshing oasis amidst the urban frenzy. A street artist captivates onlookers with a vibrant mural in progress. The square's vibrancy is mirrored in the diversity of languages and fashion styles. This snapshot captures the heart of cosmopolitan life, where cultures collide and harmonize in a dynamic tapestry."
I found some examples of inpainting workflows using ComfyUI on GitHub¹. You can also modify the mask and the sampler nodes to get different results³.
(1) Inpaint Examples - GitHub. https://github.com/comfyanonymous/ComfyUI_examples/blob/master/inpaint/README.md.
(2) What am I doing wrong with my Inpainting Workflow?? - GitHub. https://github.com/comfyanonymous/ComfyUI/discussions/639.
(3) comfyanonymous/ComfyUI_examples: Examples of ComfyUI workflows - GitHub. https://github.com/comfyanonymous/ComfyUI_examples.
what the hell, my workflow settings are gone
i literally am at the basic nodes that were when i started ComfyUI for the first ime
time
@vivid tide Do you think that could work for fixing details like faces while keeping the same style?
I'm not sure, but I think it depends on the model and the mask you use. You can try different models and masks to see what works best for your image. Maybe you can use the anythingV3 model or the faceV2 model for inpainting faces.
same prompt "a cat", same model (unstabilityXL), same seed, 120 different styles
Do you think it perform better result than original base 1.0?
Did you generate that at one time?
Paintography 🙂
I think so. Could do it with the original base to check it, but it took half a hour to render. I used DPM++ 2M Karras with 30 steps plus refiner and took a while
running v1 of my finetune data through lora training right now x_x
Yes, used the "Prompts from file or textbox" script with an archive with the 120 styles taken from https://github.com/Douleb/SDXL-A1111-Styles. Reformated with the help of ChatGPT (that thing is amazing)
but how did you get it to generate 120 diffrent cats plus styles at one
I said it. I used "Prompts from file or textbox" script in A1111.
Oooh, did/could you please share the workflow or an image that contains those nodes? 🙂
in theory, you could also use the infinite grid generator, then throw those into the prompt matrix
(stableswarmui)
Well, then i had to rename each image with each style using a python script chatgpt wrote for me, and pasted them in ten collages using photoshop. All automatized.
Sure! It uses WAS nodes only. Here's a screenshot. If you have any questions let me know. I can also strip it down for you later and send you the build, but it could take a bit because I need to finish some work right now.
It was an experiment that I'm still using ;). All this would make much more sense as a node.
But it's very flexible. You could potentially save any text information of a gen in a .txt file.
Vineyards!
Thanks so much, will look into it. I am checking out some other nodes and if I find one that does the trick, I'll let people know
there are a couple of nodes that can do txt files, but this way I could create a format that makes sense for my workflow. I'm using the metadata for my image management tool, so I can search and filter for prompts etc.
are there any good and non over exaggerated workflows for ComfyUI?
Them's fighting words! 🙂
its not like everyone sees to point to use like 100 nodes only to spit out 1-4 images for preview
Check Sytan's workflow out: https://github.com/SytanSD/Sytan-SDXL-ComfyUI
I plan on doing the same, somehow get the metadata into Lightroom but the data in the JSON is just too much. I'm gonna check out https://github.com/wallish77/wlsh_nodes because it may have some of the functionality
i may have put it the wrong way, it was not to insult others
thx 😄
Oh no, I got it.... I was making fun of those 400 node flows too 🙂
yeah I'm creating a xmp from the metadata text files with some custom scripts I wrote heh
see - I check every day for new comfyui custom nodes. I haven't seen this one - or forgot about it 🙂 thanks - I'll check it out
I'm not really happy with the filenames. no node offers a suffix. so I'm trying to build my own save image / save text node now. I want my filenames to be in a certain format
You're welcome, I haven't looked at it yet but will do so here in a minute and I hope it works. I managed to use ChatGPT to create a Python file that loads AUTO1111 metadata direct from the PNG file into Lightroom but it still needs some tweaking to fit edge cases. I can't imagine doing that for SON file and the insane amount of variables ComfyUI has
well it isn't really that hard. you just look for the node in the json data that includes your prompt and get that entry.
that is what json was made for - you can easily process the data
oh ok lol. No i dont need really much because SD isnt even my main gen AI tool. I use it as a supplement for specific needs
It's hopeless guys. I can't use SDXL 😦
Auto1111 - always running out of vram.
ComfyUI - takes 25-30 mins for 1 image.
I have GTX 1660 Super 6GB VRAM. It's over 😦
Oh, I get that... what I was thinking of is, if the number of nodes are variable, as are the configs for each one, it would lead to an overwhelming number of possible keywords and hierarchies for those keywords in Lightroom. Plus things like scoring as keywords... I set my script so Ligtroom could have a parent keyword for the score and a subkeyword for the first decimal palace because otherwise there'd be 100 dropdowns clogging the menu
yeah, true. you could prepare your workflows, give your node an unique name and than target that node for all your metadata extractions
depending on what kind of data you want to extract. I use prompts (clip_g+clip_l+negative+refiner...), model and seed. the rest is saved in the image's metadata anyway
maybe they will release some pruned version de SDXL.
we now have 2 models for SDXL, loras, fine-tuned models and if you look at some of the more elaborate workflows up to 4 - 5 models for one image gen (base diffusion+refining+upscaling+finalizing...). it's getting complex and crazy 😉
4x_NMKD-SuperscaleV2_2k.pth
4x-UltraSharp.pth
4x_NMKD-SuperscaleV2_2k.pth
4x_UniversalUpscalerV2-Sharp_101000_G.pth
or Topaz Photo AI
Yes. And i dont see so much improvement. So at the end of the day i find myself returning to A1111.
you'll see what you want to
Oh yeah it uses like 3,5 for me lol
Very nice
that clearly says 3 and 5 out of 6 and 0. #science
ultrasharp not bad,i use frequently
yeah, it's great that there are choices. You can build very simple or unique, complex and wild workflows with ComfyUI.
I really like the flexibility to build my own workflows. In A1111 you are more focused on your prompt and it's not like the hood is up and you start rewiring the whole machine ;). But I like that.
i download 4x_comic_dataset_115k and 4x_NMKD-Superscale-SP_178000_G if the latter on serves well for general purpose
classic 🙂
yeah I switch between _2k and SP_178000_G
Precisely, in A1111 i focus more in the final result. Comfy is great to learn and to experiment but for working i prefer A1111. Plus all the extensions and the scripts, imag2img, inpainting, oupainting, etc. For now.
Love how this one looks
What was the prompt for this?
parameters
anime artwork isometric style silkpunk isometric shop design, combine steampunk elements with chinese mythology and motifs, bamboo, silk . vibrant, beautiful, crisp, detailed, ultra detailed, intricate . anime style, key visual, vibrant, studio anime, highly detailed
Negative prompt: deformed, mutated, ugly, disfigured, blur, blurry, noise, noisy, realistic, photographic, photo, deformed, black and white, realism, disfigured, low contrast
Steps: 20, Sampler: DPM++ 2S a Karras, CFG scale: 7, Seed: 3973761187, Size: 1024x1024, Model hash: 31e35c80fc, Model: sd_xl_base_1.0, Denoising strength: 0.75, Version: 1.5.1
Style: Pixel art and anime
https://www.mediafire.com/file/hypqlg6vafmdb4t/styles.csv/file
Ty
Yeah, a1111 is a complete package in many regards. ComfyUI will also get lots of new stuff in the coming weeks and months.
I'm mostly focused on prompt engineering and research. I was missing a couple of features that are really important to me like prompt editing and using fine-tunings right inside the prompt, but there are already ComfyUI custom nodes that are being developed with those features.
For me personally I would feel very restricted, locked in a cage if I would use anything other than a node based system like ComfyUI right now.
i think a1111 inadvertantly evolved towards being full integrated studio software. i don't think it's doing it very well though.
your of the scientific type. I am more of the lazy bastard type that wants everything already done. 😄
holy smokes... you are using Comfy now??? Awesome!!!
comfyui was very much designed with a purposed goal. Be a phat assed back end node graph for this stuff. awwwww yeah.
paperpapsppsppapepwe papers
bruh
long
true
although the other one reminds me of the Nazi silhouette propaganda @cinder aspen
yes, I am. the main reason I started adapting to ComfyUI is because Comfy said there will be AIT support for SDXL
xD
i just have to find out how to actually use one of those upscalers
Advanced Infantry Training? lol
jk
i like thiss
I get ya
@spring fulcrum this is what AIT is
So that means you can train models? or LoRAs?
both
yes, also x3 speed (as I demonstrated in the video)
ok the upscaled is better
these are so pixelgame ready lmao
hella
Are there command line args to make it go that fast? How did you get 56 its/s
im hoping for nintendo 64 lora for sdxl
are fascism and nazism and their symbols actually censored in SDXL?
no? maybe? I don't know yet? Test it yourself?
gotta find out
AIT triples speed on newer cards without changing precision or changing outputs. I usually get ~20it/s on my 4070ti, but with AIT it's times 3 faster
seems like Sytans workflow also has downscaler for whatever reason so whatever you upscale also gets downscaled afterwards
I guess my question is... How do I use AIT? I've never heard of that before.
@visual glade is working to support AIT on ComfyUI. what I used in the video I demonstrated is a custom node that only works on 1.5 models
ohh
😦
Well I'm sure it will be out before ya know it.. Shit moves fast in these circles
prompt?
no
"stardew valley style, weapon sprites"
ah pixel?
yeah
that is low poly game art
wrong context sorry.
You misunderstood.
all good - I didn't want to correct you. was just stating the obvious 😉
This why I dislike the timestamp feature.
You would be able to understand what I mean if the timestamp was always visible next to the image at all times.
i like what discord has done with the image galleries when you post multiple in one message. but i think they should go a step further and when someone posts multiple images in a row, combine them into one gallery.
i bet people would hate the new change and find ways to break it though
i keep forgetting to do that myself. i like the feature but forget to use it
yeah but they should really give you a preview so you don't have to post cropped heads etc
some kind of crop control would be nice.
looks like slimerancher lmao
well it's a trade off. now it's more compact but images are getting auto-cropped. so now I'm trying to think every time before posting multiple images how to arrange them. a preview would really help 😉
It is
No it's always been like this.
Timestamps were always bad
I wasn't talking about the timestamps. just the image cropping that didn't exist 8 months ago, before the gallery feature was introduced
well as pretty much every company like this does, it will become worse and worse over time
but yeah timestamps are confusing
just cherish what it is now, because it'll be a little bit worse soon
The story of Discord
Discord is about to do MJ a favor so they apparently can make inpaint in MJ directly in Discord if i understood correctly
dunno how that is going to work out
really? that seems a bit cumbersome. but what do I know
anyone who gets to the point where they want to inpaint will pay for photoshop instead
discord can be a nice collaboration and community tool. almost browser / desktop app parity (electron isn't great, I know). the worst part about discord is their stance / approach to security and their unwillingness to improve it.
MJ's real competitor is Adobe and it doesn't look good for MJ
not if they want it all "automated" plus dont want to pay for Adobe sub
but on the other hand...
what about the new dall-e release?
that one dude that paid 260$ in one month for MJ
nice
thats more than i pay for Adobe CC, Autodesk and Maxon combined per month
easily
and eben Substance
even
I think I used up my mj tokens once and then never used it again. is that the one that uses tokens?
oh, well then that's what I did
photoshop is half the price of MJ pro
he paid 30$ subscription + extra 50 GPU hours
far more, bcs there is photo bundle of PS + LR for 11,99€ here
but i heard the argument from some that they dont want to pay for an overpriced product that wont even automate the work for them
I used to pirate the adobe programs, but then had a little window pop up one day offering all the adobe suite programs for 30 a month if I just stopped pirating stuff
i mean Firefly/gen fill is there dunno what this guy meant
plus in a few months all the fancy tools will be released either
I think MJ is just a gimmick that is sucesfully capitalizing on a small window opportunity. I dont see them pivoting to much else. They'll never beat adobe.
happened in photoshop one day. we'd been playing cat and mouse for a while
so it was a good compromise
until MJ comes with their web UI AND some fancy tools like SD and Adobe FF have at this point they will still be far behind in that regard
Macromedia had better tooling than adobe at one point too
I'm loving all these anti-comfy memes on reddit. these people are a trip.
one thing is to be said tho, MJ trained on much more images and therefore also contains copyrighted materials. Firefly doesnt so you can forget generating celebrities, famous characters etc.
"why isn't this product I'm not paying for specifically catered to my wants and needs! I demand justicde"
They're so sulky . i think it's just a few toxic fans of the vlad and auto's UI doing it
well, I grammer errored, there, but you get the gist
it's so crazy to me
also comfy is hired by Stability now as i understand it, so there's all sorts of resent there
why are they hating ComfyUI?
emotional insecurities
because nodes scary!
one can argue that they dont need that spaghetti of nodes for their needs, but why the hate
if you dont need it, dont use it
or use it according to your limitations
I actually can't wrap my head around it. yes you can make a huge bowl of noodles and have 10,000 nodes. or you can run a few
because it's popular and they like what isn't popular. queue insecuriteis
the ego is a very fragile thing
I'm just having fun, watching my LoRA training material get prepared 🙂 (using my v1 LoRA to prepare my v2 LoRA) to dial in the style.
people gotta realise that there are several workflows that people want, need and use.
I just wonder what sort of lives they live. and my iinstinct would tell me it was young people or kids, but they probably aren't
I don't know, some people just don't know how to think abstractly or solve things on their own
my workflow doesnt have to be ideal for other people and vice versa
the context is important
but is yet ignored
this is true. take me for example. 40 and still a big kid
yes my age is a large number as well, lol
cozy cottages 😄
Hah sorry
nah, it's quite alright. I am not offended
i love it when they're in one post
Yeah 10 in 1 takes up less space
people who ain't technical familiar just hate to learn something new and they already spent a lot of time to get used to A1111
I don't really mind at all. I'm just wondering if something changed
it'll be funny when MJ releases their webui and it's just a rebranded auto1111 fork
and that's why everyone is doing it now
wasn't commenting out of annoyance, just observation
These people should really chill out when using bleeding edge software then imo.
I haven't really used discord much. tried to sign up like 3 years ago and it autobanned me for being a spammer when I hadn't even posted. maybe because of my vpn or something. and then it wouldn't let me use my email address to sign up
and then didn't really care to jump through hoops to use it
if they want accesible then there's services like dreamstudio or clipdrop or /gags, midjourney
well bottom line to me is if you're not paying a penny for the service, don't expect it to be catered to your wants and whims
when discord decided I was a spammer because I used a vpn I didn't go on a meme creation campaign to express my outrage. I just used something else that worked for me
blaming the world is so much easier than personal responsibility though
Any links for training resources?
for what specifically?
A LORA on SDXL 1.0 or a base model.
the same idiots that claim that other workflows than theirs are just "dogsh*t"
i think that's a wholly separate problem. ego and pride tied to these releases.
well if you're look for sdxl stuff go to huggingface, or prepare for some waifu nsfw fun and go to civitai
I have to wonder what sort of impact that website has on the userbase, lol
its more that the userbase has had an effect on that website, and it's become a feedback loop
the first time I went there I was quite perplexed, and a bit horrified. not really a prude about things, but my goodness
yeah
just didn't realize so many people took it to that level, lol
i stopped talking to people about generative AI when i realized they'd google it and see Civit and then think that's how i use it
when we train a lora, is it always the last produced lora is better than the ones produced in between?
i actually swore not to get into this (ComfyUI based workflow) kind of stuff before. Well i didnt swear but i said i dont want to get there
depends on the training parameters. sometimes i've found that the one made halfway through was the best. x/y grids help a lot for testing
and then even with open source language models similar things going on. it's just something that I missed I guess. I knew people were into anime, but not at such a weird level
this and all the uneducated people on art that think learning to do art properly takes supertalent and the process of doing it by yourself instead of using generative AI is horrible and not worth it and would take a tremendous time to get there
I haven't trained loras in months, and then only on 1.5. I'm assuming it's similiar with sdxl?
I should probably look that up
so for that a1111 is better to test?
yup, that's me
back in my day you downloaded your mp3s from sketchy sources and you appreciated them
for x/y automation, i have not found something for comfy quite yet
Anyone else feel like an addiction checking to see if there are new fine tune checkpoints a few times per day?
i bet theres something
Could somebody do me a d0g$h1*e LoRA?! 🤩🥳🤠
what if god smoked cannabis by weird al.mp3
Weird-Al-Yankowitz-LoRA
yes, but the fact that it takes 5-10 minutes for them to load on my computer kind of tempers that a bit
you can do grids with stable swarm ui
@hardy cipher 5 to 10 minutes? Whaaa?
that's something I don't really remember why I remember
are you loading models quickly, crate?
Cilla-Black-LoRA-LoRA-laffs
checkpoints or loras?
Checpoints
what gpu are you on?
x/y/z comparision?
yes
cool
It's an i7 and a rtx Quattro 4000
I think switching through checkpoints is always instant in comfyui
well if you can flip through them instantly I'm quite happy for you. but I have not had any instance in a1111 or comfy on either my pc or on colab in which they changed instantly
SDXL requires more GPU VRAM highly especially just rolled out
man he was supposed to be a nazi general with Hitler moustache O_O is there img2img for SDXL 1.0?
so I would like to congratulate you on your space age technlology
SDXL .pt files at that!!!
I know, this guy is trashtalking because I said the models take a while to load
I think it was a bit of humblebragging
you'd better train your own small size lora model, a better choice
I'm not trashyalking.....u honestly had no idea you could even have that issue in comfy...mine totally do lag in auto so I know what you mean
there are a lot of open source codes in github with minimized vram usage, good luck
as does anything
with all images i can find on Hitler, Donald Trump and nazi generals with their outfits?
ugh
never done that before
merge model,the attraction of SD open source imao
alright, you guys got me. my computer sucks and I should load everything instantly
Someone please remind me how to Symlink B:/Stable-diffusion to C:/Stable-diffusion ?
my specs arent ideal either but yeah, SDXL at least works
although not as fast as id like
I've never had a single complaint about it
but other people I guess think I should feel bad about it
next year or in 2 i will have a self built beast
ideally next year
i need it anyway
Anyway my point here was, what new fine tunes do you guys like? I'm already kind of done with the base
Admin cmd, mklink /d
model load time on comfyui is limited by your drive read speed
or your ram if it starts paging when loading it
I have 64 gb ddr4 and a solid state drive. not sure how much space it needs to work with though
Is that all? 🙂
my model load times caught turbo speed when i bought a new gen 4 nvme drive
my only bottleneck is the gpu. and since it's a laptop, not really a small task to upgrade that
you need to put the source and target after the command. But I forget which one first
Idk my models load pretty quickly on my RTX4090ti, I have 1tb ram btw
agreed,self trained model load maybe time saving,more simple and well read,no mention some nodes not necessary
next year a RTX 4090, 64 GB DDR5 RAM or 128 GB if they finally fix the issue until then and i9 13900K processor. All that unless newer gen parts come out
idk, my models load alright on my dual a100 server
and i iwont have to think about that anymore
it's just the law of diminishing returns with high end gpus
next year?AI speed if you endure one month, one year for your life span
I just don't know if I want to drop 1500 dollars on one, lol
Fr tho it feels like shit is gonna change ridiculously fast over the next few years
thought about getting an egpu, but have to decide if that's even worthwhile
so turns out swarmui has nothing to do with stable diffusion, but it looks super cool
hm?
100%|█████████████████████████████████████████████| 20/20 [00:00<00:00, 24.38it/s]
Prompt executed in 1.50 seconds
That's SD1.5 safetensors with the model loading time included
computational power develops fast enough, nvidia invest a lot into this niche market
i'll probably upgrade to meteor lake when it comes out. 1) it's got meteor in the name. how cool is that. 2) it'll have ML features
https://www.intel.com/content/www/us/en/newsroom/news/ai-coming-to-pc-at-scale.html#gs.zl9826 not really a rumor. just not much info about it
Intel and Microsoft collaborate to advance artificial intelligence for Windows 11 PCs.
meatier lake
hmm, maybe I should feel bad about my inferior hardware
Would it be any better than a GPU tho?
Other than cheaper ram ig
it'll probably just open different doors
does ram even matter anymore?
Bruh
seems like ram just exists these days
Oh u mean normal ram
yes
Yeah get 16gb and u chillin
yeah, models can run off your vram ideally, or your cpu, or your hard drive, or your external usb thumdrive
but the ram? nah, that's irrelevant
i just got 64 from 32 and im chillin
only one text encoder (CLIP ViT-L/14) except sdxl OpenCLIP ViT-bigG
it's a trade-off between complexity and quality
or else i have to build 64 GB DDR5 RAM and later upgrade to 128
Mostly just seems to be the first step in getting good ai accelerators to the wider public, which is very nice
Does anyone know of any image to video models coming soon other than gen2 and pikalabs
they're calling it a neural VPU. vision processing unit i think. it'll probably work with things like depth estimation or motion vectors.
almost thought those were sketches until I saw the little binder rings on both sidesof the notebook in the one pictures
Haha, well, they are sketches, but, the prompt (I'm working on a giant prompt guide) is actually for on lined paper
But there's also on Bristol as well
tbh I don't know what's real anymore
real pictures look fake, fake pictures look real
Tried to create "Man's best friend", almost
Haha, yeah, IKR? I'm going into depth about it--should be fun.
DOGGOS 💖
deg
last one is a photo of doges
so if I render something with a 1.5 model, then run it through the sdxl refiner, which vae should I use to decode?
was supposed to be Harry Potter 😄
1.5 image->1.0 vae encode to latent->1.0 refiner-> 1.0 vae decode to image.
ahh, so I guess if you just send the latent strange things will happen?
they are different
I've come to realize doing things incorrectly in that regard really don't make pretty things
WW2 officer Potter
anyone has any lora workflows?
https://huggingface.co/frank-chieng/michelleyeoh/blob/main/sdxl1.0base_lora_upscale.json it's the basic base model chain with lora and refine upscaling workflow,the github comfy examples of lora for consideration,anyway it's a big trade-off between complexity and quality of final images
u-net and xformers should indulge in for a while,denoise is very interesting process
how to download the file
i dont see any option
you can download this image and drag it into comfy workflow
base model->load lora->base ksampler should work fine.
how about the refine model?sdxl is much more gpu vram loaded,the community is making every efforts on the new models now,no mention of diffusers,PR on the way
Do you mean using lora with refiner?
You might try Sytan 1.0 workflow which has upscale with 3rd ksampler using the base. It could also apply the lora on the 3rd ksampler to increase the lora likeness after the refiner.
setting up my ComfyUI workflow like:
it's good to know,Sytan workflow is great as far as i know
@mossy canopy The way that you set up the upscaler is fantastic. I don't use it often due to how long it takes but what I do notice is that it retains the texture and grit detail of the base image unlike all the other ones I've used.
(blurry wacky distorted:1.6) of elon face
wicked shit
To share a bit of my current workflow. It likes refiner guided noise (1~3 steps) -> [base + lora-> refiner] * N loop -> 2x upscale with base + (lora* 0.65 strength)
i grabbed a pack of various depthmaps from comfynet, and the depth model sarge released for XL really isn't accurate at all
I usually target maximum details with easy to understand workflow.. so anyone can tweak it to their needs including me..
wicked
okay,When everybody adds fuel the flames rise high.welcome
how are these?
i still need to improve them
the 1st one looks good
but 2nd looks very bad
@dense chasm can you guide me how to use kohya scripts?
it's a long story.i run on google colabs,if you have a 16-24G vram, you can run local well enough with kohya scripts,diffusers asks for more
i have 16gb
youtube has a bunch of lora training of sdxl as well,but not for more details
there arent many scripts, most of them are on gui
i see, a lot of developers confused or not give a shit of kohya cuz the original is japanese annotation, but we have google translation loll..
left is [base + 0.85 loha -> refiner] * 5 loop, right is the left -> 2x upscale -> (base + 0.65 loha) * 0.65 strength denoise. The upscale part really help to fix the eyes and add detail.
savior Jensen Huang
yeah true, can you please tell how I can run your file?
Here is the one I use:
kohya's documentation translated is 1000x better than documentation on the web extension side
I also notice loha is work better than lora with similar training time (at least on person).

pure python run on colab,anyway,if there is a will there's a way
i could photoshop his face into this
just roop'er
hmm?
roop a doop
if i had better spec i could generate much more images at once with a high speed
same man. same
didnt even know about this lol
i mean i enjoy doing it myself too but gotta try this one as well
doing it yourself has benefits too. roop always has a level of synthetic to it
Apocalyptic Barbie...
for comfyui i've set this and this up to use it with gens. https://github.com/ssitu/ComfyUI_roop https://civitai.com/models/24690/comfyui-facerestore-node
Roop has not been updated for a long time for ComfyUI. Does not play well with MTB nodes... had to delete it in the old end.
true, although one can do it with gen AI and then work on top of it with PS for example. Sometimes the effort isnt worth it and the generated stuff makes it only harder
had to go through that experience
i used it yesterday. updated last week on the link i gave. what's MTB nodes?
hm
the node i linked just takes an image input
bro what happend to you Jensen
the first link i gave is standalone roop
Jim did
@cinder aspen
Jensen = Jim + Henson = Jim Henson
I will check, but been using MTB instead, it has everything you need... https://github.com/melMass/comfy_mtb
i dont think that an animation toolset being incompatible is a reason to tell someone not to bother with roop.
especially when the context is a single face swap
That's not what I did... why diss me if I'm telling you my experience... use it, or dont...
So, am I an office numpty or, is this just not possible? In the example ComfyUI SDXL workflow here: https://comfyanonymous.github.io/ComfyUI_examples/sdxl/, the STEPS node is a primitive and the output is determined when you connect it to a follow on node, in this case the steps input in the KSAMPLER. For the life of me, I cannot get that output to connect to a node that ALSO accepts an integer. Has anyone been able to do so? It looks like you can't even add a reroute to it after it's connected to steps, which is odd. The unattached node is what I am trying to connect to. Any ideas?
misinformation irks me. people confidently saying something like "roop hasn't been updated for a long time" when the github shows consistency. makes me all dissy.
perhaps i misunderstood your point. your reply was to someone expressing interest. but like you said, use it or don't . it's just another tool.
I gather what i'm seeing in A1111, while using SDXL. Is the VAE decoder hitching?
It get's close to finishing a render and the progress bar pauses at 80% and then completes the image after a few seconds of waiting.
Does anyone else have this issue?
I wonder if A1111 could fix this in the future... 🙃
You must have gotten off he wrong side of bed... was trying to help... I really dit have a bad time trying to run both... so I haven't seen that it was updated... just take from it, don't run both at the same time, I will check now, if the issue was solved... just don't make me your punching bag pls.
🙄 .
I see there is 3 now... do you experience of the other 2? All referring to roop...
you're not showing me that roop lacks updates if you think that's what you're doing
Error still persists... I will try the other.
its not about me or you. the information you gave was that roop hasn't been updated for comfyui towards someone expressing interest. time and a place for your complaints.
seems like its more of an mtb developer problem too. one would expect them to be compatible wiht other tools. not the other way around
Ok I'm out... not here to be misinterpreted and reprimanded!
Part of it may be that ComfyUI gets pushed by SF heavily, even for a target audience and for workflows that would work better in a solution like A1111 or SD.Next - in the next step SF even tries to offer a similar frontend. Some of the staff also talked down other solutions - things like that will cause a reaction, even without fanboys. The main problem I see is that if all tools are controlled by SF they can push censored stuff, watermarks, etc. even harder, because their own tools - the new defacto standard - will not make them optional.
With comfy, I get awful eyes
the tools are open source. this is just conspiracy theorizing garbage
impactpack has a great detailer node
sure, but 99% won't use forks, just the thing some youtube influencer showed them.
why be concerned about watermarks when also expecting every image to have metadata?
metadata can be removed easily. watermarks can't without reducing quality.
open sourced. watermark can be removed with one # character probably
If one starts at step 0 and stops at step 31 why will comfy only do 31 steps?
i mean at a later time. not by editing the code or forking
we're also talking about a denoising deep learned system. wouldn't be hard to train it to detect and flawlessly remove the watermark either. It's just all such conspiracy hubbub. usually comes back to world government tracking us all nonsense.
how many steps should it do when end of steps already set at 31?
32 if my logic is correct.
especially since the watermark would be created by open sourced code. teaching a system is even easier when you have a fundamental ground truth to provide it
With SF you mean Stability AI itselfs
this is the most stupid explanation I ever heard 😂
the server is called stable foundation so i guess they mean stability
So people are mad that the tool is eventually landing in their hand instead of "the people"?
it seems people don't know what open source means
step 0 didnt do anything, I think
it's a conspiracy to track us and control us and the 15min cities!!
Smells like radical/militant FOSS people
tl:dr
many ppl even think that open source mean free
once you are the marked leader and all standard tools are in your control, you can influence things a lot and also change the rules.
overall i just wanted to mention that things like that can cause reactions - i myself use comfy without much hesitation. (at least for workflows like txt2img that work better than other solutions)
hmm yes. recognizing that the licenses the code is under are free and permissive is radical fanatic behavior. ||/s||
Adobe wants to know your location
That's odd.

https://github.com/comfyanonymous/ComfyUI/blob/master/LICENSE iz a giant global conzpirazies guyz
Might be wrong. I am not confirmed
there is no control in open source tools. Everyone can make his own forks
gonna trackzx you!
I mean people that react like idiots for every single sht that in their eyes endangers their "autonomy" and "free stuff
There is 4 face swap nodes you can try... MTB and the original Roop and 2 other... Haven't tried it.
is it generally understood that more sampling steps creates more weird skin folds and errors? or is it just me.
guyz lets make a standard to control all the other standardz!
Stability try to control our mind by releasing SDXL. We should use SD1.5 which is save. Oh, SD1.5 was also released by Stability. We should use MJ instead
no 1.5 was runway ml who won't release gen 2 now
yes, but most people don't. so controlling the standard fork still gives you much control. just as people still use a1111 for sdxl, even if sd.next has much better implementation at the moment.
how's that control going for a1111?
hmmm, you are right.
so you expect other people to implement tools for you and at the same time blame them for doing so?
quite well, if you see that many influencers still advertise it over sd.next, just because a1111 generates more clicks.
maybe just learn programming yourself
but 1.5 was released by runway ML, stability didn't even want to release the model 😒 ||/j||
i was just giving one reason why there is resistance to the comfyui push. pushing always results in resistance - that's literally a law of physics (and life in general).
When you are having a good time regardless
thank you 🙂
I love comfyui because
- it works and doesn't crash on my system
- if I break a setting I can just load my original JSON and everything is back to normal
it is time for my own good to shut down my SD terminal 😭
I need to do actual work on my computer and not accumulate anymore GB of images until I've sorted what I have
anyone knows a way to fix hands?
nope
😦
no one knows how to fix hands
think it does not end at step, it's rather end before step
stable diffusion models don't understand anatomy
not just SD models
well tbf neither do I
well Midjourney in some way fixed them
me neither but i have to learn anatomy
well I think if you start creating a more hybrid scenario as far as models are concerned it might be taken care of. but visually I don' t know if it's possible to really fully grasp the 4 fingers and a thumb thing
sdxl does a lot better it seems
is DDIM_eta just 'eta' ? is there any custom node that would let me paste in whatever variable I'd want to try and expose? https://github.com/comfyanonymous/ComfyUI/blob/c5d7593ccfb4dd3a97175e01b9fa883086f5d8b4/comfy/ldm/models/diffusion/ddim.py
problem for me is not 4 fingers, but distorted
that too
😄
one reason why comfyui is being "pushed" is just because if a1111 has that much trouble getting a simple base + refiner pipeline implemented properly there's no way they are going to be able to keep up with what's coming next
so it's better for users to start moving to tools that will keep up
Agreed. I think there was just people panicking over the thought of using tools that are more complex/complicated.
The community we eventually get both, with ComfyUI and a more basic UI alternative. Whether or not A1111 can catch up or not. It will just be a matter of time before things rebalance again.
Another thing is, some indaviduals tend to believe that they are being minipulated by big corporations, to use or do certian things. (This isn't the case here) But that is what some of them may have thought.
for some people it can be suboptimal, tbh for me its overkill to use ComfyUI but i use it more for fun or supplementary for my main gen AI tool
but trust me, there are more complicated stuff than this
also, it doesnt have to be a spaghetti of nodes
you can also have a simplistic workflow
yeah, I feel like the meta for the future of this community is to use ComfyUI as a backend, and like a fancy UI with a bunch of features as a frontend. this way you get maximum efficiency and still have a chill interface
Indeed, Gradio seems like a major bottleneck.
30.58s without / 25.54s with --gpu-only - that's a lot
Either way. It's great that we have both. Really great. We have just got a issue with some UI that are still heavily relied on to catch up. Which i admit must be pretty irritating for the SAI staff to say the least. Something will be done eventually. I'm sure.
people got too much in comfort zone, they better not come up with "adapt or die" because they are already dead by now
I want to keep things more simple since some of the Comfy workflows are already messy/busy. Do I need to think about the 0.9 VAE? like is it that much better to use it? vs just the 1.0 base and refiner as they are?
yeah for the people who don't want to deal with the nodes someone just has to take a good workflow and make a nice UI that just sends that workflow to the comfyui backend and just shows the generated images
this is how it should be
its good when there are multiple ways to work with a tool including having different UIs
but the critique remains, people got too lazy and too much in comfort zone
foreshadowing degeneration
I think the best idea is to have like a frontend UI that has all A1111 features and is capable of changing the ComfyUI workflow for more customization
I mean yeah, there should be a frontend with the most commonly adjusted settings visible, and have options to select the most frequently used workflows or whatever. and then advanced settings etc. for adjustments. that is how lots of programs work. finding that balance. currently, even though it has helped me understand how it works and is fast, comfyui doesn't have this. Isn't there some kind of Stable Swarm thing that uses Comfy as a backend?
heck, maybe even stable studio is capable of that
the biggest issue I have when loading a new workflow is, without destroying the neat arrangements, I can't track what connects to what
maybe I'm missing a shortcut or something, like when I select an output/input to a node, to tell what the input/output is that is connected
so I end up just dragging stuff apart for the first time when I'm figuring out what the workflow is actually doing. then I'm like Aha! then I make it neat again. which is okay I guess
but I've only dealt with 2 workflows or something. what happens when every fine tune has it's own, which appears to be the case?
yeah you can just reload it after
yeah I do that.
what's the easiest way to try out a sdxl lora I just trained if I can only do it online?
makes me not want to try new ones though that is for sure haha. I'll let the fine tunes mature and see what is most popular first
also, is there a feature to have like ctrl+z in comfy? I feel like that would be insanely useful when making workflows
but I do enjoy how ComfyUI allows me to better under how the internals work. Since I'm already on the path to understanding that haha
like ComfyUI is somewhat similar to making game shaders and materials and whatnot. all of those have a ctrl+z mechanism for when you fuck up and want to go back
one thing I can't figure out is why some workflows won't load for me. I can load most of them. and then I can load most of the images from a1111 that I've tried and it'll show me the workflow I used there. but then some workflows just won't load. and some images won't load their workflwos. and I can't really figure out why
anyone?
colab maybe?
I literally can't run comfy on colab. I deleted it all from my drive, started over, reinstalled. and that damn ip address/password is always wrong. I have no idea what the deal is
use colab
use stableswarm
yeah, i have that setup. haven't started using it yet
also, I googled swarmui yesterday and that's a github for robot swarms
fyi, if you're into robot swarms
if I have dual 4090s, should I be trying StableSwarm? or just sticking to ComfyUI more directly? I don't know much about StableSwarm, is there an official blog/article about it from SAI? Thanks
dual 4090 just for SD?
i'm not $$ enough to have A100 etc.
nah, you need an H100
no, I have it for other reasons, I just have it. normally I just set primary as #1 instead of #0 as I might be gaming while training on the other,
but say I Want to do 100 images, then it might matter
anyways it isn't a big deal
I gather what i'm seeing in A1111, while using SDXL. Is the VAE decoder hitching?
It get's close to finishing a render and the progress bar pauses at 80% and then completes the image after a few seconds of waiting.
Does anyone else have this issue?
I wonder if A1111 could fix this in the future... 🙃
.
I posted this earlier, but i don't think anyone saw. I gather this issue isn't fixable yet?
(I have a 3080, btw.)
1024x1024?
maybe I should switch the localtunnel in the colab to that cloudflare thing others are using then
hmm, not sure. if it's in a1111 all kindof fun stuff going on in there
has anyone else had the same issue I mentioned? I'm just wondering if it's something on my end or what's going on
but really expected it to work when I reinstalled. just odd
Nvidia is going to build a supercomputer with 4.600 H100 to run AI. Skynet on steroids.
really? well if so, that's a lot of something they'll be able to do
not sure what, but something
how much improvement can we observe from a lora trained at 4k steps and 17k steps
I've had them get worse with too many steps
but not sure what method you're using or if it woudl work the same
main thing is there should always be progress as far as the model's accuracy. once it stops getting ore accurate then it starts overfitting
*more
what are you using to train it?
face
I meant what method
but when is overfitting?
70 steps are enough?
there are countless options as far as this stuff is concerned. but think of it like you're carving the training into wood or something. so up to a point you're just creating concept, adding the details, refining them. but then get to a point where you can't really train it anymore. it knows all it can know from your data. so after that you're just carving into the wood deeper
but it's not learning anymore
I mean, it is sort of learning in a sense, but it's not learning in a way you want it to learn
I tried training a lora with 5600 steps, but results are poor
very poor
while with SD 1.5 were great
what method? steps are arbitrary, depends on the method you use, learning rate, etc
0.0004
constant
this
it also took 4 hours on a 4070.....
loss at the end was like 0.15
I don't know. I'm sure someone here knows a lot more than me. I'm just telling you what I know about the concept in general. haven't tried training anything on sdxl
the colab now has a cell to use cloudflared
its voodoo. throw some chicken bones at it
whoa, dude. that's awesome! thanks!
I was going to add it eventually since localtunnel is a bit annoying with the ip/password thing
I've been a bit crippled with trying to do too much with sdxl using my 6gb of vram, lol
same with me and my RTX 2070 on laptop
yeah, I'm still confused by why it wouldn't work for me. makes no sense to me, but that's not uncommon for me
took me a few minutes to get 4x base rendered images + 4x upscaled images
at the same time
need to try out some lora training now
it actually doesn't do a terrible job considering.but once I get past 1024x1024 it gets mad at me
well I can upscale, but then sending it back through to render anything becomes a real grind
does anyone run loras or skip steps on the refiner model or the negative conditioning?
curious about running loras on the negative conditioning, seems like it could be strategically beneficial at times
my negative output image is pretty legit. not sure what's going on there
hmmm... running with --gpu-only after a couple of generations GPU stays at 100% while generating but it's not progressing. 100% gpu but progress is frozen... time to investigate
that's no good
yes, but when it works it's faster (30s without / 25s with - for my current settings) but something is apparently wrong heh. it doesn't happen without --gpu-only
your dim is unnecessary high and you should not train text encoder with that
you probably totally overfit
train unet only
SDXL is easier and faster to train than 1.5. I got much better results with SDXL Loras than with 1.5
Do you have a good json? I'm training a style not a person
just add the option to prevent text encoder training
depends, sdxl has much more parameters, so it takes longer. But it needs way fewer steps to learn something
can you share a sample json to train a style with kohja?
@midnight shuttle "xy_grid.py" is either no longer used or something.
the new file that doesn't cause problems is xyz_grid.py
so i disabled/removed xy_grid.py and no more error.
I'm going to try to load that json on colab in a bit to investigate. curious to see if I can decipher what's going on
I mostly just trained dreambooth models before, but the entire checkpoints. then figured out how to extra the loras from those. but havne't tried anything with sdxl
*extract
tried the same exercise with 120 styles of cats like my previous post (search for it to compare) this time with Dreamshaper 8 (SD1.5 based) not even close to sdxlUnstableDiffusers_xl quality
Gonna be training a model on 5000 images, any recommendations for dreambooth training? Do I need to use naturalization?
kitties
Learning rates, epochs, all of that im not familiar with model training only loras
I was using like 5-20 pictures with dreambooth, lol
If the learning rate is too high, the training can "overshoot" and miss the optimal state. If the learning rate is to low it will take forever to train the model. So having a learning rate that's a bit too low is better than too high.
does anyone have a guide on creating a sdxl checkpoint?
I was using 0.001 for loras bit I figure thats far too high for model training
Is there any way to do gradient accumulation?
Hyperparameters such as learning rate can really only be optimized by trial and error. Try a few different rates and see which one produces better results.
I feel like a bit of a dumbo asking, but what's the most efficient method for installing extra node packs in comfy while using colab? should i install the manager? thought about just copying them into the folder but then I can't imagine that'll work out too well
@nimble heart This sucks if true. https://wccftech.com/amd-rdna-4-navi-4x-lineup-rumored-to-not-include-any-high-end-gpus/
Focusing resources on Instinct maybe?
I bet so
MI350 or 400
soon enough gamers will be glad to get any gpus. response from the industry...shut up and go console
Console or cloud streaming. Cloud streaming is the way of the future. Full control of everything and no piracy.
Yep, except too many have not better than dial up
if its true i bet then it justmeans the 8990xtx will be another rdna3 . but i did predict AMD would exit out of the high end gaming market now that intel is coming to town
It will be crazy if Intel becomes the GPU leader for consumers. I hope they will also have some good AI support.
i dont think they're ready to exit high end yet and this rumor is just suggesting rdna4 is mobile focused
in comfys Collab there's the section with all the models downloads you can uncomment. one of those lines is to install a custom node for controlnets just modify that with the git url you want
I would bet on intel more than AMD since their code is better
brilliant. thanks.
I made a Workflow for Image2Image using SDXL. It's mostly automated, will be making a few tweaks to it based off suggestions (first will be an auto image scale adjuster based off the input image).
The steps across the models are automatically dynamically adjusted based off your selected 0-100% denoising value.
So while this doesn't offer total flexibility in being able to adjust your own steps, it's a pretty fool-proof workflow.
Any other suggestions are welcome.
May integrate a random prompt option to my flow, I've seen some great stuff with that as well
i am using a1111 and the one button script. there should be some node for it in comfy too.
A fun extension could be a big board of switches, knobs, sliders that are used to directly generate weights without even using CLIP. So you would adjust all the controls and see what comes out.
I'm gonna be taking a look at the frontend code soon and seeing if I can fiddle my way around the error redwalls that come with switching off nodes but leaving input connections.
If I can figure that out, could make some nice toggle workflows
comfyui is basically like a modular synth environment, but with ai stuff instead
Yes but it still uses prompts. This extension I am talking about would only be for fun not for practical use. Just a fun way to generate images by trial and error and not using any words.
well there might be a way to set it up so it effectively scrolls you through the latent space. not sure how all that would work. but people smarter than me mmight
I would just look at see how weights get assigned by CLIP and instead make like 50 different controls that generate weights in a deterministic but incomprehensible way. I'll never have time to do it but it could be a fun toy. Only a toy though.
I like the unclip model concept, but doesn't seem super popular for some reason
Is there a bot to generate images in the discord ?
really? I thought the most powerful CPU is that one AMD made, right?
what's jesus got going up up there?
"Thread Ripper" I think?
random things
looks like he might be getting into a bit of grab ass hijinx
nice
dude, I had a random mushroom cloud appear in one of my images yesterday
Is there a way to upload photos and have SDXL change them like you can do with Midjourney?
yeah, mushrooms happen
there are ways, but it is better to wait for sdxl controlnet to be released
vert creepy
32 mb image file, lol. good lord
it got a biit deep fried in the rendering, but I do believe that's a mushroom cloud. didn't ask for such a thing
that is a weird image. Looks more like news than AI
what's weird is I now hold my ai images to a higher standard than actual images
or photographs rather
I zoom in on them and realize the detail isn't on point in the distance and that does not work for me
looks like a photography of beirut after the beirut nitrate explosion
man, what's weird is it was just making normal weird stuff and then that popped out of nowhere
yep. It is like the jesus pic.
that is good to finish this session
man, you have a peculiar style
yeah, just trying to do things a bit outside the norm
Anyone doing "Upscale Mixed Diff" like in @high skiff's workflow? I'm trying to prevent it from smoothing out the final result but I'm not very successful..
ooh, got the manager installed on my colab comfy. time to get the party started
this discord is not related to comfyUI or ?
cuz i cant run server backend of main.py
You could probably get help with that here still, but it's not directly related
well i downloaded stabilityAI the windows installer that does everything automatic and only on the last step when im already in the web gui now it doesnt wanna open or read comfyAI main.py the script does some things and then windows asks me how to open the file i select powershell and nothing happens - backend error - i cant generate any AI images im using a 6700 xt aswell tho
you should ask your questions in #🤝|tech-support though
ups
I'm not sure why that would be happening, and all of my example images I have given, there's really no smoothing of detail
Perhaps it's the pixel upscaler you are using
Or, if you changed the step count/scheduler/sampler, then that would likely have negative effects
what really grinds my gears is I haven't been able to load your workflows for some reason. my install just won't load them and I don't know why
@high skiff It might be be the upscaler but it doesn't look like it imo, i'm also using the same one as you: "4x_NMKD-SuperscaleV2_2k.pth"
And the smoothing is pretty minimal
yep. it was taking longer to tune the mixed upscale settings than to just use Ultimate Upscaler. So when using that workflow I dont use it
I have my own upscale workflow now so dont use Sytan's file much now
AI is much more a software problem than a hardware one
AMD is good at hardware but their software isn't great
@heady vale Would you mind sharing that workflow?
not just yet. Im still working on it
gotcha
True. ROCm isn't even close to CUDA. Even if it is, it takes AMD like half a year to catch up to Nvidia, and by the time AMD does that, Nvidia already updated like 7 damn times