#✨|sdxl
1 messages · Page 26 of 1
i'm also training blip2 with another lora. will see if that helps
instructblip is the shit
based on vicuna
worth checking out
I originally hand captioned that dataset, I just got suggested that I tried clip-interrogator on them to see what it gave me
The results I got from just training on a white image, nice brightening of the scene.
you can train in comfy?
no. you can infer in comfy
ah
Just a friendly reminder, we need everyone to go and vote on the bot channels, please select what image you like best: "A" or "B", this is for helping improve and finetune what will be the gold ⭐ SDXL 1.0 model.
sometimes it's really hard to pick which is better
wow lmfao
Do we get a prize for the person that has the most votes? Edit: actually scratch that, if we did get a prize people would just vote with out actually looking at the images
oh, Diodotos is one of the people in our research server
absolute gigachad
do you accept lurkers?
its not my server to link to, tho I wouldn't think so as we are testing things that likely lead to fails to try and find good results, and our whole idea is to not let that unfished info out to the public
anyone get this error on hugging face?
{"error":"module 'diffusers' has no attribute 'StableDiffusionXLPipeline'"}%
grrrr, it's so annoying. with the upscaling at the moment it seems like it's a choice of upscale enough and get good eyes, but wash the detail out of everything else. Or keep the detail, but have bad eyes.
That looks like you don't have the newest version of diffusers that supports the XL Pipeline
I'm just using the hosted inference api here https://huggingface.co/stabilityai/stable-diffusion-xl-base-0.9?text=A+summer+day+in+new+york
is this outdated @eternal fog ?
I think so as the how to section says "from diffusers import DiffusionPipeline
pipeline = DiffusionPipeline.from_pretrained("stabilityai/stable-diffusion-xl-base-0.9")"
exactly! You win an image of a complete A100 card for this realization 😜
I've not forgotten about you. I'm trying to fix a few issues in the workflow.
How can we get into the research team?
Oh cool, that's better
Not too sure, we were all just kinda together already
The img2img pass is really annoying, if you go too far it doesn't fix the face and hands enough. But if you don't go far enough they it fucks the background.
and its between 2 steps where this change happens
What about LLAVA or minigpt4 for captioning?
IMO Llava hallucinates too much
even at lowest temps
KOSMOS-2 is out
For something like image training captions KOSMOS-2 would probably be pretty great
Minigpt4 is verbose
I see the good old learning by repeating the failure of others, something you guys critized stability about not explaining why they made the choices they made, now you use the same fluff speak to do the same, god I hate the immature attitude the people involved in these AI fields have about open source, it's gtab and take for them, so glad that the (non ai) software fields I'm more active in are so much more open to sharing
The information is shared with people who know how to use it. With all due respect to the community, stable diffusion users are not the most educated on the tools they use
The goal is a robust and effective tool/workflow for the masses, not the people that have the ability to make their own things as they please
And besides, I also do research documentation for things I don't properly understand for people to mess with, so please don't come at me/us like we are doing it for some form of walled garden or something. We owe nothing to this comminity, and dedication to make something functional for all is harder to do than just throwing out poorly documented and untested findings, if I do say so myself
this is a lot of text to say ur mad that u cant join someone's personal discord lmfao
Congrats, you made sure they'll never be educated this wat, it's really a dumb argument, but as everyone says, you do you bro 🤡
its going to be fine
how's the new card @high skiff
also, as an addition, I have nearly 3000 words of documentation for my workflow specifically on how AND why it works
So if people wish to read it and understand exactly why these choices were made, they can feel perfectly fine and supported doing so
dead
not completely, but effectively
not mad, just baffled by that general attitude, it's totally contrary to what i'm used to in opensource development
yes, let me just put out information I do not understand with improper documentation and then expect people who know less than I do to fix it for me, rather than running around with my poorly made/optimized product and spreading misinformation on behalf of my name. How baffling an idea
it was nice not having a condescending self centered person around for 24h
24hr? I thought you were only muted for 1
ive never been muted
its not like dead dead, but its looking like its headed that way
I have a few more extreme measures to test first, but my hopes are not high
Chaz the raver denies allegations of improper conduct online
lmfao
A response from one of the members in the server about letting randos in lol
ComfyUI SDXL 0.9 personnal workflow. Can't wait for 1.0, damn ! 😄
1.0 has been the first model in almost a year that I've been eagerly awaiting getting my hands on.
Can't believe it's almost been a year since 1.4
yeah it has been forever since 1.5 release. especially in AI time lol
This was among the 1st images I generated in Midjourney beta a year ago almost day for day...
This is what I have so far, with a custom node I've modified for aspect ratios.
The 2nd pass after an upscale isn't needed for all images. But it will fix faces if needed, with the downside that it tends to smooth out other features. I don't believe that's something that can be fixed though and is intrinsic to the model. It effects certain images more than others.
wild to see how far we come
@azure oxide do you mind if I DM you?
That is so insane, I remember using image generators before they were even diffusion based, back when Wombo dream was still running its clip master or whatever
*masher
@visual glade did you know there's a major determinism difference between images generated under --gpu-only and normal vram?
I was worried it was a code regression but it's gpu-only or not
Reproduced back and forth, bigger than gpu non-deterministic sampler differences
Yeah I probably have some VQGAN+CLIP images from 2 years ago somewhere lol, back in the days it was close to dreams and imagination was doing the rest 😄
Basically lol
Although I will say, it was impressive how much it could capture the essence of certain things, while being extremely abstract
which samplers are you using?
nah go for it
how big of a difference is it?
I need to add you as a friend, that's why I wanted to ask
dpmpp_sde_gpu, which varies from inference to inference but not as dramatically
will give an image compare, one moment
oh on that one it's normal
When trying to send a friend request, it says that you're not actively accepting friend requests from other people, so you have to send me one lol
but specifically gpu-only has a different determinism for that one?
Trying to DM you has more steps than my entire workflow lol
oh strange
french revolution on VQGAN+CLIP around june 2021 I think, generated on a Google Colab in several minutes lol, 400x296
I think it's a privacy thing
same workflow / seeds etc. Only varies slightly between inferences. gpu-only ALWAYS gives a nose, normal VRAM always gives the lack of a nose, for example
different arm placements
That is extremely strange, that's a very big difference
you get tiny differences normally but the gpu-only flag makes it very different, euler_a also changes between gpu-only and not
but slightly
yeah that's why I split them into _gpu and regular
is euler_a also non-deterministic?
I don't think I made that one deterministic yet
OK I thought it was because of the per inference difference, not the wider difference there
and this is now, a couple of minutes on local machine, ComfyUI + SDXL 0.9
sent
thanks!
what GPU if I might ask?
g-g.... GREEN SCREEN!?
Lol I have to switch it to normal or sum I had insider build because it was a work around since I had a i7 7700k and it wasn't compatible but I upgraded to the i9 13900k so idk
About time everyone gave Windows the boot and switched over to Linux hides
I can understand the sentiment, but I am just not there yet
lol it was soooooo bad djskanfjhnadsfjhnbdf the 4 cores killed me
Would if I could play games on it properly
yessssss
I just switched to a 12600k, even that is a monumental improvement
i would use linux but this is my gaming pc
what stops you to play games properly? 😄
lol i had a 1080ti and a i7 7700k switched to 3070 last year and just now i got the i9 13900k
Anti-Cheat, lack of proper support for Force Feedback Wheels
bro lmao
I had a 7700k and a 1080, then went to a 3060ti, then this PC lmao
If you must game, dual boot is never a bad idea, though I hear Steam/proton is getting pretty good these days.
prob gonna get the 5090 or 6090 and still have my i9 13900k lol
online gaming?
we are just alternate universe versions of each other lol
Yes
Side note- it's wild how much minute boilerplate prompt stuff that we've gotten used to affects SDXL outputs compared to SD... A few term swaps you think are benign can change an image from CG to hyperrealism
lol have i have no choice to do it in parts lol dropping 1k on a gpu and then 1k on a cpu at the same exact time is hard but 1k on a gpu then 1 year later on a new cpu is easier
for sure, 100%
I hear you can drop $100 dollars on an apu and get similar results to a $1500 gpu.
yeah, GPU's are improving much faster than CPU's at the moment in terms of capability improvements
maybe $1500 GPU from 12 years ago haha
you could pull an apple say say your iGPU is as fast as a 4090, but really mean just in tools that can't use the 4090's compute lmao
@high skiff you may laugh, but we're talking about an APU, not a GPU 🙂
i have this motherboard https://www.amazon.com/ASUS-Gaming-Intel®-Motherboard-Thunderbolt/dp/B0BG6KQPWD the i9 13900k all the same lol and i got 2x32gb ram stick ddr5 at 6400maz
The ASUS ROG Strix Z790-E WiFi 6E Gaming motherboard is designed with 18+1 power stages and optimized cooling to cope with the demands of powerful 13th Gen Intel® Core™ processors. Along with the latest Wi-Fi 6E, DDR5 Memory and PCI Express® 5.0 for superfast data transfer, memory and storage per...
Thanks to Gigabuster.EXE for his help! https://forum.level1techs.com/t/mi25-stable-diffusions-100-hidden-beast/194172/1 ********************************** Check us out online at the following places! https://bio.link/level1techs IMPORTANT Any email lacking “level1techs.com” should be ignored and immediately reported to Queries@level1techs.com.
second it can leverage the 4090, the M2 GPU looks like a graphic calculator lol
I know what an APU is lol
tho that is very cool to see
Good. So when I said a $100 APU, I didn't mean a $1500 dollar GPU from 12 years ago. Meant what I said 🙂
I was assuming you meant APU as its more common consumer term definition, my bad
Finnaly got koyka or what ever it'd called to work lol one Greenscreen and reinstalling python is all it took
idfk no more
Anyone here ever tried running sd (training) on a tpu?
ah yeah, part of the reason that i never update nowadays
@hearty ginkgo This line: Tried to allocate 20.00 MiB (GPU 0; 8.00 GiB total capacity; 7.16 GiB already
allocated; 0 bytes free; 7.30 GiB reserved in total by PyTorch)
Your VRAM is exhausted, in other words.
like always
@desert copperI just watched the video you sent, and thats really cool, tho nowhere near a $1500 GPU
3it/s at 768x768 is far from the near 10 I get on my 3090 when its working 😅
glad to see I am not the only one that edits messages multiple times and still has typos lol
for SDXL, I don't think so
Oh rip
I know its been more optimized since I last tested it, but it was using 17GB VRAM for me at BS1 before
tho I have a friend that i think said its working on a 3080 now
It's near enough when you don't want/have $1500 to drop on a GPU, but sure, it's AMD, and we're still not there with AMD drivers quite yet from what I understand. But price/performance ratio is about as good as it gets.
And it's also, what, 5 years old? So it does pretty well, considering.
oh for sure, I totally agree
its a dope option for those with not much money, but a lot of skills
@desert copperoh my god the background of their website is satidfying
i figured out linux and if i can do it, hope for anyone
Protip: don't be afraid of the command line on Linux. 🙂
I am getting this weird points/pixels in all of my images while using sdxl 0.9 with automatic1111. does anyone know why this happens?
If you're into AI and still afraid of the command line it's going to be rough for you
Thats the SDXL 0.9 watermark, its in all images (unless you use comfy UI, cause Comfy learned how to turn it off)
can consider renting gpu or just using cloud -- especially if you only plan on training a little bit. or trying it out. cards will keep getting better. price-- hard to say since there is massive demand for the AI-capable hardware
I saw a site with outrageously good prices
how do i enable low vram for kohya
it was like $0.40 an hour for a 4090 IIRC
alright, i just tested out sdxl0.9, it's not as good as I thought it would be
with like 12 cores and 128GB RAM
I didn't know that one either, that's a really rough watermark
you have to know what you are doing
yeah, it is
oh, i know, trust me
it can show up as slight pixel discolorations, or in diffusers it shows up as a bunch of red pixels
I wasted so much time trying to fix it 😄
speaking of watermarks, I hope we've got rid of all of those images with watermarks from the model.
Probably best to avoid anything with a watermark, not least those stock image libraries that are now suing Stability 🙂
Let them keep their walled garden. We don't need anything from it 🙂
do you know which commit he did it in? Just curious what that looked like
no clue, from what I know, it was stripped out from the start
bro, the new 4090 version of my 3090 is HIDEOUS LMFAO
my nice clean and sleek 3090
I don't think I'm doing this correctly, it doesn't add up that a 1.5 model outdone SDXL0.9
and the new 4090 version of it lmao
let me have a peep
AssertionError: network for Text Encoder cannot be trained with caching Text Encoder outputs / Text
tbh it's kind of hard to top something like that, but...
hmm...
oh, you need to use a special command
ok thanks
just a sec
you can't train text encoders yet for LoRA's
(we are working on that in the research server)
at least not well
rip i turn that off and it jumps up to 20mib usage again ejhfbsdjhbfsdhjbsd
Idk man, I was almost confident that SDXL could be better than 1.5 finetuned models in this kind of thing =\
@hearty ginkgo--network_train_unet_only
were do i use that?
you need to add that to the optimization args in kohya
it should decrease VRAM a bit as well, and make training timmes way faster
let me double check, just a sec
k
yes, there
for me, it was about 2x faster with that (as it didn't waste time on the TE)
SDXL is great if you use it correctly
for sure
oh man, the stylization on that is fire
here's something a little fancier
it has to be used as part of a pipeline with the refiner
egh you got any other recomandations to try to lower vram?
I haven't done anything with that at the moment
you can try cache latents, cache latents to disk, and enable gradient checkpointing
and na'vi do not look stretched in that format 😄
yeah haha, they are very lanky as is lol
so this LoRA quality is with no text encoder, just pure unet
it looks even better with my refiner I made which is the same LoRA trained in 1.5 as a very small pass
are you using the refiner?
yeah
SDXL vs 1.5 fix pass
1.5 looks great! the styles are not 100% comparable. I guess the overall style is interpreted differently
what kind of lora is best for SDXL training? just normal LoRa or ...
I have only tried LoRA
but some of my colleagues have suggested LoHa's
#✨|sdxl message
Try that workflow or this one:
https://github.com/SytanSD/Sytan-SDXL-ComfyUI
yeah, this is kinda sad, because SDXL has about x5 more parameters and it didn't beat a 1.5 model for the most part =\
for those that want to take the beach with them
well is it base sd 1.5? because a fine tuned model would definitely react differently
nope, it's a 1.5 finetuned model
but still
let's see what a even more fine-tuned SDXL model can do 😉 I mean for a base model it is really great
prompt: multiple mason jars on ornate table containing the cosmos, (containing dioramas of wonders of the world), ultrarealistic, ultradetailed https://cdn.discordapp.com/attachments/1101178530865352815/1131358264270000258/multiple_mason_jars_on_ornate_table_containing_the_cosmos_containing_dioramas_of_wonders_of_the_world_ultrarealistic_ultradetailed_steps-51_style-Photographic_seed-0ts-1689807218_idx-0.png
If you're testing things you know work well on 1.5 and then redo it on SDXL then of course 1.5 is going to look good.
I think there's something wrong with what you are using because it's not supposed to look like that
I used the workflow @high skiff made
your images have a1111 metadata though?
wha- I'm currently on ComfyUI, I think there is a save format or smth
Probably using that horrible extension that puts ComfyUI into Auto1111
the one he linked IS mine
yeah, that's the one I'm using rn
it has yet to be updated, as I am fighting with my GPU
can I see how you are prompting it in that case?
yeah, hold up
SDXL - considering that it is 0.9 not-finetuned and we still wrapping our head around about prompting - I am happy with this model
i give up sytan lol
I think you can only go so far with 'simple concept, photographic'. Probably using an established realism upscaler would benefit more..
If you want to compare them ask for actual scenes and not simple concepts.
30 year old Julie with blonde hair and roots showing is emotional and breaking up with her raver boyfriend 31 year old chaz with short brown hair at the Leeds City Centre Rave Memorial in a scene from the BBC drama A Raver's Last Chance, Chaz pov looking at Julie, hella emotional, dramatic scene
optimus break dancing
do those with the bot?
in automatic1111
noice
my colab crashes allot using SDXL with comfys colab is there any startup argument that might help? i can do a highram runtime but it doesnt seem to fix anything on its own
Idk man, this whole ComfyUI shtick isn't working for me very well. I think I'll wait for the full release and A1111 support
I don't like how it wants 2 prompts
You don't need 2 prompts hyperthetically
how is auto1111's support coming along (For SDXL)?
Using it right now
31 year old raver with short brown hair Chaz finds himself in a neo-sci-fi-futuristic neon Leeds in the year 2099 in a scene from the BBC drama movie Cyberpunk'd
yeah seems very active
extension?
Last two days was using extension, yes (running localy)
But from today tried out A1111 1.5
If you are interested
Working well
Is that using the 1.5 SD Model or is Automatic1111 at v1.5 itself?
Automatic1111 at v 1.5
using sdxl 0.9
nice... what improvements have you seen. did they make any changes to the UI?
For now UI is the same. Pretty much only improvement that I have payed attetion to is that I can load and use sdxl model
Some samplers are not supported when using sdxl
With refiner?
Thing about refiner is that I don't see that beeing used in comfortable way yet. Only way could be using it throught img2img right now
But base model itself giving good results
Oh, then I'll wait some more time. The base on it's own isn't as good as some 1.5 finetuned models and it's way slower =\
I guess after the full release this wouldn't be the case
I hope, at least
I believe so
Also, when is the full release?
Maybe together with sdxl 1.0 😄 your guess is as good as mine
SDXL should be almost as fast when sampling 1024x1024 as SD1.x
It's about 30%, even on ComfyUI.
I usually get about ~16-20it/s. On SDXL with all optimizations I get about 5it/s
Idk, I think it would run better after A1111 works with it in the full release. For now it's all too experimental to get results as good as 1.5 models stretched to it's limits.
It does follow prompts way better than 1.5, but the detail isn't quite there yet
And it's slower. So I think I'll stick with 1.5 until the full release
those are terrific!
is this at the same resolution for both?
comfyui is pretty close to the peak performance you can get with a pure pytorch implementation, the way to go faster is stuff like AIT or Tensorrt but they are not that easy to implement in a way that is transparent to users
Looks like some weird advert for strawberry shampoo or something.
the refiner doesn't work as designed in automatic. the way sdxl was designed is you do enough steps in the first model to get an image built, then pass the latents to the refiner model to finish the denoising. you can use it other ways, but that's how it was designed.
Automatic's pipeline is limited to one model one prompt, so you're forced to do things the other way.
do the whole image from the base, let the vae cook it, pass it to img2img, load the refiner, set the denois to 0.4 or 0.5 and run it again. higher tends to cook too much. it's really sloppy this way. it should be done with latent space
yeah refiner for img2img gives you very diminished results compared to leftover noise in latent space
inb4 dumbfucks nagging at me once again for saying this
will A1111 not make it like, load both the base and refiner? that's not how it's supposed to be
I think there should be another dropdown next to the checkpoint dropdown, but for the refiner
I wonder will it be possible to run this on auto next week by just doing a normal git pull?
yeah the a1111 codebase might have difficulty handling multi model pipelines
people will be so mad if they will have to click 3 buttons for using the refiner
no way A1111 will look at the refiner and just go ''nah, fuck that''
i just meant in general
im ready to answer any questions
git switch release_candidate
git pull
maybe update venv
there's at least 2 UIs that can handle the full pipeline and there's probably going to be more next week
if your favorite ui doesn't handle it you can just switch
comfy i know. What's the other?
Idk, he has no reason to. some 1.5 finetuned models are somewhat better than SDXL0.9, I already tested that before
the vlad fork I think has it properly implemented or if it's not it's going to be because he's using diffusers
Not sure exactly what you're referring to but please stop making bad GUI's a badge of honor.
There is no shame in wanting a good user experience.
Idk about Vlad's UI. I tested it in comparison to A1111 with all the same parameters, optimizations, sampler, etc.. the image I made with Vlad's UI was not as good as A1111
i dont think that was his what he was implying. think it was more "auto better fix it" instead of welcoming the bad ui
a1111's seems to prompt to the openclip layer and vlad's seems to prompt towards the clip layer. that is my assumption from testing
good ui's on comfy prompt to both
cheers
i'll ask again when 1.0 is out lol
just playing with the bot until then
But sounds like it is still too early to switch to Auto1111 then if they still got issues.
that branch will be live by 1.0 release. wont need to switch
seems hella powerful to me, i mean so did the 0.9 but all the channels are solid imo
what other UI is there that does that is there except Vlad's?
i should say good layouts for comfyui
did emad say theyre releasing their own ui? maybe not there were so many things
are dynamic prompts and generate forever available for comfy ui?
They did open source the thing DreamStudio is using.
generate forever is the unlabeled checkbox in the extra options
But currently SAI has no good UI of their own.
i think so . i'm not a comfy guru . just started
already is a stable ui released https://github.com/Stability-AI/StableStudio
If you have the tools inspect their "repasting" work. Also clean it with 99% isopropyl if you have it and let it dry.
is it new? or does it suck lol. never heard anyone talk about it
it's the FOSS version of dreamstudio. i think it's good. no updates since release though
It has no dedicated backend as far as I'm aware other than SAI's API
it can't run anything. it's just a frontend. but a good frontend though
I have been peenged
I've actually taken a look at it, it's fantastic
When I try to train a model I get this error: src/tcmalloc.cc:283] Attempt to free invalid pointer 0x594800d99740. Anyone knows why ?
I inspected all the thermal pads and everything
They make excellent contact with the active back plate, great surface area, great finish
also isopropl the pci fins and even the slot on the mobo if its used
The GPU core and the VRAM temps are perfectly acceptable
I did that first thing
It's likely everything that you're going to list I've already done, but I'm 100% open to suggestions
is the PC still running fine when the screen blacks out or does music and stuff stop playing
if it was an old gpu you didn't care about I would suggest the oven trick as a last resort
Wtf is the oven trick
did you pray to the elder gods?
i Can't use sytan's workflow with out it crasing it tried with --use-split-cross-attention and i got two images instead of one before the crash. any advice?
oh you are on payed colab, you should use: --highvram
ok will try ty
the oven trick is removing the heatsink and anything that could melt from the GPU and putting it in the oven at 380F for 8-10 minutes
i thought that was thoroughly debunked?
this isn't working. I just finished setting this up
I have fixed a total of 3 GPUs this way
used to do that for poorly manufactured xbox 360's
it only fixes hardware failures where solder isn't seated right
huh, nice. i finished two off. I know it was always controversial, but i still have a distant memory of "here's why it doesn't work blahblah"
it's something you should only try if the GPU is broken and not under warranty
because at that point it doesn't matter anyways if it doesn't work
When I try to train a model I get this error: src/tcmalloc.cc:283] Attempt to free invalid pointer 0x594800d99740. Anyone knows why ?
and if it works you feel like a mad scientist
Tey chatgpt
Try*
good idea
31 year old short brown hair raver Chaz sitting in the cold morning light in his Leeds apartment after the Leeds City Centre Rave Disaster, dramatic scene, hella emotional Chaz, powerful moment, from the BBC drama A Raver's Last Chance 1989
31 year old short brown hair raver Chaz from Leeds UK is guest starring on the new Game of Thrones series, dramatic moment
pretty accurate
anyone know how to use negatives on the bot
I'm testing out some LoRA training and I'm happy so far. (Top is with base only, bottom is with LoRA)
With the goal of being able to prompt X as Y a lot more consistently. So here Seth Rogan as Pikachu.
Neither in the dataset I used for training.
though i never know what to put for negatives lol
I tried using chat gpt, but it doesn't give me any specific answer and since I don't know anything about programation I can't fix my issue
You don't really need negatives other than if you want to remove something from your image.
Thats the only thing my documentation for SDXL doesn't include, because I self admit not really knowing what I am doing with the negatives lol
SDXL and negatives are weird
lol that error probably needs lots more information before being able to figure it out. like all the previous errors you have had up to this point that you might have ignored, the system, how you are training, what libraries you are using, etc.
okay I see
31 year old short brown hair raver Chaz from Leeds UK is the newest Avenger in a scene from the Marvel movie End Game Part 2: The Return
I put my motherboard in the oven regularly because my room was so damp idk what was wrong with it but putting it in the oven or heating it up with a hair dryer worked
based off the error it is from code executing in tcmalloc.cc line 283, tcmalloc is Google's custom implementation of the (C) malloc
did you do any manual compilation?
are you using up to date libraries during the compilation or runtime? sometimes bugs are fixed down the road
knowing the full stack you are using would help, but not sure
I followed a YT video and I used DreamBooth
And TheLastBen
From what I understand
I'm new to this thing so I don't understand everything really well yet
okay looks like fast-stable-diffusion does have a dependency on libtcmalloc
Can u ss the error u get
are you using the most up to date fast-stable-diffusion?
can you update libtcmalloc? not sure if it that will help or hurt things
not sure if it really that library vs some code base that is just fucking up lol.
src/tcmalloc.cc:283] Attempt to free invalid pointer 0x594800d99740
Nope
are you using Collab?
But I get the same error
Yeah
you might want to comment there and see if there is a resolution
While you're waiting you might just want to try another training program to see if that works for you. Given how long since update I don't think it would support SDXL anyway.
Since I'm new, I don't know how to properly train a model or a LoRA. That's why I've been following yt tutorial. So right now I don't have other training programm
There should be plenty of guides to using Kohya-ss gui and it's not too tricky to setup.
If it is of any help, this is the video I’ve been following: https://youtu.be/c6r25rT8DV0
Training, or Fine-Tuning, your Stable Diffusion model cannot be easier with DreamBooth! Using Google Colab, I will walk you through generating images with specific subjects, objects, or styles!
📣📣📣I have just opened a Discord page to discuss SD and AI Art - common issues and news - join using the link: https://discord.gg/fxHVBVQ7Aa
🤙🏻 Follow m...
what model are you fine-tuning? is it even SDXL?
i couldnt get the kohya gui to work in colab i use https://github.com/Linaqruf/kohya-trainer he has new xl notebooks if you look in the files
Adapted from https://note.com/kohya_ss/n/nbf7ce8d80f29 for easier cloning - GitHub - Linaqruf/kohya-trainer: Adapted from https://note.com/kohya_ss/n/nbf7ce8d80f29 for easier cloning
ehy guys, what i need to try sdxl in a 4g vram?
till now i did miracles with the gc
i run stable diffusion with this set COMMANDLINE_ARGS= --always-batch-cond-uncond --opt-split-attention --xformers --medvram
31 year old short brown hair raver Chaz sitting at the pub having a pint reflecting on the Leeds City Centre Rave Disaster, hella emotional scene, dramatic, focused on Chaz, from the BBC drama A Raver's Last Chance
pretty f'n solid for just a bot with a dummy prompting imho. hands not perfect but close enough lol
(not cherrypicked 1st gen) combining sdxl and illuminati diffusion to achieve more detailed results
don't make jokes about his deformed hand, he is pretty sensible on the argument
he has an amputated finger but that would be from a farming accident
discord compresses the image a lot
I don't think Discord is doing any compression here.
idk why but the image went from 768x1152 to 436x655
You need to click on "open in browser" to get full version.
open in browser is the original
Thanks for trying to help me
it might be just worth trying another tutorial/other software
Yeah I’ll try that tomorrow
Today was the first time I tried to train a model
kool
I’ll do more research later
MOOOOOOOREEEEE
I have a question for the community
in regards to how you would all prompt a specific thing
sun rays going through foliage on a subject, I can't think of a proper way to tag that
Linguistic: An enchanting photograph capturing sun rays filtering through lush foliage onto a subject.
Supporting: nature, ethereal, magical, backlighting, dappled light, atmospheric, dreamy, natural beauty
Negative: Blurry details, distracting artifacts, overexposed highlights, lack of clarity, lack of focus on subject, uninteresting composition, excessive noise, flat colors
sun rays going through foliage
dappled lighting!
thank you!
I knew it had a term, i just forgot it
god rays
dappled lighting was exactly what I was looking for
cinematic shot of sunrays hitting Yann Lecun through foliage, god-rays
I am actually using it for 1.5, so I am trying to find terms that work well there
dappled light is the actual correct term for it, though 1.5 does not seem to know what it is
You should use clip-interrogator if you haven't already.
try
Speckled light
Streaming light
Radiant beams
Golden glow
Sunbeams
Illuminated foliage
Mottled light
Light shafts
Sun-drenched ambiance
Luminous patterns
alright, I think this specific model just does not understand this as a concept
oh well
I am sure I could train a LoRA on it at some point if need be
I have no idea why I got a painting vibe with that lol
thats much better
Training a LoRA is often the fastest way.
I honestly have no Idea how to train LoRAs or embeddings. I always seem to find a way to mess it up.
yes the training data is filtered. you'll have to wait for tits and ass until the fine tunes.
the vanilla model might be able to do some stuff
No, SDXL is not censored.
Popachu. (Base only as running the refiner is too heavy with LoRA)
i think the bot rooms here gave me nudity (not intentionally)
but even with filtered training data it should be able to get a basic sense of the anatomy
Great so I’ll have to wait until two weeks until sdxl is fully unleashed
actually let me try local NSFW prompts. see what it knows
Because it’s currently on a leash
2 weeks? I wouldn't assume anything
The internet is fast
based off of what happened, there is no release date. just hopes the 'event' is next week
1 week I give ppl
Oh so 1 week delay is not official?
event is something like 'super stage' . I have no idea what that even means.
just hints wink winks of the release
just don't get your hopes high. assume nothing
but likely it'll be within the next few weeks (the release)
Well whatever, I think in 1 week after official release that’s when sdxl will be unleashed
but you are correct, just wait, the model is great, it'll be fine-tunable, and you'll get what you want
Because it’s currently the on a leash
It is not on a leash, apart from licencing of course, but nothing prevents you from finetuning it for yourself already.
It’s gonna be SDXL unleashed after ppl take off its leash
the person is talking about where 1.0 candidate inferencing is available.
and if those places are censoring. no big deal. we are on the same page
Yup, basically this.
the training data does have filtering too but Emad has hinted at 'no worries' by saying stuff like 'it learns fast'. it should understand anatomy well enough. I agree, let the community go wild soon enough
Yes, the bot channels have rules, but the models are not censored.
The training data has some quality control and should really have been filtered more to avoid too many duplicates and watermarks. But that's not censorship.
are you suggesting the training data in no way has filtered out NSFW? lol
i just made boobs 😊
Huge if true
Maybe SDXL is not on a leash after all
it has. just not enough to where the model gets stunted into infantilism forever. It is a strong model.
Sure it has filtered some stuff, but that's quality control. You do not want the things it has filtered away in the dataset.
@placid coral easy answer is, if NSFW cannot be done [even with fine-tuning], it is dead on arrival. and I feel like it won't be dead on arrival. rejoice
Okay so it still has a filtering leash on… seems like we need a fat cat to help unleash the beast, so to speak
I mean to include fine-tuning
base model might not be able to do stuff, but it should be very strong foundationally
to where you can make it do sooo much, more than it already can do, which is already very strong
Yup, what this guy said.
where it can do styles well, just via prompting
just patience. the tooling is being worked on simultaneously
Training tools are already ready so go ahead and train it if you want, just remember to follow the license.
though it would be training 0.9
Just note that you will need to retrain on 1.0 when that comes out if you want to release stuff, but you can test the waters now.
which I Think is fine honestly, especially if the wait is forever, but if it is just a week, then waiting for 1.0 is good
if I had more time off I'd def practice training on 0.9, a lora since pple have been doing that. Also curious about dreambooth, but not sure what the requirements would be for that
I just want people to test it for themselves so you can stop spreading the idea that SDXL is censored.
Please no more meaningless Dreambooth models that should be a LoRA model instead
arguments revolving on semantics only go so far on the internet anyways, especially in chatrooms
genitals are not in the training data by the looks
I've been running it locally since release
tits and ass it'll give you all day, but genitals are replaced by blank skin or clothing you didn't ask for
I've never done a lora before so it was probably bad, but I trained a lora on 10 images of 1.5 outputs and while it artifacted and sucked (my fault probably) it DID add back genitals properly
so there shouldn't be any concerns
once people who know more than me start working on it, it'll give you whatever you want
though you might have to do a refiner lora too, since the refiner 're-censored' everything
yeah in that case, the filtering is defended by 'safety'. like I Said, semantics. What matters is what matters -- practically speaking. The model will do what people want it to re NSFW [with fine tuning]. People shouldn't worry about that IMO.
ohhhhh re the Refiner. very interesting
I didn't make a refiner lora (idk if you can? I assume so?) but I assume that would fix it
refiner doesn't change a pic much if you set it up right
kool
refiner is very good at adding small details
but when some of those small details had to be lora'd in, it'll remove them too
all datasets are culled. even what runway ml created. omg censorship. its so cherrypicked. the self righteousness is .. just.. ugh. we know why you want the boobs. it's not any glorified freedom ideal
just lay off on the censorship thoughtspeak. it's dum
?
we aren't complaining
we are just talking about whether it's hard censored like 2.1 or just not in the training data
i'm over it
2.1 was 'hard censored'? what does that mean
at no point have we complained that it's censored
loras/finetunes etc couldn't get it to do nsfw properly
2.0 had a mistake in the data filtering that was reversed in 2.1, it was never intentional.
which is the reason it's such a big topic around sdxl
big oopsidoodle there
it was filtering out boats
the embeddings i tested in comfy worked
in the refiner too
last i tried to load them in auto they failed though
31 year old short brown hair raver Chaz from Leeds UK is going mental rocking out on the stage with a glowing electric guitar at Glastonbury 2008, absolutely epic solo, mental, good vibes
here we go again
hey how often is the huggingface diffusers library used anymore?
does anyone know why i get the grid lines with the ultimate upscaler
35mm color photograph of 81 year old short grey hair male furious raver Chaz from Leeds UK going mental in the supermarket trying to buy bagels, mental, good vibes, still from 1972 british comedy film
Yes, the ultimate upscaler splits your image up in several chunks and upscales them seperately and you see the lines between the chunks.
how do i not split it into chunks?
lion invisibility cloak
you have to use mega ultra ultimate giga upscaler
oh ok
hey, i am a little out if the loop and tried searching the discord and reddit already.
What is the best way to run sdxl when you need to use a cloud service? is it supported by google collab?
😂
35mm color photograph 60 year old raver Chaz from Leeds UK going mental in the streets because he forgot to buy milk, mental, good vibes, still from 1972 british comedy film
I can feel that one coming
yeah with that you should be able to run the base SDXL model on free colab but the refiner might be a bit too much
are you messing with me?
I guess you run out of normal RAM on Colab?
But yeah ComfyUI works well in the cloud with the right hardware.
yes but something with that name might exist soon enough
@shy kelp I can't 
35mm color photograph 40 year old raver Chaz from Leeds UK going mental because he stole pizza from the ninja turtles, mental, good vibes, still from 1972 british comedy film
Thank you Arron
3 seconds per image in the cloud is really nice.
seeing those images appear in front of me - priceless
that's an amazing image!
finding some interesting images from the overnight run
damn - amazing stuff
there's plenty of broken images in the batch too 😄
bottom row center 
of course - that's part of the process 🙂
😄
soldier in a tropical storm 😄
now we are going terry gilliam style 😄
the glam style is inspired by kitsch quirky movies from the 70s 80s
I've found using RealESRGAN_x2 to upscale can cause similar colours to blur together into one blob
who is @shy kelp the raver from Leeds UK? I tapped into his latent space life and he's so consistent lol (probably the british comedy token)
SDXL 0.9 VladAUTO1111
very pretty
So, SDXL Base is txt2img; then the image produced here is fed to refiner - so the 2nd stage of SDXL is img2img
Thank you 🙂
the refiner still takes prompt input. it can be distinct from your main prompt too. also it can be setup so that you dont really get a image in between models the refiner can take over and run the last steps before sending it to the decoder and outputting an image.
Any tips on trying to get it to edit NSFW content? I uplosded an image and it was immediately flagged for being 'inappropriate'. Probably because it was haha.
anyone knows how to adjust the denoise value between 0 to 1.0
I know this is an SD server but did anyone notice that ChatGPT using GPT 4 has upped the limit from 25 messages every 3 hours to a limit of 200 messages every 3 hours?..... That is awesome
I thought it was 50
code interpreter is awesome
log out and log back in see if it changes
maybe even clear your browser cache
they are also well known for rolling things out in small batches at a time
i cleared site data and all
did you apply for a higher cap?
because i legit have the gpt 4 api but i cant get a higher cap its bs
I would say give it a week max and you will have it... Nope... I had no Idea they were even going to do that... I just happend to notice when trying to get chat gpt to help me with some error code for this LLM I'm working with
bs
If you are directly using the GPT 4 API the only limit is your wallet... as long as you have a large enough amount set and deep pockets to go with it they let you use it as much as you want.
oh ik that part
i got aproved for 120$ a month limit but the api playground and chat gpt itself is much different resualts
Right now I'm trying to setup the Oobabooga UI with the following picture and it doesn't like it. It keeps saying I'm out of RAM not VRAM so I'm doing a fresh install:
what gpu you have?
if you want to try it you can always make a nat.dev account
Whats that?
That looks interesting... Does it cost anything?
5 dollar min
Cause I just put the last 15$ I had in the gas tank today
well its all good... I'll just keep trying to wrangle this beast into working for now... all it costs me is time and electricity
i would just let you log into my account and finish the credits i have left off but i connected it through gmail rip
whats crazy is the claude ai is 100k token limit
Its all good I wouldn't do that to ya....
lama is 4096 token limit
Ya claude looks awesome
Still nothing produces code like ChatGPT on GPT 4 with Code Interpreter
give me something to ask chatgpt and the new lama i can do a comparision response
hmm
ask it the woodchuck rhyme and see if either of them answer correctly
for math ask them to solve 2x+6=12
They should get x = 3
lol
but claude get it right i think
Michael scores a 95, 87, 85, 93, and a 94 on his first 5 math tests. If he wants a 90 average, what must he score on the final math test?
Possible Answers:
86
88
96
84
90
ill ask it that
claude got it
later all!
So I was able to get the 13B paramerter model to load with no problem. I tried the 70B parameter model. I think its just too big for my little 4090. I'll have to wait for the 34B Parameter Model to drop in a few weeks. They said they needed to work on that one for "safety"
chatgpt supports different languages best so far,LLaMA is far way behind
they just dropped LLaMA 2 today... maybe its better
I barely speak english well enough so other languages are a mute point for me
This Greenscreen bs is killing me
I can't wait till I get my gen5 m.2 drive and can just do a clean boot 😅😂😅😂🤣
Based of that you will never be able to run got 4
It's just so big
Well if it was open sourced and someone created a 4-bit quantized version you might be able to run it on a single H100 with 188GB VRAM or maybe even 48GB A6000 who knows
I need some Ideas for Pictures to generate.... Any suggestions are welcome
A lion
This would be one hell of a hitchhiker
Is this in sdxl?
yep
What workflow are you using 😭😭😭😭
Just try to steal from this candy store
Bro your shit is so much better then mine
What's your prompts?
Install this... https://github.com/ssitu/ComfyUI_UltimateSDUpscale
Then press the clear button....
Then drag the picture of the lion into your comfyui after closing it out and reloading once you install the thing I mentioned.
When you drag the picture in it will give you exactly what I used
that looks really good to me it might just be arranged differently than mine
this is the best true black i got today
That looks really good
by anychance do you know why i get those black grid lines?
in the black area?
aslo what is your ultimate sd upscaler settings just ss them to me if you can please?
Somebody did but then removed it they said you should bring your pc to best but or sum
been super busy all day, haven't even been able to do much for the GPU
Dam
Same lol
I have work at 9 am and I never sleep so I usually end up going to bed at around 4 am
Lmao same
Whatcha got
Generate a car carrier it's legit impossible
ok out of 16 images these 2 were the closest to some sort of messed up reality:
Lol told you it's impossible
Never got anything close on stable diffusion but an hour ago after like hours and days of trying I got one semi good on midjournery
if only ComfyUI had some sort of image to image then it would probably work
those are the 2 best i ever got but it took prob 2 months of trying over and over on midjournery
either way its fuck
aint no way i got a watermark using midjoruerny
try doing truck carreir with cars in the back
Wtf did you use a prompt for that lol
A photo of a demon a dark demonic scary landscape
Question what's the difference from those 2 different batch counts
the one on the left will generate multiple images at once and put them in a grid. this takes longer and seems to use more RAM... the one on the right will generate images one at a time but will give you the amount of images you put in there. I always use the one on the right to generate them one at a time
I think its faster and never crashes my system
If I edit the batch count that's right under quee prompt with it still show them all in the save image node?
it still saves all of them to whatever folder you had it setup to save in but it won't show you them all at the same time
try it and you'll see.
I'm in my bed lol got tired at sitting at my desk xd I have no glass on my pc case so that section of my room is hot asf but my bed is nice and cold yk
just a heads up, if you have the PC resources to do it, doing multiple images in the same generation does infact save time
Well I live in Florida and I'm convinced fpl is turning my ac off and on
but if you do not, then turning up right side batch size
I should give it another go. I upgraded my setup since the last time I used batch size instead of batch count
I feel like hes about to say "Beuller", "Beuller" lol
@high skiff whar happened to pseudo? And did ur 3090 get fixed?
Pseudo left the server, and I have not at this moment

i'm getting Forbidden Zone vibes out of this
@spring fulcrum try to recreat this
yeah, nice
just casually genning 32x 600x904 images at the same time lol
comfy UI is so efficient man
ComfyUI Is soo good I can finally show pictures of my mountain girlfriends lol
going to refine this terran style lora more with 768 resolution overnight. just as an experiment . see y'all on the other side. goodnight
@high skiff Ya doing the batch size is better.... However when I tried to do 12 batch size My system went OOM when they hit the upscaler
oh yeah, upscaling at high BS is intense
Is that a RAM issue or do you think thats VRAM or maybe page file? I have 64GB DDR5 and an RTX 4090 with an i9 not that that really comes into play but its a pretty decent setup and it still goes OOM
scorp i got a photo idea for you
whatcha got
even a single 2k image using up all my 24gb of vram when decoding
idk why you got that issues im 4x with my ultimate upscaler and i can do batch of 12 on a 3070
are you upscaling to 4096*4096 too?
i think so
i think the max i tried with the upscaler was 10 let me try 12 right now i think i did 16 before though so im not sure im do a 12 real quick
Your setup is on the left, mine is on the right... Maybe that makes a big difference in how it uses the resources... I haven't adjusted mine. I got my setup from someone else and just left it as is.
yes 12 works for me
the blur isnt the difference but use linear for mode type based of @high skiff said to me if its none it dosnt do any upscalling
your denoise, mode type, and mask blur are different than mine
the blur was because i got some weird grid lines when doing all black images
denoise sytan said should be 0.075 and linear for mode type
it's super sensitive, even really low numbers I was seeing eyeballs show up in dark areas
Is that because of denoise?
yes
So I should just make it 0 technically
then it's not doing anything
let me give it a try with a high batch count now... Ill set it to 24 just to see if it breaks
It's just Denoise though
that makes it do nothing
But is denoise nneeded?
yes
denoise is how much noise is added in order to refine new detail
0 means 0 noise, and no changes
so far so good with a BS of 24
Are you trying the trex eating a human?
trying but I think its no dice on the eating a human part
You have the 4090 you can prob do a 49 batch max without it crashing tbh
Rip lol
lol
Those are the best I got last night in midjournery in stable diffusion I got pretty much just drawing of a trex lol
I mean this dude looks like he's about to be eaten
Is this "the greatest of all time" or is it a GOAT?
Lol
totally both. Those were my GOAT GOATs
@high skiff I'm not sure if it hung up here or if it just takes a long time to see the progress bar move
honestly, you are playing with fire, so just be careful lol
with the 24 image BS? my cpu and gpu temps are like 45 degrees Celcius
how long does it take yours to process that many?
Well anyways folks I'm going to let this batch sit overnight and bake.... I'll let you guys know if I fried my GPU in the morning.... well... later this morning anyways.
Gave it a quick try with 0.9, not perfect 
1girl,hat,longhair,bag,walking,solo,blackhair,shoes,simplebackground,orangebackground,(cat:1.4),backpack,bluefootwear,profile,socks,shorts,baseballcap,longsleeves,blush,whiteheadwear,fromside,leash,hood,fullbody,bluesocks,lora:tuyafengge_20230707170048:0.9,
1girl,hat,longhair,bag,walking,solo,blackhair,shoes,simplebackground,orangebackground,(cat:1.4),backpack,bluefootwear,profile,socks,shorts,baseballcap,longsleeves,blush,whiteheadwear,fromside,leash,hood,fullbody,bluesocks,
Hey there, you can generate images in #1100170312106127410 for example 
Another quick and dirty try
hi anyone knows how to do img2img using sdxl? It seems can only refine a picture, but I want to change the whole style of original one
More denoise
Is your GPU working now?
Nope
Although I haven't really been able to work on it at all today, I've been very busy with other things
has anyone tried placing a real product and generate the surrounding scene around it with SDXL masking REST API function??
I already submitted a goods and services claim with PayPal, they haven't covered
*they have me covered
i get quite amazing results, but sometimes the masking not doing the trick, and it generates another bottle behind my original product. can anyone suggest some tricks to make the original product blend with the generated environment?
i know it's working to geenrate okish results with SD 1.5 and 2.2 but im interested in this new engine to do it as the results are better.
im building a product photography application, do you guys find it interesting ?
the text is ai generated?
no, that bottle is the original bottle of bvlgari man in black eau de parfum
i got some generations with text in it, not the best, so i must have the original product integrated in the scene.
this one is fully generated by sdxl. text included.
yet that is not the orignal bottle of the brand..,
Whats your setup for generating with masks? Im intruiged
what do you mean the setup?
im sending the init_image, the mask source and the mask_image the mask source im using black and white
So you were sold a lemon! Sorry to hear!
https://civitai.com/models/111435/latentbyratio-comfyui-jnode-sdxl-sd15-sd2x It contains a img2img and upscale workflow and two costom nodes. For the load upscale model, you could replace with original one.
To support my work, you could buy me a coffee https://www.buymeacoffee.com/JasonAICreator This node is aims to help with different models on differ...
Thanks man, I will try it!
I mean can you share your comfyUI setup for example? 
I never used masks so I kinda wanna try it
i don't have this made in comfyUI, don't know how to do it there... i coded everything on a backend.
if anyone could share a workflow with sdxl masking in comfyui id be glad to give it a try there maybe i can spot some parameters to help me fix this masking issue.
I already reported this to SD team that a bug is there with the masking in sdxl.
they noted, and hopefully we can have a fix for it
another thing that's interesting, even if my mask is fully black on the subject, and i only want SD|XL to modify my white area sometimes i get strange modifications on the product subject, with a mask applied fully black, no spots left.. it just don't care about it.
it does not fully change my original image portion, but it tends to do freaky stuff on the text.
im testing with Alpha_channel mode see if that improves anything.
if any smart people here want to join this conversation and give me some good ideas to achieve the perfect background scene change without adding extra artefacts to the orgiginal subject, id highly appreciate it.
anyone knows if in ComfyUI there's a custom node existing that acts like a branch killswitch ? Like I have multiple branches on the output of a node, for example different upscale models that are followed with different flows and I want to test a single one, I'd like to mute the other ones. I'm trying to create it myself, I have a node with a boolean and upscale model inputs, and upscale model output. If bool=1 it continues, but I can't stop the branch if I put it at zero as it seems it absolutely needs to continue.
yup can Mute Nodes with CTRL+M but I find that deosnt stand out.
My method is to use what I call "jumper" (cos they remind me of old school dial up modems)
Ie Rerout nodes that can be used to break the flow
and with the IMG2IMG branch "jumpered" on
you sometimes get errors with jumpers... but I tend to do the same
Ok so you just cut the link between 2 nodes ? There's not really need for a special node for that I guess ? If you click on the disk in the middle of a link between 2 nodes you can delete it
(it's probably cleaner visually using your method though)
yup , just delete the link if I dont want that branch to run
usually Ctrl m is enough
Yes I didn't know this shortcut, time for my crap dev talents to stop 😅
I just dont find that stands out as clearly personally but yes it does work
Another example of where I use it is to switch between "Standard" and "Enhanced" Prompting
I was getting so many Xformers errors with Vlad AUTO1111 - I now have Cumfy - it is so easy to setup 🙂
Where to put sdxl Diffusers in CumfyUI?
Can you share the configuration of your 'prompt style' selectors? I'm adding a function like that to my bot, and it'd speed things up a lot.
My first CumfyUI Output (not yet sdxl)
But this is the SDXL Channel 
Yes I agree - where do I place the SDXL Diffusers inside CumfyUI?
also completely random question but how did you get your nodes to connect with straight lines? xD
use the regular SDXL model, not the Diffusers version
Safetensors? OK
Its in the settings
Link Render Mode
whats the best comfy ui config rn for sdxl?
Im using this with Searge's SDXL nodes
thx
Not saying its the best, but I had the most success with it
What video card is recommended for SDXL? I want to generate 1920x1080 images with it. I can cope with slow generation speed but I think that my current card RTX 2080 8GB may throw CUDA OOM errors
More memory is better. A 3060 12GB would be the cheap option.
Just make sure you don't confuse it for the 3060, which is a completely different card.
3090/4090 are ideal, if you can afford it. More for future-proofing, but if nothing else, having the memory to run SDXL and a web browser is nice.
3090 is too expensive for me. Thanks for recommendation.
Budget Option 1080ti , has 11Gb VRAM and just works
I have an RTX2070 8GB VRAM - generates good quality SDXL, and then I upscale using Topaz GigaPixel - much cheaper and quicker than a 1920x1080 native off-the-board generation
Up scaling is nice but original generation can generate more details
Agreed - but how much detail is enough?
theres no benefit in generating much past the trained sizes (512 for SD1.5, 768 fo SD2.1, 1024 for SDXL) as it starts generating mutations & errors
it may work but..........
GigaPixel does a fine job - fit that against the expensive hardware to get 1920x1080?!?!?
Bog standard 1024x1024 in SDXL on my 1080ti takes around 60-70 seconds, on a 4080 would be 5-10 seconds probably
But I guess the way things are going, it'll be all native 8k video soon ... ! 🙂
My RTX 2070 8Gb VRAM is getting me one 1024x1024 every 12 seconds
4090 with 30 steps dmp sde takes about 7-8s
4090 is also lightyears ahead of a 2070
With the batch processor on Vlad A1111 - I can set up 100 or 200 different prompts, leave it grind away overnight, and by breakfast - hey presto!!!!
You know some peace!
Takes 21 seconds on my 2080ti. So that's good to know.
mild variation of my prompt from yesterday, just trying out artist names 🙂
nice work! @peak dove
Thanks, just putting CumfyUI thru its paces
cumfyUI is the nsfw version? 😂
Tee hee
something that @visual glade should address
the one I posted?
ComfyUI is better than I thought it would be - and so easy to setup when compared with Vlad AUTO 1111 SDXL
its way faster
Its way faster to setup for sure
we can only unlock SDXL to its full potential using comfy
It also launches faster, im impressed every time
You want the prompt?
comfy take 3secs
if you want to share the setup with the community great! I was just liking the work, it's looking nice
a vivid watercolor depiction of diverse Rococopunk Afrofuturist women with beautiful and bold head wraps walking toward a huge moon in the background, holding hands, walking away from the camera, collaborating in a creative and productive environment, women empowerment poster, photography, inspired by the styles of victo ngai and vladimir kush
should have comfy meta data in it
oh really? i thought discord erased all that
They reverted that iirc
but here is the prompt anyways : (Photomontage:1.2) A jarring image where elements of various photographs collide and coexist. A cat with the wings of a dove leaps from a toast, above a sea teeming with swimming umbrellas. It's a realm where the laws of nature are suspended and the impossible becomes real
hey, I used the A1111 branch that supports SDXL0.9 base. I tried figuring out how to use it myself, but it gives me this error, help?
Whenever you get overwhelming errors like this just delete the VENV folder and run webui again
A1111 uses diffusers - u must set the diffusers checkbox in settings
Discord doesn't. But there is a bug in chromium browser's were it changes the encoding of it if you copy paste from the browser window and it stops it working.
Add this argument at the start of webui.bat file - webui.bat --upgrade
Chrome has its own proprietary "brand" of jpeg called .webp
A lot of places (like Discord, Photoshop) hate .webp!!!!
I do too
Kewl
Doesnt discord use webp for all of their guild icons etc?
good old hokusai 😄
Im not sure whaat hokusai is 😛 but this is a tattoo I have, I was looking for new ideasa
That's an old pic though, half of it is shaded in
hokusai tidal wave, very famous painting
that's the original

