#🤝|tech-support
1 messages · Page 26 of 1
Comfyui can do the same as Auto1111 but is comepletely Node based. You can create very huge and specific Workflows in it
its good for the more advanced users, but can be very overwhelming
if you need python 3.11.2 you can install 3.10.11 too and then change the path in the auto1111 settings
but if you dont need it, best would be to uninstall it
yes you need to use the lora
ok, thanks, what value would you use as lora weight?
0.6 is recommended by the dev
I've installed "3D Openpose", and generated the four images it spits out in the extension with the default pose, but when I press "Send to txt2img" there is nothing in the controlnet and the image generates w/e so the pose is clearly not being sent. There is no error message in the UI, and none in the terminal. Is the extension (ver: f2d5aac5) not working properly with 1.7.0 or am I missing something?
I've restarted SD as well
thats normal xD this 3d openpose extension is 8 months old and not updated
incidentally i used a weight of 1 with these settings.. and ...
oh boo! when I try the normal open pose the pose isn't really getting the look I want 😛
Is there a working 3d poser?
no weight needs to be 1
lora weight 0.6
you can still save the images and put them into controlnet manually
looks good
great. so the settings are ideal?
was that with weight 0.6 and lora 1 ?
I tried that, but that didn't work either. The image it saved was dimmer so I don't know if SD got confused.
I will try it again I guess xD
used 1 for that one, but yeah i could play around with lower values
because foocus added directml a few months ago, and auto1111 has this fork since a year, and my guide covers all needed steps
yeah the fooocus documentation is... lacking, to say the least
#👍
@ornate elk if you got few mins free now, can you show an example of face id from your end of the settings with this image, cause not all of the images are working out too well for me ..
it worked and properly allocated it but did not actually make it faster
I've seen a lot of people mention blender for poses, is there a good resource for how to use blender to play aroudn with a pose ?
ooooopsy
also @ornate elk i think there is a clever way of using lineart with face id for pretty realistic effect
oh okay, why with lineart?
did you edit the webui-user.bat? like in guide step 5
the combination is for something as realistic as this
i did but i fucked it up
already figured it out lol
sometimes when im reading things my eyes have the habit of skipping lines just completely accidentally, or jumping from say, the amd guide to the nvidia guide without noticing
although i'd prefer to have better control over face id alone, so im curious what output iyou can generate with that image i shared so i can compare my settings
ahh yea, i know that
so you want it as realistic ?
the facial look yes but not entirely the pose
@ornate elk im also using face id plus
ill try
one thing i notice about face id is that the image we generate it may not follow the exact angle or positioning of the face, it just uses the facial resemblance of the source image .. which is perfectly ok for creating variations
yea thats what ip-adapter always does
you can fix a position with a openpose with face mask
did i mess up by putting a space here
yes or lineart for details
normally that doesnt matter
but you may need to delete the venv folder if you still get the skip-torch error
i deleted it all and am retrying now lol
i assume setting it up for nvidia first may have borked it
i made the guide, the fork is from lshqtiger on github
gotcha
fooocus is cool but kind of ass
its terrible at making anything even remotely specific
amazing if you type "burger" but add more than 2 words and it shits itself
iim have trouble with my art, idk why it make it like this. it does this last minute before completing and idk why its doing this. i could give more imformation if could
the prompt was "fortnite burger"
yea its made for easy prompting and getting crazy artistic results
hey, whats your model and vae ?
i find they rarely have much to do with the prompt
where would i find that at?
what webui are you using?
woah auto1111 is way faster
uh,,i actually dont know where would i find this imformation
do you use a local programm?
then in the browser, scroll to the bottom and check the version
i think so, yeah its on my local fine
yea it depends on the models and its generaly better optimized as it supports more backend optimisations
your using the basic 1.5 model, thats over a year old so thats why
says version: v1.7.0
it was workign earlier but idk why its doing this
yea, you should get community made models, these are much better in quality, and have specific artstyles,
the largest databse is Civitai.com, there you can download them for free.
For example get the Dreamshaper v8 model from here:
https://civitai.com/models/4384/dreamshaper
DreamShaper - V∞! Please check out my other base models , including SDXL ones! Check the version description below (bottom right) for more info and...
Put that file into models/stable-diffusion folder
oh yeah i was using those on fooocus
thought you meant i should be using a newer version of auto
nope, auto1111 is up to date
so any possible reason why its doing this last minute
i disabled filtration on this
and people are way too horny
whats the model name, thats in the top left corner?
ok this is sick as fuck
you mean the check points?
yes
currently using MFCG Doll Mix v2-No VAE.safetensors, tired other checkpoint it keeps messing up last minute
like its working give the right colors until very very last second it messes up
okay, i checked your image meta data,
your sampler + the upscaler settings doesnt work together
dont use 4 hires steps, use 10, and set the denois to 0.5
then use DPM++ SDE Karras for example
also lower the lora weight to 0.8
can you explain why? just so i can learn
You used a latent upscaler, these upscalers can generate additional stuff into the image by denoising.
but 4 steps are to less for them. 10-15 is a good value.
The denois defines how much the image should get changed by the upscaler, going for a value like 0.7 is to high and can make deformations to the image.
Samplers are all different, some work better with Latent Upscalers than others.
Would recommend, Euler a, DPM++ 2M SDE Karras, DPM++ SDE, but not with Heun together.
There are also Upscalers like Esrgan4x+ that work better with any samplers, and they dont add stuff to the image, so if you just want a little enhancement then they are good too
@fair oxide here is what i get with id-plus
tried realistic vision model
ahh ok, i think some of the models i tried have different outputs depending on how they are trained
i checked with same settings too
ill try now with face full as comparrison
sure, would love to see that
did you use default values?
hmm, face looks bit too different from the source, you suppose?
could be the realistic vision model
maybe a bit, but for me face full is still better than face id, because it can actually move over the skin and face accents better, every extension that uses insightface (like Reactor, roop, or faceID) smooths the face to much
interesting, i have to try that on my end
im actually gonna try it now
@ornate elk just to be sure, you used these models?
cool
yes, and no lora, as it doesnt need one
right... ok im gonna try that image with that settings
what was the resolution?
640x640
strange that shouldnt happen with that res
everytime you get that error, you need to restart the webui to clear the vram
i'm not using adetailer when using ip-adapter btw
i use hires fix to smooth it a bit
yes me too, hires fix is active
this shit dont work lol
what are your settings?
disabale hires fix and it will work
You're pushing the resolution up when you have that on.
i mean its a 24GB card
on windows its very limited, because directml is poorly optimized in vram usage
rocm would be much better, but we need to wait a few months for that on Windows
you can use hires fix, but just use thse settings:
Resolution 512x512, 512x768, or 540x960 (for FullHD)
Hires steps: 10,
Denois 0.5
upscale by: 2
Upscaler: Esrgan4x as example, or latent bicubic
@fair oxide you see what i mean, left face full, right face-id
that was the input image
yep i see it, you are right about face id smoothing out too much
face-id-plusv2
better than face-id plus
nice, i'll grab that one too
@ornate elk can you offer some explanations on these 3 parameters ?
i assume controlweight would refer to more closely folllowing the source the more weight you have?
weight is how strong the controlnet itself should be applied, (like lora weights)
starting step is when should the controlnet comes to action in %
Same goes for the Ending Step. (if set to start 0 and end 1, it will force the controlnet until the last step)
so reduce the end step so the face doesnt get too forced on an image if you want to upscale for example
great thanks!
also with face full you can switch to an unrealistic model and it still gets the face right:
same input image from above
how many loras do you need to make something not look like deformed dogshit
no lora
you need a good negative prompt first
some quality tags in the prompt too
did you tried dreamshaper v8 ?
for example, add these to the negative prompt:
disfigured, kitsch, ugly, oversaturated, grain, low-res, Deformed, blurry, bad anatomy, disfigured, poorly drawn face, mutation, mutated, extra limb, ugly, poorly drawn hands, missing limb, blurry, floating limbs, disconnected limbs, malformed hands, blur, out of focus, long neck, long body, ugly, disgusting, poorly drawn, childish, mutilated, , mangled, old, surreal, text, blurry, b&w, monochrome, conjoined twins, multiple heads, extra legs, extra arms, fashion photos (collage:1.25), meme, deformed, elongated, twisted fingers, strabismus, heterochromia, closed eyes, blurred, watermark
oh ok
i was just using something called "easy negative" because i thought it would do it for me lol
thats an embedding (textual inversion), that will work too if you downloaded it and put it into the embeddings folder, and then added it to the negative prompt
i did but everything looks like shit
feel free to post an example
do you use an SDXL model?
whats that
a model thats 6gb large
definitely not
okay, then what are your txt2img settings?
this shit is just nightmare fuel
did you downloaded a community checkpoint?
nope
I'm looking for "restore faces" option, but I don't find it, how can I find the option?
get v8, not sdxl
https://civitai.com/models/4384/dreamshaper
DreamShaper - V∞! Please check out my other base models , including SDXL ones! Check the version description below (bottom right) for more info and...
must not be on civitai
oh
its just called dreamshaper
oops
sorry im a total end user with this shit
its hidden, to get it back go into Settings - User Interface- Quicksettings, and there add Face_restoration,
then hit apply and reload ui
no problem at all 🙂
@deep canyon here is an easy example:
"C:\Matrix\stable-diffusion-webui-directml\repositories\stable-diffusion-stability-ai\checkpoints" right path?
note says put unCLIP checkpoints here
nope, stable-diffusion-webui-directml/models/stable-diffusion
ty
This stuff?
then you can click the reload button near the models(checkpoints) dropdown
and then you can select it
uprank 😄
they all seem to be coming out deepfried
can yo show your settings again?
thats strange, did you changed anything?
can you paste an image in here?
hmmm seems to be coming out fine now
strange
maybe its just RNG to get deep fried ones?
make sure to not run Wallpaper Engine or Epic Games Launcher in the background, also whitelist the wbeui in any browser adblocker
also sometimes a simple restart fixes it
this thing will not stop making women lol
i even just put "boy" as the prompt
still women
lol
yea had to take wpp engine off startup and quit using it because it kept crashing my shit
yea a common error ^^ had crashes tooo with it when using SD
yea, i disable it everytime before starting a game
i dont get why it oversaturates stuff so often
extremely oversaturated? like before?
do you run any other programm or multiple browser tabs in the background?
yea
discord desktop with github pages and civitai
nothing else really
except HWInfo and Aquasuite and a music player
minimize all other programms, and close the civitai tab xD that site is a browser lagfest
these oversaturations come from lagging, when some other programm uses to much resources while generating, then the image bugs out and get corrupt.
happened to me a few times, but its very rare normaly, but i dont have anything open, also using firefox with maybe one other tab
yeah it only happens on the last step too
like it looks fine before
aaaaand I just closed auto by accident
xD
can you paste an image in here, so i can check the mata data?
make sure to upload the original output
its in the outputs/txt2img folder
i think the lora i was trying to use is broken
i removed it and its not oversaturated now
yea
that can cause this too,
you didnt said you used a lora xD
i would have figured it out in seconds with that info
my bad
no problem, you have to check if a lora is made for 1.5 or sdxl
you find that info on the civitai page when downloading the lora or model
they have a little info box
Base: 1.5 or SDXL
says 1.5
okay, than it can work with dreamshaper v8,
but loras still can be "overtrained"
then you have to adjust the Number after the lora, lower it to 0.5 and try again.
still overcooking
what lora is that?
im not at liberty to say 💀
okay, then try an other lora to check if that works
even 0.3 its cooked af
its rare but loras can be just broken
its the one it was trained on
so im leaning towards broken
other people have it working tho in comments so idk
you can share me the lora in private if you want that i take a look
xD ah okay
(not derogatory just actually gay)
why is it now saying "RuntimeError: Not enough memory, use lower resolution (max approx. 640x640). Need: 0.2GB free, Have:0.1GB free" i was just working ealier
ahh okay
whats your GPU and whats inside your webui-user.bat?
2080ti super max q
wdym waht inside my webui-user.bat
when you right click and edit the webui-user.bat
@echo off
set PYTHON=
set GIT=
set VENV_DIR=
set COMMANDLINE_ARGS=
call webui.bat
yep got it working
baller
now i can make my homie regret showing me stable diffusion
hahaha 😄
okay, at the line set COMMANDLINE_ARGS=
you need to add:
--xformers --medvram-sdxl --no-half-vae
then save and relaunc the webui-user.bat
that will increase the performance alot and lower vram usage
thank you
Np
Hi again, thanks for your rapid and helpful response. It seemed to install this time without error but I'm still getting an error when I try to generate an image. It says RuntimeError: Unspecified Error and this is what the code looks like. Thanks!
Hey, would need to see the whole cmd output
Can you close and relaunch the webui-user.bat?
Then try the 1.5 EMA pruned model
With default settings
I only have 2 gigs of VRAM so I put the --lowvram condition in but was I supposed to add --no-half in addition to --no-half-vae or --no-half instead of --no-half-vae? My GPU is AMD RX 560
Ah okay, thats hard on the edge
Then you should download a 2gb model to test. Like Dreamshaper v8
Thanks
I can't guarantee that it will work with 2gb
Could someone help me figure a problem out. So I'm currently using a SDXL 1.0 as my main model. But sometimes there will be character LoRAs I want to use that doesn't exist for SDXL, only for 1.5. Obviously I can't use it on SDXL. Is there a good workaround for this problem? I've tried making the image first in SDXL then using that output in img2img and switching to a 1.5 model with the LoRA, but the result are just not good.
@ornate elk what kind of prompts are you using that you found have helped custom trained models with dreambooth look more like the person youre trying to get
the workaround is doing the same training to create a lora for SDXL, if you consider that a workaround
I had to update graphics card driver again and thought it was almost going to work. I could see the image loading up with dreamshaper8 and it got to 90% but then it hit a runtime error. Could nto allocate tensor with 268435456 bytes. Not enough GPU video memory available. Might try once more with all other programs closed.
Brought down height and width slightly and success! Quite slow but I think it's as good as I'll get. Thank you.
Well not really
hello! I have a question and would love some help ^^
basically I made 2 loras, each of a different character. I'm trying to do images with these two characters together in the same image, but they keep mutating into each other (swapping genders, clothes, hair colors, eye colors) and a lot of extra heads/arms/legs/hands are generated as well.
I've tried using control net and I've tried regional prompting, but nothing seems to be working for me and I can't find much info online
Could anyone please provide me with a prompt or other solution that might work? I'm using SDXL and this checkpoint https://civitai.com/models/261336/animapencil-xl
any help would be very appreciated as I've been unable to figure this out and it's been driving me crazy for a few days!
Thank you in advance 
blue_pencil-XL meets ANIMAGINE XL 3.0 License : Fair AI Public License 1.0-SD You should share the merge recipe if you release a model merged with ...
hello! I have a problem... my load image nodes arent working properly, i cant choose a new image from a path anymore after some updates today
[12:12 AM]
my browser console says:
Error calling extension 'Comfy.smZ.WorkflowImage' method 'setup'
Object { error: TypeError }
error: TypeError: callerLine.match(...) is null
anyone maybe knows how to fix this?
id rather not reinstall comfy 😦
i dont know if this is the right channel but the anantomy and everythign fuck up at the end and things are getting worse and worse the more i use this
That doesn't really help unless you share things like examples and screenshots of your configuration.
yeah prolly not best since im making some nfsw content currently
i mean i do request the help though
@upper narwhal DM me the images with the mess up anatomy and i'll see what i can do to advice.
is this normal that with some SDXL checkpoints i only get noise output in my animatediff workflow? while others work just fine?
i mean i get that some work better than others, but here with animatediff some checkpoints produce absolutely nothing, just random noise
omg i see something! clip skip helped
still really bad tho
Where?
No problem, the image quality is epic!
Thanks! 😎😊
I didn't custom trained a model with dreambooth yet, would recommend training a lora or using a faceswap tool or ControlNet IP-Adapter
Nice good to know, what you can try too is to install the Multidiffusion extension with Tiled VAE support.
With tiled vae enabled it can stop the crashes most of the time. So you can use higher resolution.
Is there a tutorial somewhere for the Multidiffusion extension?
You dont need one. Its literally installing the Extension, via the Extensions tab.
Click on Available, Load from, to get a list.
Then search for Tiled Diffusion. Then install.
Then restart the webui-user.bat.
And then you only need to enable the Tiled VAE Option and test it with default settings
No problem at all
The bots are currently under maintenance #1047610792226340935
Means all of the bots are under maintennance ?
Yes
Thanks
👌
all of my generations are coming out as odd smudges, i just started using stable diffusion today but i couldn't find anybody else talking about this. if it helps i'm using sdxl base with comfyui and default settings, but no matter the prompt it always turns out like this. it generates the same smudge look when i use my nvidia or my cpu so i'm unsure where to go from here. if anyone has an idea what i'm doing wrong i'd really appreciate it!
🤝
Hey, did you tried the standard sdxl workflow from the comfyui github ?
Also what's the resolution?
yep, first image here is perfectly standard and other two are just "half orc standing"
as for resolution its 512x512 as is standard
Then something is wrong
For sdxl a standard resolution of 1024x1024 is needed
Np, 1.5 models are for 512x512
thank you so much i apologize for my ignorance
No problem, the default sdxl workflow should use 1024 by default
So if it was set to 512x512 that would be strange.
But good that it works now 🙂
actually it doesn't hold up
i thought it did at first
it's still smudges just slightly higher resolution
damn
is it a hardware issue? a download issue? should i try reinstalling?
Do you have sdxl and the refiner model?
yep
Then delete all nodes and load that image into comfyui:
That contains the default sdxl+refiner workflow
i guess i wasn't using the right defaults? i just used it as installed, i didn't realize there was another set entirely
let me try this
Here is a helpful site for basic workflows:
https://comfyanonymous.github.io/ComfyUI_examples/
Okay
honestly i'd think it was my gpu's fault or something if the cpu didnt do the same thing
I'm looking for a way to monitor progress, memory usage etc directly from the A1111 UI, anyone know any extensions that would do all or some fo these?
Using the UI trough --listen on another computer and I can't access the comman line easily
can you help me please
has there ever been anyone with the same issue?
You should install auto1111 via my install guide in the pinned messages of this channel.
Maybe that webui will work for you
There is an extension for that:
https://github.com/vladmandic/sd-extension-system-info
To less infos to help
Percect, thanks! Will check it out.
it's not the webui, it's something about stable diffusion itself apparently
just installed auto1111 to the letter
Screenshot all settings pls
In txt2img
Also what model (checkpoint) did you used?
Try an other model please
any recommendations?
DreamShaper - V∞! Please check out my other base models , including SDXL ones! Check the version description below (bottom right) for more info and...
Its 2gb
there's absolutely no blurring or artifacting on this one, so i guess it's just hardware limitation on my part or something?
sorry, i'm still new to this
but thank you so much for your patience and help
this one works great
its definitely an older GPU, nvidia geforce rtx 2070 super, but what tripped me up was that the cpu produced the same results just slower
Maybe the model is broken. Try this sdxl model:
https://civitai.com/models/133005/juggernaut-xl
For business inquires, commercial licensing, custom models, and consultation contact me under juggernaut@rundiffusion.com Update: Try out the Inpai...
Also for sdxl use the 1024x1024 resolution
this one works just fine, i guess i just downloaded a broken model or something? i'm honestly confused as to how that's possible, but point is it's working now and thank you
Could be a corrupt file then
You can also redownload the 1.0 one
Hello to all friends
I wanted to ask a question, in the control net and openpose extension, I also downloaded the control_v11p_sd15_openpose [cab727d4] model and put it in the extension, and even when I click on the run preprocessor option, it gives me the pose of every photo I uploaded, but when I write a prompt, the image It does not create a pose. I don't know where the problem is, I tried almost all the options, but it still only works with the same prompt, what should I do to solve this problem?
keep getting this error with "next view" when I try to turn batch Images Into a video, says there Is a video file missing but Im pretty sure thats the one Its supposed to create out of the Images
??
I see you are using DreamShapeXL_Turbo model, it's an SDXL model, right? You need SDXL ControlNet models for SDXL models. Try if that helps.
You installed the webui inside a onedrive folder. Thats not recommended and cause path and permission issues
hello, could someone tell me how to have stable distribution on Kaggle
You could also try a different workflow, first create a base image with an SD model + ControlNet to get the right pose, then use that image as a base and do some refining with inpaint using the SDXL model.
I remember reading somethig about on their website
see if this helps https://youtu.be/dpM02YMj8FY?si=uXrdlXIIW1vcMcAI
You want to use Stable Diffusion, use image generative #AI models for free, but you can't pay online services or you don't have a strong computer. Then this is the tutorial you were looking for. By watching this tutorial, you will learn how to use Kaggle free cloud service with famous Stable Diffusion #Automatic1111 SD Web UI as easy as it is ru...
These are the models I have, I think they are all sdxl right?
Oh yes, the problem was with the models
I tried another model and it made a for me in the controlnet pose
Thank you very much
I only recognise the DreamShaper, sorry. Check the source where you downloaded them from. Civitai, Hugginface etc
My problem is solved, I will go the rest of the way by myself 😘
Great! 🙂
ComfyUI question, is there like a steps to CFG ratio to follow? I'm getting decent images this way, first KSampler 10 steps, 3 cfg, 0.86 denoise, upscale to 1.5 before going to second KSampler 12 steps, 3cfg, 0.50 denoise, before going to last upscaler of 1.5x on Ultimate SD Upscale. With that much Denoise on second KSampler, is it worth doing say 25 steps on the first and 5 cfg? then second at 35 steps 7 cfg?
Does someone know if there is a way to avoid this? Would making a low res image and then progressively upscaling it step by step work?
@ornate elk just to share this observation with you, most of the images I generate using SDXL models, those are fine with 512x768 resolutions along with hires fix as usual workflow.
Your python version is to old.
Check my install guide in the Pinned Messages in this channel
What's your GPU?
And what's inside your webui-user.bat?
RTX 4060 Laptop/ 8GB VRAM
Pardon?
Not sure what you mean by that, excuse me..
a file named under root webui folder
to see the content of that file you'd have to right click and edit to open in notepad
Should be this one then, yea?
that should basically show something like this ...
Ahh this one 1 sec
I had --xformers in as well until just today when I decided to try it out without it
--xformers is crucial to optimize performance greatly if you are using nvidia cards
So shall add it back in there?
wait for @ornate elk he was going to suggest something im guessing
I've heard that results can vary when using it and trying to recreate images from copied generation prompts. But yes no worries I'll wait thanks 🙂
well to recreate an image it requires using same seed and settings, but xformers is a built in technical feature to optimize SD performance for nvidia GPUs regardless you want to create or recreate an image
But if it's important, I'll add some background, I'm trying to make an image as high resolution as possible (if possible the same resolution as my phones screen) for a wall paper or so 😃 those don't have any quality yet. -
these are the conventional resolutions you should set your image size to then hires fix by 2x to upscale
I have the exact generation datas, same checkpoint version, Lora version applied, clip skip, seed etc. But kept ending up with a different, disturbed image. So I figured its either --xformers or hirez fix isn't transmitted when coying an images parameters so I'll just have to figure that out my self
that could be anything about the settings such as LoRA, source image (if used), embedding and so on, but not xformers
Ah ok.. I've set it to the exact parameters as my screen ratio decided by 4 and then wanted to upscale x4.. maybe I have to use another ratio then tho ..
think of xformers as performance related not image manipulation.
for most tasks you'd be using 512x768 then upscale by 2x using hires fix .. that's the usual convention, but you can also use other larger resolutions, in fact with SD 1.5 models you can even use 512x2048 and the output would be symmetrical. you can experiment with those set dimension i showed above and mix them up
I see... Interesting
Well, to start things off I added --xformers back now and deleted the venv folder the get a clean first boot of automatic1111 (I think)
you didnt have to delete venv folder to add back --xformers
Oh.. anyhow... 😅
only reason you may need to delete venv folder when you have lots of updates patched over a period of time and bugs are getting in the way
During this boot bow I can see it suggests to upgrade. Do I just add '-- upgrade pip' behind where I also wrote --xformers and the restart again?
typically you should not update webui everytime you launch it, that could break things over nightly builds, but to actually update you would go into webui root folder and open up cmd from there and type in git pull but other than webui updates you are encouraged to update your extensions within the webui which you can do by going into extension tab and check for updates then apply and reload.
Sitting in this menu for a good couple of minutes now.
Okay 👍🏻 I'll do this update another day then.
thats normal
It's done now.lol
Your card needs: --xformers --medvram-sdxl --no-half-vae
For the best performance
But they are all sure to not decrease the outputs? What's the downside to using them?
there is no downside in using them, xformers had an issue which was that you cant recreate an exact 100% same image, but that seems fixed with the 0.0.20 version,
the command args dont reduce the quality in any way
New to the SD world, and I am starting with what I believe to be a simple task. The task is using personal self images and generate realistic portrait images. I am on a local installation of the kohya trainer and Automatic1111/dreambooth to generate the text-to-image. The training data set is all in the (512X512) resolution with the the naming convention "subject(1), .... ,subject(n). Using this trained model, I then merge with a citivai mode (50/50 blend). The resulting generated image looks terrible (non-human, poor resolution, insert any undesired trait of a realistic iamge . This issue to be independent of the prompt itself. Any suggestions? (investigate a new data set with higher resolution, trainer issues, citivai model selection)
Okay that's crazy. I just wonder then, why isn't it fundamentally recommended to use all of those commands even if you don't 'need' them if they don't have a downside? Basically just to make 'extra' sure your experience is the best it can be
yea, thats a good question, the devs dont want to force any of these arguments, to keep it clean at install
for example, if you would install Auto1111 to use it with your cpu, you would get errors because xformers is for nvidia only
or --medvram-sdxl is only usefull for 8-11gb vram gpus
so it wouldnt make sense to add them for all
Does AMD automatic1111 still not work with Olive optimization? I keep getting all kinds of different errors
it works with olive partly but isnt recommended to use
I could get the Ui to work and it would be 15s per iteration so I tried the amd community post and followed it but it doesn't work lol
So for amd cards its just not usable until more support is added officially?
if you got the --skip-torch.cuda-test "error" thats fixable
im using SD Auto1111 directml fork with my Radeon RX7900XTX, without Olive, on Windows
Yeah i got around that by adding the arguement
yea thats the wrong approach, adding that will use cpu only
with Olive optimization I could not get it to work but generally I got both Auto and tiger to work
so It is not using the gpu at all? I saw 100% gpu usage in task manager
I installed the directml btw
if you want a usable webui with the best support but with mediocre speed, you should follow my install guide, its in the pinned message of this channel
what's the average speed you get on your card per iteration? I only have a 6800XT
if you want something really fast, but with less support i would recommend the Shark webui. Much better than Olive, but has the same problems (converting models, for every resolution, no multi lora or extension support, etc)
i can check, one moment
I understand but I just want to know your speed per iteration so i can see if its worth it or not
I gave up on Olive version and was just going to find some good online generator like getimg
Also does Controlnet work with the set up you have? I usually use that a lot
Linux is still the best option for AMD righ now. But we will get rocm support on windows in the comming months.
i would say its pretty worth it with a 6800XT to use it localy
I can try your guide, what about controlnet?
getting 3-4 it/s
controlnet works completely
alrighty, Thanks
only sdxl can be slow as its a 6gb file
I ran FOOCUS yesterday, it worked but it was terrible lol
yea foocus added amd (directml) support a few months ago, but didnt added other optimisations
auto1111 directml fork from lshqtiger is the best we have right now in terms of use and support
the next best is comfyui with directml
then Shark webui for 20it/s speed (but lacks features)
sounds good, I used comfy last year on my laptop with a 1060 6gb lol
I do love the interface of comfy
okay, I will follow your guide and post here if I get any errors
Yea sure 🙂
Is the bot down?
do i need --medvram?
Yes
I got 16gb doe
I got 24 and need it to 🥲
loool
Directml is not optimised in vram usage thats why
RIP
We will get Rocm support in the next months so everything will change to the good
ok mine is installing now
yeah hopefully we get the support soon
and it actually works
unlike olive
yea i think it will
ALRIGHTY it works and is 4seconds per it
Make sure to test 512x512, euler a, 30 steps
Also important: make sure wallpaper engine and epic games launcher dont run in the background
so for models from civitai I just paste into models/stable-diffusion right?
yeah they're closed
Yes
okay, Can I make higher resolutions as well?
I like 1024x576
16:9 ratio
Could work. I don't know your cards limit. You have to try.
But I know the upscale limits mostly
Also important. Every time you get an Out of memory (vram) error, you need to restart the webui to clear the stuck vram
ok
go on the extensions tab, click on Available, click on Load from, then you get a list
type sd-webui-controlnet
then install
then restart the webui
alrighty
for the controlnet models its important to use the smallest that are available, like these:
https://civitai.com/models/38784/controlnet-11-models
there are multiple on that page so here is a quick overview how to get them with their config file
put everything in models/controlnet folder
No problem 🙂
one more question if a model on Civitai has XL in their name, It is not going to work with mine right?
also where do I get SD 2.0 is that usable?
it can work but is slower
oh nope, stay away from that xD
it will work but its so bad supported
you also need config files for some 2.1 models
not worth it
its good but still 1.5 is the most used as it has more community made loras and model variants
you can download an sdxl model like Bluepencil xl and try it out
so stuff like this would be slower comparitively ?
https://civitai.com/models/260267?modelVersionId=293564
ANIMAGINE XL 3.0 Huggingface link: https://huggingface.co/cagliostrolab/animagine-xl-3.0 Gradio Demo : https://huggingface.co/spaces/Linaqruf/anima...
yes it would be slower and could crash when going to high in resolution
i see
but if you do like 1024x768 for example that will work good
yeah I usually do that or 512x512
sdxl models are trained on 1024x1024 resolution
1.5 is trained on 512x512
best resolution for portrait is 512x768
or if you want a FullHD image later then use 540x960
i see
upscaling by 2 and you have HD
its tiled rendering, not needed for now
but later on Tiled Vae can help in situations where vram is limited
i see
okay so it is working however images are mostly ugly rn, when I used get img I had to make longer prompts so this is also similar to that right?
like detailed prompts otherwise you get ugly images?
yea you need some good quality tags, like high quality, highres, detailed, etc,
and more important some good negative prompts
for example add these to negative and create again, it will be much better image
disfigured, kitsch, ugly, oversaturated, grain, low-res, Deformed, blurry, bad anatomy, disfigured, poorly drawn face, mutation, mutated, extra limb, ugly, poorly drawn hands, missing limb, blurry, floating limbs, disconnected limbs, malformed hands, blur, out of focus, long neck, long body, ugly, disgusting, poorly drawn, childish, mutilated, , mangled, old, surreal, text, blurry, b&w, monochrome, conjoined twins, multiple heads, extra legs, extra arms, fashion photos (collage:1.25), meme, deformed, elongated, twisted fingers, strabismus, heterochromia, closed eyes, blurred, watermark
okay, what about hires. fix and redifiner
you can completely ignore refiner
hires fix is very nice to get much better quality outputs, but it slows down the process and use a lot of vram.
so if you want to use it then i can give you the best settings
maybe later
what's the best place to get prompts like Lexica was nice until they added their own model to it
Civitai has a site with only images
im getting really weird color issues with the model I sent above
is it because its XL model?
what's your model and VAE ?
maybe because you used 512x512 with sdxl
higher than that it ran out of memory lol
try 768x768
AMD
euler a is good, DPM++ SDE,, DPM++ 2M SDE Karras also
for testing, Euler A is probably the best speed / safe compromise
you may cant use hires fix with sdxl models cuase of the high vram usage of both together
but you can also upscale in img2img without vram limit
that should work but SDXL model doesn't like 512 lol
yess ong
I understand that. By the way, is Automatic1111 using VRAM only? I have only 8GB VRAM but 64 GB RAM if that's any helpful for anything
the image generation part is only done with vram, you can only store some models in ram for quick switch if they arent stored on an ssd
Aye okay
@ornate elk Will the Ui save my last prompt?
or settings
How do I not start from scratch each time lol
also what is Lora, Am i able to use that with AMD set up?
Loras are smaller files trained on a specific character or style. It helps the model to create that char / style.
Yes you can use loras.
Check for their base version.
1.5 loras only work with 1.5 models and sdxl loras with sdxl
Yes you can click the blue/white arrow to get the last settings used
ok
cool
ty
Stability API is down.
I've Installed Posex
According to the tutorial, this interface should be shown
However, I just see this
I don't see the option to modify any pose
How to fix?
Make sure the webui is whitelisted in any browser adblocker
I've finally downloaded another one which the tutorial man offers in case that doen't work
Posex is pretty outdated, guess its broken
But what exactly would be the problem if webui is not whitelisted?
Visual glitches and issues in img2img tab or some extensions dont work
Yea it takes a while, depending on what your goal is
Also with new extensions and features the learning never ends
SD itself is quite easy, learning its dozens of extensions on the other hand.
Start simple, play with it and add extensions to your workflow one by one.
I have some aims with SD, I guess I will have problems, so I will come here to ask experts for help. But now I'm watching a course on YT
As you just learned, be mindful of outdated tutorials on YT.
Text tutorial are usually easier to update and therefore often most up-to-date.
You can't go wrong by simply reading the official manual / github page of whatever you're using.
I got some experience last summer, but I left SD and I have returned
Maybe I should look at text tutorial, but I get easily lost with that, I'm not sure where exactly to look for text tutorial
Also I am not so good at following text tutorial
But I'll consider
reddit, stable-diffusion-art, github page of whatever extension you're using, etc
I guess I will come here again if I have doubts with text tutorial
This channel for everything technical, for prompting there is #📝|prompting-help
which webui i can use instead of AUTOMATIC1111, im getting new errors with AUTOMATIC1111 every second
you could try comfyui like i did
is it working properly?
yes it is, here is the install guide which is also in pinned messages
What errors?
I've been getting cuda out of memory errors, now its not allowing me to use sdxl based checkpoints
i've had enough of it
What's your GPU and what's inside your webui-user.bat?
You mostly missing the right performance settings
RTX 3070 ti
i've deleted the diffusion so. Don't have the bat
Oh xD
Yea you just missed the needed performance settings in the bat
Then sdxl would have worked
Can you give me tutorial link? I can always download it back
You can checkout my install guide in the Pinned Messages of this Channel. There is everything you need to know and tweak at the install.
Your card needs --xformers --medvram-sdxl --no-half-vae
Hello there!
I am trying to use (guided by yt video) MeshGraphormer Hand Refiner inside a ComfyUI but i got an error.
I am using Comfy-ui installed by Stability Matrix on 6900XT and ryzen 5900x and windows11. I already found that this problem exist because of amd gpu - my question is - is that possible to fix that, or i just have to swallow sad true and avoid of using MeshGraphormer on amd gpu?
Error message:
Error occurred when executing MeshGraphormer-DepthMapPreprocessor:
new(): expected key in DispatchKeySet(CPU, CUDA, HIP, XLA, MPS, IPU, XPU, HPU, Lazy, Meta) but got: PrivateUse1
Yea that means it expected cuda or CPU but got directml, so that won't work
Is there a way to redirect it to using cpu?
If its a controlnet model and preprocessor you can run controlnet on CPU only mode
size mismatch for model.diffusion_model.input_blocks.0.0.weight: copying a param with shape torch.Size([320, 9, 3, 3]) from checkpoint, the shape in current model is torch.Size([320, 4, 3, 3]).
im getting this error
When did you get that?
just now
When switching the model or when generating?
switching
Mostly your webui is not updated then. Check the version numbers at the bottom in browser
version: v1.7.0 • python: 3.10.11 • torch: 2.0.1+cu118 • xformers: 0.0.20 • gradio: 3.41.2 • checkpoint: 6ce0161689
i've tried it twice so its the same error 2 times;
it looks like a mess
i mean it can be cause of the model maybe
let me download an another one
Good model, downloading it
When we can expect the server to start the services again?
Idk, but we well get that information first in #1047610792226340935
Hehe, yea keenly waiting for it. I need to make a creative Fun Friday LinkedIn Post.
@ornate elk guess what, google bard can now create images like dalle3
If you dont mind waiting times, you can use the stable horde bot in browser, for free
Yea heard about it too
Is an account needed?
just your usual google account that you use for gmail
Draw a photorealistic image of a majestic golden lion standing proudly on a rocky cliff overlooking a vast savanna landscape bathed in the warm glow of the setting sun. Include tall grasses swaying in the gentle breeze and a few zebras grazing peacefully in the distance.
draw a visual representation of the following keywords in landscape format: a girl, beach, golden hour, surfing, ocean waves, orange sky.
kinda coarse but it got the context
Yea a good start
yeah, its powered by gemini
Are there any benefits from using ComfyUI over Automatic1111? or just different workflows? I tried ComfyUI and it was a bit confusing so I couldn't give it a fair comparison and don't know if I should invest the time in it since I am pretty comfortable in 1111.
Very interesting, I can't download these images on my Smartphone. Not even if I open them in browser
oh im not sure im on pc
Draw a candy’s enterprise
What file type are these images?
Thank you! It was quite easy to start generating, maybe it is worth looking into.
png
i dont buy into the idea of walking the extra length when ai should be doing the heavy lifting
If it add flexibility I am not against it
i have tried adapting to comfyui, the wiring is impossible
Its highly customisable with individual workflows making it good if you want the full control over everything.
But if your familiar with auto1111 and just want to change some settings then there is no need to switch.
You can also have both installed and comfyui can use the models folder of auto1111, so no need to copy models over
I don't like the rats nests that can happen xD I get UE5 nightmares
For me comfyui is to much work to just get some images. You can waste hours by making the perfect workflow. And when downloading other workflows you never know on which preference they are created on. Or if they are better ones.
Full control sounds nice especially when if I get comfortable using it.
I feel there are a lot of shots in the dark with A1111 which might be lesser if I get comfortable in ComfyUI.
I don't mind spending hours, I usually play with A1111 while at work during downtime
there is an obvious pattern where txt2img are going if you look at dalle3 and now google imagen2, those will never require you to manage clutters of wire like comfyui and where google and dalle goes from here is to maintain ease of use
auto1111 is quite intuitive to say the least
also consider midjourney for ease of use... if users are meant to deal with underlying technicalities that defeats the purpose of AI
I like to have a lot of control ^^ but I am no picasso 😛
Or at least to know enough to have the level of control I feel I need
Midjourney and dalle are for the normal people who just want to get some instant output with nothing to worry, like apple users for example
I like having more Control over it, especially in the terms of digital Privacy
what you are currently missing should be provided through the webui interface, unless of course you want meaningless credibility by doing what should be automated
well comfyui interface design kills creativity for me and encourages more of a mundane tasks that Ai engine should be taking care of
But yea webuis like Fooocus are nice, because they do the heavy lifting mostly, if you just want some good outputs
a1111 has a nice balance of control and ease
although not complete
Yea auto1111 is the perfect mix
the LLM has been out long before text 2 image, and there are still lots of developments and improvements going on with LLM
so im guessing in time we will see more intuitive designs with text 2image
Yea for sure! In 2024 it will change a lot
hoping so, for sure
I think I will spend some time with ComfyUI, even if I am not super excited about the rat nets and debugging that mess 😛
Okay, I can't download these 3 images because of the file name lenght
yeah the filenames are EXTRA long lol, i pruend them for you here
Works now instantly thx
here is my take on what comfyui offers at present, the control ppl seek with it will be obsolete as Ai tech advances
That is probably true, and when it comes to that I won't mind using those tools as well.
taking too much time 108s for per /it
is this normal
Depends on the settings and GPU, but normaly not
i dont know about the setting but i am having amd gpu with 12gb vram
but still looking for knowladge how to minimize time
Hopefully you did the steps required for amd to work in comfyui
By installing the Directml backend
yes yes
that is fast
but my images get pixelated
so i want some bigger quality image thats why i choose 1024
x1024
Did you used an sdxl model for that too?
no i didnt use sdxl model i used this one
@wraith bronze since you have already looked into comfyui with certain likings and limitations, have a look at SwarmUI
Oh thank I will, the more to try the better ^^
np, SwarmUI takes comfyui node orientation and offers a more easy to use interface unlike all the wiring of comfyui
Thats okay, most people use lower resolution because the 1.5 models are trained on a resolution of 512x512.
Then you need to upscale the image to improve the quality
Generating at 1024x1024 won't help with 1.5 models. It will give you a bad output
thankyou thankyou
Imagine
thankyou buddy
I'll look at it when I get home, right now I am remoting into my machine at home, Discord is not popular at work! 😛
create a portrait photo of mona lisa in modern art style
Prompt: Knight with bare buttocks running across the field to attack. Rear view. Funny. Anime.
@ornate elk unlike BING and dalle3 i dont see any limits or caps to using google bard to create images
Ah okay, maybe at first they will not limit it
oh no, i dont want them to set any tokenized limits lol
holding high hopes for google 🙂
draw an anime girl at a boardwalk on a sunny afternoon with candy floss in her hand. she is wearing a yellow t-shirt, black pants, and red sneakers.
i had to re prompt shorts from pants otherwise it wont generate image
SD1.5 is trained for 512x512, SD2 for 768x768 and SDXL for 1024x1024.
You can generally ask for about 30% lower/higher res for each of them wih little to no visual artifacts.
Also, yes you can get much higher resolution for each of them but it involves upscaling.
a girl inside a plane flying above the mountains
ok i think i should stop with all the images
Maybe we should port SD to minecraft redstones?
it took year to complete thi que with sdxl may be my machine not build for sdxl
asking for 512 height with an sdxl model ? chances are high your output will be trash.
so what is the best size for that
Here's a list of some possible resolutions for SDXL:
- aspect ratio: 4:3 - 1152x896
- aspect ratio: 2:3 - 832x1216
- aspect ratio: 9:16 - 768x1344
- aspect ratio: 21:9 - 1536x640
- aspect ratio: 1:1 - 1024x1024
- aspect ratio: 3:4 - 896x1152
- aspect ratio: 16:9 - 1344x768
- aspect ratio: 5:4 - 1152x896
- aspect ratio: 3:2 - 1216x832
- aspect ratio: 4:5 - 896x1152
- aspect ratio: 9:21 - 640x1536
That would be awesome haha xD
you could try 512x768 (width x height) with sdxl if you want to save up some resources, but best not to use 512 for height as @vocal burrow pointed.
im going to test that out right now on sdxl model
which one is best for performance
upscaling latent or upscaling image
Draw the nine circles of Hell are like big mountains covered with snow. little devils wander around each of them
512x768 and flipped on SDXL along with hires fix by 2x
what is you pc configration
it came out fast in your case
well for one i used lower res as i just mentioned that saves up memory and im using rtx 3060 gpu with 12gb vram
right now its stuck into ksample for 4 min
im using automatic 1111 btw
i am also having decent graphic card 6750xt 12gb ram
is that AMD gpu?
yes yes
ahh ok, AMD gpu compared to nvidia specs are bit on the lower end in SD optimization
okay
Wouldn't recommend using sdxl with AMD 12gb GPUs, its slow and you have a better time with 1.5 and upscaling
If you want to try out auto1111 for AMD, checkout my install guide in the Pinned Messages of this Channel
look at that my ram and disk got bottle neck
Is that right now while you generate?
Because your GPU usage is at 0℅
this happen i dont know why when i stuck at ksampler
it always my ram and disk who got bottle neck
You should install any webui on SSD, and then store only the models on HDD
Mostly your windows pagefile is stored on the HDD, you would need to check that and switch it to the SSD
@ornate elk as typical as it is, bard too is sensitive to prompts over ridiculous words
i install window on ssd you can see it c disk
I know, I'm not talking about windows
The pagefile is a file that is used for caching (virtual RAM), it gets used when your Ram usage is maxed out.
That file should be located on C by default. But for some systems its stored on their HDD, slowing down everything
Ahh okay xD
thankyou thankyou
an example
Ah well the typical over censoring
i forgot to replace her with him, but it worked
problem solved
lol
What was it?
me do not know
Okay what did you changed ? xD
page file and you can see there is no ram or disck bottleneck
Thats a common error for the Directml Backend. Not enough vram for that resolution
Perfect
So it was on the HDD as I thought
yes it was
你好
can anyone help please
i am using amd gpu 12gb ram , confyui
while using sdxl 1.0
Lower the resolution to 768x768
So, memory leak (python) on automatic1111 on Ubuntu. Any advice on how to keep it from accumulating?
Have you restarted before you tried?
no
Because every time you get that error you need to restart comfyui
Hey guys, if we ignore the benefit of Vram for training, is there any scenario where rtx 4070 is better than rtx 3090?
Is there any latest benchmark for stable diffusion I can check?
you dont need to you can check it on website based on cuda core
J
I remember there were some accelerations released later which boosted the performance for the latest rtx. since I have been away for a while, I thought I should ask if those accelerations were only for rtx 4000 or for old generation s too.
Can't remember the name of the acceleration or if its released by nvidia themselves.
how upload image
Can I add google LUMIERE to my AMD ui or is that not yet available?
Has anyone gotten instant-id in auto111 webui to work well? It works but the result isnt very good. colors are usually washed out, weird contrast, etc.
#🤝
How long the bots will be down?
Can you show an example of said mess up image?
from the looks of it. your cfg might be too high
what did you set it at?
wait no. it might be vae problem
I've tried a range between 4 and 6
best if you post your image here with the metadata
Does anyone know how to install comfyui to a docker server
to do so, drag the image from your folder into this chat.
Does it require a vae? I think it depends on the model, in this case I've tried Juggernaut and Albedo, both sdxl
i have no clue till you share more info.
best way is to send one of the mess up image to this chat from your folder.
dont copy paste the image to here.
followed this to a T. https://github.com/Mikubill/sd-webui-controlnet/discussions/2589 same settings. tried cfg between 5 and 6. Only difference might be the main model
Instant ID project https://github.com/InstantID/InstantID Instant ID uses a combination of ControlNet and IP-Adapter to control the facial features in the diffusion process. One unique design for I...
please send this. Would be really helpful.\
Grabbed a different image since I was using a personal one. I even grabbed the similar positive and negative prompt and steps (also tried lowering the steps) from an example form the model page.
The colors are a bit weird, since this model has VAE baked in, I dont think I need to provide another on
@unique glacier
This is an image I've made in Blender using two 3D models of a video game. I want to make an img2img of this image to generate another image should be similar, but has a better appearance.
I'm looking for a model which allow me to get a result wich has the most similar appearance in style to this image, which is an official render of the character, but I don't know what model to download. Anyone could help me with that?
see if this fits https://civitai.com/models/118086/3d-animation-diffusion
3D Animation Diffusion 3D Animation Style Model Do you like what I do? Consider supporting me on Patreon 🅿️ or feel free to buy me a coffee ☕. A ❤️, ...
Cool!
I'll check it out
Noice
Yea it works okay
What models have you tried with it, every one I try has not great results, usually ends up blurry and odd colors and contrast.
Tried it with Dreamshaper xl or juggernaut xl
I tried with Juggernaut and it doesnt come out very well. Pretty much followed the examples to a T.
Its not that good but if you lower the cfg to 4 its ok
an example of instant id but i used cfg scale bit high in this one. can you show your output for reference?
Im not at the PC to share examples
I posted an example if you scroll up
Is your output supposed to be that bright? I can get some similarities to the face, its kind of everything else that falls off.
looking at your post, are you using any lora, embeddings?
no
yes its supposee to be like that given the settings i used
try the control weight and end step both to around 0.8
0.8 helps a little. Ill mess around with some stuff. Thanks for the assist
sure, you could try lowering the control weight a bit more
53 seconds at 15 hires steps
45 seconds at 10 hires steps
Every other setting isn't changed in those two generations.
X2 upscale, esrgan_x4 0.38 denois, euer a, 41 sampling steps, 1x1 batch, 512x768 size, CFG5, realistic Vision 6
Are those timers acceptable on the faster or slower side for single images generations?
Thanks in advance:)
I also am using --xformers --medvram-sdxl and --no-half-vae
would be helpful if you mention your gpu spec
I didn't want to mention it because I thought there is like an average estimation time that images generally take to generate. Like 30 seconds for an image at 30 steps being good, 20 seconds very good, 50-60 seconds being acceptable but +2 minutes not good
- if that makes sense 😅
But sure -
RTX 4060 mobile 8GB VRAM
I7-13700K
64GB RAM
all that would largely depend on your gpu power, sd parameters come later
i would say fairly ok with that gpu
Whilst browsing I saw a comment like this and thought - wow is his GPU that fast or do I have to sort out some issues with mine?
but to optimize your generations, you could go with 30 steps and 10 steps for hires
But then there was another guy who said it took him +40 minutes for a single image because he's generating on a potato PC, so I wanted to check where I stand with mine 😂
Most checkpoints work best at around 30 steps anyways iirc no?
Like that there wouldn't be too much benefit/different from 30 to 70 or so
your gpu is pirmary when it comes to speed then comes ram and cpu and after that it would also depend on the models you are using and other settings along with lora in some cases
30 steps is ideal cause in most cases you won't notice strong artistic differences going above 30 steps
Okay 👍🏻
So checkpoints do alter generation time and even loras can be a factor for that, I thought of that before but the biggest different so far I saw was from different samplers
if you are extremely picky about fine detail which is usually a rare case scenario you could try steps in between 30 - 50, anything above is waste of time
For details that most often the only Lora I actually use is one to increase/adjust details so I think I can safely stay at 30 yea
yes they do and to be specific SD checkpoints and XL checkpoints vary greatly, but not to be mistaken each sd or each xl would vary in resources
Ah okay I was just gonna ask if SDs are faster than XL haha thanks
XLare more resource demanding due to its design and size
btw @ornate ruin speaking of steps, there are LCM lora and turbo models that can produce pretty good results with as little as 6-8 steps
Really hahah I might check that out for fun if I stumble upon one of those on Civit
I have like 20 checkpoints installed rn bc I install all checkpoints and loras that I see if they were used on an image I like 😩
Its also common knowledge that on the free tier if Google 'catch' you using an SD GUI then they'll ban you.
Colab is there to run interactive script not GUI's
Diffusers script are safe, AUTO, Comfy, InvokeAI etc not so.
8 steps and cfg scale 2
I don't even know what that means to be completely honest
I'm just running SD Automatic1111 locally on my PC is he not speaking of the same thing?
¯_(ツ)_/¯
Oh that's actually a good image, nice
yes its leveraging LCM technology, you can learn about it here https://huggingface.co/latent-consistency/lcm-lora-sdv1-5
cuts down generation time more than half and speeding up your workflow
Is that something someone can learn? I thought it's just a Lora/checkpoint you'd use. LCM technology makes it sound un-beginnerfriendly
I don't do much anime/cartoon stuff but how long did that image take you roughly?
you can learn about its application and the files you need to grab
Ah alright
No he's talking about running Auto111 on the server Google provides for AI developement, there's a free tier which people where abusing to do SD remotely for free.
Google clamped down on it, if you pay them they allow it
2 seconds probably, it was one of my older gen, not now
Okay no I don't do that.. thanks for clarifying
Neat
np, but LCM is not limited to anime, you can apply that on realistic images too
@ornate ruin this for example is also with 8 steps and 2 cfg scale
Sick
took me about 2- 3 seconds at best
i have just completed this one and it took like 50-60 seconds rougly
working on improving the time, thanks ill most likely take a look into LCM next
that timing sounds fairly reasonable on your rig
and SD setup
Okay that's good to know tho 🙏
Is there doccumentation on that model to read? (I am not so able for searching that yet)
all of the detail on using it on that page.