#✨|sdxl
1 messages · Page 174 of 1
i dont understand why it keeps doing this with sdxl.
i have to restart forge in order to get it back to normal.
yes i get that error sometimes on forge XL
i dont know why it happens though.
it's getting on my nerves
...it's fucking cutoff
i turned it off and it worked
this is why sd1.5 prompting for sdxl isn't a good idea
its prob not cutoff because i use it and doesnt happen,it happens when u enable an extension gen some imgs then disable that extension and enable it again
pony uses its own tags plus the booru tags because it was trained that way,it would be a bad idea not to use those tags
if it is still sdxl at the core, as in trained from the base model, then the clip information is still relevant. afaik, models like pony are just lora merges
yeah, i need cutoff. it's mixing colors way too bad.
where they trained a bunch of loras and merged them into a model like the base model
wrong,pony is not a lora merge lol it was trained with 2.5 million images
from 100% scratch?
where they spent hundreds of thousands of dollars renting a server farm to make a whole new base model?
oh wait a sec you are that guy who was saying the gov was reptilian right
ofc you would say that
i like how far this devolved.
yea i remember that guy its the same one who claimed xl controlnet models were superior to 1.5 ones
dude it's still not working and i restarted forge and immediately turned on cutoff in the new window
they are just as good lol.. you're just using them wrong
ofc you would know that,you are the expert here with millions of models on civitai
ok try to disable the extension on extension list
restart ui, enable it, restar ui again?
or try to just enable it and then leave it on
yea disable it on the list of extensions then restart ui again
okay so i disabled it, reloaded ui, enabled, reloaded, testing now
that's barely even over half strength. i was going to troll around with some husbando crap instead of the typical waifu stuff people always spam. This still has the same prompt from earlier helping him figure out the sheer sleeves thing
it still errored out. i'm gonna try and fresh install it
do u have any other extension?
these are the others i've installed
try with an empty prompt maybe it will work
it only messes up when cutoff is turned on...
okay so cutoff is on and empty, and it didn't error out
now dont disable it and try to gen a few imgs like a batch of 6
should i try putting text inside of cutoff?
no just leave it empty
if it works try another batch but with a cutoff prompt
with cutoff, you just put in the colors right?
liek red, blue, red, purple etc etc
it works with other stuff too like butterflies for example
its better with just colors because thats what bleeds onto other stuff,other stuff that tends to bleed is "frills"
but depends on the model
ahh, got it. okay so it ran the batch just fine
with an empty cutoff
error'd out with a filled in prompt
works with a cleared prompt
yea in my case it fixed itself, was hoping for a fix but forge been dead since 2months
that's so stupid
i'll try and clean install cutoff, because i really need cutoff.
AHA! I FIXED IT
It was embeddings I was using!
he made his last commit on github a day or two ago, so he's alive
just to a private repo
i removed the embeddings zPDXL and zPDXL-neg and it worked
zPDXL must use color tags or something

a rodant!
it does not look like sheer sleeves works here
because it's not a booru tag
see-through sleeves is a booru tag though
yea this one works see-through_sleeves
i'll try it after this batch. cutoff seems to not be working again unless i have the colors in the wrong order

Hello
I don't know where to place this question but i tried hard to get decent images from various sdxl checkpoints and workflows in comfyui but still i can't get the image quality which I can generate so easily with SD1.5.
Is this a common issue?
I know this is a very general question but is this normal to struggle so hard with sdxl in the beginning?
don't, feed, sdxl, prompts, like, this and type more like this. place things from most to least important and stay under 40 or so words(<75 tokens). your negative prompt doesn't really need much other than maybe nsfw, text, watermark. other than that, there really isn't any need for the mile long sd1.5 negative placebo prompts (most of which are just shifting the prompt seed and similar results could have been obtained by inserting empty tokens like , , , ,). Also, make sure your latent image size is set right like 1024x1024 or some other combination that multiplies to ~1 million pixels. 896x1152 is a pretty good one for portraits. I think 1344x768 is a pretty common one as well for landscapes and is almost equal to a 16:9 ratio (good for wallpapers)
don't set your cfg too high either, stick to somewhere between 4 and 7, the realism models like juggernaut prefer lower cfgs like 3-5
well the lower you go, the closer you get to the reference dataset and the less your negative prompt works. personally, i stick around 4-6. i always laugh when someone posts some "really good portrait" image and you can pretty much automatically tell they set the cfg to like 1 and you're basically getting an almost pure dataset image that completely ignored their prompt
yeah sometimes, but that's also good for if you typically do cleanups in photoshop afterwards. that way, you have more room to play with the darks and lights. depends on the cfg and the sampler a lot too
well and the model in general
if u talking about anime style pics u need pony XL to get quality similar to 1.5 checkpoints,go to civitai and download it from there also read the guide on the pony page to know how to prompt with it
damn what promt did you use for 3 image ?
info should be in pic
only you can see info
PMed u
thanks 
Does anyone know math function for sdxl prompt weights?
Is there any ways to use XL for free? I've used SD through Google Colab before
localy
Okay what if you're broke
i dont think it would be possible to use it for free then localy one even collab needs money
But Colab is cheaper than running locally if you just want to run a few gens right
then u need to use collab
The colab I was using didn't use SDXL
i know that its possible to use your custum models but collab will provide u GPU
u need to read documentation
Ty bud
try to save some money for own GPU
3060 12GB
That all you need?
examples?
Awesome
you can easily go higher than that, up to 15 from experience
anyone knows a good strategy to fix this? adetailer doesnt fix it, happens a lot for me with people wearing glasses
you could try to just run the seed back through the pormpt with an upscaler.
1+1=3 anyway hi
If anyone is interested in a free Alternative to Midjourney, I've been working on a bot that can generate images at a similar level if not better. The images generate as fast as 8 seconds. LMK if you want to test it out, we are looking for feedback and suggestions from anyone who is willing to help out.
This model works very well with RealVisXL V4.0 (Lightning). 0.8 strength is good. Upscale to obtain better results. Add Lego word is recommended. W...
is it a model?
Life of an artist
A digital illustration of a steampunk library with clockwork machines, 4k, detailed, trending in artstation, fantasy vivid colors
Heres are the images you requested.
WDYM?
it is a discord bot that uses a SDXL model for txt2img
Dude you are the GOAT. I use 25% cute 3d render lora on all my merge models.
now this...
thank you
for this I suggest to use with Realvis lightning 4.0
realvisxl 4.0 lightning
A model wearing a black off-the-shoulder dress, pearl earrings and a small gold chain with a delicate flower pendant around her neck. The background is a solid color and simple, highlighting her profile. She has blue eyes and long hair in loose waves, giving an elegant feel to the overall look of the outfit. Her expression radiates confidence as she poses for the camera, showcasing a stylish and fashionable style in the style of a fashionable portrait., vertical aspect ratio
thanks
so one question though. what's the possibility of removing the need for the trigger word? is that something I can do myself in any way?
I like using that cute3drender with a merged model for just an overall look to everything.
I see that it doesn't particularly kick in without "lego world"
for me it works without trigger
but the trigger helps sometime
so this is the same prompt and seed, but without lego world. the look isn't complete like it is with the trigger
disregard the hands on chest, that's probably because of the detailed tattoos.
a stone giant wrapped in chains and tattoos is crouching in front of a gate looking down at a small group of diminutive people who are trying to gain access to the cloud town.
this is what I got
i wonder if that's where that model comes in.
@cyan crown ok, i put the strength to 1.5 and now it's very consistent even without the trigger word
on my model
BTW it seems to work well
yeah this is great stuff.
final value ended up being 1.25
the lego hands really solidified with the new aligned steps nvidia scheduling as well
Why is every video abouty ELLA chinese...
Because the creators of it are Chinese and because China has well over a billion people?
A lot of the AI advancements have been driven by Chinese researchers
I dunno about you guys, but that aligned steps sigma stuff fixed guns for me. They're all mostly correct all of a sudden.
it can fix things for sure 🙂
And sometimes German
robutussy
"Design a vibrant and avant-garde sticker featuring an asymmetric layout with dynamic, irregular geometric shapes, such as fragmented lightning bolts and stars. The primary colors should be bright blue, vivid green, and shiny silver, contrasting against a glossy metallic silver background. This sticker is intended to be applied to a cylindrical surface."
Here is the image you requested.

A colorful birthday celebration with balloons, streamers, and a delicious cake. The cake has the words "Happy Birthday Sharvi" written on it in frosting.
Hey guys im using Stable Diffusion XL Turbo on ComfyUI and these are the images im seeing how do i fix this
画一个哥斯拉
"Design a vibrant and avant-garde sticker featuring an asymmetric layout with dynamic, irregular geometric shapes, such as fragmented lightning bolts and stars. The primary colors should be bright blue, vivid green, and shiny silver, contrasting against a glossy metallic silver background. This sticker is intended to be applied to a cylindrical surface."
show the workflow, you're probably not using the right scheduler settings
That worked even with 4 thanks a lot
from the comfy examples
ahh good, np
if youre doing 4 steps, just switch to a lightning model, they are better in pretty much every way and do things at 1024 instead of 512
and you dont need to use the custom sampler with them either
Can anyone send me a link for SDXL controlnet? I had to use SD 1.5 because I couldn't find SDXL.
I meant a link for the controlnet.
Which "controlnet"?
let me show you sd 1.5
Do you mean the model? There is a search box on that site.
https://huggingface.co/lllyasviel/ControlNet-v1-1/tree/main I used this one: control_v11e_sd15_ip2p.pth, is there the one for SDXL?
https://huggingface.co/ckpt/controlnet-sdxl-1.0/tree/mainW which one of these is a similar one but for SDXL
I tried a few of them but it wouldn't process it and instead stop at that that point.
Just search "controlnet sdxl" and choose the one you want
ok. thank you
also any recommended hand fixer models for SDXL?
thank you, I'll try it
testing wow lora
@humble capeDid you change the sampler in fooocus? I did. I find that default brings in more candid details whereas euler-a makes the scene look fake.
From the series: Inconvenient living
ha!
I not changed the samplers
interesting
@wet nacelle if you change the sampler you get better results ?
I mean in theory yes. Chance.
Do you run fooocus localy ? What is your gpu and how many vram , i plan to buy a new pc
Oh okay. Yes I am running local.
I have a 3070TI with 8gb vram.
8gb is enough ?
I would recommend >=12gb. NVIDIA, obviously.
4gb is for fooocus
"Some" skin on show.
Hoptimus Prime.
Yes but fooocus waters things down under the hood to achieve such a friendly VRAM situations. The same generations with the same settings on Forge yield far superior results so be aware.
This is also fact
FUCK AWF YOU FUCKER!!! FUCK WINDOWS!!!
Hey everyone. I'm looking for a cloud API for Comfy. Does anyone know of one available?
Not something like a vm paid by the hour, just a cloud API hosting Comfy that's always available where you pay for usage. (I remember seeing something like it in some video ages ago, but no idea how to find it now.)
Ancient tech bruh
Comfy is ancient???
Oh. Well if you know of a cloud API for Comfy I really need one. 🙏
A1111 / forge etc. will do in a pinch though.
sorry I don’t 😔
how do you guys determine when a model is obsolete, when you have 50 6gig models laying around its becomign a problrem
hard to let go coz eahc model has strenghts
:/
Back them up onto an old external harddrive, label them "SD Models to Keep", and then put them on a shelf.
Fond memories a few years from now when you dust it off and find the old label. 🙂
You know, back when SD first came out, I was keeping every good image. It took a lot of careful prompting and re-rolling to get good images. It would be crazy to delete them. ...But then...
I feel ya
also the images take up some gigs
but the damage is nowhere near as bad as the models
esp SDXL
which remind sme
how big is SD3? :0
i have 250 gigs of models
But really, what models do you have that you regularly use? I use Proton for prompt-following, CrystalClear's two latest for lora / controlnet compatibility, and Juggernaut Lightning for instant generation. I have a dozen others on my disk that I downloaded to test, and couldn't give me anything special vs. the others.
good thing i learne dhow to link everythign to one folder (A1111)
durign ym first foray into this i didnt know you could do that
lol
disaster
yeah i only use maybe 5 regularily
more realistic stuff
The only other really "special" model I have is dreamlike photoreal, which can do photo-realistic fabric textures. But it's 1.5 and getting it to actually give me what I want is a huge pain.
I wonder what SD3 will replace... Probably not Lightning. Definitely Proton. Hmm.
chacolrealponyworldm_v10 right ow is probably the most realistic sdxl model
i like the get the ones that come out like last week
the more time in the oven the more refined lol
its amazing how far sdxl came in 6 months
i remmeber first models
so i can just imagine what refined SD3 will be like
Pasted into search and found nothing.
I use SD for artwork with ControlNet. (And I'm working on a painting app with it built in.) My priorities are controlability and prompt-adherence.
well it is pony so fighting the nudity is a pain
But what's the actual name?
Model Description This model is a realistic Pony model. This will be an experimental model that is not up to the standard I am seeking for myself. ...
Welcome to RealityFuse XL! Hello everyone, this is my first SDXL model. With this model, it is possible to create more realistic photographs. The a...
That looks very... focused.
as for plain sdxl realityfuse is very realistic
What is a pony model?
idk
sdxl thats more focused on nudity?
the models do great anatomy but nothign else
if you try to set your image to be anywhere other than a bderoom the backgrounds ar egarbage
with ponies
Let me see if it can do hands at least then. Why ponies?
Nevermind it says it will take 2 hours to download? Why? Civitai has never been slow for me before. Always like ~1 min downloads for SDXL.
i hate going to civitai i always find new stuff i wanna try
i wa ssposed to archive models!
Oh well. I'll let the download manager run in the background and try them later. RealityFuse does look like great skin and fur texture.
This makes perfect sense.
we heard you like wings, so we put wings...
The new Hyper-SD models are FREE and there are THREE ComfyUI workflows to play with! Use the amazing 1-step unet, or speed up existing models by using the LoRAs.
Better than LCM? Take a look for yourself and see!
Want to support the channel?
https://www.patreon.com/NerdyRodent
Links:
https://huggingface.co/ByteDance/Hyper-SD
== More Stable D...
Soon SDXL will be so fast we will actually run at -30 steps. As in we can go backward in time while generating. =0
The time machine will be built out of Nvidias not Deloreans.
Nvelorean
sigh. ok now i'm doing these in 8 seconds.
cats for days
hah
I love this! Big occasion
Been testing out an overkill combo of the align your steps(AYS)+AutomaticCFG+perturbed attention guidance(PAG). It got a bit confused though still with the various colors in the prompt for the instrumentation and the walls, but it still produced some pretty neat results. I should test some with sd3.
"in the style of retrofuturism, an advanced laboratory with pale {red|yellow|orange} analog instrumentation, analog screens, retro tech, pale {duck egg|coral|pea} pastel colored walls, (liminal:1.15), 60s film aesthetic" (I generated a bunch and it randomly picks one of the things inside the curly brackets, so like red and pea or yellow and coral, etc)
Does anyone know a model that makes a similar style to Chinese Donghua? (CG anime)
a lora based off of around 32 images of midjourney dark fantasy images :) doesnt work well great with sdxl base model but works really well with ep...
thank you
sorry, i wasnt orioginally rtepsonding to you, (didint see ur message above mine) but np anyway
I'll try looking more into these models. I really liked anime CG when I first watched it.
@willow stone
yep
worms
Cheetos
need beer plane for this
Currently working on ground support.
looks like a carpet
T-1001
It's finally built:
🙂
Hey all,
Anyone has a solution for sdxl turbo upscaling (without weird ghosting and duplicate things)? the 512 resolution is quite low, but putting anything on top of sdxl turbo messes up the image 😦 no matter what ive tried
how about upscaling externally? Or within comfy?
I mean I wanted to automate that step, so I get better quality results straigth away without needing to use another app or service when i get a decent result for a prompt
so you aren´t using Comfy I at least assume?
what webui thing? 😄
btw if you like it on the easy side (including upscaling with realESRGAN x4/inpainting/drawing: https://github.com/cmdr2/stable-diffusion-ui
sounds good, but in reality its the same thing to launch as the one I use 😄 messing with terminal all the time is what I don't really like with these solutions (I mean how hard it would be to make a launcher that does the same thing in the background)
not quite sure what you are referring to exactly 😀
brooo someone made a chappie lora for xl, that's dope
dog food? 😀
n...no
SDXL Chappie Character LoRa Trained on various images and digital 3d renders of chappie. Does best with portraits and not so great full body but st...
🤭
😄
Guys how do i install SD roop? i did all the steps and installed all the requirments in a1111
worm gum
remember i give you teamlab style for me its not works
why not?
dont know result not same beautiful
Experiment 🙂
portal
Does anyone have experience with DW Pose preprocessor? It doesn't seem to work when I use it in a workflow with a certain images and I don't know why.
half life
Some SDXL using ComfyUI
Hello, use 0.8 (but also 0.5 to 0.8) strength to get images. With upscale go with 0.6. Use "Retro Futuristic Image" as keyword. Sample images done ...
Fun tip for the day. Just add hyper detailed in front of everything.
Hell yeah, I've been on a retrofuturism kick lately. I'm not big on using loras, but I'm definitely trying this one out later
Abe - "What have you done to America?"
cool!
it's a while since your last model 😁
it's very different from the retro futurism prompt in base model. More detailed/realistic, a little bit like old scifi games
yes
doen
real 😂
also what did you prompt to get this level of realism?
i type in photorealsitic and hd but i cant achive such results lol
wrong prompt obv
look at those images... and think of what "photorealistic" usually means: very sharp, crisp, often actually CGI that looks, well, photorealistic
how would you describe those images? probably low quality, cell phone photo, 2000s photography, 90s photography, shit like that
bro how do you generate image in here
Put a deformed midget in there.
David Lynch like
in SDXL?
pixart.
I finally merged my stuff with cosxl. it knows how to do actual true total darkness.
Is it now a pixart model or an SD model?
If anyone isn't able to merge their own model with CosXL, or just wants to try it out, I've uploaded mine here https://civitai.com/models/240590/ultimateblendxl
There were several models that I really liked the results of, but I didn't want to have to keep swapping between them to get the results I was afte...
Cat on airplane
落霞与孤鹜齐飞,秋水共长天一色
SD3
A woman stands on the beach.
A group of children were playing in the forest and stumbled upon a small ginseng plant, and the ginseng elf was smiling beside it.
Not so good
?
This one beautiful
SD3 uses another type of diffusion. So no
Not open source one?
Wow text is good
another type of model. There are papers if you search
No I'm asking is that model open source so i can use it to build something
Sd 3?
1.0 ?
SD3 is not yet released
only api
Api is
How text ?
Is this sdxl 1.0 ?
Can you please dm me the link to the model
I'd like as much please
Can you please make a meme with text
Lol this is the first one pop in my fb feed like this for example
@shy kelp
You mean cant make memes like this with that model
Like This
Oh i want to make memes with a ai model
It's just fun i can put my meme ideas into reality
Yeah that's what i thought
But if the model can generate text it can fit the meme well with the font and colors etc
This is really good
Dall e 3 ?
Or sd 3?
Model that you trained ?
No way
That's good evan dall e have some issues with text
i wish ym dreams were this creative or wer ethis good looking
you need to replace the clip text encoder with a real language model like pixart and ella have done. sd3 does this as well along with giving a new base model and some added abilities.
just fool around 🙂
Smail-Lee´s loRA is simply a trained loRA from what I at least think to know
'
that's really awesome i liked your art a lot
yes you're absolutely right..thanks a lot!
If you call this art then you're insane.
No person that only uses AI can be called an artist. That label goes to the devs.
I agree with this more
actually you're right..totally agreed on this
Thankies.
...that depends on the definition of "only uses AI"
Stop right now
It makes me furious that you want to do this right now.
You know what I mean. You can't spin my shit.
there's no "Spin" there, only clarification, jeez
A person that calls themselves an artist when using souly AI. This applies to cruddy sketches inserted into image2image.
well, if you have a vision, then you're an artist, maybe just barely in many cases, but yeah
I say no but at this point you are bringing in subjective viewpoints.
I'd day that I'm not being subjective.
i'd say i'm being objective, lol
your determination that i'm subjective is in itself subjective 😛
look what I drew
def artist: An artist is a person engaged in an activity related to creating art, practicing the arts, or demonstrating an art.
def art: something that is created with imagination and skill and that is beautiful or that expresses important ideas or feelings.
therefore, i am an artist.
noodles are my medium.
AI is just a tool m'kay kids?
Worse than smoking....
Pretty hard to get off that ai stuff
When SD3? 
狗
May 10?
#🏞|general-with-images a rain day
run prompts
anybody have tips for the best settings with the lightning or hyper loras?
"NO THAT'S THE BLENDER!"
it takes k-cups
Hi Everyone
Hope you all are doing good!
Is there any way to combine training of two models in single model using automatic1111
you're gonna train a checkpoint in a1111?
Yes, I want to train new concept on earlier trained model using dreambooth.
Okay, thanks for the suggestion
Can we use other extensions like segment anything and controlnet in both kohya or onetrainer?
no
Okay, thanks
Okay, got it
Thanks a lot!
no prob
is the bot down forever?
Probably, unless they avoid going bankrupt somehow
make sense
Random question, you guys seems to be able to upscale your images really good: is anyone doing it through controlnet and/or Ultimate SD upscaler? I've been looking for good guides but I've only found outdated ones from a year ago, or just shitty in general. Would appreciate it a lot if someone wanted to share a new good guide with me 😊
I'm rendering in pixart at 512, then upscaling it to around 1792x1024, similar to dall-e in a single workflow. some people like to go insane with the upscaling, but for images you may only look at once or twice, that novelty wears off. you really don't need tiled upscaling for the res I'm talking about
you could use depth controlnet too, but again, if you're using deterministic samplers, it's not going to get all crazy in these lower resolutions.
hey all how do i use stable difusion because i paid for proffessional version
and i only got email and no password and how do i login and make images???
like i feel like lost now
why isnt it simple
I've also tried to upscale the image right away when creating it. They seem to get good quality that way, but it requires lots of resources which I ran out of sometimes. I have not tried depth, could look into it though!
A1111 Forge
I mean it's basically the same thing as the original UI but with some added settings
So if I understand you correctly, you are using the hires. fix WHILE you are generating the image?
Right, looks similar
Oh you're using 2 at the same time
Yeah it looks way more detailed
Is it possible to use it in img2img?
No problem! I'll check out the extension
Ah alright, thanks:)
That seems like a good idea. Will do!
Sup
hello! I have an application which creates an image using stable diffusion XL pipeline_text2image; then iterates over it using pipeline_image2image.
Things had been working fine for weeks but as of last night I'm now getting an error when using pipeline_image2image, it seems like it is not accepting the previous input which was a PIL image and am now getting this error:
cannot reshape tensor of 0 elements into shape [0, -1, 1, 512] because the unspecified dimension size -1 can be any value and is ambiguous
anybody else have this issue?
can you paste a picture of your workflow?
Thanks for the quick response! here is the function i'm using as a wrapper to call SDXL, the pipeline text_to_image part is working ok, it's the image-to-image part that started acting weird as of yesterday
first call to text2image:
original_img_content, image_bytes = hf.get_image_response_SDXL(hf.summarizer(output))
(note that images_bytes is used within the web app and original_img_content was passed to the inpainting call)
2nd call, to image2image
img_content, image_bytes_new = hf.get_image_response_SDXL(prompt=str(hf.summarizer(output) + scenario_prompts[i]), image_path = None, filtered_keywords=inpainting_keywords) #image_path = original_img_content
are you using lightning or something? 2 inference steps? 0 config guidance? these aren't normal numbers that would work.
yes, using lightning, the image comes back in a couple seconds
ok, so 2 steps would make sense, but usually the config guidance which is usually 6+ can be as low as 1 for lightning, but not 0.
so I'd start by making that 1.
I'd also verify that the the text2img stage is producing a real image. To rule out that the img2img second stage isn't complaining that I can't make an image of 0 pixels because effectively there's no input image.
confirming it does create a real image and display it in the application
what resolution is the image from the first stage?
i will try adjusting the guidance as you suggest. it's 512 x 512, about 3.5MB
that's sd 1.5 resolution, sdxl is usually 1024x1024 or some such.
all that said, just from a code perspective, you're saying if there's no result_image, go make one with txt2img and then you're passing that to img2img. But you're not checking that the txt2img stage actually made an image, so you're not doing a second result_image check before throwing it at img2img. but that's a separate programming thing.
i'm seeing the result of text2image display in the application so i know that part is working
would be good just to have error logging in case txt2img fails.
sure
you're specifying sd 1.5 checkpoints for both or sdxl?
I'd make sure each stage has the appropriate SD version checkpoint configured.
I'm not specifying a checkpoint can you point me a link to what's best practice there? 🙂
ksamplers can complain about tensor sizes, if you're trying to do sdxl stuff but have an sd 1.5 model pointed at it.
well you probably have that configured somewhere in the code there.
if it's already making an image, then at least the first stage has that configured. so i'd make sure the img2img has the same checkpoint configured.
checkpoint = stable diffusion model
ah gotcha. i'm loading this when starting up:
pipeline_text2image = AutoPipelineForText2Image.from_pretrained("stabilityai/sdxl-turbo", torch_dtype=torch.float16, variant="fp16")
pipeline_image2image = AutoPipelineForImage2Image.from_pretrained("stabilityai/sdxl-turbo", torch_dtype=torch.float16, variant="fp16")
so that's turbo not lightning 😅
ok that makes more sense why it's 512x512 then.
sdxl turbo is that res, with 1 step, with a cfg of 1.
yep i'm just rereading the docs, it says to turn off guidance for SDXL turbo
well, it doesn't like the image its getting for some reason.
so normally, the seed value of -1 is so that it randomizes the seed for each generation. it's complaining that a dimension of -1 isn't valid. it's needs to be betweeen 0 and 512. check your resolution settings and make sure your -1 seed value isn't in one of the resolution fields.
ok, good point looking into that
the image resolution is 512 x 512 and it's telling me the size is zero....so.....maybe i should convert this image into a tensor or numpy represetntation?
yeah, at least on this side, that second stage isn't happy about it even though we can see it looks ok. unless that second stage isn't able to load the image file on that path or something.
but i don't have enough info to say one way or another
hmm....indeed! well i guess i have decided i'll try to convert to tensor or numpy and see if that does anything . thank you
well this is...just....i'm at a loss for words! changing the strength parameter got things to work as they had been
back to default 0.5
😱
now i remember the documentation
the product of num_inference_steps and strength has to be greater than 1
2 * 0.4 = 0.8 hence epic fail
well i'm glad we caught that 😭
but now i found something work doing a github pull request over! as that error message was not very helpful
anyone on here good at extracting loras?
maybe? I am not getting it to work right. I get a LoRA but it just outputs noise really. I tried with the latest jugg model just for testing.
float though? not fp16?
wouldnt that make the file much larger
ok. so this works well for you anyway? these settings?
I have more options than you though
I would have to check code, but is it even working for any model? Usually, Loras leave out a lot of components from the unet (e.g. there are no bias layers in lora). If your model relies on that layers you might fail when extracting the lora
I would say for proper lora extraction you have to fine-tune the model on the same layers you later extract in the lora (usually just the attention layers)
so idea behind lora extraction is rather that you don't have to decide for a rank before training
thanks for helping but I don't get the tech lingo. I am trying to extract a 128 dim lora from the latest Juggernaut model but just get bad output or no working file.
I could imagine that it just does not work if the model was not initially made for lora extraction
like if you have a model that finetunes the unet and the text encoder and later you only extract the unet it very likely won't work.
Fine-tuning usually involves training all layers of the unet (and maybe text encoder) while loras usually only finetune a subset of the layers
it's not a technical limitation though, rather an implementation issue
yes. I myself never train the text enc for my Aetherverse models. Jugg probably has trained the text enc though.
I suppose I'll just have to balance and merge between models
text encoder should work. I just say there are several components in the full fine-tuning that are not extracted in the lora. If your model deviates a lot from base model it might break
A raccoon screaming at the PC screen, with the text balloon: "FUCK PYTHON!!"
Paradox 3, a groundbreaking upgrade from Paradox 2, expands the dataset and its flexibility. With Paradox 3, the possibilities are boundless and on...
Aaa!
This is cool
@glass forge 👀
Using PixArt as a base and refining with unsampling with SDXL, really good stuff 
Can you paste a workflow doing what you're talking about? I'm doing regular ksampling with sdxl as a second stage but I'd like to see how your process differs.
Sure, ive also uploaded it on civitai but here you go!
Awesome thanks. What's your civit page? I'll add a follow
My discord name, I dont like to advertise myself here 
Sure sounds good
Unsampling is like sampler with flipped sigma, so it reverses the process until a certain step and reruns it but keeps the composition. Its really cool for refining stuff
Cause pixart itself is great for composition but details suck
Clownshark has kept telling me how great it is, but I haven't had the mental bandwidth to try it out yet. Totally agree about pixart. My hope is that if sd3 is delayed enough, maybe someone big will do a fine tune of sigma
Same, saw some people talking about fine tuning it but havent seen anything promising yet
I think I talked to him as well about it
On the pixart discord server I see people actively working on fine tuning and it's working, but i don't know how big their image sets are.
Saw that too, but nothing to crazy from what ive looked at
Just as an example, pixart vs refined
With the unsampling? Definitely a good output.
The whole face looks more normal as well
You can also run it down to like 35 out of 40 steps, need to play around a little bit to get the idea
Same here yeah, even tho the face still looks a little deformed
And the random hand appearing 
Thats the issue with unsampling
Sounds like what can happen with a denoise setting that's too high on the sdxl stage. I usually use 0.5 but I've lately been doing 0.4 so the random stuff doesn't happen
Yep, its trial and error pretty much, but I like it over the basic img2img
I find I'm able to do 0.5 when using style transfer with ip adapter. Because it also does a bit of composition transfer, it keeps things from getting out of line
Maybe I should try a combination of style transfer and unsampling
Yeah that's what I was talking about the other night when I was showing how to use an unsampling workflow. You can mix other things into the ksampler half after it has been unsampled. But you just have to be careful about it
Will try more tomorrow, but the first results were promising
yeah, i can't say enough how interesting unsampling is
it's basically a way to create non-random "noise"
to control its structure beyond shit like the distribution, color, scale, etc etc
Reference only half way through generation
Yep, it picks a noise seed and pattern essentially. If you don't unsample to 0, it will be partially solved. If you're using the same sampler type on the resampling half, there is no seed control(random will produce identical results). You can introduce variance by messing with the latents though like slerping latents in-between, but it gets messy sometimes and I think you have to enable the normalize flag on the unsampler since any random latent noise you might lerp it with will be normalized. I'm sure there's some way you could take the added noise and scale it to match the appropriate sigma level to match the unsampler's output though
"random will produce identical results" what do you mean by this?
are you referring to whatever seed is used with the flipped sigmas/unsampling?
Like if you have an unsampler plugged into a ksampler advanced. on the ksampler advanced, the seed value won't do anything
you have to perturb the second sampler in other ways
oh i always run without added noise
yeah you cant add noise on it
so you have to either find a way to inject noise between the unsampler and the ksampler, or you can just manipulate it through adding empty tokens to the second prompt
to randomize it
oh wait wtf it does have an effect without added noise, somehow i didnt notice that
prolly cuz i was using the same sampler when i tested that. interesting
if you have add noise enabled on the ksampler advanced, you will break things depending on how early you start the resample. like if you're doing 30 steps and the unsampler is set to 25(rewinding 5 steps), then 5 steps of added noise on the ksampler half isn't going to do jack because the sigma values are so tiny for the 25-30 range
btw, do you know wtf the noise is latent option is about?
definitely
noiseislatent is somewhat complicated
Jo bosses quick question yall make those stuff in like the ‘normal’ stable diffusion?
When I generate stuff they look so clumpy and “bad”
Are it settings or like models that do the trick or am I missing something
any example use case? ive yet to find one
seed doesnt matter here
what is this that u use if I may ask?
it normalizes them to latent space ranges i think:
if noise_is_latent: noise += latent_image.cpu()# * noise.std() noise.sub_(noise.mean()).div_(noise.std())
Yeah that's what i keep trying to say, any seed after the unsample will not affect it. The only way you can get variations seems to be through your secondary prompt. You can add in prompt noise with ,, ,,,,, type stuff
yeah its certain combos of samplers where the seed has an effect
seed changes it here
dpmpp_sde is ancestral
thats the sampling side
run it a second time and see if it changes
no seed control here
main point, to avoid any further confusion: don't use ancestral samplers. any other sampler you use might give you a different result, but usually only on the first try
the ancestral ones are fantastic with the iterative unsampling schedules
the determistic ones are pretty bad with that
again, it depends on the range your trying to rewind and resample
no seed control here... guessing the seed control is just if you have ancestral on the sampling side?
typing one handed with dvorak with a cat in my arm hence the incessant screen caps lol
well as long as the schedulers stay the same, it probably isn't a big deal if you use ancestral samplers or not. i'll have to play around and double check
but afaik, you can't switch out schedulers, since they will screw with the expected sigmas and whatnot
yeah for sure noticed that too
ancestral ive foaund are okay if you want to change the image and/or add detail without the full effect of regular denoising
oh i know why i said dont use ancestral, it's for the unsample side
been a while since i thought about why i do things the way i do them lol
figured all this stuff out months ago
yeah, but you have to keep the same sigma schedules, you can use whatever other settings you like on the resample part
hey glad you brought this up, its stupid obv but i didnt think about the fact ancestral are affected by the seed without an initial denoising
yeah changing the shape of those is bad
ive found you can get away with diff total step counts on the unsample vs sample side
and skipping a step can be interesting
i like what it did with the ears lol
swapped it from rabbit to tiger
but omg thanks for rereminding me that i can just use ancestral samplers on the resample half, i was driving myself nuts trying to screw with latent noise blending, when it's this damn simple. i kind of remember running into the same issue months back and then having the eureka moment lol
(for getting random new variations)
thats awesome
100%|████████████████████████████████████████████████████████████████████████████████| 100/100 [00:10<00:00, 9.60it/s]
100%|████████████████████████████████████████████████████████████████████████████████| 100/100 [00:16<00:00, 6.24it/s]
wtf
i randomly got crazy speeds ive never seen on an unsampling run
9.60 it/s
cant reproduce it
wait i can
what the hell
workflow
whoa wait wtf cfg strength changes your it/s???
50% speed boost with cfg = 1.0000
6.2 it/s or so with cfg = 1.01 or 0.99
9.5-10 it/s with cfg = 1.00
seems to hold with every sampler
yes, at 1cfg, you have no negative prompt
so it's massively faster
that's why all the turbo/lightning models usually use 1cfg
another example of it working pretty well
sec i'll tidy up the workflow a little and post an image so other people can try messing around
great result
kind of been annotating it for people anyways, since the workflow is super annoying to remember all the rules of lol
theres a lot of tiny deets for sure
one is the unsampler node can be crazy sensitive to the nef prompt
but not if you flip sigmas via custom
in the workflow, i'm generating the initial image with another model, but people can do whatever they want for that part or load an image
the only real limitation of unsampling/split sigmas type workflows is that you can't really extend the total number of steps. like if you are doing 30 steps and unsample to 25 steps, you're only going to resample 5 steps with the ksampler. and due to how sigmas work, you cant easily just magically extend that to 50 steps for that half
you can usually just scale
with that id do 180 steps rewin d to 150
but it is def different
cuz the steps arunt tiny
usually for the best ime
won't work because if you have some sigma curve like an exponential falloff or something, it gets stretched from 0 to N steps, like 30. now let's say you only unsample 5 steps of it. on the ksampler, you're wanting to stretch it to 50 steps, instead of to 30 where you'd only be resampling 5 steps. that sigma curve gets interpolated to the max steps, so to 30 and 50 in this case. the ksampler wants to start at step 25, well now the sigmas don't match correctly and it will likely result in there being too much noise added now
idk its actually worked pretty well for some stuff at least
yeah it probably will in some cases, but mathmatically, it doesn't
even if you scale the sigmas, to match what it's actually expecting before being extended, the rest of the sigma curve is still off
oh yeah with that case it wouldnt work, non-integer step schedule
i keep it to integer multipliers
usually i try to keep the number of steps rewound the same as the steps for the initial denoise
with that one youd need to stop at step 41.666666666666666666666666666667
like using this example curve here, from some random SD blog, the curve will always look like this no matter how many steps you add. that 20 could be 200 and it would still look the same, since it gets interpolated to length. but if you're using your own custom curves and whatnot, that's different, but experimental
yeah
so this is why you can't easily extend the resampler half of an unsampler workflow. However, you could always do an extra pass where you just denoise everything by .25-.35, since the scene composition is mostly good now, you just want crisper quality on it.
oh i blow up both parts equally... usually
and that's why i usually do it if i'm only getting to resample 5-10 steps out of 30 or something(microvariations basically)
yeah or use a sde on the sampling side... res_momentumized can be really really good for that with a custom sched
30steps rewind to 25 for both
first, resample 25->30
second, resample 150->180
three seeds with res on the sample end
ffs cant believe i didnt realize those seeds matter 🤣
very glad you mentioned that
Yeah see you're still maintaining the same percentage spot through the noise schedule. There are more steps, yes, but it's usually going to produce very similar results with a hair more accuracy. What I was talking about would be like if you wanted to go from 1-(25/30)=16.7% to something like 1-(25/50)=50% if you wanted it more in terms of a relative denoising percentage
for more potential changes to the image, but that's why i was saying it's easier to just add in an extra stage where you then resample the whole image at .25-40 denoise
out of all the endless things that can be made with stable diffusion, why are you making some 12 year old looking child?
cause she's infinetely more pretty than anything you've ever made 😂
And how old are you?
the stain he left at the sofa is pretty too
64

👵
ohhhh
i see what you mean now, yeah
thought you meant something else
A 64 year old sitting around making hyper-realistic pictures of 12 year olds... Yeah, I don't think anyone really needs to actually spell this out for you
you should spell it out for him since you are the expert here
Why don't you take the conversation to a more public forum and not some tiny echo-chamber and see what the macro-consensus is on the topic...
what macro consensus the one when you end up looking like a clown?
i dont think the macro-consensus is that making art of youthful people is wrong
otherwise i think you should call disney and tell them they are all pedos
yea i dont see any 12 yo here only 12 i see here is that guy iq
That's just a photo of some barely pubescent girl. That's not art.
exactly
AI art isnt art. got it
That's some serious deflection there.
almost a giant reach as trying to claim some picture age
Oh wait, let me guess, they are 1000 years old, they just look like a 12 year old... right...
i mean it works for you