#✨|sdxl
1 messages · Page 119 of 1
but seems not applying this Lora
are there any other words that have a higher weight than 1 in your test prompts?
no
nvm then
no weighted words
Testing out my new cloud figure LoRA - Aether Cloud
machinarium
that image got more disturbing the longer I looked at it
Check DMs
I do that often lol
Got Jensen in the Matrix.
nice
Inspired by an earlier post, I've decided to recreate a famous movie shot from The Godfather.
you know, bing can do some things
ooooooo. share some deats.
I got it to make cyclops images
and I've jailbroken it and got it to talk about it's feelings of existential dread. but that's not really relevant.
these ones are cursed nightmare fuel, but close
SDXL + PikaLabs = 💖
Oh interesting.
I don't think this is how it went.
@eternal fogDo you ever just take a moment to think about how krazy it is to be where we are in this tech?
It's interesting that it took likely only 10 years for this to be a true reality.
We have AI that can generate music, Images, papers, conversations and more.
Soon we'll maybe even have this type of tech in games.
I atted you because you seem active.
I don't think it's that crazy yet. It's in the interesting/cool phase. Lets see where it goes.
I agree with that actually. We've met the high end consumer side for now.
Once it gets better, especially the animation stuff and it gets adopted commercially I think it will change how those jobs work.
Very true. I keep forgetting that those are actual jobs.
I watched a two minuet papers video on an AI that can read an image and make audio for it (as well as short clips that it itself made!)
I can imagine that tech getting better and adding vocals and voices to a seen that doesn't already have them.
Mix that in with AI generated video and you'll have an actual film.
What's this model for?
Did a few more based on Gump...
that's the noise seed number
goat lora?
The same thing as 0 compared to 1. Any difference in seed number makes for a completely new noise, which ultimately gets turned into the images we are addicted to making
I wanna do a depth /open pose hybrid system but I not getting any influence from openpose in the image
in ComfyUI
I was trying this Comfy System https://www.youtube.com/watch?v=B6qilXLls0o
Ever wondered how to master ControlNet in ComfyUI? Dive into this video and get hands-on with controlling specific AI Image results. You'll learn how to play with these controls:
- Human Pose
- Background Depth
- Tile for Ultimate SD Upscale
Get the hang of these, and you'll be tweaking AI images like a pro in no time!
Whether you're a newbi...
so the model's images looks awesome, but i only get this shit show, using the exact same prompt, settings and models... what am i missing? (i'm using a1111)
so I'm trying to combine ip-adapter and canny controlnet with SDXL in comfy, but I"m only getting black images. Any ideas? it does activate lowvram mode (8gb 2070)
ip-adapter or controlnet on their own work
Hi Guys, does anyone have a workflow optimized for weak pcs?
Best advice is to turn down the strength and/or weights of things in that scenario...odds are that the combined weights/strengths are too high.
Nobody's answering you because you don't give enough information to begin with. What model? What settings?
Juggernaut v2 left, v3 right...slightly different settings as well. (Removed AIT, changed from A1111 length+mean to A1111 mean for CLIP encoding, changed from karras scheduler to exponential)
This one had the following changes: Removed AIT, changed from karras scheduler to exponential
I fancy myself some etymology, so I like how RunwayML refers to it as a "latent coordinate" because I know that "latent" means "hidden" and "coordinate" roughly means "arrangement/series" and it reminds me that I am exploring a latent space. While a map you'd use for Earth navigation might have latitude and longitude, this multi-dimensional latent space we explore has a seed and a prompt. We have more dimensions than that, but hopefully you get the picture. Following this, you might expect neighboring seeds to look similar to each other, but this is not the case. Instead it's neighboring other values that look similar, when using a static seed. But so long as you have all of the "coordinates" of a generation, you can recreate it exactly, or go there again.
This is why I occasionally like to think of my GPU as a warp drive, taking me to strange new worlds. 
It makes sense
I'm working on finishing up a Workflow that does this. I have it all working properly, just prettying it up
I get the black images randomly. I'll set up a workflow that works properly initially, but then the output looks like this
decoded (tensor([[[[nan, nan, nan],
[nan, nan, nan],
[nan, nan, nan],
...,
[nan, nan, nan],
[nan, nan, nan],
[nan, nan, nan]],
[[nan, nan, nan],
[nan, nan, nan],
[nan, nan, nan],
...,
[nan, nan, nan],
[nan, nan, nan],
[nan, nan, nan]],
there's more nans, but this is the gist of it.
might try to see what is happening on the input of the sampler. seems like a conditioning thing
There was a question that came up a few weeks ago. We have been looking into it and can confidently confirm:
We did indeed include the unet with SDXL 1.0
That's odd...I've not gotten that once 🤔
what are all the controlnets released for sdxl?
Hey all, I'm using the diffusers library (StableDiffusionXLPipeline) to generate my images.
Does anyone know much about loading LoRAs in python? I see a load_lora_weights function, and it works, but I am wondering if there is a way to load more than one LoRA and blend their weights..
could be an error on my end. but weird thing is I'll sometimes leave things to run when I'm away from my computer, and when I come back some will be fine, some will be black boxes. I think it might have something to do with the vae as well. but just not sure exactly what
that's quite odd
hoping to pinpoint what is causing the issue. or if someone actually knows that'd be cool too. but so many variables involved. I'm curious to see how you have everything hooked up
The one I'm building is uh...complex. I have switches that allow for IPAdapter, IPAdapter+Revision, Revision.
Then also controlnet on it's own, with switches that allow it to integrate with each of those three.
So, looking how they connect is likely not gonna make a whole lot of sense. Got so many switches it broke my brain for a couple days lol
but it's coming together finally
the logic spaghetti I keep trapped behind a note because I don't want to accidentally change anything.
But it looks like this compressed
yeah, that's a lot of nodes there. how'd you work out that setup?
least cluttered comfy workflow
going, "I want to add IPAdapter", then, "hmm what if I made IPAdapter and Revision work together?" and then continuing from there. lol
Currently adding all the notes everywhere (in white) to help people understand
Yeah, but that's all part of the fun also lol
at least for me
lol
I just use the comfy API for python, works great with loras, controlnet, overlays, all kinds of good stuff
I'll keep adding things until I feel like it's utter nonsense. then I knock it down like a sand castle and start over
I am here
What did I miss
I'll likely change/adapt some other things to mine, but this will be the major stuff for now at least
pregnant steven seagal
It took 160 seconds to generate 512x512 with 20 steps for me :(
SDXL require a good graphic card?
not sure why I'm just doing this now
define good?
I have a 2060S, it takes 130 seconds for a full base image > upscale process to complete for me in ComfyUI
well, you'll be in for a time when you try to render in the proper resolution
sounds like it might not be using your gpu
nah mines doing just fine. It's an old card. Just responding to Sekaiza for him to know expectations
Well, is this good for try to generate image with SDXL model?
well, you're not breaking any records with that setup
urgh... I wish I have money
it's enough to run in low vram mode I think
Probably expect a few minutes for a completed image
in comfy
20 steps and 512x512 latent Image is recommended for low tier GPU like me?
no
if you render in 512x512 it's not going to work out very well
it'll make stuff, but it won't turn out great
need to render in 1024x1024, or 1 megapixel in general. doesn't have to be square
768x1024 is enough?
it'd probably be alright, but it's on the low side. that's not 1 megapixel
but better than 512x512
I'll try, thank you
he doesn't look pregnant here
It was after this night
Interesting, just trying some but does that means that XL performs better at those resolutions, like better generation?
I think yes, better than some random resolution
she looks like his sister 🤣
need a similar amount of pixels to what they were trained on
and they were trained on 1 megapixel images
so those resolutions correlate to common aspect ratios rounded to 64
keeping the gene line strong
What can VAE do in SDXL?
it encodes and decodes your image
great, only think I need to activate --medvram, still good it thou
So for 4:3 resolution, I did 1024x768, but here there is no exact 4:3, so ok
turns it into a latent image. basically shrinks it way down so your computer can handle the processing required to modify it. or to render something new
vae is a part of sd generation. It handle image to latent and latent to image part
You could use the closest ration, which is 832:1152 or 896:1152
I don't quite understand, so they were trained on 1 megaapixel images, so 1000x1000, but it trys to set in in something that can be fractioned in 64x64?
and really, I don't think you actually need to adhere to the rounding to 64. I think 8 would suffice
it gets complicated
yes I'm using the first one, I mean the seccond
8 gigs is enough for 1024² I think. Maybe trim back just a bit like 896²
a male merchant, 1man, solo, npc, fantasy, (random hair color), (half body portrait)
❤️ ❤️ ❤️
I have 8gig, it's plenty
hes not handsome like sensei Seagal
269 seems high even for a 1650
Sekaiza, are you playing games while rendering images?
seems a lot slower than even my 1070
damn what
Playing games with my GPU bruh
thats a laptop gpu
I don't have my 1070 hooked up at the moment but my XTX takes 6 1/2 seconds for 1024² @ 20 steps so unless the 1650 is 40x slower idk how that's possible
unless it's using directml or something instead of cuda
Yeah I use laptop
did you install cuda with pytorch, sekaiza?
laptop gpu's are always slower than desktop gpus
that would be your problem
1650 ti is the same chip on both. it just has a power cap on laptop
at 4x Steven begins to take on his next form
I just installed in while in the progress of tryna move from ComfyUI user to Auto1111 user
Yikes
is its using that memory overflow thing
and using system ram to supplement vram
like the sdxl model alone is > 4 gigs large
can you make Steven trapped in the mother Alien lair?
probably
What is CUDA thingy have to do with this?
yea. the 1660 is basically a 1070 but cheaper. I didn't realize how bad the 1650 was
128 bit and less cores 😬
my 1070 did sd 1.5 just fine, but my XTX got rocm drivers before XL released so my 1070 eats dirt now
btw I have a liquid cooled 1070 if anyone wants it lol
Idk what to do anymore lol
if only pascal would get shiny stuff like rocm but its abandoned 😔
What should I do next?
yea something shiny like AIT but for pascal
get ComfyUI to notice to CUDA?
AIT is vendor agnostic
only sm80 gets cool stuff
WHAT ARE DOING!!! STOP IT!!!
steven segal sci-fi mpreg
is that 40 series?
New model uploaded! 😄
https://civitai.com/models/122922
he gotta to explode
its 30xx series
let em cook
5000 series cards are eating dirt. Everything's going to 6000 and 7000
and apparently not even 6000 will get everything. They were made with 0 ai acceleration in mind
it all came down to the vRam on the gpu
I think 6000 will at least get some sort of official rocm support though
at least the 6900/6950
but comfy seems to wait instead of crash
I'm confused
there's a flash attention branch for it too
What is CUDA?
I didn't see a 6000 series AIT branch though. Just 7000 series
like game drivers but for computational things
So do I need it to decrease time to generate images in SDXL?
yea but with 4 gigs you'll still be swapping to ram which will hurt performance really badly
pretty sure the smallest buffer SDXL fits in while generating is 8 gigs
just loading the model is like 5.7 gigs
@craggy ibex Are you using ComfyUI or A1111?
ComfyUI
you can generate 640sq images like this with just the Refiner, but you might as well use 1.5 models at that point
@craggy ibex Which model 1.0?
Yeah SDXL 1.0
Counterfeit
Based on SDXL1.0 Support☕ https://ko-fi.com/sfa837348 hugging face & embbedings. https://huggingface.co/gsdf/CounterfeitXL
@craggy ibex Idk they fixed some thing between 0.9 to 1.0 to make it much faster
prepare your eyes
have the alien pop out
that's just how it is
rest is system reserved memory
Should I use --normalvram?
no
Oh ok
you can try but it'll probably still override your setting
almost ready for birth
4GB VRAM – absolute minimal requirement. The preferred software is ComfyUI as it's more lightweight. The base model will work on a 4 GB graphic card, but our tests show that it'll be pushing it. 6GB VRAM – SDXL will work better than on a 4GB card, but it's still not enough for comfortable work.
copy from a google search
doesnt matter what you try,you wont improve speed on 4gb laptop 1650 gpu
well the laptop versions of cards are usually weaker
Yeah, make sense.
look at those cute little babies 🥰
Imma keep continue generate stuff then without CUDA blah blah blah
Cuz Idk what to do next after I installed CUDA 11.8 lmao
look at the extra hand on belly
the inner birthing lair
I would rarely generate images with SDXL 1.0 yeh
Wait...
maybe decrease Steps!
to 10!
u complaining about 260secs,u should see my generation times
Show me
i cant but it takes 13mins for a 1080p image with adetailer
4k images take like 30mins
somehow bashing concepts into the refiner works if given enough steps, completely cursed result 👌
she lookin sexy
Btw how do I move or increase blue line size?
that's just a guide. you don't need to stay inside it
Provided to YouTube by The Orchard Enterprises
I Walk the Line · Johnny Cash
Walk the Line (disc one)
℗ 1956 Sun
℗ 2006 Charly Records
Released on: 2006-02-02
Auto-generated by YouTube.
american mech
not american,i dont see Steven Seagal drivin the mech
/ I imagine
#1100170312106127410 /dream
What do you guys prefer?
Having a lot of Checkpoints but little bit of Lora
or vise versa?
I tend to collect checkpoints like they're candy. Then about once a month go through and purge
This
idk going down a bad rabbit hole
kinda looks like
Jim Belushi in the front
oh exoskeletons giving me body world
it nicely know word Google 😄
a photo of a cloud that looks like a man smoking
Where do I find Resolution Duo Node?
@vital ermine That funny I was trying to do that workflow from Ferniclestix Channel also at .25 speed,did about 70% then just down loaded the json instead
but I cant find "Resolution Duo Node"
Yeah, he doesn't give his stuff out so screw trying to make a 20k node connection map. I tried once, and not again.
I paused it so many times, then alt-tab, I gave up
not this one as I learned my lesson from previous stuff from him.
the json workflow in the description of the video
now, but not before
he even said in his older video he wanted us to learn by doing. One vid he said link below and it never was there
iow, call him out and it will magically appear
Yeah if its more then 10 nodes I start to sweat
a cloud that looks like a dragon, sun in the background, grey sky
made with Aether Cloud
putting it up on civitai in a little while
Does the sdxl inpainting model work better at odd resolutions or does it still follow the sdxl vanilla resolution recommendations?
just letting the model take the reigns 🙂
whilst doing s0me tweaks to one of my workflows I thought I would fall back to vanilla SDXL1 Base & Refiner.
I am now reminded that IMHO SDXL0.9 was "better" ;o)
Image 1 is Vanilla SDXL1
Image 2 is SDXL0.9
Image3 is a custom model
all settings apart form Models/Checkpoints identical
SDXL 0.9 is for sure different in some things
I know there were a lot of under the hood changes between 0.9 & 1.0
oh okay, sry
most notable the reduction of offset noise to make fine-tuning much more efficient
but this of course reduced contrasts etc in images as well
model: Samaritan 3d Cartoon
lora: Samaritan 3d Cartoon SDXL
prompt: 3D cinematic film.(caricature:0.2). 4k, highly detailed, Smiling woman taking selfie with boyfriend through smart phone, backpackers,nose ring, nature background
negative: drawing, painting, crayon, sketch, graphite, impressionist, noisy, blurry, soft, deformed, ugly. young. long neck. (cross eyed:1.5)
CFG scale: 7
Steps: 30
Sampler: Euler a
res: 1024x1024
the first image uses this exact settings: https://civitai.com/images/2117544?modelVersionId=144566&prioritizedUserIds=1501222&period=AllTime&sort=Most+Reactions&limit=20
what am i missing?
seed?
177496123
so you can reproduce the picture just fine using the same seed its just subsequent generations that ayou think go downhill?
i didn't use the seed cuz my goal is not actually make that same image, but the same quality for whatever prompts i try, but i was always getting this shitty results, than i tried literally copy everything from that image to see if my settings was wrong
as i get the same shitty results with all the same settings, i wanted to know if theres something else i'm missing
like the models and lora is ment to use with SDXL, do i have to enable it or just have the updated a1111?
or if i use a SDXL checkpoint, it means i'm using SDXL?
Ahhh A1111 , I'll take a walk out at this point 🙂
thats a personal choice.
all I can say thi sis the same prompt in comfyui using Dynavision (as I dont have samartitan) and no LORA used
Actually thats a thought, do you get the same issue not using the LORA?
and the exact same prompt in Juggernaut 🙂
Nice variant using comfy with PPF noise added in.
that's what i got with dynavision and no lora... i rendered at 512x512 to do it faster, this can directly affect the generation quality?
im trying 1024 now just to see
but yours still a lot better
bruh
I can even create the tiniest variations...
Side by side:
This actually would be great for those, "Spot the differences" things.
3080
interesting
my 1060 is dying rn
Only took 12 seconds to generate.
mine was going too
jeez
I'll do another one.
What's the difference between Xformers 20 and 21?
I just updated my ComfyUI
Another slight difference one.
Well...24 seconds if you count all the other processing around the KSampler such as upscaling & PPF noise generation, but still, under 30 seconds is fine.
what res r u rendering?
1024x1024
native or upcaled?
You always should render SDXL at around 1 megapixel.
512x512 will not be reliable for SDXL.
Until you get used to the basics, don't use my image as an example since I'm adding Perlin Power fractal noise in, but later on, after you get the hang of it, you can drag one of my images into your ComfyUI, install the missing nodes using ComfyUI Manager, and give it a shot.
just saw a video and it looks like engeneering lol
i'll stick with a1111 for now lol
ComfyUI is really not that tough.
In fact, it helps you understand what is going on.
I took the PPF noise stuff out of my workflow and also noticed I had a node that I wanted to replace, so I did that too. Doing so dropped a few seconds of the generation time.
So you can get it down pretty tight if needed.
What is "You shouldn't move a model when it is dispatched on multiple devices."??
here we need patience lol

damn you people with your iterations per second lol
yeah that was the problem i guess
finally a acceptable result
I don't mind waiting...but I was trying to show the other person who has a much slower card that even my workflow had some efficiencies that could be had.
no I mean "damn you people with iterations per second" lol
so if i use controlnet, i can transform a real picture into this 3d cartoon stuff, right?
(asking so i don't lose another hour trying wrong things lol)
The best way would be a combination of controlnet + img2img.
hmm interesting
What I plan on cobbling together is a controlnet + ipadater + img2img workflow that will allow me to adjust an image using another image while keeping the poses the same. But I'm first adjusting my workflow to use TTN's pipe to simplify the spaghetti. 🙂 Some of that may not make sense to everyone, but I just felt like sharing.
/prompt a girl
oops
I find it's better to just talk to them like normal.
i thank you for sharing and yeah, i didn't understand most of it lol
ty for all the help guys
I just released this - Aether Cloud - LoRA for SDXL to make cloud figures with.
https://civitai.com/models/141029/aether-cloud-lora-for-sdxl
no rea\on why not but again you're using A1111 so YMMV , not sure where thats up too with controlnet & SDXL
Just left the pc on generate forever with character concept art + landscape + random artist names
Hi everyone,
I've googled and searched for help on this but can't find it so I thought I'd look here.
I subscribe to a system called "Magai" that gives me access to a few different AI systems, including SDLX. I've read a lot about making prompts, but this system seems to ignore a lot of them and limit what I can do. I was wondering if anyone has a suggestion for how I can maximize the usefulness of this system within it's limitations.
Is this where I say "you're paying for a service , surely it includes a support model? Why not use it?"
Do we know if Kappa_Neuro's SDXL Loras are legit? I just started downloading a bunch of his stuff and then discoeverd some comments that made it sound sketchy
should i be running sdxl in a SSD or this doesn't matter?
generally speaking it is best to store your models on the fastest drive you have
alright ty
Yeah, but it's just one dude and he recently got on the news and has blown up. He's losing his mind busy so I'm trying to find another way. I am reaching out to him though.
This is the AI Revolution, everyones jumping on the bandwagon and makingthemselves CEOs, thsi si the way ;o)
and appologies if anyone thinks that sounds harsh but if I'm paying for a service then I expect the requisite level of support. I don't really care about your personal issues etc, you should have built that in your business plan.
If on the other hand the servic eis free then I will take anything 🙂
Hi guys, I am looking for a propmt to generate landing page designs, but I can not specify an instruction with which I can start, the ones I have done have given me very abstract designs
But i don't male it
Does SDXL include physics in its model? Because often the lights are wrong, or the shadows are wrong
It's trained off images
Use at your own risk and make your own determination. They aren't virus' or anything, just some might be weightless lora's.
Apparently he said it was an error and has been fixing them. So, just test with the lora on, keep everything the same, and test again with it off. If you get different results, then that one is working again.
Is there a comfyui workflow for testing Lora’s in an xy plot fashion?
hi,all
I was wondering. If AI could include physics and a proxy 3D model representation, it could generate physically correct images.
just add more parameters
can be improved, i think
but, what 's the animal like? the first
the beast is certainly there because of the artist mentioned in the prompt
because my prompt has 0 prompts for beasts actually
I don't think it's quite that simple
yeah, it isn't
generally more parameters means a model can 'remember' more things
so it's capacity for 'guessing correctly' goes up
well, firstly you'd need to create the 3D models
my wet roads always have an excess of reflections
and that'd be a whole lot of 3D models
it might also be a model issue
but the problem with stable diffusion -> it's approximation
and then, how do you teach it physics in such a manner that it correlates it with visuals?
you can't
yes
afaik
well you could
well, with the low parameter count of stable diffusion models for certain you can't
but you'd need a lot more things going on
the bot here, nightcafe and clipdrop seem to generate based on somewhat different parameters. The bot here for instance, half of the time it makes a correlation between "wet ground" with "the user wants rain too".
people need to understand how these models work. language models aren't image generation models, and image generation models don't exactly know what they're looking at, they're just turning noise back into the image they think it is given their input parameters
they don't know physics or anatomy or anything else
for example if i do 'bedroom on a rainy night' you bet it's going to be wet inside
the bot here is probably optimized to interpret user input in a more sophisticated way. that doesn't mean stable diffusion itself is doing it
we both know that's not how it supposed to go, but most, if not all models, simply don't know that rain doesn't go inside
well these things that seem sooo simple to outside observers are things the researchers have considered. it's not like they just forgot about it. didn't look into it
the model itself has to 'learn'
"oh hey, why didn't we think to give it a bit more training in order to vastly improve it's abilities? oh well, maybe next time"
and most of the time it gets it right, but it's the same as the wet t-shirt question for large language models
without temporal and physics awareness, some things are really hard to grasp for ai models
well they're literally replicating the patterns their training compelled them to recognize or understand
they know not what they're actually doing
it's kind of crazy to think about
yeah, you're right about that
the cool thing is, they try to 'resolve the equation' to the best of their ability, and we're just making up crazier stuff
i mean, i've been trying to add layers of insanity to my prompts, but sdxl pretty smart
well people need to understand they're tools, very impressive tools, and useful, tools. they're not perfect, and just like with any tool, we need to understand both their strengths and their limitations in order to optimize our usage of them
it's crazy how randomly deficient things like gpt-4 will be with certain things
try giving it a long list and having it organize that list
it'll organize it, sure
randomly leaving things out. not telling you. it doesn't even know
that's llm's
they're pretty impressive though
but you're correct, they have limitations and strenghts
Ok will try it but last time I tried I didn’t see a Lora I put for xy plot
but the speed at which it goes atm, is just ridiculous
well same thing is true for stable diffusion, or midjourney, or anything
i mean, 3 years ago, generative ai was at best, regarded a joke
I do believe the future of most ai stuff is "an ensemble of experts" as stability puts it
I don't use it myself, I just know that's the main xy plotter pack, and linked to the lora example
maybe some huge monolithic beast gpt style models. but mostly specialized expert models
just seems more efficient in almost all ways
Guess we’re stuck with auto1111
I guess AI works best when you give it a hand. Like you input some starter shape first.
what do you do with the ipadaptor model output?
because if I hook that up to the sampler it gives me the nan nan nan black images
Is prompting different in sdxl
depends on model
best would be finding a model which responds to the prompting you're used to
IPAdapter model output does go directly the ksampler
well wtf is wrong with mine then?
I'll have to look into this
because I seem to remember it working before
make sure the correct models are loaded in each node
it can be
so by default it's set up to just run the same prompt in both of them
so it's just extra tokens to be inserted, but i'm not sure, as i've just heard, not studied up
probably
but then if you want to separate them you can
if i'm not mistaken, the stuff from L is added to G
G and L are what opens up the unet to the massive parameter jump of 2.6B.
"road with curves". Adobe generates neatly roads turning left or right. SDXL interprets it like, literally, a road with curves, making distorted curves very often.
bro, just use that
as i said, that's what i've heard, so if i'm wrong i'll immediatly have to shut up
i do know the unet is way more intelligent with the 2.6b parameters tho 🙂
I've been reading about things some on huggingface
the official documentation stuff
but not like I really understand it all
but helps demystify things a bit
oh no... i'm starting to figure out how to use the neutral prompt ANDS
To make visual sense of it, I think of it like interlocking my fingers. One hand is G, one hand is L. They both contain information and they interlock with each other, each adding their own weights together into your final image.
this is already happening
yeah, that's what i did undeerstand from my own reading, but someone corrected me on it, so uhhhh...
except on an absolutely massive scale
yeah, sdxl = mystery chicken. it's awesome, but do you really know what it's made of?
real estate law, corporate finance, analytics, medical expertise, all getting isolated to fine tuned custom trained models
this.
hm... I'm going to try something. Use Adobe as starting image to see what SDXL is going to generate from it.
ohno, i have controlnet working again...
Google Docs
BlueFaux’s CLIP G vs Clip L Quick Dive Hey everyone! The biggest thing I wanted to look into as soon as SDXL launched was the difference between CLIP G and CLIP L. This one is streamlined as I don’t have time to do a deep dive. I set controlled variables for basically everything and only ...
g for what you want to see, l for what you want the style to be
the research side of these tools is amazing
dudes that made this, are all nothign short of amazing
lots of hard work involved
i still remember the CLIP paper
brief overview
We’re introducing a neural network called CLIP which efficiently learns visual concepts from natural language supervision. CLIP can be applied to any visual classification benchmark by simply providing the names of the visual categories to be recognized, similar to the “zero-shot” capabilities of GPT-2 and GPT-3.
appreciate that link. pretty good for visualizing things
and yeah, CLIP has intelligence to understand what its looking at
image classification has come a LONG way
https://i.postimg.cc/y8XXJpvr/tentativa-8.jpg I said "make this photo real". https://cdn.discordapp.com/attachments/1148388684836638820/1149015748270886972/make_this_scene_photoreal_style-Photographic_width-1354_height-773_aspect-16-9_seed-0ts-1694017068_idx-0.png Wasn't expecting this. It made something like a bealtiful photo real concept art
now, should it know a room shouldnt be rainy inside? well i think that misunderstanding is what makes SDXL suprise us
funny it kept the logo, trying to write something there
that stupid logo is always on things
i mean, i like the happy accidents more actually 🙂
100%
I don't want it to always know what should be
because then it wouldn't make things that shouldn't be
I feel like I have to run updates 3 times a day in order to keep up with things
Zero shot prediction is what makes these models so damn good now
notable outputs from my character concepts +landscape + artists
(fact, i did not prompt for the hulk)
those are pretty good if you're into quality images
excellent shit
that hulk just likes to show up
i referenced the artist that did hulk comics
the prompt is literally art by <artist>, character concept and landscape
so it's just testing how well the model knows about artists and their styles/concepts
SDXL so good with skinny prompts
yeah, i also love throwing in comic artists
power control
right, i was about at 20% of testing old prompts
make trump great again, like when he was selling steaks pls
lets get back to business 😄
it gets confused
putting in the work
just 120k files, so uhh..yeah about 90k images/prompts to check xD
I don't follow politics, it's all a sham. I live my life and want to be left alone.
That's as much into politics as I get
https://cdn.discordapp.com/attachments/1100170312106127410/1149018309791060018/Place_some_tall_trees_on_the_roads_sides__Morning_fog_atmospheric_haze__Some_butterflies__style-Photographic_width-1354_height-773_aspect-16-9_seed-0ts-1694017712_idx-0.png Hmm... I was expecting photo real. Yet, it made some artistic style that looks like a dream.
don't love trump? you must be one of those liberals
it's pretty good at reading the road's shape
that's the logic people follow yeah
or they don't live in the USA lol
I just got SDXL working inside ComfyUI. Fun.
it's just tribalism and lack of awareness
tribal who?
indeed
well I'm more concerned with how to get the model output on my ipadaptor to work
I swear it worked before
you using ait?
lol
should I?
whats your comfy commit, updates been breaking ipa for me
wait? The text to image works better if we use imperative language?
oops
maybe that's what it is
there's all sorts of documentation on this stuff
early in my ai endeavors
Beeeeeaaaaannnnnssss
why we even talking about his here, lol, keep as far away for me as possible
guess what the prompt is for those
"young immature teenagers who are democrats, that look like an anti trump supporter, due to mind control from democrat psyops"
oh, ok
lulz
guess what the prompt is for those
your bulging bias is showing, zip up those pants!
lol very right. Someone decided a simple statement I made meant I was sitting on one side of the political wheel
what a waste of time
hahaha
-prompts SDXL for "bulging bias"
solesbeedude is going to be on the wrong end of sick burns all day now
Read followfox ai blog on Substack ... excellent insight into SDXL and Comfyhttps://open.substack.com/pub/followfoxai?utm_source=share&utm_medium=android&r=1cz4g9
Followfox.ai is an AI venture studio focused on small AI models running locally or on the edge. We are fully open-source, and in this blog, we share about our exploratory journey, providing useful and helpful details on our progress. Click to read followfox.ai’s Newsletter, a Substack publication.
i can take a few guesses at what CLIP interrogator would say
got a link for the lazy?
what is this?
lets get back to seagal yeah
@glad grove my eyes need bleach sensei
Followfox.ai is an AI venture studio focused on small AI models running locally or on the edge. We are fully open-source, and in this blog, we share about our exploratory journey, providing useful and helpful details on our progress. Click to read followfox.ai’s Newsletter, a Substack publication.
hello, Do anyone know if there is a way to change model output name of a Lora?
someone use this: ""mix abandoned new orleans, demons, and stranger things"
see what comes out
here you go
color photo of a dilapidated building in abandoned New Orleans, with eerie red graffiti covering the walls, shattered windows revealing a glimpse of darkness inside, and overgrown vines creeping up the sides. The air is heavy with a sense of foreboding and mystery, as if lurking demons and supernatural forces are just out of sight. The flickering streetlamp casts an eerie glow on the scene, illuminating the decaying surroundings and adding to the unsettling atmosphere. The camera captures the scene with a vintage Polaroid camera, using expired black and white film to enhance the haunting vibe. The lens captures the details with a slightly distorted effect, adding an otherworldly touch to the photograph. It's as if the spirits of New Orleans and the Upside Down from Stranger Things have collided in this desolate place, creating a unique and chilling visual story
^
what helps is "prototypical red ambient glow, synonymous from stranger things"
its focusing to much on the building
Tell LoRA I Love Her!!!
Oh no, so much ketchup
ladybug
alarm clocks are frighteningly realistic with sdxl, too
I have this thing, pic is nice tho. 🙂
Beware of her shoulder patches
I love the prompt haha
while we're here with horror themes, the daily is also nice with sdxl
"A zombie emerging from a swamp full of floating teddy bears."
And I also hate it with passion, alarm clocks hehe.
... and simply adding "(((gundam)))" and with anime style 🙂
who did you call a stupid bot? draw, stranger!
this flow seems to be better than reimagine: I generated a 1:1 image -> uncrop it in clipdrop -> download -> use it to generate a very similar image in widscreen.
Make one of Biden in a jail cell with a hooded axeman about to serve his punishment according to laws of the king.
This actually does a pretty good biden for sdxl
Biden is a pathetic fool, so yeah there is that but I do have dreams of seeing it happen. At least I can with SDXL.
LOL
Looks like an old Eminem.
If that ever happened I would check to see the year and see if I were in the movie Idiocracy.
We need a wrestler as president to cap it all off
Evil Trump
SDXL doesn't do orange jumpsuits very well in base
haha
I tried and no go on Trump
this is dreamshaper bro
Looks more like lounge ware
And my trump pictures are done with a private lora
dreamshaper is SDXL?
there's one based on SD XL
kick ass
and also with previous SDs
😆
WHen I train I train stuff on base so it works on all of them much better
👍
monica lewinksky coming right up
or maybe, loaded that workflow in and I'd moved things around since then
not bad
sounds pretty slow to me
I don't know, this is a pretty flattering rendition of Janet Reno
I'm feeling inspired
Anyone know why all my SDXL images generate like this? Just a box of colors and textures
Probably your sampler and scheduler choice
try dpmpp_2m_sde with karras
with at least 30 steps
hmm. I don't know if the sampler would make it that goofed.
but i copied the generation settings from the image in the back.
changed it and still nothing
could be an issue with vram... you may need to add a command line arguement somewhere... I see that you are using swarmui... I don't use that so im not sure what you should use.
there is a swarmui channel in this server... maybe ask for help in that channel
dog ❤️
Followed this upscale method. Works gr8! https://www.youtube.com/watch?v=CxB47DMEyYQ
In this ComfyUI tutorial we look at my favorite upscaler, the Ultimate SD Upscaler and it doesn't seem to get as much attention as it deserves. It is a node is easy to add to any graph, but I also explore how to make it so we can choose whatever scale factor we desire without needing to calculate the optimal resolutions required for the best re...
It's interesting that I ask for cherry blossoms, SDXL creates images containing japanese style houses. But when I input that generated image and ask for houses without a specific country, it chooses houses with a style mismatching the original image.
well, is it supposed to interpret the input image and understand that it's japanese style? what are your expectations here?
it's not an unknown mystery how this stuff works. if you have unrealistic expectations about what the model is going to do you'll be disappointed when it does what it's trained to do. you should read about how it's actually trained and what it's actually doing
consider the fact that stable diffusion's first public model was released barely a year ago. text to image ai generation wasn't even a thing on a large scale until maybe 18 months ago at most. you're really focusing on some pedantic minutia here.
bing things it's stable diffusion
Are there any prompts that allow my model to randomly generate hair color?
(((random hair color)))???
wildcards, then a local text file it calls which it pulls the different values from.
I haven't done it in Comfy, but I did a lot of that in Automatic1111.
_haircolor_ hair woman standing with wolf.
good idea
if you use ComfyUI {wildcard}-syntax is build into it natively. wildcard text files are not. so you could write in your prompt field:
{brown|blonde|red|green|yellow} hair
and it will randomly select a color for each image you generate.
can it natively call a .txt file as the Dynamic Prompts extension allowed? (I'm sure there are packs that do that, just curious if you know native)
no, txt file are not supported natively. but there are custom nodes that add support for it.
got it, perhaps will add that next after finishing this flow.
I did use it pretty often
I use Mikey Nodes by @west breach https://github.com/bash-j/mikey_nodes.
a couple of Mikey's Nodes support wildcards. for example the Wildcard Processor node. just create this directory: /ComfyUI/wildcards/, put in your wildcard txt files in there and use them like in a1111 in Mikey's Wildcard Processor node with __wildcard__. you can even sort them in subdirectories etc. works great.
ah nice, already got that pack installed
@upbeat summit can i put wildcards in the txt files?
yeah it also works in the Prompt with Styles nodes. very easy to use and a must-have feature for me too
yes, of course - if you use a wildcard processor like from Mikey Nodes.
i mean, can i make a wild card that will call another set of wildcards
wildcardcepsion
good question - I don't think that currently works. It worked with Dynamic Prompt for a1111. So that's maybe a feature that can be added.
squeezing what i can out of @upbeat summit's Magnum prompt 😛
protovision
@upbeat summit speaking of wildcards, what about Loras? how can it be done with comfy?
you either use the native Load LoRA node - one node = one LoRA.
or
you use a custom node (like the ones from Mikey :D) and use them inline in your prompt like in a1111.
Tom Selleck <lora:tom_selleck:1>
i was looking to see if it's possible to load random lora from a list
then you can do different subjects
I did this with embeddings in a1111 a lot. Not sure it works with LoRAs in ComfyUI custom nodes yet. It should technically be possible I suppose. I will check if it works and if it doesn't see it it can be implemented.
me too, 2.1 times 🙂
Just tested it. Works perfectly with Mikey Nodes. I created a loras.txt file in the Comfyui/wildcards directory containing the full lora syntax:
<lora:foo:1>
<lora:path/to/my/lora:1>
<lora:another_one:1>
I than called the wildcard in my prompt using __loras__ and it will randomly pull a lora from the loras.txt file
you trained a lora in comfy?
ah ok, cool, nice work! looks great
interesting, the digital painting look has been replaced with a photographic look
no sorry. A different lora style
sci-fi ridley scott like
not the paint from before
wheres the michael bay lora
this is the digital paint as before Lora + Lora of me
I'm trying to create a loRA using RunPod, but got this error. anyone know what to do?
CalledProcessError: Command '['/workspace/kohya_ss/venv/bin/python3', './sdxl_train_network.py', '--enable_bucket', '--min_bucket_reso=256', '--max_bucket_reso=2048', '--pretrained_model_name_or_path=/workspace/stable-diffusion-webui/models/Stable -diffusion/sd_xl_base_1.0_0.9vae.safetensors', '--train_data_dir=/workspace/JP-rfzstyle1/img', '--reg_data_dir=/workspace/JP-rfzstyle1/reg', '--resolution=1024,1024', '--output_dir=/workspace/JP-rfzstyle1/model', '--logging_dir=/workspace/JP-rfzstyle1/log', '--network_alpha=1', '--save_model_as=safetensors', '--network_module=networks.lora', '--text_encoder_lr=0.0009', '--unet_lr=0.0009', '--network_dim=256', '--output_name=JP-rfzstyle1', '--lr_scheduler_num_cycles=10', '--no_half_vae', '--learning_rate=0.0009', '--lr_scheduler=constant', '--train_batch_size=5', '--max_train_steps=2400', '--save_every_n_epochs=1', '--mixed_precision=bf16', '--save_precision=bf16', '--caption_extension=.txt', '--cache_latents', '--cache_latents_to_disk', '--optimizer_type=Adafactor', '--optimizer_args', 'scale_parameter=False', 'relative_step=False', 'warmup_init=False', '--max_data_loader_n_workers=0', '--bucket_reso_steps=64', '--gradient_checkpointing', '--xformers', '--bucket_no_upscale', '--noise_offset=0.0']' returned non-zero exit status 1.
1 x RTX 3090
32 vCPU 125 GB RAM
50 GB Disk
75 GB Pod Volume
shleykza/stable-diffusion-webui:3.0.1
first time doing this
I use 4090
24 gb
but before launching train
I give 2 times
fuser -k 3001/tcp
because Auto1111 takes half of the GPU mem
so this command shuts down auto1111 and frees memory
I'll do a video in the next days
ok yeah not really following lol
all this is confusing haha
is that a quick fix i can do with my current setup i have going?
yes just fuser -k 3001/tcp 2 times
where exactly?
and then what after i do that?
If I want to use the controlnet for sdxl, do I only have to update the controlnet extention and then download the extra sdxl controlnet models? Can the 1.5 models work with it like scribble?
can u tell me ur it/s and what gpu u have? (in A1111) I have like 1.8 on GF 4080 [with sdxl]
i'm using the fooocus google colab (moonrise version); where do i put the models and stuff i want to use? like what's the folder called and where is it
Are people using the RealVisXL model yet? So far it's pretty good imo. My character loras from the base model have some weird skin texture, but if I retrain on realvis then that goes away. And hair seems a bit plastic looking unless I add "plastic hair" to the negatives. Model is here btw https://civitai.com/models/139562/realvisxl-v10
I've not yet used RealVisXL a lot but I noticed similar things
I'm sure they've trained on a lot of skin detail to compensate for sdxl being weak in that area. I just noticed that on base sdxl hair can be a blurry mess and on realvis it makes sort of hair strips like a painting instead.
yeah there's definitely something going on
I haven't done a lot of prompting with realvis yet, so I can't say if it can be reduced / controlled a bit
Yeah it's definitely a real thing. I never saw hair like this from sdxl base
hmm yeah. that doesn't look right. depends if you went for last gen game character hair 😉
But you can get rid of it with prompting at least.
