#✨|sdxl
1 messages · Page 169 of 1
you should try to mass ping staff again for better results
stability staff too?
Ok, hi 🙂 - where do i go?
yea devs,comunity guides and even elon musk 🗿
Jeez finally got a 13. Sdxl doesn't like generating "13"
lol
just type your prompt, i'll make a few key changes and then send you the output 😄
I was told you can create images in this discord channel though??
correct, i'm even more intelligent than GPT 4
Ok, so you can't then?
The bots are down. Should be up inside of a week or so with SD3
Yeah probably after the testing period it will be an SD3 bot
cant wait for next week where they invite more people
I wonder if it will be 10 or 12 people 
I did scared animals looking at all the @ symbols, but it just made it an A
lmao
having multiple people doing actions really isn't an SDXL/SD1.5 thing without regional prompting
okay I used highresfix which gave the mane a female face and vise versa
this is how it looked before
im inpatient, SD3 would take these prompts to life
What's the original prompt for that?
I lost it but its basically about a man holding a knife behind a woman
like some horror film
exactly lmao
I don't even remember having gun in my prompt but it still puts it in there
but there was, then it would make sense why it generated like that
@copper kraken Please dont mass ping all the staff, simply reporting a message is enough, thanks.
No trigger world. Play with strength. "The Classic" style is a fusion of historical aesthetic with modern technology, where traditional elegance me...
Roger, sorry. What's the best way to report it?
You can react to a message with ⚠️ or right click Apps->report to staff
Hm, when I tried hitting the exclamation reaction, it appeared then immediately disappeared
happened both on my browser and on the app
(that s proof that it is working)
Yeah that means it works, the bot takes the reaction and sends the messages to a sepperate channel for us to review
oh! ha, I thought that meant it didn't work.
good to know. thanks.
i thought the exclamation might not be working if it was just me sending it without it being in response to a specific message
As usually, it's just 2 people there with it. Not one using it on the other. Maybe censorship of the model?
idk DreamShaperXL
also I'm using lightning which might hinder prompt adherence further
some sdxl models are better at darker scenes, but cascade base model is exceptional
thats not how censorship of these models works. it happens in the training set, and it happens in output classification. they can't lobotomize specific knowledge out of weights.
I kind of expected you'd know better, but i'm often mistaken so /shrug
just basic token bleeding. classic attention problem from the text encoder
i still like the ada lovelace sneaking up on alan turing images i was doing with regional prompts. probably could make her look a lot more murderous and knife wielding
Hello hello !
I'm trying to generate something veerrryyy specific with SDXL but it seems to be very limited in this regard
fun fact. Ada lovelace was the FIRST computer programmer. Since she wrote code that would run on Babbage's difference engine
||I want to generate a pair of slingshot bikini on a female model ,but the character either goes out nude or with simple bikinis no matter the lora or the model that I use , how could I troubleshoot this, do I need to train my own model ?||
lol i want to generate something VERY specific ||its porn||
you need to prompt for huge monkin bazonkas . might help
as in ? I'm a newb
type "huge monkin bazonkas" in your prompt
eh
Hah anyone ever tell you that you have an abrasive personality?
do I need to train my own model ?
yeah but its usually after i tell them how wrong they are and while they're being overtly defencive.
Well when you start off with insults out of nowhere, maybe that's why they get defensive.
the whole entire "it's a censorship problem" conversation has long past the civility lines. You're on the hostile side. I'm not sure why you're surprised that someone might be tired of it by now.
festivalman do you know a solution to my issue ? thank you !
If you feel that civility needs to end on a happy place with silly pictures, you need to take a break from the internet.
I just expect honest discourse but that's not what happens during the "censorship" conversation. People are entrenched in hyperbole and throw out wingers like "they edit the weights after training". instead what we get is ad hominem in most cases.
I mean, if you can't handle a little jab that would be tame by BBC standards, maybe you're not the one cut out for the internet?
thats fooocus. Upscale/variations are img2img for them yup. The fooocus readme is really useful if you're first exploring this app
All these projects tend to have a readme and if they don't the author is being lazy (imo).
thank you !
looks great
thanks
wait a sec haven't I seen you masslevel
I've been on this discord for quite some time 😄
Lies. You aren't even here right now.
😛
yeah me too, that's why I'm asking
and I think I have
ideogram should license SD and have this option on their website.
Yeah an ideogram to SDXL pipeline would be great
1 thing where sdxl really doesnt shine is textures in clothes, they always look kinda messed up
what in particular about clothing texture is messed up?
Yeah I don't there's anything wrong with SD textures, SD is only as good as it's user
I've noticed this especially with cascade. if you don't specifically tell it to give you good output, it'll give you an average non-sharp image
maybe that's true of sdxl base as well and i'm just too used to the finetunes where I don't need to do that.
"ridiculously enormous"
yes
Fantasy XL to bring your images to the next level. Play with the weights, and using, and not using, the activation word in your prompt. Feeling gen...
oh its general awareness
Removing unwanted artifacts really helps
wow thats nice
that one is way cool
perfect prompt example to generate everything ? who got an example
everything and negative prompt nothing
Does sdxl inpainting work? Are their specific models for it similar to 1.5? I tried searching but couldn't come up with anything definitive.
Ancient costume figures, True style, Play Chinese musical instruments, White background

Here is the image you requested.
It does work and doesn't need a specific model. I think it's better to use the model you used to create the image to start with.
Just search for posts of mine in this channel and use the new differential diffusion workflow. Works with any sdxl model and you don't need anything special other than making sure you blur your masks so they have a gradient and make sure you use 30+ steps
Good afternoon, my name is Maxim. I'm from Russia. For a school project, I created a short cartoon using neural networks only. Please take a look and support. https://youtu.be/ruANV24h0Dw?si=Wr7Nbyo0Wr2_QB3-
Короткометражный мультфильм "Парк" - невероятно увлекательный короткометражный мультфильм, созданный с использованием нейросетей.
I like these better than my emoji in a desolate landscape phone wallpaper example. But yeah differential diffusion is game changing and super awesome. It really removes like 70% of a workflow that you'd normally spend un-fudging it with delicate resamples
Again, can't stress it enough to people that want to play with it: you HAVE to have gradients(blur works) in the mask for it to work right and HAVE to have decent step counts.
Did did you download it somewhere?
Or if you did it yourself, could you share it?
I created it, and shared it in that image.
could you share a json?
It's in the image, just drag and drop it
thanks it worked!
You can delete the fooocus nodes, or just leave them disabled. I thought they made it worse.
which nodes are focus nodes?
going back to XL lightning is a shock to the system when you can create 8 images faster than 1 in cascade
a tiny plane with a huge banner floating after its tail with the colorful word "dahmane !!!" written on it, seen from the beach, many jumping women looking at it with raised arms, blue sky, sunny weather
You should prompt for a large plane, the tiny ones do not seem to work.
a plane with a huge banner floating after its tail with the colorful word "dahmane !!!" written on it, seen from the beach, many jumping women looking at it with raised arms, blue sky, sunny weather
a large plane with a huge banner floating after its tail with the colorful word "dahmane !!!" written on it, seen from the beach, many jumping women looking at it with raised arms, blue sky, sunny weather
a large plane with a huge banner floating after its tail with the colorful word "dahmane !!!" written on it, seen from the beach, many jumping women looking at it with raised arms, blue sky, sunny weather
seems ok to me

if you intend to use a bot, that is i think currently not available on this server
a plane with a huge banner floating after its tail with the colorful word "dahmane !!!" written on it, seen from the beach, many jumping women looking at it with raised arms, blue sky, sunny weather
a plane with a huge banner floating after its tail with the colorful word "dahmane !!!" written on it, seen from the beach, many jumping women looking at it with raised arms, blue sky, sunny weather
🍀 What if... St Patrick's Day had a Fashion Show? 🤔 🇮🇪
still wish clothes would look better 😄
very uncomfortable - nice work 🙂
创造一只小狗
a puppy?
Beaches, beautiful women, bikinis, coconut trees, yachts
Can you tell me how it was made
Can you tell me how it was made
Beaches, beautiful women, bikinis, coconut trees, yachts
创造一个海滩
Prompt : photo of a (baby french bulldog:0.5) that resembles a [(frog:1.4):2], cinematic lighting, dramatic lighting, high contrast, low key lighting, high key lighting, chiaroscuro lighting, Rembrandt lighting, split lighting, rim lighting, back lighting, side lighting, top lighting, bottom lighting, natural light, artificial light
Hello,excuseme,where we write the prompt to make images here?
You should ask them, I'm not sure how to use it either
Ya,that i did,but no answer for now.
The bot has been down for a while, you have to run SD locally
You can use seaart or playground ai, just search on Google
Ok,thx,i cant do it cuz i havent a nvidia or a amd graphic card of 4gb,but thx anyway.
Ok thx
Nether do i, i run Comfy ui on kaggle, you get 30 hours every week, i also have other alternatives
Could you write link to that site pls?
Kaggle is the world’s largest data science community with powerful tools and resources to help you achieve your data science goals.
thx man
Which one you use most?
Comfy for best quality but i use my hours efficiently, seaart is also good
Diffu,i cant find sd xl on kaggle.
@round cargo you need a good notebook, just search for Comfy ui on kaggle and look for a good notebook I can't help you with that
Ok,thx
Search for this exact notebook in the code section, it's the one I'm using
I wrote comfy ui 823f37 at code- search public notebokk but nothing hapes
How is the VAE Encode & Inpaint Conditioning looking like? I always get a very harsh edge when doing inpainting.
Import it
Thx,now ho wi use it?
Upscale
On kaggle, make a new notebook then import the one i just sent, don't forget to enable the x2 T4 gpu in the accelerator, start a session then run all the code one by one from top to bottom, the last one is unnecessary, they'll give you a link to the Comfy webui
Yeah that was from a mobile app I'm using called Artful, limited customization, I'm saving my kaggle hours😁
And it actually uses Juggernaut
the x2 t4 gpu? i havent a grahic card.
What i do whit the comy ui link?
I meant on the kaggle website, have you used kaggle before, if not you need a tutorial, search for one on YouTube
Nope
Forget kaggle that's way too advanced for you, if you're using Android search for an app called Artful on your appstore, i donno if the app is on iOS
I made the notebook,now i choose new dataset to upload the file you sent me?
No, theses a drop-down called File somewhere at the top, click it and then click on import notebook then import the one i sent
Done,now?
See the panel to your right that says add input, scroll down to where it says accelerator and select the x2 T4 GPUs, make sure internet on is enabled (it should be somewhere below where it says accelerator), now you're ready to run
The examples are literally below the workflow.
...I scroll down but i dont see there some thing called x2 t4 gpu.
Bro there's a separate panel to your right, scroll down to where it says accelerator (it will show none because it's not enabled) then you just click on where they say none and then select GPU T4 x2
Yes bro,I see output sessions options add tags shedule a notebooks to run save code help,but i dont see nothing called acelerator.
Bro i really wanna help you but you're like completely new to all of this, just download the Artful app i mentioned you'll be able to run JuggernautXL, they even have dalle-3, Kandinsky also the base version of SDXL, i highly recommend you just get the app
Ok,but it is not mi fault if there dont appear anything called acelerator,i did all you said.
And i wanted use it on pc cuz my phone is really slow.
Lol
Ya i see all that but that acelerator dont appear on pyton option
You don't have much prompt control
Bro tbh I don't think you'll be able to run it, you need at least some coding knowledge to understand what's going on here
This
actually I have, i think they're running a1111 through cloud, all i really need is to be able to change the scheduler
Model In general
Silly me, you need to sign in and also verify with your phone number, sorry i forget to mention
Prompt was "egirl" ☠️
I will back later,thx for help.
I make use of Syntax, it gives me close to dalle level prompt control
To achieve the "Cute Collectible" style effect, you would describe the subject followed by the phrase to be included, for example: a Man Drinking a...
guys... 😔
/bangbande
ComfyUI session from this morning. Some Final Fantasy-inspired scenes of wizards defending their cities from a hoard of invading dragons.
Here is the image you requested.
A young anime Dalmatian dog, different scenes, different actions, white background
@sturdy kindle A young anime Dalmatian dog, different scenes, different actions, white background
boys play ice hockey in a ice rink
Located in the city's underground black market, Chinese style, here to sell a variety of things, strange, winding, lively and mysterious. From ancient spells and exotic herbs to legendary artifacts and mysterious exotic animal specimens
my DnD characters with your Lora:
great work as usual 😍
Through SV3D...
Wow that's really neat
Happy Spring you all wonderful beings! - https://twitter.com/HikariUchu/status/1770116278075945328
Im back man,now what i do? i verified acc and now yes i see that gpu4 t2
Ok select the GPU x2 T4 then you're ready to start running, the notebook i sent uses JuggernautXLv9, you can add any other checkpoint by copying the link from civitai or huggingface, run the code one by one by clicking the play button by the sections of code, as i said yesterday you don't need to run the last code, only the first 3 cells, they'll give you a link to comfy webui
Wich one i must click?
@round cargo bro not there, you need some coding knowledge for what comes next, go watch a tutorial on how to run code in kaggle
Bro where did you run SD previously
Just go to the seaart website it's beginner friendly, comfy ui isn't
Germs beach
Is Syntax an app or comfy node?
You need a node to use it in comfy, i think its the style prompts node you'll find it on git
Thanks
I hate this 🤣
yeah it's meant to make you uncomfortable
Sorry the name for the node is comfy ui prompt control i just checked
Cool thanks
that's an epic number of traffic lights 😄
looks like the intersection from hell
yeah lmao
Thanks brotha.
How does these make you feel
@gloomy lark btw check this out, regarding upscaling...
the latent upscale is indeed vastly worse than the other method even with the vae encode decode sequence... so i bet if you're liking the outputs more with the latent upscale, the reason might be the extra noise it's introducing
bet there's a better form of noise you could inject to get what you're looking for
(or maybe not, which would be interesting in its own right imo)
So here's a few from today using the 20 steps dpmpp-2m sampler to latent 1.5x area upscale to 20 step dpmpp-2m sampler with 0.50 denoise. I was reading online that it's pretty much the cleanest and fastest way to upscale. Anything better is going to take a LOT longer. That make sense if I'm in the interface doing 1 image that I'm working on, but for this particular purpose I'm doing 6 images 12 images against 4 models and want the results back quickly. Best bang for the buck.
yeah, very much depends on what you're doing
retaining the likeness of a person i've found def works better with tiled if you're using a lora etc
or ipadapter
but yeah the rest will take a hell of a lot longer
I also realized that I can do dpmpp-2m-side-gpu in the same time, but it bombs the api render. Works great in the comfyui interface but errors when doing it via the api where all other settings are identical. That better sample takes care of the occasional not perfectly round eyeball etc. I'm still working on figuring out why it's doing that.
that or just straigh tup using one of those 4x upscale models which really aren't too shabby
huh
Yeah this skips the upscale models which would be great, but it's just the speed thing. Perfect for figuring out prompts rapid fire while I'm out of the house. 🙂
yeah
what i'm curious about is alternate noise injection vs the area upscale
maaaybe that's law of first optimization stuff
Doing the latent needs high denoise value. It's what I figured out anyways, but found lots of people saying it's needed or it ends up looking pixelated from the latent upscale.
yeah, did you see those images in detail above
the latent area upscale one looks like hell
i did also check the NNLatentUpscale version, it looks a lot like the model upscale route with the vae vae sequence
except just a bit less detailed
so i'm guessing it's gotta be something to do with the shittiness introducing noise to denoise
Oh yeah you need a denoise ksampler after the latent upscale. Without it, it's just a mess. 🙂 but even with that, only 7 seconds total
yep but what i mean is...
the upscaler models really are very good these days, if all you're looking for is blowing the image up without pixelating... but...
we usually want to "upscale" by adding extra stuff to the image in the process
which usually requires adding a bit of noise after upscaling and then denoising
so that terrible latent upscale is the source of that noise
what i'm curious about is if the structure of that noise is better than other sources of it for these purposes
and if that's the case, i'm curious about finding a way to add some parameters onto that style of latent upscale
ah
i see a couple of "upscale latent with model" nodes, but then they don't have any options for upscale amount etc. obviously i'm not understanding how those are suppsoed to be used.
you sure that's not upscale image with model?
if there's a latent one i'm missing it and would def like to know lol
i've tried to grab every latent anything node and every sampler node, but i could've missed one
(not that i know wtf i'm doing with them)
ah, yep upscale image. still can't use it though even if i did vae decode/that node/encode. it's missing settings, or clearly needs some kind of other node to help it.
I tried adding more latent upscales and ksamplers and i was reminded of my first tests with this. one 1.5x upscale and it looks awesome. more than that, and it falls apart.
oh wait is that workflow not working?
or do you mean something else
upscale image with model?
this one. when I try to use the "load upscale model" node with it, it won't connect. so I'm clearly not using the right supporting node with it.
oh!
cascade first, put through 1.5x latent upscale and then dpmpp_2m with 0.5x denoise with sdxl model. does a great job and it's fast.
what scheduler?
lemme know if install missing custom nodes finds it, if it doesn't load the correct node here
yeah, you're right, that works. but how do I specify the scale_by?
you can't
I think I combined 2 different gripes into 1 with that. 🙂
those models are neural networks trained to do 4x only
lol
so you gotta follow with image scale by
ok let me try that.
i usually do lanczco s and 0.375 to get 1.5x
i haven't tested everything on the planet, but i did spend a while looking through some that were available and of the ones i tested i've found this to be one of the best
4xLSDIR and 4xLSDIRplus depending a bit on the image type, haven't been scientific about it
hah yeah ok, doing it with the upscale with model now and it's taking an insane amount of time
most of them tend to smooth the image
huh
it should noly be a couple seconds
oh waiiiit how big is your image 😄
let me remove the "upscale by" afterwards.
maybe the model is already upscaling to 4x.
it's definitley faster now, although still very slow. i'll see what the res is afterwards.
original is 1152x768, so if this is 4000+ res, then yeah...
wtf, of all the small irritations life has to offer... the order of the schedulers got flipped? lol
huh that shouldn't take long at all
what model are you suing for it
maybe some take longer, idk, i haven't timed them
hah yeah.. the upscale with model upscaled it by 4x.
that's why there's no resolution input.
it actualy looks worse than the regular upscale image by 1.5x
let me try a photo image, not a 2d drawing.
interesting
are you using LSIDRplus or something else
maybe the resolution does something wonky too, idk
now that i think about it i've generally just done 1024x1024, 512x, 768x
4k imaging will turn a 4090 into sludge.
yeah it totally borked it.
maybe i'm doing this wrong.
maybe i shouldn't be denoising...
at lal
all if i'm doing the upscale model.
actually yeah, need to do much lower denoise now that i'm not doing latent.
this took 6 seconds
yeah you need a lot less denoise
that or it's a good idea to use an ancestral sampler to get some noise back
there's a lil less smoothing with the base LSDIR (top was LSDIRplus)
so this is cascade, with the same method you're using, although with the siax upscale model.
the problem with cascade is that it has stuff that definitely needs denoising to fix. i don't just want it bigger with more skin texture.
those wonderful cascade swirls on faces.
yeah for sure
it's an oversampling issue
aimingfall figured that out
i've had some success with tweaking the denoise schedule
if you stop stage B after something like 6-7 steps it's generally better than any greater number unless you've modified the denoise schedule
cascade / upscale image by 1.5x - area method / 25 steps of dpm_2m_sde_gpu-karras / 0.5 denoise
hah even fixed the gap in her teeth
lol the original sdxl refine was the new proteus-rundiffusion that came out the other day, this is with dark arts images. everything is way more sinister!
awesome
the ones i tend to use the most are 0.45-0.6 denoise dpmpp_a, typcially 0.5
with karras
then sometimes dpmpp_3m_sde_gpu exponential when i want to preserve the original composition
with 0.5 or so
without the gpu version it changes it a bit more
i haven't been systematic in testing everything... so many fn permutations
hahaha this is with playground 2.5
loving that you're setting up a mini gpu farm to crank out shit like this 😄
btw since you're in comfy or were recently... i realized that the concat conditions with multiplier node is awesome for combining weird shit
its from the inspire pack
playground does really good bugs
yeah when playground 2.5 works, it's really impressive. be back in a few.
k
just play with the ratio in that multiplier concat node
if one prompt is dominating, then swio.h.pe
cat lol
then reduce its multiplier, and go back and forth till you reach equilibrium
yeah I'm assuming sd3 is gonna be a good amount more taxing, so waiting for higher render time images is gonna get old real fast.
and turbo is lame when the whole point is better prompt adherance.
*And the price is somewhat reasonable and accessible 💀
i also hope it turns out nvidia has already started production and accidentally mails me one
Imagine if Nvidia sells it for $4000
doubt they'll go that high
they still haven't announced specs on 5090 though right? it was just that rumor post of 1.5x memory speed and the same 24gigs of ram?
correct
also a rumor it's 32gb
also a rumor there's 50k cuda cores
all kinds of random guessing
I mean, the 4090 is like $2000 🤣 It's insanely overpriced in my humble opinion
well, we'll see what the speed increase is. if it's something really noticeably faster I'll get 1.
in terms of performance vs other cards, it's not, but they are price gouging
but it would have to be a lot faster than the 4090 for it to be worth it.
not like 30% or some such
I have to check the prices in Bulgaria 🤔 It's been a while since I've looked at GPUs
yep if it's double the speed and/or 32gb i'll be getting one
moores law hinted at 60% better
but if it's 30% with the same 24gb, meh
its all rumours though
yep
it's 47,000 stotinki per gig of vram over there.
So 470 leva? 🤔
give or take
That doesn't add up because the 4090 should cost around $5000 here then 😅
https://plasico.bg/komponenti/video-karti/filter-156846-156849 It will probably be difficult to read the Bulgarian but you can take a look at this store and compare the 4090 prices
yeah I'm just making stuff up. 🙂
😂 That flew over me
do you have your discord bot set up with comfy workflows? i forget
i wish comfy was even remotely usable on android
yep, it's all comfy at this point. took me a few days of troubleshooting, but I got the queueing system working. tonight I added cascade as well so it adds into the queue correctly. but yeah it's using comfy workflows for everything
just waiting on the hardware now
does it allow you to select a certain workflow, or alter any of the parameters?
nope. 🙂
you weren't kidding about PG2.5 and bugs...
yeah, that's playground 2.5 on comfy
res_momentumized with no momentum in ksampler
that is one of my new fav samplers, damn is that one interesting
it generates even wilder outputs than dpmpp_a
there are many things that cascade and playground have that are similar, so it just modifies it a little. but then there's stuff like this where it completely changes it.
although I guess for the better
the other thing that's interesting about res_momentumized i've noticed is tons of steps does actually help
i'm starting to use 50+ out of habit
notice anything interesting yet with the sigma values? i haven't palyed with them much
yeah for sure
a few times i've gotten amazing shit by going to 200+
it seems to keep adding details and complexity for a while
idk a damn thing about that sampler
that said, it's clear that some stuff doesn't work with the api that works with the regular interface, so i can't just take anything and throw it in there. I have to figure out why that is.
with 200 res_momentum
for the bot? yeah, been talking with someone who's been messing with that
apparently the api is a shit show
with sd3, we should be able to have a wide assortment of bugs
i'm eager to push the boundaries of clown, shark, bat, manta ray, samurai, insect, and arachnid themed images
is the token limit still 77-2?
i've barely touched pg2.5 tbh
this is pretty great
"me at the zoo", an 18 years old video from youtube reimagined:
it's my first attempt so it isn't particularly good
this was the video i used for it
400 steps
8 steps using the 4 step lightning lora
lykon said it's 512 now in sd3, but that's one of those things I'll wait to see. we can do really long prompts in sdxl as well, but we know it's not that simple
oh, yeah, i was wondering about pg2.5 too though
i'i'd imagine it's the same unless i'm overlooking something
Wow neat. Yeah playground still has the same engine limitations.
So these are all cascade put through dark arts images. I think playground is awesome by itself, but deviates too far from cascades native imagery. Dark arts is closer and adds that little dark arts flare if you go "dark".
Although I may have to go with juggernaut, dark arts might be still too overpowering for cascade e
yeah, playground has a hell of a style it wants to enforce for sure
Beep boop. Would you like an image? Provide a prompt!
yeah gotta hand it to em this is pretty fn great
it's too bad it's not supported on a1111 or forge yet
playground deserves more attention that they don't get for reasons of platform compatibility and sheer inertia
It has a very midjourney vibe to it. Lots of high contrast colors.
def
not as much control as you get with our fav sdxl models, but no denying it looks great
brownie points for being open source and beating MJ/dalle3 at their own game imo
it does have that mega trained feel to it - there's a lot of stability to the image outputs, not a lot of variation from seeds, and some prompts just MUST have a certain element
for example, not like my efforts were exhaustive, but it sure as hell thinks that a playground at night should have a light somewhere
Well, have to be able to see it. 🙂
sometimes though you might just want to barely see the outline of the image
at least i do lol
Because I still cannot manually add the DPM++ 3M SDE Exponential sampler+scheduler to the sample images, the image will show a sampler+scheduler I ...
i discovered this last night... lifesaver
Sure
?/dream
shit, even the lora didn't save me with that one, a damn light appeared
You're right about the lack of variation on seeds with pg. barely any. Have to use llm to create variation between generations.
yeah that's not really a good thing
but yeah what it's aiming to do i think it knocks outta the park when considering it's working within the limitations of the sdxl architecture
cascade respects my deepest desires
fuck actions, i just want SD3 to be able to create images where you can barely see anything lol
Same prompt but with llm varying it up.
Hah that's a black square
yup
prompt" a barely visible outline of a playground at night, pitch black"
i changed "very dark" to "pitch black" and got that
changed to "almost pitch black"
That looks like a barely visible comfy workflow
This is so cool, whats model generated
Friends, can anyone help me make an image, I don't know how to use it
which clip does the refiner use, L or G?
Pretty sure* it uses both like the base, but you could always check the huggingface page
dragon cyborg in gold and pink with H.R giger texture and oriental pattern
Are you using face reconstruction?
nope
WOW!!
just Highres-Fix
highresfix is amazing, SD3 will benefit from it the same way
before and after
before after
instead of reconstructing the faces only, I reconstruct the entire image
epic finger count
holy crap those reflections are so pristine
Cogito ergo sum right?
It seems I don't have enougth VRAM to use hi-res fix, I can only use it to 1.2 or so, is there some way to make it work with higher amounts? I have 12 GB VRAM
I have 12GB too
Turn on Tiled VAE in Forge/A1111 or use VAE Encode (Tiled) in Comfyui
this was I never run out of VRAM
Tiled VAEs don't have seams to my knowledge
I need to try that, ty
how much do you resize with Hi-res fix?
believe it or not like
1.2x first with kohya deep downsample -> 2x highresfix
so like 2.4x upscale on 12GB? 
wihtout Tiled VAE I am inching very very close to OOM
I can only imagine how it will be with SD3 8B (if at all)
I didn't know about that, kohya deep downsample
oh yeah
that seems like it
but you can go for 2.4X highresfix straight, idk why I even do it this way lol
I need to try that
Also there is this button in Auto1111 now, so you can hi-res fix something you already generate
how do I generate picture
Here's the image you requested
Cinematic.Redmond is here!
This is a Cinematic model fine-tuned on SD XL 1.0!
The model has a high capacity to generate Cinematic, artistic images, cars, people, and a wide variety of themes. It's a versatile model.
I really hope you like the model and use it.
I recommend generating it in cinematic proportion like 16:9, 2:1 etc.
If you like the model and think it's worth it, you can make a donation to my Patreon or Ko-fi.
Follow me in my twitter to get acess before for all new models:
~https://twitter.com/artificialguybr/~
You can use it for free here:https://huggingface.co/spaces/artificialguybr/CinematicRedmond-Free-Demo
Download it here:
HF;https://huggingface.co/artificialguybr/CinematicRedmond-SDXL
Civitai: https://civitai.com/models/359999
I hope you guys enjoy :)
nice
these are so epic
does this use any 3d animated content?
and did you standardize on 1344x768 or something else?
and did you use the film scripts at all for the contents of the captions?
your trained content is really excellent btw
did you use pivotal training? so many questions
daaamn
Who needs SD3 when we have SDXL? 😁
yessisrky
SD3 has the quality of a finetuned SDXL model with the prompt coherency of DALLE3

Better than dalle-3
well I consider the fact that I might use SD3 Turbo as a daily driver like SDXL Lightning
especially if somehow SD3 has to be CPU offloaded then 4 Steps will really help out
Yoda x Gollum
I thought SDXL Lightning was the best thing ever until I started putting lots of promtps through it side by side with the full version of the same model, and even though the visual quality was there, the prompt adherence was garbage. Great for portraits, not so much for other stuff.
So I see this stuff about SD3 Turbo, and I'm glad it's there, lots of uses, but I'll probably steer away from it.
Prompt adherence is still going to be above Midjourney v6 apparently for Turbo so you might actually consider it
though the failure examples do make me doubt it sometimes
I've got subscriptions to all the services, and I only occasionally put them through midjourney. Now that I see SD3's higher channel VAE, I suspect that Midjourney V6 has that already and that's one of the reasons why it looks so good, aside from their clearly higher res training set. The prompt adherence of MJ at this point is only marginally better than SDXL, maybe a little higher than Cascade. I still can't get mecha to actually be doing anything with MJ other than standing here with their arms at their sides. I have to use regional prompter in SDXL to get them to do anything as well.
can you roughly tell me what the 16 channels improve? higher quality VAE decoding? therefore, less of a need for HighresfixM
my 2 second understanding is that it's not a composition thing, it's a color quality thing
it's the differnce between shiny waxy skin, and a real photograph appearance for people.
yeah that one thing will probably have the single biggest impact other than prompt adherenace.
i could have done more fixing/upressing on the sdxl to fix it up, but i think the color difference between the 2 is obvious
midjourney v6 on left, sdxl on right
yeah
that said, midjourney goes way overboard on pastel colors and shallow depth of field, even when it makes no sense. with sdxl it's a choice, not a default.
Just to round out those 3, dall-e's version of it.
😔
hah putting that through sdxl is so much better
Dall e is being carried by the llm
yeah, if you look at what sdxl just made of it, it's so much higher fidelity
Sdxl isn't good at prompt following
yeah
sure it is. 🙂
hah nah, it's regional prompter. the prompt for that robot with the money is actually dirt simple.
smiling man's stone head, mecha hands in the air spewing money, dystopian future
ADDROW
chibi cute 3d mecha arms in the air , money everywhere, gritty, grimey, wires, sparking, dystopian future
ADDROW
worshiping crowds with raised fists, dystopian future
💀☠️
Ros getting dall e level prompts
@gloomy lark are u using Loras
Or just juggernaut
no loras.
It wouldn't do your username correctly. 🙂
wow midjourney actually did a really good job with this one, which is so hit or miss. sdxl is consistent, midjourney is luck of the draw as to whether it does what you ask
IT GIVES YOU A RANDOM ART STYLE EACH FUCKING TIME!!!!!!!
yeah exactly. I've tried messing with the style settings. it does nothing
I used MJ since it launched actually. The moment I jumped ship was when Stable was able to output consistent hyper realism. I was there up until the end of V5. V5 was shit too. fucking garbage mods and people too.
I think they saw what SD3 is going to do with text and they upped their game. text with them has been meh until recently
For me it was always really good at it.
💀
More spring magic - https://twitter.com/HikariUchu/status/1770589661251141657
Yoo I can only tell that´s AI because of the cat face and other minor details but it looks so good!
what model were you using? 👀
wait...is it ai, right?
😁
Hey everyone. I haven't been on in a few months. I was wondering what is new with SDXL? I keep hearing about Stable Diffusion 3 and Cascade. Are any of those out and available on ComfyUI?
cascade is
That cigarette one is so cursed, messed up anatomy
Bot is down bro they're busy with sd3
Is there a new solution now on how to upscale watercolor paitnings without actually loosing the texture of watercolor paintings ?
I was just working on something like that, I got good results with Ultimate SD Upscaler, SD XL model zavychromaxl_v50, Denoising strength: 0.16, upscaler: 4x-UltraSharp, upscaled x 4, Ultimate SD upscale tile_width: 1024, Ultimate SD upscale padding: 64
Got some little aberration (a guy head on a post) but other than that I was pretty pleased with the result, it respected the painting (I was trying to resize in img2img but it always messed the good details of the painting)
the result looks good but i was hoping to be able to keep those textures as of the wall on the left below the balcony, the upscaled ones allways kept losing those irregular patterns and became to clearly structured, was hoping that by now there is a solution
yes that's true, I haven't noticed that, and I'm in the same boat. I was more focused on the subjects and light poles/trees.
It seems with upscale you always have to compromised something. Sometimes with several tries, trying different denoising strength, one hits the spot.
Yes SD Ultimate Upscale usually kind of wipe some nice textures, result may be decent.
Wan Gog

Brute method I might think is to create several ones and in Photoshop merge the parts that has the desired textures, as sometimes some areas needs more detail, like the subjects, but other area needs less denoising so it keeps the original texture.
Then some inpaint if the merge isn't seamless... yeah some work. But afaik there is no other method than the denoising gamble.
Woskar Wowoschka
oh yes definitely it made a wan gog head there
juggernaut
Who's a good boy?
Dude..
These are great! Was it a lora or all prompt?
Improve your generation with Zest. Yes, this will work in combination with my other generational image enhancers. No trigger/activation word requir...
interior design TV background
Here is the image you requested.
A pod of dolphins playing.
Im training a style lora for sdxl, which preset should i use Im a beginner T-T
Here is the image you requested.
There is no one size fits all preset but I'd start with the sdxl now prodigy one. Prodigy is a good optimizer. Also, be prepared for a lot of failures in over and under training. There is a ton of heavily conflicting information in various guides you'll find, so be prepared to not take every guide as the best
The reason why the information is so conflicting and vague is because you have a lot of people that lucked out with some settings on a specific train, so they think it works for all. Oh and graphs, you'll do a lot of looking at tensorboard graphs lol
cascade?
This was sdxl dark arts images. Prompt was hurricane of gang members. Kind of a combination of the 2 big things happening to Haiti. It's always interesting to see what the model comes up with.
Very interesting
ah, yes interesting, thank you
i wish so much we could somehow lock these characters and make coherent stories with them they look great
做一个马
I've been away for a month, give or take, are there any big advances I missed? I'm using Radeon.
is that shaggy!
if there a prefered comfyui node that takes an image input and proportionally resizes an image so that the largest dimension stays under a maximum that I've set?
Use the Image Resize from ComfyUI Essentials.
Thanks, I got it going
prompt:a cat drinking coffee at a coffee shop. with and without clipvision, which I finally got setup for sdxl (only did cascade before). The difference in "prompt adherence" is night and day. incredible.
and this was the input image. it didn't take much to set it on the right path.
Very interesting
a 22 year old boy with black hair, from india, indian skin color, wearning a computer glasses, looking into camera, background is dark brown plain, 4k resolution profile photo
I'm figuring out that clipvision in sdxl is nothing like that in cascade.
for some reason the input image affects the final output WAY more in cascade.
/ca
openpose
then do you load that openpose skeleton into controlnet?
which one of these is it?
is this right?
sure you can choose All or select openpose.
then hit the little firework red thing to see the preview to make sure it worked.
that said, if you already have the open pose model, you don't technically need the preprocessor, because that's just creating that openpose mannequin out of whatever image.
so if you already have it....
Pixel Perfect?
does this mean it is not working?
something isnt working
am i using the wrong openpose model?
pixel perfect is when your openpose jpeg with the skeleton matches your render resolution you have set.
so it's a 1:1 pixel accurate
no preprosesor fixed it
all this stuff and regional prompter is finnicky. different models, different cfg and step counts. have to fool around with all those settings to see if that's what's stopping it from working the way you want. in a1111, there's an XYZ plot at the bottom. do your controlnet stuff, then set the number of steps and the CFG values to walk the ranges and see if there's one set that will do what you want.
thanks
any tips for consistency?
character consistency
to make an animation
same seed?
there's some ipadapter models like faceidplus v2 and instantID that can do repeatable faces, but they usually only work at very low CFGs, which is only possible with Turbo models.
other than that, you'd have to train a lora.
there's a ton of youtube videos that go over all that.
And even that is just 80% accurate. No 100% concistency yet.
We need total control! Like a 3D program. I have to be able to film my characters/objects/backgroudns from any angle and lightign condiiton concievable!
Until then AI video is a gimmick
Might as well go full manual then lol but I get you
heh
Its not that simple because AI could render things a million times better than other 3D programs and fiddling with photo editors and so on.
such a shame to get these amazing images but really we cant do anything with them yet
👏 Do you know if its possible to use clipvision on sd forge?
don't know, I've only used clipvision with comfy
😁
Chinese painting style drawing of a tortoise
rs image
Differential Diffusion (using zho_zho_zho's workflow)
They actually do work well with cfg = 6 with regular models
The issue is you tend to have body horror hell
I've found the best thing to do is use them on low strength for a first pass, then upscale the face, unsample and resample it, then patch it back in
I have some nice workflows that automate that
tried some different prompts fusing together
Link to workflow? Curious cuz I have some fancy ones I've come up with
Would be happy to share once my Internet connection comes back (on phone here)
All diff diff, workflows should be in the image still
wow, beautiful
I used a variety of different masks for these
I had these on my phone still so just pasted them in, the masks I can share once I get my Internet connection back
Diff diff is incredible
The one with the steampunk machine in the center, I was using diff diff for inpainting
Comfyui is just fine for inpainting, the stuff about needing a1111 for that is bs imo, you just need the workflow set up right
more diff diff, look close and see which way is up
So what's your process for these? Inpainting? Because there's more going on here than what regional prompter can give you. There's a level of chaos that I haven't been able to achieve normally. 🙂
Before and after upscaling
The upscale fucked up the sun rays.
Crazy sh!t
InstantID, with the right LoRA(s) included has incredible facial consistency. My LoRAs of choice for I.ID are Ethereal Grace, Rainbow, and Shaun Tan ... but that's not an exhaustive list.
Without LoRAs, faces can be weaker in their consistency.
yeah i was trying to see just how far i can push it, took about 4 min on a T4 gpu, a 3x upscale using pixelksampleupscaler with 0.3 denoising, i was using ultimateSDupscale before but couldn't get rid of those tile seems
Funny...I'm trying to push a PixelKSampleUpscalerProvider through Iterative Upscale right now on a 1152x2048 image just to see something. Taking forever on my 3080. 🤣
2x upscale over 3 steps. Probably gonna die again here soon and I may have to give up on it.
Lol yeah it can take forever, i always use 1 step but i try to at least get a 3x upscale
Well, I would do 3x, but I started at a 2MP resolution, so 2x is enough. 🙂
Upscaling a 2k x2 takes forever on the T4 im using, so i only upscale 1k images
What upscaler would you say is the best rn, i tried ultrasharp and i hate it, way too sharp, i found a website with loads of upscalers but im still not sure which to use
Honestly, it totally depends on your source image, if you're trying to add detail or not, and the style you're going for. I'm not as versed with upscaling as some of the others here, but my current favorite tool is SUPIR...but I was planning on trying something with this iteration that I'm working on with SUPIR after the iterative upscale.
I haven't tried SUPIR yet but it looks promising, I've only been using ultrasharp and some other lesser known models i found on that website
The power to bend light!
I find an upscale usually gets rid of most sun rays, so that one is actually not too bad.
?
old fashion lady
freaky one on the left is my kinda style 😄
you're so vain, you probably think this song is about you
Old Photo that brings some nice effects to gens from watermarks, fade effects, splash effects, etc... Use about 0.4 to 1.1 for the weight depending...
Using ICBINP
Is there a way to swap the model while generating? I'm using Forge.
I think no
That's similar to refining. Just set the denoise to whatever percentage you want each model to work on.
Not familiar with Forge, but if it can refine, you should be able to.
yeah, i think you can just use the refiner checkbox, and tell it to use a regular checkpoint instead of the refiner
Oh interesting
This grandma found a giant sized rat in her garden! 😲 I think it's a good idea! 💡
It's her best friend now
Mouse needs a cookie, that's all
Hush, he's sad now
what!
how do you generate images where is that dream menu?
i see
a big car
6
quagh quagh
thats a horse + duck, so we can call it a houck
Generate an illustration of a little girl in a forest style, characterized by watercolor transparency, aesthetic, fresh, and elegant features.
#1047610792226340935 bots currently offline, you can run it locally or use online services like https://seaart.ai
and stop posting it in all channels
How do you register via email at seeart? It gives no options?
there is a discord option
Dorse

Nobody expected this when they said the aliens have landed
No trigger. Play with strength. The "Happy World" style, as the name suggests, typically portrays a cheerful and optimistic view of life, embodying...
