#🏞|general-with-images

1 messages · Page 162 of 1

clever oar
#

oh anathomy still bad in new model

#

extra legs

grizzled cosmos
#

is it possible to inpaint with SD3.5?

clever oar
#

dont know i use it online

languid pebble
#

Ahoi!

clever oar
#

I don’t understand why it’s so difficult to fix anatomical problems because flux doesn’t have this, even sd 1.5 is better

crisp stream
# clever oar

I wouldn´t necessarily call it bad in case it (occasionally ?) adds an extra leg only, as long as the shape is fine. Yet from what I´ve seen here so far, as well with the black forest gump on the bench FLux/SD 3.5 comparison, SD 3.5 still doesn´t seem to be doing too well with hands and stuff

clever oar
steep sleet
#

Early days - we'll know by Christmas as to whether 3.5L is a good SD Model; or just a large bottle of milk!!! 😄

clever oar
#

on the other hand, you can use SD3 where there are no people or living beings🙂

#

Mojo

#

you try it?how fast you make one picture

languid pebble
#

Not really fast ...

clever oar
#

one min?

languid pebble
#

Hard to compare cause I upscale my flux picture. 2 Minutes in Energy safe mode maybe

clever oar
#

nice result?

languid pebble
#

I switched back to Flux ... but no complete failure ...

clever oar
#

good you have hope😎

languid pebble
#

Let's don't talk SD3.5 dead and wait for the finetunes ... base model has never been good

crisp stream
languid pebble
#

But we all have been working with the finetunes ....

#

If I understood right the base models aren't trained 100% so we can do finetunes without overtraining it ....

crisp stream
# languid pebble Let's don't talk SD3.5 dead and wait for the finetunes ... base model has never ...

Well, not really here, while I started using dreamlike photoreal 2.0 for SDXL at some point, the basemodel was already fine, just like with 1.4/1.5, even though finetunes naturally improved the overall output qualities. Yet the base model was a pleasant basis, while I feel with SD3 (/apparently 3.5) and Flux that isn´t the case anymore, reason it´s hard to produce anything out of the boring realism and 3D comic look, because the training material has obviously changed a lot. With Flux for example it looks like they at least for quite an amount trained on a Chinese dataset, with lots of stock footage appearances. Reason I basically went to generating with Cascade again. This merely changed after having started local training for Flux.

#

Also addressing artists doesn´t really work, neither in Flux nor in SD3, don´t know how it is with SD 3.5, just guessing it isn´t any different than SD3, yet who knows

crisp stream
languid pebble
#

Maybe I'm writing garbage ... it's just how I understood it ....

crisp stream
#

I´m also wondering why SD3 has got 3 different textencoders, making prompting a hassle to work with and best is the textencoder doesn´t even fully grasp the milelong "natural" language prompts, yeah, I know they can be reduced, still, they still struggle then and basically having to translate a prompt into that textencoder language by using an LLM feels counter-intuitive and decouples the artist from the actual prompt engineering to at least some extend. Isn´t there a textencoder being able to understand at least keyword and natural language in one go? Flux luckily "only" got 2, still too much if you ask me. Yes I can combine both, so the prompt will be sent to each of them, then that´s not really the way how its use is being intended and that for, at least how I experienced it here, only partially works well.

royal monolith
crisp stream
royal monolith
#

think of three different humans - a mathmatician, an artist, and a chef. put them all together on a project and tell them all to 'bake a cake together' and you'll get a mess. but tell each of them to do specific things for the project, you'll get a much better result

crisp stream
royal monolith
#

and they're just as expensive to train as base models are

crisp stream
royal monolith
crisp stream
clever oar
clever oar
clever oar
#

last try🙂

languid pebble
clever oar
viral frost
stuck totem
#

@late sorrel i cant post images in the other channel but heres an example, and also i think i realized the major problem, the frames are the exact same color as the eyes so 100% the ai got confused, which is unfortunate because the other parts seem to be decent in consistency

late sorrel
late sorrel
#

that indeed can be tricky

stuck totem
#

i think next time ill train it without the glasses or just make the glasses a different color or something, idk

late sorrel
#

well yea cause you can change the color of the glasses with a text prompt anyway, just learn the concept first

stuck totem
#

yeah im new to this, either way its exciting though, some generation epochs are pretty decent and not complete trash so its promising at least, just the glasses that need fixing

late sorrel
#

making a lora is like half science, half art lol, you mess with the params in a certain way until it gives you what you are looking for, there is no definite way to train. i mean there are things that could be considered objectively better than others, but overall i would say its still a dance with the params

stuck totem
#

true, no wonder stuff like controlnet and inpainting were developed, ai art is completely unpredictable and you need all those tools to make it do what you want. But its real exciting when you finally get a generation thats coherent, or at least good enough to minorly edit with inpainting, it truly is a work of art

late sorrel
#

yea

#

lot of fun

candid river
late sorrel
#

i see

candid river
#

It butchered my prompt, I tried have chains as like hip and boot accessories

#

And the person standing etc

late sorrel
#

but wait what are you using to generate? sd3.5?

candid river
#

I'm not using any kind of stable

#

I'm using a image generator on a site called "perchance"

late sorrel
#

well the first problem with online stuff is that they are very limited in workflows, not to mention if they are limited in censorship as well

candid river
#

There's a bunch of awesome AI generators on there made by people, for anything from fantasy towns, to plants, to minerals, etc

candid river
#

And there are no restrictions for content

late sorrel
#

well idk what that website is using under the hood, because if you have that information, then you can technically know how to prompt it correctly along with the params

candid river
#

The one thing I can't do, is use an image as an asset on it

late sorrel
#

like image to image?

candid river
#

Kind of, yeah, but more so using the image to guide the prompt

#

To get an image

late sorrel
#

yes that is image to image

candid river
#

Oh my bad

late sorrel
#

because the guidance comes from the clip conditioning on the image, it receives info from the image. but its better to guide it with text on top anyway, not just image

candid river
#

Ironically enough, I can actually draw freehand, and have been for over 13 years. I primarily draw on paper and use digital stuff to make it pop more

late sorrel
#

i wish i could draw, i would make some very interesting "manga" :3

candid river
late sorrel
#

is that from the freddy game? i never played it, just seen some stuff lol

candid river
#

Yea

#

I also have a few oil pastel arts of nagatoro hayase

late sorrel
#

you are speaking to me as if i know all of this :3

candid river
#

And a colored pencil of a clicker from the last of us

late sorrel
#

i mean to be fair, i know a lot of characters, but i maybe dont know their names

#

for example, the danganronpa games, or however you spell that, i know the characters because i seen them, but i dont think i know any of their names lol

late sorrel
#

huh.. not sure if i know that character

candid river
#

That's nagatoro

late sorrel
#

where from?

candid river
#

Don't toy with me, Miss Nagatoro!

late sorrel
#

yep i dont know that one 😦

candid river
#

I plan on drawing Marin Kitagawa at some point

late sorrel
#

is that a newer anime?

candid river
#

("my dress up darling", it's an anime series about two unlikely friends, a doll making male, and a cosplay gaming and manga geek, female, they make costumes and go to conventions

#

And do photoshoots and stuff

late sorrel
#

what year did it release?

candid river
#

2022? Maybe

late sorrel
#

ah ok so yea im not that familiar with the newer anime

candid river
#

I drew denji from chainsaw man

late sorrel
#

i barely even watch these days

#

now i know chainsaw man lol

candid river
#

And I might try drawing power next too

late sorrel
#

damn

candid river
#

Here's my crate of "seasonings" they let me cook

late sorrel
#

it's kinda interesting, cause i remember having fun drawing some stuff as a kid, but i never really continued that path

candid river
#

This is another one I did freehand

late sorrel
#

nice

candid river
#

It's a scanned paper artwork

royal monolith
#

@wispy nest 2d cell-shaded anime image depicting a woman with long flowy blonde hair, wearing bold makeup, in her casual t-shirt and denim pants, navel cutout, indoors, solid red background. dramatic perspective.

wispy nest
#

cause im getting this w/o artist ref 2d cel-shaded anime , a woman in large white t-shirt, bare legs, at a beach as the sun is setting. the ocean water is deep turquoise with orange hue in the sky.

royal monolith
#

i've got no problem using an artist ref, i just tend not to need to usually

wispy nest
#

here is another example which i dont understand ... it should render anime..

a woman in large white t-shirt, bare legs, at a beach as the sun is setting. the ocean water is deep turquoise with orange hue in the sky. Anime style by Katsuhiro Otomo

royal monolith
late sorrel
#

nice water :3

wispy nest
#

not too fussed with the anime thingy atm, im liking the overall image aesthetics

royal monolith
#

that AI just be trolling you

wispy nest
#

one thing that stands out for me with sd3.5 is that it can render full body shots lot more fluidly than flux

royal monolith
wispy nest
wispy nest
royal monolith
royal monolith
wispy nest
#

i had to struggle quite a bit with flux to render ful body shots, it feels like flux focuses on upper body only how its trained

royal monolith
#

FYI - you can get 3.5 down to 8 steps and it still looks good. cfg 4, steps 8, shift 2

wispy nest
#

yeah that upper body thing is DPO
DPO likes to "zoom in" a lot and fill the frame with the subject

royal monolith
# wispy nest

you're using turbo. i mean sd3.5 large, not turbo, can still go down to almost no steps

late sorrel
royal monolith
late sorrel
#

i see

wispy nest
#

this is very pleasing to see how her face came out even with full body

#

im not upscaling

royal monolith
wispy nest
#

i wonder if booru style tagging triggers anime output

#

someone gotta try booru yeah see what happens

#

i did

#

it rendered anime but need to test more

#

about realism it falls a bit short compared to flux but still good

wispy nest
#

able to render at 10 seconds is effortless

wispy nest
#

since its already distilled

#

would be similar to the issues training flux

#

training a big checkpoint like juggernaut on a distilled model has never been done before

#

so i take that as a no

#

but they could train large base model to be optimized with 4 steps while maintaining richer texture?

royal monolith
wispy nest
#

i mean running large base model at 8 steps is fine too, given the model can be trained to produce better images than the default base

royal monolith
wispy nest
#

there will be stuff like hyper coming along probably

#

as an option

#

aye

#

when I do SDXL or SD 1.5 I load up TCD, PCM and hyper loras
and then mix and match weights

#

they all do similar things

royal monolith
wispy nest
#

this sd3.5 is 8 billion parameter?

#

i notice the model is lot less resource hog than flux

#

feels like im rendering with sd1.5

#

yet the file size of this 3.5 is 16gb plus the clips

wispy nest
wispy nest
#

anime girl at a park

#

`highly detailed, sharp focus, best quality, masterpiece,

anime girl at a park, smiling softly, wearing a red crop top, white shorts, black sneakers. modern anime.`

#

these quality prompts have impact...
best quality, highly detailed, sharp focus, vibrant texture, dynamic composition,

mellow needleBOT
#
Stylish Stroll

Aw, so cute! A person, all dressed up in a white button-up shirt with suspenders and dark pants, is standing by a teal-colored wall. They're wearing a floppy, white hat. A white door, framed by some light-colored wooden trim, is opening up to their right. The floor looks like light-colored wood. It's like a scene from a cute anime or a photoshoot, don't you think? Meow! So stylish! Nyaaa~

wispy nest
#

also not as strictly censored as flux

wispy nest
late sorrel
wispy nest
wispy nest
late sorrel
#

but to find her, i need to find the park first

rose sparrow
#

do you guys know which checkpoint this is?

royal monolith
nimble mason
#

sure, that looks like the AnimeSkullHatXL checkpoint

languid pebble
#

Good morning coffee

languid pebble
#

I think there's a reason why there's no "Save Image" node in the example SD 3.5 Workflow ;-P

dry blaze
#

Do you guys know of a model for "upscaling" for comfyui that doesn't upscale, more just take "paper picture scan" blurryness and sharpens it? As the image is already high res. just need to sharpen and deblur it

wispy nest
#

I would strongly consider upscaling and then downscaling though

reef furnace
#

There seems to be a problem or a major change in the way SD3 creates natural scenes. The first image is sd3 medium, the second image is sd35 large.
sd3 produces really natural results, whereas sd35 looks pixelated and strange. Both are the exact same standard sd35 comfyui workflows from huggingsface.

wispy nest
dry blaze
wispy nest
#

you can tile ERSGAN style upscalers in Chainner

dry blaze
wispy nest
#

sadly I never found one

#

would be good

reef furnace
wispy nest
#

can lower them later if this works

reef furnace
visual hemlock
#

Hey.. I am having issue with an model .. not sure if its everyone has problem with.. But the legs at the knee it always looks weird ( photo ) Is there a way to fix it?

cold bronze
#

Hello!
Could you help me ?

I need an Image... Wizard in a cold hill

clever oar
olive sky
visual hemlock
olive sky
#

Yeah I know I was messing with you considering the cropped image suggests a full picture that might not be shareable here.

Also, I don't have a solution for that. Anatomy is one of those things that our eyes and brain register instantly in minute details even if we're not aware of it. But those creases and the particular folding of the back knee is a prime example of something that wouldn't be explicitly trained in a model unless specified.

Also, it's a stylistic problem, since many visual art styles don't bother with exact proportions or shapes or rendering. The style you're working with is exceptionally prone to mistakes being very visible.

royal monolith
gloomy yew
#

''woman smiling slightly, laying on beach, natural sunlight, high resolution''

royal monolith
astral gorge
#

is it possible to replace a animal like head with a more human head using ADetailer? Trying to get an anthro character but the faces/heads are way too animalistic for what I am aiming for

royal monolith
gloomy yew
#

i think i just had this solved in the tech chanel thanks guys 😄

astral gorge
#

lol I was about to ask something and then i realised. WAIT... I think I routed the wrong node into this input XD

odd viper
royal monolith
#

or be specific in your prompt: a man laying on a beach towel on the sand, his hands behind his head, his dog laying across his stomach, looking up at the sky,

#

the issue comes when you give a very unclear prompt like "a man laying on the sand' to all three encoders, and they all have to guess what way he's laying, and what he's doing - and they all guess something else, then try to draw all of the guesses at once

#

'laying on the sand' - could mean laying on his back, or laying on his stomach. and it could mean with his head toward the camera, or looking at him from the side or looking at him from his feet or looking at him from straight above him. and if they all guess something different, and then try to draw all the guesses at the same time, for one image, you get odd results

#

but tell it SPECIFICALLY what he's doing and how - or prompt each encoder separately, and you don't get weird results

#

IF you are using a workflow that just allows you to put in a single positive prompt, then you are giving the same prompt to all 3 encoders, you have no choice. if you are using comfy, you can use the triple encoder node and then prompt each of them individually

odd viper
#

(and ofc make it catastrophically forget things like what a bank is and stuff like that)

royal monolith
royal monolith
royal monolith
neon vine
#

ah

royal monolith
#

yes

odd viper
royal monolith
#

sd3-2b-medium, sd3.5 - and flux too except it's stuffed so full of images to mask the effect you don't need to

odd viper
#

since ANN models are "rough models of the universe", how rough comes down to how much data it has on-hand during training to align it to what is actually the truth (or what we want it to do)

royal monolith
#

anyhthing that uses the archetecture

neon vine
royal monolith
# neon vine i just got 3.5 any sampling recomendadtions??

grab this, it's got my comfy workflow in it, take a look at my settings. to get to my actual uploaded image with the workflow you have to 1. click the image to open in the discord viewer. then click the words open in browser. then right click on that image, save as, then drag and drop into comfy

odd viper
#

I guess the whole "using diffusion models instead of transformer" kind of defeats the purpose of being able to output accurate depictions of things that it hasn't seen before e.e

neon vine
royal monolith
neon vine
#

ill take a look

royal monolith
neon vine
#

barely have enough VRAM to generate a 512x512 image

neon vine
royal monolith
# neon vine barely have enough VRAM to generate a 512x512 image

https://blog.comfy.org/sd3-5-comfyui/ you might want to use comfy's version then, it's a lot smaller

Comfy Org Blog

Following our exciting V1 launch yesterday, we're excited to share that Stable Diffusion 3.5 is now supported in ComfyUI for local inference. Experience it with our signature node-based workflows!

Just now, Stability AI released Stable Diffusion 3.5, including 3 powerful models:

  • Stable Diffusion 3.5 Large: With 8
neon vine
#

whats with the 3 prompts?

royal monolith
neon vine
#

i do not know how that works

royal monolith
#

if you remove the text that's there, you'll see which field goes to which encoder

#

give t5XXL your long, narrative, detail rich prompt. give clip_l your artsy, background, ambient, fine details, style stuff. give clip_G the black and white, no frills text

neon vine
#

im new to comfy i dont exactly know how clips work

royal monolith
#

the encoders are what read your prompt - they're also models

#

you CAN give all of them the same prompt, but then they will battle each other, and they will each understand it differently. so prompting them seperatly works much better

neon vine
#

what about leaving one ortwo empty

royal monolith
neon vine
#

oh god is 40 steps slow ill drop it down to 30

royal monolith
#

i use steps between 32 and 40 usually

neon vine
#

technology is leaving my 3080 behind quite rapidly

royal monolith
neon vine
#

also using the official sd3.5 large

odd viper
royal monolith
royal monolith
odd viper
#

they work in mostly spatial domain, converting to pure latent information + physical simulation would end up with better results

odd viper
neon vine
royal monolith
neon vine
#

im tempted to set it to 200 steps and go eat dinner lol to see what happens

#

or 30 steps but a batch of like 30

royal monolith
odd viper
royal monolith
#

it's not using unet, you cant' talk to it like you would, say, sdxl

odd viper
neon vine
odd viper
#

so if you're saying you don't get that at all, it's because you're not asking it for things that may result in that 🤷

#

Fair point about how it's being prompted, It would be cool if stability released their VLM or a bunch of examples of dataset captioning

royal monolith
royal monolith
royal monolith
odd viper
royal monolith
neon vine
odd viper
#

show me tiptoe kneeling in sand, short black hair, both hands on the ground in front of the character with a side-view perspective 🤷 (then observe the feet and hands turn into abstract horrors)

royal monolith
#

when you use phrases that are unclear, and made up, and are hard for humans to understand, there's little chance the AI is going to ahve a clue what it is you want it to draw

#

and when that happens, and the encoders all guess somethign different - and they are all incorrect almsot guarenteed, you get a mess

odd viper
royal monolith
#

you are ASKING for a mess with that phrase

odd viper
#

also that's not my "prompt" smh

royal monolith
#

and i'm a human with years of experience, not an AI with a very small world view

odd viper
#
~*~aesthetic~*~ #boho #fashion, full-body 20-something woman kneeling in sand, wearing shorts and a t-shirt, candid pose

"no artifacts"

#

"20-something woman" is from SAI's provided prompt btw

#

this is through replicate API with no changes to default settings, 40 steps, cfg 4.5 etc

royal monolith
#

rompt: a blond woman sitting on the grass. she is sitting cross-legged. she is holding a sign in her right hand that says "SD 3.5". her hair is blowing in her eyes. she is smiling. She holding a rose in her left hand. it is summer. the setting is a park

odd viper
#

"no artifacts"

royal monolith
odd viper
royal monolith
# odd viper "no artifacts"

nope, no artifacts. that's the AI trying to draw more than one hand or more than one foot at a time. that's an issue with the encoders

#

and that, again, is what a LoRA is for, if you really want to create images where tht's a problem

odd viper
#

🤔 interesting, so you're saying the model wasn't trained properly enough to take advantage of what the autoencoder has to offer (with 16 channels per pixel?!)

royal monolith
#

not an entirely new checkpoint

royal monolith
odd viper
#

it may make sense for "style" but even then, these models in the past have shown great ability to easily switch between photograph and cartoon style without needing to load model offsets/loras/etc

#

you'd be way better off having a fine-tune specifically for creating high quality human anatomy across a diverse range of styles, poses, content etc. Using Lora's for poses is extremely limiting, having a smarter model would be far better

#

In any case, I'm looking forward to seeing if this thing trains as good as SDXL does. I've seen some really impressive and diverse capabilities in the much-smaller model

case in point; it makes sense to use lora for style, but not for small anatomy features

royal monolith
odd viper
#

my 3090ti has no issues now running it locally with fp16 >:D

#

ima probably set this one aside until a good community fine-tune pops up and wait for auto1111 repo to get updated so I can spend a day merging all of my modifications onto it x_x

clever oar
#

The debate is whether the anatomy in SD3.5 is normal?

#

sd 1.5 do better😃

torn grove
#

can someone help me edit the syringe that has been generated to a pendant or a necklace?

scenic yew
clever oar
scenic yew
clever oar
#

I hope this gets fixed

#

although I would be happy to run even this locally, because I have 4 GB vram

clever oar
scenic yew
clever oar
scenic yew
clever oar
clever oar
#

sd 3 medium

#

turbo and medium better)

#

now what can flux

#

flux lose

#

but there more style

#

and now sd 1.5 lmao

median rover
#

Anyone know if there's a model that can make images like this? Took a lot of effort with midjourney blender was hoping there was something trained on psychedellic stuff.

clever oar
clever oar
jovial tiger
#

It's not on flux level, but it's way better than at 4.5.

jovial tiger
royal monolith
supple thistle
#

Cat

clever oar
royal monolith
cold bronze
#

Wizard in a cold hill

wispy nest
#

If you VAE encode and decode a high-res photo you will pretty much get the same image back, the VAEs are much more powerful models than the Unets and DiTs

royal monolith
wispy nest
nimble mason
#

lol 16 ch per "Pixel"

#

no such thing as a pixel in a latent lol

steep sleet
#

I'm getting very poor output using default SD3.5L workflow

#

Hope its not my GPU?!

#

Or is it my prompt at all?

nimble mason
#

if your gpu was the problem you'd get nothing

steep sleet
#

I'm seeing that it is actually the prompt doing this ...

#

(If you d/load my PNG and run the w/f and see if it does the same?!)

#

🙂

#

This prompt "ruins" SD3.5L

royal monolith
celest sigil
#

Really? That output doesn't look prompt related.

steep sleet
#

This is the next output - same settings

#

Perfect

royal monolith
# steep sleet

that is the world's ugliest prompt. why on earth are you using midjourney weighting with sd 3.5?

steep sleet
#

It works

royal monolith
#

it doesn't. 3.5 cant' use those weight commands, that's midjourney specific and not in th emodel, either

steep sleet
#

But it also throws SD3.5L "a loop!"

royal monolith
#

yeah. all of that stuff is jsut random noise to it

nimble mason
steep sleet
#

I got the weighting from Portrait Master - it works fine on my PC

royal monolith
nimble mason
#

i've found the best way to weight components of prompts is to change the order

steep sleet
#

But I can see that it troubles SD3.5L

nimble mason
#

or if you're desperate, to repeat stuff in different ways

royal monolith
steep sleet
#

OK

#

It works fine in Flux

royal monolith
steep sleet
#

Having said that 3.5 has no idea - every so often the prompt works perfectly

celest sigil
#

Where's my Leopard?!

steep sleet
#

Let me see if I can get a perfect example at the same settings ...

royal monolith
wispy nest
#

it depends on how the Clip text encode node is coded

royal monolith
steep sleet
#

It affects clothing using terms like snake, alligator, wolf, fox, peacock etc

royal monolith
#

i think it tried to eat her pasta, she skinned it, and is wearing it

celest sigil
#

This is odd...I have a wildcard with animals in it, and the prompt that came up replaced the wildcard with "Leopard print". There is only "Leopard" in the list of animals?!

royal monolith
celest sigil
#

I know, but you're missing the point

steep sleet
#

Can anybody reproduce my output using the w/f from the PNG? svp

royal monolith
celest sigil
royal monolith
celest sigil
#

You can see the replacement in the first screen shot. That's before it goes to any LLM

royal monolith
celest sigil
#

It's done it again with Zebra!

wispy nest
#

I always had trouble getting wildcard nodes to work

celest sigil
royal monolith
celest sigil
#

There is no LLM

royal monolith
#

then it's a very poorly written script

#

you should be able to read the code of the node to see what it's doing

celest sigil
royal monolith
#

maybe it needs a differnt dictionary

wispy nest
#

I can't remember which wildcard nodes I tried cos there are many
but there was something fishy, have to check code at some point

celest sigil
#

It's not intelligent enough to know stripes goes with zebra and print goes with leopard

royal monolith
celest sigil
royal monolith
steep sleet
#

Its definitely the prompt - and its not an aberration after all - just a densely textured (and unattractive!) look.

#

This isn't too bad

#

I dropped the use of both Zentangle and Rococopunk - much more usable output as a consequence!

#

Zentangle and Rococopunk produced "too much bad noise" 🥳

wispy nest
#

a lot of civit loras have to be weighted very low

#

the red background with green looks good

steep sleet
#

Awesome!

steep sleet
languid pebble
#

SD 3.5

clever oar
languid pebble
clever oar
#

It’s strange that even simple images have such mutations

languid pebble
#

They look like copied in ...

celest sigil
#

Something a little odd about the dog.

clever oar
languid pebble
#

Nothing more to say

clever oar
#

in photoshop?

#

the lighting on people is as if they are not from this picture

astral gorge
#

anyone got experience with the TCD lora and module? I just added it to my workflow but I'm getting like what seems to be fairly different results from what I was getting before, like I lost most of my prompt adherance and also the generated image almost looks like it hasn't finished denoising.

#

I'm working in comfy btw

languid pebble
wispy nest
#

gamma is called eta on that node, by the way

shut jasper
#

GPU requirements for running stable diffusion 3.5 large model ??

vagrant dust
clever oar
astral gorge
wispy nest
#

the choice of eta is key

#

trying every eta from 0 to 1 in 0.1 increments is a good idea

astral gorge
#

I was having issues getting my normal samplers and schedulers working so I had to back off of it for a bit to get that back to generating anything decent. I finally got a proper gen again so gonna switch back to TCD

#

my workflow got a BIT more complicated XD

wispy nest
#

hmm

odd viper
#

it was a palpable difference

astral gorge
#

this can't be right for TCD

#

ah so while other stuff gets longer with each step this gets shorter or something

#

cool

#

I don't get why my generated image changes SOOOO much when using TCD instead of something else. it's like most of my prompt adherance gets nuked

astral gorge
wispy nest
#

higher step count and higher CFG mostly

#

if you push CFG high enough you might need nodes to deal with CFG burn like thresholding, skimmed CFG or tonemapping

wispy nest
#

there was never really a newer VAE that improved SDXL, beyond the fp-16 fix

steep sleet
#

I'm going to tryout OmniGen in the new SD Next (just released today!)

odd viper
clever oar
steep sleet
#

At the entrance of a dimly lit cave, a towering, majestic dragon with sapphire-hued scales glistens in the faint light. The dragon stands tall, holding two crystalline prisms in its claws, angled precisely like those in the reference photo. The sunlight streams through the cave entrance, hitting the prisms at specific angles, causing vivid, realistic beams of light to split into a spectrum of colors, casting a radiant rainbow on the dusty ground. The surrounding area is shrouded in partial shadow, with the play of light and dark creating a mysterious atmosphere. The dragon’s intelligent, piercing eyes gaze at the viewer, offering a silent challenge: solve the ancient riddle of light and shadow. The cave walls are rugged and dark, with faint engravings hinting at forgotten knowledge. The overall mood is one of mystery, magic, and high-stakes intellect, as the dragon stands guard over the path forward."

royal monolith
steep sleet
#

Turbo SD3.5L + Ollama LLM for prompting

royal monolith
royal monolith
steep sleet
#

I will get around to it I'm sure 🙂

royal monolith
#

blender+default cube + material on one face...

#

that's got all sorts of possiblities suddenly

steep sleet
wispy nest
#

their next version apparently they are also training it on bad hands so that can be used as a negative

steep sleet
#

Ooh! Err! cattitude

languid pebble
#

I sometimes see faces in that one ...

vagrant dust
#

Thought it was an image where you look at strong enough then move a bit away to see a 3d image?

languid pebble
#

Does it work for you?

vagrant dust
#

Well i was bad in the days where those 3d stereograms where released in book form... so no 🙂

languid pebble
#

I know them but don't think A.I. can do this ...

#

Sometimes I see an eye on the top left or two on the right side

steep sleet
#

3.5L Turbo

languid pebble
#

Day of the tentacle ...

clever oar
clever oar
fleet bluff
mellow needleBOT
#
Night City Sparkle

Aawww, what a charming night scene! The city streets are all shimmering and sparkling with the reflections of the lights. Rain is falling softly, creating puddles that catch the light like tiny, magical mirrors. The buildings are gorgeous and elegant in their reddish-brown tones, standing tall in the dark night. Streetlights illuminate the way, casting a warm glow on everything around them. Cars are parked neatly along the street, and the air is filled with the quiet hum of the city at night. It's like a dreamy urban wonderland! Purrfect for a magical girl adventure! Nyaa!

#
Night City Sparkle

Aawww, what a charming night scene! The city streets are all shimmering and sparkling with the reflections of the lights. Rain is falling softly, creating puddles that catch the light like tiny, magical mirrors. The buildings are gorgeous and elegant in their reddish-brown tones, standing tall in the dark night. Streetlights illuminate the way, casting a warm glow on everything around them. Cars are parked neatly along the street, and the air is filled with the quiet hum of the city at night. It's like a dreamy urban wonderland! Purrfect for a magical girl adventure! Nyaa!

crisp stream
cerulean radish
#

#checker board socks that play chess

keen wing
#

made this banner for a friend who plays rust

steep sleet
#

Turbo 3.5L

languid pebble
#

Good morning coffee!

nimble mason
steep sleet
nimble mason
steep sleet
#

Turbo LLM 3.5L

steep sleet
radiant plume
visual hemlock
royal monolith
#

@pure monolith Prompt: A woman lying on top of a pool of marshmallows., Negative Prompt: left blank, Width: 1024, Height: 1024, Steps: 40, Cfg Scale: 4.0, Shift: 3, Seed: 2926827617

gloomy yew
royal monolith
gloomy yew
#

damn, ive only just installed this version .. howe do i even update to 3.5 😛

gloomy yew
#

oh man

#

lots of letters and words

#

xD

#

I just followed a youtube guide the other day, i assume it was an old version in the guide

#

i use a standalone if that makes any difference

royal monolith
royal monolith
gloomy yew
#

I followed this guide .. I had no idea what I was doing, just followed the steps

#

my only knowledge on it is that I hit the batch file to start it up lol

royal monolith
gloomy yew
#

is this the right method?

royal monolith
royal monolith
# gloomy yew https://tenor.com/view/friends-chandler-noidea-excited-gif-4810411

Today we cover the basics on how to use ComfyUI to create AI Art using stable diffusion models. This node based editor is an ideal workflow tool to leave how AI art is generated, but also how you can really mess with the internal elements much more than you can with any other AI Art interface out there today. #comfyUI #stablediffusion

Install ...

▶ Play video
gloomy yew
#

legend thank you

gloomy yew
#

probably getting ahead of myself

royal monolith
gloomy yew
#

youre so kind ty

royal monolith
gloomy yew
#

add a silouette of frodo baggins

royal monolith
royal monolith
royal monolith
clever oar
languid pebble
#

Good morning coffee

royal monolith
sterile kiln
steep sleet
#

3.5L Turbo LLM (Llava3.2)

steep sleet
languid pebble
steep sleet
#

SD3.5L Turbo LLM (Llama3.2)

languid pebble
#

Just a usual day in NYC underground ...

steep sleet
#

SD3.5L Turbo LLM (Llama3.2)

languid pebble
steep sleet
#

When in Flux yes ... can it be used in 3.5L as well?

#

I have seen Olivio Sarkas video on this node - changes detail somewhat ...

languid pebble
#

Yes ... I have no clue whether it works with SD3.5, too

#

I just added the node to the super Flux workflow ....

wispy nest
#

ModelSamplingFlux works on every model you can use it with SD 1.5 if you want

royal monolith
ancient bronze
#

Not sure this is a matter of tech or prompting but how do ya guys fix bad eyes when it's larger pics like this?

#

Using pony btw, might be relevant

ancient bronze
jovial tiger
# clever oar

lol my mouse immediately went to go click the play icon

clever oar
ancient bronze
wispy nest
#

you can do eyedetailer

#

but there isn't a pre-made node

ancient bronze
#

Node? I assume that's comfyui stuff, I'm using the version for coughing babies, webui

wispy nest
#

ah I don't know about that

ancient bronze
#

I guess you could call it the vanilla basic bitch version

wispy nest
#

lol

#

webui is okay for users its just the code base is bad for developers

ancient bronze
wispy nest
#

TBH with comfy you can just drag someone's image

#

click "install missing nodes"

#

hit control+enter

#

and you have got their workflow running

ancient bronze
wispy nest
#

miracle checkpoint is a cool way to describe flux yeah

ancient bronze
ancient bronze
royal monolith
ancient bronze
royal monolith
rain gazelle
languid pebble
#

Sometimes you just need a hand. Good morning coffee!

steep sleet
steep sleet
gilded venture
#

Guys what kinds keywords can i use on sdxl to get a style like this. It's like a blend of photorealism and digital art

icy sedge
#

Boy

gilded venture
#

I need it for a project i'm working on

#

Btw currently i'm uisng "(concept art:1.4) {prompt} (digital artwork : 1.4), illustrative, painterly, matte painting, (highly detailed : 1.4)"

and its not giving me the best result

rain gazelle
#

I feel like this is what 99% of sdxl models output 😛

gilded venture
#

I knowww but i've tried so many times and the images always end up not as digital as i want

gilded venture
rain gazelle
#

For this i don't think it really matters much. "Pretty woman, dark makeup, hair bun, city nightlight, choker necklace, halter" would probably do it.

#

lots of the big popular models output results like this just by default... reminds me of the old RevAnimiated

vagrant dust
gilded venture
#

I'm using fooocus btw

vagrant dust
# gilded venture I'm using fooocus btw

not about the ui but the models. For the one above i created an image with animerge but did not denoised it to 100%, then a second pass with epic realism. Pretty sure there are some sdxl models which are well well balenced

gilded venture
wispy nest
#

some SD 1.5 models do better with natural text BTW

gilded venture
wispy nest
#

new reality does better with text for example

#

didn't realise until recently

gilded venture
wispy nest
#

like sentences

#

instead of tags

gilded venture
#

ahh got it

lucid sparrow
#

can someone please explain to me what the hell this is? I keep writing to support to do something about it but I guess they don't give a shit if they have our money! Why the fuck isn't it working!!!!!!!!

royal monolith
frosty wave
#

The place where one can make support tickets isn't accessible to normal server members? Is that intended? Also I wanted to report user: davidsmith4704 for randomly writing me "Hello What's up with you?" (We never had contact and I haven't been active on this server for months.)

[I'm reporting this message with the ⚠️ reaction so mods see it ,not sure it this still works though]

#

Also this doesn't seem to work

royal monolith
royal monolith
hardy tapir
#

Realistic Vision 5.1 Hyper

#

looks perfect at first but after a bit you realize something is off

#

which is that the lighting on the face doesn't match the lighting on the neck/torso so the face looks copy pasted onto an original face

#

likely fixable with an image editor

royal monolith
languid pebble
#

Good morning coffee

languid pebble
#

🤪

viral frost
#

I think I'm done with trying to make SD3.5L work for me. It just doesn't seem to give quality output. Flux on left, 3.5L on right, same prompt and seed. Upscaling makes 3.5L even worse.

steep sleet
# gilded venture What prompt structure do you recommend

raw photo, (realistic:1.5), (woman:1.01), (fashion model 18-years-old:1.5), (tiny, tiny body:0.36), (over the shoulder pose pose:1.25), (giger:1.05), (blue eyes:1.05), (oval eyes shape:1.05), (red lips:1.05), (defined cupid's bow lips:1.05), (glowy makeup:1.05), (in love, in love expression:0.68), (round with defined cheekbones shape face:0.63), (frohawk cut hairstyle:1.05), (mahogany hair:1.05), (disheveled:1.09), (muted colors:0.98), (fashion photography:0.5), (professional photo, balanced photo, balanced exposure:1.2), (watercolor makeup:1.05), (back arch pose:1.25), (vintage dress:1.25), (bikini:1.25), volumetric lighting light, light from left, muted colors, black and white photography, (professional photo, balanced photo, balanced exposure:1.2)

steep sleet
glossy notch
#

A wheat field at sunset, transitioning from purple to intense yellow. On the left side, a field of wheat is consumed by fire and reduced to ashes. In the center, there is a stone path, and on the right side, a bountiful wheat harvest viewed from above. The scene has a cinematic style, captured with a 20mm lens for added depth and perspective."

steep sleet
languid pebble
#

Made a new LoRa ... 🙂

steep sleet
#

Ectomorph LoRA?

languid pebble
#

Humanmachine ...

gilded venture
languid pebble
wispy nest
#

it needs some more fine tuning to be used as the final pass I think

viral frost
royal monolith
viral frost
#

I’ve thought about the possibility of using Flux as a refiner for SD3, but 64GB isn’t enough memory for that on MPS without quantization, and I don’t really want to download quants just to try it out. Not even sure if quants work on MPS yet — they didn’t when Flux was released.

wispy nest
#

would recommend using Nvidia GPUs in cloud over using Metal Performance Shaders

#

there's gonna be quite a lot of things that aren't implemented for Apple at the moment

#

its true that using a separate model as a refiner comes with the downside that you need to keep both models loaded in VRAM

#

unless you are willing to load and unload a lot

royal monolith
wispy nest
#

you could try Realvis SDXL as refiner, looks like this when used on Flux

#

its a fairly low blur model, which can be helpful

karmic lake
#

guys, I'm lost, I just want to be able to extend images using stable diffusion... I have made an acount, got a license, but what do I do now?
Install something?
There are these apps on the website like draw things and diffusers, I don't know what any of this is...

royal monolith
karmic lake
#

I think it is called 'generative fill', to extend an image, that is all I need it for

karmic lake
#

okay thank you

royal monolith
#

welcome

vagrant dust
# royal monolith that's photoshop only

Since when? Invoke, comfyui and for example diffusionbee (Mac) use a canvas where you can easily outpaint an image which does the same like the generative fill from photoshop?

royal monolith
#

and THAT is what he said he was looking for

celest sigil
languid pebble
#

Good morning coffee!

celest sigil
steep sleet
#

SD3.5L Turbo

wispy nest
#

you can get away with a turbo model for the early steps if you have a refiner yeah

steep sleet
#

Three-tuskers!!!

celest sigil
#

Prompt from @viral frost

steep sleet
celest sigil
wispy nest
#

flux refiner is cleaning up good wow

steep sleet
#

Are we posing SD3.5L Turbo as 'poor', if we are turning to Flux for finishing/refining?

wispy nest
#

not necessarily
I often refine Flux with SD 1.5, but I would say Flux is not a poor model compared to SD 1.5

#

SD3.5L, SD3.5L Turbo and the SD3.5M that is coming have not had final aesthetic fine tune yet
so a refiner is good now, but in the future they might not need a refiner as much

#

Flux came out of the box with an aesthetic fine tune done already, so its a bit different for Flux

celest sigil
#

Accidentally set denoise to 1.0 on the Flux refiner, so this is actually just Flux 😄

wispy nest
#

hmm I might like the pure Flux image more, hard to say

steep sleet
celest sigil
#

Refined

languid pebble
steep sleet
clever oar
celest sigil
supple nimbus
#

We are looking for people who want to participate in our new web 3 ecosystem. A brand new project with lots of tools for your needs.

Beta Testers ($35-40/hr)
Moderators/Community Manager ($500/week)
Developers (Rust, Python, C++)
UI/UX designers (from 1 year of commercial work)
Ambassador (to be discussed)

Apply today and be part of a transformative journey fueled by creativity and vision.
To apply, send me a friend request!
(We are also open to proposals for cooperation on mutually beneficial terms)

celest sigil
#

Everyone should be interested, they're a "Crypto holder", don't you know!

wispy nest
#

crypto is nearly 20 years old and I haven't seen a use case yet lol

celest sigil
languid pebble
minor violet
#

Hi. How do I configure the predefined prompts to append to user input so that small amount of 'text' would be generated on the images?

steep sleet
#

Add a sign with the text saying "Welcome to SD3.5 Medium!"

languid pebble
#

Ahoi dicordos

clever oar
#

AHOI

real bay
languid pebble
real bay
clever oar
viral frost
gritty warren
royal monolith
#

SD3.5 medium

brazen sand
celest sigil
royal monolith
#

SD3.5 large

brazen sand
celest sigil
#

denoise around 0.55

brazen sand
#

maybe you have a really good pc

celest sigil
#

It takes me 60 secs to create an image, with both passes and upscale.

#

RTX 4090 🙂

brazen sand
#

24 GB, oh boy I wish I had gotten into more debt to get one instead of 12 GB VRAM 😂

brazen sand
#

and even more loading of the model time

celest sigil
#

SD3.5 using Google FLAN and Flux refiner.

wispy nest
wispy nest
brazen sand
wispy nest
#

VRAM is like the working memory of the GPU
when its full, the model has to use slower memory (DRAM) and this slows it down a lot

brazen sand
#

I would like to find some refinement method for already generated images, so I don't have to switch or load different models

brazen sand
wispy nest
#

if you switched to using GGUF quants that fit in your VRAM you would see a very dramatic speed up

brazen sand
#

once it is loaded it is kind of smooth, but in no circunstance I would considerer switching the model in the middle

brazen sand
wispy nest
#

why? the 4_0 GGUF looks almost identical for most seeds

#

and you would receive many multiples of speedup

#

you could also fit both Flux Dev and 3.5L in your VRAM

#

if you used 4_0 GGUF

languid pebble
#

Good morning coffee

celest sigil
celest sigil
celest sigil
steep sleet
lapis bluff
#

BITE COIN

languid pebble
brazen sand
#

original

sterile kiln
brazen sand
# sterile kiln Wow, what prompt? Is it SD3.5?

it was flux dev, the promtp was

A photo taken inside a rustic hut, with no luxuries, depicting Jimi Hendrix and John Frusciante as castaways. They are relaxing and drinking argentinian mate, a traditional argentine beverage, surrounded by simple, tropical decor. The scene captures a laid-back and creative atmosphere, with instruments and tropical elements in the background.

brazen sand
# sterile kiln Wow, what prompt? Is it SD3.5?

but the one you quoted, it was done by inpainting, the original I made with flux was the other one. In the original, one is drinking "something", and they are not quite jimi hendrix nor john frusciante, I just inpaint it until I produced that

visual hemlock
#

I was sitting and doing prompts from different shows and they all became so fucking tall and I did multiple ones and they all became the same.. Eventually I look at my photo size and it was 832 x 2048 😄

viral frost
royal monolith
#

@nimble mason prompt: Abandoned, century-old lighthouse on rocky coastline at dusk, with crumbling stone walls, rusty lantern room, and overgrown vegetation. Waves crash against weathered foundation. Flickering sunset light casts eerie shadows." sampler: dpmpp_2s_ancestral_cfg_pp scheduler: linear_quadradic

nimble mason
#

a lot of that stuff stopped working with sd35, including the euler_cfgpp in comfy

#

SamplerRK is definitely working though

royal monolith
nimble mason
#

the "dpmpp_2s" in the dropdown in samplerRK is a dpmpp_2s_ancestral implementation

royal monolith
#

this is the entire set for that sampler

nimble mason
#

lol

royal monolith
#

i've got distribution on all but that one

nimble mason
#

yeah i probably need to go do some rescaling with the math or something

royal monolith
#

this is another one

#

but euler_ancestral is nice

nimble mason
#

oh yeah my cfgpp ones are actually not for RF, i just looked

royal monolith
#

i'm just using the default that's in comfy as of the latest update for 3.5 medium

nimble mason
#

haven't updated em... hopefully never will as the goal is to just get all this shit rolled into the same core architecture

#

gotcha

royal monolith
#

someone stuck lms right in the middle of the dpm's

nimble mason
#

recommend trying this for euler ancestral

royal monolith
nimble mason
#

gotcha, makes sense

royal monolith
#

but you might want to look at wht's going on with that sampler anyway

nimble mason
#

the euler ancestral in comfy?

#

that should be pretty much equivalent to this:

wispy nest
#

comfy adapted euler ancestral for ret flow recently

#

if i remember rightly from the commit history

nimble mason
nimble mason
royal monolith
#

becuse i should have at least gotten a distribution for that scheduler, not solid red

nimble mason
#

that was back in the pre-SD3 days

#

afaik

#

very very work has been done by pretty much anyone afaik to get noisy sampling working

royal monolith
#

all the other samplers i'm at least getting a noise distribution, though, for that scheduler. just not that one

wispy nest
#

I don't know why but it feels like the entire academic field wants these models to be one step of euler

royal monolith
nimble mason
#

ultimately im' gonna delete all of those and just replace them with functions that pass the call to the same core sampler code

#

lol, yeah, i mean... here's how i see it

#

either you get it to the point where it's one euler step, or there's benefits to better sampling methods

royal monolith
#

yann posted something really dumb on twitter the other day, to which i responded with 'proof that scientsts need to get out of the lab and live in the real world'

nimble mason
#

test of linear quadratic, res_2s with one implicit step

#

hard noise eta 0.5

#

same as that first one but with beta

royal monolith
#

i like the first one best

#

beta's too saturated

nimble mason
#

agreed there

#

another linear quad

royal monolith
#

that one's good too

#

skip layer 2

nimble mason
#

agreed

wispy nest
#

I thought using linear_quadratic on image models instead of video models was a joke
but its working kinda well 🤔

nimble mason
#

main thing i'm looking out for here: the grainy gritty look that most mmdit models seem to be plagued by for whatever reason

nimble mason
#

another linear quad

#

gonna gen these prev two with beta too

royal monolith
nimble mason
#

the thing that's interesting is the fact these schedules work period

#

yeah not sure if it's arch or training

#

tehre's certainly more synthetic data around now

royal monolith
#

skip layer 2 and see what you get

royal monolith
nimble mason
#

yeah, but i mean... more stuff that's got weird artifacts that aren't typical for photos etc

royal monolith
nimble mason
#

just a random guses at one possible reason for the appearance of new artifact patterns

#

i've seen it with sd3 and flux

#

it was at its worst with the sd3 beta

#

and it tends to appear for certain prompts, which is one reason i suspect training sets being a culprit

#

when these images come out clean, they're gorgeous

royal monolith
nimble mason
#

yea that's why i said beta

royal monolith
#

there's a lot of 'beta's ;)

#

that's even a fish

nimble mason
#

but yea that is one type of artifact i'm watching out for, and something your comparisons had less of in the linear quad outputs

#

i've noticed other models doing this shit too now now that i think about it

royal monolith
#

like i said - skip layer 2

#

and see what you get

nimble mason
#

and different versions of it in certain ways... when flux spits out the fake toy story world look, it's especially prevalent on texturse like dark pavement

#

dots, dots, dots, fake detail

#

noise passing as detail

#

think most ppl think it looks great but... it does not, it's fake bs detail

wispy nest
#

I thought it was positional embeds
but then Sana came out with literally zero positional embeds and still has it

royal monolith
#

it's the core architecture

nimble mason
#

beta with the same seeds

#

linear quad is def better

#

i see some subtle haloing with the beta results, it's possible this is somewhat tangentally related to cfg now that i think about it

wispy nest
#

I think maybe its an inductive bias due to the fact that DiTs have to use patchwise embeds

#

I really wanna test the flan thing

royal monolith
wispy nest
#

someone on civit added flan t5 to sd 3.5 and flux

#

he said its way better but then didn't put comparison images

royal monolith
nimble mason
#

nope, need to get that added, don't have it on here

#

i'm not doing general updates with my comfy environment

royal monolith
#

i did some images with just clip_l and clip_g with prompts and a . for t5xxl - and they came out really nice

nimble mason
#

too much customized stuff

wispy nest
#

flan t5 is a fine tune of t5 made by google

royal monolith
#

take it out of the mix entirely

royal monolith
wispy nest
#

oh nice this could help with lowering the side effects of too much noise injection

nimble mason
#

the problems with injecting too much noise come from the sigma schedule not getting adjusted to scale for it

royal monolith
nimble mason
#

RF is real sensitive to any perturbations to the variance level

royal monolith
#

also - ddim_uniform warms your image up - has a lot of red shift

nimble mason
#

anytime you add noise, you have to be sure the next step goes farther down that it normally would, to get you back to the right noise level

royal monolith
wispy nest
#

ah is this the variance exploding versus variance preserving thing

royal monolith
wispy nest
#

no I didn't see

royal monolith
#

sgm_uniform vrs ddim_uniform

nimble mason
#

i might have to go make another RAG (random attention guidance) node and hope it's as interesting as it was with cascade

royal monolith
#

ddim on the left - lots of red

nimble mason
#

cuz i just love that name

#

SLG did nothing

royal monolith
nimble mason
#

is it supposed to have a very visible effect or be really subtle?

wispy nest
royal monolith
nimble mason
#

gotcha, good to know

#

scale, start end percent?

royal monolith
nimble mason
#

says no WF for some reason

#

just curious if you're letting it run through the entire generation or stopping at 0.15

royal monolith
#

i'm using dango's default settings. but i have something else loaded and can't load it right now to check what they are

nimble mason
#

yea i can view others like my own above, idk what's up

#

no biggie

royal monolith
#

beta's noise distribution leaves a little bit to be desired

royal monolith
nimble mason
#

obv torched lol

royal monolith
#

dpmpp_sde/beta

nimble mason
#

oh, yeah, that thing doesn't have any of the noise scaling corrected for RF

#

that would explain it

wispy nest
#

need to test Instaflow model

nimble mason
#

what's that

wispy nest
#

its like a ret flow version of SD 1.5

royal monolith
#

not as bad as dpmpp_sde/exponential

nimble mason
#

new?

wispy nest
#

no its old

#

but it would show something

#

if it gets flux grid or not

nimble mason
#

oh yeah that's def worth a look

wispy nest
#

one thing with ret flow is the flow paths aren't meant to cross

#

I wonder if this is the issue

#

because Pixart Sigma is a DiT and we never heard about Flux grid back then in the Pixart Sigma days

royal monolith
wispy nest
#

one thing I've been meaning to do for ages is write a python library that analyses and visualises various group statistics for the diffusion model trajectories
I originally wanted to do it to think about CFG burn but it would be helpful for this sort of thing also

royal monolith
#

the difference between dpmpp_sde_gpu/normal and dpmpp_sde/normal

wispy nest
#

this sort of chart is great

#

they plug a toy 1-D example into the model equation to get these

#

its showing the CFG in the early steps destroying the image diversity

#

and then limiting it brings the diversity back

royal monolith
wispy nest
#

I don't think I need speed bad enough to write Triton for it

royal monolith
wispy nest
#

there's a nice 3D one from GITS paper too

royal monolith
#

now THAT is a chart i like

wispy nest
#

that's how they made GITS scheduler they just took the average of these

royal monolith
#

i want to see what's going on in more than jsut 2D

#

i really want a visualizer that charts, in vector space, how a token in a prompt moves to it's final destination

wispy nest
#

3rd dimension adds a lot yeah

royal monolith
#

in 3D cause, yeah

wispy nest
#

I always forget the name of this paper, will find it tomorrow
its the one about delaying the negative
it shows maps of individual token vectors on top of the score function of the model
its the thing you are looking for

#

could be re-visualised in a nicer way probably

royal monolith
#

so much for assuing that the _gpu on the end of the sampler just means it'll use your GPU if you ahve one

wispy nest
#

lol

#

its just where the noise is made

#

its weird how much difference it makes

royal monolith
rain gazelle
#

Their recent share on monodromies was pretty interesting. Will be interesting to see that shift.

#

chuckles

#

😎 sometimes it feels like that in here

royal monolith
languid pebble
#

Good morning coffee

celest sigil
languid pebble
celest sigil
#

Apparently, the street gets narrower with age

languid pebble
celest sigil
rain gazelle
viral frost
sterile kiln
glossy sun
#

a teenager`s head in mini style

royal monolith
languid pebble
neon vine
#

i guess its that unmistakable ex commie eastern europe aesthetic

steep sleet
steep sleet
orchid nexus
#

bonjour, puis-je avoir une assistance en Français?

royal monolith
orchid nexus
royal monolith
# orchid nexus merci, pas de soucis je suis novice, je cherche à savoir que faire pour créer un...

pour créer une image sur ce discord, vous devez utiliser le canal artisan. Vous pouvez lire les informations à ce sujet dans ce canal #artisan-faq Cependant, il est en anglais, donc je vous suggère de copier le texte et de le coller dans ce traducteur. https://www.deepl.com/en/translator

Translate texts & full document files instantly. Accurate translations for individuals and Teams. Millions translate with DeepL every day.

steep sleet
#

Tres bien

languid pebble
#

The red planet

royal monolith
little dagger
#

I’ve been trying to get out paint to work for hours now and this is the best I get using poor man’s off paint. They really need a new video update of what needs to be done with modern out paint

clever oar
royal monolith
clever oar
#

he has no right to break such rules!