#πŸ†•ο½œsd3

1 messages Β· Page 118 of 1

dusky thistle
real terrace
#

I really love when there is that kind of composition and it gets done perfectly, like sections with perfect spaces

dusky thistle
oblique parcel
sacred geode
sacred jewel
dusky thistle
sacred geode
#

not sure why civitai isn't showing the info :/

patent acorn
untold valley
dusky thistle
patent acorn
#

ok def large is better

sterile pendant
#

I wasn't really impressed with large, but medium has kind of hit the artistic sweet spot if you're into doing art styles and if you're not obsessed with making waifus or people

#

I think finetunes of medium will eventually replace sdxl

patent acorn
#

large is prompt coherence-adherence fun, medium is sdxl but 100x better

dusky thistle
patent acorn
#

i typed a 2 panel garfield comic, i think the rating tags wasnt supposed to be there but hey garfield works πŸ˜‚

dusky thistle
craggy crest
#

@dusky thistle Prompt: a young executive sitting in a tower office near a window. Through the window we see the tops of buildings and the city. There is a cartoon thought bubble attached to his head and in it, we see a picture of a puppy

muted dove
craggy crest
craggy crest
muted dove
#

Generating on an empty stomach

craggy crest
muted dove
craggy crest
#

ghost lighthouse

muted dove
muted dove
#

Love these waves 🌊

craggy crest
#

like the light streaks from the lighthouse in this one

#

prompt: Intricate, geometric, snake-scale patterns, tessellated in shimmering, metallic hues of polished silver, gold, and bronze, reflecting light with a dazzling, kaleidoscopic effect, evoking the ancient, symbolic language of reptilian textures, and the modern, technological allure of precision-crafted materials.

muted dove
craggy crest
#

oooo love that effect

muted dove
#

A slightly different take on the lighthouse theme

craggy crest
muted dove
craggy crest
#

ROFL!

muted dove
#

I'd rather be there right now. πŸ€”

craggy crest
craggy crest
#

Crabs!

muted dove
#

A few πŸ˜„

craggy crest
muted dove
craggy crest
muted dove
craggy crest
muted dove
#

I avoid the whole platform.

craggy crest
muted dove
#

I smell it already 🀒

craggy crest
muted dove
#

Not if it's associated with knob face

sterile pendant
craggy crest
muted dove
craggy crest
#

it's the silly time of the night

muted dove
#

When dinosaurs walked the earth...

craggy crest
sacred geode
winged seal
#

Hey people, anybody know how to merge LoRA's into a transformers model, and then save it in the transformers split format as seen on HF? I need to be able to merge my LoRA's into my models so I can train additional concepts in

#

I am very inexperienced with code, so I unfortunately do not understand a lot of the stuff I have seen

noble coyote
#

Flux RF Inversion - no Controlnet, no IPAdapter! Original, then RFI version - prompt = a boy in an angry rage cartoon style

sterile pendant
#

If you wanted to just merge a model with a lora using the non-diffusers format, you can easily do it in comfyui and save the model. It's three nodes

noble coyote
#

Flux RF Inversion - no Controlnet, no IPAdapter! Original, then RFI version - prompt = a boy in an angry rage cartoon style

turbid grotto
#

anybody tried to train sd35m?

winged seal
sterile pendant
noble coyote
fossil pagoda
icy drift
#

Console says Phi3 (OmniGen's base model) doesn't support SDPA attention or something.

muted dove
#

A hologram from the future

gusty trail
gusty trail
patent acorn
winged seal
gusty trail
winged seal
#

well no shit... if thats the case

#

you will be my hero if this works @gusty trail

gusty trail
winged seal
#

I need the full model saved in diffusers format

gusty trail
#

you might just copy other component

#

and replace the transformer part

winged seal
#

I am *extremely inexperienced with code, so the less work/chances to mess things up, the better πŸ˜…

#

I can use safetensors to merge in the LoRA's, and then convert to transformers, if I can figure it out

gusty trail
#

the unetsave would save the model as a unet. If you want to load the models using diffuser pipeline, you just need a diffusers repo copy and replace the original transformer with saved new transformer.

noble coyote
#

Flux RF Inversion - no Controlnet, no IPAdapter! Original, then RFI version - prompt = astronaut on a spaceship in the style of 3d melting gold render cleaning the toilet

winged seal
#

here's an outline of what I need:

I am training flux using AIT, which requires a diffusers/transformers format model

From there, I wat to merge in those LoRA's, and continue training new concepts in/on that merged model

in order to do that, the model I merge into or try to train on cannot be a safetensors, and instead needs to be a transformers style model.

All I need here s to be able to convert from safetensors to diffuers, or have a script to merge LoRA's into diffusers/transformers models

gusty trail
#

Just try it. There are many way to achieve the same result

#

"diffuser" style model just has different naming key with the same thing. there is no black magic

winged seal
#

My mind has a tendency to get overwhelmed and shut down when I have no idea what I am doing 😭

sterile pendant
patent acorn
sterile pendant
#

what would really help the diffusion community most would be a new text encoder model that can translate lazy prompts and fleshed out prompts. almost like running prompt expansion under the hood for the lazy prompts. the problem is that text encoders are expensive to train. T5 and clipg/l are ancient by ML standards, with the rate that things grow in this field

sacred geode
lavish sparrow
patent acorn
lavish sparrow
#

Have a LLM pick up your prompt -> split it in a t5, clipl,clipg prompt

patent acorn
#

but what llm?

lavish sparrow
# patent acorn but what llm?

i've tried various -> actually roleplay LLM's are surprisingly the better ones. They are usually more creative, and give more flavourful prompts

patent acorn
#

specifically llama 3 is the only llm i find to be creative

lavish sparrow
#

mistral nemo, or if you can run it, mistral small

pseudo owl
# patent acorn can an LLM be used as TE to do that tho?

Yes llms can be used directly to convert your text into embeddings for the unet or dit like the t5xxl , lumina next, sana, li-dit all do that.

What eface is saying is to expand your prompt/rephrase it and then put the llm enhanced prompts into clipg, clipl, t5xxl.

lavish sparrow
#

i've tried cydonia 22b, works really well too

noble coyote
#

Meta's Llama3.2 is good

#

Zephyr, Llava2

lavish sparrow
#

zephyr still exists?

patent acorn
noble coyote
patent acorn
#

thats why i said some hf spaces do it

bitter hearth
patent acorn
#

idk if yall seen lumina using an llm as TE

bitter hearth
#

we need a diffusion model that can handle a wider range of guidance vectors, rather than an LLM forcing us to use a different vector

pseudo owl
# patent acorn yeah ik the enhanced prompt

Yeah then you can honestly use any llm. It’s more about the system prompt then. some might be slightly more creative then others but the system prompt, few shot prompts matter much more.

lavish sparrow
noble coyote
#

Try these too minicpm-v:8b-2.6-q8
qwen2:0.5b

sterile pendant
# patent acorn can an LLM be used as TE to do that tho?

Yeah but they are slow as hell compared to the way clip/t5 encode. LLMs do the whole next token thing and clip/t5 output the whole output at once. Also, the word flower doesn't come out of the prompt encoder as flower. It's way to complicated for me to explain at 8am walking the dog lol...

noble coyote
#

gemma2:latest

patent acorn
#

true haha

patent acorn
bitter hearth
#

TBH the best text embeddings for diffusion would probably have to be over API

sterile pendant
#

Even if you're using a pre-stage with an llm to prompt expand, you're still at the mercy of clip and t5 and then encode it for the diffusion. It helps a ton though

pseudo owl
bitter hearth
#

I think a better direction is widening the range of prompt types that work for the model

sterile pendant
bitter hearth
#

this is Sana?

sterile pendant
#

Like maybe they had to train the last couple layers or something

bitter hearth
#

Sana is Nvidia so maybe its skill issue thing and they worked out a way to do it TBH

pseudo owl
patent acorn
#

ok can i ask something outside sd3 here? because im trying to change actor clothes frame by frame and its stressing me out due to the process is rly slow and im using krita ai diffusion

bitter hearth
#

I don't know this area well but there is an entire field of diffusion models called "VTON" or "Virtual Try On"

#

they work differently to our normal ones

patent acorn
#

im tryna find a way to change anyone clothes in vid and all i could find is inpaint the actor outfit then use ebsynth and if the next frame is liquidy then edit it again and so on

bitter hearth
#

they tend to fork the Unet like in control net or brush net and then do self attention and/or cross attention injections across the two unets

#

sadly your task is extremely hard its still experimental

#

VTON is the key word to search for though

patent acorn
#

im too nervous since its a school film project and it was past the deadline though the teacher doesnt even care

#

is this good tho?

#

but doin it every frame uhh

sterile pendant
bitter hearth
#

oh sorry my suggestion was not appropriate for school project
I am not sure there is an easy solution though

patent acorn
#

wonder if it can do shirtless

bitter hearth
#

ah yeah this is the sort of thing I meant

#

dedicated VTON model

#

they beat normal methods

patent acorn
#

wait but

#

what about some props to the actor

#

like loincloth

noble coyote
#

sackcloth and ashes

bitter hearth
#

you could use VTON for most of their clothes and then try img-to-img for small objects

noble coyote
#

Ollama/Flux i2i

patent acorn
bitter hearth
#

inpainting is the easiest
also compositing (place the object where you want it before the img-to-img)
hardest is stuff like noise inversion or edit models like CosXL

patent acorn
#

then i have to do that on few specific frames and do ebysnth

bitter hearth
#

I never got noise inversion to work personally its tricky

patent acorn
#

this was example i was stressed out btw

#

somehow more yellowish than the right pic

bitter hearth
#

not sure about ebsyth

patent acorn
#

i mean

#

im doing inpainting

#

the refine gen for some reason mroe yellowish than the before frame

bitter hearth
#

that's not a big deal as we have good colour match tools

#

many comfy nodes do the same colour match method called Reinhart or something

sterile pendant
#

Oh yeah, that's where I got the system prompt I've been using with qwen2.5 for prompt expansion lately lol... (Sana's paper)

bitter hearth
#

if you are inpainting I strongly recommend powerpaint v2.1 https://github.com/nullquant/ComfyUI-BrushNet

sterile pendant
#

I forgot I read parts of this paper when it came out lol

patent acorn
bitter hearth
patent acorn
#

comfyui as the remote

bitter hearth
#

I love Sana cos they made a DiT with no positional embeds

sterile pendant
#

that sysprompt works REALLY well I've found (obviously, you need to use an instruct version of a model, not a base version)

#

well they are the pixart team mostly and i loved pixart sigma

bitter hearth
# patent acorn im using krita ai diffusion rn

by pure coincidence I spent a lot of time yesterday researching the latest inpainting methods
my conclusion was that powerpaint v2.1 is the way to go, out of stuff that is currently fully released and working in common GUIs

#

the original brushnet paper explains why inpainting models and control nets don't work

#

inpainting models mix the text tokens in too early, and control nets are too sparse of a control

patent acorn
#

wait what i gotta check it out

bitter hearth
#

there is a really nice node pack with good examples https://github.com/nullquant/ComfyUI-BrushNet

bitter hearth
#

Sana is still very interesting but I wanted main model

noble coyote
sterile pendant
#

nvidia teamed up with them for RnD

bitter hearth
#

I hope not but maybe

noble coyote
#

PiXart Sigma was cool, agree

bitter hearth
#

hoping for new pixart also

sterile pendant
#

sana is likely a proof of concept before they make something big from the arch

patent acorn
#

need pixart omega smh

bitter hearth
#

the VAE from Sana will be good for other uses

patent acorn
bitter hearth
#

in some ways the VAE was the star of the show anyway

#

not sure license

patent acorn
bitter hearth
#

okay nice that looks not bad

#

if you like outpainting, thats where powerpaint dominated the other methods

#

powerpaint's preference score was like 600% of the score that inpainting model got

patent acorn
#

i havent seen one message about it in its discord

noble coyote
#

Ollama and SD3.5L

bitter hearth
#

yeah Krita is not the way to go for inpainting

#

dedicated networks are getting too good

noble coyote
#

I use Photoshop_SD_Plugin node inside Photoshop to enhance its Content Aware/Inpainting features ... bu that's just me!

sterile pendant
#

using that sysprompt from Sana, i got some nightmare fuel out of sd3.5 medium

noble coyote
#

Pigs WILL fly!!!

sterile pendant
#

(that ui is my gradio app i've been making for the family. trying to make it as idiot proof as possible)

#

it runs comfy api workflows

#

and that's using qwen2.5 7b IT for expansion

noble coyote
#

Comfy has had a makeover ...

#

I often use Qwen, Zephyr too, lately Meta's Llama3.2

bitter hearth
#

I'm a florence guy mostly

#

its not as strong these days though

noble coyote
#

I use Flux with Florence2 a lot

sterile pendant
#

yeah but its not idiot proof enough for techno illiterate family members. comfy lets me make custom workflows and it has the best optimization out of all the frontends

noble coyote
#

minicpm-v:8b-2.6-q8

sterile pendant
#

i use both florence2 and minicpm2.6

#

have a tab for it in my app

bitter hearth
#

sadly Diffusers is overtaking Comfy in speed

#

cos of Torchao and Sage Attention

sterile pendant
#

well it's not like those features can't be implemented into comfy

noble coyote
#

3.5L and Llama3.2

bitter hearth
#

there's an interesting convo about exactly that in comfy discord at the moment

noble coyote
#

Marvellous!!!

#

πŸ₯³

sterile pendant
# noble coyote 3.5L and Llama3.2

how is 3.2 vs 3.1? afaik, wasn't it just a distilled version of 3.1? like the 3b model is roughly on par with the 8b version, right?

#

haven't really bothered messing with it yet

bitter hearth
#

3.2 boosted the smaller ones a fair bit

#

to be honest though its the agent or chain framework that you use around the LLM that is more important at this point

sterile pendant
#

well i mostly only use instruct versions of models for stuff like prompt expansion and some of the newer models have been hit and miss. like for my app's prompt expansion, qwen2.5 is the only one that reliably follows the exact format and doesn't do a bunch of rambly verbose LLM stuff

bitter hearth
#

verbose is a big issue yeah

sterile pendant
#

3.1 is close though, but still whiffs it like 1/20 times

noble coyote
sterile pendant
#

yeah i feel you

bitter hearth
#

getting a second smaller LLM to check the results and force a re-roll for a bad one can help

noble coyote
#

Is art full of mistakes? Yes!
And it is often the bad mistakes which make the art successful!

#

... he said, rather enigmatically!

bitter hearth
#

πŸ€”

sterile pendant
#

i'll try out 3.2 since it's smaller and all. my pc only has 32gb ram. like for my t5 tenc, i use the q5km gguf to save a little ram. worst case scenario, i have q8 flux or sd3.5large/medium, q5km t5, clipg/l, qwen2.5 7b(q5km), minicpm2.6, florence 2 all offloaded into ram. it all fits without having to hit the page file lol

noble coyote
#

My 8GB VRAM is only competent due to my 64Gb RAM

sterile pendant
#

i only have 8gb vram as well

#

so i immedately offload after use at each stage

noble coyote
#

Plus 2 x SSDs for rapid LoRA and Checkpoint change

sterile pendant
#

yeah all my models are on a 5gb/s nvme

noble coyote
#

My 2Tb system SSD is overfull, so swapping out for 4Tb

sterile pendant
#

lmao... (sd3.5m)

bitter hearth
#

wow

sterile pendant
#

"A cucumber human hybrid creature stands in a whimsical scene inspired by Francisco Goya's style. Its torso resembles a green, lumpy cucumber with delicate, vine-like tendrils for arms and legs, while its head is humanoid with large, expressive eyes and a mischievous grin. The background features a dreamy, surreal landscape with floating clouds and ghostly figures, creating an eerie yet enchanting atmosphere." not quite goya, but i'll take it

noble coyote
#

Flux and Ollama minicpm2.6

sterile pendant
#

alright im done lol...

pseudo owl
noble coyote
#

LLama3.2 and 3.5L

lavish sparrow
finite osprey
muted dove
#

Hair in your soup?

flat oracle
#

sup guys maybe someone can help me out with that:

Im trying to generate detailed pixelimages from simple pixel images.
Therefore im resizing the images to 1024x1024, which is no problem at all with pixel art.

However for some images, its gives me weird outputs if i dont scale them down to 512x512.

Upper one is 1024x1024
below that is 512x512.

The initial image resolution is 964x464

Can someone explain this to me, or even tell me in which case i need to resize to what resolution?

dusky thistle
rapid pivot
#

Cool colors

lavish sparrow
dusky thistle
#

1920x1152

#

one shot no upscale (SD35M

lavish sparrow
#

nice coherence on the outside

dusky thistle
lavish sparrow
dusky thistle
#

yeah SD35M has a great sense of style

lavish sparrow
#

that's some crazy gen speed your making there @dusky thistle

cedar axle
#

SD3.5L

#

Fiddling with realism (SD3.5L)

#

Too bad 3PO gets all the gold - this armor could have been pretty sweet (SD3.5L)

noble coyote
#

Flux + Ollama

#

Ollama settings

pseudo owl
turbid grotto
#

sad that allegro and mochi do not support img2vid

remote holly
#

What is allegro ?

flat oracle
#

that was my question too

flat oracle
unique saffron
real terrace
#

I'm setting a img2img workflow for SD3.5 medium, should I change something here?

pseudo owl
#

@flat oracle @remote holly
Allegro is a text to video model, can generate videos from text.
Their official discord bot(3-4 min), can also run locally on as little as 8gb vram but takes 30mins(it’s horribly unoptimized right now).

If you have 24gb vram, I would recommend Mochi-1 text to video(also has apache2.0 license), that is considerably better and faster even though it’s 10b. Needs at least 24gb tho, can’t fit in 8gb.

Some vids mochi generated(from genmo, their official website).

real terrace
signal shuttle
craggy crest
craggy crest
#

@bitter hearth lcm+normal vrs lms+normal

cunning lintel
#

Best sword ever!

real terrace
untold valley
cunning lintel
#

The girl lying on the grass reassembled herself best as she could and went for a run πŸ˜‚ 😭

cunning lintel
craggy crest
cunning lintel
craggy crest
#

@bitter hearth @dusky thistle ipndm/linear_quadradic

untold valley
sacred jewel
untold valley
cedar vortex
untold valley
distant skiff
#

A close-up of cells and microorganisms, mostly white, with a few colorful elements, such as pink and yellow, on the background we see some small, organic, green, blue, and violet creatures. This is a macro photography image with a shallow depth of field.

dusky thistle
craggy crest
dusky thistle
#

sd3m? sd3L?

#

just about ready to drop a new sampler πŸ˜‰

#

this is what sampler dev looks like lol

#

got some new modes going thouugh

#

dpmpp_2m, dpmpp_3m, res_2m, res_3m with all noise modes and implicit sampling options

#

the right kind of noise can push res_3m toward some styles it normally doesn't want to do

#

more normal result for that one

#

got 25 samplers all using the same code, not too shabby

bitter hearth
#

25 is loads yeah

dusky thistle
#

prolly will add another half dozen or so by the weekend

#

only one major task left: get unsamlping and guide stuff working with samplerRK

#

oh, and add DEIS

#

then i can delete pretty much all of my samplers lol

#

that is going to feel greaaaatttt

#

thousands and thousands of lines of code, poof, no more

bitter hearth
#

the guides were cool yeah

dusky thistle
#

seemed like mode 8 was the real popular one that was essential to keep

craggy crest
craggy crest
#

Dont delete stuff. You never know when you might need it, or soneone else will find it usefull. Even if you are positive it can go away

#

Especially with things changing this fast

bitter hearth
#

I don't delete I just never save in the first place lol

craggy crest
craggy crest
dusky thistle
bitter hearth
#

at least the way cloud is currently, storage is priced really high relative to compute for some reason

dusky thistle
#

but i am def deleting most of this from my active version

#

it's gonna make this much easier to maintain and navigate

#

i'll have at least 4k lines of redundant code

craggy crest
bitter hearth
#

ye at some point I should put civit models on google drive or backblaze

#

its an issue also that hugging and civit don't have global servers

#

so if your GPU is in Australia or somewhere like that the download is slow

craggy crest
dusky thistle
#

yea i use mine as a landfill

bitter hearth
#

problem is getting them from hugging to vast

#

it seems that everyone has the same problem cos US servers go for a slight premium

craggy crest
bitter hearth
#

oh its just that vast have the cheapest compute

craggy crest
dusky thistle
hallow lion
#

We are not in Kansas anymore.

noble coyote
#

Flux RF Inversion - an astronaut 'keeping house!'

dusky thistle
#

so let me know when you get the chance if it works now so i can close the issues ppl have opened

#

deceptively simple looking little beast

#

25 samplers, unsampling, guides, multistep modes, buffer modes, legit implicit runge kutta sampling, 17 noise types, 6 noise scaling modes, and CFG++

#

and more (of course :P)

noble coyote
untold valley
noble coyote
dusky thistle
untold valley
#

2m then

dusky thistle
#

i have a 4090 so i don't feel the effects of the slower samplers as much, so tbh my fav is probably res_3s

untold valley
dusky thistle
#

yeah, res_2m is prolly a safe bet for that card

#

that runs as fast as euler

noble coyote
#

Cannot get RES4LYF to work πŸ˜„

dusky thistle
noble coyote
#

yes

dusky thistle
#

are you still getting an opensimplex error or something

untold valley
dusky thistle
noble coyote
#

I will get the error soon ... d/loading grounding-dino stuff

untold valley
#

it seems you are missing nodes?

dusky thistle
#

yea, trying to get my repo working on there

noble coyote
#

RES4LYF fails to load ... I need to disable clashing nodes

dusky thistle
#

oh there's a node naming conflict? weird

#

maybe you have an old version in another folder?

noble coyote
#

Mebbe - I shall delete that older version - or update it

dusky thistle
#

ohhhhh yeah if you have another one, remove it from your custom_nodes folder and stash it on your desktop or something lol

#

then git clone again from scratch

#

that should help

#

100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 39/39 [00:04<00:00, 8.32it/s]
100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 40/40 [00:08<00:00, 4.50it/s]
Prompt executed in 15.90 seconds

untold valley
noble coyote
#

RES4LYF loaded at last - no problems with Open Simplex (yet ...)

dusky thistle
#

thank god lol

#

yeah i took opensimplex out, i just commented it out

#

if anyone reallllly wants to use it they can just uncomment it

bitter hearth
#

without opensimplex how can we get Flux grid in SDXL

untold valley
dusky thistle
noble coyote
#

FIX NODE on both ClownSamplers helped!

untold valley
# dusky thistle

oh damn we need a skull and the with needs to have golf club then it will be perfect

#

wizard death golf club in mars

dusky thistle
#

they have workflows embedded

#

i have some really sophisticated unsampling on these

#

it was inspired by the RF inversion stuff, totally redid the math and reworked the algorithm so it's kinda new-ish

bitter hearth
#

I tried that token downsampling thing called Todo, its really good
its a 50% speed boost on any workflow that uses text encoders πŸ€”

dusky thistle
#

major upgrade

bitter hearth
#

I never got vanilla unsampling working

#

would be cool to try this new one TBH

untold valley
noble coyote
untold valley
dusky thistle
hallow lion
noble coyote
hallow lion
#

Geoffrey Hackumeriff: "I'd rather plug my butt permanently than have diarrhea in space again. Even Sumo-X couldn't save that pod from my ass."

noble coyote
#

Ewwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwww

#

πŸ₯³

dusky thistle
noble coyote
hallow lion
# noble coyote

Don't take laxatives when you fly in orbit. I speak from experience.

#

Papa Musk banned me from his shuttle.

noble coyote
#

Norovirus in Space! You could do a very funny Graphic Novel on this!!! πŸ₯³

patent acorn
noble coyote
#

Flux RF Inversion Style Transfer

patent acorn
noble coyote
patent acorn
#

thats no fun

#

i requested you not expecting to be told to gen it myself

dusky thistle
noble coyote
#

RFI Style Transfer

fossil pagoda
noble coyote
dusky thistle
lavish sparrow
lavish sparrow
dusky thistle
dusky thistle
#

res_2m and res_3m are as fast as euler

fossil pagoda
lavish sparrow
dusky thistle
#

res_3m is really good at getting paint to pop

#

yep

lavish sparrow
dusky thistle
#

WF in the second image above btw

#

well, it'll be different

#

it's a different style of sampling

#

more approximate but it also has some advantages from that tradeoff

#

in that it borrows from previous steps

lavish sparrow
#

i mean, res3 should be pretty fckn slow by all means

#

it's accurate, but just... slow

dusky thistle
#

yep

lavish sparrow
#

if you manage to get 50% of the accuracy and no speed penalty, that's a win

dusky thistle
#

yeah it's still a signifiacnt boost

#

def give both the 2m and 3m a shot

#

3m is gonna be more sensitive to cfg and low step counts, which is pretty common for that type of sampling

lavish sparrow
#

yeah, am gonna have to fiddle around with it a bit. not bad @dusky thistle ❀️ loving ur sampler

dusky thistle
#

25 samplers, unsampling, latent image guides, multistep modes, buffer modes, legit implicit runge kutta sampling, 17 noise types, 6 noise scaling modes, and CFG++... all in one compact package

#

i'm sure there's some bugs to be worked out but finally got most of the features rolled in that i was looking to prioritize

lavish sparrow
#

your sampler works great with this

dusky thistle
lavish sparrow
#

you need to adjust your eta tho

#

if you use the detailer

dusky thistle
#

ahh yea i've got a way to do that by manipulating some of the params with noise

#

i was thinking about adding other stuff to just directly increase the denoise out of sync

lavish sparrow
#

but the combination of the detailer daemon and eta noise manipulation makes for some very crispy images

dusky thistle
#

best way to see it demoed is to set eta to... something.. and then lower the s_noise value to something like 0.95 or even 0.9

lavish sparrow
#

so much experimentation to be done by just adjusting values xD

#

... and each model has it's own preferences...

dusky thistle
#

yea for real

#

you dont even want to know how much code i've written that hasn't made it out of the lab so to speak

lavish sparrow
#

i can imagine

dusky thistle
#

i've tried damn near everything imaginable with the math with RF

#

which is why i've got six noise scaling modes now lol

lavish sparrow
#

3m fries with detailer, it seems

dusky thistle
#

and probably at least a thousand of images i burned to fucking death lol

lavish sparrow
#

trying 3s now

#

oh. 3s is slow xD

dusky thistle
#

try res_2m

#

then try res_3m and use a painting prompt as a comparison... something with impasto

fossil pagoda
dusky thistle
#

been wanting to get everything cleaned up first so i can actually document it

#

it got completely unmaintainable, just way too much to keep up with, which is what drove me to come up with this universal sampler mech

#

now i only have to deal with code in one place, not 18 different places over 5k+ lines of mess

lavish sparrow
#

res2m (110s) vs res3s (290s)

#

lets see if i can get res2m cleaned up

dusky thistle
#

are you using eta? you'll usually see more of a diff when you're doing SDE sampling

lavish sparrow
#

0.25 only

dusky thistle
#

here's the fun thing about faster sampling btw

lavish sparrow
#

can't have it too high because of detailer daemon

dusky thistle
#

you could always just increase the step count to eliminate your time savings πŸ˜„

#

try turning off the daemon and lowering or increasing s_noise a bit, might do what you're looking for (or maybe not)

lavish sparrow
#

yeah, just gonna experiment some further πŸ˜„

noble coyote
lavish sparrow
#

but res2m seems to be a very good sampler

dusky thistle
lavish sparrow
#

@dusky thistle -> if for some reason you're an idiot like me and you put ETA to 1.0 -> the image output becomes black. however. changing it back makes something bugged, and need to restart comfy to get the sampler working again

#

just so u know ❀️

dusky thistle
#

oh weird

#

always good to hear bug reports

#

yeah 1.0 with hard is by definition the breaking point for the math

#

anything less than 1.0 is theoretically doable even if it's horrendous

#

0.999999999999999999999 works

#

1.0 doesn't

zealous latch
#

A video editor focused intently at a cluttered desk, surrounded by multiple screens displaying a complex video editing timeline filled with clips. The room is dimly lit, emphasizing the glow of the monitors. Coffee cups and scattered papers create an atmosphere of urgency and creativity, capturing the essence of the editing process.

lavish sparrow
dusky thistle
lavish sparrow
#

might also be the combo with the detailer tho -> very often it's patcher nodes that fck things over

zealous latch
#

A video editor editing on his 3 monitor setup

lavish sparrow
noble coyote
radiant ledge
signal shuttle
radiant ledge
#

yeah, it's alnost surgical

radiant ledge
#

flux is funnier

#

clearly about to surgically remove that third leg

lavish sparrow
#

feeding random gibberish to 3.5

rapid pivot
#

Eface

cunning lintel
#

Gibberish is great; Accidental gibberish to the clip encoders.

lavish sparrow
lavish sparrow
# rapid pivot Eface

oh. i'm here to report that my path of exile campaign has had yet more absurd things happening...

#

how about dropping a reflecting mist + (the interrogation -> vaal orb -> corrupting blood) in a single map?

#

like those things aren't supposed to happen

craggy crest
unkempt compass
#

I'm pretty sad. There is no that much new Checkpoints on Civitai/HuggingFace; especially Turbo/GGUF ones :/

dusky thistle
craggy crest
dusky thistle
craggy crest
dusky thistle
#

hah

turbid grotto
#

I am training lora for sd3.5m already guys gonnabegood

#

it takes only 7.6/12gb vram at 1024px with batch size 2

craggy crest
#

yeah - one of the main focus's of 3.5 was to make sure it was very easily trainable

#

that it worked, worked right, created very good images even though it's a base model, that it shines, is flexible and fast - and most of all, very easy to train. no battling with it

turbid grotto
#

yea, and it is just beginning

#

I think sdxl took more vram, I might be able to bump quality up, but idk how yet

#

OneTrainer btw, very easy

bitter hearth
#

I can't quite get it working as well on the flow models that don't have PAG but
for the diffusion ones like SD 1.5, SDXL etc, you can use low PAG amounts with no CFG to browse the unconditional distribution, in a form where you can see what the images are actually like
and you can use this to see how overtrained a model is (does it show a range of images or just anime 1girl etc)
if you run this test with Flux then you see clear images, often with that Flux chin
but if you run it with SD 3.5L you see a range of images as you should

untold valley
#

have you guys felt sd3.5m responds better to word spaghetti or with actual sentences?

craggy crest
untold valley
#

thanks

bitter hearth
#

cutting out T5 might be good for low VRAM people tbh

gusty trail
#

It is what SD3 designed for

turbid grotto
bitter hearth
#

yea there's probably all sorts of degradations

#

some people don't seem to want to use big cloud GPU so it is what it is

#

stuff like NF4 isn't costless either sadly

turbid grotto
#

but it can be fine in ram

bitter hearth
#

yeah putting encoders into ram after use is better in my opinion

turbid grotto
#

i am with rtx3060 and 32gb ram can run sd3.5l and T5 both at fp16 with the same speed as quantized to hell

bitter hearth
#

I think an even better option would be encoding embeds in advance

#

but that would probably not be popular

turbid grotto
bitter hearth
#

not sure but it wouldn't be a very difficult node to make if its not existing already

turbid grotto
#

will check node manager later

craggy crest
bitter hearth
#

lol

#

another option would be using a cloud embedding service

#

Diffusers is actually set up with the embedding library entirely seperate

#

what you could do is cache embeddings within one session

  1. load text encoders 2. encode like 200 prompts 3. unload text encoders
#

instead people often load and unload everything for each new image

noble coyote
#

Flux RF Inversion

bitter hearth
#

looks good, can't see any borders or seams

noble coyote
craggy crest
craggy crest
noble coyote
#

Flux RF Inversion

#

2 stages of 28 iterations each

bitter hearth
#

sadly I found flux needed a weirdly high number of steps to finalise image
sometimes 60 and sometimes even 100
its a very expensive model

noble coyote
mortal mesa
#

i like to use dpm_adaptive sometimes and ya steps can be pretty broad, i recall from 33-66, SD3 seems similar

sacred jewel
bitter hearth
#

dpm_adaptive is really awesome yeah

noble coyote
bitter hearth
#

this year I used TCD Sampler for like 95% of my images

#

there is something slightly better than TCD out, but only in Diffusers

noble coyote
bitter hearth
#

I mostly don't save them lol
I've posted a few on here though, every now and then

sacred jewel
bitter hearth
#

Kubrick is awesome

signal shuttle
bitter hearth
#

I think I remember seeing Animagine XL on civit

signal shuttle
bitter hearth
#

ah okay nice

#

I read that SD 1.5 is better for anime

#

but not sure if that has changed

signal shuttle
bitter hearth
#

okay I see

turbid grotto
#

curious if Illustrious team will jump on sd3.5, they could actually take the lead from pony, however, Astralite has very sophisticated system and probably already in training

bitter hearth
#

auraflow has the best prompt adherence by a fairly long way

turbid grotto
turbid grotto
cunning lintel
noble coyote
#

Can anyone find me an Auraflow TensorRT at all? πŸ˜„

signal shuttle
turbid grotto
craggy crest
#

everyone and their dog does good anime now

noble coyote
craggy crest
bitter hearth
#

if there was an exception, that exception would be Ideogram V2 or the upcoming Playground V3 possibly

noble coyote
#

Yes, suggests I ask FAL

#

... gone to FAL ...

bitter hearth
#

Torcello got sucked into a black hole

#

my favourite model by far is still midjourney, I don't use it though

craggy crest
noble coyote
#

I am one of the few who truly appreciate the underused Auraflow

noble coyote
craggy crest
#

it's way too much fun

noble coyote
#

Like Steamed Jam Roly Poly?! πŸ₯³

bitter hearth
#

their project to train an LLM to make comfy workflows was cool

signal shuttle
craggy crest
#

and you can remix other peopel's glifs to make your own version

cunning lintel
#

Auraflow .3 was such a let down, before that it was magic, sadly it seems abandoned.

bitter hearth
#

do you know about Aurum

#

blend of .2 and .3

#

I totally agree though

#

.3 lost the prompt adherence magic

cunning lintel
#

read about it 2 weeks ago, should try it πŸ™‚

#

so much to try and so little time and compute 🀣

craggy crest
bitter hearth
#

lol

#

the SimpleTuner dev was saying on reddit that Auraflow doesn't train well

#

not sure

#

my viewpoint is autoregressive models will quickly overtake diffusion models in prompt adherence anyway

#

(but be much slower, more expensive and lower image quality)

cunning lintel
#

There's really a whole lot to win on prompt adherence

#

I had this bright idea to try sd3.5 by having gemini read an entire book and create 25 scenes from it

#

it was all too complicated for poor sd35 (and flux too), turns out i've gotten really good at writing prompts current image ai's somewhat kind of can work with 🀑

bitter hearth
#

I never really learnt prompt engineering but yeah there is a lot of skill to it

craggy crest
#

and using gemini just makes it worse. go to meta.ai - tell it about your scenes and then ask it specificly to craft prompts for stable diffusion 3

untold valley
#

prompt "engineering" is just tossing word spaghetti at it until you figure out what the model likes and dislikes. then using that to submit it to your will.

noble coyote
#

"I am a prompt wrangler!!!"

mortal mesa
#

ide think the possible knowledge of how it could respond is the engineering part

#

surprised we don't have small LLM and Image model pairs as a normal thing yet.

untold valley
bitter hearth
cunning lintel
#

Problem was more the kind of scenes, often multiple people (something simple like a man at the counter of a bank, while a woman sits in the waiting room watching around) or even seemingly normal scenes (two people in a car, understanding the inside of a car turned out hard and the road seen from the windows, placing the steeringwheel). there's so many edge cases for seemingly mundane scenes ai's still struggle with. Often I just prompt for 1 subject, interaction i try rarely as i know it's hard, but when you try to create "real life" scenes, by just prompting, it's not that easy yet.

craggy crest
craggy crest
#

this "a woman sits in the waiting room watching around" wouldn't even tell most humans what she's actually doing. what does 'watching around' mean?

noble coyote
#

Making a Dynamic TensorRT for Auraflow3

noble coyote
craggy crest
noble coyote
#

πŸ™ƒ

craggy crest
#

that's the biggest issue most people run into - you have to talk to the AI in clear, concise terms - but you are talking to a computer. you must think like it does and talk to it like it thinks. NOT like you think or you talk to a human

bitter hearth
#

it gets rough cos I mostly use highly distilled models for only 2-4 steps with CFG 1
you only get like 9 tokens (less than 9 words) that the model will attend to

untold valley
noble coyote
#

Oops Auraflow TensorRT operation c r a s h e d - compilation error in backend

untold valley
#

ok is it me or does 3.5m reallly really loves like 3/4 shots

noble coyote
untold valley
#

walked into that one

noble coyote
#

πŸ₯³

untold valley
#

ive just realized that cfg on 3.5m has a crazy amount of control of the gens.

bitter hearth
#

I used CFG on flux since day 1 TBH

#

never actually did the no CFG route

untold valley
#

4,7,13 13 totally cooks it but crazy the difference between 4 and 7 from real to a more "anime" style

craggy crest
bitter hearth
#

LOL

craggy crest
# bitter hearth LOL

seriously. there's a reason it doesn't use CFG. set CFG to 0 and then prompt it

untold valley
#

so flux is loosy goosy ur saying?

mortal mesa
#

if everyone followed "the rules" we would never have anything new

craggy crest
#

people turn it on because they HAVE to HAVE their negative prompt fix.

#

but flux isn't designed to use CFG OR use negative prompts

mortal mesa
#

you can possibly get MORE than what its designed to do, crazy

craggy crest
craggy crest
mortal mesa
#

stay in the cave my friend

craggy crest
icy drift
#

Still no OmniGen in Comfy... Hopefully this weekend...

mortal mesa
craggy crest
mortal mesa
#

yup cant expose the lies, ya sure you learn stuff, that's clear, but when you don't know you BS for some like internet points in a passive/aggressive way, its prety clear

craggy crest
mortal mesa
#

no, more lies, do what you need for yourself

#

you dont get to be the most blocked user out of nowhere

craggy crest
signal shuttle
mortal mesa
#

i certainly didnt ask if you cared and none of that happened regardless

lunar canopy
#

mooooooooving on

#

there is far too much halloween candy to eat

untold valley
lunar canopy
untold valley
craggy crest
untold valley
#

I was wondering why 3.5m seems like a really great base model, and been having fun with it. Decided to test its word that shall not be said but starts with N and ends in W, capabilities and gosh darn it can you push it far. no wonder, it all makes sense that when you don't purposely handicap, sandbag, and sensor something it starts working correctly. thanks SAI. ❣️ goodjob mikuwha

lavish sparrow
#

this one looks pretty decent, except for that finger...

lavish sparrow
bitter hearth
#

its uses hugging transformers and diffusers libs

cunning lintel
#

same prompt also gave this, such an interesting style for sd3.5l to do out of the box

craggy crest
lavish sparrow
lavish sparrow
lavish sparrow
craggy crest
craggy crest
lavish sparrow
# craggy crest the llm is going to be very confused ;)

The digital masterpiece showcases a cosmic beauty, a stunning vision of ethereal allure. Her skin, a canvas of iridescent hues, melts and drips like celestial paint, revealing a complex network of shimmering galaxies and nebulas. Long, flowing hair, composed of cascading stars, frames her serene face, where eyes, like twin portals, reflect the vast expanse. The figure gracefully poses, allowing the cosmic substance to drip from her form, creating a captivating contrast between heavenly beauty and the raw, visceral nature of the melting effect. The image exudes a sense of otherworldly tranquility, inviting viewers to immerse themselves in this captivating, hyper-detailed digital creation.

craggy crest
#

red panda is from recraft. go to their site and use it, see what you think

bitter hearth
#

I voted against red panda every time

#

in that blind trial

errant dust
#

I'm not sure what you mean by no since that's what the article says.

bitter hearth
#

I liked midjourney, ideogram and flux pro

craggy crest
errant dust
#

And I was just sharing but I actually have no opinion about it since I haven't used it. I plan to because of course I'm curious. But I'll be honest I'm very happy with both flux and stable diffusion 3.5 L

craggy crest
#

they heavily cherry picked the images that got voted on

bitter hearth
craggy crest
#

the link to their site is in that post on twitter. go play with it and see what you think

bitter hearth
#

its on Fal apparently

lavish sparrow
cunning lintel
craggy crest
#

red panda, to me, feels like they tried to make a couple flux loras, didn't do them well, and are trying to carve out some of the pie for themselves - get users to use their website to gen with.

craggy crest
bitter hearth
#

20B is a lot, Flux is 12B for comparison, so panda is a very chonky transformer

#

it does hands and text very well

#

and has strong blur effect abilities like Flux Pro

#

the aesthetic fine tune seems slightly off to me

icy drift
bitter hearth
#

ah okay thanks for trying

#

yeah its tricky adding things to comfy

craggy crest
bitter hearth
#

there is a Leonardo model that is strong also, apparently

#

although that was before this summer so maybe it hasn't kept up

#

companies can't just launch a 2B Unet any more any compete

craggy crest
lavish sparrow
errant dust
#

Well petapixel isn't exactly AI friendly. They aren't anti AI per se but I would hardly call them favorable. And the same goes for the majority of the readers if you look at the feedback their articles get.

#

As a photographer it's a very good site but they have their biases in some things

bitter hearth
#

if we wait a few months there will be papers that benchmark it
there's finally papers that talk about flux and ideogram

errant dust
#

Ideogram 2.0?

bitter hearth
#

ye I used to do photography and read petapixel

lavish sparrow
bitter hearth
#

yeah I think I saw Ideogram 2.0 in a paper

#

ah yeah I found it

#

the playground V3 paper has Ideogram 2.0 in the comparisons

#

https://arxiv.org/abs/2409.10695

lavish sparrow
bitter hearth
#

got flux in the paper too

errant dust
#

Never heard of playground

bitter hearth
#

they did a model that was not great called Playground v2.5
it was really overfit, came out around SDXL time

#

but their new one looks competitive

#

Flux hasn't been benchmarking that well, I think it might be the slightly overfit aesthetic that is harming it in benchmarks

untold valley
bitter hearth
#

no papers on SD 3.5 yet though

lavish sparrow
#

aight, time to go to bed ^^

bitter hearth
#

the glow effect is rly good

craggy crest
#

what else would you want in a paper other than what's already written?

short thicket
bitter hearth
craggy crest
#

i'll sit here and do those tests if you want

bitter hearth
#

haha sadly you need 30,000 images to do FID for example
I will pay to run myself at some point

short thicket
bitter hearth
#

there's also the case of human preference studies
which are quite expensive

#

there are standardised companies that do those now, the fees are fairly flat but it adds up

#

we just have to wait a bit more, there will be papers on SD 3.5 soon, there are a fair few papers about Flux now

short thicket
craggy crest
craggy crest
short thicket
bitter hearth
#

there seems to be a sort of center of gravity anyway

#

regarding human preferences on most subjects

#

independent attempts at human preference optimisations often end up with kinda similar results

craggy crest
craggy crest
#

but ask them individually - you get cats

short thicket
bitter hearth
#

yeah I do mostly use stuff like FID to judge things instead

#

cos the human element is removed

#

FID has some issues though

#

it can also be gamed, sadly, its known what to do to subtly raise FID score

craggy crest
bitter hearth
#

FID is best for like

#

papers that made a sampler and they want to test what settings are best

#

so they show FID score for the different settings

#

for an actual new model I think you've kinda gotta take in all the benches combined, along with human pref study

#

cos with new models the financial incentive to game benchmarks is higher

gritty steeple
craggy crest
dusky thistle
craggy crest
dusky thistle
craggy crest
untold valley
#

congrats @dusky thistle

craggy crest
#

and he promptly goes into hiding

minor lotus
#

sd3 output default 10241x1024, can I get higher

untold valley
#

you can try but it likes 1megapixel res, its better to gen at that res and then you go and upscale it

dusky thistle
#

credit to SAI for releasing two killer models in the last week or so

minor lotus
untold valley
#

what do you mean?

dusky thistle
#

1920x1152 works pretty well for a one-shot generation with SD35M

#

large is a bit more limited for initial latent size

#

you'll gain some and lose some with coherence when going outside of the most heavily trained resolutions

minor lotus
#

Should I generate a standard image first and then use upscaling to increase the pixels?

dusky thistle
#

you should try both πŸ™‚

minor lotus
#

thanks

dusky thistle
#

there's advantages and disadvantages to both strategies

#

which is better depends on the subject and model so it's good to experiment with it

craggy crest
#

just make a bunch of 15x15 images and tile them ;)

#

(mosaic tiles)

untold valley
#

are there any optimal settins for your sampler you have found yet Batwing?

dusky thistle
#

try these if you want something really fast

#

res_3m is fantastic with paint

#

res_2m is more moderate... both run at euler speed

#

res_2s and espec res_3s are really high quality

#

eta = the amount of noise added, try setting that at 0, 0.25, and 0.5 and compare

untold valley
#

many appreciations

dusky thistle
#

np

#

also leaving the WFs embedded in these

craggy crest
dusky thistle
sacred jewel
dusky thistle
short thicket
short thicket
untold valley
#

one day we will have proper hands. but composition and textures are improving

short thicket
runic tusk
#

No.

#

Stop it.

#

Get some help.

craggy crest
#

you can't generate in this channel. you have to use the artisan channels. start by reading the information here: #artisan-faq

runic tusk
craggy crest
runic tusk
craggy crest
dusky thistle
dusky thistle
dusky thistle
hallow lion
#

clownshark delivering the goods.

dusky thistle
hallow lion
#

Do a Friday themed one XD

hallow lion
#

yeah

#

everyone is lonely

#

until they connect to god

untold valley
dusky thistle
bitter hearth
#

maybe SD3.5M just needed some stochasticity after all

untold valley
winged seal
# dusky thistle

now this looks fantastic. One of the few SD3.5 images I have seen that I really like

dusky thistle
dusky thistle
untold valley
dusky thistle
dusky thistle
#

all still SD3.5M. this model is special

#

one shot 1920x1152 with medium

untold valley
dusky thistle
#

yeah, they obviously just felt compelled to release before it was ready, it really was a beta πŸ™‚

untold valley
#

Not going to start this all over again. I got ptsd lol. But yeah.

dusky thistle
untold valley
#

Some issues are it adores 3/4 shots, likes solid white or black backgrounds,

hallow lion
hallow lion
#

Dor Brothers very close to the future of what movies will be like soon.

noble coyote
icy drift
#

OmniGen in Comfy is working for me now! πŸ₯³ I had to update my transformers library to 4.45.

#

No need for person loras anymore. πŸ™‚

bitter hearth
#

oh nice

#

what have you found it is good for?

#

that face copying ability does look strong

icy drift
# bitter hearth oh nice

I also managed to speed up the node made by https://github.com/AIFSH/OmniGen-ComfyUI
So now it's as fast at the Pinokio / non-comfy install.
I'm sure there's some way to make it much faster, but I have no idea what I'm doing. https://github.com/0X-JonMichaelGalindo/OmniGen-ComfyUI

GitHub

Contribute to AIFSH/OmniGen-ComfyUI development by creating an account on GitHub.

GitHub

Keep model loaded. Contribute to 0X-JonMichaelGalindo/OmniGen-ComfyUI development by creating an account on GitHub.

icy drift
bitter hearth
#

okay awesome

#

speed ups are tricky to implement, might need to wait for support

#

particularly stuff like tensorrt

severe phoenix
#

please does anyone have this problem where flux loras work better on civit than on their comfy?? right is my comfy, left is civit, how the lora should actually look. b4 u ask i have tried all the scheduler combos, still same issue persists.

short thicket
#

Uploading Mangled Merge V1 Dedistilled currently. It's Mangled Merge Matrix and Magic, plus PixelWave, FluxBooru, and nyanko7's dedistilled model. The model works as a dedistilled model so flux guidance is useless but negative prompts and dynamic thresholding work great. It also get's the styles of PixelWave, and the booru knowledge of FluxBooru and Loras work fine on it too.

severe phoenix
short thicket
severe phoenix
unkempt compass
short thicket
severe phoenix
short thicket
short thicket
short thicket
icy drift
short thicket
severe phoenix
frank mural
#

anyone knows if 3.5 is available for forge?

short thicket
severe phoenix
icy drift
short thicket
unkempt compass
#

That is why I was asking the question

severe phoenix
short thicket
severe phoenix
short thicket
#

bf16 is it's original format, then it was quantized down from that hence fp8 being gguf.

severe phoenix
icy drift
#

@unkempt compass Changing the model config to int8 datatype did not change memory requirements, and changing datatype to fp8_e5m2 failed. I do not know what else to try, unless you have any suggestions.