#🆕｜sd3 | Stable Diffusion | Page 118

dusky thistle Oct 31, 2024, 4:18 AM

#

#

#

real terrace Oct 31, 2024, 4:26 AM

#

I really love when there is that kind of composition and it gets done perfectly, like sections with perfect spaces

dusky thistle Oct 31, 2024, 4:27 AM

#

oblique parcel Oct 31, 2024, 4:29 AM

#

sacred geode Oct 31, 2024, 4:40 AM

#

sacred jewel Oct 31, 2024, 4:46 AM

#

real terrace I really love when there is that kind of composition and it gets done perfectly,...

Yeah, I am always pleasantly surprised.

dusky thistle Oct 31, 2024, 4:52 AM

#

#

#

#

sacred geode Oct 31, 2024, 5:39 AM

#

https://civitai.com/models/904556?modelVersionId=1012233

#

not sure why civitai isn't showing the info :/

patent acorn Oct 31, 2024, 6:31 AM

#

#

untold valley Oct 31, 2024, 6:41 AM

#

dusky thistle Oct 31, 2024, 6:46 AM

#

#

patent acorn Oct 31, 2024, 6:50 AM

#

ok def large is better

sterile pendant Oct 31, 2024, 6:53 AM

#

I wasn't really impressed with large, but medium has kind of hit the artistic sweet spot if you're into doing art styles and if you're not obsessed with making waifus or people

#

I think finetunes of medium will eventually replace sdxl

patent acorn Oct 31, 2024, 6:55 AM

#

large is prompt coherence-adherence fun, medium is sdxl but 100x better

dusky thistle Oct 31, 2024, 6:57 AM

#

#

patent acorn Oct 31, 2024, 7:06 AM

#

i typed a 2 panel garfield comic, i think the rating tags wasnt supposed to be there but hey garfield works 😂

dusky thistle Oct 31, 2024, 7:10 AM

#

#

craggy crest Oct 31, 2024, 7:28 AM

#

#

@dusky thistle Prompt: a young executive sitting in a tower office near a window. Through the window we see the tops of buildings and the city. There is a cartoon thought bubble attached to his head and in it, we see a picture of a puppy

muted dove Oct 31, 2024, 7:35 AM

#

craggy crest Oct 31, 2024, 7:36 AM

#

craggy crest Oct 31, 2024, 7:47 AM

#

muted dove

always eat breakfast before generating

muted dove Oct 31, 2024, 7:53 AM

#

craggy crest always eat breakfast before generating

Yep, made that mistake!

#

Generating on an empty stomach

craggy crest Oct 31, 2024, 7:57 AM

#

muted dove Generating on an empty stomach

;) it was rather obvious that food was on your mind

muted dove Oct 31, 2024, 7:57 AM

#

craggy crest ;) it was rather obvious that food was on your mind

Not on these minds 😄

craggy crest Oct 31, 2024, 7:58 AM

#

muted dove Not on these minds 😄

they probably don't mind

#

ghost lighthouse

muted dove Oct 31, 2024, 8:02 AM

#

#

muted dove Oct 31, 2024, 8:18 AM

#

Love these waves 🌊

#

craggy crest Oct 31, 2024, 8:19 AM

#

like the light streaks from the lighthouse in this one

#

prompt: Intricate, geometric, snake-scale patterns, tessellated in shimmering, metallic hues of polished silver, gold, and bronze, reflecting light with a dazzling, kaleidoscopic effect, evoking the ancient, symbolic language of reptilian textures, and the modern, technological allure of precision-crafted materials.

muted dove Oct 31, 2024, 8:24 AM

#

craggy crest like the light streaks from the lighthouse in this one

I like how it illuminates the rain in this

craggy crest Oct 31, 2024, 8:25 AM

#

oooo love that effect

muted dove Oct 31, 2024, 8:29 AM

#

A slightly different take on the lighthouse theme

craggy crest Oct 31, 2024, 8:30 AM

#

muted dove A slightly different take on the lighthouse theme

needs a crab

muted dove Oct 31, 2024, 8:32 AM

#

craggy crest needs a crab

craggy crest Oct 31, 2024, 8:34 AM

#

ROFL!

muted dove Oct 31, 2024, 8:35 AM

#

#

I'd rather be there right now. 🤔

#

craggy crest Oct 31, 2024, 8:37 AM

#

muted dove

https://youtu.be/cE0wfjsybIQ?si=8XaL4__ok-Ebyfrf

YouTube

Noisestorm

Noisestorm - Crab Rave (Official Music Video)

My solo developed game "Crab Champions" is OUT NOW! 🦀 https://store.steampowered.com/app/774801/Crab_Champions/

My first ever music video, created entirely with Unreal Engine 4!

Support on Spotify: https://open.spotify.com/track/4qDHt2ClApBBzDAvhNGWFd

▶ Play video

muted dove Oct 31, 2024, 8:43 AM

#

craggy crest https://youtu.be/cE0wfjsybIQ?si=8XaL4__ok-Ebyfrf

craggy crest Oct 31, 2024, 8:44 AM

#

Crabs!

muted dove Oct 31, 2024, 8:45 AM

#

A few 😄

#

craggy crest Oct 31, 2024, 8:53 AM

#

muted dove

you can do that with just a prompt in 3.5

muted dove Oct 31, 2024, 8:53 AM

#

craggy crest you can do that with just a prompt in 3.5

That is just a prompt, in Flux.

craggy crest Oct 31, 2024, 8:54 AM

#

https://x.com/cubiq/status/1851273471503860212

Matteo Spinelli (@cubiq) on X

Guess what? it also works with SD3.5M

#

https://x.com/cubiq/status/1851187910923464916

Matteo Spinelli (@cubiq) on X

trying to recreate this "picture-in-picture" effect with just prompting in SD35L. considering no lora, no controlnet, no inpainting.... it's impressive

#

muted dove Oct 31, 2024, 8:55 AM

#

craggy crest https://x.com/cubiq/status/1851187910923464916

I know, was just chatting to him about it, so I gave it a try and that was first attempt.

craggy crest Oct 31, 2024, 8:56 AM

#

muted dove I know, was just chatting to him about it, so I gave it a try and that was first...

he's got some really interesting images on his twitter profile

muted dove Oct 31, 2024, 8:56 AM

#

I avoid the whole platform.

craggy crest Oct 31, 2024, 8:58 AM

#

muted dove I avoid the whole platform.

muted dove Oct 31, 2024, 8:58 AM

#

I smell it already 🤢

craggy crest Oct 31, 2024, 8:59 AM

#

muted dove I smell it already 🤢

you don't like musky smells?

muted dove Oct 31, 2024, 8:59 AM

#

Not if it's associated with knob face

sterile pendant Oct 31, 2024, 9:00 AM

#

patent acorn large is prompt coherence-adherence fun, medium is sdxl but 100x better

From my fairly limited testing with large vs medium, prompt adherence is roughly the same. Run 50 seeds of each and you'll see they average out. Depends on the prompt and/or how poorly it's worded though. Also, medium has a 256 token t5 limit

craggy crest Oct 31, 2024, 9:01 AM

#

muted dove Not if it's associated with knob face

prompt: Not if it's associated with knob face

muted dove Oct 31, 2024, 9:02 AM

#

craggy crest prompt: Not if it's associated with knob face

Much more pleasing

craggy crest Oct 31, 2024, 9:03 AM

#

muted dove Much more pleasing

Prompt: the brave little knob-face

#

it's the silly time of the night

muted dove Oct 31, 2024, 9:04 AM

#

When dinosaurs walked the earth...

craggy crest Oct 31, 2024, 9:12 AM

#

sacred geode Oct 31, 2024, 9:38 AM

#

winged seal Oct 31, 2024, 9:39 AM

#

Hey people, anybody know how to merge LoRA's into a transformers model, and then save it in the transformers split format as seen on HF? I need to be able to merge my LoRA's into my models so I can train additional concepts in

#

I am very inexperienced with code, so I unfortunately do not understand a lot of the stuff I have seen

noble coyote Oct 31, 2024, 9:51 AM

#

Flux RF Inversion - no Controlnet, no IPAdapter! Original, then RFI version - prompt = a boy in an angry rage cartoon style

sterile pendant Oct 31, 2024, 9:55 AM

#

winged seal Hey people, anybody know how to merge LoRA's into a transformers model, and then...

You're better off asking in huggingface diffusers groups if you're looking to make a split diffusers model where each chunk of the model is limited to like 2gb or w/e

#

If you wanted to just merge a model with a lora using the non-diffusers format, you can easily do it in comfyui and save the model. It's three nodes

noble coyote Oct 31, 2024, 9:57 AM

#

Flux RF Inversion - no Controlnet, no IPAdapter! Original, then RFI version - prompt = a boy in an angry rage cartoon style

turbid grotto Oct 31, 2024, 9:57 AM

#

anybody tried to train sd35m?

winged seal Oct 31, 2024, 9:58 AM

#

sterile pendant You're better off asking in huggingface diffusers groups if you're looking to ma...

I just need the output to be in diffusers/transformers format

sterile pendant Oct 31, 2024, 9:58 AM

#

winged seal I just need the output to be in diffusers/transformers format

Yeah I'm not sure. I want to say comfyui now supports diffuser based models but I could be wrong

noble coyote Oct 31, 2024, 10:02 AM

#

fossil pagoda Oct 31, 2024, 10:03 AM

#

241031110232_A_dark_rainy_cyberpunk_cityscape_with_neon_signs_in_Japanese_kanji_glowing_in_pi__00013_.png

icy drift Oct 31, 2024, 10:08 AM

#

Can anyone manage to get the ComfyUI OmniGen node working? It just errors for me.
https://github.com/AIFSH/OmniGen-ComfyUI

GitHub

GitHub - AIFSH/OmniGen-ComfyUI

Contribute to AIFSH/OmniGen-ComfyUI development by creating an account on GitHub.

#

Console says Phi3 (OmniGen's base model) doesn't support SDPA attention or something.

muted dove Oct 31, 2024, 10:11 AM

#

A hologram from the future

#

gusty trail Oct 31, 2024, 10:27 AM

#

turbid grotto anybody tried to train sd35m?

training

gusty trail Oct 31, 2024, 10:28 AM

#

winged seal I just need the output to be in diffusers/transformers format

comfyui save checkpoint is able to merge the applied lora into model and you might need to convert the saved checkpoint to diffuser format

patent acorn Oct 31, 2024, 10:31 AM

#

sterile pendant From my fairly limited testing with large vs medium, prompt adherence is roughly...

if its the same adherence then is it a dataset problem?

winged seal Oct 31, 2024, 10:31 AM

#

gusty trail comfyui save checkpoint is able to merge the applied lora into model and you mig...

I am not aware of any way to convert safetensors to diffusers, but if thats the case, that woud be amazing

gusty trail Oct 31, 2024, 10:33 AM

#

winged seal I am not aware of any way to convert safetensors to diffusers, but if thats the ...

https://github.com/huggingface/diffusers/tree/main/scripts

GitHub

diffusers/scripts at main · huggingface/diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX. - huggingface/diffusers

winged seal Oct 31, 2024, 10:34 AM

#

well no shit... if thats the case

#

you will be my hero if this works @gusty trail

gusty trail Oct 31, 2024, 10:35 AM

#

winged seal you will be my hero if this works <@331826740898824195>

I just remembered you could use unetsave which might do the job without convert

winged seal Oct 31, 2024, 10:36 AM

#

I need the full model saved in diffusers format

gusty trail Oct 31, 2024, 10:37 AM

#

you might just copy other component

#

and replace the transformer part

winged seal Oct 31, 2024, 10:38 AM

#

I am *extremely inexperienced with code, so the less work/chances to mess things up, the better 😅

#

I can use safetensors to merge in the LoRA's, and then convert to transformers, if I can figure it out

gusty trail Oct 31, 2024, 10:40 AM

#

the unetsave would save the model as a unet. If you want to load the models using diffuser pipeline, you just need a diffusers repo copy and replace the original transformer with saved new transformer.

noble coyote Oct 31, 2024, 10:41 AM

#

Flux RF Inversion - no Controlnet, no IPAdapter! Original, then RFI version - prompt = astronaut on a spaceship in the style of 3d melting gold render cleaning the toilet

winged seal Oct 31, 2024, 10:45 AM

#

here's an outline of what I need:

I am training flux using AIT, which requires a diffusers/transformers format model

From there, I wat to merge in those LoRA's, and continue training new concepts in/on that merged model

in order to do that, the model I merge into or try to train on cannot be a safetensors, and instead needs to be a transformers style model.

All I need here s to be able to convert from safetensors to diffuers, or have a script to merge LoRA's into diffusers/transformers models

gusty trail Oct 31, 2024, 10:46 AM

#

Just try it. There are many way to achieve the same result

#

"diffuser" style model just has different naming key with the same thing. there is no black magic

winged seal Oct 31, 2024, 10:48 AM

#

My mind has a tendency to get overwhelmed and shut down when I have no idea what I am doing 😭

sterile pendant Oct 31, 2024, 10:51 AM

#

patent acorn if its the same adherence then is it a dataset problem?

Moistly, yeah, but likely with the captioning itself. Since the community is split over caveman sd1.5 prompting and newer t5 style natural prompting, the captions have to have a mix of both types. I'm sure you see where I'm going with this

patent acorn Oct 31, 2024, 10:55 AM

#

sterile pendant Moistly, yeah, but likely with the captioning itself. Since the community is spl...

then the medium has that mixed captions dataset?

sterile pendant Oct 31, 2024, 10:59 AM

#

patent acorn then the medium has that mixed captions dataset?

All of the models do and have to really. We all make lazy prompts from time to time like a dog with a ball, evening time, grassy field, cinematic lighting. Not everyone wants to write novels with complete sentences

#

what would really help the diffusion community most would be a new text encoder model that can translate lazy prompts and fleshed out prompts. almost like running prompt expansion under the hood for the lazy prompts. the problem is that text encoders are expensive to train. T5 and clipg/l are ancient by ML standards, with the rate that things grow in this field

sacred geode Oct 31, 2024, 11:33 AM

#

lavish sparrow Oct 31, 2024, 11:38 AM

#

patent acorn Oct 31, 2024, 11:50 AM

#

sterile pendant what would really help the diffusion community most would be a new text encoder ...

can an LLM be used as TE to do that tho?

lavish sparrow Oct 31, 2024, 11:52 AM

#

patent acorn can an LLM be used as TE to do that tho?

taht's what i'm already doing

#

Have a LLM pick up your prompt -> split it in a t5, clipl,clipg prompt

patent acorn Oct 31, 2024, 11:54 AM

#

lavish sparrow Have a LLM pick up your prompt -> split it in a t5, clipl,clipg prompt

yeah thats for some huggingface spaces

#

but what llm?

lavish sparrow Oct 31, 2024, 11:55 AM

#

patent acorn but what llm?

i've tried various -> actually roleplay LLM's are surprisingly the better ones. They are usually more creative, and give more flavourful prompts

patent acorn Oct 31, 2024, 11:55 AM

#

specifically llama 3 is the only llm i find to be creative

lavish sparrow Oct 31, 2024, 11:55 AM

#

mistral nemo, or if you can run it, mistral small

pseudo owl Oct 31, 2024, 11:56 AM

#

patent acorn can an LLM be used as TE to do that tho?

Yes llms can be used directly to convert your text into embeddings for the unet or dit like the t5xxl , lumina next, sana, li-dit all do that.

What eface is saying is to expand your prompt/rephrase it and then put the llm enhanced prompts into clipg, clipl, t5xxl.

lavish sparrow Oct 31, 2024, 11:56 AM

#

i've tried cydonia 22b, works really well too

noble coyote Oct 31, 2024, 11:56 AM

#

Meta's Llama3.2 is good

#

Zephyr, Llava2

lavish sparrow Oct 31, 2024, 11:57 AM

#

zephyr still exists?

patent acorn Oct 31, 2024, 11:57 AM

#

pseudo owl Yes llms can be used directly to convert your text into embeddings for the unet ...

yeah ik the enhanced prompt

noble coyote Oct 31, 2024, 11:57 AM

#

At ollama.com

patent acorn Oct 31, 2024, 11:57 AM

#

thats why i said some hf spaces do it

bitter hearth Oct 31, 2024, 11:57 AM

#

patent acorn can an LLM be used as TE to do that tho?

yes this is possible even today, Hunyuan-DiT even supports multi-turn prompting where you have a conversation
however if a model baked in automatic prompt expansion like they suggested, I would never use it personally

patent acorn Oct 31, 2024, 11:58 AM

#

idk if yall seen lumina using an llm as TE

bitter hearth Oct 31, 2024, 11:58 AM

#

we need a diffusion model that can handle a wider range of guidance vectors, rather than an LLM forcing us to use a different vector

pseudo owl Oct 31, 2024, 11:58 AM

#

patent acorn yeah ik the enhanced prompt

Yeah then you can honestly use any llm. It’s more about the system prompt then. some might be slightly more creative then others but the system prompt, few shot prompts matter much more.

lavish sparrow Oct 31, 2024, 11:59 AM

#

pseudo owl Yeah then you can honestly use any llm. It’s more about the system prompt then. ...

There was one model that didn't have system prompt recently, it totally sucked because it just couldn't follow instructions very well, dunno which one it was

noble coyote Oct 31, 2024, 12:00 PM

#

Try these too minicpm-v:8b-2.6-q8
qwen2:0.5b

sterile pendant Oct 31, 2024, 12:01 PM

#

patent acorn can an LLM be used as TE to do that tho?

Yeah but they are slow as hell compared to the way clip/t5 encode. LLMs do the whole next token thing and clip/t5 output the whole output at once. Also, the word flower doesn't come out of the prompt encoder as flower. It's way to complicated for me to explain at 8am walking the dog lol...

noble coyote Oct 31, 2024, 12:01 PM

#

gemma2:latest

patent acorn Oct 31, 2024, 12:01 PM

#

true haha

patent acorn Oct 31, 2024, 12:01 PM

#

noble coyote gemma2:latest

license unfortunately

bitter hearth Oct 31, 2024, 12:03 PM

#

TBH the best text embeddings for diffusion would probably have to be over API

sterile pendant Oct 31, 2024, 12:03 PM

#

Even if you're using a pre-stage with an llm to prompt expand, you're still at the mercy of clip and t5 and then encode it for the diffusion. It helps a ton though

pseudo owl Oct 31, 2024, 12:03 PM

#

sterile pendant Yeah but they are slow as hell compared to the way clip/t5 encode. LLMs do the w...

Yeah but that’s not a problem if you use llm directly as a text encoder(need to train the dit/unet tho). For example, gemma 2b is actually faster then t5xxl as a text encoder.

But with enhancing prompts it is yeah.

bitter hearth Oct 31, 2024, 12:03 PM

#

I think a better direction is widening the range of prompt types that work for the model

sterile pendant Oct 31, 2024, 12:05 PM

#

pseudo owl Yeah but that’s not a problem if you use llm directly as a text encoder(need to ...

Yeah newer architecture, but again, it's not encoding the prompt for diffusion, it's making text. Though that one team managed to turn Gemma into a t5 alternative. I'd have to look at what they did to pull it off, but it's likely something hacky

bitter hearth Oct 31, 2024, 12:05 PM

#

this is Sana?

sterile pendant Oct 31, 2024, 12:05 PM

#

Like maybe they had to train the last couple layers or something

bitter hearth Oct 31, 2024, 12:05 PM

#

Sana is Nvidia so maybe its skill issue thing and they worked out a way to do it TBH

pseudo owl Oct 31, 2024, 12:06 PM

#

sterile pendant Yeah newer architecture, but again, it's not encoding the prompt for diffusion, ...

A few ones, sana(used li-dits advice), lumina, and li-dit(they use llama3 and qwen2 7b but not open source, paper tho).

It actually performs better then t5xxl I believe while being considerably faster and using less vram.

patent acorn Oct 31, 2024, 12:07 PM

#

ok can i ask something outside sd3 here? because im trying to change actor clothes frame by frame and its stressing me out due to the process is rly slow and im using krita ai diffusion

bitter hearth Oct 31, 2024, 12:07 PM

#

I don't know this area well but there is an entire field of diffusion models called "VTON" or "Virtual Try On"

#

they work differently to our normal ones

patent acorn Oct 31, 2024, 12:08 PM

#

im tryna find a way to change anyone clothes in vid and all i could find is inpaint the actor outfit then use ebsynth and if the next frame is liquidy then edit it again and so on

bitter hearth Oct 31, 2024, 12:08 PM

#

they tend to fork the Unet like in control net or brush net and then do self attention and/or cross attention injections across the two unets

#

sadly your task is extremely hard its still experimental

#

VTON is the key word to search for though

patent acorn Oct 31, 2024, 12:09 PM

#

im too nervous since its a school film project and it was past the deadline though the teacher doesnt even care

#

https://www.youtube.com/watch?v=FiIyV7jw4SU

YouTube

Sebastian Kamph

How to change clothes with AI.

Prompt styles for Stable diffusion Automatic1111, ComfyUI & Vlad/SD.Next: https://www.patreon.com/posts/sebs-hilis-79649068
Inpainting model https://civitai.com/models/25694?modelVersionId=134361

Get early access to videos and help me, support me on Patreon https://www.patreon.com/sebastiankamph

Chat with me in our community discord: https://d...

▶ Play video

#

is this good tho?

#

but doin it every frame uhh

sterile pendant Oct 31, 2024, 12:10 PM

#

pseudo owl A few ones, sana(used li-dits advice), lumina, and li-dit(they use llama3 and qw...

yeah i'm looking at sana's arxiv right now and it was them that i was thinking of that is using gemma

bitter hearth Oct 31, 2024, 12:10 PM

#

oh sorry my suggestion was not appropriate for school project
I am not sure there is an easy solution though

patent acorn Oct 31, 2024, 12:10 PM

#

bitter hearth VTON is the key word to search for though

looks similar to the vid

#

wonder if it can do shirtless

bitter hearth Oct 31, 2024, 12:11 PM

#

ah yeah this is the sort of thing I meant

#

dedicated VTON model

#

they beat normal methods

patent acorn Oct 31, 2024, 12:11 PM

#

wait but

#

what about some props to the actor

#

like loincloth

noble coyote Oct 31, 2024, 12:11 PM

#

sackcloth and ashes

bitter hearth Oct 31, 2024, 12:12 PM

#

you could use VTON for most of their clothes and then try img-to-img for small objects

noble coyote Oct 31, 2024, 12:12 PM

#

Ollama/Flux i2i

patent acorn Oct 31, 2024, 12:13 PM

#

bitter hearth you could use VTON for most of their clothes and then try img-to-img for small o...

confused with the img-img, inpainting you say?

bitter hearth Oct 31, 2024, 12:14 PM

#

inpainting is the easiest
also compositing (place the object where you want it before the img-to-img)
hardest is stuff like noise inversion or edit models like CosXL

patent acorn Oct 31, 2024, 12:15 PM

#

then i have to do that on few specific frames and do ebysnth

bitter hearth Oct 31, 2024, 12:15 PM

#

I never got noise inversion to work personally its tricky

patent acorn Oct 31, 2024, 12:15 PM

#

this was example i was stressed out btw

#

somehow more yellowish than the right pic

bitter hearth Oct 31, 2024, 12:16 PM

#

not sure about ebsyth

patent acorn Oct 31, 2024, 12:16 PM

#

i mean

#

im doing inpainting

#

the refine gen for some reason mroe yellowish than the before frame

bitter hearth Oct 31, 2024, 12:17 PM

#

that's not a big deal as we have good colour match tools

#

many comfy nodes do the same colour match method called Reinhart or something

sterile pendant Oct 31, 2024, 12:18 PM

#

Oh yeah, that's where I got the system prompt I've been using with qwen2.5 for prompt expansion lately lol... (Sana's paper)

bitter hearth Oct 31, 2024, 12:18 PM

#

if you are inpainting I strongly recommend powerpaint v2.1 https://github.com/nullquant/ComfyUI-BrushNet

sterile pendant Oct 31, 2024, 12:18 PM

#

I forgot I read parts of this paper when it came out lol

patent acorn Oct 31, 2024, 12:18 PM

#

bitter hearth many comfy nodes do the same colour match method called Reinhart or something

i didnt know that.. so i can just match the color with the frame before?

bitter hearth Oct 31, 2024, 12:18 PM

#

sterile pendant I forgot I read parts of this paper when it came out lol

ah nice LOL

patent acorn Oct 31, 2024, 12:18 PM

#

bitter hearth if you are inpainting I strongly recommend powerpaint v2.1 ``https://github.com/...

im using krita ai diffusion rn

#

comfyui as the remote

bitter hearth Oct 31, 2024, 12:19 PM

#

I love Sana cos they made a DiT with no positional embeds

sterile pendant Oct 31, 2024, 12:19 PM

#

that sysprompt works REALLY well I've found (obviously, you need to use an instruct version of a model, not a base version)

#

well they are the pixart team mostly and i loved pixart sigma

bitter hearth Oct 31, 2024, 12:20 PM

#

patent acorn im using krita ai diffusion rn

by pure coincidence I spent a lot of time yesterday researching the latest inpainting methods
my conclusion was that powerpaint v2.1 is the way to go, out of stuff that is currently fully released and working in common GUIs

#

the original brushnet paper explains why inpainting models and control nets don't work

#

inpainting models mix the text tokens in too early, and control nets are too sparse of a control

patent acorn Oct 31, 2024, 12:21 PM

#

wait what i gotta check it out

bitter hearth Oct 31, 2024, 12:21 PM

#

there is a really nice node pack with good examples https://github.com/nullquant/ComfyUI-BrushNet

bitter hearth Oct 31, 2024, 12:22 PM

#

sterile pendant well they are the pixart team mostly and i loved pixart sigma

yeah I spent half a year waiting for new Pixart instead of Sana though lol

#

Sana is still very interesting but I wanted main model

noble coyote Oct 31, 2024, 12:23 PM

#

sterile pendant well they are the pixart team mostly and i loved pixart sigma

Is PiXart dead, resurrected as Sana?

sterile pendant Oct 31, 2024, 12:23 PM

#

nvidia teamed up with them for RnD

bitter hearth Oct 31, 2024, 12:23 PM

#

I hope not but maybe

noble coyote Oct 31, 2024, 12:23 PM

#

PiXart Sigma was cool, agree

bitter hearth Oct 31, 2024, 12:23 PM

#

hoping for new pixart also

sterile pendant Oct 31, 2024, 12:23 PM

#

sana is likely a proof of concept before they make something big from the arch

patent acorn Oct 31, 2024, 12:23 PM

#

need pixart omega smh

bitter hearth Oct 31, 2024, 12:23 PM

#

the VAE from Sana will be good for other uses

patent acorn Oct 31, 2024, 12:23 PM

#

bitter hearth the VAE from Sana will be good for other uses

is the vae licensed?

bitter hearth Oct 31, 2024, 12:23 PM

#

in some ways the VAE was the star of the show anyway

#

not sure license

patent acorn Oct 31, 2024, 12:26 PM

#

bitter hearth there is a really nice node pack with good examples ``https://github.com/nullqua...

brushnet examples lookin interesting but sadly im tired of inpainting.. i found a video that would do the outfit transfer for me

bitter hearth Oct 31, 2024, 12:26 PM

#

okay nice that looks not bad

#

if you like outpainting, thats where powerpaint dominated the other methods

#

powerpaint's preference score was like 600% of the score that inpainting model got

patent acorn Oct 31, 2024, 12:32 PM

#

bitter hearth if you like outpainting, thats where powerpaint dominated the other methods

Looks like the creator of krita ai diff need to find abt

#

i havent seen one message about it in its discord

noble coyote Oct 31, 2024, 12:32 PM

#

Ollama and SD3.5L

bitter hearth Oct 31, 2024, 12:33 PM

#

yeah Krita is not the way to go for inpainting

#

dedicated networks are getting too good

noble coyote Oct 31, 2024, 12:34 PM

#

I use Photoshop_SD_Plugin node inside Photoshop to enhance its Content Aware/Inpainting features ... bu that's just me!

sterile pendant Oct 31, 2024, 12:34 PM

#

using that sysprompt from Sana, i got some nightmare fuel out of sd3.5 medium

noble coyote Oct 31, 2024, 12:34 PM

#

Pigs WILL fly!!!

sterile pendant Oct 31, 2024, 12:35 PM

#

(that ui is my gradio app i've been making for the family. trying to make it as idiot proof as possible)

#

it runs comfy api workflows

#

and that's using qwen2.5 7b IT for expansion

noble coyote Oct 31, 2024, 12:36 PM

#

Comfy has had a makeover ...

#

I often use Qwen, Zephyr too, lately Meta's Llama3.2

bitter hearth Oct 31, 2024, 12:37 PM

#

I'm a florence guy mostly

#

its not as strong these days though

noble coyote Oct 31, 2024, 12:37 PM

#

I use Flux with Florence2 a lot

sterile pendant Oct 31, 2024, 12:37 PM

#

yeah but its not idiot proof enough for techno illiterate family members. comfy lets me make custom workflows and it has the best optimization out of all the frontends

noble coyote Oct 31, 2024, 12:38 PM

#

minicpm-v:8b-2.6-q8

sterile pendant Oct 31, 2024, 12:38 PM

#

i use both florence2 and minicpm2.6

#

have a tab for it in my app

bitter hearth Oct 31, 2024, 12:38 PM

#

sadly Diffusers is overtaking Comfy in speed

#

cos of Torchao and Sage Attention

sterile pendant Oct 31, 2024, 12:39 PM

#

well it's not like those features can't be implemented into comfy

noble coyote Oct 31, 2024, 12:39 PM

#

3.5L and Llama3.2

bitter hearth Oct 31, 2024, 12:39 PM

#

there's an interesting convo about exactly that in comfy discord at the moment

patent acorn Oct 31, 2024, 12:39 PM

#

noble coyote Pigs WILL fly!!!

https://tenor.com/view/bad-piggies2-gif-26842263

Tenor

noble coyote Oct 31, 2024, 12:40 PM

#

Marvellous!!!

#

🥳

sterile pendant Oct 31, 2024, 12:40 PM

#

noble coyote 3.5L and Llama3.2

how is 3.2 vs 3.1? afaik, wasn't it just a distilled version of 3.1? like the 3b model is roughly on par with the 8b version, right?

#

haven't really bothered messing with it yet

bitter hearth Oct 31, 2024, 12:42 PM

#

3.2 boosted the smaller ones a fair bit

#

to be honest though its the agent or chain framework that you use around the LLM that is more important at this point

sterile pendant Oct 31, 2024, 12:44 PM

#

well i mostly only use instruct versions of models for stuff like prompt expansion and some of the newer models have been hit and miss. like for my app's prompt expansion, qwen2.5 is the only one that reliably follows the exact format and doesn't do a bunch of rambly verbose LLM stuff

bitter hearth Oct 31, 2024, 12:44 PM

#

verbose is a big issue yeah

sterile pendant Oct 31, 2024, 12:44 PM

#

3.1 is close though, but still whiffs it like 1/20 times

noble coyote Oct 31, 2024, 12:45 PM

#

sterile pendant how is 3.2 vs 3.1? afaik, wasn't it just a distilled version of 3.1? like the 3b...

It holds its head up against Llama3.1; but since my output is mainly artistic - "there is never one LLM being better than another" - as each 'mistake' or 'aberration' is often masked and "contributes to the artistic whole"

sterile pendant Oct 31, 2024, 12:46 PM

#

yeah i feel you

bitter hearth Oct 31, 2024, 12:46 PM

#

getting a second smaller LLM to check the results and force a re-roll for a bad one can help

noble coyote Oct 31, 2024, 12:46 PM

#

Is art full of mistakes? Yes!
And it is often the bad mistakes which make the art successful!

#

... he said, rather enigmatically!

bitter hearth Oct 31, 2024, 12:48 PM

#

🤔

sterile pendant Oct 31, 2024, 12:49 PM

#

i'll try out 3.2 since it's smaller and all. my pc only has 32gb ram. like for my t5 tenc, i use the q5km gguf to save a little ram. worst case scenario, i have q8 flux or sd3.5large/medium, q5km t5, clipg/l, qwen2.5 7b(q5km), minicpm2.6, florence 2 all offloaded into ram. it all fits without having to hit the page file lol

noble coyote Oct 31, 2024, 12:49 PM

#

My 8GB VRAM is only competent due to my 64Gb RAM

sterile pendant Oct 31, 2024, 12:50 PM

#

i only have 8gb vram as well

#

so i immedately offload after use at each stage

noble coyote Oct 31, 2024, 12:50 PM

#

Plus 2 x SSDs for rapid LoRA and Checkpoint change

sterile pendant Oct 31, 2024, 12:51 PM

#

yeah all my models are on a 5gb/s nvme

noble coyote Oct 31, 2024, 12:51 PM

#

My 2Tb system SSD is overfull, so swapping out for 4Tb

sterile pendant Oct 31, 2024, 12:54 PM

#

lmao... (sd3.5m)

bitter hearth Oct 31, 2024, 12:55 PM

#

wow

sterile pendant Oct 31, 2024, 12:55 PM

#

"A cucumber human hybrid creature stands in a whimsical scene inspired by Francisco Goya's style. Its torso resembles a green, lumpy cucumber with delicate, vine-like tendrils for arms and legs, while its head is humanoid with large, expressive eyes and a mischievous grin. The background features a dreamy, surreal landscape with floating clouds and ghostly figures, creating an eerie yet enchanting atmosphere." not quite goya, but i'll take it

noble coyote Oct 31, 2024, 12:55 PM

#

Flux and Ollama minicpm2.6

sterile pendant Oct 31, 2024, 1:04 PM

#

#

alright im done lol...

pseudo owl Oct 31, 2024, 1:15 PM

#

muted dove A hologram from the future

Tried something similar with mochi

noble coyote Oct 31, 2024, 1:15 PM

#

LLama3.2 and 3.5L

lavish sparrow Oct 31, 2024, 2:16 PM

#

#

finite osprey Oct 31, 2024, 2:41 PM

#

muted dove Oct 31, 2024, 2:51 PM

#

Hair in your soup?

#

flat oracle Oct 31, 2024, 3:06 PM

#

sup guys maybe someone can help me out with that:

Im trying to generate detailed pixelimages from simple pixel images.
Therefore im resizing the images to 1024x1024, which is no problem at all with pixel art.

However for some images, its gives me weird outputs if i dont scale them down to 512x512.

Upper one is 1024x1024
below that is 512x512.

The initial image resolution is 964x464

Can someone explain this to me, or even tell me in which case i need to resize to what resolution?

dusky thistle Oct 31, 2024, 3:50 PM

#

SD35M

rapid pivot Oct 31, 2024, 3:52 PM

#

Cool colors

lavish sparrow Oct 31, 2024, 3:56 PM

#

#

dusky thistle Oct 31, 2024, 4:05 PM

#

1920x1152

#

one shot no upscale (SD35M

lavish sparrow Oct 31, 2024, 4:11 PM

#

nice coherence on the outside

dusky thistle Oct 31, 2024, 4:12 PM

#

#

#

#

#

#

#

lavish sparrow Oct 31, 2024, 4:13 PM

#

dusky thistle Oct 31, 2024, 4:14 PM

#

#

yeah SD35M has a great sense of style

#

#

#

#

#

#

#

#

lavish sparrow Oct 31, 2024, 4:17 PM

#

that's some crazy gen speed your making there @dusky thistle

cedar axle Oct 31, 2024, 4:17 PM

#

SD3.5L

#

Fiddling with realism (SD3.5L)

#

Too bad 3PO gets all the gold - this armor could have been pretty sweet (SD3.5L)

noble coyote Oct 31, 2024, 4:24 PM

#

#

Flux + Ollama

#

Ollama settings

pseudo owl Oct 31, 2024, 5:16 PM

#

Allegro 2.8b(apache2.0 open source)

turbid grotto Oct 31, 2024, 5:26 PM

#

sad that allegro and mochi do not support img2vid

remote holly Oct 31, 2024, 5:47 PM

#

What is allegro ?

flat oracle Oct 31, 2024, 5:54 PM

#

https://huggingface.co/blog/RhymesAI/allegro

Allegro: Advanced Video Generation Model

#

that was my question too

flat oracle Oct 31, 2024, 5:55 PM

#

pseudo owl Allegro 2.8b(apache2.0 open source)

hardware? looks crazy good

unique saffron Oct 31, 2024, 6:01 PM

#

real terrace Oct 31, 2024, 6:06 PM

#

I'm setting a img2img workflow for SD3.5 medium, should I change something here?

pseudo owl Oct 31, 2024, 6:11 PM

#

@flat oracle @remote holly
Allegro is a text to video model, can generate videos from text.
Their official discord bot(3-4 min), can also run locally on as little as 8gb vram but takes 30mins(it’s horribly unoptimized right now).

If you have 24gb vram, I would recommend Mochi-1 text to video(also has apache2.0 license), that is considerably better and faster even though it’s 10b. Needs at least 24gb tho, can’t fit in 8gb.

Some vids mochi generated(from genmo, their official website).

real terrace Oct 31, 2024, 6:33 PM

#

real terrace I'm setting a img2img workflow for SD3.5 medium, should I change something here?

I wonder as in Automatic with less denoise, there was less steps

signal shuttle Oct 31, 2024, 7:35 PM

#

Any thoughts on this? https://civitai.com/models/904111/sd-35-large-modern-anime?modelVersionId=1011744 its a full finetune of 3.5 Large

craggy crest Oct 31, 2024, 7:39 PM

#

signal shuttle Any thoughts on this? https://civitai.com/models/904111/sd-35-large-modern-anime...

3.5 does anime very well without needing a lora or fine tune, but the description says it's for quality, and it looks interesting

craggy crest Oct 31, 2024, 8:28 PM

#

@bitter hearth lcm+normal vrs lms+normal

cunning lintel Oct 31, 2024, 9:36 PM

#

#

Best sword ever!

real terrace Oct 31, 2024, 10:14 PM

#

#

#

#

#

#

untold valley Oct 31, 2024, 10:20 PM

#

cunning lintel Oct 31, 2024, 10:28 PM

#

The girl lying on the grass reassembled herself best as she could and went for a run 😂 😭

cunning lintel Oct 31, 2024, 10:54 PM

#

craggy crest Oct 31, 2024, 10:54 PM

#

cunning lintel Oct 31, 2024, 11:18 PM

#

craggy crest Oct 31, 2024, 11:35 PM

#

@bitter hearth @dusky thistle ipndm/linear_quadradic

untold valley Oct 31, 2024, 11:43 PM

#

sacred jewel Oct 31, 2024, 11:46 PM

#

untold valley Nov 1, 2024, 12:01 AM

#

cedar vortex Nov 1, 2024, 12:03 AM

#

untold valley Nov 1, 2024, 12:14 AM

#

distant skiff Nov 1, 2024, 3:15 AM

#

A close-up of cells and microorganisms, mostly white, with a few colorful elements, such as pink and yellow, on the background we see some small, organic, green, blue, and violet creatures. This is a macro photography image with a shallow depth of field.

turbid grotto Nov 1, 2024, 3:32 AM

#

signal shuttle Any thoughts on this? https://civitai.com/models/904111/sd-35-large-modern-anime...

had no luck with it

dusky thistle Nov 1, 2024, 3:33 AM

#

craggy crest Nov 1, 2024, 4:01 AM

#

@dusky thistle #🆕｜sd3 message

dusky thistle Nov 1, 2024, 4:04 AM

#

sd3m? sd3L?

#

just about ready to drop a new sampler 😉

#

this is what sampler dev looks like lol

#

got some new modes going thouugh

#

dpmpp_2m, dpmpp_3m, res_2m, res_3m with all noise modes and implicit sampling options

#

the right kind of noise can push res_3m toward some styles it normally doesn't want to do

#

more normal result for that one

#

got 25 samplers all using the same code, not too shabby

#

bitter hearth Nov 1, 2024, 4:28 AM

#

25 is loads yeah

dusky thistle Nov 1, 2024, 4:48 AM

#

prolly will add another half dozen or so by the weekend

#

only one major task left: get unsamlping and guide stuff working with samplerRK

#

oh, and add DEIS

#

then i can delete pretty much all of my samplers lol

#

that is going to feel greaaaatttt

#

thousands and thousands of lines of code, poof, no more

bitter hearth Nov 1, 2024, 5:04 AM

#

the guides were cool yeah

dusky thistle Nov 1, 2024, 5:10 AM

#

seemed like mode 8 was the real popular one that was essential to keep

craggy crest Nov 1, 2024, 5:17 AM

#

dusky thistle sd3m? sd3L?

Large.

craggy crest Nov 1, 2024, 5:18 AM

#

dusky thistle thousands and thousands of lines of code, poof, no more

And 10 seconds later...

#

Dont delete stuff. You never know when you might need it, or soneone else will find it usefull. Even if you are positive it can go away

#

Especially with things changing this fast

bitter hearth Nov 1, 2024, 5:21 AM

#

I don't delete I just never save in the first place lol

craggy crest Nov 1, 2024, 5:21 AM

#

craggy crest Nov 1, 2024, 5:21 AM

#

bitter hearth I don't delete I just never save in the first place lol

I save, i just dont comment

dusky thistle Nov 1, 2024, 5:22 AM

#

craggy crest Dont delete stuff. You never know when you might need it, or soneone else will f...

i always back stuff up in zip files

bitter hearth Nov 1, 2024, 5:22 AM

#

at least the way cloud is currently, storage is priced really high relative to compute for some reason

dusky thistle Nov 1, 2024, 5:22 AM

#

but i am def deleting most of this from my active version

#

it's gonna make this much easier to maintain and navigate

#

i'll have at least 4k lines of redundant code

craggy crest Nov 1, 2024, 5:27 AM

#

bitter hearth at least the way cloud is currently, storage is priced really high relative to c...

Google's not bad. I pay almost nothing for several terrabytes of drive space

bitter hearth Nov 1, 2024, 5:28 AM

#

ye at some point I should put civit models on google drive or backblaze

#

its an issue also that hugging and civit don't have global servers

#

so if your GPU is in Australia or somewhere like that the download is slow

craggy crest Nov 1, 2024, 5:33 AM

#

bitter hearth its an issue also that hugging and civit don't have global servers

Just stick them on your huggingface space. Then you csn also easily share thrm out

dusky thistle Nov 1, 2024, 5:34 AM

#

yea i use mine as a landfill

bitter hearth Nov 1, 2024, 5:35 AM

#

problem is getting them from hugging to vast

#

it seems that everyone has the same problem cos US servers go for a slight premium

craggy crest Nov 1, 2024, 5:39 AM

#

bitter hearth problem is getting them from hugging to vast

Why do you need them on vast?

bitter hearth Nov 1, 2024, 5:40 AM

#

oh its just that vast have the cheapest compute

craggy crest Nov 1, 2024, 5:45 AM

#

#

#

dusky thistle Nov 1, 2024, 6:03 AM

#

hallow lion Nov 1, 2024, 6:04 AM

#

We are not in Kansas anymore.

noble coyote Nov 1, 2024, 6:40 AM

#

Flux RF Inversion - an astronaut 'keeping house!'

#

dusky thistle Nov 1, 2024, 6:47 AM

#

noble coyote Flux RF Inversion - an astronaut 'keeping house!'

btw, saw the issue report... i've had soooo many ppl have problems installing opensimplex, and the results with it haven't been particularly interesting or anytihng... just took it out from RES4LYF

#

so let me know when you get the chance if it works now so i can close the issues ppl have opened

#

deceptively simple looking little beast

#

25 samplers, unsampling, guides, multistep modes, buffer modes, legit implicit runge kutta sampling, 17 noise types, 6 noise scaling modes, and CFG++

#

and more (of course :P)

noble coyote Nov 1, 2024, 7:00 AM

#

untold valley Nov 1, 2024, 7:04 AM

#

noble coyote Nov 1, 2024, 7:06 AM

#

#

untold valley Nov 1, 2024, 7:10 AM

#

dusky thistle 25 samplers, unsampling, guides, multistep modes, buffer modes, legit implicit r...

what is your fav sampler?

dusky thistle Nov 1, 2024, 7:10 AM

#

untold valley what is your fav sampler?

depends, but i really like the "res" ones

untold valley Nov 1, 2024, 7:13 AM

#

2m then

dusky thistle Nov 1, 2024, 7:13 AM

#

noble coyote

#

i have a 4090 so i don't feel the effects of the slower samplers as much, so tbh my fav is probably res_3s

untold valley Nov 1, 2024, 7:14 AM

#

dusky thistle i have a 4090 so i don't feel the effects of the slower samplers as much, so tbh...

you just inadvertently called me a peasant, lol im struggling with a 1080ti

dusky thistle Nov 1, 2024, 7:15 AM

#

untold valley you just inadvertently called me a peasant, lol im struggling with a 1080ti

ouch

#

yeah, res_2m is prolly a safe bet for that card

#

that runs as fast as euler

noble coyote Nov 1, 2024, 7:15 AM

#

Cannot get RES4LYF to work 😄

dusky thistle Nov 1, 2024, 7:15 AM

#

noble coyote Cannot get RES4LYF to work 😄

still??

noble coyote Nov 1, 2024, 7:16 AM

#

yes

dusky thistle Nov 1, 2024, 7:16 AM

#

are you still getting an opensimplex error or something

untold valley Nov 1, 2024, 7:16 AM

#

noble coyote Cannot get RES4LYF to work 😄

jfc what is this monstrosity? whoa

#

dusky thistle Nov 1, 2024, 7:16 AM

#

noble coyote Nov 1, 2024, 7:17 AM

#

I will get the error soon ... d/loading grounding-dino stuff

bitter hearth Nov 1, 2024, 7:18 AM

#

noble coyote I will get the error soon ... d/loading grounding-dino stuff

Hey not again

untold valley Nov 1, 2024, 7:18 AM

#

it seems you are missing nodes?

dusky thistle Nov 1, 2024, 7:18 AM

#

yea, trying to get my repo working on there

#

https://github.com/ClownsharkBatwing/RES4LYF

GitHub

GitHub - ClownsharkBatwing/RES4LYF

Contribute to ClownsharkBatwing/RES4LYF development by creating an account on GitHub.

noble coyote Nov 1, 2024, 7:19 AM

#

RES4LYF fails to load ... I need to disable clashing nodes

dusky thistle Nov 1, 2024, 7:19 AM

#

oh there's a node naming conflict? weird

#

maybe you have an old version in another folder?

noble coyote Nov 1, 2024, 7:20 AM

#

Mebbe - I shall delete that older version - or update it

dusky thistle Nov 1, 2024, 7:20 AM

#

ohhhhh yeah if you have another one, remove it from your custom_nodes folder and stash it on your desktop or something lol

#

then git clone again from scratch

#

that should help

#

100%|███████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 39/39 [00:04<00:00, 8.32it/s]
100%|███████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 40/40 [00:08<00:00, 4.50it/s]
Prompt executed in 15.90 seconds

untold valley Nov 1, 2024, 7:28 AM

#

noble coyote Nov 1, 2024, 7:33 AM

#

RES4LYF loaded at last - no problems with Open Simplex (yet ...)

dusky thistle Nov 1, 2024, 7:34 AM

#

untold valley

#

thank god lol

#

yeah i took opensimplex out, i just commented it out

#

if anyone reallllly wants to use it they can just uncomment it

bitter hearth Nov 1, 2024, 7:34 AM

#

without opensimplex how can we get Flux grid in SDXL

untold valley Nov 1, 2024, 7:35 AM

#

dusky thistle

i love this so much, made me cackle 😂 mikuwha goodjob happemad

#

dusky thistle Nov 1, 2024, 7:38 AM

#

noble coyote Nov 1, 2024, 7:38 AM

#

FIX NODE on both ClownSamplers helped!

untold valley Nov 1, 2024, 7:38 AM

#

dusky thistle

oh damn we need a skull and the with needs to have golf club then it will be perfect

#

wizard death golf club in mars

dusky thistle Nov 1, 2024, 7:39 AM

#

noble coyote FIX NODE on both ClownSamplers helped!

btw, i recommend you load these images

#

they have workflows embedded

#

i have some really sophisticated unsampling on these

#

it was inspired by the RF inversion stuff, totally redid the math and reworked the algorithm so it's kinda new-ish

bitter hearth Nov 1, 2024, 7:39 AM

#

I tried that token downsampling thing called Todo, its really good
its a 50% speed boost on any workflow that uses text encoders 🤔

dusky thistle Nov 1, 2024, 7:39 AM

#

major upgrade

bitter hearth Nov 1, 2024, 7:40 AM

#

I never got vanilla unsampling working

#

would be cool to try this new one TBH

untold valley Nov 1, 2024, 7:43 AM

#

noble coyote Nov 1, 2024, 7:49 AM

#

dusky thistle it was inspired by the RF inversion stuff, totally redid the math and reworked t...

RFI gives me quasi-controlnet/ipadapter using Flux. 8Gb VRAM for the Xflux IPAdapter does not work! 🙃

untold valley Nov 1, 2024, 7:55 AM

#

dusky thistle Nov 1, 2024, 7:55 AM

#

noble coyote RFI gives me quasi-controlnet/ipadapter using Flux. 8Gb VRAM for the Xflux IPAda...

yup

dusky thistle Nov 1, 2024, 7:56 AM

#

bitter hearth would be cool to try this new one TBH

later i'm gonna mod the sigma node for it so it detects sdxl etc and uses that method instead

hallow lion Nov 1, 2024, 8:04 AM

#

noble coyote Flux RF Inversion - an astronaut 'keeping house!'

Going to the toilet in space sucks tremednously.

noble coyote Nov 1, 2024, 8:07 AM

#

hallow lion Going to the toilet in space sucks tremednously.

"Have you tried it?!" 🥳

hallow lion Nov 1, 2024, 8:08 AM

#

noble coyote "Have you tried it?!" 🥳

Off world dreams. Nosumer life.

#

Geoffrey Hackumeriff: "I'd rather plug my butt permanently than have diarrhea in space again. Even Sumo-X couldn't save that pod from my ass."

noble coyote Nov 1, 2024, 8:12 AM

#

Ewwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwwww

#

🥳

#

dusky thistle Nov 1, 2024, 8:23 AM

#

noble coyote Nov 1, 2024, 8:28 AM

#

hallow lion Nov 1, 2024, 8:29 AM

#

noble coyote

Don't take laxatives when you fly in orbit. I speak from experience.

#

Papa Musk banned me from his shuttle.

noble coyote Nov 1, 2024, 8:30 AM

#

Norovirus in Space! You could do a very funny Graphic Novel on this!!! 🥳

#

patent acorn Nov 1, 2024, 9:11 AM

#

noble coyote

do a superman pissing

noble coyote Nov 1, 2024, 9:15 AM

#

patent acorn do a superman pissing

Nothing to stop you from doing it?!¬ 🥳

#

Flux RF Inversion Style Transfer

patent acorn Nov 1, 2024, 9:20 AM

#

noble coyote Nothing to stop you from doing it?!¬ 🥳

what are uou tryna say i said gen superman pissing

noble coyote Nov 1, 2024, 9:23 AM

#

patent acorn what are uou tryna say i said gen superman pissing

Try it yourself, why not?

patent acorn Nov 1, 2024, 9:30 AM

#

thats no fun

#

i requested you not expecting to be told to gen it myself

#

sadcat

dusky thistle Nov 1, 2024, 9:35 AM

#

noble coyote Flux RF Inversion Style Transfer

The astronaut to clown shark image I posted has a wf that should also work for this kinda stuff with flux btw

noble coyote Nov 1, 2024, 9:40 AM

#

#

#

#

RFI Style Transfer

fossil pagoda Nov 1, 2024, 9:44 AM

#

241030124715_A_mechanical_anthropomorphic_Carnage_symbiote_fused_with_SpiderMan_rendered_in_c__00031_.png

noble coyote Nov 1, 2024, 9:51 AM

#

#

#

dusky thistle Nov 1, 2024, 9:54 AM

#

#

lavish sparrow Nov 1, 2024, 9:58 AM

#

lavish sparrow Nov 1, 2024, 9:58 AM

#

fossil pagoda

sd3.5?

dusky thistle Nov 1, 2024, 9:59 AM

#

dusky thistle Nov 1, 2024, 9:59 AM

#

lavish sparrow sd3.5?

you might wanna git pull my repo btw, added a lot of stuff to samplerRK

#

res_2m and res_3m are as fast as euler

fossil pagoda Nov 1, 2024, 9:59 AM

#

lavish sparrow sd3.5?

Yep

lavish sparrow Nov 1, 2024, 9:59 AM

#

dusky thistle you might wanna git pull my repo btw, added a lot of stuff to samplerRK

oooh ^^ spicy new toys?! 😄

dusky thistle Nov 1, 2024, 10:00 AM

#

res_3m is really good at getting paint to pop

#

yep

lavish sparrow Nov 1, 2024, 10:00 AM

#

dusky thistle res_2m and res_3m are as fast as euler

wait, you managed to get res3 that fast?!

dusky thistle Nov 1, 2024, 10:00 AM

#

WF in the second image above btw

#

well, it'll be different

#

it's a different style of sampling

#

more approximate but it also has some advantages from that tradeoff

#

in that it borrows from previous steps

lavish sparrow Nov 1, 2024, 10:00 AM

#

i mean, res3 should be pretty fckn slow by all means

#

it's accurate, but just... slow

dusky thistle Nov 1, 2024, 10:01 AM

#

yep

lavish sparrow Nov 1, 2024, 10:01 AM

#

if you manage to get 50% of the accuracy and no speed penalty, that's a win

dusky thistle Nov 1, 2024, 10:01 AM

#

yeah it's still a signifiacnt boost

#

def give both the 2m and 3m a shot

#

3m is gonna be more sensitive to cfg and low step counts, which is pretty common for that type of sampling

#

lavish sparrow Nov 1, 2024, 10:09 AM

#

yeah, am gonna have to fiddle around with it a bit. not bad @dusky thistle ❤️ loving ur sampler

dusky thistle Nov 1, 2024, 10:10 AM

#

25 samplers, unsampling, latent image guides, multistep modes, buffer modes, legit implicit runge kutta sampling, 17 noise types, 6 noise scaling modes, and CFG++... all in one compact package

#

i'm sure there's some bugs to be worked out but finally got most of the features rolled in that i was looking to prioritize

lavish sparrow Nov 1, 2024, 10:13 AM

#

you might want to fiddle around with this one https://github.com/Jonseed/ComfyUI-Detail-Daemon

GitHub

GitHub - Jonseed/ComfyUI-Detail-Daemon: A port of muerrilla's sd-we...

A port of muerrilla's sd-webui-Detail-Daemon as a node for ComfyUI, to adjust sigmas that control detail. - Jonseed/ComfyUI-Detail-Daemon

#

your sampler works great with this

dusky thistle Nov 1, 2024, 10:14 AM

#

lavish sparrow Nov 1, 2024, 10:15 AM

#

you need to adjust your eta tho

#

if you use the detailer

dusky thistle Nov 1, 2024, 10:15 AM

#

ahh yea i've got a way to do that by manipulating some of the params with noise

#

i was thinking about adding other stuff to just directly increase the denoise out of sync

lavish sparrow Nov 1, 2024, 10:15 AM

#

but the combination of the detailer daemon and eta noise manipulation makes for some very crispy images

dusky thistle Nov 1, 2024, 10:15 AM

#

best way to see it demoed is to set eta to... something.. and then lower the s_noise value to something like 0.95 or even 0.9

lavish sparrow Nov 1, 2024, 10:16 AM

#

so much experimentation to be done by just adjusting values xD

#

... and each model has it's own preferences...

dusky thistle Nov 1, 2024, 10:16 AM

#

yea for real

#

you dont even want to know how much code i've written that hasn't made it out of the lab so to speak

lavish sparrow Nov 1, 2024, 10:17 AM

#

i can imagine

dusky thistle Nov 1, 2024, 10:17 AM

#

i've tried damn near everything imaginable with the math with RF

#

which is why i've got six noise scaling modes now lol

lavish sparrow Nov 1, 2024, 10:17 AM

#

3m fries with detailer, it seems

dusky thistle Nov 1, 2024, 10:17 AM

#

and probably at least a thousand of images i burned to fucking death lol

lavish sparrow Nov 1, 2024, 10:17 AM

#

trying 3s now

#

oh. 3s is slow xD

dusky thistle Nov 1, 2024, 10:19 AM

#

try res_2m

#

then try res_3m and use a painting prompt as a comparison... something with impasto

fossil pagoda Nov 1, 2024, 10:20 AM

#

dusky thistle then try res_3m and use a painting prompt as a comparison... something with impa...

Do you plan to bring your nodes to comfyUI manager btw?

dusky thistle Nov 1, 2024, 10:20 AM

#

fossil pagoda Do you plan to bring your nodes to comfyUI manager btw?

yeah, def

#

been wanting to get everything cleaned up first so i can actually document it

#

it got completely unmaintainable, just way too much to keep up with, which is what drove me to come up with this universal sampler mech

#

now i only have to deal with code in one place, not 18 different places over 5k+ lines of mess

lavish sparrow Nov 1, 2024, 10:22 AM

#

res2m (110s) vs res3s (290s)

#

lets see if i can get res2m cleaned up

dusky thistle Nov 1, 2024, 10:23 AM

#

are you using eta? you'll usually see more of a diff when you're doing SDE sampling

lavish sparrow Nov 1, 2024, 10:23 AM

#

0.25 only

dusky thistle Nov 1, 2024, 10:23 AM

#

here's the fun thing about faster sampling btw

lavish sparrow Nov 1, 2024, 10:23 AM

#

can't have it too high because of detailer daemon

dusky thistle Nov 1, 2024, 10:23 AM

#

you could always just increase the step count to eliminate your time savings 😄

#

try turning off the daemon and lowering or increasing s_noise a bit, might do what you're looking for (or maybe not)

lavish sparrow Nov 1, 2024, 10:24 AM

#

dusky thistle you could always just increase the step count to eliminate your time savings 😄

xD

#

yeah, just gonna experiment some further 😄

noble coyote Nov 1, 2024, 10:24 AM

#

lavish sparrow Nov 1, 2024, 10:24 AM

#

but res2m seems to be a very good sampler

dusky thistle Nov 1, 2024, 10:26 AM

#

lavish sparrow Nov 1, 2024, 10:28 AM

#

@dusky thistle -> if for some reason you're an idiot like me and you put ETA to 1.0 -> the image output becomes black. however. changing it back makes something bugged, and need to restart comfy to get the sampler working again

#

just so u know ❤️

dusky thistle Nov 1, 2024, 10:29 AM

#

oh weird

#

always good to hear bug reports

#

yeah 1.0 with hard is by definition the breaking point for the math

#

anything less than 1.0 is theoretically doable even if it's horrendous

#

0.999999999999999999999 works

#

1.0 doesn't

zealous latch Nov 1, 2024, 10:30 AM

#

A video editor focused intently at a cluttered desk, surrounded by multiple screens displaying a complex video editing timeline filled with clips. The room is dimly lit, emphasizing the glow of the monitors. Coffee cups and scattered papers create an atmosphere of urgency and creativity, capturing the essence of the editing process.

lavish sparrow Nov 1, 2024, 10:30 AM

#

dusky thistle 1.0 doesn't

set a limiter so it won't break it? ^^

dusky thistle Nov 1, 2024, 10:30 AM

#

zealous latch A video editor focused intently at a cluttered desk, surrounded by multiple scre...

Here is the image you requested.

lavish sparrow Nov 1, 2024, 10:31 AM

#

might also be the combo with the detailer tho -> very often it's patcher nodes that fck things over

zealous latch Nov 1, 2024, 10:37 AM

#

A video editor editing on his 3 monitor setup

untold valley Nov 1, 2024, 10:43 AM

#

zealous latch A video editor editing on his 3 monitor setup

#artisan-faq and #artisan-1

lavish sparrow Nov 1, 2024, 11:00 AM

#

noble coyote Nov 1, 2024, 11:31 AM

#

dusky thistle Here is the image you requested.

Wow! "Almost 'surgical' prompt adherence!!!" 🙃

radiant ledge Nov 1, 2024, 2:06 PM

#

noble coyote Wow! "Almost 'surgical' prompt adherence!!!" 🙃

Here is the image you requested.

signal shuttle Nov 1, 2024, 2:08 PM

#

radiant ledge Here is the image you requested.

'Alnost'

radiant ledge Nov 1, 2024, 2:08 PM

#

yeah, it's alnost surgical

radiant ledge Nov 1, 2024, 2:48 PM

#

flux is funnier

#

clearly about to surgically remove that third leg

lavish sparrow Nov 1, 2024, 4:12 PM

#

feeding random gibberish to 3.5

#

rapid pivot Nov 1, 2024, 4:15 PM

#

Eface

#

waow

cunning lintel Nov 1, 2024, 4:32 PM

#

Gibberish is great; Accidental gibberish to the clip encoders.

lavish sparrow Nov 1, 2024, 4:40 PM

#

lavish sparrow Nov 1, 2024, 4:40 PM

#

rapid pivot Eface

oh. i'm here to report that my path of exile campaign has had yet more absurd things happening...

#

how about dropping a reflecting mist + (the interrogation -> vaal orb -> corrupting blood) in a single map?

#

like those things aren't supposed to happen

craggy crest Nov 1, 2024, 5:00 PM

#

dusky thistle which is why i've got six noise scaling modes now lol

part of me wishes that this was actualy physical somewhere and you could take photos of it

unkempt compass Nov 1, 2024, 5:02 PM

#

I'm pretty sad. There is no that much new Checkpoints on Civitai/HuggingFace; especially Turbo/GGUF ones :/

dusky thistle Nov 1, 2024, 5:03 PM

#

craggy crest part of me wishes that this was actualy physical somewhere and you could take ph...

it's like playing with a VST when you want some hardware with physical knobs to turn 😄

craggy crest Nov 1, 2024, 5:05 PM

#

dusky thistle it's like playing with a VST when you want some hardware with physical knobs to ...

i've just got this mental visualization of a large pyramid of playing cards mixed with an intricate pattering of standing domnios that range all over your entire house

dusky thistle Nov 1, 2024, 5:06 PM

#

craggy crest Nov 1, 2024, 5:06 PM

#

dusky thistle

and yet, the watch/clock is not melting like dali's do

dusky thistle Nov 1, 2024, 5:07 PM

#

hah

turbid grotto Nov 1, 2024, 5:09 PM

#

I am training lora for sd3.5m already guys gonnabegood

#

it takes only 7.6/12gb vram at 1024px with batch size 2

craggy crest Nov 1, 2024, 5:11 PM

#

yeah - one of the main focus's of 3.5 was to make sure it was very easily trainable

#

that it worked, worked right, created very good images even though it's a base model, that it shines, is flexible and fast - and most of all, very easy to train. no battling with it

turbid grotto Nov 1, 2024, 5:17 PM

#

yea, and it is just beginning

#

I think sdxl took more vram, I might be able to bump quality up, but idk how yet

#

OneTrainer btw, very easy

bitter hearth Nov 1, 2024, 5:19 PM

#

I can't quite get it working as well on the flow models that don't have PAG but
for the diffusion ones like SD 1.5, SDXL etc, you can use low PAG amounts with no CFG to browse the unconditional distribution, in a form where you can see what the images are actually like
and you can use this to see how overtrained a model is (does it show a range of images or just anime 1girl etc)
if you run this test with Flux then you see clear images, often with that Flux chin
but if you run it with SD 3.5L you see a range of images as you should

untold valley Nov 1, 2024, 5:20 PM

#

have you guys felt sd3.5m responds better to word spaghetti or with actual sentences?

craggy crest Nov 1, 2024, 5:20 PM

#

untold valley have you guys felt sd3.5m responds better to word spaghetti or with actual sente...

depends on if you're talking to the t5xxl encoder or you take it out of action and just talk to clip_l and clip_g

untold valley Nov 1, 2024, 5:21 PM

#

thanks

bitter hearth Nov 1, 2024, 5:23 PM

#

cutting out T5 might be good for low VRAM people tbh

gusty trail Nov 1, 2024, 5:24 PM

#

It is what SD3 designed for

turbid grotto Nov 1, 2024, 5:25 PM

#

bitter hearth cutting out T5 might be good for low VRAM people tbh

I noticed sd3.5m degrades at resolutions above 1024px if you don't have T5 encoder but fine at 1024

bitter hearth Nov 1, 2024, 5:25 PM

#

yea there's probably all sorts of degradations

#

some people don't seem to want to use big cloud GPU so it is what it is

#

stuff like NF4 isn't costless either sadly

turbid grotto Nov 1, 2024, 5:26 PM

#

but it can be fine in ram

bitter hearth Nov 1, 2024, 5:27 PM

#

yeah putting encoders into ram after use is better in my opinion

turbid grotto Nov 1, 2024, 5:27 PM

#

i am with rtx3060 and 32gb ram can run sd3.5l and T5 both at fp16 with the same speed as quantized to hell

bitter hearth Nov 1, 2024, 5:27 PM

#

I think an even better option would be encoding embeds in advance

#

but that would probably not be popular

turbid grotto Nov 1, 2024, 5:28 PM

#

bitter hearth I think an even better option would be encoding embeds in advance

is this possible in comfy? I would like this to make grid comparisons

bitter hearth Nov 1, 2024, 5:28 PM

#

not sure but it wouldn't be a very difficult node to make if its not existing already

turbid grotto Nov 1, 2024, 5:33 PM

#

will check node manager later

craggy crest Nov 1, 2024, 5:42 PM

#

bitter hearth I think an even better option would be encoding embeds in advance

ipndm_v+linear_quadratic

craggy crest Nov 1, 2024, 5:43 PM

#

bitter hearth I think an even better option would be encoding embeds in advance

bite your tongue

bitter hearth Nov 1, 2024, 5:45 PM

#

lol

#

another option would be using a cloud embedding service

#

Diffusers is actually set up with the embedding library entirely seperate

#

what you could do is cache embeddings within one session

load text encoders 2. encode like 200 prompts 3. unload text encoders

#

instead people often load and unload everything for each new image

noble coyote Nov 1, 2024, 5:57 PM

#

Flux RF Inversion

bitter hearth Nov 1, 2024, 6:01 PM

#

looks good, can't see any borders or seams

noble coyote Nov 1, 2024, 6:04 PM

#

craggy crest Nov 1, 2024, 6:05 PM

#

bitter hearth another option would be using a cloud embedding service

that sounds like a business idea no one's had yet - you should run with this

craggy crest Nov 1, 2024, 6:06 PM

#

noble coyote

don't look now, but she's growing an owl

noble coyote Nov 1, 2024, 6:09 PM

#

#

Flux RF Inversion

#

2 stages of 28 iterations each

bitter hearth Nov 1, 2024, 6:13 PM

#

sadly I found flux needed a weirdly high number of steps to finalise image
sometimes 60 and sometimes even 100
its a very expensive model

noble coyote Nov 1, 2024, 6:16 PM

#

#

mortal mesa Nov 1, 2024, 6:17 PM

#

i like to use dpm_adaptive sometimes and ya steps can be pretty broad, i recall from 33-66, SD3 seems similar

sacred jewel Nov 1, 2024, 6:18 PM

#

BL4C LoRA

bitter hearth Nov 1, 2024, 6:21 PM

#

dpm_adaptive is really awesome yeah

noble coyote Nov 1, 2024, 6:21 PM

#

bitter hearth Nov 1, 2024, 6:22 PM

#

this year I used TCD Sampler for like 95% of my images

#

there is something slightly better than TCD out, but only in Diffusers

noble coyote Nov 1, 2024, 6:23 PM

#

bitter hearth this year I used TCD Sampler for like 95% of my images

Where do we see your images? You're 'very coy' on here!!! 🥳

bitter hearth Nov 1, 2024, 6:24 PM

#

I mostly don't save them lol
I've posted a few on here though, every now and then

#

sacred jewel Nov 1, 2024, 6:42 PM

#

Kubrick LoRA

bitter hearth Nov 1, 2024, 6:44 PM

#

Kubrick is awesome

signal shuttle Nov 1, 2024, 6:50 PM

#

cagliostrolab (Creators of Animagine XL) plan on developing a new big anime model on SD3.5 (either large or medium no one knows yet). SD3.5 has a great future ahead of it https://cagliostrolab.net/posts/dev-notes-001-future-plans-and-beyond

Dev Notes #001: Future Plans and Beyond - CagliostroLab

Cagliostro Research Lab

bitter hearth Nov 1, 2024, 6:52 PM

#

I think I remember seeing Animagine XL on civit

signal shuttle Nov 1, 2024, 6:53 PM

#

bitter hearth I think I remember seeing Animagine XL on civit

Animagine XL was one of the biggest anime models for SDXL when it first came out

bitter hearth Nov 1, 2024, 6:53 PM

#

ah okay nice

#

I read that SD 1.5 is better for anime

#

but not sure if that has changed

signal shuttle Nov 1, 2024, 6:54 PM

#

bitter hearth I read that SD 1.5 is better for anime

Illustrious XL is pretty good at anime in my opinion better then any 1.5 model

bitter hearth Nov 1, 2024, 6:55 PM

#

okay I see

turbid grotto Nov 1, 2024, 6:57 PM

#

curious if Illustrious team will jump on sd3.5, they could actually take the lead from pony, however, Astralite has very sophisticated system and probably already in training

bitter hearth Nov 1, 2024, 6:57 PM

#

auraflow has the best prompt adherence by a fairly long way

turbid grotto Nov 1, 2024, 6:58 PM

#

turbid grotto curious if Illustrious team will jump on sd3.5, they could actually take the lea...

this is so cool, so much smart people doing smart things
all I can do is be exited

turbid grotto Nov 1, 2024, 6:59 PM

#

bitter hearth auraflow has the best prompt adherence by a fairly long way

yeaaa agree, that is why I am exited about it too

cunning lintel Nov 1, 2024, 6:59 PM

#

bitter hearth auraflow has the best prompt adherence by a fairly long way

Does it though? I'd not be surprised if up to .2 it was simply trained on much better captured images (read ideogram outputs) than other models

noble coyote Nov 1, 2024, 6:59 PM

#

Can anyone find me an Auraflow TensorRT at all? 😄

signal shuttle Nov 1, 2024, 7:00 PM

#

turbid grotto curious if Illustrious team will jump on sd3.5, they could actually take the lea...

The more competition the better, the more people creating finetunes on newer base models will result in better models for the user

turbid grotto Nov 1, 2024, 7:00 PM

#

turbid grotto yeaaa agree, that is why I am exited about it too

and if I understand correctly - he will have more sfw and realistic data, so this model could actually be used for general stuff (possibly)

craggy crest Nov 1, 2024, 7:00 PM

#

bitter hearth I read that SD 1.5 is better for anime

it's not any more

#

everyone and their dog does good anime now

noble coyote Nov 1, 2024, 7:00 PM

#

bitter hearth Kubrick is awesome

I know a guy who won an Oscar working on a Kubrick movie (the movie was Barry Lyndon)

turbid grotto Nov 1, 2024, 7:01 PM

#

signal shuttle The more competition the better, the more people creating finetunes on newer bas...

yes yes yes gonnabegood

craggy crest Nov 1, 2024, 7:01 PM

#

noble coyote Can anyone find me an Auraflow TensorRT at all? 😄

did you ask google?

bitter hearth Nov 1, 2024, 7:02 PM

#

cunning lintel Does it though? I'd not be surprised if up to .2 it was simply trained on much b...

yeah I think Auraflow v2 has best prompt adherence of anything at the moment

#

if there was an exception, that exception would be Ideogram V2 or the upcoming Playground V3 possibly

noble coyote Nov 1, 2024, 7:02 PM

#

Yes, suggests I ask FAL

#

... gone to FAL ...

bitter hearth Nov 1, 2024, 7:03 PM

#

Torcello got sucked into a black hole

#

my favourite model by far is still midjourney, I don't use it though

craggy crest Nov 1, 2024, 7:05 PM

#

bitter hearth Torcello got sucked into a black hole

not as deep a black hole as if he'd landed on https://glif.app/glifs

glif

glif - all prompts, no code AI sandbox • build AI workflows, apps, ...

all prompts, no code AI sandbox • build AI workflows, apps, chatbots & more

noble coyote Nov 1, 2024, 7:05 PM

#

I am one of the few who truly appreciate the underused Auraflow

noble coyote Nov 1, 2024, 7:06 PM

#

craggy crest not as deep a black hole as if he'd landed on https://glif.app/glifs

This'll let me make an Auraflow TRT?

craggy crest Nov 1, 2024, 7:06 PM

#

noble coyote This'll let me make an Auraflow TRT?

no, but it'll let you make AI memes - and other things

#

it's way too much fun

noble coyote Nov 1, 2024, 7:07 PM

#

Like Steamed Jam Roly Poly?! 🥳

bitter hearth Nov 1, 2024, 7:07 PM

#

their project to train an LLM to make comfy workflows was cool

signal shuttle Nov 1, 2024, 7:07 PM

#

craggy crest it's way too much fun

Can confirm Glif is indeed fun

craggy crest Nov 1, 2024, 7:07 PM

#

and you can remix other peopel's glifs to make your own version

cunning lintel Nov 1, 2024, 7:07 PM

#

Auraflow .3 was such a let down, before that it was magic, sadly it seems abandoned.

bitter hearth Nov 1, 2024, 7:08 PM

#

do you know about Aurum

#

blend of .2 and .3

#

I totally agree though

#

.3 lost the prompt adherence magic

cunning lintel Nov 1, 2024, 7:08 PM

#

read about it 2 weeks ago, should try it 🙂

#

so much to try and so little time and compute 🤣

craggy crest Nov 1, 2024, 7:09 PM

#

bitter hearth .3 lost the prompt adherence magic

when you live on the cutting edge, sometimes you get sliced

bitter hearth Nov 1, 2024, 7:11 PM

#

lol

#

the SimpleTuner dev was saying on reddit that Auraflow doesn't train well

#

not sure

#

my viewpoint is autoregressive models will quickly overtake diffusion models in prompt adherence anyway

#

(but be much slower, more expensive and lower image quality)

cunning lintel Nov 1, 2024, 7:17 PM

#

There's really a whole lot to win on prompt adherence

#

I had this bright idea to try sd3.5 by having gemini read an entire book and create 25 scenes from it

#

it was all too complicated for poor sd35 (and flux too), turns out i've gotten really good at writing prompts current image ai's somewhat kind of can work with 🤡

bitter hearth Nov 1, 2024, 7:20 PM

#

I never really learnt prompt engineering but yeah there is a lot of skill to it

craggy crest Nov 1, 2024, 7:23 PM

#

cunning lintel it was all too complicated for poor sd35 (and flux too), turns out i've gotten r...

it's not at all too complicated for sd3.5 - however you need to talk to the computer in a way it understands. and rambling on with a lot of text that is meaningless for anything but noise is not the way to get much of anything.

#

and using gemini just makes it worse. go to meta.ai - tell it about your scenes and then ask it specificly to craft prompts for stable diffusion 3

craggy crest Nov 1, 2024, 7:24 PM

#

bitter hearth I never really learnt prompt engineering but yeah there is a lot of skill to it

wanna learn?

untold valley Nov 1, 2024, 7:26 PM

#

#

prompt "engineering" is just tossing word spaghetti at it until you figure out what the model likes and dislikes. then using that to submit it to your will.

noble coyote Nov 1, 2024, 7:28 PM

#

"I am a prompt wrangler!!!"

mortal mesa Nov 1, 2024, 7:28 PM

#

ide think the possible knowledge of how it could respond is the engineering part

#

surprised we don't have small LLM and Image model pairs as a normal thing yet.

untold valley Nov 1, 2024, 7:30 PM

#

noble coyote "I am a prompt wrangler!!!"

sponging

bitter hearth Nov 1, 2024, 7:31 PM

#

mortal mesa surprised we don't have small LLM and Image model pairs as a normal thing yet.

yeah its weird
this is something open ai really got right

cunning lintel Nov 1, 2024, 7:31 PM

#

Problem was more the kind of scenes, often multiple people (something simple like a man at the counter of a bank, while a woman sits in the waiting room watching around) or even seemingly normal scenes (two people in a car, understanding the inside of a car turned out hard and the road seen from the windows, placing the steeringwheel). there's so many edge cases for seemingly mundane scenes ai's still struggle with. Often I just prompt for 1 subject, interaction i try rarely as i know it's hard, but when you try to create "real life" scenes, by just prompting, it's not that easy yet.

craggy crest Nov 1, 2024, 7:33 PM

#

bitter hearth yeah its weird this is something open ai really got right

meta does it with their AI - sometimes works, sometimes faily spectacularly

craggy crest Nov 1, 2024, 7:35 PM

#

cunning lintel Problem was more the kind of scenes, often multiple people (something simple lik...

not that hard if 1. you use the right model and 2. you construct the prompt correctly. and the prompt you give the AI frequentlly doesn't look at all like something you'd write in a book or short story, or say to a human

#

this "a woman sits in the waiting room watching around" wouldn't even tell most humans what she's actually doing. what does 'watching around' mean?

noble coyote Nov 1, 2024, 7:36 PM

#

Making a Dynamic TensorRT for Auraflow3

noble coyote Nov 1, 2024, 7:37 PM

#

craggy crest this "a woman sits in the waiting room watching around" wouldn't even tell most ...

She's not watching a square?! 🥳

craggy crest Nov 1, 2024, 7:37 PM

#

noble coyote She's not watching a square?! 🥳

or a pickle

noble coyote Nov 1, 2024, 7:38 PM

#

🙃

craggy crest Nov 1, 2024, 7:38 PM

#

that's the biggest issue most people run into - you have to talk to the AI in clear, concise terms - but you are talking to a computer. you must think like it does and talk to it like it thinks. NOT like you think or you talk to a human

bitter hearth Nov 1, 2024, 7:41 PM

#

it gets rough cos I mostly use highly distilled models for only 2-4 steps with CFG 1
you only get like 9 tokens (less than 9 words) that the model will attend to

untold valley Nov 1, 2024, 7:42 PM

#

noble coyote Nov 1, 2024, 7:43 PM

#

Oops Auraflow TensorRT operation c r a s h e d - compilation error in backend

untold valley Nov 1, 2024, 7:45 PM

#

ok is it me or does 3.5m reallly really loves like 3/4 shots

noble coyote Nov 1, 2024, 7:47 PM

#

untold valley ok is it me or does 3.5m reallly really loves like 3/4 shots

Of Jaegermeister?

untold valley Nov 1, 2024, 7:48 PM

#

walked into that one

noble coyote Nov 1, 2024, 7:48 PM

#

🥳

untold valley Nov 1, 2024, 7:55 PM

#

ive just realized that cfg on 3.5m has a crazy amount of control of the gens.

bitter hearth Nov 1, 2024, 7:56 PM

#

I used CFG on flux since day 1 TBH

#

never actually did the no CFG route

craggy crest Nov 1, 2024, 7:58 PM

#

untold valley ive just realized that cfg on 3.5m has a crazy amount of control of the gens.

how about that?

untold valley Nov 1, 2024, 7:58 PM

#

4,7,13 13 totally cooks it but crazy the difference between 4 and 7 from real to a more "anime" style

craggy crest Nov 1, 2024, 7:58 PM

#

bitter hearth I used CFG on flux since day 1 TBH

you broke it then. turn CFG off and then you'll see what flux really is

bitter hearth Nov 1, 2024, 7:58 PM

#

LOL

craggy crest Nov 1, 2024, 7:59 PM

#

bitter hearth LOL

seriously. there's a reason it doesn't use CFG. set CFG to 0 and then prompt it

untold valley Nov 1, 2024, 7:59 PM

#

so flux is loosy goosy ur saying?

mortal mesa Nov 1, 2024, 7:59 PM

#

if everyone followed "the rules" we would never have anything new

craggy crest Nov 1, 2024, 8:01 PM

#

untold valley so flux is loosy goosy ur saying?

nope. but if you turn on cfg, you're not going to get out of the model what it's designed to do

#

people turn it on because they HAVE to HAVE their negative prompt fix.

#

but flux isn't designed to use CFG OR use negative prompts

mortal mesa Nov 1, 2024, 8:02 PM

#

you can possibly get MORE than what its designed to do, crazy

craggy crest Nov 1, 2024, 8:02 PM

#

mortal mesa if everyone followed "the rules" we would never have anything new

if everyone let the air out of their tires and drove on flats, they'd have very interesting journies

craggy crest Nov 1, 2024, 8:02 PM

#

mortal mesa you can possibly get MORE than what its designed to do, crazy

nope. you just break it

mortal mesa Nov 1, 2024, 8:03 PM

#

stay in the cave my friend

craggy crest Nov 1, 2024, 8:03 PM

#

mortal mesa stay in the cave my friend

i program this stuff, friend. do you?

icy drift Nov 1, 2024, 8:03 PM

#

Still no OmniGen in Comfy... Hopefully this weekend...

mortal mesa Nov 1, 2024, 8:05 PM

#

craggy crest i program this stuff, friend. do you?

show me

craggy crest Nov 1, 2024, 8:07 PM

#

mortal mesa show me

pats you on the head nope. not falling for that trap

mortal mesa Nov 1, 2024, 8:09 PM

#

yup cant expose the lies, ya sure you learn stuff, that's clear, but when you don't know you BS for some like internet points in a passive/aggressive way, its prety clear

craggy crest Nov 1, 2024, 8:11 PM

#

mortal mesa yup cant expose the lies, ya sure you learn stuff, that's clear, but when you do...

you try this all the time, you realize that? you get a 'no' answer and then you try the manipulation tactics. not going to work.

mortal mesa Nov 1, 2024, 8:12 PM

#

no, more lies, do what you need for yourself

#

you dont get to be the most blocked user out of nowhere

craggy crest Nov 1, 2024, 8:13 PM

#

mortal mesa no, more lies, do what you need for yourself

you do realize i do not care what names you call me? or what other negative manipulation, bullying, tactics you want to try. maybe that works on your friends and family, but here it only makes you look like a fool

signal shuttle Nov 1, 2024, 8:18 PM

#

https://tenor.com/view/popcorn-eating-gif-26433511

Tenor

mortal mesa Nov 1, 2024, 8:21 PM

#

i certainly didnt ask if you cared and none of that happened regardless

lunar canopy Nov 1, 2024, 8:29 PM

#

mooooooooving on

#

there is far too much halloween candy to eat

untold valley Nov 1, 2024, 8:30 PM

#

lunar canopy there is far too much halloween candy to eat

call dibs on the recsess penut butter

lunar canopy Nov 1, 2024, 8:31 PM

#

untold valley call dibs on the recsess penut butter

no way how did you know

untold valley Nov 1, 2024, 8:31 PM

#

thomas

craggy crest Nov 1, 2024, 8:33 PM

#

lunar canopy there is far too much halloween candy to eat

how about an image contest on this theme?

untold valley Nov 1, 2024, 8:36 PM

#

I was wondering why 3.5m seems like a really great base model, and been having fun with it. Decided to test its word that shall not be said but starts with N and ends in W, capabilities and gosh darn it can you push it far. no wonder, it all makes sense that when you don't purposely handicap, sandbag, and sensor something it starts working correctly. thanks SAI. ❣️ goodjob mikuwha

lavish sparrow Nov 1, 2024, 9:35 PM

#

this one looks pretty decent, except for that finger...

#

lavish sparrow Nov 1, 2024, 10:03 PM

#

bitter hearth Nov 1, 2024, 10:04 PM

#

icy drift Still no OmniGen in Comfy... Hopefully this weekend...

you could copy paste the inference script into a node template and use it now in comfy if you want

#

its uses hugging transformers and diffusers libs

cunning lintel Nov 1, 2024, 10:05 PM

#

#

same prompt also gave this, such an interesting style for sd3.5l to do out of the box

craggy crest Nov 1, 2024, 10:22 PM

#

cunning lintel same prompt also gave this, such an interesting style for sd3.5l to do out of th...

try this prompt: melting hyperdetailed digital art, dripping stunning cosmic belle; drips a vision of heavenly beauty

lavish sparrow Nov 1, 2024, 10:40 PM

#

lavish sparrow Nov 1, 2024, 10:41 PM

#

craggy crest try this prompt: melting hyperdetailed digital art, dripping stunning cosmic bel...

imma feed that to my LLM too tho

errant dust Nov 1, 2024, 10:42 PM

#

https://petapixel.com/2024/10/31/mysterious-ai-image-generator-more-powerful-than-midjourney-breaks-cover/

PetaPixel

Mysterious AI Image Generator More Powerful Than Midjourney Breaks ...

The rumors are true.

lavish sparrow Nov 1, 2024, 10:43 PM

#

craggy crest try this prompt: melting hyperdetailed digital art, dripping stunning cosmic bel...

as interpreted by my LLM setup

craggy crest Nov 1, 2024, 10:44 PM

#

lavish sparrow imma feed that to my LLM too tho

the llm is going to be very confused ;)

craggy crest Nov 1, 2024, 10:44 PM

#

lavish sparrow as interpreted by my LLM setup

very nice :)

lavish sparrow Nov 1, 2024, 10:44 PM

#

craggy crest the llm is going to be very confused ;)

The digital masterpiece showcases a cosmic beauty, a stunning vision of ethereal allure. Her skin, a canvas of iridescent hues, melts and drips like celestial paint, revealing a complex network of shimmering galaxies and nebulas. Long, flowing hair, composed of cascading stars, frames her serene face, where eyes, like twin portals, reflect the vast expanse. The figure gracefully poses, allowing the cosmic substance to drip from her form, creating a captivating contrast between heavenly beauty and the raw, visceral nature of the melting effect. The image exudes a sense of otherworldly tranquility, inviting viewers to immerse themselves in this captivating, hyper-detailed digital creation.

craggy crest Nov 1, 2024, 10:44 PM

#

errant dust https://petapixel.com/2024/10/31/mysterious-ai-image-generator-more-powerful-tha...

yeah, no.

#

red panda is from recraft. go to their site and use it, see what you think

bitter hearth Nov 1, 2024, 10:45 PM

#

I voted against red panda every time

#

in that blind trial

errant dust Nov 1, 2024, 10:45 PM

#

I'm not sure what you mean by no since that's what the article says.

bitter hearth Nov 1, 2024, 10:45 PM

#

on artificialanalysis.com leaderboard

#

I liked midjourney, ideogram and flux pro

craggy crest Nov 1, 2024, 10:46 PM

#

errant dust I'm not sure what you mean by no since that's what the article says.

i know that's what it says. it's wrong. go to the recraft website and use the actual red panda generator. it's not that good

errant dust Nov 1, 2024, 10:46 PM

#

And I was just sharing but I actually have no opinion about it since I haven't used it. I plan to because of course I'm curious. But I'll be honest I'm very happy with both flux and stable diffusion 3.5 L

craggy crest Nov 1, 2024, 10:46 PM

#

they heavily cherry picked the images that got voted on

#

https://x.com/recraftai/status/1851706399631224939

Recraft (@recraftai) on X

Nice to meet you! We are red_panda, but for friends just Recraft.

#RecraftAI #red_panda

bitter hearth Nov 1, 2024, 10:47 PM

#

craggy crest they heavily cherry picked the images that got voted on

it got the highest ranking in the elo leaderboard though
and that's a blind test

craggy crest Nov 1, 2024, 10:48 PM

#

the link to their site is in that post on twitter. go play with it and see what you think

bitter hearth Nov 1, 2024, 10:53 PM

#

its on Fal apparently

cunning lintel Nov 1, 2024, 10:55 PM

#

craggy crest try this prompt: melting hyperdetailed digital art, dripping stunning cosmic bel...

nice drips

lavish sparrow Nov 1, 2024, 10:58 PM

#

cunning lintel Nov 1, 2024, 11:02 PM

#

errant dust And I was just sharing but I actually have no opinion about it since I haven't u...

it's not bad it seems more like a bunch of finetunes, and nice gimmicks svg (the flat outputs really are very good) and color palettes. The sad part is SAI gets anarticle "SD3.5 can do woman lying in grass" with only a picture of the horror woman, this model gets an article with nice images... SAI really needs to send better presskits out

craggy crest Nov 1, 2024, 11:03 PM

#

red panda, to me, feels like they tried to make a couple flux loras, didn't do them well, and are trying to carve out some of the pie for themselves - get users to use their website to gen with.

craggy crest Nov 1, 2024, 11:04 PM

#

cunning lintel it's not bad it seems more like a bunch of finetunes, and nice gimmicks svg (the...

press kits would have been ignored. the media is fickle - and they either go for the senssational "look, mysterious company!" or what the reader wants

bitter hearth Nov 1, 2024, 11:08 PM

#

20B is a lot, Flux is 12B for comparison, so panda is a very chonky transformer

#

it does hands and text very well

#

and has strong blur effect abilities like Flux Pro

#

the aesthetic fine tune seems slightly off to me

icy drift Nov 1, 2024, 11:09 PM

#

bitter hearth you could copy paste the inference script into a node template and use it now in...

I tried using a custom node https://github.com/AIFSH/OmniGen-ComfyUI that uses the diffusers, and there's some library version compatibility error. Definitely not that simple.

GitHub

GitHub - AIFSH/OmniGen-ComfyUI

Contribute to AIFSH/OmniGen-ComfyUI development by creating an account on GitHub.

bitter hearth Nov 1, 2024, 11:09 PM

#

ah okay thanks for trying

#

yeah its tricky adding things to comfy

craggy crest Nov 1, 2024, 11:10 PM

#

icy drift I tried using a custom node https://github.com/AIFSH/OmniGen-ComfyUI that uses t...

@dusky thistle has a new project ;)

bitter hearth Nov 1, 2024, 11:11 PM

#

there is a Leonardo model that is strong also, apparently

#

although that was before this summer so maybe it hasn't kept up

#

companies can't just launch a 2B Unet any more any compete

craggy crest Nov 1, 2024, 11:13 PM

#

bitter hearth although that was before this summer so maybe it hasn't kept up

they haven't published any updates for a couple months, but their last model release is very good

lavish sparrow Nov 1, 2024, 11:17 PM

#

errant dust Nov 1, 2024, 11:18 PM

#

Well petapixel isn't exactly AI friendly. They aren't anti AI per se but I would hardly call them favorable. And the same goes for the majority of the readers if you look at the feedback their articles get.

#

As a photographer it's a very good site but they have their biases in some things

bitter hearth Nov 1, 2024, 11:20 PM

#

if we wait a few months there will be papers that benchmark it
there's finally papers that talk about flux and ideogram

errant dust Nov 1, 2024, 11:20 PM

#

Ideogram 2.0?

bitter hearth Nov 1, 2024, 11:20 PM

#

ye I used to do photography and read petapixel

lavish sparrow Nov 1, 2024, 11:20 PM

#

bitter hearth Nov 1, 2024, 11:21 PM

#

yeah I think I saw Ideogram 2.0 in a paper

#

ah yeah I found it

#

the playground V3 paper has Ideogram 2.0 in the comparisons

#

https://arxiv.org/abs/2409.10695

#

lavish sparrow Nov 1, 2024, 11:23 PM

#

bitter hearth Nov 1, 2024, 11:24 PM

#

got flux in the paper too

errant dust Nov 1, 2024, 11:24 PM

#

Never heard of playground

bitter hearth Nov 1, 2024, 11:25 PM

#

they did a model that was not great called Playground v2.5
it was really overfit, came out around SDXL time

#

but their new one looks competitive

#

Flux hasn't been benchmarking that well, I think it might be the slightly overfit aesthetic that is harming it in benchmarks

untold valley Nov 1, 2024, 11:28 PM

#

bitter hearth Nov 1, 2024, 11:29 PM

#

no papers on SD 3.5 yet though

lavish sparrow Nov 1, 2024, 11:35 PM

#

#

aight, time to go to bed ^^

bitter hearth Nov 1, 2024, 11:39 PM

#

the glow effect is rly good

craggy crest Nov 1, 2024, 11:45 PM

#

bitter hearth no papers on SD 3.5 yet though

we talked about this. SD3.5 is SD3. we just fixed the issues. and the SD3 paper was written a long time back

#

what else would you want in a paper other than what's already written?

#

useful tool https://sd-tokenizer.rocker.boo/

Stable Diffusion Tokenizer

Informs you about how your prompt/words gets turned into tokens, privately. For Stable Diffusion models, CLIP models

short thicket Nov 1, 2024, 11:56 PM

#

2024-11-01_2024-11-01-195541_Mangled_Merge_Flux_Magic_Lora_Base_Merge_524_Mangled_Merge_Flux_Magic_Lora_Base_Merge_524_793022652513579_0_deis_beta_30_5.0_1.0.png

bitter hearth Nov 2, 2024, 12:02 AM

#

craggy crest what else would you want in a paper other than what's already written?

value of papers is mostly in testing, discussion, ablations, benchmarking etc

craggy crest Nov 2, 2024, 12:03 AM

#

bitter hearth value of papers is mostly in testing, discussion, ablations, benchmarking etc

so what sort of data are you wanting that's not in the original paper?

#

i'll sit here and do those tests if you want

bitter hearth Nov 2, 2024, 12:04 AM

#

haha sadly you need 30,000 images to do FID for example
I will pay to run myself at some point

short thicket Nov 2, 2024, 12:04 AM

#

craggy crest try this prompt: melting hyperdetailed digital art, dripping stunning cosmic bel...

Mangled Merge Flux V1 coming soon.

2024-11-01_2024-11-01-200415_Mangled_Merge_Flux_Magic_Lora_Base_Merge_524_Mangled_Merge_Flux_Magic_Lora_Base_Merge_524_370481366718364_0_deis_beta_30_5.0_1.0.png

bitter hearth Nov 2, 2024, 12:05 AM

#

there's also the case of human preference studies
which are quite expensive

#

there are standardised companies that do those now, the fees are fairly flat but it adds up

#

we just have to wait a bit more, there will be papers on SD 3.5 soon, there are a fair few papers about Flux now

short thicket Nov 2, 2024, 12:09 AM

#

2024-11-01_2024-11-01-200924_Mangled_Merge_Flux_Magic_Lora_Base_Merge_524_Mangled_Merge_Flux_Magic_Lora_Base_Merge_524_1098029388423724_0_deis_beta_30_5.0_1.0.png

craggy crest Nov 2, 2024, 12:09 AM

#

bitter hearth there's also the case of human preference studies which are quite expensive

human prefrences are very subjective however

craggy crest Nov 2, 2024, 12:10 AM

#

bitter hearth we just have to wait a bit more, there will be papers on SD 3.5 soon, there are ...

you might get one faster if you poke lykon

short thicket Nov 2, 2024, 12:11 AM

#

2024-11-01_2024-11-01-201125_Mangled_Merge_Flux_Magic_Lora_Base_Merge_524_Mangled_Merge_Flux_Magic_Lora_Base_Merge_524_329887323870826_0_deis_beta_30_5.0_1.0.png

bitter hearth Nov 2, 2024, 12:16 AM

#

craggy crest human prefrences are very subjective however

they are, but satisfying human preferences is often an objective so we can't really remove that part

#

there seems to be a sort of center of gravity anyway

#

regarding human preferences on most subjects

#

independent attempts at human preference optimisations often end up with kinda similar results

craggy crest Nov 2, 2024, 12:29 AM

#

bitter hearth they are, but satisfying human preferences is often an objective so we can't rea...

yeah, but they're so varied, that's an impossible task. the saying 'you can't please everyone' is the truest statement ever made

craggy crest Nov 2, 2024, 12:30 AM

#

bitter hearth independent attempts at human preference optimisations often end up with kinda s...

only because people are a herd animal

#

but ask them individually - you get cats

short thicket Nov 2, 2024, 12:30 AM

#

2024-11-01_2024-11-01-202947_Mangled_Merge_Flux_Magic_Lora_Base_Merge_524_Mangled_Merge_Flux_Magic_Lora_Base_Merge_524_695939129243932_0_deis_beta_30_5.0_1.0.png

bitter hearth Nov 2, 2024, 12:37 AM

#

yeah I do mostly use stuff like FID to judge things instead

#

cos the human element is removed

#

FID has some issues though

#

it can also be gamed, sadly, its known what to do to subtly raise FID score

craggy crest Nov 2, 2024, 12:39 AM

#

bitter hearth it can also be gamed, sadly, its known what to do to subtly raise FID score

smart people don't read reviews ;)

bitter hearth Nov 2, 2024, 12:41 AM

#

FID is best for like

#

papers that made a sampler and they want to test what settings are best

#

so they show FID score for the different settings

#

for an actual new model I think you've kinda gotta take in all the benches combined, along with human pref study

#

cos with new models the financial incentive to game benchmarks is higher

gritty steeple Nov 2, 2024, 12:44 AM

#

craggy crest Nov 2, 2024, 1:16 AM

#

dusky thistle Nov 2, 2024, 1:48 AM

#

#

#

craggy crest Nov 2, 2024, 2:03 AM

#

dusky thistle

the real red-lipped batfish

dusky thistle Nov 2, 2024, 2:03 AM

#

craggy crest Nov 2, 2024, 2:03 AM

#

https://www.thedodo.com/in-the-wild/red-lipped-batfish-walking

The Dodo

No One Can Believe This Fish Isn’t Wearing Makeup

The red-lipped batfish is one of many unusual fish who live near the Galapagos Islands.

#

https://x.com/stabilityai/status/1852501140174430557

Stability AI (@StabilityAI) on X

We ❤️ the customizations coming from the community with SD3.5!

Check out Clownshark Batwing’s textured oil painting styles, straight from the Stable Diffusion Discord.

You can join the Discord here: https://t.co/Brmd9dAGfr (1/3)

untold valley Nov 2, 2024, 2:06 AM

#

congrats @dusky thistle

craggy crest Nov 2, 2024, 2:17 AM

#

and he promptly goes into hiding

minor lotus Nov 2, 2024, 2:22 AM

#

sd3 output default 10241x1024, can I get higher

untold valley Nov 2, 2024, 2:24 AM

#

you can try but it likes 1megapixel res, its better to gen at that res and then you go and upscale it

dusky thistle Nov 2, 2024, 2:24 AM

#

credit to SAI for releasing two killer models in the last week or so

minor lotus Nov 2, 2024, 2:25 AM

#

untold valley you can try but it likes 1megapixel res, its better to gen at that res and then ...

I can only upgrade to get better, right?

untold valley Nov 2, 2024, 2:25 AM

#

what do you mean?

dusky thistle Nov 2, 2024, 2:25 AM

#

1920x1152 works pretty well for a one-shot generation with SD35M

#

large is a bit more limited for initial latent size

#

you'll gain some and lose some with coherence when going outside of the most heavily trained resolutions

minor lotus Nov 2, 2024, 2:27 AM

#

Should I generate a standard image first and then use upscaling to increase the pixels?

dusky thistle Nov 2, 2024, 2:30 AM

#

you should try both 🙂

minor lotus Nov 2, 2024, 2:31 AM

#

thanks

dusky thistle Nov 2, 2024, 2:31 AM

#

there's advantages and disadvantages to both strategies

#

which is better depends on the subject and model so it's good to experiment with it

craggy crest Nov 2, 2024, 2:32 AM

#

just make a bunch of 15x15 images and tile them ;)

#

(mosaic tiles)

untold valley Nov 2, 2024, 2:33 AM

#

are there any optimal settins for your sampler you have found yet Batwing?

dusky thistle Nov 2, 2024, 2:39 AM

#

untold valley are there any optimal settins for your sampler you have found yet Batwing?

there are many worth exploring

#

#

try these if you want something really fast

#

res_3m is fantastic with paint

#

res_2m is more moderate... both run at euler speed

#

res_2s and espec res_3s are really high quality

#

eta = the amount of noise added, try setting that at 0, 0.25, and 0.5 and compare

untold valley Nov 2, 2024, 2:41 AM

#

many appreciations

dusky thistle Nov 2, 2024, 2:45 AM

#

np

#

https://github.com/ClownsharkBatwing/RES4LYF?tab=readme-ov-file i dropped a couple of WFs on the readme here

GitHub

GitHub - ClownsharkBatwing/RES4LYF

Contribute to ClownsharkBatwing/RES4LYF development by creating an account on GitHub.

#

#

also leaving the WFs embedded in these

craggy crest Nov 2, 2024, 2:51 AM

#

dusky thistle also leaving the WFs embedded in these

scrapes discord, steals all of the shark's workflows

dusky thistle Nov 2, 2024, 2:55 AM

#

sacred jewel Nov 2, 2024, 2:57 AM

#

dusky thistle Nov 2, 2024, 2:57 AM

#

short thicket Nov 2, 2024, 3:00 AM

#

2024-11-01_2024-11-01-225935_Mangled_Merge_Flux_Magic_Lora_Base_Merge_524_Mangled_Merge_Flux_Magic_Lora_Base_Merge_524_153088742464838_0_deis_beta_30_5.0_1.0.png

short thicket Nov 2, 2024, 3:22 AM

#

2024-11-01_2024-11-01-232024_Mangled_Merge_Flux_Magic_Lora_Base_Merge_524_Mangled_Merge_Flux_Magic_Lora_Base_Merge_524_469814476738525_0_deis_beta_30_5.0_1.0.png

untold valley Nov 2, 2024, 3:23 AM

#

one day we will have proper hands. but composition and textures are improving

craggy crest Nov 2, 2024, 3:26 AM

#

untold valley one day we will have proper hands. but composition and textures are improving

put gloves on them

short thicket Nov 2, 2024, 3:34 AM

#

2024-11-01_2024-11-01-233250_Mangled_Merge_Flux_Magic_Lora_Base_Merge_524_Mangled_Merge_Flux_Magic_Lora_Base_Merge_524_1089465790533119_0_deis_beta_30_5.0_1.0.png

runic tusk Nov 2, 2024, 3:36 AM

#

No.

#

Stop it.

#

Get some help.

craggy crest Nov 2, 2024, 3:37 AM

#

you can't generate in this channel. you have to use the artisan channels. start by reading the information here: #artisan-faq

runic tusk Nov 2, 2024, 3:37 AM

#

https://tenor.com/bWx2M.gif

Tenor

craggy crest Nov 2, 2024, 3:42 AM

#

runic tusk https://tenor.com/bWx2M.gif

chill out, dude.

runic tusk Nov 2, 2024, 3:43 AM

#

craggy crest chill out, dude.

Relax. It's just a meme.

craggy crest Nov 2, 2024, 4:17 AM

#

https://youtu.be/wRvnEISJYFI?si=8XhUSQ-AWfp_FByS

YouTube

Rob Adams

Image 2 Image using SD 3.5 L in ComfyUI

My first SD 3.5 video. This is a method of improving image to image in SD3.5, I have had problems with the base workflow so here are a few work arounds. Also a tiled upscale to get around the image size limitations of SD 3.5.

Workflow is here: https://drive.google.com/file/d/1OFwgvutAcTvh6oTrR6iCDfKkMmFckhxO/view?usp=sharing

▶ Play video

dusky thistle Nov 2, 2024, 4:24 AM

#

dusky thistle Nov 2, 2024, 4:44 AM

#

dusky thistle Nov 2, 2024, 5:37 AM

#

hallow lion Nov 2, 2024, 5:38 AM

#

clownshark delivering the goods.

dusky thistle Nov 2, 2024, 6:06 AM

#

hallow lion Nov 2, 2024, 6:09 AM

#

Do a Friday themed one XD

hallow lion Nov 2, 2024, 6:34 AM

#

yeah

#

everyone is lonely

#

until they connect to god

untold valley Nov 2, 2024, 6:54 AM

#

dusky thistle Nov 2, 2024, 6:59 AM

#

#

bitter hearth Nov 2, 2024, 7:14 AM

#

maybe SD3.5M just needed some stochasticity after all

untold valley Nov 2, 2024, 7:26 AM

#

winged seal Nov 2, 2024, 7:44 AM

#

dusky thistle

now this looks fantastic. One of the few SD3.5 images I have seen that I really like

dusky thistle Nov 2, 2024, 7:56 AM

#

dusky thistle Nov 2, 2024, 8:15 AM

#

#

untold valley Nov 2, 2024, 8:43 AM

#

dusky thistle

Oh oh 😳

dusky thistle Nov 2, 2024, 9:00 AM

#

dusky thistle Nov 2, 2024, 9:20 AM

#

#

all still SD3.5M. this model is special

#

#

#

one shot 1920x1152 with medium

untold valley Nov 2, 2024, 9:36 AM

#

dusky thistle all still SD3.5M. this model is special

Agreed it’s what 3 should’ve been from the get go. Lessons were learned goodjob can’t wait for further trainings.

dusky thistle Nov 2, 2024, 9:36 AM

#

yeah, they obviously just felt compelled to release before it was ready, it really was a beta 🙂

untold valley Nov 2, 2024, 9:37 AM

#

Not going to start this all over again. I got ptsd lol. But yeah.

dusky thistle Nov 2, 2024, 9:39 AM

#

#

#

untold valley Nov 2, 2024, 9:42 AM

#

Some issues are it adores 3/4 shots, likes solid white or black backgrounds,

hallow lion Nov 2, 2024, 9:59 AM

#

dusky thistle

Friday Sofa.

hallow lion Nov 2, 2024, 10:22 AM

#

Dor Brothers very close to the future of what movies will be like soon.

noble coyote Nov 2, 2024, 11:12 AM

#

Flux RF Inversion

icy drift Nov 2, 2024, 11:24 AM

#

OmniGen in Comfy is working for me now! 🥳 I had to update my transformers library to 4.45.

#

No need for person loras anymore. 🙂

bitter hearth Nov 2, 2024, 11:25 AM

#

oh nice

#

what have you found it is good for?

#

that face copying ability does look strong

icy drift Nov 2, 2024, 11:27 AM

#

bitter hearth oh nice

I also managed to speed up the node made by https://github.com/AIFSH/OmniGen-ComfyUI
So now it's as fast at the Pinokio / non-comfy install.
I'm sure there's some way to make it much faster, but I have no idea what I'm doing. https://github.com/0X-JonMichaelGalindo/OmniGen-ComfyUI

GitHub

GitHub - AIFSH/OmniGen-ComfyUI

Contribute to AIFSH/OmniGen-ComfyUI development by creating an account on GitHub.

GitHub

GitHub - 0X-JonMichaelGalindo/OmniGen-ComfyUI: Keep model loaded

Keep model loaded. Contribute to 0X-JonMichaelGalindo/OmniGen-ComfyUI development by creating an account on GitHub.

icy drift Nov 2, 2024, 11:28 AM

#

bitter hearth what have you found it is good for?

All I did was experiment with it a week ago to see what it could do.
It's limited to photorealism and people as far as reposing behavior goes.
Now that I have it in Comfy, I'll try some more things.

bitter hearth Nov 2, 2024, 11:28 AM

#

okay awesome

#

speed ups are tricky to implement, might need to wait for support

#

particularly stuff like tensorrt

severe phoenix Nov 2, 2024, 12:29 PM

#

please does anyone have this problem where flux loras work better on civit than on their comfy?? right is my comfy, left is civit, how the lora should actually look. b4 u ask i have tried all the scheduler combos, still same issue persists.

short thicket Nov 2, 2024, 12:31 PM

#

Uploading Mangled Merge V1 Dedistilled currently. It's Mangled Merge Matrix and Magic, plus PixelWave, FluxBooru, and nyanko7's dedistilled model. The model works as a dedistilled model so flux guidance is useless but negative prompts and dynamic thresholding work great. It also get's the styles of PixelWave, and the booru knowledge of FluxBooru and Loras work fine on it too.

2024-11-01_2024-11-01-224323_Mangled_Merge_Flux_Magic_Lora_Base_Merge_524_Mangled_Merge_Flux_Magic_Lora_Base_Merge_524_616035756425607_0_deis_beta_30_5.0_1.0.png

severe phoenix Nov 2, 2024, 12:49 PM

#

short thicket Uploading Mangled Merge V1 Dedistilled currently. It's Mangled Merge Matrix and ...

cant wait, will it work with normal loras??

short thicket Nov 2, 2024, 12:50 PM

#

Yes. The ones I tested worked fine. You can try it out here:

https://civitai.com/models/788136/mangled-merge-flux?modelVersionId=1019621

severe phoenix Nov 2, 2024, 12:54 PM

#

short thicket Yes. The ones I tested worked fine. You can try it out here: https://civitai.co...

ok thaks!

severe phoenix Nov 2, 2024, 12:55 PM

#

short thicket Yes. The ones I tested worked fine. You can try it out here: https://civitai.co...

pls can u upload to hugginface?

unkempt compass Nov 2, 2024, 12:55 PM

#

icy drift I also managed to speed up the node made by https://github.com/AIFSH/OmniGen-Com...

I saw that there are compressed versions of Omnigen, but didn't figure out how to load these models in a OmniGen Workflow

short thicket Nov 2, 2024, 12:56 PM

#

severe phoenix pls can u upload to hugginface?

is there a specific quantiziation you are looking for?

severe phoenix Nov 2, 2024, 1:00 PM

#

short thicket is there a specific quantiziation you are looking for?

fp8 and 16 thanks!

short thicket Nov 2, 2024, 1:01 PM

#

severe phoenix fp8 and 16 thanks!

k. I'll post a link once they are done.

short thicket Nov 2, 2024, 1:08 PM

#

severe phoenix fp8 and 16 thanks!

Uploading now. Give it some time though HF is slow with uploads and I've had them fail on me halfway through.

https://huggingface.co/ManglerFTW/Mangled_Merge_Flux_V1_Dedistilled/tree/main

ManglerFTW/Mangled_Merge_Flux_V1_Dedistilled at main

short thicket Nov 2, 2024, 1:23 PM

#

severe phoenix fp8 and 16 thanks!

Looks like they are finished uploading. Quicker than I'de expected.

icy drift Nov 2, 2024, 1:37 PM

#

unkempt compass I saw that there are compressed versions of Omnigen, but didn't figure out how t...

I haven't seen any compressed versions of OmniGen.

unkempt compass Nov 2, 2024, 1:39 PM

#

icy drift I haven't seen any compressed versions of OmniGen.

https://huggingface.co/goodasdgood/OmniGen_quantization/tree/main
https://huggingface.co/sdyy/OmniGen_quantization2/tree/main

goodasdgood/OmniGen_quantization at main

sdyy/OmniGen_quantization2 at main

severe phoenix Nov 2, 2024, 1:41 PM

#

short thicket Looks like they are finished uploading. Quicker than I'de expected.

thanks!

short thicket Nov 2, 2024, 1:42 PM

#

severe phoenix thanks!

You're welcome!

severe phoenix Nov 2, 2024, 1:44 PM

#

short thicket You're welcome!

hello please the .gguf file, can it be converted to safetensor??

frank mural Nov 2, 2024, 1:46 PM

#

anyone knows if 3.5 is available for forge?

short thicket Nov 2, 2024, 1:46 PM

#

severe phoenix hello please the .gguf file, can it be converted to safetensor??

Hmm. I'm not sure how to do that. Is it possible?

severe phoenix Nov 2, 2024, 1:46 PM

#

short thicket Hmm. I'm not sure how to do that. Is it possible?

i dont know lool but i'm trying to load it nd its saying it cant, probably because its not a safetensor file

icy drift Nov 2, 2024, 1:47 PM

#

unkempt compass https://huggingface.co/goodasdgood/OmniGen_quantization/tree/main https://huggin...

I have no idea how to load these. The custom node code is loading a safetensors file, not a pth or a pt file. I don't know if I can just drop that in and point it to the new format.

short thicket Nov 2, 2024, 1:47 PM

#

severe phoenix i dont know lool but i'm trying to load it nd its saying it cant, probably becau...

what program are you using to load it?

unkempt compass Nov 2, 2024, 1:47 PM

#

That is why I was asking the question

severe phoenix Nov 2, 2024, 1:48 PM

#

short thicket what program are you using to load it?

ehh hugginface api for comfy. i'm using rented comfy cloud server

short thicket Nov 2, 2024, 1:50 PM

#

severe phoenix ehh hugginface api for comfy. i'm using rented comfy cloud server

I haven't used it before, but maybe this might help?

https://huggingface.co/docs/hub/en/gguf

GGUF

severe phoenix Nov 2, 2024, 1:50 PM

#

short thicket what program are you using to load it?

how come fp16 is safetensor but fp8 is gguf file. shouldnt they both be safetensors?

short thicket Nov 2, 2024, 1:50 PM

#

bf16 is it's original format, then it was quantized down from that hence fp8 being gguf.

severe phoenix Nov 2, 2024, 1:55 PM

#

short thicket bf16 is it's original format, then it was quantized down from that hence fp8 bei...

damn, ok thanks by the way. i'll see if i can look for a fix

short thicket Nov 2, 2024, 1:56 PM

#

severe phoenix damn, ok thanks by the way. i'll see if i can look for a fix

You're welcome

icy drift Nov 2, 2024, 2:00 PM

#

@unkempt compass Changing the model config to int8 datatype did not change memory requirements, and changing datatype to fp8_e5m2 failed. I do not know what else to try, unless you have any suggestions.