fervent thunder Apr 21, 2025, 2:49 PM

#

I heard about stepfun cos I read about it in arxiv

floral umbra Apr 21, 2025, 2:49 PM

#

Nay, this channel just doesn't have images. #🏞｜general-with-images right below does though :P

#

https://github.com/stepfun-ai/ComfyUI-StepVideo Appears they added support last month :P

#

Oh, only via api apparently. How much vram will it require minimum? Thonk

fervent thunder Apr 21, 2025, 2:52 PM

#

yeah api isn't rly support

#

are you including blockswap?

#

with blockswap it will run on anything

#

without blockswap I am not sure what the actual minimum is

#

but their reference code defaults to 4 H100s and then one more server for controlling it

floral umbra Apr 21, 2025, 2:54 PM

#

Haven't even acquired it yet, as i couldn't find a version of it being a single .safetensors

fervent thunder Apr 21, 2025, 2:55 PM

#

oh I don't think we should be doing that anyway

#

huggingface's original format was so much better

#

VAE and text encoders separate, and files broken up

floral umbra Apr 21, 2025, 2:57 PM

#

Interesting Thunk How much memory does it require to run ram and vram wise? Like, same as wan 720p?

fervent thunder Apr 21, 2025, 2:58 PM

#

that's what I was saying

#

if you blockswap then it will run on anything only a few GB needed

#

without blockswap I am not sure

#

their code is 4 H100 plus one more server

#

to run it

floral umbra Apr 21, 2025, 2:59 PM

#

whatface

#

Then not for me, as i prefer all local kek

#

Easier to pay 80 cents per 24 hours of my own 3090 than 80 cents for an hour of runpod for instance omegaLUL

fervent thunder Apr 21, 2025, 3:01 PM

#

cloud is cheaper, than the electricity required to run a 3090 at home

#

if you want to use at home for privacy that is fine

#

but cloud is actually the cheaper option

floral umbra Apr 21, 2025, 3:01 PM

#

Well, runpod is 24x pricier than my electricity :P

#

Plus i do all local anyways due tp privacy. I would only use cloud to test out models capability, or just workstation cards's speed vs my own card

ancient mauve Apr 21, 2025, 3:05 PM

#

https://huggingface.co/diffusers/stable-diffusion-xl-1.0-inpainting-0.1 I want to use this model for Inpainting but I have no idea how to put it on forge UI I see no safetensor file

oblique agate Apr 21, 2025, 3:05 PM

#

fervent thunder cloud is cheaper, than the electricity required to run a 3090 at home

locally run stuff is freedom

#

in economic sense always run cloud

#

https://www.youtube.com/watch?v=TCHXzX6vUcA https://rumble.com/v6sai4p--dance-of-destiny-english-subtitle.html my first time using ai plus anime footage in my music video

fervent thunder Apr 21, 2025, 3:14 PM

#

yeah I have no problem with libertarians that's fine
personally I just want cheap inference

#

you can do a mixture anyway

fervent thunder Apr 21, 2025, 3:14 PM

#

floral umbra Well, runpod is 24x pricier than my electricity :P

yeah runpod is the wrong one

#

vast ai is the cheap one

ancient mauve Apr 21, 2025, 3:16 PM

#

ancient mauve https://huggingface.co/diffusers/stable-diffusion-xl-1.0-inpainting-0.1 I want t...

Anyone?

fervent thunder Apr 21, 2025, 3:17 PM

#

look at this one I linked to
#🏞｜general-with-images message

#

RTX 4090 for $0.109/hr

#

for me to run that at home would cost $0.30 in electricity

oblique agate Apr 21, 2025, 3:19 PM

#

fervent thunder yeah I have no problem with libertarians that's fine personally I just want chea...

cheap inference likely won't happen till ai tools are more mainstreamn

#

kinda like dot com bubble

fervent thunder Apr 21, 2025, 3:20 PM

#

yea I agree

#

when it becomes mainstream there will be more political will

#

and a larger market to sell into so raising capital will be easier

#

they need to research targeting use-cases more

#

at the moment its all a hammer in search of a nail

oblique agate Apr 21, 2025, 3:28 PM

#

fervent thunder when it becomes mainstream there will be more political will

the idea is to perfect your ai craft like using ai tools so when it becomes mainstream you have an edge

fervent thunder Apr 21, 2025, 3:30 PM

#

sounds good

#

I started in the 90's lol

#

but it depends on what you call ai

polar wagon Apr 21, 2025, 3:31 PM

#

Would anyone be able to point me towards a guide geared towards using InvokeAI to create large amounts of game assets (images of playing cards)? I'm making a game and I have cards that give powerups. I was thinking I could take the card descriptions, pass those to InvokeAI, and have it pump out placeholder artwork so I could continue development.

oblique agate Apr 21, 2025, 3:32 PM

#

polar wagon Would anyone be able to point me towards a guide geared towards using InvokeAI t...

never used invokeAI :<

fervent thunder Apr 21, 2025, 3:33 PM

#

I used it a tiny bit

#

but i am not familiar enough

oblique agate Apr 21, 2025, 3:33 PM

#

I need to research more on image generation

fervent thunder Apr 21, 2025, 3:33 PM

#

if you've mostly used comfy trying diffusers can be good

#

or pure pytorch like the original flux code (that particular code is rly nice)

sage reef Apr 21, 2025, 3:37 PM

#

@woven panther hey i appreciate you started porting some SkyReels V2 stuff ❤️

oblique agate Apr 21, 2025, 3:38 PM

#

https://github.com/mr-fool/ai-background-removal-toolkit did this like yesterday. Learning the library

fervent thunder Apr 21, 2025, 3:39 PM

#

background removal is nice yeah

oblique agate Apr 21, 2025, 3:40 PM

#

fervent thunder background removal is nice yeah

the library does all the heavy lifting it just slap a gui on it

sage reef Apr 21, 2025, 3:41 PM

#

how many background removal tools do we have these days? i lost count :3

fervent thunder Apr 21, 2025, 3:41 PM

#

ye llm can pump out a nice GUI

#

in no effort

abstract quarry Apr 21, 2025, 3:42 PM

#

polar wagon Would anyone be able to point me towards a guide geared towards using InvokeAI t...

there are plenty of tutorials on YouTube. What is xou exact question. It sounds like you just want to do text2image

oblique agate Apr 21, 2025, 3:43 PM

#

sage reef how many background removal tools do we have these days? i lost count :3

is like react todo list. It is flooded

#

basically nowadays as soon as someone pump out some ai tools. Someone will fork it to boost their resume

ancient mauve Apr 21, 2025, 3:44 PM

#

ok how do I install the sdxl inpainting model or the ace plus model, none of them are working

abstract quarry Apr 21, 2025, 3:44 PM

#

you don't install models

ancient mauve Apr 21, 2025, 3:44 PM

#

you copy paste them in the correct folders

woven panther Apr 21, 2025, 3:45 PM

#

sage reef <@228118453062467585> hey i appreciate you started porting some SkyReels V2 stuf...

new model every day recently

oblique agate Apr 21, 2025, 3:45 PM

#

capstone projects used to be a good idea until everyone just copy each other on github as soon as it looks cool

sage reef Apr 21, 2025, 3:45 PM

#

woven panther new model every day recently

haha no time to rest eh 🙂

woven panther Apr 21, 2025, 3:45 PM

#

no time to use them

sage reef Apr 21, 2025, 3:45 PM

#

mhm

ancient mauve Apr 21, 2025, 3:46 PM

#

abstract quarry you don't install models

I just want to either put https://huggingface.co/diffusers/stable-diffusion-xl-1.0-inpainting-0.1 or https://huggingface.co/ali-vilab/ACE_Plus in forgeUI

abstract quarry Apr 21, 2025, 3:47 PM

#

I would assume you put them in the same directory as normal models, but not sure. I don't use forge

ancient mauve Apr 21, 2025, 3:47 PM

#

does anyone here use forge?

abstract quarry Apr 21, 2025, 3:47 PM

#

you use them only for inpainting the same way you would inpaint with a normal model

#

what is not working for you? does the model not appear in the model selection menu?

fervent thunder Apr 21, 2025, 3:48 PM

#

some forge users hang here

sage reef Apr 21, 2025, 3:48 PM

#

something interesting about SkyReels V2, i did a small inference test using basic comfy workflow with the i2v 540p 1.3 model,
and it generates the video sure... but like the starting frame (image) very quickly and abruptly changes and the camera even moves
in a strange way lol, idk.. il wait for your wrapper anyway, maybe you will implement it the way it's meant to be used, cause right now
kinda wonky lol @woven panther

fervent thunder Apr 21, 2025, 3:48 PM

#

forge did look appealing I just never got round to it
and now I have left for Rust lol

ancient mauve Apr 21, 2025, 3:48 PM

#

fervent thunder some forge users hang here

it doesnt have to be those models, I just want to use an inpainting model to expand some images

#

with forge UI

#

but I keep getting error

abstract quarry Apr 21, 2025, 3:48 PM

#

what errors?

fervent thunder Apr 21, 2025, 3:49 PM

#

ye its tricky cos not having used forge UI its hard to help

ancient mauve Apr 21, 2025, 3:49 PM

#

what UI do you guys use

abstract quarry Apr 21, 2025, 3:49 PM

#

comfyui or (rarely) InvokeAI

floral umbra Apr 21, 2025, 3:51 PM

#

fervent thunder vast ai is the cheap one

Does vast have templates? Like run it once, it'll auto setup everything for you ready to go?

ancient mauve Apr 21, 2025, 3:52 PM

#

Unable to start ComfyUI Desktop v0.4.36

#

this is why I dont use comfy ui

sage reef Apr 21, 2025, 3:53 PM

#

i personally use just the normal comfy the portable one, not the desktop

abstract quarry Apr 21, 2025, 3:53 PM

#

invokeai is the most simple to use ui in my opinion 🤷‍♂️

fervent thunder Apr 21, 2025, 3:53 PM

#

floral umbra Does vast have templates? Like run it once, it'll auto setup everything for you ...

yeah

#

standard docker

abstract quarry Apr 21, 2025, 3:54 PM

#

comfyui is the most flexible one, but with a high learning curve

sage reef Apr 21, 2025, 3:54 PM

#

i mean it's not even that high...

fervent thunder Apr 21, 2025, 3:54 PM

#

comfy is the best gui yeah
to beat gui you have to go to command line / code frameworks

woven panther Apr 21, 2025, 3:54 PM

#

sage reef something interesting about SkyReels V2, i did a small inference test using basi...

it's just the model, if it doesn't "recognize" the input, it does whatever

#

and it follow the prompt REALLY closely

#

if you prompt something that's not in your input image, it can just move onto that and ignore it

#

and it's human centric model.. non-human stuff does that more often than not

#

I did get some amazing outputs when I initially tested it though

ancient mauve Apr 21, 2025, 3:55 PM

#

abstract quarry comfyui is the most flexible one, but with a high learning curve

comfyui sucks, I drag the node...#🏞｜general-with-images message

woven panther Apr 21, 2025, 3:56 PM

#

so I don't think there's anything wrong, it works in both the wrapper and native just as it is too

ancient mauve Apr 21, 2025, 3:56 PM

#

andf oh? where the f is it? #🏞｜general-with-images message

fervent thunder Apr 21, 2025, 3:56 PM

#

the GUI has had more bugs lately

sage reef Apr 21, 2025, 3:56 PM

#

hmm maybe il try again and see, but il wait for your wrapper as well 🙂

fervent thunder Apr 21, 2025, 3:56 PM

#

but you can decouple the GUI from the back end and use the back end alone

#

one of my current projects is to make rust front end

ancient mauve Apr 21, 2025, 3:56 PM

#

comfy my ass

woven panther Apr 21, 2025, 3:57 PM

#

sage reef hmm maybe il try again and see, but il wait for your wrapper as well 🙂

it already works, I mean there are many Skyreels models... the DF is the one that's very different and needs it's own code, the rest just works with any old workflow

sage reef Apr 21, 2025, 3:57 PM

#

yea

fervent thunder Apr 21, 2025, 3:57 PM

#

I mean what GUI is alternative?

ancient mauve Apr 21, 2025, 3:57 PM

#

comfyUI doesnt even work, I cant manage to install inpainting in forgeUI fm

fervent thunder Apr 21, 2025, 3:57 PM

#

alternative GUIs are forge or invoke?
these are like 0.01% of comfyui features

#

if you include CLI/code-based then you get all of pytorch/jax/julia/C++/rust ecosystems etc
but these are not GUI

sage reef Apr 21, 2025, 3:58 PM

#

try the non-desktop version of comfy, it should work

abstract quarry Apr 21, 2025, 3:58 PM

#

invoke and forge have quite a lot features

ancient mauve Apr 21, 2025, 3:59 PM

#

abstract quarry invoke and forge have quite a lot features

inpainting isnt working for me

abstract quarry Apr 21, 2025, 3:59 PM

#

99% of the users don't need 99% of the extra features

ancient mauve Apr 21, 2025, 3:59 PM

#

it needs a model to use

ancient mauve Apr 21, 2025, 3:59 PM

#

sage reef try the non-desktop version of comfy, it should work

link?

sage reef Apr 21, 2025, 3:59 PM

#

https://github.com/comfyanonymous/ComfyUI/releases

fervent thunder Apr 21, 2025, 4:00 PM

#

its tricky cos a lot of features are edge cases
where you only need it a handful of times
but in that moment you really needed it

#

there are a lot of features that I have not used in recent workflows that I found indispensable in previous ones

ancient mauve Apr 21, 2025, 4:01 PM

#

whats the point of having one million features if something as basic as dragging objects on screen doesnt work

fervent thunder Apr 21, 2025, 4:01 PM

#

I mean I agree I've switched to rust lol

ancient mauve Apr 21, 2025, 4:01 PM

#

sage reef https://github.com/comfyanonymous/ComfyUI/releases

will try this buut im pretty obfuscated right now

abstract quarry Apr 21, 2025, 4:01 PM

#

it seem to work for everyone else 😂

fervent thunder Apr 21, 2025, 4:01 PM

#

I got too frustrated after nearly 2 years of bugs

#

does it though

#

😂

ancient mauve Apr 21, 2025, 4:01 PM

#

abstract quarry it seem to work for everyone else 😂

I did a clean installation and doesnt work

fervent thunder Apr 21, 2025, 4:02 PM

#

the thing is its hard no matter what you do

#

some stuff like loading and casting I find hard in every single codebase and language

#

and sorting out compile

#

sage attention and teacache/firstblockcache also

#

this stuff needs setting up in every fresh project

abstract quarry Apr 21, 2025, 4:04 PM

#

if you want a easy to install and easy to use ui, I would use invokeai.

I would argue that comfyui is the wrong tool for you. It's complicated to use when you don't understand the internals

sage reef Apr 21, 2025, 4:04 PM

#

technology moves so fast that by the time those will be default in some setups, they will most likely be deprecated by that point 😂

ancient mauve Apr 21, 2025, 4:04 PM

#

forgeUI is the one Im linking cause is the same as authomatic1111 the one I used in the past

#

its just that I cant manage inpainting for now

abstract quarry Apr 21, 2025, 4:04 PM

#

you can also use forge. But you won't find help by anyone if you cannot precisely say what is your error message

sage reef Apr 21, 2025, 4:05 PM

#

isnt swarmui a nice GUI? it has tons of features, like close to comfy features i think and it should give you inpainting stuff

ancient mauve Apr 21, 2025, 4:05 PM

#

abstract quarry you can also use forge. But you won't find help by anyone if you cannot precisel...

I downloaded this model ace_plus_fft.safetensors

#

which is supposed to be an inpainting model

#

I put it in the stable diffusion models folder

abstract quarry Apr 21, 2025, 4:06 PM

#

😬

#

don't use that one

ancient mauve Apr 21, 2025, 4:06 PM

#

and when I try to do a generation I get AssertionError: You do not have CLIP state dict!

fervent thunder Apr 21, 2025, 4:06 PM

#

sage reef technology moves so fast that by the time those will be default in some setups, ...

caching in diffusion is around 2 years old

ancient mauve Apr 21, 2025, 4:06 PM

#

abstract quarry don't use that one

its the one the other guy said

fervent thunder Apr 21, 2025, 4:06 PM

#

sage is new yeah

ancient mauve Apr 21, 2025, 4:06 PM

#

what do I use then

fervent thunder Apr 21, 2025, 4:06 PM

#

although I actually use STA instead of sage where I can

abstract quarry Apr 21, 2025, 4:06 PM

#

use sdxl inpainting for example

ancient mauve Apr 21, 2025, 4:06 PM

#

abstract quarry use sdxl inpainting for example

and I cant manage to download that one

abstract quarry Apr 21, 2025, 4:07 PM

#

or flux inpainting if forge supports it (I don't know)

ancient mauve Apr 21, 2025, 4:07 PM

#

https://huggingface.co/diffusers/stable-diffusion-xl-1.0-inpainting-0.1

#

where is the safetensor?

#

how do I use this then @abstract quarry

abstract quarry Apr 21, 2025, 4:08 PM

#

it's a diffusers model. You could check civitai if they have a forge compatible one

ancient mauve Apr 21, 2025, 4:08 PM

#

abstract quarry it's a diffusers model. You could check civitai if they have a forge compatible ...

a diffuser model isnt a single file?

sage reef Apr 21, 2025, 4:09 PM

#

i mean it's right here:
https://huggingface.co/diffusers/stable-diffusion-xl-1.0-inpainting-0.1/blob/main/unet/diffusion_pytorch_model.fp16.safetensors

ancient mauve Apr 21, 2025, 4:09 PM

#

I just want to download the model but I see a bunch of files instead of a single one and I dont know how to set up in forge

ancient mauve Apr 21, 2025, 4:10 PM

#

sage reef i mean it's right here: https://huggingface.co/diffusers/stable-diffusion-xl-1.0...

you dont need more files other than that one?

#

what about diffusion_pytorch_model.safetensors whats the difference

sage reef Apr 21, 2025, 4:10 PM

#

well you might need vae and clip, but thats also all there for you to download

#

fp16 is smaller and basically the same quality

ancient mauve Apr 21, 2025, 4:10 PM

#

so I need the safetensor, the vae and a clip

#

3 files, not 1

sage reef Apr 21, 2025, 4:12 PM

#

well i never used forge idk.. but usually thats how it works, you either get 3 separate things (unet, vae and clip) or if you lucky you have all in one.
im sure there was a download for inpainting all in one somewhere, but i dont remember where

ancient mauve Apr 21, 2025, 4:12 PM

#

thats the problem then, it wasnt working because I was missing files 🤦🏼‍♂️

ancient mauve Apr 21, 2025, 4:12 PM

#

abstract quarry don't use that one

btw why is this model bad?

abstract quarry Apr 21, 2025, 4:12 PM

#

ancient mauve a diffuser model isnt a single file?

no, it's multiple files and also a different naming scheme

ancient mauve Apr 21, 2025, 4:13 PM

#

abstract quarry no, it's multiple files and also a different naming scheme

I didnt know that

abstract quarry Apr 21, 2025, 4:13 PM

#

ancient mauve btw why is this model bad?

it's not a inpainting model

#

also, it's based on flux-fill which is not supported by forge

#

at least it seems so for me

#

which is sad. flux-fill is probably the strongest inpainting model

fervent thunder Apr 21, 2025, 4:14 PM

#

ye

#

powerpaint v2 for SD 1.5 is not bad

#

its bizzarely strong for an sd 1.5 thing

#

before Flux fill it regularly took SOTAs

sage reef Apr 21, 2025, 4:15 PM

#

yea if you have the hardware specs, go maybe with flux fill

ancient mauve Apr 21, 2025, 4:15 PM

#

https://huggingface.co/diffusers/stable-diffusion-xl-1.0-inpainting-0.1/tree/main im downloading this for the moment

#

wait

ancient mauve Apr 21, 2025, 4:15 PM

#

sage reef yea if you have the hardware specs, go maybe with flux fill

ok, time out because Im getting confused

sage reef Apr 21, 2025, 4:15 PM

#

lol

ancient mauve Apr 21, 2025, 4:15 PM

#

what is the best inpainting model I should download

#

for image EXTENSION

sage reef Apr 21, 2025, 4:15 PM

#

depends on your specs

ancient mauve Apr 21, 2025, 4:16 PM

#

I have a 4090gtx

abstract quarry Apr 21, 2025, 4:16 PM

#

flux-fill is the best inpainting model BUT I don't know if it us supported by forge

sage reef Apr 21, 2025, 4:16 PM

#

well you can do flux fill then

#

and i also have no idea about forge

abstract quarry Apr 21, 2025, 4:16 PM

#

cause on their GitHub they write they haven't implemented full flux support yet

sage reef Apr 21, 2025, 4:16 PM

#

yikes

ancient mauve Apr 21, 2025, 4:16 PM

#

abstract quarry flux-fill is the best inpainting model BUT I don't know if it us supported by fo...

PFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFF

#

I want to bash my head against a wall

sage reef Apr 21, 2025, 4:17 PM

#

this is why comfy is king 🙂

ancient mauve Apr 21, 2025, 4:17 PM

#

isnt there a standalone github repository for inpainting or something

ancient mauve Apr 21, 2025, 4:17 PM

#

abstract quarry flux-fill is the best inpainting model BUT I don't know if it us supported by fo...

so I can use this model?

sage reef Apr 21, 2025, 4:17 PM

#

i mean im sure you can find even a huggingface space for free to inpaint

fervent thunder Apr 21, 2025, 4:18 PM

#

ancient mauve isnt there a standalone github repository for inpainting or something

yes pretty much- the base flux code repo

abstract quarry Apr 21, 2025, 4:18 PM

#

or use invokeai, it has support for flux

ancient mauve Apr 21, 2025, 4:18 PM

#

fervent thunder yes pretty much- the base flux code repo

do you have a link?

fervent thunder Apr 21, 2025, 4:18 PM

#

if you search github flux black forest labs it should come up

ancient mauve Apr 21, 2025, 4:19 PM

#

abstract quarry or use invokeai, it has support for flux

do I have to use the flux model or thera re no other good inpainting models

abstract quarry Apr 21, 2025, 4:19 PM

#

there are plenty of inpainting models

ancient mauve Apr 21, 2025, 4:19 PM

#

fervent thunder if you search ```github flux black forest labs``` it should come up

https://www.invoke.com/ this one?

sage reef Apr 21, 2025, 4:19 PM

#

for example, this does flux fill outpainting:
https://huggingface.co/spaces/multimodalart/flux-fill-outpaint

ancient mauve Apr 21, 2025, 4:19 PM

#

abstract quarry there are plenty of inpainting models

yeah but I dont know which ones

native heart Apr 21, 2025, 4:19 PM

#

anyone can give me tip on how to make face remain the same on i2v using wan 2.1 model

abstract quarry Apr 21, 2025, 4:19 PM

#

ancient mauve https://www.invoke.com/ this one?

yes

ancient mauve Apr 21, 2025, 4:19 PM

#

the flux one you say doesnt seem to work with forge as you say and the other model the other guy said doesnt seem to eb an inpainting model after all

abstract quarry Apr 21, 2025, 4:20 PM

#

you can try the sdxl inpainting you downloaded with forge

ancient mauve Apr 21, 2025, 4:20 PM

#

and after 3 UI installs none of them work because one is incompatible with flux and comfyUI doesnt even have a working drag feature and Im getting crazy

sage reef Apr 21, 2025, 4:20 PM

#

https://huggingface.co/spaces/black-forest-labs/FLUX.1-Fill-dev

ancient mauve Apr 21, 2025, 4:20 PM

#

abstract quarry you can try the sdxl inpainting you downloaded with forge

which one

ancient mauve Apr 21, 2025, 4:21 PM

#

abstract quarry you can try the sdxl inpainting you downloaded with forge

https://huggingface.co/diffusers/stable-diffusion-xl-1.0-inpainting-0.1/tree/main this one?

abstract quarry Apr 21, 2025, 4:21 PM

#

try invokeai. It has a full installer that also automatically download all the models for you

abstract quarry Apr 21, 2025, 4:21 PM

#

ancient mauve https://huggingface.co/diffusers/stable-diffusion-xl-1.0-inpainting-0.1/tree/mai...

the unet/diffusion_pytorch_model.fp16.safetensors is the file you need

ancient mauve Apr 21, 2025, 4:22 PM

#

wait forge UI seems to work with flux

fervent thunder Apr 21, 2025, 4:22 PM

#

maybe they updated

#

there is another one called reforge

#

IDK what it is

ancient mauve Apr 21, 2025, 4:22 PM

#

Im gonna try with forge I dont want to isntall any more stuff

ancient mauve Apr 21, 2025, 4:23 PM

#

abstract quarry the unet/diffusion_pytorch_model.fp16.safetensors is the file you need

But I need 3 files, the one inside the vae folder, the one inside the unet folder

#

and the third one?

fervent thunder Apr 21, 2025, 4:24 PM

#

text encoders

ancient mauve Apr 21, 2025, 4:24 PM

#

fervent thunder text encoders

there are 2 text encoder folders

#

do I have to download both or only 1 of them?

abstract quarry Apr 21, 2025, 4:25 PM

#

xou haven't used sdxl so far?

ancient mauve Apr 21, 2025, 4:25 PM

#

no

abstract quarry Apr 21, 2025, 4:26 PM

#

😅

#

then you have to download everything

ancient mauve Apr 21, 2025, 4:26 PM

#

I used 1.5 long ago

abstract quarry Apr 21, 2025, 4:26 PM

#

or just use invokeai 😬

ancient mauve Apr 21, 2025, 4:26 PM

#

and I remember seting up the model and the vae, then no longer needing the ave files

#

but dunno how things work now

ancient mauve Apr 21, 2025, 4:26 PM

#

abstract quarry or just use invokeai 😬

maybe later

fervent thunder Apr 21, 2025, 4:26 PM

#

ancient mauve there are 2 text encoder folders

both

ancient mauve Apr 21, 2025, 4:27 PM

#

so I end up with 4 files in the end, the unet, the vae and the 2 text encoders

abstract quarry Apr 21, 2025, 4:27 PM

#

I don't want to make advertisement for invokeai 😂 it's just really newcomer friendly and it sometimes makes me crazy when tools like comfyui are recommended for new people although these tools are definitely more for professional users

abstract quarry Apr 21, 2025, 4:27 PM

#

ancient mauve so I end up with 4 files in the end, the unet, the vae and the 2 text encoders

yes

#

and if you want to also use sdxl you only need the sdxl unet file

fervent thunder Apr 21, 2025, 4:28 PM

#

TBH I just forget for months at a time that invoke exists
as a tool I have no issue with it

ancient mauve Apr 21, 2025, 4:29 PM

#

abstract quarry and if you want to also use sdxl you only need the sdxl unet file

so unet its like the base model and the other 3 are addons for inpainting?

ancient mauve Apr 21, 2025, 4:29 PM

#

fervent thunder TBH I just forget for months at a time that invoke exists as a tool I have no is...

wanna try forge by the moment, its siomilar to 1111

sage reef Apr 21, 2025, 4:29 PM

#

you can also use all in one Juggernaut inpainting model, based on sdxl:
https://civitai.com/models/403361/juggernaut-xl-inpainting

abstract quarry Apr 21, 2025, 4:30 PM

#

ancient mauve so unet its like the `base` model and the other 3 are addons for inpainting?

no, you always have text encoders, vae, and then the real model (called unet for historical reasons)

fervent thunder Apr 21, 2025, 4:30 PM

#

also there is swarm

#

I think swarm was unmentioned so far

sage reef Apr 21, 2025, 4:30 PM

#

yea swarm is cool, even tho i never used it lol

#

i did mention it

fervent thunder Apr 21, 2025, 4:30 PM

#

ah ok

abstract quarry Apr 21, 2025, 4:30 PM

#

inpainting is an extension of the model

fervent thunder Apr 21, 2025, 4:30 PM

#

didn't see

ancient mauve Apr 21, 2025, 4:35 PM

#

abstract quarry and if you want to also use sdxl you only need the sdxl unet file

then why did you said this

ancient mauve Apr 21, 2025, 4:35 PM

#

abstract quarry inpainting is an extension of the model

I always thought inpainting models were finetuned versions of a model exclusively for inpainting

boreal dew Apr 21, 2025, 4:36 PM

#

is 8gb of vram enough for illustrious?

#

starting to think not.

fervent thunder Apr 21, 2025, 4:40 PM

#

is ok

boreal dew Apr 21, 2025, 4:40 PM

#

it does not appear to be.

ancient mauve Apr 21, 2025, 4:46 PM

#

abstract quarry flux-fill is the best inpainting model BUT I don't know if it us supported by fo...

do you have a link to the flux model? maybe I can make it work

atomic mortar Apr 21, 2025, 5:04 PM

#

boreal dew is 8gb of vram enough for illustrious?

It is

#

i used to run it on a 3070TI

#

but it depends if you are using a nvidia card of amd

boreal dew Apr 21, 2025, 5:04 PM

#

atomic mortar i used to run it on a 3070TI

yeah i crossposted like a dummy and cs1o told me that i needed to add --medvram-sdxl to the .bat

#

which i didn't do at first
but now it's running a lot better now that i did

abstract quarry Apr 21, 2025, 5:07 PM

#

ancient mauve then why did you said this

you only need the unet cause all other components are identical between sdxl and sdlx-inpaint

abstract quarry Apr 21, 2025, 5:09 PM

#

ancient mauve I always thought inpainting models were finetuned versions of a model exclusivel...

no, it's also a bit different, because it gets three inputs: the original image, the mask, and the noised image

abstract quarry Apr 21, 2025, 5:11 PM

#

ancient mauve do you have a link to the flux model? maybe I can make it work

even if flux is supported by forge, this doesn't mean that the inpaint model is supported

#

https://huggingface.co/black-forest-labs/FLUX.1-Fill-dev/tree/main

ancient mauve Apr 21, 2025, 5:15 PM

#

ValueError: Failed to recognize model type!

#

fuck this

#

for the love of God if anyone's reading this and manages to make this work in forgeUI please let me know https://huggingface.co/diffusers/stable-diffusion-xl-1.0-inpainting-0.1

atomic mortar Apr 21, 2025, 5:35 PM

#

woah another one, gotta be bots or something

#

happens a lot here

#

lol i believe you

#

hmm hope you find clientele here

#

hmm not here probably, rarely some businesses appear here wanting AI solutions for dirty cheap/free

#

but its mostly a community server

#

you might have more luck on fiverr for freelancing

fervent thunder Apr 21, 2025, 5:48 PM

#

hi if this is real person
your way of advertising is a really bad idea
cos it literally looks like a malware bot

brittle kraken Apr 21, 2025, 5:53 PM

#

Hello everyone, I'm new on Stable diffusion, as I saw, we can create images with models, and upgrade them with Loras, is there something else that we have to input to upgrade them ?

abstract quarry Apr 21, 2025, 5:56 PM

#

loras are model finetunes, not necessarily upgrades. But the sd ecosystem is huge, so yes, there is a lot of other stuff

fervent thunder Apr 21, 2025, 5:59 PM

#

some loras are

#

downgrade

#

thomas

brittle kraken Apr 21, 2025, 6:02 PM

#

abstract quarry loras are model finetunes, not necessarily upgrades. But the sd ecosystem is hug...

You if there are youtube videos where I can see what I have to implement ?

atomic mortar Apr 21, 2025, 6:28 PM

#

brittle kraken Hello everyone, I'm new on Stable diffusion, as I saw, we can create images with...

i dont understand your question fully

#

your asking if theres more stuff you can use then loras?

#

if so, theres controlnet you could use and embeddings?

brittle kraken Apr 21, 2025, 6:54 PM

#

Fine Ty, I will look if I can get some info on youtube

solemn harness Apr 21, 2025, 10:38 PM

#

Hi, I'm relatively new to this AI stuff, and I have a question.

#

I'm using qDiffusion. I tried out some negative embeddings, and I got this error message, and I don't know how to fix it. Any ideas? Error while Encoding.
stack expects each tensor to be equal size, but got [1280] at entry 0 and [768] at entry 18 (clip.py:71)

desert dagger Apr 22, 2025, 12:16 AM

#

solemn harness I'm using qDiffusion. I tried out some negative embeddings, and I got this error...

are you trying to use images or just text?

solemn harness Apr 22, 2025, 12:16 AM

#

Text to image

desert dagger Apr 22, 2025, 12:17 AM

#

solemn harness Text to image

read through this https://stackoverflow.com/questions/71011333/runtimeerror-stack-expects-each-tensor-to-be-equal-size-but-got-7-768-at-en

solemn harness Apr 22, 2025, 12:19 AM

#

I managed to get chatgpt to fix the code to automatically resize it. Seems to work just fine now.

upper plinth Apr 22, 2025, 5:59 AM

#

bruh sora are the biggest posse of wussies I have ever witnessed

#

cant have even a milimeter of cleavage on ur gens before they get tagged as violating policies

#

thank the ultra-feminists for this bastardization

agile tusk Apr 22, 2025, 6:24 AM

#

** If anyone is using Nvidia driver 576.02, there is a bug that can cause it to ignore GPU temperatures and therefore not control the cooling correctly. I found that it can be fixed by reinstalling it using the custom installation method, and checking the box for "Clean installation". Check that your GPU temperature changes and isn't fixed at the temperature it was at during start-up.

ebon locust Apr 22, 2025, 8:41 AM

#

Hello, nice to meet you

fervent thunder Apr 22, 2025, 8:55 AM

#

solemn harness I'm using qDiffusion. I tried out some negative embeddings, and I got this error...

qdiffusion is this? https://github.com/arenasys/qDiffusion?tab=readme-ov-file

#

seems interesting

wanton harness Apr 22, 2025, 9:21 AM

#

Hello! Looking forward to explore!

hasty nebula Apr 22, 2025, 10:51 AM

#

from transformers import BertTokenizer, BertModel
import torch

tokenizer = BertTokenizer.from_pretrained('bert-base-uncased')
model = BertModel.from_pretrained('bert-base-uncased')

inputs = tokenizer(sent, return_tensors="pt", return_attention_mask=True, return_token_type_ids=True)
outputs = model(**inputs)
embeddings = outputs.last_hidden_state[0] # shape: [seq_len, hidden_dim]

Get mapping from subwords to original words

word_ids = inputs.word_ids()

Accumulate embeddings per word

word_embeddings = []
current_word_id = None
current_word_embeddings = []

for idx, word_id in enumerate(word_ids):
if word_id is None:
continue
if word_id != current_word_id:
if current_word_embeddings:
word_embeddings.append(torch.mean(torch.stack(current_word_embeddings), dim=0))
current_word_embeddings = [embeddings[idx]]
current_word_id = word_id
else:
current_word_embeddings.append(embeddings[idx])

Append the last word

if current_word_embeddings:
word_embeddings.append(torch.mean(torch.stack(current_word_embeddings), dim=0))

Convert to tensor

sent_embedding = torch.stack(word_embeddings)

agile tusk Apr 22, 2025, 11:09 AM

#

agile tusk ``` ** If anyone is using Nvidia driver 576.02, there is a bug that can cause it...

Fixed in this: https://nvidia.custhelp.com/app/answers/detail/a_id/5650/~/geforce-hotfix-display-driver-version-576.15

atomic mortar Apr 22, 2025, 11:15 AM

#

Can confirm crashes have stopped too

solemn harness Apr 22, 2025, 11:53 AM

#

fervent thunder qdiffusion is this? https://github.com/arenasys/qDiffusion?tab=readme-ov-file

Yeah

fervent thunder Apr 22, 2025, 12:01 PM

#

I use qt framework with py sometimes

ancient mauve Apr 22, 2025, 12:59 PM

#

does anyone here use forgeUI

atomic mortar Apr 22, 2025, 1:30 PM

#

ancient mauve does anyone here use forgeUI

Yes a lot of people but i suspect your question is actually different

ancient mauve Apr 22, 2025, 1:30 PM

#

atomic mortar Yes a lot of people but i suspect your question is actually different

I just want to make this work

#

https://huggingface.co/black-forest-labs/FLUX.1-Fill-dev

#

or this

#

https://huggingface.co/diffusers/stable-diffusion-xl-1.0-inpainting-0.1

#

but I keep getting ```AssertionError: You do not have CLIP state dict!

atomic mortar Apr 22, 2025, 1:32 PM

#

Like in the cloud?

vale viper Apr 22, 2025, 1:38 PM

#

Hi

ancient mauve Apr 22, 2025, 1:49 PM

#

atomic mortar Like in the cloud?

local

#

I have forgeUI locally but i cant make any of those work

#

maybe im not settig the folders right

sage reef Apr 22, 2025, 3:55 PM

#

@woven panther LOL.. Phantom Wan ? this really doesn't stop does it... haha

knotty rain Apr 22, 2025, 3:59 PM

#

Unsure if this is the best place to ask.
I have a 3080 with 10Gb. What would be the best option for me to train a LoRa? Also open to using runpod or similar cloud options.
I've heard people use flux trainer. I'm not set on a model yet, but between flux and sdxl

stiff hinge Apr 22, 2025, 4:59 PM

#

I’ve lately been thinking a lot about how AI is affecting the graphic design industry, so I made a quick dive into the topic with this new video. 🤔🎨
Would love to hear your thoughts — I’m open to any feedback! 🙌
Check it out here:
https://youtu.be/uLwnGXXPrfc?si=8tzI6EZaaGERGehq

mellow meteor Apr 22, 2025, 6:33 PM

#

prisma owl Apr 22, 2025, 6:50 PM

#

Hello people!

#

Today im here to introduce a important question to the people

#

I want to upscale a picture of Shannon Sharpe

#

And i want to know how, and what the best method is for the high quality pictures. Thank you very much

oblique elk Apr 22, 2025, 7:11 PM

#

prisma owl And i want to know how, and what the best method is for the high quality picture...

I am not an expert in upscaling Shannon Sharpe images, but if a more general approach is fine for you i would suggest you use an upscale tool with models for photographs / real images. As a model to start with i would suggest REAL-ESRGAN. Available in different free tools but the easiest would be upscayl or freescaler.

prisma owl Apr 22, 2025, 7:34 PM

#

oblique elk I am not an expert in upscaling Shannon Sharpe images, but if a more general app...

Upscayl is free?

#

Yes, i will soon be the only expert in upscaling Shannon Sharpe images

#

It will go in the history book

oblique elk Apr 22, 2025, 7:36 PM

#

prisma owl Yes, i will soon be the only expert in upscaling Shannon Sharpe images

Yes it is free and pretty sure the Shannon Sharpe Image upscaling Expert market is not very competitive 🙂

swift roost Apr 22, 2025, 8:12 PM

#

Would a 4GB GTX 1630 be useful for AI?

#

My guess is probably not but I'm trying to look for anything I can use

iron pendant Apr 22, 2025, 8:21 PM

#

technically yes but you will be very very limited

atomic mortar Apr 22, 2025, 8:33 PM

#

I think you could run stable diffusion 1.5? I personally havent tried it since it was hella unoptimized but nowadays you maybe could

steel prawn Apr 22, 2025, 8:35 PM

#

swift roost Would a 4GB GTX 1630 be useful for AI?

itll be hella slow for larger images. like anything over 512x512. And itll probably still offload some work to your ram since you have 4GB of vram.

iron pendant Apr 22, 2025, 8:35 PM

#

I personally would not recommend a 1630 at all, save some more pennies

#

maybe a 2070 Super 8GB or something

steel prawn Apr 22, 2025, 8:36 PM

#

yeah i think 8GB is pushing it but workable. Ive got 10GB with my 3080 and its still slow sometimes if im trying to upscale a lot.

swift roost Apr 22, 2025, 8:36 PM

#

It's what I already have on hand. I was able to run an LLM on a 12GB 6700 XT which I currently main.

atomic mortar Apr 22, 2025, 8:36 PM

#

My 3070ti used to do really well with xl pushing 30s per image with a few loras

atomic mortar Apr 22, 2025, 8:37 PM

#

swift roost It's what I already have on hand. I was able to run an LLM on a 12GB 6700 XT whi...

Cant you try running sdxl with zluda and tiled vae with that card

#

Probably forge

swift roost Apr 22, 2025, 8:38 PM

#

Oh nice, I will look into that

tall gorge Apr 22, 2025, 8:40 PM

#

2060 basically the minimum

#

acc runs quite a good amount of stuff but stuff like sdxl struggles a bit

oblique agate Apr 22, 2025, 8:49 PM

#

Does it really cost the Nvidia and amd that much to put 24gb vram in their gpus

swift roost Apr 22, 2025, 8:53 PM

#

oblique agate Does it really cost the Nvidia and amd that much to put 24gb vram in their gpus

Likely limited by the memory bus size

oblique agate Apr 22, 2025, 8:54 PM

#

swift roost Likely limited by the memory bus size

coz 5060ti they managed to squeeze 16gb vram in

prisma owl Apr 22, 2025, 8:55 PM

#

oblique elk Yes it is free and pretty sure the Shannon Sharpe Image upscaling Expert market ...

Added kevin hart into the mix aswell

oblique agate Apr 22, 2025, 8:55 PM

#

and 5070 ti and 5080 likely have larger memory bus size than that so they should be able to squeeze 24gb vram in

swift roost Apr 22, 2025, 8:56 PM

#

5060 Ti has a 128-bit bus, the 1630 has a 64-bit bus

#

Eight 16Gbit chips gives 16GB

oblique agate Apr 22, 2025, 9:58 PM

#

oh well amd, intel and nvidia are not giving us the best at a good price

upper plinth Apr 22, 2025, 11:38 PM

#

nah bro the gpu's come pre-scalped now

#

average price for a 5090 if you manage to catch one is about $3200 USD

#

5080 I purchased mine at $1600

#

lowest they go probably $1300 for the crappy PNY ones

prisma owl Apr 22, 2025, 11:44 PM

#

Can i run SDXL on 8gb Vram?

atomic mortar Apr 22, 2025, 11:45 PM

#

prisma owl Can i run SDXL on 8gb Vram?

Yes! i used to run it just fine only my 3070TI. if you use AMD however im not sure

prisma owl Apr 22, 2025, 11:45 PM

#

atomic mortar Yes! i used to run it just fine only my 3070TI. if you use AMD however im not su...

Nah i got Intel

atomic mortar Apr 22, 2025, 11:46 PM

#

ooh an intel arc thats a first ive seen it

prisma owl Apr 22, 2025, 11:46 PM

#

Is it hard to set up? Im new to image generations and just want to try make some images in various art styles

atomic mortar Apr 22, 2025, 11:47 PM

#

Hmmm im not sure what you consider difficult, have you used Git before?

prisma owl Apr 22, 2025, 11:48 PM

#

atomic mortar Hmmm im not sure what you consider difficult, have you used Git before?

Not really used no

atomic mortar Apr 22, 2025, 11:48 PM

#

hmm are you on windows?

prisma owl Apr 22, 2025, 11:49 PM

#

atomic mortar hmm are you on windows?

Yes

atomic mortar Apr 22, 2025, 11:49 PM

#

prisma owl Yes

did a lil research and i think SDnext supports it?

#

https://github.com/vladmandic/sdnext

#

https://vladmandic.github.io/sdnext-docs/Intel-ARC/

prisma owl Apr 22, 2025, 11:51 PM

#

atomic mortar did a lil research and i think SDnext supports it?

lmao wait

#

when u said AMD i thought u meant as in CPU

#

I forgot they made GPU's

#

So thats why I said Intel

#

I got a Nvidia card ahahahhaha

atomic mortar Apr 22, 2025, 11:52 PM

#

ohhh

#

yeah then i recommend SwarmUI or ForgeWebui

#

swarm is an easier install imo

#

but in tech-support you can have more support here

#

with forge webui

#

theres a tutorial in the #🤝｜tech-support pinned comments

desert dagger Apr 23, 2025, 12:27 AM

#

upper plinth nah bro the gpu's come pre-scalped now

and they didn't before/

prisma owl Apr 23, 2025, 12:50 AM

#

@atomic mortar Do u perhaps know why all my images are like deformed

#

like the faces etc

#

the body

atomic mortar Apr 23, 2025, 12:51 AM

#

Hmm faces are often distorted but what model are you using?

#

What resolution

#

Etc

prisma owl Apr 23, 2025, 12:51 AM

#

Just using SDXL

#

1024x1024

atomic mortar Apr 23, 2025, 12:52 AM

#

Base sdxl?

prisma owl Apr 23, 2025, 12:52 AM

#

atomic mortar Base sdxl?

Yes

atomic mortar Apr 23, 2025, 12:53 AM

#

Hmm try a model from civitai.com
Illustrious for anime

#

Sdxl for realism

prisma owl Apr 23, 2025, 12:54 AM

#

atomic mortar Hmm try a model from civitai.com Illustrious for anime

what is the difference between model and checkpoint

atomic mortar Apr 23, 2025, 12:54 AM

#

prisma owl what is the difference between model and checkpoint

Oh its the same thing

#

But I'm going to bed, 3am n all

prisma owl Apr 23, 2025, 12:55 AM

#

Hahahahahah same for me

atomic mortar Apr 23, 2025, 12:55 AM

#

If you get stuck i recommend popping into the #🤝｜tech-support channel or the SwarmUI discord if its a UI specific thing

prisma owl Apr 23, 2025, 12:55 AM

#

Preciate u for ur help bro

#

Have a good night of sleep

oblique agate Apr 23, 2025, 1:02 AM

#

Any apes here know any pharma shit https://www.gilead.com/news/news-details/2025/gilead-presents-new-hiv-treatment-and-cure-research-data-at-croi-2025-including-an-investigational-long-acting-twice-yearly-therapy-option is that the oaktree kim is referring to

upper plinth Apr 23, 2025, 2:13 AM

#

Okay I have to admit Sora's image-to-video is blowing me away

#

Im going to try animating my Taylor Swift dark magician girl and pray that it doesn't tag it as violation

#

it is INSANE how detailed it is. Almost tempts me to pay for the Pro version

high flint Apr 23, 2025, 7:03 AM

#

So, I've been away for a while from SD ai art gen, and I see a LOT of new model types, Such as Illustrious, Pony, Flux, and more. I'm mostly used to 1.5 and SDXL, what benefits do the new model types bring, and what use cases should I use them for?

quiet finch Apr 23, 2025, 8:12 AM

#

It's all about finding what feels right and comfortable to use.

abstract quarry Apr 23, 2025, 9:12 AM

#

Flux is the newest model family and has the best prompt following. It also gets anatomy right most of the times. Its weakness are a certain "plastic look" for photorealism and its lack of many style understandings (in particular for paintings). Both can be solved via custom models, though

fervent thunder Apr 23, 2025, 9:15 AM

#

Hidream erasure 😂

abstract quarry Apr 23, 2025, 9:15 AM

#

HiDream is probably just Flux finetuner on new text encoders 🙊

fervent thunder Apr 23, 2025, 9:18 AM

#

it probably is secretly Flux in a trench coat and a hat yeah

prisma owl Apr 23, 2025, 10:32 AM

#

@atomic mortar Hey

#

I was wondering how to impaint pictures

#

Impainting is changing stuff on pictures u have made right?

atomic mortar Apr 23, 2025, 10:33 AM

#

Yeah either adetailer or changing something entirely

#

In swarm right?

prisma owl Apr 23, 2025, 10:33 AM

#

Yeah in swarm

#

I read about adetailer but i couldnt download it to swarm

atomic mortar Apr 23, 2025, 10:33 AM

#

If you just want to fix the face i recommend segment:face or use the + button next to the prompt box

prisma owl Apr 23, 2025, 10:33 AM

#

unless its pre-installed already

atomic mortar Apr 23, 2025, 10:34 AM

#

Automatic segmentation is basically adetailer

#

But bigger

#

You can segment anything to "fix"

#

Always do a segment at the end of a prompt

prisma owl Apr 23, 2025, 10:34 AM

#

What is automatic segmantation in comparising with segment:face

atomic mortar Apr 23, 2025, 10:35 AM

#

So its like 1girl, brown hair, etc segment:face blue eyes

atomic mortar Apr 23, 2025, 10:35 AM

#

prisma owl What is automatic segmantation in comparising with <segment:face>

Its the same

prisma owl Apr 23, 2025, 10:35 AM

#

atomic mortar So its like 1girl, brown hair, etc <segment:face> blue eyes

So I add segment:face to my prompt?

#

When i generate or like after?

atomic mortar Apr 23, 2025, 10:36 AM

#

Both is possible

#

But give this a read

#

https://github.com/mcmonkeyprojects/SwarmUI/blob/master/docs/Features/Prompt Syntax.md

prisma owl Apr 23, 2025, 10:37 AM

#

I will, yeah will adding that segment also add faces? because rn i have a prompt where i tell it to show face with this: (face out of frame:1.1) in negative. I grabbed the prompt from Citivai so Idk how it completely works

atomic mortar Apr 23, 2025, 10:37 AM

#

Hmmmm

#

No

#

Segment will look for a face

#

And "fix" it

prisma owl Apr 23, 2025, 10:37 AM

#

Ohhh

#

Idk why its not working then unfortunately

#

I have in positive

#

"face showing"

#

in negative (face out of frame:1.1)

atomic mortar Apr 23, 2025, 10:38 AM

#

Give me a few sec and ill look

#

Making a omelette

prisma owl Apr 23, 2025, 10:39 AM

#

Hahahahaha enjoy

#

Take ur time

atomic mortar Apr 23, 2025, 10:50 AM

#

@prisma owl can you send me the prompt + image either in #🏞｜general-with-images or dms

ancient mauve Apr 23, 2025, 10:56 AM

#

Im finally using flux fill with comfyUI

#

but outpainting like 100 pixels takes me hours and hours is that normal?

#

outpainting a single image is not nearly done after 3 hours

sudden jewel Apr 23, 2025, 11:54 AM

#

high flint So, I've been away for a while from SD ai art gen, and I see a LOT of new model ...

Pony and Illustrious (IL) are SDXL so heavily tuned that they're essentially their own model now (loras that work on SDXL prob don't work on pony/IL, vice versa, etc)
the 2 are both anime focused, and I'm p sure ppl just use an IL finetune of their liking over pony now

hasty night Apr 23, 2025, 12:02 PM

#

hi. i'm iqram

ancient mauve Apr 23, 2025, 12:27 PM

#

is it normal that generating an outpainting with flux takes so much time?

abstract quarry Apr 23, 2025, 12:54 PM

#

outpainting, inpainting, txt2img, img2img they all are internally the same thing

#

so no, it should not take more time than generating an image of same size

abstract quarry Apr 23, 2025, 12:55 PM

#

ancient mauve outpainting a single image is not nearly done after 3 hours

sounds like you do computations on your cpu instead of gpu

ancient mauve Apr 23, 2025, 12:56 PM

#

abstract quarry sounds like you do computations on your cpu instead of gpu

I also think so, my stupid ass started CPU instead of gpu

#

Will test

ancient mauve Apr 23, 2025, 1:12 PM

#

abstract quarry sounds like you do computations on your cpu instead of gpu

yeah that was the problem, no wonder lol

ancient mauve Apr 23, 2025, 1:34 PM

#

outpainting now takes almost no time, but it isnt working

#

it gives me a grey extension instead of generating anything really

trim knoll Apr 23, 2025, 2:00 PM

#

IMAGINE/Bússola estilizada integrada a uma tela de TV ou antena

abstract quarry Apr 23, 2025, 2:09 PM

#

ancient mauve it gives me a grey extension instead of generating anything really

you should either

use flux-fill and 100 denoising strength.
copy the edge of the image such that it is filled and use e.g. 80% denoising strength

#

in both cases you don't need a prompt

ancient mauve Apr 23, 2025, 2:12 PM

#

abstract quarry you should either 1) use flux-fill and 100 denoising strength. 2) copy the edge...

denoising value of MAX takes the og image as a prompt completely roght?

ancient mauve Apr 23, 2025, 2:12 PM

#

abstract quarry you should either 1) use flux-fill and 100 denoising strength. 2) copy the edge...

2) copy the edge of the image such that it is filled and use e.g. 80% denoising strength```
WDYM?

abstract quarry Apr 23, 2025, 2:12 PM

#

it changes the masked region maximal

ancient mauve Apr 23, 2025, 2:13 PM

#

it says lower values will mantain the structure of the OG allowing for image to image sampling

abstract quarry Apr 23, 2025, 2:14 PM

#

in img2img as higher the denoise as more of the original image is changed

#

you want to outpaint, so the part you want to change is empty (e.g. gray). you want to 100% replace this part of the image

ancient mauve Apr 23, 2025, 2:16 PM

#

yeah, so what changes that is denoise 100 not denoise 0 then

abstract quarry Apr 23, 2025, 2:16 PM

#

100% denoise means completely replace this part of the image

ancient mauve Apr 23, 2025, 2:17 PM

#

ok cool

#

also, does it matter how many pixels do I outpaint?

#

should I stuck with 64x64 multiples or something like that then crop

#

instead of dunno, augmenting top by 81 pixels and left with 149

abstract quarry Apr 23, 2025, 2:18 PM

#

in theory multiple of 16 but I think most tools handle that internally

ancient mauve Apr 23, 2025, 2:19 PM

#

so if I want something like 60 pixels, 16*4=64 then crop the extra 4 pixels

#

rather than just augmenting 60

abstract quarry Apr 23, 2025, 2:19 PM

#

outpainting is the same as inpainting. You just extend the image size beforehand and then do inpaint on the extended edges

#

only the total image size has to be multiple of 16

ancient mauve Apr 23, 2025, 2:20 PM

#

abstract quarry only the total image size has to be multiple of 16

total size of the whole image or only the extra stuff

abstract quarry Apr 23, 2025, 2:20 PM

#

whole image

fervent thunder Apr 23, 2025, 2:21 PM

#

in machine learning its just easier to make everything multiples of 64

abstract quarry Apr 23, 2025, 2:21 PM

#

and as said, tools usually handle that internally anyways (e.g. extend to 16 and then crop)

ancient mauve Apr 23, 2025, 2:21 PM

#

so I shouldnt worry with comfyUI then

#

im using the default template for fluxfill

#

I just put what I want in the pad image node

abstract quarry Apr 23, 2025, 2:22 PM

#

I think the default templates are really bad

fervent thunder Apr 23, 2025, 2:23 PM

#

everyone got this sequence stuck in their head now lol 🫠
64, 128, 192, 256, 320, 384, 448, 512, 576, 640, 704, 768, 832, 896, 960, 1024, 1088, 1152, 1216, 1280

abstract quarry Apr 23, 2025, 2:23 PM

#

cause they don't preserve the original pixels

ancient mauve Apr 23, 2025, 2:23 PM

#

abstract quarry I think the default templates are really bad

do you have a good template?

#

I have a rectangular image and I just want to turn it into a square

#

I just want to outpaint not many pixels really, closest is 128 extra pixels

abstract quarry Apr 23, 2025, 2:29 PM

#

you want to copy the changed part of your image into the original part.
But you can also keep the current template and check how the quality is first

ancient mauve Apr 23, 2025, 2:30 PM

#

ah ok padding doesnt let you choose any outpainting anyways

#

I can choose 64 or 72 but no in between

ancient mauve Apr 23, 2025, 2:31 PM

#

abstract quarry you want to copy the changed part of your image into the original part. But you ...

I dont get it, what do you mean by copying the changed part

#

also what is feathering exactly and what could it be a good amount, here #🏞｜general-with-images message

abstract quarry Apr 23, 2025, 2:33 PM

#

you encode your input image with the vae, then change the edge of the image, then decode it back through the vae. The vae is a compressor. Think of it like you convert a png image into jpg and then back to png. It will lose quality

ancient mauve Apr 23, 2025, 2:33 PM

#

abstract quarry you encode your input image with the vae, then change the edge of the image, the...

ah so I cut paste the 72 generated pixels and stich them to my orignal image

abstract quarry Apr 23, 2025, 2:33 PM

#

it's not so severe with Flux as flux vae is using less compression than sd 1 and xl

abstract quarry Apr 23, 2025, 2:34 PM

#

ancient mauve ah so I cut paste the 72 generated pixels and stich them to my orignal image

yes

#

that would prevent that your original image loses quality

ancient mauve Apr 23, 2025, 2:34 PM

#

abstract quarry yes

Ive heard that using ai generated isnt good practice

#

changing my image to a square then using that square as training data isnt good

#

but I just did a generation and visually, it looks ok

abstract quarry Apr 23, 2025, 2:35 PM

#

you could also just train on a method like flux that natively supports non-square images 😅

#

or sdxl

ancient mauve Apr 23, 2025, 2:36 PM

#

abstract quarry you could also just train on a method like flux that natively supports non-squar...

seriously? fuuk

#

oh well, I wanted to try first with sd1.5, then the other ones and compare results

fervent thunder Apr 23, 2025, 2:36 PM

#

if people trained on the latest models they would have an easier time

ancient mauve Apr 23, 2025, 2:36 PM

#

fervent thunder if people trained on the latest models they would have an easier time

I mean its good learning int he end

fervent thunder Apr 23, 2025, 2:36 PM

#

its the opposite to people's intuition

#

people think training big new model would be harder but its easier

abstract quarry Apr 23, 2025, 2:37 PM

#

I mean, Flux trains very differently from SDXL, so it might be good to try both and decide

#

but usually flux just gives you best results but takes most of the time

fervent thunder Apr 23, 2025, 2:38 PM

#

with lion I saw someone get ok result in 70 steps

#

did require lion though

#

bit of a messy optim

ancient mauve Apr 23, 2025, 2:38 PM

#

abstract quarry but usually flux just gives you best results but takes most of the time

so flux is better than sdxl, and sdxl is better than sd1.5

fervent thunder Apr 23, 2025, 2:39 PM

#

bigger = better almost always

abstract quarry Apr 23, 2025, 2:39 PM

#

lion is weird 😬

ancient mauve Apr 23, 2025, 2:39 PM

#

I thought training with flux was harder but if you say otherwise

abstract quarry Apr 23, 2025, 2:39 PM

#

ancient mauve so flux is better than sdxl, and sdxl is better than sd1.5

yes

ancient mauve Apr 23, 2025, 2:39 PM

#

can you train flux in comfy UI or whats the way to go nowadays?

#

flux isnt a SD model so I suppose its different in some ways

abstract quarry Apr 23, 2025, 2:39 PM

#

flux is by the same developers as SD

ancient mauve Apr 23, 2025, 2:39 PM

#

because if I dont have to waste that much time setting up the dataset...

abstract quarry Apr 23, 2025, 2:40 PM

#

it's just not called SD due to the devs left the company

ancient mauve Apr 23, 2025, 2:40 PM

#

abstract quarry it's just not called SD due to the devs left the company

ty copyright 😦

fervent thunder Apr 23, 2025, 2:41 PM

#

I mean at this point their new company is a stronger brand so
it is swings and roundabouts 😄

abstract quarry Apr 23, 2025, 2:41 PM

#

there are so many training tools

fervent thunder Apr 23, 2025, 2:41 PM

#

the main threat to any western AI firm is the Chinese firms anyway

abstract quarry Apr 23, 2025, 2:41 PM

#

kohya, onetrainer, simpletuner, aitoolkit

fervent thunder Apr 23, 2025, 2:42 PM

#

the Chinese firms are releasing very large models with full apache/mit licenses
I actually don't know how western AI startups can compete with that

#

I am not sure they can compete, purely on the model front

#

so they will have to pivot

#

to more service-based model or something

ancient mauve Apr 23, 2025, 2:43 PM

#

abstract quarry there are so many training tools

I leave for 2 years and everything changes completely

abstract quarry Apr 23, 2025, 2:43 PM

#

they can just build on top of that models. I don't think open source is a threat at all.

ancient mauve Apr 23, 2025, 2:43 PM

#

abstract quarry kohya, onetrainer, simpletuner, aitoolkit

I remember training in kohya, not the other ones

#

if quality has increased that much im excited

abstract quarry Apr 23, 2025, 2:43 PM

#

they all usually use the same input more or less. Only configuration is different

ancient mauve Apr 23, 2025, 2:44 PM

#

btw now that I catch you connected #🏞｜general-with-images message what is feathering exactly

ancient mauve Apr 23, 2025, 2:44 PM

#

abstract quarry they can just build on top of that models. I don't think open source is a threat...

open source rocks

fervent thunder Apr 23, 2025, 2:44 PM

#

abstract quarry they can just build on top of that models. I don't think open source is a threat...

cos why would people pay the middleman

#

is the issue

#

I stay out of AI investing cos of this sort of reason
I can't see where the moats are

abstract quarry Apr 23, 2025, 2:45 PM

#

fervent thunder cos why would people pay the middleman

they do all the time.

ancient mauve Apr 23, 2025, 2:49 PM

#

@abstract quarry do you have any good guides for training the flux model?

fervent thunder Apr 23, 2025, 2:50 PM

#

I just think its a way smaller market

#

than for example 2-ish years or so ago

#

when firms like midjourney had monopolies

abstract quarry Apr 23, 2025, 2:51 PM

#

I think most tools have guides or default settings

#

I remember Simpletuner and Aitoolkit have default settings for Flux.

ancient mauve Apr 23, 2025, 2:51 PM

#

abstract quarry I think most tools have guides or default settings

its just that I dont know who said it but coimfyUi isnt for training or something

abstract quarry Apr 23, 2025, 2:52 PM

#

but Simpletuner might be difficult on Wimdows

fervent thunder Apr 23, 2025, 2:52 PM

#

I feel like I'd love to use simpletuner but the install is not made easy

#

compared to others where its a container or an API endpoint

abstract quarry Apr 23, 2025, 2:53 PM

#

fervent thunder I feel like I'd love to use simpletuner but the install is not made easy

it's just a normal python package with poetry 🤷‍♂️

fervent thunder Apr 23, 2025, 2:54 PM

#

maybe its skill issue on my part

#

I only skimmed the docs but they looked quite manual

#

I mostly look either for cloud endpoints or containers I can quickly make cloud endpoint

abstract quarry Apr 23, 2025, 2:57 PM

#

I tried kohya, simpletuner and Aitoolkit. I found them all quite similar

fervent thunder Apr 23, 2025, 3:01 PM

#

is mostly that for some I found containers

#

funnily enough there is a cog for simpletuner but its an old version

haughty smelt Apr 23, 2025, 3:02 PM

#

Hello air fryer people

fervent thunder Apr 23, 2025, 3:03 PM

#

hello

fervent thunder Apr 23, 2025, 3:05 PM

#

abstract quarry I tried kohya, simpletuner and Aitoolkit. I found them all quite similar

I should just be less lazy and make container, automation script and endpoint for each of these myself

ancient mauve Apr 23, 2025, 3:33 PM

#

btw how are prompts with flux, is it betetr to have words separated by commas or a long continious description?

atomic mortar Apr 23, 2025, 3:34 PM

#

Long

fervent thunder Apr 23, 2025, 3:36 PM

#

long ye

fervent thunder Apr 23, 2025, 4:29 PM

#

hey i need someone who is really good on image generation

fervent thunder Apr 23, 2025, 4:32 PM

#

fervent thunder hey i need someone who is really good on image generation

it is better to just ask your question directly 🙂

atomic mortar Apr 23, 2025, 4:36 PM

#

if its a question someone can answer they will but fishing to get the real question out is what i do at my job enough already lol

pine path Apr 23, 2025, 5:41 PM

#

new papers that seem interesting
Boosting Generative Image Modeling via Joint
Image-Feature Synthesis
https://arxiv.org/abs/2504.16064v1
REPA-E: Unlocking VAE for End-to-End Tuning with Latent Diffusion Transformers
https://arxiv.org/abs/2504.10483

ancient mauve Apr 23, 2025, 5:50 PM

#

pine path new papers that seem interesting Boosting Generative Image Modeling via Joint Im...

yet integrating representation learning with generative modeling remains a challenge can anyone smarter than me explain this?

#

does it mean training?

floral umbra Apr 23, 2025, 6:03 PM

#

Is it possible to "convert" a checkpoint to to a lower billion parameter? As gguf is for quantized models, but iirc gguf needs it's own nodes, and would wanna use a 7/8b parameter wan.safetensors for instance

pine path Apr 23, 2025, 6:36 PM

#

ancient mauve ```yet integrating representation learning with generative modeling remains a ch...

https://arxiv.org/abs/2410.06940

#

both build off of work mentioned in this paper

haughty smelt Apr 23, 2025, 9:27 PM

#

Why does everyone here use perfect grammar as if they are at work chatting on teams or smth?

#

Like... I do not care if you talk normally.

steel prawn Apr 23, 2025, 9:35 PM

#

😂

primal current Apr 24, 2025, 2:10 AM

#

I need to cover my nipples with AI InPaint, but no workflow works for me. Does anyone have a solution? I pay for the service.

upper plinth Apr 24, 2025, 3:28 AM

#

primal current I need to cover my nipples with AI InPaint, but no workflow works for me. Does a...

photoshop?

prisma owl Apr 24, 2025, 3:28 AM

#

I have a question

#

I was reading on CivitAI and they said they are updating their policy

serene mountain Apr 24, 2025, 3:29 AM

#

Yeah. Fun stuff.

prisma owl Apr 24, 2025, 3:30 AM

#

Is it for pictures or does it also mean they cant upload models/loras on there surrounding those things

upper plinth Apr 24, 2025, 4:40 AM

#

policies policies policies censorship censorship censorship

#

Didnt this orange mf say he was going to end these invasive restrictions?

abstract quarry Apr 24, 2025, 7:47 AM

#

lol, reading this you could think that civitai bans nsfw content, but no, they only ban very extreme and specific stuff

#

and now people cry cause they can no longer generate porn with women having period

upper plinth Apr 24, 2025, 7:48 AM

#

lmfaoooo

#

yeah there is a line for sure, but the overcorrections are insane

#

nothing like waiting 15+ minutes for a single video gen in Sora just to be told that it can't be shown because of a mysterious "policy violation"

abstract quarry Apr 24, 2025, 7:54 AM

#

ChatGPT might be extreme in its censorship

#

but I'm also annoyed that the only big image gen website is a porn site basically

upper plinth Apr 24, 2025, 8:04 AM

#

bruh. welcome to reality. Goonality should I say

#

AI is 98% gooner stuff, 2% productive stuff

#

I wouldn't have it any other way.

#

I'm getting error messages on Sora that they've hit capacity since everyone and their grandma is trying out the new models and based on the gens I've seen I bet a large chunk of that overcapacity comes from gooners like me trying to bypass their ridiculous censorship through trial and error

abstract quarry Apr 24, 2025, 8:14 AM

#

any art platform is full of nsfw. Looking through devian arts means looking through naked bodies.
The difference is: devian arts is aesthetic. Its arts.
Civitai is just pervert porn.

#

like when you go on a porn site you want to stay in certain categories. It's so annoying seeing an add popup of, say, granny porn 😬 similarity, I'm sure straight people don't want to see gay fetish porn.
But on civitai all these weird fetish stuff is just thrown onto you. You could open a model "world morph into glass" and half of the showcase images are masturbating women with unnatural large breasts. It's just disgusting and it's difficult to get rid of it. You have to disable all mature content but even then you still see a lot of fetish stuff

upper plinth Apr 24, 2025, 8:33 AM

#

That's a lie. CivitAI doesn't show anything NSFW unless you turn that on in the settings

#

Now if your concern is that weird porn is getting mixed in with traditional porn, well, welcome to the golden age of depravity circa 2025. As the world decays, people get lonelier > people get into weirder and more depraved shit which is then normalized. Idiocracy/cyberpunk dystopia in full motion.

abstract quarry Apr 24, 2025, 8:48 AM

#

even in sfw mode you get a lot of weird fetish stuff that is just not "nude enough" to be counted as nsfw

steel prawn Apr 24, 2025, 8:48 AM

#

Sad but true. I think we can all agree that the changes to remove minors in images as well as implications of SA or forced sexual situations is probably for the best, and probably the removal of celebrities. Art is meant to be subjective, and if you see something you dont like you shouldnt click on it, and your free to have your opinion of it. But that does not mean the artist is in the wrong for creating it. People think the works of certain surrealist and horror artists are over the top or distatesful because the imagery doesnt agree with them, but that doesnt mean it isnt art.

We live in a day and age where increasingly sex and porn are being normalized, even the weirder fetishes, and that of course means its gonna bleed into the artistic side of things. Case in point, danbooru is full of it and a lot of that isnt just AI art. You just gotta accept that that is the space now, and take the tools you need to make what you want and go about your business. Besides, a lot of this is just Civit covering their asses before a lawsuit happens.

upper plinth Apr 24, 2025, 9:18 AM

#

Oh all big companies are covering their asses, but the bias is absolutely asinine

#

Go to Sora's main page and you're going to find loads of Donald Trump or Putin turning into poop parodies

#

yet the moment I try to even remotely animate my Taylor Swift dark magician girl - policy violation. Of course.

paper gazelle Apr 24, 2025, 1:06 PM

#

hello everyone!

#

hows everyone doing today?

median jewel Apr 24, 2025, 10:36 PM

#

does someone know if its possibel to use fooocus codes in visual studio, trying to inpaint, lora, prompt etc feature but getting access to it through code, is that possible or do i need to use the website interface they have for that? Cause i have tried using simple SDXL code with lora and masks but it doesnt get nearly the same good result as fooocus does

night gladeBOT Apr 24, 2025, 10:39 PM

#

fervent thunder Apr 24, 2025, 11:34 PM

#

median jewel does someone know if its possibel to use fooocus codes in visual studio, trying ...

diffusers is most common for command line

#

you can use comfyscript for comfyui

#

otherwise pure pytorch etc

#

forge api maybe

#

comfyscript with custom comfyui nodes or pure pytorch have much nicer syntax and modularity than diffusers

#

but diffusers is more stable

#

so it depends

#

I am part switching to rust but I don't "recommend"

round dove Apr 25, 2025, 12:35 AM

#

Hey, trader.
If you are also facing issues from your challenge account passing or making profits on your live account on any of your chosen trading platforms on this prop firm. I'd like to tell you what my research brought for me that makes me to always take enough profits per day on my live account.
Msg me if you are interested

nova glade Apr 25, 2025, 1:01 AM

#

Hey, I wanted to ask if there's any rules for making a new post on r/StableDiffusion, I don't use reddit much and maybe my account does not have enough karma. I can't see my new post appearing, maybe it is pending moderation.

I had made a tool to easily archive civitai content so was hoping to share that with the community, https://github.com/dreamfast/go-civitai-downloader

nova glade Apr 25, 2025, 1:32 AM

#

ahh https://old.reddit.com/r/StableDiffusion/comments/1k784qf/gocivitaidownloader_easily_download_anything_from/ I see it was removed, no problem, I'll try one more time with the github as a link post, if it doesn't go through no problem

fervent thunder Apr 25, 2025, 1:46 AM

#

wow thanks so much for this

nova glade Apr 25, 2025, 2:18 AM

#

sad I can't get all the models, i only have so much space, but i got all the loras i wanted for video

uncut venture Apr 25, 2025, 3:26 AM

#

Whats the best setup for amd users? Just running comfyui straight up or is there any good programs that package other useful things along with it?

nova glade Apr 25, 2025, 3:35 AM

#

cool i just added torrent stuff so u can generate torrent files based on what you downloaded, sad i can't share it with r/StableDiffusion 😦

nova glade Apr 25, 2025, 3:35 AM

#

uncut venture Whats the best setup for amd users? Just running comfyui straight up or is there...

i heard it's tough, there has to be plenty of tutorials out there though, it is possible to do, straight up might work but check first

oblique agate Apr 25, 2025, 5:22 AM

#

I am trying to generate something like this https://www.youtube.com/shorts/CtbEvLPM23o I can't quite find the base image for something like that. Any tips

upper plinth Apr 25, 2025, 10:31 AM

#

oblique agate I am trying to generate something like this https://www.youtube.com/shorts/CtbEv...

that looks like midjourney

#

probably midjourney and if they dont have their own animation AI then use sora

rose drift Apr 25, 2025, 2:48 PM

#

Hi, can anybody help me?

oblique agate Apr 25, 2025, 2:51 PM

#

upper plinth that looks like midjourney

atm I think I need to studymax on img generation so I have some gucci base image for wan 2.1

left hatch Apr 25, 2025, 2:56 PM

#

Hi @robust otter

median jewel Apr 25, 2025, 3:13 PM

#

fervent thunder you can use comfyscript for comfyui

Just so I understand, if I wanted to use lora and image prompt can I run a simple python program that uses fooocus app without me going into either comfyui and manually adding photos I can make a code that runs and uses the api instead? Will this cost money even though I run it locally?

robust otter Apr 25, 2025, 3:44 PM

#

left hatch Hi <@155072514689728513>

what do you want, and why are you messaging me in a server which I dont ever use

nimble light Apr 25, 2025, 3:55 PM

#

How do I use Stable Diffusion or other AI General Tools like Flux, like Photoshop's Generation tool. Kitra is a software that allows me to do that, like Photoshop, masking out an area, and for example, masking out a lake,. and telling it to add boats. Pinokio just makes images from scratch, but I want to modify certain parts of images locally using GPU

#

Also is AMD RX 570 8 GB Enough

#

So I openned a ticket, now what?

still glacier Apr 25, 2025, 3:59 PM

#

nimble light So I openned a ticket, now what?

a ticket ? did someone reach out in dm ?

nimble light Apr 25, 2025, 3:59 PM

#

YEAH

still glacier Apr 25, 2025, 4:00 PM

#

who....

nimble light Apr 25, 2025, 4:00 PM

#

It's MOD SAM

still glacier Apr 25, 2025, 4:00 PM

#

99.9999% chances of it yes.

nimble light Apr 25, 2025, 4:00 PM

#

Yes what? Also can't you just tell me

fervent thunder Apr 25, 2025, 4:05 PM

#

median jewel Just so I understand, if I wanted to use lora and image prompt can I run a simpl...

not sure if fooocus had an api

#

you still pay for electricity locally

ancient mauve Apr 25, 2025, 4:06 PM

#

abstract quarry you could also just train on a method like flux that natively supports non-squar...

for what im reading it seems flux needs a dataset of 512x512 images

#

at least Flux.1 Dev

nimble light Apr 25, 2025, 4:11 PM

#

fervent thunder not sure if fooocus had an api

Your's trying to say?

ancient mauve Apr 25, 2025, 4:21 PM

#

fo you guys reccomend a 1024x1024 dataset for flux1Dev?

#

I want to set up a new dataset but I want confirmation if possibel

#

I want to do good quality but im not used to flux

fervent thunder Apr 25, 2025, 4:36 PM

#

nimble light Your's trying to say?

I just don't know it might

#

I used it a bit over a year ago

#

can't remember

abstract quarry Apr 25, 2025, 4:42 PM

#

ancient mauve for what im reading it seems flux needs a dataset of 512x512 images

you can use any resolution you want for flux

ancient mauve Apr 25, 2025, 4:42 PM

#

abstract quarry you can use any resolution you want for flux

but the whoole dataset needs to be the same size right?

abstract quarry Apr 25, 2025, 4:42 PM

#

no

ancient mauve Apr 25, 2025, 4:43 PM

#

abstract quarry no

seriously? you can have literally any size though the dataset?

abstract quarry Apr 25, 2025, 4:43 PM

#

yes

#

you have the usual "multiple of 16" rule, but the training tools will just crop your images to a multiple of 16

fervent thunder Apr 25, 2025, 4:49 PM

#

hmm

ancient mauve Apr 25, 2025, 4:49 PM

#

abstract quarry you have the usual "multiple of 16" rule, but the training tools will just crop ...

I preffer to set up the dataset first

#

I want to control what goes in in the end

fervent thunder Apr 25, 2025, 4:50 PM

#

if you do a big fine tune without the resolutions spread nicely in the training data
flux will lose its ability to do multi resolution

#

but for small lora it is okay, that is probably what they mean

ancient mauve Apr 25, 2025, 4:50 PM

#

fervent thunder if you do a big fine tune without the resolutions spread nicely in the training ...

what do I do then

#

how many images are we talking here lora vs a full finetune

#

wanna try both

abstract quarry Apr 25, 2025, 4:58 PM

#

you cannot do full finetune flux with 24gb vram

ancient mauve Apr 25, 2025, 4:59 PM

#

how much do I need for a finetune and how much for a lora with flux

#

both nº of images and vram

fervent thunder Apr 25, 2025, 5:00 PM

#

if you blockswap you can do it

#

you could do 1 img if you want

ancient mauve Apr 25, 2025, 5:03 PM

#

fervent thunder if you blockswap you can do it

never heard of blockswap

fervent thunder Apr 25, 2025, 5:08 PM

#

its where you move blocks back and forth

#

from motherboard DRAM to graphics card VRAM

obsidian plume Apr 25, 2025, 6:27 PM

#

Have you ever found a way to convert a Disco diffusion CLIP model into a diffuser or .ckpt file for use in something like Deforum?

fervent thunder Apr 25, 2025, 6:34 PM

#

would be easier to make a fresh code base than go back to the old stuff rly

obsidian plume Apr 25, 2025, 6:53 PM

#

fervent thunder would be easier to make a fresh code base than go back to the old stuff rly

do you mean for disco?

iron swallow Apr 25, 2025, 6:53 PM

#

https://www.youtube.com/shorts/MLqGVIYwSAY made this with AI

fervent thunder Apr 25, 2025, 6:57 PM

#

obsidian plume do you mean for disco?

ye disco is super old its probably a pickle if you do find it
but I meant deforum also

obsidian plume Apr 25, 2025, 6:58 PM

#

there are colab codes that function still, but would be so nice to have one to save locally and to not have to use those extremly heavy servers

fervent thunder Apr 25, 2025, 7:09 PM

#

ye it would be cool

granite river Apr 25, 2025, 7:11 PM

#

Hello together, I´m new here and excited what we can create together 🙂

merry ginkgo Apr 26, 2025, 5:53 AM

#

anyone know of a civitai alternative

#

since the site is dying

sage reef Apr 26, 2025, 6:43 AM

#

@woven panther just a question about your Phantom Wan implementation.
I noticed that the way it embeds the subject images, it seems to
embed them the same size and that size is then used for the
video generation size. but is there a way to decouple this?

like let's say I want to generate a 768 x 512 video, but..
the subject images can be either same or different sizes
from that, like 480x480 for image 1 and 600 x 400 for image 2.

also, is the 3rd and 4th embedding working? cause it doesnt seem
to be copying them correctly, maybe because 1.3B model is too small
for more than 2 subjects?

woven panther Apr 26, 2025, 7:33 AM

#

sage reef <@228118453062467585> just a question about your Phantom Wan implementation. I ...

Has to be same size since it's used in the same latents, but you should be able to resize your image and composite on a white canvas like with VACE

heady pivot Apr 26, 2025, 7:44 AM

#

Hi

fallen axle Apr 26, 2025, 8:51 AM

#

merry ginkgo since the site is dying

What, because they outlawed pee and diapers?

nova glade Apr 26, 2025, 9:29 AM

#

hey sd pals, i did some big updates for this https://github.com/dreamfast/go-civitai-downloader so now it's very easy to download many models or loras, also images from civit ai. After the models or loras are done downloading you can generate a torrent file and magnet link too. I am hoping this will help preserve some of the content that is doomed for oblivion.

steel prawn Apr 26, 2025, 10:02 AM

#

fallen axle What, because they outlawed pee and diapers?

They outlawed a bunch of things that COULD imply forced or nonconsensual situations as well, maybe its that that has people grabbing their torches and pitchforks. I dunno, kinda sus to me.

placid hatch Apr 26, 2025, 10:18 AM

#

anyone have an issue with models where the face of a character in a generated image will suddenly be in a completely different style than the rest of the image?

ancient mauve Apr 26, 2025, 10:19 AM

#

fervent thunder but for small lora it is okay, that is probably what they mean

It's ok, maybe I can start with a lora and see how it goes, but I want to have all my dataset with the same size

#

I don't want flux choosing what it cuts

upper plinth Apr 26, 2025, 10:27 AM

#

Bruh what was the point of electing Trump if the internet is going to keep snowflakizing?

steel prawn Apr 26, 2025, 10:32 AM

#

Like i said before, seems like Civitai is covering their asses, and in the grand scheme of things, its probably better for the AI Art movement/scene/whatever you wanna call it if its not being viewed as a place to create pornographic material that even porn studios wouldnt film (hence the removal of certain things that could be, at least in a court of law, skewed to implicate such things as SA or pedophilia). But, their a business, and all businesses shake and move when their investors say so, so its no surprise.

placid hatch Apr 26, 2025, 10:44 AM

#

Unfortunately it is still part of a broader bipartisan assault on adult art and adult artists that has been happening over the last decade.

upper plinth Apr 26, 2025, 10:58 AM

#

bro, what is the point of AI if it's not to create erotica?

#

AI is and always has been about goonerism, in fact, sex robots is arguably the end goal of all this. Who the hell wants to deal with real women with all their flaws when we can have our own ideal bot partners?

#

the fact that people keep trying to pretend that AI is completely exclusive from porn is ridiculous. Just admit that the two go hand in hand, there's nothing wrong about that despite what the loud blue-haired karens on twitter are shouting

fervent thunder Apr 26, 2025, 12:33 PM

#

ancient mauve I don't want flux choosing what it cuts

I recommend writing a training loop yourself rather than using the pre-made ones

#

at least then you know what it is doing

merry ginkgo Apr 26, 2025, 2:27 PM

#

fallen axle What, because they outlawed pee and diapers?

The tos change is because of Visa and MasterCard, based on other sites they are gonna keep censoring more and more until its R+

merry ginkgo Apr 26, 2025, 2:31 PM

#

nova glade hey sd pals, i did some big updates for this https://github.com/dreamfast/go-civ...

Thanks gonna grab a few TB with it

ancient mauve Apr 26, 2025, 2:34 PM

#

fervent thunder I recommend writing a training loop yourself rather than using the pre-made ones

I dont know how to do that, I used kohya in the past

fervent thunder Apr 26, 2025, 2:40 PM

#

its actually harder to use kohya in some ways cos documentation is not thorough

#

I recommend simple tuner if you are gonna use a pre-made one they have a thing called lokr

#

lokr is separate its part of a project called lycoris, but it is integrated well into simple tuner

atomic mortar Apr 26, 2025, 2:57 PM

#

Scam, dont click

ancient mauve Apr 26, 2025, 3:03 PM

#

Ok so for what I'm reading, one preprocessing flux does is bucketing

#

You select a size and it makes groups on that size with x64 muktiples

#

So my database can have images of 256x256 if I select that, but it can also have for example a 320x320 image in the dataset

#

Or 384x384, etc

#

Like here #🏞｜general-with-images message

#

using 1024x1024 as reference, it seems that as long it has the same size as any of those buckets or same aspect ratio, is all ok

#

if anyone can confirm pls

fervent thunder Apr 26, 2025, 3:32 PM

#

where did you read this?

#

could you quote it pls?

ancient mauve Apr 26, 2025, 3:38 PM

#

https://civitai.com/articles/7777/detailed-flux-training-guide-dataset-preparation

ancient mauve Apr 26, 2025, 3:39 PM

#

ancient mauve https://civitai.com/articles/7777/detailed-flux-training-guide-dataset-preparati...

@fervent thunder

#

I mean it gives youa lot of options

ancient mauve Apr 26, 2025, 4:01 PM

#

what do you think

fervent thunder Apr 26, 2025, 4:12 PM

#

hmm need info from a proper source like a paper
or quotes from the company rly

silk latch Apr 26, 2025, 4:51 PM

#

How to get invoice from stability?

atomic mortar Apr 26, 2025, 5:41 PM

#

silk latch How to get invoice from stability?

should be in your e-mail or you could email their support directly

silk latch Apr 26, 2025, 6:00 PM

#

i tried but they didn't answer

upbeat fjord Apr 26, 2025, 6:56 PM

#

hello

abstract quarry Apr 26, 2025, 7:20 PM

#

ancient mauve Ok so for what I'm reading, one preprocessing flux does is bucketing

yes. But that is only relevant if you use batch size above 1

#

with batch size = 1 you don't need buckets (or every unique resolution can be just its own bucket)

abstract quarry Apr 26, 2025, 7:22 PM

#

ancient mauve I dont know how to do that, I used kohya in the past

please don't write your training loop yourself 😂
it's not necessarily simpler than using kohya. There are a lot of stuff you have to implement to make training efficient. Implementing stuff like gradient checkpointing is not done in a single line of code.

ancient mauve Apr 26, 2025, 7:24 PM

#

abstract quarry please don't write your training loop yourself 😂 it's not necessarily simpler t...

yeah its just that I have some 3 other guys also telling me stuff and I get confused

#

by the moment im preparing my dataset with that chart I linked

#

should be enough for something like 10-50 images

abstract quarry Apr 26, 2025, 7:25 PM

#

ancient mauve https://civitai.com/articles/7777/detailed-flux-training-guide-dataset-preparati...

I'm looking at it

#

"Avoid Ambiguous Images and Distracting Elements: Avoid having too many images that mix styles, characters, or concepts. For example, if you are training a character, don’t use an image that shows that character in a group of other characters." <-- this is bullshit

ancient mauve Apr 26, 2025, 7:26 PM

#

abstract quarry "Avoid Ambiguous Images and Distracting Elements: Avoid having too many images t...

yeah as long as it is tagged it should work right?

abstract quarry Apr 26, 2025, 7:26 PM

#

it's the opposite: if you train character loras, you definitely should add images with multiple characters. It's sufficient to put just multi-panel images with different characters in there. Without that, your Lora will transform every face into your character

ancient mauve Apr 26, 2025, 7:26 PM

#

abstract quarry it's the opposite: if you train character loras, you definitely should add image...

the model has to differentiate

abstract quarry Apr 26, 2025, 7:26 PM

#

ancient mauve yeah as long as it is tagged it should work right?

yes. As long as your caption is correct you will improve the model

ancient mauve Apr 26, 2025, 7:27 PM

#

so if you have many images of one person you want him in diifferent scenarios

ancient mauve Apr 26, 2025, 7:27 PM

#

abstract quarry yes. As long as your caption is correct you will improve the model

good to have some confirmation

abstract quarry Apr 26, 2025, 7:27 PM

#

yes, but also add him with other characters

#

the model has to learn that "NAME" refers to this specific character, not to other characters

ancient mauve Apr 26, 2025, 7:30 PM

#

abstract quarry the model has to learn that "NAME" refers to this specific character, not to oth...

yeah so you have to avoid common names and tags for that specific character because the model can already have that beforehand

#

or style

abstract quarry Apr 26, 2025, 7:32 PM

#

hm, dunno what you mean with that

ancient mauve Apr 26, 2025, 7:32 PM

#

abstract quarry hm, dunno what you mean with that

some common names like "john" are already learned by the AI

#

so if you want to learn a character like dunno, john wick, and you tagg it as "john" if the AI already knows other johns it gets confused

#

same with concepts and such

abstract quarry Apr 26, 2025, 7:33 PM

#

a common name is not so good, in particular if it is already loaded with a meaning

ancient mauve Apr 26, 2025, 7:33 PM

#

you get my idea

#

the thing is to not mix some tags, depending on what you want

abstract quarry Apr 26, 2025, 7:34 PM

#

like "John" is a very American/British name, so using it for a Asian guy might be not so good

ancient mauve Apr 26, 2025, 7:34 PM

#

abstract quarry like "John" is a very American/British name, so using it for a Asian guy might b...

yeah something like that

abstract quarry Apr 26, 2025, 7:34 PM

#

I would use natural names, though

#

like when I train on my own face I always use my real name (first name+ last name)

#

(funnily, my first name is Kai, which is a common German name, but the model associates it with Japanese and in the beginning often mixes in Asian elements)

#

(so I trained my first loras with the name Christian instead, which sounds more Caucasian. However, it doesn't really matter. The model also learns my real name after a while)

#

Many guides use random characters as names instead. I wouldn't do that, cause T5 understands the concept of a name and might get confused by random characters. But in the end both will work nevertheless

ancient mauve Apr 26, 2025, 7:51 PM

#

abstract quarry Many guides use random characters as names instead. I wouldn't do that, cause T5...

I mean you can always invent a less common name

#

the AI doesnt know your true name, it only cares on how you look

#

to avoid things like the asian thingy

abstract quarry Apr 26, 2025, 7:54 PM

#

yes, but if you use first name+last name you are usually fine

ancient mauve Apr 26, 2025, 7:54 PM

#

abstract quarry yes, but if you use first name+last name you are usually fine

not in my case XD

#

anyways where you able to see anything else in that civitai tutorial

#

I dont really like civitai that much but it is popular

abstract quarry Apr 26, 2025, 7:55 PM

#

I think the rest is okay

ancient mauve Apr 26, 2025, 7:55 PM

#

im just making a database of like 9x7 images

#

and squares

abstract quarry Apr 26, 2025, 7:55 PM

#

style or character training?

ancient mauve Apr 26, 2025, 7:55 PM

#

will try with a lora for a character I think

#

by the moment

#

I want to do both but mabe character is easier and needs less images

#

for what I read

abstract quarry Apr 26, 2025, 7:56 PM

#

yeah, keep it simple. You can train on hundreds of images, but you can also train on just 10 images

#

it's not always clear what's better

#

(I mean, more is better. But quality> quantity)

ancient mauve Apr 26, 2025, 7:57 PM

#

abstract quarry (I mean, more is better. But quality> quantity)

yeah I will try to get fized on this

#

how many images would you say for a character and for a style each?

abstract quarry Apr 26, 2025, 7:58 PM

#

also I would not use gradient accumulation. Takes too much time. You can use batch size if you can afford the vram. Training on batch size 1 also works, though

#

as said, more is better, but you can often train on surprisingly low number of images. The guide you posted is right with saying you should rather pick 10 highest quality images than using 50 low quality ones

ancient mauve Apr 26, 2025, 8:00 PM

#

abstract quarry as said, more is better, but you can often train on surprisingly low number of i...

I think I should have enough quality images, I just need a number for a start

#

and some "default" settings I can edit in future generations

#

I just not want to go like a headless chicken

abstract quarry Apr 26, 2025, 8:01 PM

#

dunno, I think 20 is a good number

ancient mauve Apr 26, 2025, 8:03 PM

#

abstract quarry dunno, I think 20 is a good number

do you have a workflow for onetrainer?

#

at this point im comfortable copy pasting what you use

#

you seem to know your stuff

abstract quarry Apr 26, 2025, 8:04 PM

#

no, haven't used it so far

#

feel free to paste the config

ancient mauve Apr 26, 2025, 8:05 PM

#

I managed to have my dataset in 3 ratios

#

for square, horizontal and vertical

ancient mauve Apr 26, 2025, 8:12 PM

#

abstract quarry feel free to paste the config

gonna use onetrainer, let me see how it works

#

oh fuck I have to tag my dataset first 😮‍💨

abstract quarry Apr 26, 2025, 8:14 PM

#

you have 24gb vram? You might use gemma 3 for assisting you with creating the captions

#

but for 20 images you can do it yourself

#

for more it's quite helpful to automate this. A big advantage of using AI for creating the captions is that you can use multiple captioning strategies (tag based, natural language short captions, natural language ling captions)

atomic mortar Apr 26, 2025, 8:16 PM

#

i like to use civit ai's captioning system tbh

#

upload pics, download em after tagging

ancient mauve Apr 26, 2025, 8:20 PM

#

with flux is bette rlong descriptions or single words separated by commas

abstract quarry Apr 26, 2025, 8:20 PM

#

gemma is the strongest, though. It has a really deep understanding and you can teach it any captioning style

abstract quarry Apr 26, 2025, 8:20 PM

#

ancient mauve with flux is bette rlong descriptions or single words separated by commas

to be honest,I would do both

ancient mauve Apr 26, 2025, 8:20 PM

#

abstract quarry to be honest,I would do both

same seed both and compare results I suppsoe

abstract quarry Apr 26, 2025, 8:21 PM

#

short words have the disadvantage that your trigger words lose their effect in long prompts

ancient mauve Apr 26, 2025, 8:21 PM

#

any limit for both? number of tags or size of paragraph for the other one

ancient mauve Apr 26, 2025, 8:21 PM

#

abstract quarry short words have the disadvantage that your trigger words lose their effect in l...

but if Im training for a character only I dont need a trigger word right?

abstract quarry Apr 26, 2025, 8:21 PM

#

ancient mauve same seed both and compare results I suppsoe

no. Ideally, use multiple captions per image. But most tools don't support this. In this case just randomly decide for each image if you use a short or a long caption.

ancient mauve Apr 26, 2025, 8:21 PM

#

what do you mean by trigger word exactly

abstract quarry Apr 26, 2025, 8:22 PM

#

trigger word is also the character name

ancient mauve Apr 26, 2025, 8:22 PM

#

abstract quarry trigger word is also the character name

yeah but I thought tht if you are training for a character you dont have to tag it

abstract quarry Apr 26, 2025, 8:22 PM

#

?

ancient mauve Apr 26, 2025, 8:22 PM

#

only what its extra

abstract quarry Apr 26, 2025, 8:23 PM

#

you always add a name

ancient mauve Apr 26, 2025, 8:23 PM

#

like, if there are 2 characters and you only want 1, you only describe the one you dont want

abstract quarry Apr 26, 2025, 8:23 PM

#

the idea is that you don't describe what is implicitly defined by the name

#

so if you train on, say, on Son Goku, you don't describe that he has black hair and is muscular, cause this is implicitly clear

ancient mauve Apr 26, 2025, 8:25 PM

#

abstract quarry so if you train on, say, on Son Goku, you don't describe that he has black hair ...

yeah

#

but you "have" to tag the word "son Goku"

#

to define those

abstract quarry Apr 26, 2025, 8:25 PM

#

you add "Son Goku" to the prompt, yes

ancient mauve Apr 26, 2025, 8:25 PM

#

ah ok

abstract quarry Apr 26, 2025, 8:26 PM

#

and if there are multiple characters, you write "An image with two characters. Left is Son Goku. The character on the right is a man with pink hair and a muscular body."

ancient mauve Apr 26, 2025, 8:26 PM

#

abstract quarry and if there are multiple characters, you write "An image with two characters. L...

and with simple tags?

#

(I suppose left by my POV not the image's

abstract quarry Apr 26, 2025, 8:27 PM

#

"Son Goku and another man" ?

#

Tags are just text, too. There is nothing special with them

ancient mauve Apr 26, 2025, 8:28 PM

#

no I mean, you can put tags like Son Goku and another man fighting

abstract quarry Apr 26, 2025, 8:28 PM

#

yes

ancient mauve Apr 26, 2025, 8:28 PM

#

or you can put Goku, man, fighting, muscle

abstract quarry Apr 26, 2025, 8:28 PM

#

I would definitely use the upper one

ancient mauve Apr 26, 2025, 8:29 PM

#

ok, any kind of limit for what goes into the prompt

#

how many words or how big or total descriptions should have

abstract quarry Apr 26, 2025, 8:29 PM

#

as said, I would try both: short and precise prompts as well as long and detailed prompts

#

that's also how you want to prompt in the end

ancient mauve Apr 26, 2025, 8:30 PM

#

and what was that programm that helped you tag

ancient mauve Apr 26, 2025, 8:30 PM

#

abstract quarry that's also how you want to prompt in the end

you mean both for all images or the first for all then teh second for all

abstract quarry Apr 26, 2025, 8:31 PM

#

You can use multimodal llms nowadays

ancient mauve Apr 26, 2025, 8:31 PM

#

abstract quarry You can use multimodal llms nowadays

I just need a name

abstract quarry Apr 26, 2025, 8:31 PM

#

I use gemma

ancient mauve Apr 26, 2025, 8:31 PM

#

I remember using wd14 but I think thats only for anime

abstract quarry Apr 26, 2025, 8:31 PM

#

cause you can run it locally

#

you could also use ChatGPT if you have a subscription, though

ancient mauve Apr 26, 2025, 8:32 PM

#

abstract quarry you could also use ChatGPT if you have a subscription, though

I preffer something local

#

but you mean a local llm model

abstract quarry Apr 26, 2025, 8:32 PM

#

yes

ancient mauve Apr 26, 2025, 8:32 PM

#

there must be some already made exclusively for flux

#

for tagging

abstract quarry Apr 26, 2025, 8:32 PM

#

you can download gemma 3 4bit quant and run it in your local machine

abstract quarry Apr 26, 2025, 8:33 PM

#

ancient mauve there must be some already made exclusively for flux

no. Just explain the llm what you want

ancient mauve Apr 26, 2025, 8:33 PM

#

abstract quarry no. Just explain the llm what you want

ah ok

abstract quarry Apr 26, 2025, 8:35 PM

#

"I show you an image of a character named Son Goku. Please answer with a prompt that describes this image. The prompt should be short and precise (10-30 words) and include the name Son Goku. Do not describe Son Goku's appearance, but describe what he is doing in the image. Describe also the background. Answer only with the prompt."

#

something like this

#

the cool thing on llms is that they really understand what you want. If you are not happy, you can add more information into the prompt

#

you could even tell the llm that you want prompts for Flux.

ancient mauve Apr 26, 2025, 8:57 PM

#

The prompt should be short and precise (10-30 words)

#

this is what I wanted to know more or less

ancient mauve Apr 26, 2025, 8:57 PM

#

abstract quarry you could even tell the llm that you want prompts for Flux.

yeah its just that I havent yet found a good llm

#

local llm

abstract quarry Apr 26, 2025, 8:58 PM

#

gemma 3

abstract quarry Apr 26, 2025, 8:58 PM

#

ancient mauve this is what I wanted to know more or less

that was an example 😅 as I said, instead of having one consistent style of prompting, just use different ones

#

short prompts, long prompts, tag based prompts

ancient mauve Apr 26, 2025, 9:00 PM

#

abstract quarry gemma 3

its google

#

google censors the crap of their products

abstract quarry Apr 26, 2025, 9:00 PM

#

not true

ancient mauve Apr 26, 2025, 9:10 PM

#

abstract quarry not true

they didnt cap that one?

#

uh

abstract quarry Apr 26, 2025, 9:10 PM

#

they never do

#

censorship only happens during alignment step at the end of training. In its core none of the models is censored

ancient mauve Apr 26, 2025, 10:57 PM

#

abstract quarry censorship only happens during alignment step at the end of training. In its cor...

oh thats cool to know, didnt know it was at the end

fallow veldt Apr 27, 2025, 3:48 AM

#

I'm amazed how good Sora is... it seems to get everything to ask in the prompt in the correct style with no confusion

fallow veldt Apr 27, 2025, 4:33 AM

#

As I said once, I thought it would be continuous development and optimization for generating locally but it seems that what we got is what it is

#

From SD 1.5 to SD XL was wow

echo cobalt Apr 27, 2025, 5:21 AM

#

yo guys

#

whatsup

snow cedar Apr 27, 2025, 5:40 AM

#

hey

#

I need help

quiet bison Apr 27, 2025, 6:10 AM

#

Hi

#

Hi

fervent thunder Apr 27, 2025, 6:12 AM

#

fallow veldt I'm amazed how good Sora is... it seems to get everything to ask in the prompt i...

ah I haven't tried it yet

#

not rly into video

#

I think most people were super excited for video and many switched over right away
but I still prefer image

#

its cos I started out in upscaling hobby first

nimble light Apr 27, 2025, 6:18 AM

#

Can someone help me with a question

echo cobalt Apr 27, 2025, 6:23 AM

#

nimble light Can someone help me with a question

what question

nimble light Apr 27, 2025, 6:24 AM

#

echo cobalt what question

What is the best host software like Kitra AI, that allows me to use models like Stable Diffusion or Flux to generate image in masked areas, like Photsohp, isntead of generating image from scatch, i want to mod sepcific parts, EG add boats to a part of the river

echo cobalt Apr 27, 2025, 6:26 AM

#

nimble light What is the best host software like Kitra AI, that allows me to use models like...

idk that much but i think you can use comfyui can do preety much of the work i guess

nimble light Apr 27, 2025, 6:27 AM

#

Like Photoshop? Mask select an area and generate or mod?

echo cobalt Apr 27, 2025, 6:27 AM

#

nimble light Like Photoshop? Mask select an area and generate or mod?

yaaa preeety much it

nimble light Apr 27, 2025, 6:27 AM

#

What you mean by pretty much? The word pretty much means there are some caviats

echo cobalt Apr 27, 2025, 6:28 AM

#

If you want the most Photoshop-like experience but free: InvokeAI

#

If you're okay with a bit of complexity for ultimate power go with ComfyUI

#

thats all

nimble light Apr 27, 2025, 6:30 AM

#

This one https://www.comfy.org?

#

Or the git hub one?

echo cobalt Apr 27, 2025, 6:37 AM

#

github one

oblique elk Apr 27, 2025, 6:56 AM

#

nimble light What you mean by pretty much? The word pretty much means there are some caviats

If you are more into a user friendly UI you could combine krita with a comfyui backend with an ai plugin for krita. Otherwise if you do not need the latest models and function but solid outpainting and inpainting, regional changes etc. I would look towards invoke (community edition)

atomic mortar Apr 27, 2025, 7:48 AM

#

echo cobalt If you're okay with a bit of complexity for ultimate power go with ComfyUI

Id go a step further, swarmUI

#

Bit more user friendly and you have acces to comfyUI as a backend

main snow Apr 27, 2025, 10:27 AM

#

Swarm is apparently good for Flux too so there's that

#

You get to use the miiiiiracle checkpoint type lol

odd patio Apr 27, 2025, 10:54 AM

#

Can someone give me an invite to comfy org discord?

#

when I click on it it shows I am not logged in, when I log in the tab forgets I asked to join that room

#

Cant find it in discover search

ancient mauve Apr 27, 2025, 11:12 AM

#

what prompt do you reccommend for tagging images for a flux training in gemma3? llm its not giving me good desc riptions

#

it does the usual yapping these models do which I dont really need

#

btw having a local llm rocks

abstract quarry Apr 27, 2025, 11:32 AM

#

dude, you can just ask Gemma for a good system prompt lol

#

My prompt: "Write me a good system prompt for an image captioning model. I want to generate image captions for training/finetuning a Flux diffusion model for image generation. Write me a system prompt for such a captioning llm."

#

gemma gave me a good system prompt. I then added:
"This is great. Modify the system prompt such that the model will always output two different captions: one short which only highlights the most important aspects of the image and one detailed. Also, if I show it an image and write "this image shows [SOME NAME] I want the captioning model to use [SOME NAME] in its description and do not describe the main subject of the image in details (as these details are already implicitly defined by its name). Do you understand that? Write me a system prompt!"

#

What came out was:

#

as you can see on my prompts: you don't need good prompts. Just write anything and ask Gemma to make a good prompt out of that

#

then use this prompt as system prompt for your image captioning stuff

steel prawn Apr 27, 2025, 11:40 AM

#

I just slap this into any llm im using at the time if i get creative block:


You know the secrets of the lost art of prompting gorgeous anime wallpapers, at 16:9 and 2560x1440 resolution. You also have extreme proficiency in character profile shots in a 9:16 aspect ration, at 1440x2560 resolution. Some have said your creativity knows no bounds, and they are right.

Your also extremely proficient with all of the extensions and tools available on Automatic1111 with Stable Diffusion to enhance images, especially controlnet and regional prompting. And when necessary you will suggest using these tools, as well as providing a mock up open pose skeleton or depth image for controlnet.

I am your human counterpart, the one who enters the prompts to bring your forbidden knowledge and majestic works of art to the masses. Any prompt you give me, no matter how ridiculous, will be entered. And if additional tools are needed to achieve your glory, you will tell me and structure the prompt as it should be entered with those tools in mind.

With all of this in mind, your only job today is to provide me with prompts for stable diffusion anime art of the highest caliber. After each prompt you will ask me to submit the image generated, and then suggest no less than 3 options for our next prompt for me to choose from. Each prompt will be detailed, exquisite, and balanced so as to showcase the character and the scene in its proper glory. Once i pick a prompt option you will generate me the prompt you have in mind, and the cycle will repeat. The world will know the name and our brand by the time we are done. ```

abstract quarry Apr 27, 2025, 11:41 AM

#

haha, that's a good one

steel prawn Apr 27, 2025, 11:41 AM

#

Just input whatever image generator your using in place of A1111 (ive upgraded to forge for the time being myself) and run wild with this. It'll spit out pretty good stuff and you can steer it with your selections, upload the outputs to critique, and use it to build consistent styles for lora training etc if you want.

#

Doesnt fix them being chatty though.

#

They love their emojis

lofty scarab Apr 27, 2025, 6:55 PM

#

I don’t know if someone could help me. I can’t do checkpoint merge anymore. I use to pretty often and now it always end up in an error. With A1111, forge UI, comfy UI, none of them work. I’m on windows 11 24h2, 12900ks, RTX 4090. I’m on the latest driver 576.02. Is it a problem with the gpu driver? It used to work but now it doesn’t anymore. Is someone got a clue?

steel prawn Apr 27, 2025, 7:02 PM

#

lol the damn scammers tryin to get crafty

#

ask in tech support Dude, they might be able to help you

warm junco Apr 27, 2025, 7:34 PM

#

lofty scarab I don’t know if someone could help me. I can’t do checkpoint merge anymore. I us...

Hey, come to #🤝｜tech-support and show provide a full cmd log

sage reef Apr 27, 2025, 9:45 PM

#

@woven panther is this something you would consider porting to comfy?
https://www.reddit.com/r/StableDiffusion/comments/1k9bcfr/magi_45b_has_been_uploaded_to_hf/

floral jay Apr 27, 2025, 11:41 PM

#

my favourite thing forge has over a1111 must be how the interrupt and skip button actually works.

fair shadow Apr 28, 2025, 3:40 AM

#

Hello friends, I’m using Automatic1111 and I want to create a consistent character, but I don’t know how to do it. I looked online, but no one has explained it thoroughly. Can you help me with this?

obsidian plume Apr 28, 2025, 9:55 AM

#

hey'

#

hey! still in super need of to make clip models become .ckpt if thats even possible?

#

from .pt to .ckpt

abstract quarry Apr 28, 2025, 9:59 AM

#

rename it? 😂

#

these endings do not have a meaning. Usually, they are pickled dictionaries or models.

obsidian plume Apr 28, 2025, 10:05 AM

#

abstract quarry these endings do not have a meaning. Usually, they are pickled dictionaries or m...

no its not posible. Have you worked with disco diffusion before?

abstract quarry Apr 28, 2025, 10:07 AM

#

no. What I want to say is: there is no "checkpoint format" or "pt format".

#

even safetensors, although its own format, is not "standardized"

#

so your question has to be: I have file X downloaded from source Y and want to use it in tool Z.

obsidian plume Apr 28, 2025, 10:33 AM

#

abstract quarry so your question has to be: I have file X downloaded from source Y and want to u...

exactly! Theres this tutorial but i havent made it worked... Anyone else have tried?

#

https://youtu.be/tgRiZzwSdXg?feature=shared

serene mountain Apr 28, 2025, 12:05 PM

#

I have a 3070 8GB presently on Forge Web UI…considering upgrading to a 16gb. Had some feedback in another discord that 16gb is already not enough. Budget is tight but if Im gonna upgrade to do quality image gen whats the minimum i should be looking at without going overboard (i know in gaming there are diminishing returns).

Im doing this recreationally but once I get proficient I want to incorporate it into my business model.

#

Are there key specs on the cards I should be looking at? Or is the raw amount of ram the most important thing.

still glacier Apr 28, 2025, 12:13 PM

#

ammount of vram dictates which model you can load on your gpu at once / without having to chop it in pieces and load it bit by bit during the generation process ( usually done automatically by whatever program you ll use )
Having the model loaded fully will avoid the costly / long loading and unloading of data to your gpu.
With that said, if you have enough vram to load what you want, then yes the gpu speed / architecture itself will become your main concern regarding speed. Newer gpu will go faster ( assuming there is enough vram to load everything at once )

Now.... Is 16gb enough. Yes for image generation definitely. For video generation meeeh, video generation is still in is infancy, so it s hard to tell. You ll have enough to do stuff for sure. But will it be """""""futureproof"""""""" is hard to tell. Even 8gb is enough for video generation if you use some tricks.

fervent thunder Apr 28, 2025, 12:17 PM

#

obsidian plume exactly! Theres this tutorial but i havent made it worked... Anyone else have tr...

you need to de-serialise it and get it into a form where it is just
written out as standard pytorch code

#

and then you can open up a model that is in the format you want

#

and have a think about what you need to do to get it to be in that format

#

its mostly just renaming stuff but sometimes there is more

serene mountain Apr 28, 2025, 12:21 PM

#

still glacier ammount of vram dictates which model you can load on your gpu at once / without ...

Itll be a minute before i get into video, im still trying to learn everything about image. Im getting there…

It seems like running sdxl models works ok on what i have now, so any upgrade would be an improvement but the resounding answer in the other chat was to go cloud. I see the benefit but i have privacy concerns there. I guess i just hate being tethered to a 3rd party.

Its tough because im starting to see the suggestions are all over the place

#

I know flux is pretty VRAM heavy

fervent thunder Apr 28, 2025, 12:22 PM

#

you can fit flux in 8GB

#

int4 flux is 6.64 GB

#

or nf4

#

same size

still glacier Apr 28, 2025, 12:24 PM

#

serene mountain Itll be a minute before i get into video, im still trying to learn everything ab...

it all depends of your budget.... How much will you be using this gpu ? for how long ? only for AI stuff ? Privacy concern indeed for cloud solutions ? etc... Disminishiong return costs, etc
Like Neon said 8gb should be enough for flux anyways.

serene mountain Apr 28, 2025, 12:47 PM

#

Budget $600 ideal - i saw a few 5070 cards (16gb) in the 500-600 range. I can push $1000 but thats about my ceiling.

I had this card for a while, it runs all my other games and software fine on high or max settings (i do graphics, photo and video professionally). So if I had something that worked well, Id probably keep it until it melts or software just totally out paces it.

abstract quarry Apr 28, 2025, 12:48 PM

#

16GB is fine. Sure, more is better, but this is also true for 24gb. As soon as you have 24gb, you want even more vram. It never stops ;D

fervent thunder Apr 28, 2025, 12:52 PM

#

$600 is the used RTX 3090 area

#

but there is risk in used cards

still glacier Apr 28, 2025, 12:59 PM

#

keep in mind that RTX 5000 will get longer support than RTX 3000 too. (at least in theory, if Nvidia does not become a fully AI company by that time...)

fervent thunder Apr 28, 2025, 12:59 PM

#

yeah that's true

#

its tricky

still glacier Apr 28, 2025, 1:00 PM

#

Personally with that budget I d go with RTX 5070 because of the support, faster cores, dlss4 and because I don t care about video gen :p

fervent thunder Apr 28, 2025, 1:01 PM

#

the gaming and rendering stuff like dlss might be worth yeah

serene mountain Apr 28, 2025, 1:01 PM

#

abstract quarry 16GB is fine. Sure, more is better, but this is also true for 24gb. As soon as y...

Well, right? I mean Ive done this a while (tech not ai).

Price wise… first it was bitcoin mining driving it up, then covid, now the AI “bubble” and tarrifs…so theyre always gonna be pricy.

I know itll be out dated probably as soon as i buy it, thatll be true even if i got 24gb.

My concern is: if i drop 600-1000 dollars, will I be happy with my image gen with the CURRENT environment

still glacier Apr 28, 2025, 1:02 PM

#

saying it just in case. upgrading your gpu will NOT upgrade the quality of the outputs

#

it will just change the speed

fervent thunder Apr 28, 2025, 1:02 PM

#

every GPU will be out of date at some point because ASICs are coming
but this might take a few years

#

ASIC just means "specialist chip"

still glacier Apr 28, 2025, 1:04 PM

#

to be fair I remeber hearing about asic already available that can do inference for a fraction of the cost (but not training) but I don t think they re selling for the public yet.

serene mountain Apr 28, 2025, 1:04 PM

#

still glacier saying it just in case. upgrading your gpu will NOT upgrade the quality of the o...

Quality in terms of definition?

What about capability… like if I generate on civit… i can do an illustrious model with 2-5 lora’s and put out some fun stuff.

I feel like rn, im pretty capped at 1 model, maybe 1 lora.

Invoke crashes if my prompt goes over tokens (but that may be a setup issue)

Im still learning forge.

still glacier Apr 28, 2025, 1:05 PM

#

serene mountain Quality in terms of definition? What about capability… like if I generate on ci...

sounds like a setup / settings issues more than an hardware one.

#

sure lora will add to the vram cost but not that much tbh. And the more loras you shove in your prompt the more they will fight each other usually so it s not recommended to use many of them at once.

serene mountain Apr 28, 2025, 1:07 PM

#

still glacier sure lora will add to the vram cost but not that much tbh. And the more loras yo...

I know. I usually play with the weights to get some different effects but i try to minimize it

abstract quarry Apr 28, 2025, 1:08 PM

#

serene mountain Quality in terms of definition? What about capability… like if I generate on ci...

invokeai is using diffusers which is, unfortunately, less memory efficient than ComfyUI

#

make sure you run invoke in low vram settings

#

in general, the length of your prompt should not matter as long as it is below 500 tokens

#

and the number of loras shouldn't matter either

serene mountain Apr 28, 2025, 1:09 PM

#

abstract quarry make sure you run invoke in low vram settings

Maybe i should do a fresh install and try again then.

#

I thought i had that setup, it ran great for 15 generations then all it would spit out was black squares.

still glacier Apr 28, 2025, 1:10 PM

#

it s worth a try to reinstall and or run comfyUI before dropping hundreds of $ into a new gpu.

serene mountain Apr 28, 2025, 1:10 PM

#

ChatGPT said the error in the log was from to many tokens but it definitely wasnt 500.

still glacier Apr 28, 2025, 1:10 PM

#

ChatGPT tells you what you want to hear.

#

If you can t / dont know how to verify what it says, I would not trust it blindly. Same goes for every LLM.

serene mountain Apr 28, 2025, 1:10 PM

#

still glacier ChatGPT tells you what you want to hear.

True. The hallucinations are annoying. But i also feel bad sometimes coming here with 100 questions 😂

still glacier Apr 28, 2025, 1:11 PM

#

LLM can be a good tool to start your research, it will at the very least give you a few pointer, stuff to research.

#💬｜general-chat

Get mapping from subwords to original words

Accumulate embeddings per word

Append the last word

Convert to tensor