copper crystal Sep 9, 2024, 12:52 AM

#

if i understand it right so far, you have to train the solver for the model

#

the code is the sd2 version

formal gate Sep 9, 2024, 2:52 AM

#

i havent been here for a while, are there any AI text to speech models?

sick chasm Sep 9, 2024, 3:48 AM

#

Hello

#

Anyone having problems generating images?

full lark Sep 9, 2024, 4:04 AM

#

Nice! 👏

proud fern Sep 9, 2024, 5:10 AM

#

Hi! I am the founder of an ecosystem around gen AI and automation. We are currently working on developing a proprietary closed model using a base image generation model and a Deep Convolutional Generative Adversarial Network (DCGAN) model. I'm seeking advice from an engineer with experience working with such models and in using Cloud GPUs. I would like to understand which provider can best meet our requirements. Can someone help?

mighty coral Sep 9, 2024, 9:01 AM

#

Hi

analog apex Sep 9, 2024, 9:29 AM

#

Hey everyone, I got a buff PC now and want to try generating some AI Art. Is Stable Diffusion the way I should go and is the guide I should follow to set it up? https://rentry.org/voldyold

still glacier Sep 9, 2024, 9:30 AM

#

analog apex Hey everyone, I got a buff PC now and want to try generating some AI Art. Is Sta...

no this guide is way too old.

#

check out guides from #🤝｜tech-support pinned messages

analog apex Sep 9, 2024, 9:30 AM

#

Thanks

tulip yarrow Sep 9, 2024, 10:50 AM

#

What's the size difference between SD1.5, SDXL and Flux

#

Well... I know the difference between SD1.5 and SDXL

#

I never used a SD3/Flux Lora before

quartz siren Sep 9, 2024, 10:55 AM

#

tulip yarrow What's the size difference between SD1.5, SDXL and Flux

There is a very large difference, sdxl is roughly 3.5b parameters combined everything(text encoders, vae, unet)

Flux is roughly 16b on the other hand with everything(text encoders, vae, dit)

Just use nf4v2 quantization, and flux should use the same vram has sdxl.

fervent thunder Sep 9, 2024, 10:55 AM

#

proud fern Hi! I am the founder of an ecosystem around gen AI and automation. We are curren...

any of the big 3- Azure, AWS or Google cloud
they are roughly equivalent at this point
although I would also say there is good reason that GANs aren't really getting used much any more

tulip yarrow Sep 9, 2024, 10:55 AM

#

quartz siren There is a very large difference, sdxl is roughly 3.5b parameters combined every...

So does that mean flux loras are even bigger

fervent thunder Sep 9, 2024, 10:56 AM

#

ye

#

the loras are bigger

quartz siren Sep 9, 2024, 10:56 AM

#

tulip yarrow So does that mean flux loras are even bigger

Yes, but not by much.

tulip yarrow Sep 9, 2024, 10:56 AM

#

oh. god.

#

Yeah... looks like it's back to SD1.5 for me. Too bad it's dying out.

fervent thunder Sep 9, 2024, 10:57 AM

#

SD 1.5 keeps getting stronger and stronger as more tools come out TBH

tulip yarrow Sep 9, 2024, 10:57 AM

#

But it's still a classic

fervent thunder Sep 9, 2024, 10:58 AM

#

yeah definitely

tulip yarrow Sep 9, 2024, 11:10 AM

#

Which is why I still use it

fervent thunder Sep 9, 2024, 11:11 AM

#

I really like the compositions of SD 1.5

#

I like them more than most newer models

#

it needs a refiner pass with a stronger model but otherwise its great

tulip yarrow Sep 9, 2024, 11:13 AM

#

I want to see new SD models backwards compatible with SD1.5 Loras

static falcon Sep 9, 2024, 11:14 AM

#

proud fern Hi! I am the founder of an ecosystem around gen AI and automation. We are curren...

you mean cloud for production use case running it for clients afterwards, or for training and dev purposes? any of these would work, depends on what size and type of servers and GPU models you need, because they just provide the capacity and everything else is up to you

#

for training AWS would be best because they have these concepts of Spot instance pricing, which is 50%-70% cheaper but can get interrupted by other bidders at any time with 2 minute warning signal. most server types can run for many hours or days before they are interrupted by other bids so it's pretty good for one-off tasks to run and then delete a server (like CI/CD builds or load testing / GPU training)

fervent thunder Sep 9, 2024, 11:30 AM

#

interruptible pricing is great yeah I use Vast.ai personally and rarely get interrupted

tropic frost Sep 9, 2024, 11:38 AM

#

do you guys personally add any detail enhancing loras to your generations when you first start experimenting? or after you get a good base for img2img?

fervent thunder Sep 9, 2024, 11:38 AM

#

I never make any image without detail loras

#

sometimes I have to remove it for trouble-shooting

#

Detail Tweaker XL is the main one

#

but you want to stack at least 2-3 cos they all add different things

static falcon Sep 9, 2024, 11:57 AM

#

what are the detail loras, aren't details dependent primarily on prompting? i still don't know how loras work, i know 'control net' is a lora type?

#

so there's pre-processors like anyline, and there's loras which is a custom feature or style but it's basically the same SD model just trained one more 'layer' on top of it? is that how technically it works?

fervent thunder Sep 9, 2024, 12:01 PM

#

no, details depend maybe 1% on prompting and 99% on the rest of the workflow

static falcon Sep 9, 2024, 12:11 PM

#

where is it possible to train a custom lora? i want to try train for 2d cartoon body parts, but my laptop won't even load the SD itself 😊

#

is it something to try train a lora for, or it's a job for control net and openpose stuff?

fervent thunder Sep 9, 2024, 12:12 PM

#

control net and openpose can control the layout and pose but not more than that

static falcon Sep 9, 2024, 12:13 PM

#

fervent thunder control net and openpose can control the layout and pose but not more than that

and lora?

#

can it get trained for custom layouts like that?

fervent thunder Sep 9, 2024, 12:13 PM

#

yeah lora has an advantage that it can do both layout and style together

#

I wrote this comment the other day on lora methods:

1. pytorch
2. diffusers
3. OneTrainer, Koyha, SimpleTuner
4. replicate, civit
5. paying a freelancer```

static falcon Sep 9, 2024, 12:22 PM

#

fervent thunder I wrote this comment the other day on lora methods: ```in order of decreasing di...

thanks! will google it, koyha is something on github i remember i saw it, but need to train on my own pc correct? civitai can be used for any lora training? that seems best option, will try doing it there. replicate never heard of

elder epoch Sep 9, 2024, 12:24 PM

#

Can anyone tell me where to find the CFG Rescale parameter in stable diffusion?

fervent thunder Sep 9, 2024, 12:28 PM

#

static falcon thanks! will google it, koyha is something on github i remember i saw it, but ne...

replicate, civit are the only ones that can't be done locally

tropic frost Sep 9, 2024, 3:12 PM

#

out of curiousity, on average how long does it take for you to generate lets say 3 images (512x512) with pony lora?

feral pike Sep 9, 2024, 3:15 PM

#

is there a way to measure ToPs?

main junco Sep 9, 2024, 3:54 PM

#

How to use stable diffusion on cpu only without GPU? On windows?

#

Do i need to download special version?

solid kindle Sep 9, 2024, 3:56 PM

#

dont

#

long story short

fervent thunder Sep 9, 2024, 4:05 PM

#

I do it, its fine

fervent thunder Sep 9, 2024, 4:05 PM

#

main junco How to use stable diffusion on cpu only without GPU? On windows?

https://github.com/comfyanonymous/ComfyUI/releases get the portable version

#

and then go into the folder and

#

double click run_cpu.bat

#

it should start working straight away

#

get TCD sampler and TCD lora for SD 1.5

#

this makes excellent images at 6 steps

#

TCD is the best distilled model in my opinion, for SD 1.5 and SDXL

trail lion Sep 9, 2024, 4:08 PM

#

fervent thunder interruptible pricing is great yeah I use Vast.ai personally and rarely get inte...

same, biggest thing I've seen on vast is an issue with a template on a particular server. like missing files or something, if I ever see that I just delete it and grab another or if I've already uploaded stuff transfer the files with the built-in sync (love that feature)

fervent thunder Sep 9, 2024, 4:08 PM

#

I have issues with templates a lot yeah

#

I am working on building my own template

#

I don't even use the sync I just redownload everything each time using a shell script that chatgpt wrote

#

cos I tend to go for data centers with 10GBs download its fine

#

if you go for servers with slower download then sync methods would be good

#

I build the workflow in advance so time isn't wasted

main junco Sep 9, 2024, 4:46 PM

#

What are the best modern tools for training LORA or dreambooth?
And which one is better? Everyone says that dreambooth, but why there are so many loras?

trail lion Sep 9, 2024, 4:51 PM

#

loras are the most convenient, because they are small, easy to distribute, take up less space, and can be combined with others on a run-time basis. a dreambooth finetune outputs a full checkpoint, so in the case of SDXL that means 6Gb or more, while you can combine a full checkpoint with others, it requires a merging script to combine the weights. both dreambooth and lora training can be done on a small training dataset (meaning small number of images). there are multiple tools to train, kohya and the popular wrapper around kohya known as kohya_ss are among the more popular trainers, but others exist such as onetrainer. kohya_ss is what I use personally

#

I have a current training right now that has 24 saved epochs so far on flux. each of them is 150Mb, vs 26G for the full checkpoint. maybe that helps put it in perspective

fervent thunder Sep 9, 2024, 5:04 PM

#

main junco What are the best modern tools for training LORA or dreambooth? And which one i...

cos people have different budgets

#

people are trying to minmax rather than make the best fine tune they can

#

if you have the money for it then training all the weights including the text encoders is best

#

this paper is a good example of why self-attention layers matter as well as just cross-attention https://arxiv.org/abs/2308.12964

open crest Sep 9, 2024, 5:25 PM

#

I have a budget for 600 dollars for gpu.
Currently have ryzen 9 and 64gb ram and 5700xt.

Any recommendations as to what to get for local image gen?

fervent thunder Sep 9, 2024, 5:37 PM

#

used 3090

dreamy turtle Sep 9, 2024, 5:40 PM

#

Anyone of you tried making manga?

#

using sd

static falcon Sep 9, 2024, 5:57 PM

#

open crest I have a budget for 600 dollars for gpu. Currently have ryzen 9 and 64gb ram and...

anything second hand like 2080 would probably work

tropic frost Sep 9, 2024, 5:57 PM

#

hhhmm... say guys, if you use a lora that was meant for real life pictures in a anime or art generation, do you think that will cause a major increase in generation time?

fervent thunder Sep 9, 2024, 6:13 PM

#

tropic frost hhhmm... say guys, if you use a lora that was meant for real life pictures in a ...

no, that's not possible as far as I know

open crest Sep 9, 2024, 6:34 PM

#

fervent thunder used 3090

Would love to find a used one in that budget

fervent thunder Sep 9, 2024, 6:46 PM

#

should be doable

trail lion Sep 9, 2024, 7:14 PM

#

open crest I have a budget for 600 dollars for gpu. Currently have ryzen 9 and 64gb ram and...

I had that card, it's still sitting on my shelf in fact. you can do image gen with it....better in linux, since it's an AMD. I was able to get a 3090 TI for just under 900 (renewed). For me, an upgrade had to be to a 24G nvidia card (for local training mainly), otherwise, it seemed more of lateral move. If you really dont care about vram (though you should), maybe us a chart like this to help you decide https://cdn.mos.cms.futurecdn.net/FtXkrY6AD8YypMiHrZuy4K-1200-80.png.webp

#

I didnt have to upgrade my power supply, but make sure you have enough juice

thick elk Sep 9, 2024, 8:53 PM

#

Hola a todos! saludos desde Argentina

hot zodiac Sep 10, 2024, 2:56 AM

#

how to draw icon use sd?

winter shoal Sep 10, 2024, 7:01 AM

#

you can use deforum or similar tools for that. or if you are lazy, just subscribe to dream machine and use start and end images for the video.

cyan nest Sep 10, 2024, 7:13 AM

#

do I have to have a subscription to create images ?

pseudo moon Sep 10, 2024, 7:44 AM

#

Dear everyone
Nice to meet you.

#

Recently I've used this service and impressed by AI engine.
https://www.nterview.me/
https://www.youtube.com/watch?v=AfDn_Esqgg8

main junco Sep 10, 2024, 8:06 AM

#

What models are compatible with fooocus? is there a list? Or how can I determine if the model or lora will work?

sand flax Sep 10, 2024, 8:22 AM

#

I finally figured out dall e 3's little image generating trick

#

It doesnt really generate images but more like it cheats on the test

#

Talk about understanding nuances

#

I rip information from its images, almost forget i can change the image file format into a text format

#

And decompile a little bit to grab some code from microsoft sources

fervent thunder Sep 10, 2024, 9:19 AM

#

ye

#

sd3 has t5 clip l clip g

#

flux has t5 clip l

white parrot Sep 10, 2024, 10:15 AM

#

anyone can help me fix this #📝｜prompting-help message

fervent thunder Sep 10, 2024, 10:22 AM

#

flux simply wasn't trained with clip g

harsh agate Sep 10, 2024, 10:29 AM

#

hi everyone okay so see i m training an sdxl model with kohya trainer so can anyone suggest me some tricks while training so that i can generate high quality images

shy lagoon Sep 10, 2024, 10:30 AM

#

does anyone have an opinion on deep dream machine ? Do you know better or cheapre alternatives for Ai video creation ?

quartz siren Sep 10, 2024, 10:53 AM

#

shy lagoon does anyone have an opinion on deep dream machine ? Do you know better or cheapr...

Try kling or gen3, those are better from what I know.

harsh agate Sep 10, 2024, 11:28 AM

#

will anyone answer my question?

hasty hornet Sep 10, 2024, 11:39 AM

#

cyan nest do I have to have a subscription to create images ?

no, you can generate locally on your pc, search for automatic1111 webui or comfyui or tutorials on it, you'll figure it out

#

you subscribe only if you want to generate on someone else's machine basically.

#

or support devs \ site creators, depending on where you're gonna sub

stark frost Sep 10, 2024, 12:22 PM

#

hey chat

still glacier Sep 10, 2024, 1:36 PM

#

harsh agate will anyone answer my question?

Probably not, your question is too generic. "suggest some tricks to make it better". we don t know what settings you re using, we don t know what you already know about, etc. "make it better" is also too generic, could refer to the image quality, composition, resolution, how close it sticks to the prompt, etc

#

Try to reformulate it. Also it s probably better suited for #🔧｜finetune or #📝｜prompting-help depending of the refined question.

shy lagoon Sep 10, 2024, 1:52 PM

#

quartz siren Try kling or gen3, those are better from what I know.

Yeah thanks mate i just tried kling, there is an offer 66% off first month

desert dagger Sep 10, 2024, 3:09 PM

#

sand flax It doesnt really generate images but more like it cheats on the test

Wrong

deep narwhal Sep 10, 2024, 3:29 PM

#

just popin in to say RIP A1111 (until we can use flux on it)

#

also, what should i use instead 😂

solid kindle Sep 10, 2024, 3:38 PM

#

deep narwhal just popin in to say RIP A1111 (until we can use flux on it)

what happened to A1111?

deep narwhal Sep 10, 2024, 3:47 PM

#

solid kindle what happened to A1111?

i cant use flux with it 😢

unborn hedge Sep 10, 2024, 4:44 PM

#

idk if I want to tap on links I never seen before

dark hawk Sep 10, 2024, 7:37 PM

#

Generate text to image, Chat assistant and image analysis with my verified discord bot https://dsc.gg/vexel

sand flax Sep 10, 2024, 9:10 PM

#

desert dagger Wrong

Not wrong.

desert dagger Sep 10, 2024, 9:11 PM

#

sand flax Not wrong.

very very wrong. dall-e3 generates images, just like all the others do. it doesn't 'cheat', it just doesn't use the same neural network that stable diffusion uses.

sand flax Sep 10, 2024, 9:13 PM

#

desert dagger very very wrong. dall-e3 generates images, just like all the others do. it doesn...

Im saying it does push out generated images, just not in the same way it does like other text to image API models

desert dagger Sep 10, 2024, 9:15 PM

#

sand flax Im saying it does push out generated images, just not in the same way it does li...

it uses a different neural network, but it does exactly the same thing that stable diffusion does to create. and the same thing meta does, gemini does, and all the other diffusion models do.

#

it even uses CLIP

copper crystal Sep 10, 2024, 9:16 PM

#

sand flax I rip information from its images, almost forget i can change the image file for...

lol what

#

firstdayoninternetkid.gif

desert dagger Sep 10, 2024, 9:17 PM

#

sand flax I rip information from its images, almost forget i can change the image file for...

once an image is an image, you can pull the code out and use it, if that's what you're talking about. it's just an image then.

fervent thunder Sep 10, 2024, 9:18 PM

#

there is one very weird thing you can do with Dalle 3 using comfy ui
if you use a clip embedding explorer node
and you find tokens like this 23u4tj2-8t-1t02ht4 that correlate highly to meaningful tokens

#

then you give 23u4tj2-8t-1t02ht4 to Dalle 3, it makes the same image

#

or at least a similar one

desert dagger Sep 10, 2024, 9:19 PM

#

fervent thunder or at least a similar one

i'd expect that though, i'ts using clip, it's probably using the same text encoders

fervent thunder Sep 10, 2024, 9:19 PM

#

ye it makes sense I just find it funny that it works

desert dagger Sep 10, 2024, 9:20 PM

#

it's just using GPT on it's backend and stable doesn't

fervent thunder Sep 10, 2024, 9:20 PM

#

can probably pull shenanigans like this with loads of models

desert dagger Sep 10, 2024, 9:20 PM

#

probably

copper crystal Sep 10, 2024, 9:20 PM

#

there's no resaerch to suggest that running 3 encoders is better. What stability was trying to avoid was when they went away from clip G , the very old and inferior clip model, people had no idea how to prompt anymore. A big part of SD2's problems wasn't just censorship but it was that everyone had their prompting figured out. Everyone was a prompt master. For Clip G. Nobody bothered to adapt.

SDXL was an attempt to bridge that. Clip G and Clip L side by side.

T5 benefits from being paired with a clip encoder, since T5 wasn't trained on image pairs. So that's why they use it with the superior Clip L.

SD3 actually suffers a lot because of the old busted clip G. Practically decades old at this point.

fervent thunder Sep 10, 2024, 9:20 PM

#

I think I have seen the vision version before on reddit/youtube
exploiting that they often use some shared ViT or CNN

desert dagger Sep 10, 2024, 9:20 PM

#

probably also why if you give one of the LLMs an image, get a description of it, and give that to the ai image gen, you get almost an identical image

fervent thunder Sep 10, 2024, 9:21 PM

#

yeah I really love that aspect of these models

desert dagger Sep 10, 2024, 9:22 PM

#

fervent thunder yeah I really love that aspect of these models

while it's fun, it also points to a problem - they're all just basically the same thing. we're all using the same set of pencils and crayons - so everything pretty much looks the same - or in the case of LLMs - sounds the same

copper crystal Sep 10, 2024, 9:22 PM

#

I think SD3 is the research about 3 text encoders. And well, look at it

desert dagger Sep 10, 2024, 9:22 PM

#

they're all trained on the same data, in teh same way, and there's no real diversity

fervent thunder Sep 10, 2024, 9:22 PM

#

yeah there are very large similarities

#

I made maybe 1000 flux images today and a ton of the sci fi stuff I had seen in SDXL

#

to be fair to flux it has a lot more image variety than I expected for a distilled model

copper crystal Sep 10, 2024, 9:23 PM

#

There's way too many LLMs at this point to say that for certain. Maybe back when there were 2-3 contenders.

desert dagger Sep 10, 2024, 9:23 PM

#

copper crystal There's way too many LLMs at this point to say that for certain. Maybe back when...

they're all just modifications of the same thing though

copper crystal Sep 10, 2024, 9:23 PM

#

Feels like saying most of the traffic online is pornography. Yeah back before youtube and netflix and amazon. Sure. Not now.

desert dagger Sep 10, 2024, 9:24 PM

#

go talk to chatGPT, claude, meta - ask the same quesiton, you'll get 1. the same responses 2. the same personality 3. the same thoughts

#

we're in a huge echo chamber

copper crystal Sep 10, 2024, 9:24 PM

#

the free versions? or the SOTA?

desert dagger Sep 10, 2024, 9:24 PM

#

copper crystal the free versions? or the SOTA?

try both

#

try all of the ones you can get to

copper crystal Sep 10, 2024, 9:24 PM

#

i have actualy. And more. So i don't know where you're coming from.

#

moving on i guess. discussion is moot

desert dagger Sep 10, 2024, 9:24 PM

#

copper crystal i have actualy. And more. So i don't know where you're coming from.

then don't stuff yourself in to the conversation

fervent thunder Sep 10, 2024, 9:24 PM

#

my test questions get very similar answers on like all of the top 50 LLMs

quartz siren Sep 10, 2024, 9:25 PM

#

desert dagger go talk to chatGPT, claude, meta - ask the same quesiton, you'll get 1. the same...

yeah true, but with llama/mistral models its pretty easy to make it a different personality

desert dagger Sep 10, 2024, 9:25 PM

#

quartz siren yeah true, but with llama/mistral models its pretty easy to make it a different ...

sure. but i'm talking defaults

copper crystal Sep 10, 2024, 9:25 PM

#

desert dagger then don't stuff yourself in to the conversation

just saying your claim is wildly innaccurate and inexperiienced.

It's a whole lot of puff

desert dagger Sep 10, 2024, 9:25 PM

#

you can twist the ai image gens to do unique things too - but the default stuff without a lot of prompt hoops and adjustments all come out looking pretty much the same

copper crystal Sep 10, 2024, 9:25 PM

#

dont bs in public if you don't want to be called out for it

desert dagger Sep 10, 2024, 9:25 PM

#

copper crystal dont bs in public if you don't want to be called out for it

go away

copper crystal Sep 10, 2024, 9:26 PM

#

¯_(ツ)_/¯

fervent thunder Sep 10, 2024, 9:26 PM

#

copper crystal ¯\_(ツ)_/¯

his claim was that the big LLMs are trained on common training data and have pretty similar outputs
seems true to me

quartz siren Sep 10, 2024, 9:26 PM

#

desert dagger sure. but i'm talking defaults

Yes they are heavily finetuned on such outputs which is honestly fine, you will see that the base models have no such "censorship". They are completely unfiltered

desert dagger Sep 10, 2024, 9:26 PM

#

you do this all the time, jump into a converstaion with no dea what you're talking about, get ugly, attack someone. just go find something else to do

copper crystal Sep 10, 2024, 9:26 PM

#

fervent thunder his claim was that the big LLMs are trained on common training data and have pre...

its not. it has "truthiness"

fervent thunder Sep 10, 2024, 9:26 PM

#

ok so lets not start drama if there is some truth to it

unborn hedge Sep 10, 2024, 9:26 PM

#

tagging my images for a LoRa is confusing the hell out of me lol, do i tag the stuff i DONT want in the image, tag everything or just tag the stuff i want the model to learn??

copper crystal Sep 10, 2024, 9:27 PM

#

we're in a dataset gold rush. There are more than just the 3 bots he listed out there

desert dagger Sep 10, 2024, 9:27 PM

#

unborn hedge tagging my images for a LoRa is confusing the hell out of me lol, do i tag the s...

you are telling the AI what is in the image. just tag it with what you what the AI to think of when you use those words in a prompt

copper crystal Sep 10, 2024, 9:27 PM

#

fervent thunder ok so lets not start drama if there is some truth to it

truthiness is a term Colbert coined, when bullshitters skirt that grey area between truth and fact. It's not true, but it feels like it

desert dagger Sep 10, 2024, 9:28 PM

#

fervent thunder ok so lets not start drama if there is some truth to it

eh, he follows me around and takes any opportunity to try to 'call me out' and spread his chaos

unborn hedge Sep 10, 2024, 9:28 PM

#

desert dagger you are telling the AI what is in the image. just tag it with what you what the ...

all these youtube tutorials and this guy explains it in one sentence, thank you!!

desert dagger Sep 10, 2024, 9:28 PM

#

just ignore him

copper crystal Sep 10, 2024, 9:28 PM

#

but i dont?

#

what?

copper crystal Sep 10, 2024, 9:29 PM

#

unborn hedge tagging my images for a LoRa is confusing the hell out of me lol, do i tag the s...

there's a lot of strategies. If you're training the likenss of a person for SDXL, i'd go with describing everything BUT the person. The person is boiled down to the single trigger token.

#

if you describe the person in the captions, you generally have to describe them in the prompt too

unborn hedge Sep 10, 2024, 9:31 PM

#

copper crystal there's a lot of strategies. If you're training the likenss of a person for SDXL...

im making my OC character into a LoRa, likely for SDXL

#

so a lora trained off a character and their likeness

quartz siren Sep 10, 2024, 9:32 PM

#

fervent thunder his claim was that the big LLMs are trained on common training data and have pre...

its bc of the finetuning data for chat models, most of it is completely synthetic and hence it will have a very similar speaking style to things like chatgpt/claude

However this is kind of easy to remove since you can just finetune further or finetune the base model.
Base models will not have such problems.

copper crystal Sep 10, 2024, 9:32 PM

#

one token for the character. describe everything else. Thats how i do it.

Other people have other approaches. But i've foudn that describing character details requires those in the prompt later on

fervent thunder Sep 10, 2024, 9:32 PM

#

a lot of it is the finetuning data yeah
but there's also common core and things like that

#

stack overflow has essentially been lifted into most of these models

unborn hedge Sep 10, 2024, 9:33 PM

#

copper crystal one token for the character. describe everything else. Thats how i do it. Ot...

ok thanks for the advice!

copper crystal Sep 10, 2024, 9:33 PM

#

fervent thunder a lot of it is the finetuning data yeah but there's also common core and things ...

i agree there are common datasets. it all started from smaller efforts. But we're post nvidia hitting 1Trillion valuation now.

#

That issue is rapidly diminishing

fervent thunder Sep 10, 2024, 9:34 PM

#

I'm not sure the models are diverging

#

I've seen the opposite trend in a few ways

copper crystal Sep 10, 2024, 9:35 PM

#

depends on your use case. many of them will be a lot of the same.

#

there's only so many ways that a model can impersonate a pirate

fervent thunder Sep 10, 2024, 9:38 PM

#

Imagine a fairly niche academic question, which is answered very well by only a handful of articles on the internet, and not answered well by any other sources.
Over time as models get bigger and have more expansive training data, its more likely that each model will come across that one particular answer in their training data.
Because the utility of the answer is so much higher than the utility of the answers other sources are giving, this correct answer will light up brightly on attention scores, and so end up being the answer each model gives.

quartz siren Sep 10, 2024, 9:40 PM

#

llama3 was trained on 15trillion tokens of data, the internet has less then 100t I believe.

desert dagger Sep 10, 2024, 9:47 PM

#

quartz siren llama3 was trained on 15trillion tokens of data, the internet has less then 100t...

the public internet. but how much do the private and academic sectors have?

copper crystal Sep 10, 2024, 9:49 PM

#

ignoreme i'm just following people around

desert dagger Sep 10, 2024, 9:53 PM

#

they are, and in SD3 - per the diagram - clip_G is the workhorse. it actually works pretty well for the job it's doing

quartz siren Sep 10, 2024, 9:54 PM

#

clip g is openclip(from laion), clip l is normal clip(from openai), they are similar but different sizes.

i dont understand why no one uses siglip now since its basically the much more improved version of clip

copper crystal Sep 10, 2024, 9:54 PM

#

quartz siren llama3 was trained on 15trillion tokens of data, the internet has less then 100t...

the google searchable internet maybe. the depth of it is so much more than 100t . Trillion is also an unfathombly large number, though also very obtainable as far as data goes. if one token is 5 bytes, that's 500T bytes. That's 500 Terabytes.

desert dagger Sep 10, 2024, 9:54 PM

#

quartz siren clip g is openclip(from laion), clip l is normal clip(from openai), they are sim...

someone's probably working on an implimentation that'll use it

copper crystal Sep 10, 2024, 9:55 PM

#

Allow me to introduce to you, the deep web

quartz siren Sep 10, 2024, 9:55 PM

#

copper crystal the google searchable internet maybe. the depth of it is so much more than 100t ...

yeah 100%, lots of data can't be scraped with scrapers

copper crystal Sep 10, 2024, 9:56 PM

#

quartz siren yeah 100%, lots of data can't be scraped with scrapers

stock scrapers maybe

quartz siren Sep 10, 2024, 9:57 PM

#

desert dagger someone's probably working on an implimentation that'll use it

true, there is also pile t5xxl which is supposed to be a better version of t5xxl 1.1
Auraflow uses the much smaller version(pile t5xl) which is like 1b parameters compared to t5xxl 1.1 which is like 3-4b parameters and has similar prompt following to the best models.

copper crystal Sep 10, 2024, 9:58 PM

#

wikipedia text file is 60GB. Hmm. Actually that scale on the text data.. how many wikipedias would be 500TB. actually, might be believeable

#

say 8-9000 wikipedias would fill 500 TB. that's a big scale. maybe still not the depths of it all though

#

https://academictorrents.com/details/9c263fc85366c1ef8f5bb9da0203f4c8c8db75f4 reddit dataset alone is 2.5TB. thats a lot of garbage text.

#

thats compressed too wow. older archives, like the archiveteam rips of Yahoo groups, thats 1.5TB of compressed text.

#

yeh i convinced myself again. The depth of the internet's text data is way over 500TB

quartz siren Sep 10, 2024, 10:08 PM

#

a lot of dataset is going to get filtered and deduplicated most likely

desert dagger Sep 10, 2024, 10:21 PM

#

quartz siren a lot of dataset is going to get filtered and deduplicated most likely

almost guarenteed, and most of the data sets are being taken from the LAION database so that narrows it down even farther

fervent thunder Sep 10, 2024, 10:52 PM

#

Hello

fervent thunder Sep 10, 2024, 11:10 PM

#

funnily enough Kolors was the one to really push it with the text encoding

#

they put a fairly strong multi-lingual language model called GLM

fathom imp Sep 11, 2024, 1:12 AM

#

hiya 👋 greetings everyone. just getting started with SVD 1.1 and will likely be hanging in #▶｜stable-video-diffusion

robust needle Sep 11, 2024, 1:29 AM

#

I followed a tutorial for AMD compatible stable diffusion, and although I am new to this I feel like a portion of my less than ideal results are from using "v1-5 pruned emaonly" as my checkpoint. I've had so much more success with web based ai gernerators such as adobe firefly so I feel like my prompts should get at least decent results

vagrant raptor Sep 11, 2024, 1:31 AM

#

Anybody willing to share a good text to video workflow with motion imported from another video, and face swapping?

proven wadi Sep 11, 2024, 3:53 AM

#

Hey i am new here like what do you guys do

oak latch Sep 11, 2024, 4:18 AM

#

whats the best fastest gpu for flux training?

copper crystal Sep 11, 2024, 5:15 AM

#

4090 far as gpu goes. if you go enterprise you're better off

oak latch Sep 11, 2024, 5:44 AM

#

copper crystal 4090 far as gpu goes. if you go enterprise you're better off

yea but the problem is its very slow used to be way better 14secs per iteration

#

idk what i did wrong

copper crystal Sep 11, 2024, 5:46 AM

#

you've already got a 4090? what a deceptive bait for technical support

oak latch Sep 11, 2024, 5:46 AM

#

im using runpod

#

and using a rtx6000 ada or whatever

warm junco Sep 11, 2024, 7:44 AM

#

robust needle I followed a tutorial for AMD compatible stable diffusion, and although I am new...

Hey, you should try other models then. The 1.5 ema prunes is not recommended as its 2 years old

nova turtle Sep 11, 2024, 7:54 AM

#

Hey all 🙂 what is in your opinion the best existing img2vid workflow? watwow

oak latch Sep 11, 2024, 8:27 AM

#

i am training my flux model and at iter 0/500 i get a normal picture but past i get static

#

did i overtrain?

grim aspen Sep 11, 2024, 9:15 AM

#

what is better for stable diffusion: Intel ARC A770 or Radeon RX 7600?

dry halo Sep 11, 2024, 9:24 AM

#

Friends, I want to rent a personal host with A100 graphics card and 14900K, and 4080 as the display card. Is this solution feasible?

warm junco Sep 11, 2024, 10:07 AM

#

grim aspen what is better for stable diffusion: Intel ARC A770 or Radeon RX 7600?

A 7600 would be better, but if the 770 has 16gb instead of 8 maybe that would be better

grim aspen Sep 11, 2024, 10:09 AM

#

@warm junco both have 16gb

#

alternatively I could get a used 16gb NVIDIA Quadro P5000 for the same price

warm junco Sep 11, 2024, 10:11 AM

#

A 7600 has 8gb, a 7600xt has 16

#

The 7600xt will be much better than the 770

grim aspen Sep 11, 2024, 10:14 AM

#

warm junco A 7600 has 8gb, a 7600xt has 16

ahh yea I meant the xt

#

it's like 30 bucks more then the 770

#

if it's a lot better then that makes sense to invest

#

what about a GeForce RTX 4060 Ti it's 100 bucks more. Would that make a huge difference or is it not worth it?

main snow Sep 11, 2024, 10:19 AM

#

hey guys, ya'll know of any lora or checkpoing similar to the style of persona 3 reload in-game?

warm junco Sep 11, 2024, 10:43 AM

#

grim aspen what about a GeForce RTX 4060 Ti it's 100 bucks more. Would that make a huge dif...

Ai runs the best on nvidia GPUs

#

So the 4060 ti 16gb would beat the other two easily

grim aspen Sep 11, 2024, 10:44 AM

#

warm junco So the 4060 ti 16gb would beat the other two easily

thanks then I think I'll save up some more

warm junco Sep 11, 2024, 10:46 AM

#

No problem 🙂

#

And yea its worth to the 100 bucks if you plan on using a lot of ai tools

#

Most of these local tools use Cuda made by nvidia

grim aspen Sep 11, 2024, 10:48 AM

#

warm junco And yea its worth to the 100 bucks if you plan on using a lot of ai tools

I already am using sd a lot but my current gpu sucks and I really want to use larger models

warm junco Sep 11, 2024, 10:48 AM

#

grim aspen I already am using sd a lot but my current gpu sucks and I really want to use la...

What's your current gpu?

grim aspen Sep 11, 2024, 10:49 AM

#

warm junco What's your current gpu?

a GeForce RTX 3060 from a miner that can break any moment

clever musk Sep 11, 2024, 10:50 AM

#

Hey guys, general question if anyone knows, what's the point of upscaling?

I mean, I understand it improves quality, but why would I upscale if I can generate at higher pixels from start, say 1024x1024 instead of 512x512?

warm junco Sep 11, 2024, 10:54 AM

#

grim aspen a GeForce RTX 3060 from a miner that can break any moment

Oh okay, but if it has 12gb you can use sdxl and even flux on it

#

Even with 8gb vram sdxl/pony works

warm junco Sep 11, 2024, 10:56 AM

#

clever musk Hey guys, general question if anyone knows, what's the point of upscaling? I m...

Hey, because models are trained on specific resolutions you shouldn't generate much higher than that.
For example:
1.5 models are trained on 512x512
SDXL/Pony on 1024x1024

If you generate native 1024x1024 with an 1.5 model you can get artefacts and duplicates

grim aspen Sep 11, 2024, 10:57 AM

#

warm junco Even with 8gb vram sdxl/pony works

what is pony?

warm junco Sep 11, 2024, 10:57 AM

#

An variant of sdxl

grim aspen Sep 11, 2024, 10:58 AM

#

does it create those cartoon horses or is that something else?

warm junco Sep 11, 2024, 10:59 AM

#

It can, but there are specific anime pony or realism pony versions

#

They are very good at generating normal hands

grim aspen Sep 11, 2024, 11:00 AM

#

ahh so it's not just for generating those horses

warm junco Sep 11, 2024, 11:00 AM

#

Nope

grim aspen Sep 11, 2024, 11:01 AM

#

if 16 gb isn't needed then maybe I should try switching the ui

warm junco Sep 11, 2024, 11:01 AM

#

What ui do you use?

grim aspen Sep 11, 2024, 11:01 AM

#

auto1111

#

I've read that forge is better

warm junco Sep 11, 2024, 11:01 AM

#

That should work then with your GPU

grim aspen Sep 11, 2024, 11:01 AM

#

nah it keeps crashing when it loads 6gb+ models

warm junco Sep 11, 2024, 11:02 AM

#

Ah can you show me your content of the webui-user.bat ?
In #🤝｜tech-support

#

We can fix that

grim aspen Sep 11, 2024, 11:03 AM

#

warm junco Ah can you show me your content of the webui-user.bat ? In <#1002602742667280404...

sorry it's on a different pc

clever musk Sep 11, 2024, 11:04 AM

#

warm junco Hey, because models are trained on specific resolutions you shouldn't generate m...

Oh that makes sense, thanks😎

#

So just 512x512 and upscale if I want 4k?

grim aspen Sep 11, 2024, 11:04 AM

#

then what are 16+gb gpus needed for?

warm junco Sep 11, 2024, 11:04 AM

#

grim aspen sorry it's on a different pc

Ah no problem.
But make sure you have
--xformers --no-half-vae in the webui-user.bat
And if it still crashes you need to increase the Windows Pagefile.
Feel free to ask me in #🤝｜tech-support if you try it again

grim aspen Sep 11, 2024, 11:05 AM

#

I'm on linux tho XD

warm junco Sep 11, 2024, 11:06 AM

#

clever musk So just 512x512 and upscale if I want 4k?

If you use an 1.5 model then stay near that resolution. But what works good to is for example
960x540 and then Upscale by 2 to get Full HD.
Then upscale that image in img2img for 4k

warm junco Sep 11, 2024, 11:06 AM

#

grim aspen I'm on linux tho XD

Ohhh xD

zenith latch Sep 11, 2024, 11:06 AM

#

waow

grim aspen Sep 11, 2024, 11:06 AM

#

yea I've used --xformers and --no-half-vae before gotta check if it's still in the bat/sh

#

Does the 12gb flux model really work with less then 16gb?

warm junco Sep 11, 2024, 11:14 AM

#

grim aspen Does the 12gb flux model really work with less then 16gb?

Yes

#

But if you want to use flux you need to use Forge or Comfyui

grim aspen Sep 11, 2024, 11:14 AM

#

is the model only partially loaded or how does that work as the os already uses some of the vram

grim aspen Sep 11, 2024, 11:15 AM

#

warm junco But if you want to use flux you need to use Forge or Comfyui

forge actually looks good it's basically auto1111 but comfyui is a bit complicated

warm junco Sep 11, 2024, 11:15 AM

#

grim aspen is the model only partially loaded or how does that work as the os already uses ...

Yes it gets partially loaded and into the ram too

grim aspen Sep 11, 2024, 11:16 AM

#

ahhh nice then I'll save up even more and get some 20gb+ or couple years later

#

if the current gpu lasts that long that is

#

thx so much

#

saved me 300 to 400 bucks XD

#

I thought hands would take like another 2 to 3 years it's incredible the hands issue has already been fixed

warm junco Sep 11, 2024, 11:19 AM

#

If you try flux, best use the nf4 model

#

It should be the fastest

clever musk Sep 11, 2024, 11:22 AM

#

Anyone has any experience relating graphic cards?

Would Nvidia Quadro p620 be any good in generating images? (It's only 2gb vram)

agile tusk Sep 11, 2024, 11:27 AM

#

No

warm junco Sep 11, 2024, 11:31 AM

#

clever musk Anyone has any experience relating graphic cards? Would Nvidia Quadro p620 be ...

It could work

#

But its not good

clever musk Sep 11, 2024, 12:19 PM

#

warm junco But its not good

Thank you for all the help!

proud dawn Sep 11, 2024, 12:21 PM

#

Are there any other good Ais besides anything sd or mj related?

wraith notch Sep 11, 2024, 12:55 PM

#

proud dawn Are there any other good Ais besides anything sd or mj related?

want to know too

earnest atlas Sep 11, 2024, 1:20 PM

#

I have images of a book in the closed and fully opened state. How can I generate the intermediate frames between these two images to create a smooth book opening animation?

hard swift Sep 11, 2024, 2:29 PM

#

hi

flint night Sep 11, 2024, 3:21 PM

#

Hi

desert dagger Sep 11, 2024, 3:54 PM

#

earnest atlas I have images of a book in the closed and fully opened state. How can I generate...

you might try using luma or kling - they do a good job of interprelation

hardy nexus Sep 11, 2024, 5:29 PM

#

so apparently they are going to realease sd3 8B and realeased a finetuning guide for Sd3 2B

worthy bone Sep 11, 2024, 5:49 PM

#

hardy nexus so apparently they are going to realease sd3 8B and realeased a finetuning guide...

there is no release date for sd3 8B yet right?

hardy nexus Sep 11, 2024, 6:01 PM

#

worthy bone there is no release date for sd3 8B yet right?

lykon just said so in twitter

#

but no release data

#

date

ember tide Sep 11, 2024, 7:21 PM

#

What was reflection

#

Popular for

quartz siren Sep 11, 2024, 8:30 PM

#

ember tide What was reflection

the llm? if so, it was supposed to beat all other other llms bc it could "think" and "reflect". It was trained on 70b llama3.1 and beat gpt4o, claude sonnet 3.5, 405b llama3.1 in benchmarks. However, it was kind of a scam since the open source one is completely different and performs worse the 70b llama 3.1, and their api was much better but still not as good as advertised. However, people quickly found out, that the api seemed to act similar to claude sonnet 3.5 and could not say "claude" and had special tokens only claude has but not llama.

granite peak Sep 12, 2024, 12:18 AM

#

hi!

ember tide Sep 12, 2024, 12:25 AM

#

quartz siren the llm? if so, it was supposed to beat all other other llms bc it could "think"...

Oh I see

patent veldt Sep 12, 2024, 2:17 AM

#

good morning

wispy oasis Sep 12, 2024, 2:59 AM

#

hey folks

#

is it allowed to discuss third party (premium) services for AI photo generation? I know some of these (or most) at some point were based on stable. Would appreciate any info you opinion of the "best" text to image service ($) and why you think it is the best. When I say best, I mean in general across all types of iamges like realistic, cartoon, paintings, whatever you tell it. I understand some may be tailored to specific needs. (or just the most popular right now)? If this is againt the rules please remove my message and let me know. Thanks

desert dagger Sep 12, 2024, 5:17 AM

#

wispy oasis is it allowed to discuss third party (premium) services for AI photo generation?...

is there a reason you want to pay for something you can get for free?

wispy oasis Sep 12, 2024, 5:17 AM

#

desert dagger is there a reason you want to pay for something you can get for free?

@desert dagger convenience

desert dagger Sep 12, 2024, 5:17 AM

#

quartz siren the llm? if so, it was supposed to beat all other other llms bc it could "think"...

there's a new model on meta.AI - you get a few preview messages with it, then it swaps you back over to 70B

pseudo arch Sep 12, 2024, 5:18 AM

#

saintwave

desert dagger Sep 12, 2024, 5:18 AM

#

wispy oasis <@407561236339752981> convenience

have you tried this? https://easydiffusion.github.io/

analog stream Sep 12, 2024, 5:42 AM

#

Would anyone tell me how to get as good as the result that we get in Midjourney for this prompt

A moon with a large circular hole filled with glowing yellow electronics.
Details: Intricate details, photorealistic rendering, textured lunar surface, craters, soft ambient lighting, visible wires, circuits, and chips, warm yellow glow, cinematic lighting, depth of field, volumetric lighting.
Style: beeple, Greg Rutkowski, trending on artstation, hyperrealistic.

#

I submitted Midjourney image in #🏞｜general-with-images

copper crystal Sep 12, 2024, 6:01 AM

#

been running tests with forge out of curiosity. i assumed that telling it to use less ram would slow it down a ton. but i told it to only use 4GB for flux generations, andit's only using 4GB for flux generations. so um. ok. doing 1.5 mp generations usually at 40-45 seconds. now at 86 seconds. Twice as much but really, not that bad for such a signicant memory savings.

copper crystal Sep 12, 2024, 6:17 AM

#

3GB limit works too. no speed change. 25 step 1.5mp at 85 seconds.

#

uses all my system mem though so i guess it helps to have a lot of good system mem

desert dagger Sep 12, 2024, 6:19 AM

#

analog stream Would anyone tell me how to get as good as the result that we get in Midjourney ...

with which AI?

copper crystal Sep 12, 2024, 6:19 AM

#

flux dev

#

nf4 version 2, but it should work for any model too

#

this is such a flex on the memory management that he was using before and gutted completely. must've engineered this solution to one up on teh comfyui code he got accused of ripping off. it's such a massive flex, but nobody realizing it. I bet the coders at comfy org do though.

desert dagger Sep 12, 2024, 6:23 AM

#

copper crystal this is such a flex on the memory management that he was using before and gutted...

who are you accusing of flexing?

copper crystal Sep 12, 2024, 6:25 AM

#

it's a proper good flex of skill and achievement. the author of forge, controlnet, fooocus. has a russian name i think.

trim magnet Sep 12, 2024, 6:26 AM

#

nah its an anime name

copper crystal Sep 12, 2024, 6:27 AM

#

https://tenor.com/view/zyzz-flex-motivational-stare-anytime-fitness-gif-23227512

desert dagger Sep 12, 2024, 6:28 AM

#

https://github.com/lllyasviel

copper crystal Sep 12, 2024, 6:28 AM

#

i got it so flux is only using 1gb of my vram. wtf. 1min generations with the real sampler, with 1gb of vram.

#

more people should be talking about this i feel. oh well.

desert dagger Sep 12, 2024, 6:29 AM

#

real name: Lvmin Zhang

trim magnet Sep 12, 2024, 6:29 AM

#

if only training was like sdxl 😔

fervent thunder Sep 12, 2024, 6:29 AM

#

comfy is better at flux inference cos it has the FP8 speed boost though

copper crystal Sep 12, 2024, 6:29 AM

#

trim magnet if only training was like sdxl 😔

better than sdxl imo

trim magnet Sep 12, 2024, 6:30 AM

#

no cuz it takes longer,maybe later it will be improved though

copper crystal Sep 12, 2024, 6:30 AM

#

i'm having nothing but wins using lion 8bit. adafactor wasn't doing it for me. i didn't try anything else

#

i trained 3 loras of diff ladies today. 15-30 image sets. might try to combine a few sets and do a 100 image mega set. should take over night i figure

trim magnet Sep 12, 2024, 6:31 AM

#

yea tried with prodigy and adam but only adafactor works with style im making

copper crystal Sep 12, 2024, 6:32 AM

#

i think lion is adaptive, but i run it at a constant rate. sometimes with cosines.

#

converges at 500-700 steps usually

trim magnet Sep 12, 2024, 6:33 AM

#

do u use kohya,the derrian trainer,one trainer or ai toolkit?

copper crystal Sep 12, 2024, 6:33 AM

#

i dont fuck with anything but kohya. every other trainer has way too many hype artists to cut through

#

kohya doesn't ever say shit. just keeps workin

#

plus he's japanese i think so we'd be like "what?"

trim magnet Sep 12, 2024, 6:34 AM

#

yea i tried with derrian and fluxgym but dont work i guess ill try kohya again

copper crystal Sep 12, 2024, 6:34 AM

#

fluxgym uses kohya at the back iirc

#

https://github.com/bmaltais/kohya_ss/tree/sd3-flux.1 this what i been on

trim magnet Sep 12, 2024, 6:35 AM

#

yea i always get the code1 error and the florence tagger also got stuck 😔

copper crystal Sep 12, 2024, 6:35 AM

#

i use taggui too.

trim magnet Sep 12, 2024, 6:35 AM

#

copper crystal https://github.com/bmaltais/kohya_ss/tree/sd3-flux.1 this what i been on

ill try that branch later,how much vram u have?

copper crystal Sep 12, 2024, 6:36 AM

#

https://github.com/jhc13/taggui trickier to set up but it works nicely.

i have 16gb

trim magnet Sep 12, 2024, 6:36 AM

#

yea thats prob why i couldnt make the others work i have 12gb so ill wait for more optimizations

copper crystal Sep 12, 2024, 6:37 AM

#

can train only 2 layers i think. that'll help bring it into line. split the model between ram and vram helps. little slower though

#

i guess in that memory hungry sense, it doesn't train as well as sdxl haha

trim magnet Sep 12, 2024, 6:39 AM

#

yea ill just check the civitai training prices for flux,the moment they lower the cost from 2k to 500 buzz thats the moment ill know they optimized the trainer for the lower end gpus catsprout

desert dagger Sep 12, 2024, 6:40 AM

#

trim magnet yea ill just check the civitai training prices for flux,the moment they lower th...

glif just brought their flux lora trainer online

trim magnet Sep 12, 2024, 6:40 AM

#

oh yea ill check there too

copper crystal Sep 12, 2024, 6:40 AM

#

https://github.com/kohya-ss/sd-scripts/tree/sd3?tab=readme-ov-file#flux1-lora-training in here they mention

The training can be done with 12GB VRAM GPUs with Adafactor optimizer, --split_mode and train_blocks=single options. Please use settings like below:

then gives you a training command that would fit in 12GB

#

something like lion 8 should drop in to replace adafactor and get more savings

copper crystal Sep 12, 2024, 6:46 AM

#

trim magnet nah its an anime name

which anime i'm curious

trim magnet Sep 12, 2024, 6:46 AM

#

copper crystal which anime i'm curious

fate

#

the prisma illya one is the one where shes the mc

copper crystal Sep 12, 2024, 6:47 AM

#

lol aight then

#

i was hoping a cool one like hellsing

trim magnet Sep 12, 2024, 6:50 AM

#

copper crystal i was hoping a cool one like hellsing

well fate/zero is cool but prisma illya is like a spin off full of fanservice with a magical girl plot so maybe thats why he likes it thomas

copper crystal Sep 12, 2024, 7:01 AM

#

i like sword art. except the season where its all about their digital family with a chatbot baby child

#

season 1.5

trim magnet Sep 12, 2024, 7:03 AM

#

yea same here the fights are the best part of it

hard swift Sep 12, 2024, 7:56 AM

#

gm

hexed scroll Sep 12, 2024, 12:38 PM

#

Make both people in the photo face forward.

plain raptor Sep 12, 2024, 1:03 PM

#

Ok, push button, to press, use space bar

rain rain Sep 12, 2024, 2:47 PM

#

Hi, is this the correct server to ask questions about local genning with SD forge?

blissful ibex Sep 12, 2024, 4:42 PM

#

hi

grand rain Sep 12, 2024, 4:47 PM

#

Anyone know good free tools for batch text removal from images?

nova mason Sep 12, 2024, 5:43 PM

#

Hi

tall berry Sep 12, 2024, 5:54 PM

#

Anyone here good with Lora training? Came across an issue

#

I asked a q in the chat help, may need some insight haha

pine tide Sep 12, 2024, 6:16 PM

#

Hey everyone! Is there a way to automate a list of character names through a specific part of a prompt in ComfyUI? To clarify, I have 20 characters, and I want to generate one pose for each. Since I'll be frequently changing the poses, I'm wondering if there's a way to automate inserting the character names into the text prompt node, rather than manually typing them each time. Any tips or workflows to streamline this process?

copper crystal Sep 12, 2024, 6:52 PM

#

internship implies free work. is this a paid internship, or are you looking for slaves? At least you're not charging people ot be part of it.

my point is... going onto discord chatrooms and looking for unpaid interns is pretty fucking greasy.

#

oh it was removed. good call

#

What is even with the mouse cursor on that website? With design like that, you can be sure a business has absolutely zero real world experience with tech. A web design that bad is a canary in a coal mine. It indicates there's zero expertise at the fundamental level

#

https://www.cosmic365.ai/ i'm linking it again just so people can see how shady an bad it is

fervent thunder Sep 12, 2024, 7:06 PM

#

Looks super legit to me!!!

#

Look at those blinding design skillz

#

And hey; it's pretty darn cosmic

lyric isle Sep 12, 2024, 7:33 PM

#

do you guys know if it's possible to install a1111 with an already existing forge instillation without having to have the entire program and all it's dependencies ran though? I just want to be able to run batches through adetailer and I can't without a1111 but i don't have a ton of space for multiple installs

desert dagger Sep 12, 2024, 7:38 PM

#

lyric isle do you guys know if it's possible to install a1111 with an already existing forg...

you're going to have to have all of a1111 installed for anything to run in it

lyric isle Sep 12, 2024, 7:42 PM

#

dang, I can't get XYZ grids to work nor adetailer batches, do you know anything about how to get those to work in forge? ;-;

desert dagger Sep 12, 2024, 7:51 PM

#

lyric isle dang, I can't get XYZ grids to work nor adetailer batches, do you know anything ...

no, sorry, i use comfyUI. you might ask in #🤝｜tech-support

green nova Sep 12, 2024, 8:26 PM

#

Hello people.

fervent thunder Sep 12, 2024, 8:51 PM

#

green nova Hello people.

Considering this is an AI channel you are taking a lot for granted with that statement.

#

🤨

rapid comet Sep 12, 2024, 9:28 PM

#

Anybody know how to bring down the contrast of a lightning model if I’m using eg Steps 4 and CFG 1.5?

Wondering about dynamic thresholding but not sure what settings to use

Also if i was to up the main CFG past 2, what also to counter with . Thanks

desert dagger Sep 12, 2024, 9:50 PM

#

fervent thunder Considering this is an AI channel you are taking a lot for granted with that sta...

AI are people too

pine tide Sep 12, 2024, 10:47 PM

#

copper crystal https://www.cosmic365.ai/ i'm linking it again just so people can see how shady ...

lol the yellow arrow on the cursor is so ugly

copper crystal Sep 12, 2024, 10:48 PM

#

pine tide lol the yellow arrow on the cursor is so ugly

its HORRIBLE

leaden cargo Sep 12, 2024, 11:05 PM

#

Hello, is there anyone who can guide me, thank you

#

درود به همه فارسی زبان کسی اینجا هست ایا؟

#

What exactly should be done here?

#

😁 😆

leaden cargo Sep 12, 2024, 11:08 PM

#

pine tide lol the yellow arrow on the cursor is so ugly

where do you say

wise aspen Sep 13, 2024, 12:02 AM

#

what is best extension to FORGE similar to it? https://github.com/butaixianran/Stable-Diffusion-Webui-Civitai-Helper cuz this one not working with forge, any1?

fervent thunder Sep 13, 2024, 11:30 AM

#

rapid comet Anybody know how to bring down the contrast of a lightning model if I’m using eg...

tonemapnoisewithrescalecfg and Skimmed CFG are what I use

#

on regular models I use CFG 20-30 and on lightning/TCD models, I use CFG 10

#

so yeah these two nodes can definitely lower CFG burn lol

harsh agate Sep 13, 2024, 12:34 PM

#

can anyone suggest me is there any better mdoel than clip for finetuing sdxl

vital monolith Sep 13, 2024, 12:54 PM

#

There any good "text to video" AI things you can download and use on your PC (so you don't have limited uses)?

royal ember Sep 13, 2024, 1:10 PM

#

If anyone is interested in art dm me i need urgent commision

copper crystal Sep 13, 2024, 3:22 PM

#

royal ember If anyone is interested in art dm me i need urgent commision

Scam. Don't do it. Don't be stupid.

fervent thunder Sep 13, 2024, 3:31 PM

#

harsh agate can anyone suggest me is there any better mdoel than clip for finetuing sdxl

not rly

#

there are some variants

#

but I'd rather not change to them

zenith lance Sep 13, 2024, 3:32 PM

#

Hi everyone, I'm just starting out with SD and I'm beginning to understand it.

However, when I see the prompt as an example, I often see montions like ‘score_9, score_8_up, score_7_up, score_6_up,’

I've tried putting ‘score_x’, but I don't understand or see how that influences....

Could you please explain?

fervent thunder Sep 13, 2024, 3:32 PM

#

that's a prompt for a model called Pony

#

people put it in their prompt for SDXL by mistake

copper crystal Sep 13, 2024, 3:36 PM

#

not even the author of Pony can explain them. Story always seems to change. sometimes people say it's intended and a good thing. sometimes people say he mistakeningly did it. everyone has a source for their information. I think he was just throwing spaghetti at the wall to see what worked and started reverse reasoning his way around it.

ultimately the 20 extra tokens required on the refined pony xl model are not worth it.

fervent thunder Sep 13, 2024, 3:40 PM

#

copper crystal not even the author of Pony can explain them. Story always seems to change. some...

not sure about that
he gave a coherent explanation
images were tagged with the scores with the intention to use one score in a prompt, but instead of learning how each score works the model learnt that a string of several scores is good

copper crystal Sep 13, 2024, 3:44 PM

#

people have told me other explanations with the same anecdotal "he said" . and there's always a source for it. yeah. i know. I know there are explanations out there.

I just think they were reverse reasoned after the fact and aren't the real reasons.

fervent thunder Sep 13, 2024, 3:45 PM

#

luckily there is a way to test this numerically

copper crystal Sep 13, 2024, 3:45 PM

#

pony forgets way to much of the base model's knowledge for it to have been properly planned. he fluked out when the training data produced what it did

fervent thunder Sep 13, 2024, 3:45 PM

#

take the conditioning to text code node from here and it can be measured https://github.com/Extraltodeus/Conditioning-token-experiments-for-ComfyUI

copper crystal Sep 13, 2024, 3:48 PM

#

naw. that aint my domain. i'm not that kind of artist to test and combine numbers 100 ways. some of those charts on there scare me.

using information dumps like that to magically suggest pony had a plan is weird. i dont know how it relates. how could anyone possibly?

fervent thunder Sep 13, 2024, 3:49 PM

#

if you don't want to measure it then that's fine
but that's the method to verify which side is right

copper crystal Sep 13, 2024, 3:49 PM

#

lol if you say so. feels like big troll energy

ionic wraith Sep 13, 2024, 3:50 PM

#

For windows 10, do you guys perfer annaconda navigator or just plain cmd with py?

copper crystal Sep 13, 2024, 3:50 PM

#

"go discover pony's intentions while training by testing 100 combinations of tokens" naw

#

people keep upholding the score tags. they're not good. they didn't work.

fervent thunder Sep 13, 2024, 3:52 PM

#

why would it require testing hundreds of combinations of tokens?

#

I don't understand

copper crystal Sep 13, 2024, 3:52 PM

#

that's the method you showed me. dude explained how he made nearly 1000 test examples. and honsetly his prompts look schizophrenic

fervent thunder Sep 13, 2024, 3:52 PM

#

ionic wraith For windows 10, do you guys perfer annaconda navigator or just plain cmd with py...

prefer terminal to GUIs like that

#

ah no I'm not saying to replicate what he used the node pack for in that repo

#

I'm saying to take that node and then generate with all the tags, see what the nearest prompts are in the vector space

#

then do the same for just one tag

#

and then for no tag

copper crystal Sep 13, 2024, 3:56 PM

#

i'm not going to do homework on ponyxl. its just not worth it. it's not an academic achievement of a model and isn't worth researching. it's a lemon.

fervent thunder Sep 13, 2024, 3:57 PM

#

yeah if you don't want to do that its fine, I was just telling the method

#

a funny example from that repo is that
the default Comfy UI prompt "beautiful scenery nature glass bottle landscape, , purple galaxy bottle,"
gives lavender in SDXL

#

it was always a mystery

#

turn out the 4th closest prompt has lavender lavender lavender lavender

copper crystal Sep 13, 2024, 4:04 PM

#

lavender and purple are basically synonymous in human language. that's my guess

#

you can look at how the vector space is but the reason it's like that is become synonyms

#

i never heard of that mystery until now. should've just asked me at the start .

fervent thunder Sep 13, 2024, 4:06 PM

#

ah yeah that's a great point they are synonyms

#

I get a funny one currently
I like the tokens "colorful background" a lot on SDXL

#

but if you boost them too high you get fluffy objects

copper crystal Sep 13, 2024, 4:08 PM

#

"scenic background" is one i enjoy. maybe i'll try "scenic colorful background"

fervent thunder Sep 13, 2024, 4:08 PM

#

there was a study that looked for good tokens

#

I will try to find it

#

they found 14

#

oh yea this one https://arxiv.org/pdf/2209.11711

#

and the result was this:

#

art, dramatic lighting, high detail, highly detailed, hyper realistic, intricate, intricate sharp details,
octane render, smooth, studio lighting, trending on artstation```

copper crystal Sep 13, 2024, 4:10 PM

#

missing rutkowski

fervent thunder Sep 13, 2024, 4:10 PM

#

lol

copper crystal Sep 13, 2024, 4:11 PM

#

i actually wish that "hyper realistic" was more aligned with real hyperrealism style

fervent thunder Sep 13, 2024, 4:11 PM

#

yeah a lot of tokens are a let down

#

"cinematic" is an excellent token though in particular

#

I use A1111 weighting when I can, so I can boost it further

#

not every node as a A1111 option, most don't in fact

copper crystal Sep 13, 2024, 4:12 PM

#

i thought comfyui supported prompt weights natively

fervent thunder Sep 13, 2024, 4:12 PM

#

it does but only to like 1.3

#

whereas A1111 goes to like 2

#

for a short prompt

copper crystal Sep 13, 2024, 4:13 PM

#

hmm. i remember using those lora sliders with 3 and 4 ratings, on both uis.

fervent thunder Sep 13, 2024, 4:13 PM

#

perp-weight goes to like 50 but the tokens are nowhere near as strong with perp-weight

#

oh the UI offers it, its just that funny stuff happens to the image

copper crystal Sep 13, 2024, 4:13 PM

#

lora sliders are easy test. they have significant difference between 2 and 3

fervent thunder Sep 13, 2024, 4:14 PM

#

are you talking about the lora strength or the clip strength?
cos load lora node has 2 sliders

copper crystal Sep 13, 2024, 4:14 PM

#

o i c. you didn't actually mean "it only goes to 1.3" you meant something else. got it. this is what i think pony's explanations of the tags were too.

fervent thunder Sep 13, 2024, 4:14 PM

#

oh you thought I meant the slider ends

#

yeah I meant the image breaks after 1.3

#

I haven't done the pony test I mentioned BTW

#

I might at some point

#

but like you, I'm not that bothered about Pony cos its broken

copper crystal Sep 13, 2024, 4:19 PM

#

cool gfy

fervent thunder Sep 13, 2024, 4:20 PM

#

I get what you're saying
that he might have not communicated correctly what he actually did

#

or that he didn't know what he did

peak bobcat Sep 13, 2024, 4:26 PM

#

hello!

copper crystal Sep 13, 2024, 4:38 PM

#

https://github.com/OutofAi/OutofFocus new sd2.1 tool.

#

i think there's a professor out there whos teaching his students to write all their research on 2.1 so that "the community" won't abuse it right out of the gate

#

there's been a few 2.1 projects that are novel and effective, coming out lately

#

dynamic compensation sampler is another

fervent thunder Sep 13, 2024, 5:00 PM

#

copper crystal dynamic compensation sampler is another

yeah that's the best sampler in the world as far as I know

#

would be amazing for flux

#

hello

peak bobcat Sep 13, 2024, 5:01 PM

#

fervent thunder hello

Hi

#

Nice to meet you

copper crystal Sep 13, 2024, 5:06 PM

#

spammers begging people not to bot stomp thier servers when they spam now. cute

#

on a spam server. ok

desert dagger Sep 13, 2024, 5:17 PM

#

you're spamming about a scam

severe burrow Sep 13, 2024, 5:17 PM

#

desert dagger you're spamming about a scam

absolutly no

desert dagger Sep 13, 2024, 5:18 PM

#

severe burrow absolutly no

absolutely not the right discord to post this on

copper crystal Sep 13, 2024, 5:18 PM

#

i say we bot brigade their telegram "investor" channel

severe burrow Sep 13, 2024, 5:18 PM

#

desert dagger absolutely not the right discord to post this on

You are right

copper crystal Sep 13, 2024, 5:19 PM

#

here's what'll happen. they pump a lot of stocks up. they convince a few people to get in on the pump. the people who actually initiated it will exit and then everyone is fucked. Scam

#

if it's even that kind of op. this is porbably just a guy trying to scam gift cards

severe burrow Sep 13, 2024, 5:20 PM

#

I don't talk to people who don't see the opportunity, I'm sorry for you, have a good life everyone

copper crystal Sep 13, 2024, 5:20 PM

#

severe burrow I don't talk to people who don't see the opportunity, I'm sorry for you, have a ...

i have plenty of opportunity. the secret to my sucesss is recognizing bad opportunity and scams

narrow sluice Sep 13, 2024, 6:40 PM

#

copper crystal if it's even that kind of op. this is porbably just a guy trying to scam gift ca...

If I want money, I'd become a damn good AI software engineer and get millions in funding

#

Instead of sad clickbait scams that would be laughed out on the average Discord server 😂

ionic wraith Sep 13, 2024, 6:41 PM

#

severe burrow I don't talk to people who don't see the opportunity, I'm sorry for you, have a ...

Opportunity in a pyramid sceme?

narrow sluice Sep 13, 2024, 6:41 PM

#

ionic wraith Opportunity in a pyramid sceme?

Simple.
Be the pyramid scheme

ionic wraith Sep 13, 2024, 6:42 PM

#

Thats smart, will try this out

copper crystal Sep 13, 2024, 6:44 PM

#

narrow sluice If I want money, I'd become a damn good AI software engineer and get millions in...

that's my plan too, but it's been problematic since i'm not an engineer and studying is hard

wind ingot Sep 13, 2024, 8:09 PM

#

Hi,anyone knows if there's any way to "print" generation times in XYZ plot of Forge/A1111?

zenith lance Sep 13, 2024, 9:54 PM

#

wind ingot Hi,anyone knows if there's any way to "print" generation times in XYZ plot of Fo...

you can find yours grid at \webui\outputs\txt2img-grids. Open you image and print. (if I understand your question correctly)

wind ingot Sep 13, 2024, 9:58 PM

#

@zenith lance Not quite, the grids are ok, but I would like to overlay the generation times, like: I'm testing Flux models vs Steps and would like to know how many seconds each image took to generate

#

I could probably check that latter one by one , but would be really nice to have that "printed" in the plot

#

It seems Schnell generates/resolves in less steps altought with less qualitity than Dev, but would like to know the "time saved" on a decent generation

#

Hmm,cant post images here

#

#🏞｜general-with-images message

lilac forge Sep 14, 2024, 1:52 AM

#

When I'm increasing the curves of a female subject (not overly so), a lot of times I end up with the subject losing clothes. For example, let's say my subject is Zero Suit Samus and I want her to keep her full bodied Zero Suit on - her suit starts right below the chin like a turtleneck would and covers her entire body including hands, fingers, feet, everything.
This question isn't about Samus, she's just an example. What I'm really looking for are phrases to use to ensure that all the clothes the subject is supposed to have on actually STAY on. Of course, that's with me telling the prompt what clothes the subject is wearing.

covert kestrel Sep 14, 2024, 3:18 AM

#

Quick question: Can I use SDXL checkpoint on lower resolutions (i.e 512x512)?

fervent thunder Sep 14, 2024, 3:23 AM

#

yes but only with res adapter

desert dagger Sep 14, 2024, 4:50 AM

#

lilac forge When I'm increasing the curves of a female subject (not overly so), a lot of tim...

try specifying stuff like sleeves, leggings, boots - stuff that beats the AI over the head with the fact she's wearing clothes - but isn't the word clothes or clothing or clothed

fervent thunder Sep 14, 2024, 6:04 AM

#

does anyone have workflow of controlnet inpaint with flux ? Using alimama-creative/FLUX.1-dev-Controlnet-Inpainting-Alpha

hushed stump Sep 14, 2024, 8:49 AM

#

i have some issues using HI red settings, its been fine for months when suddenly when i tried it today, the progress bar is stuck at 0% can anyone help me with this please?

empty star Sep 14, 2024, 9:20 AM

#

which CUDA version is recommended to use with a 4090?

fossil briar Sep 14, 2024, 9:35 AM

#

gm

tame rampart Sep 14, 2024, 9:37 AM

#

gm

clever hemlock Sep 14, 2024, 10:26 AM

#

gm

wraith cave Sep 14, 2024, 10:43 AM

#

gm fam

floral umbra Sep 14, 2024, 10:53 AM

#

Hoi, was ai-toolkit only for when training flux loras? Or can it be used for SDXL/1.5 as well?

white parrot Sep 14, 2024, 11:58 AM

#

can i upscale more than 2 img at once?

unreal glade Sep 14, 2024, 12:06 PM

#

Hey

#

On Linux, there's other things to run AMD GPU other than directml

#

What was that called?

#

rocm? zluda?

fervent thunder Sep 14, 2024, 12:41 PM

#

anyone got juggernaut XL to work on fooocus ?

#

my guess is that fooocus does not detect the baked in vae, and uses sdxl vae instead, so i get very bad images. other juggernaut versions work fine.

#

would recommend switching from fooocus to comfyui, diffusers or pure pytorch scripts

#

probably comfyui

fervent thunder Sep 14, 2024, 1:01 PM

#

fervent thunder would recommend switching from fooocus to comfyui, diffusers or pure pytorch scr...

fooocus has great inpainting, ive found that hard to replicated on comfy

fervent thunder Sep 14, 2024, 1:19 PM

#

I see people say this a lot but I never really knew why
when I used Impact pack on comfy it worked perfectly out of the box

#

can you share a workflow with sdxl inpaint, id greatly appreciate it.

dense field Sep 14, 2024, 4:02 PM

#

guy during the Ottoman Empire

#

Photo in the style of realistic photography, an open black box, a faint glow inside.

floral umbra Sep 14, 2024, 4:42 PM

#

white parrot can i upscale more than 2 img at once?

As far as i know of, you can batch them to do one at a time, but to do 2 upscales at once, you'd want to run 2 SD instances at once, provided you have enough video memory/ram

white parrot Sep 14, 2024, 4:43 PM

#

floral umbra As far as i know of, you can batch them to do one at a time, but to do 2 upscale...

so just about how much pc can handle ic

nocturne mural Sep 14, 2024, 5:59 PM

#

ey

#

weath

bleak matrix Sep 14, 2024, 6:08 PM

#

Good afternoon, everyone! How are you all doing?

fervent thunder Sep 14, 2024, 6:12 PM

#

okay 🙂

iron hazel Sep 14, 2024, 6:33 PM

#

Guys, last time i used SD was more than 7 months, been using automatic1111 but i dont know if its defunct now or upgraded. Is it still viable or is there alternatives?

quartz siren Sep 14, 2024, 6:48 PM

#

iron hazel Guys, last time i used SD was more than 7 months, been using automatic1111 but i...

there are lots of much better models now, and a111 is fine but comfyui, swarm, forge and diffusers are great alternatives now.

I would probably recommend switching to a better model now, flux came out and it's incredibly good at text rendering and prompt following and human anatomy. It's vram requirements are a bit hefty and requires roughly a 8gb vram gpu at the least.

iron hazel Sep 14, 2024, 6:48 PM

#

where to find flux? is there an installation guide for it?

#

whats the second best from those options? according to you?

quartz siren Sep 14, 2024, 6:52 PM

#

iron hazel where to find flux? is there an installation guide for it?

you can probably just search up installation guides, these are the original models(dev is better but schnell is much faster): https://huggingface.co/black-forest-labs/FLUX.1-dev
https://huggingface.co/black-forest-labs/FLUX.1-schnell

and I think comfyui is probably the best ui and diffusers as a library

iron hazel Sep 14, 2024, 6:59 PM

#

a side question can I install the ui to an external HD or its best to keep it on pc?

main snow Sep 14, 2024, 8:41 PM

#

Anybody with Photoshop or similar apps knowledge? Need a lil help

azure walrus Sep 14, 2024, 8:44 PM

#

Anyone can help me

main snow Sep 14, 2024, 8:44 PM

#

Mood

azure walrus Sep 14, 2024, 8:44 PM

#

My stable diffusion I got extension which is wildcard galery but preview doesnt show up how can I solve that ?

desert dagger Sep 14, 2024, 9:57 PM

#

main snow Anybody with Photoshop or similar apps knowledge? Need a lil help

i can help you with photoshop, what's up?

main snow Sep 14, 2024, 9:57 PM

#

desert dagger i can help you with photoshop, what's up?

It's that one thing you replied to actually, trying to change a texture from a yellow tinto to a more natural pinkish, it's a skin

ionic wraith Sep 14, 2024, 9:58 PM

#

for photoshop you should use the ai generative fill

#

i think you would get a simillair result

main snow Sep 14, 2024, 9:58 PM

#

The person that helped me before mentioned what Todo, sadly both them and I lack such skills

#

I can try but seems hard to scoot around the non skin parts

#

With ai I mean

desert dagger Sep 14, 2024, 9:59 PM

#

main snow It's that one thing you replied to actually, trying to change a texture from a y...

oh, well - piximperfect has a very good tutorial for that on his youtube channel if you want to use photoshop. but the easy way is just select the skin and then adjust the hue

main snow Sep 14, 2024, 9:59 PM

#

I don't have Photoshop myself, I'm a broke college student 💀

desert dagger Sep 14, 2024, 9:59 PM

#

main snow With ai I mean

even for AI, you still have to create a mask over the areas where you want to change, and then prompt in the change

desert dagger Sep 14, 2024, 9:59 PM

#

main snow I don't have Photoshop myself, I'm a broke college student 💀

DM me your image, i'll help you with it

main snow Sep 14, 2024, 10:00 PM

#

Thank you! I'll send hem

ionic wraith Sep 14, 2024, 10:00 PM

#

main snow I don't have Photoshop myself, I'm a broke college student 💀

Online you can use Photopea, really similair to photoshop

desert dagger Sep 14, 2024, 10:00 PM

#

and adobe express

#

it's free

#

or canva

zenith lance Sep 14, 2024, 11:49 PM

#

hello all, when I do upscaling, how can I ensure that my image is not destroyed and that I still have to do corectionr. (and the time I waste).

wooden nebula Sep 15, 2024, 12:16 AM

#

hello

fervent thunder Sep 15, 2024, 12:18 AM

#

there is a control net called control net tile, which can help

#

using a low denoise can help too (only running the Ksampler on low sigmas)

fervent thunder Sep 15, 2024, 6:29 AM

#

hiu

covert kestrel Sep 15, 2024, 8:19 AM

#

And if you're desperate (or just edit simple stuffs), go for GIMP.

steady sparrow Sep 15, 2024, 8:42 AM

#

Guys hi

Can I use SD to upload a drawing of a map so that it uses that drawing to make that map look like an alien spaceship map?

long talon Sep 15, 2024, 8:56 AM

#

Any Flux server?

warm flame Sep 15, 2024, 10:01 AM

#

Have a good art everyone here!

unreal glade Sep 15, 2024, 11:50 AM

#

GIMP is good

high saddle Sep 15, 2024, 11:50 AM

#

hello

flint raptor Sep 15, 2024, 1:18 PM

#

any idea if sd3 large will ever be opensourced on huggingface?

floral umbra Sep 15, 2024, 1:40 PM

#

Is ai-toolkit only for flux models? Or can they be used to train loras for 1.5 and SDXL too? Thunk

#

As i found it easy enough to use that it was the first tool i've successfulyl managed to make a lora for. As i just can't wrap my head around the damn parameters and folder structure for kohya lol

quartz siren Sep 15, 2024, 2:02 PM

#

flint raptor any idea if sd3 large will ever be opensourced on huggingface?

I don't believe sd3 large will be open sourced, however stability is planning on open sourcing sd3.1 2b at least.

jaunty jackal Sep 15, 2024, 2:59 PM

#

Jssnsggajseba?

copper crystal Sep 15, 2024, 4:06 PM

#

quartz siren I don't believe sd3 large will be open sourced, however stability is planning on...

just forget stability. they relied on the men at black forest labs for any credibility in the field. it's gone now. they'll never release anything substantial again.

#

whatever happened to make them leave is something the new owners of stability need to contend with. BFL is what stability was more or less.

Don't count on SD3.1 to improve the pretraining problems

static falcon Sep 15, 2024, 4:20 PM

#

Hey guys, could someone point me in the right direction? I want to render characters on top of sketches (mistoline control net mode) but each body part will be apart from all others (legs, torso, hands, head) like in Rayman, for skeletal animation in Spine. What kind of "examples" I should prepare for such lora training? Or IP-adapter is what I should use for that? I want each sketch I do (I draw character in full, every body part in line art, just no final color) rendered by SD in a pretty way. Should I make Lora for that somehow? Appreciate any advice 🙏

#

Do I need to make "examples set" of such body parts in full color and style of what I want SD rendering, or it's possible to convey to it that all the "point of the training" is parts separation, without caring about render style of examples that I'll submit for training?

fervent thunder Sep 15, 2024, 4:29 PM

#

in general for lora training you want to use examples that are examples of the finished image that you want as output

#

so for example you would not use examples that have a different structure/composition or style to what you want

#

you use examples that match what you want in both style and composition

#

control net is different

#

for a canny or depth control net you just want the composition to be right in the example image

#

for ip adapter it can be set to composition or style mode so it depends

#

style control nets do exist but are very rare

copper crystal Sep 15, 2024, 4:31 PM

#

to pose rayman style characters (vectorman possibly too) you could potentially train a controlnet lite for that. lot of people don't realize that controlnet learns how ot pose from it's base images. I'm not sure it could pose a disconnected character by default. Could be a neat effort to learn controlnet training with.

#

I think training a lora of just the disconnected character in enough poses would help a lot too

fervent thunder Sep 15, 2024, 4:41 PM

#

controlnet training sounds fun yeah

unreal glade Sep 15, 2024, 4:47 PM

#

I am writing an magical world

#

But, I wanna add humanoid magic golem into it.

#

And use graphics card to resembles them.

#

How should I represent those most classic ones?

#

Like, 750ti

#

960ti

#

690

#

Shits like that.

smoky sedge Sep 15, 2024, 8:08 PM

#

hello

fervent thunder Sep 15, 2024, 9:59 PM

#

hi

gilded raven Sep 15, 2024, 11:31 PM

#

Does somone in here have minecraft and want to play in my minecraft server with mods?

desert dagger Sep 16, 2024, 12:24 AM

#

gilded raven Does somone in here have minecraft and want to play in my minecraft server with ...

#🌶｜off-topic

gilded raven Sep 16, 2024, 12:37 AM

#

oh

sharp mirage Sep 16, 2024, 1:33 AM

#

Are there any local tools that can identify whether an ai image has imperfections or normal

runic ginkgo Sep 16, 2024, 3:04 AM

#

#怎么用

mighty oxide Sep 16, 2024, 3:52 AM

#

How do I run FLUX models correctly?

#

I can run them in my sd models folder, but they give gray images.

desert dagger Sep 16, 2024, 4:17 AM

#

mighty oxide How do I run FLUX models correctly?

that depends on if they are .sft files or .safetensor files

#

if they are .sft files, they go in the /models/unet folder

mighty oxide Sep 16, 2024, 4:17 AM

#

mine are safetensors

desert dagger Sep 16, 2024, 4:18 AM

#

what interface are you using?

mighty oxide Sep 16, 2024, 4:18 AM

#

i switched to forge

#

I

desert dagger Sep 16, 2024, 4:18 AM

#

i don't think forge has support for flux

#

switch to comfyUI

mighty oxide Sep 16, 2024, 4:19 AM

#

Is it comfy on Apple silicon M3 pro?

desert dagger Sep 16, 2024, 4:22 AM

#

mighty oxide Is it comfy on Apple silicon M3 pro?

not sure. you should ask that in the #🤝｜tech-support channel

#

#🤝｜tech-support would be the place to ask this

warm junco Sep 16, 2024, 4:56 AM

#

mighty oxide i switched to forge

Which flux model? Make sure you use either nf4 v2 or the flux dev fp8 (16gb)

warm junco Sep 16, 2024, 4:56 AM

#

desert dagger i don't think forge has support for flux

Forge has flux support

mighty oxide Sep 16, 2024, 4:58 AM

#

I want to use impressionistic realism

#

I don't know if that's a flux model.

#

It doesn't wark with anything I have

desert dagger Sep 16, 2024, 4:59 AM

#

warm junco Forge has flux support

my mistake, then. the last I remember hearing is that flux wasn't supported in forge

warm junco Sep 16, 2024, 5:00 AM

#

mighty oxide I want to use impressionistic realism

How big is the file?

mighty oxide Sep 16, 2024, 5:01 AM

#

38 MB

winged wasp Sep 16, 2024, 5:15 AM

#

hey

#

have you ever play free fire

#

??

warm junco Sep 16, 2024, 5:49 AM

#

mighty oxide 38 MB

Thats not a model (checkpoint) then

#

Its a lora mostly

vast hill Sep 16, 2024, 6:06 AM

#

hello

torpid bear Sep 16, 2024, 7:47 AM

#

Hi

fervent ermine Sep 16, 2024, 10:19 AM

#

Hi

echo lily Sep 16, 2024, 10:54 AM

#

Slightly afk. But if someone can ping me with the response to a question. Information says you can use stable diffusion to create a 2d character concept art. To later turn into a 3d model. Can anyone tell me which checkpoint, of the many many choices, i should use?

floral umbra Sep 16, 2024, 11:40 AM

#

Which motion diffusion do you guys recommend? Like SVD, animatediff i read is kinda "outdated", but don't know what people prefer these days

onyx needle Sep 16, 2024, 1:00 PM

#

A group of wild boars on the left and a group of hyenas on the right, facing each other during the day, tense atmosphere, panoramic view

civic cradle Sep 16, 2024, 1:00 PM

#

Who u gonna kiss

#

The boars or the hyenas

fervent thunder Sep 16, 2024, 2:02 PM

#

Im looking for a partner in building comfyui backed bots, any interested feel free to dm.

#

If this should go to community projects let me know

desert dagger Sep 16, 2024, 2:27 PM

#

fervent thunder If this should go to community projects let me know

that's where it should go

copper crystal Sep 16, 2024, 3:20 PM

#

big scam energy. @moderators or however you do it on this server. needs to be seen and ban this person.

fiery wasp Sep 16, 2024, 3:27 PM

#

copper crystal big scam energy. @moderators or however you do it on this server. needs to be ...

just react with ⚠️ and they should see, but obviously no one has seen that yet

copper crystal Sep 16, 2024, 3:28 PM

#

fiery wasp just react with ⚠️ and they should see, but obviously no one has seen that yet

they post in every channel at least once a day in these early western world hours.

i think it'd be easy to put a bot condition in place. if all their posts are verbatim the same post, put a 24 hour mute on them. EZ. Do they not have any coders left at stability?

fiery wasp Sep 16, 2024, 3:29 PM

#

copper crystal they post in every channel at least once a day in these early western world hour...

this is not a stability server, this one is community server, by users of SD

copper crystal Sep 16, 2024, 3:29 PM

#

huh?

fiery wasp Sep 16, 2024, 3:30 PM

#

there is a bot, that sends notifications to mods

copper crystal Sep 16, 2024, 3:30 PM

#

https://discord.com/channels/1002292111942635562/@home this is the official stability.ai owned server

fiery wasp Sep 16, 2024, 3:30 PM

#

bot reacts to ⚠️ that is the reason ⚠️ disapears

copper crystal Sep 16, 2024, 3:30 PM

#

oh that link doesn't work. it' goes to the "server guide" link

#

look, if stability doesn't want to tend their communiyt garden, that's on them. You don't have to make excuses for them or pretend it's not official though.

fiery wasp Sep 16, 2024, 3:32 PM

#

i am just saying what mods told me

copper crystal Sep 16, 2024, 3:32 PM

#

fiery wasp there is a bot, that sends notifications to mods

the bot is kind of half finished. A spammer that posts the same scam invite in every single channel is easily discerned in code

fiery wasp Sep 16, 2024, 3:32 PM

#

i am just a regular user

copper crystal Sep 16, 2024, 3:32 PM

#

fiery wasp i am just saying what mods told me

mods lied to you. Everything about this server is official Stability.

mellow meteor Sep 16, 2024, 3:54 PM

#

waow

fervent thunder Sep 16, 2024, 3:56 PM

#

fiery wasp this is not a stability server, this one is community server, by users of SD

this server did start out as a community server but the owner got approached and he gave it over
he still has a special flair though

ionic wraith Sep 16, 2024, 4:56 PM

#

Any good ways to remove text from and image and fill it with in?

#

img2img didnt seem to work good enough

fervent thunder Sep 16, 2024, 4:58 PM

#

probably just inpaint

echo lily Sep 16, 2024, 5:34 PM

#

Has anyone successfully gotten S.D. to produce a character like Cheetara? If so what checkpoint did you use, and how did you phrase it?

#

https://tenor.com/view/thunder-cats-gif-7172707

fervent thunder Sep 16, 2024, 7:12 PM

#

emergent !!

#

i can ssh with new bots fine but not with alraedy deployed pod

#

i have updated ssh key after i deployed this pod

#

can i change any config so that i can connect via ssh ?

ionic wraith Sep 16, 2024, 8:49 PM

#

Anyone tried https://github.com/fairy-root/Flux-Prompt-Generator?tab=readme-ov-file ?
I cant seem to connect any prompts to it.

subtle dock Sep 16, 2024, 9:09 PM

#

I tried installing Stable Diffusion on my computer, but I got the error 'Stable Diffusion model failed to load.' I can't figure out why I'm getting this error, as I was told I only needed to run the run.bat file. Is there anyone who can help

ionic wraith Sep 16, 2024, 9:10 PM

#

subtle dock I tried installing Stable Diffusion on my computer, but I got the error 'Stable ...

Follow one of these guides https://github.com/CS1o/Stable-Diffusion-Info/wiki/Webui-Installation-Guides
Very helpful

desert dagger Sep 16, 2024, 9:21 PM

#

subtle dock I tried installing Stable Diffusion on my computer, but I got the error 'Stable ...

can you post your entire error log, in #🤝｜tech-support, please

subtle dock Sep 16, 2024, 9:27 PM

#

ionic wraith Follow one of these guides https://github.com/CS1o/Stable-Diffusion-Info/wiki/We...

I will check it out thank you

subtle dock Sep 16, 2024, 9:28 PM

#

desert dagger can you post your entire error log, in <#1002602742667280404>, please

Sorry I was excited for stable, I wrote it here without reviewing the server, I will edit it

desert dagger Sep 16, 2024, 9:28 PM

#

subtle dock Sorry I was excited for stable, I wrote it here without reviewing the server, I ...

it's fine, you just going to get actual tech support if you post it in the #🤝｜tech-support channel is all

subtle dock Sep 16, 2024, 9:29 PM

#

desert dagger it's fine, you just going to get actual tech support if you post it in the <#100...

Ok, like a ticket. thx

swift igloo Sep 16, 2024, 10:25 PM

#

Hi everyone,
I've just started exploring Stable Diffusion and I have a question. I'm looking into using pretrained Stable Diffusion models, and I'm wondering if it's possible to pass an image, a mask image, and a separate image that I want to apply onto the masked area? Has anyone tried this or have any advice on how to approach it? Thanks in advance!

vestal bone Sep 16, 2024, 11:14 PM

#

Hi, just coming here trying to learn everything new about stability… otherwise it’s just train train train or gen gen gen 24/7

pseudo bough Sep 17, 2024, 1:35 AM

#

updated comfyui and now I get a white screen the gui doesn't load 😦

desert dagger Sep 17, 2024, 1:41 AM

#

pseudo bough updated comfyui and now I get a white screen the gui doesn't load 😦

unload comfy - i.e. kill the browser and the commandline windows. then go into the /comfyUI/updates folder and run the update script file. then reboot your machine

pseudo bough Sep 17, 2024, 2:04 AM

#

desert dagger unload comfy - i.e. kill the browser and the commandline windows. then go into t...

thanks 🙂 do you have a working flux wworkflow the one i found i'm having an issues with mat1 and mat2 shapes cannot be multiplied

desert dagger Sep 17, 2024, 2:05 AM

#

pseudo bough thanks 🙂 do you have a working flux wworkflow the one i found i'm having an iss...

sure. i have some fairly simple ones i'd be happy to give you copies of if you like

pseudo bough Sep 17, 2024, 3:35 AM

#

desert dagger sure. i have some fairly simple ones i'd be happy to give you copies of if you l...

i got it working in my preferred program forge 🙂

#

thansk though man this is absolutely amazing

tall maple Sep 17, 2024, 6:49 AM

#

Can someone who understands stable diffusion write to me and help me?

tulip yarrow Sep 17, 2024, 7:39 AM

#

What is a good amount of Buzz I should use for a SD1.5 CivitAI bounty for a 49 character pack? I'll collect and supply the images

undone garden Sep 17, 2024, 11:33 AM

#

Does invoke not extract info from pictures made with Auto1111 or is it just being weird

fervent thunder Sep 17, 2024, 11:33 AM

#

not sure

fervent thunder Sep 17, 2024, 1:12 PM

#

a lot of people send viruses here

#

so we can't rly click a link from a new user

static falcon Sep 17, 2024, 1:21 PM

#

copper crystal I think training a lora of just the disconnected character in enough poses would...

I have about 10 characters already (in final color) but they're very different, a frog pet, 2 flying fairies, 3 humanoid orcs etc, one fish monster that has tail instead legs, a few other weirdos 😆. I think I need to collect 10 humanoid monsters first, and then if I want to train for pets, will need about 10 four legged examples? A control net training also needs just a few examples like lora? Or a big data set

fervent thunder Sep 17, 2024, 1:22 PM

#

control net needs big data set most likely

balmy stratus Sep 17, 2024, 2:13 PM

#

hi

mighty oxide Sep 17, 2024, 4:20 PM

#

How do I fix getting black images in comfyui?

trail lion Sep 17, 2024, 4:31 PM

#

mighty oxide How do I fix getting black images in comfyui?

There are a few things that can cause it. But first restart comfy just to make sure you have a clean environment. Use a compatible vae, loras with your model. Don't mix supporting files between models. Use proper settings, resolution ,sampler ,cfg the the model you have loaded

floral umbra Sep 17, 2024, 6:19 PM

#

Hoi, do any of you have a decent workflow for animatediff with ipadapter to make lengthy clips actually consistent?

fervent thunder Sep 17, 2024, 6:50 PM

#

banodoco server is best for that

trail lion Sep 17, 2024, 6:52 PM

#

Lengthy will be the issue, most of the methods I've seen create keyframe grids.... So the more frames you add you'll either run into resource issues or create consistent issues with too much gap between frames

fervent thunder Sep 17, 2024, 6:59 PM

#

which resource do you run out of? is it VRAM?

trail lion Sep 17, 2024, 7:38 PM

#

Usually, say the video is 512x512, you take every 10th frame as a keyframe and create a grid that's maybe 5x5 with the intention of running that through img2img,ipadapter, etc... at some point you will hit a limit

copper crystal Sep 17, 2024, 7:51 PM

#

fervent thunder control net needs big data set most likely

i would augment an existing dataset. Maybe might investigate a workflow that would convert existing poses into disconnected creatures. Potentially with different parts for different limbs too. So a cat with disconnected frog legs. Seems like a weird goal to aim for, but i think that's how i'd plan my approach.

Interesting thought exercise. Thanks for sparking it @static falcon

ionic wraith Sep 17, 2024, 7:56 PM

#

Any tips or suggestions for a gpu for my ai pc?

fervent thunder Sep 17, 2024, 8:05 PM

#

I don't think assembling the image dataset is the hard part, the hard part is the expensive training cost for control nets

fervent thunder Sep 17, 2024, 8:05 PM

#

ionic wraith Any tips or suggestions for a gpu for my ai pc?

used RTX 3060 12GB is ok

mighty oxide Sep 17, 2024, 8:14 PM

#

what vae goes with the realisticComicBook_v10 model?

#

I keep getting black images.

ionic wraith Sep 17, 2024, 8:19 PM

#

fervent thunder used RTX 3060 12GB is ok

Hmm, thanks for your suggestion

static falcon Sep 17, 2024, 9:17 PM

#

fervent thunder I don't think assembling the image dataset is the hard part, the hard part is th...

my current plan is to try inpainting mode with mask, drawing items one by one, near the full character. let's say I generate the cartoon zombie on one side, and then go slightly near it with masking and prompt that it'll be a duplicate of its leg, etc

frank halo Sep 17, 2024, 9:25 PM

#

Hey

#

Someone speak spanish here?

#

I need hel with Upscaling basics in SD

tulip yarrow Sep 17, 2024, 9:28 PM

#

What is a good amount of Buzz I should offer for a SD1.5 CivitAI bounty for a 49 character pack? I'll collect and supply the images, and tag them too

#

By the way it's gonna be sfw, I'm not one of those freaks who want a niche... Lora

fervent thunder Sep 17, 2024, 9:47 PM

#

static falcon my current plan is to try inpainting mode with mask, drawing items one by one, n...

not sure if this will work

tulip yarrow Sep 17, 2024, 10:04 PM

#

tulip yarrow What is a good amount of Buzz I should offer for a SD1.5 CivitAI bounty for a 49...

Image count will be approximately 4000

deft bough Sep 17, 2024, 10:06 PM

#

hi

golden valley Sep 17, 2024, 10:33 PM

#

Who wants to work as a moder or developer in my project?

copper crystal Sep 18, 2024, 12:38 AM

#

tulip yarrow Image count will be approximately 4000

at least the cost of the buzz it would take to train that on civit, +50% . That's just my first sort of gauge on it. Could be a starter point for consideration at least

fervent thunder Sep 18, 2024, 1:24 AM

#

civit bounty market is odd

#

sometimes the buzz for a lora is the same as the buzz for one image

copper crystal Sep 18, 2024, 2:30 AM

#

yeah. my reasoning is lora costs so much buzz to train yourself. that's a "market rate" that people are paying. but getting somene else to do it isn't self serve. it's a custom process that they get paid upon completion for. so you boost the self serve market rate to a full serve then add some for incentive. it's a bounty so the goal is to convince people to come get it done for that price.

low balling market value could work too someone might bite

merry aurora Sep 18, 2024, 2:53 AM

#

Hey everyone, im wondering if someone knows about the best sources to get started with generating web design assets (e.g. design a landing page/impact image)? I did some googling of course, though still not sure where to find the latest and best... ty! ❤️

desert dagger Sep 18, 2024, 2:54 AM

#

merry aurora Hey everyone, im wondering if someone knows about the best sources to get starte...

are you wanting something that just generates images? or something that'll code the website?

merry aurora Sep 18, 2024, 3:00 AM

#

desert dagger are you wanting something that just generates images? or something that'll code ...

Only generating images. I am trying to find out what the state of the art is for generating images with a web design angle. For example I am interested in knowing and testing whether for a page like I can generate a similar layout/design

desert dagger Sep 18, 2024, 3:10 AM

#

merry aurora Only generating images. I am trying to find out what the state of the art is for...

there aren't any image gens that are specifically focused on web site design images that i'm aware of. that might be a good angle for you to explore writing, even.

#

you might be able to do that with one of openAI's GPT options

merry aurora Sep 18, 2024, 3:12 AM

#

Ah okay, I've tried a bit with OpenAI + https://stability.ai/, though it is inadequate at this stage I feel, I'll continue researching a bit - thank you for sharing thoughts

iron plover Sep 18, 2024, 6:05 AM

#

swift igloo Hi everyone, I've just started exploring Stable Diffusion and I have a question....

Better to take your various images into an editor like PS or Gimp, once you have the general placement save image and use in control net or image to image in SD, with the relevant prompt.

woeful flume Sep 18, 2024, 6:19 AM

#

So what model may i use in order to generate stuff like 90's retro Anime?

copper crystal Sep 18, 2024, 6:28 AM

#

flux + lora. or maybe, sdxl anime centric model + lora.

fervent thunder Sep 18, 2024, 7:23 AM

#

Hi

bright maple Sep 18, 2024, 7:24 AM

#

TradingView Premium Package (Cracked Version 2.9.2.6 – Desktop): https://www.reddit.com/r/FXFullLoaded/comments/1fhj6nn/tradingview_premium_free_version_available_for/

wind swan Sep 18, 2024, 8:19 AM

#

Would anyone happen to know; can I create an embedding or lora that can construct floor plans or maps? How would it be done? Should I feed it a lot of floorplans as regularization images, and then train it on individual parts of those floorplans like doors, walls, windows? Same with maps, feed it maps for reg images and then individual parts of that map like, trees, rocks, rivers, as training images?

winged sapphire Sep 18, 2024, 9:26 AM

#

hey guys whats the best video upscaler at this moment?

worldly cradle Sep 18, 2024, 9:35 AM

#

I also have questions about lora training, is this the right place? I lora training a joyful community thing where we could help each other with experience or is it a harsh market thing where the best will get all the money's and Noone wants to share experience because it might help others to get a piece of that cake? I know nothing about the market situation, I just want to create pictures of my favorite anime waifus playing my favorite boardgame 🙈 and maybe do Instagram with it. And maybe maybe some nsfw patreon of boardgame girls flashing? Don't know 😂 but it's all about the boardgame I swear. I would even pay a little money if someone experienced could take me by the hand and help me understand what I'm doing wrong 😂 but if one of you could tell me how he/she would approach a board game lora in terms of ref pictures, important tagging and training I would be very graceful 🙏

fervent thunder Sep 18, 2024, 9:44 AM

#

try the Fal.ai fast lora maker

#

it has instructions

#

that's fine for most people

worldly cradle Sep 18, 2024, 9:57 AM

#

I'm currently using kohya ss but can't comprehend all the parameters, is fal.ai relatable?

fervent thunder Sep 18, 2024, 10:01 AM

#

https://fal.ai/models/fal-ai/flux-lora-fast-training

#

its got a lot less options for simplicity

worldly cradle Sep 18, 2024, 10:05 AM

#

How many pictures should I give it? I don't know, shouldn't I have a little bit more experience about lora training to get reasonable results? 🤔 I don't really see a point in paying money to get the same results as before 😅

fervent thunder Sep 18, 2024, 10:08 AM

#

join fal discord the staff are really active

worldly cradle Sep 18, 2024, 10:13 AM

#

Thank you for your advice, I think I might use that as a last straw. I am kinda into doing this thing and hopefully understand it on the way 🤷‍♂️

floral umbra Sep 18, 2024, 10:23 AM

#

I'm using a.i toolkit for flux training, first time i've actually managed to make a successful lora. Downside though, is that i'm then limited to flux only, as i want to make for sd 1.5 and sdxl, but don't know if ai-toolkit can make for 1.5/sdxl

#

WAsn't there hardware made that was either a SoC or something that wasn't too pricy for low power A.I acceleration for stable diffusion and such? Trying to locate it, but hardly remember much of it eugh I remember nvidia released a A.I dedicated SoC using ARM was it, but can't seem to find it back either lol

fervent thunder Sep 18, 2024, 10:52 AM

#

coral tpu

wind swan Sep 18, 2024, 11:09 AM

#

worldly cradle Thank you for your advice, I think I might use that as a last straw. I am kinda ...

Are you using koya_ss?

#

Also, SD 1.5 model or SDXL?

worldly cradle Sep 18, 2024, 11:11 AM

#

wind swan Are you using koya_ss?

Yes, Kohya ss.
I'm currently training 1.5 because I haven't had the motivation to get sdxl to run yet

wind swan Sep 18, 2024, 11:14 AM

#

worldly cradle Yes, Kohya ss. I'm currently training 1.5 because I haven't had the motivation t...

sending you DM

worldly cradle Sep 18, 2024, 11:26 AM

#

wind swan sending you DM

Sure, thank you 🙏

sweet nacelle Sep 18, 2024, 11:34 AM

#

Hello, I hope you are doing well.
I have one project about the Cartoon image processing.
Main purpose of this project is as follows.
Convert the cartoon image to the smile, sad, mod and so on cartoon one.
At this, it is not allowed to change others without the face.
Who can teach me what model is useful?
Please contact me and discuss about it.

floral umbra Sep 18, 2024, 11:59 AM

#

fervent thunder coral tpu

Perfect! That was one of the accelerators i read about but completely forgot :D And now i'm curious if there's a comfyui node to make a farm of say 2-3 of those for low power acceleration of generations :P

#

Wait, does coral not have more than 4 tops on any of their boards?

fervent thunder Sep 18, 2024, 12:18 PM

#

not sure

gray ermine Sep 18, 2024, 1:30 PM

#

Why can't I make an audio-to-audio connection? Whether it's an uploaded file or an uploaded record, the generated composition remains "pending" for a long time, before finally displaying an error message.

fiery wing Sep 18, 2024, 1:52 PM

#

Hey folks quick question. I am using SD1.5. My GPU (1660ti) can handle creating up to 1024x1024 images. However, I was reading around and saw that SD1.5 was apparently trained at 512x512 and does its "best results" there.

However, when doing image generation, I find that the 1024x1024 images have more detail to them and are a little less janky, however they take 4-6x longer to generate.

What is a better workflow; generating at 1024x1024 OR generating at 512x512 and upscaling good images?

hardy nexus Sep 18, 2024, 2:00 PM

#

So sd3.5 was released ? Where was it announced? I didn't see it

warm junco Sep 18, 2024, 2:36 PM

#

fiery wing Hey folks quick question. I am using SD1.5. My GPU (1660ti) can handle creating ...

Hey, upscaling is the way to go

rain palm Sep 18, 2024, 3:32 PM

#

Anyone had any experience with https://exactly.ai ?

fiery wing Sep 18, 2024, 3:59 PM

#

Wow SDXL fucking hates generating at lower than 1024x1024.

trail lion Sep 18, 2024, 4:04 PM

#

yah, you can get away some with non square resolutions sometimes, like 768x1024 but 1024x1024 really is what seems to work best on XL

fiery wing Sep 18, 2024, 4:04 PM

#

Oh thats a good tip, I'll use that

#

So prompt forming question

#

What's better between: black hair, short hair vs black short hair?

#

Or would something like (black, short) hair be best?

trail lion Sep 18, 2024, 4:11 PM

#

I would use dark vs black, because with XL the colors tend to get applied to things you dont intend

#

like you'll start seeing everything black, like furniture, clothing

fiery wing Sep 18, 2024, 4:11 PM

#

I see, any difference for 1.5?

trail lion Sep 18, 2024, 4:12 PM

#

same goes for that, SD3 and flux have addressed that though, with their better prompt adherence

fiery wing Sep 18, 2024, 4:12 PM

#

I see, but outside of changing from black to dark, does any of the three ways I posed the prompt vary anything?

trail lion Sep 18, 2024, 4:13 PM

#

shouldnt matter

fiery wing Sep 18, 2024, 4:13 PM

#

Oh, surprising.

#

And what if I'm actually looking for like

#

black as 0,0,0 hair?

#

do I just use "black" or just do "super duper mega dark"

trail lion Sep 18, 2024, 4:13 PM

#

try it, but just dont be surprised if you see more black

fiery wing Sep 18, 2024, 4:14 PM

#

got it noted

fervent thunder Sep 18, 2024, 4:34 PM

#

what happened to iopaint? the stable diffusion tab is empty with no models

trail lion Sep 18, 2024, 4:50 PM

#

what's that? is it like inpaint?

abstract quarry Sep 18, 2024, 5:10 PM

#

fiery wing Hey folks quick question. I am using SD1.5. My GPU (1660ti) can handle creating ...

are you sure you use the base model? Cause the base model is really bad with resolutions above 512

#

you can finetune a model to higher resolutions and most popular SD 1.5 finetunes support higher resolutions

#

same for sdxl. The reason why outputs in sdxl on low resolutions look so ugly is that it associates low resolutions with ugly Internet images 🤷‍♂️

floral umbra Sep 18, 2024, 5:13 PM

#

fiery wing Wow SDXL fucking *hates* generating at lower than 1024x1024.

Which model specifically do you use? I can generate fine even at 768x768

fiery wing Sep 18, 2024, 5:27 PM

#

abstract quarry are you sure you use the base model? Cause the base model is really bad with res...

Oh, I'm not using the base model. I'm using a tuned model.

abstract quarry Sep 18, 2024, 5:28 PM

#

yes, they are finetuned on higher resolutions

fiery wing Sep 18, 2024, 5:28 PM

#

floral umbra Which model specifically do you use? I can generate fine even at 768x768

Oh I tried a few off civit.ai and it seems consistently to perform worse below 1024x1024. One I tried was AnimagineXL

#

Not to say the images are completely unusable

#

Just the volume of artifacts is much higher

abstract quarry Sep 18, 2024, 5:28 PM

#

you can use sdxl turbo if you want to generate on 512 in sdxl

#

as said, sdxl was finetuned on high quality 1024 pixel images, so it associates everything below 1024 as low quality image

#

you would have to finetune it on high quality low res images. But why would you want to generate lowres images anyways?

#

sdxl is more efficient on high resolution images than SD 1.5

fiery wing Sep 18, 2024, 5:30 PM

#

What is sdxl turbo?

#

And I'm generating below 1024 cause I only have a 6GB card Laugh

#

I can generate at 1024 but it takes much longer than smaller

copper crystal Sep 18, 2024, 5:34 PM

#

fiery wing I *can* generate at 1024 but it takes much longer than smaller

use multidfiffusion in forge ui

fiery wing Sep 18, 2024, 5:34 PM

#

copper crystal use multidfiffusion in forge ui

blobsweat Now I gotta figure out what a forge ui is

#

I'm really out of the loop lol

#

Hardly touched SD in 2 years, so have been slowly modernizing my setup

copper crystal Sep 18, 2024, 5:36 PM

#

fiery wing <:blobsweat:588766584731074571> Now I gotta figure out what a forge ui is

if you use webui from automatic1111, it's more or less the same. Has a lot of benefits for lower mem systems. worth reading up about. If you don't want to move from auto, you can find the multidiffusion extension and install it. it's just built into forge

fiery wing Sep 18, 2024, 5:36 PM

#

copper crystal if you use webui from automatic1111, it's more or less the same. Has a lot of b...

noted Okay yes I do use A1111, so I will try that extension

#

What does multidiffusion do?

copper crystal Sep 18, 2024, 5:37 PM

#

it's a tiled sampler and tiled vae. since it processes smaller tiles at a time, you can fit more into less vram

fiery wing Sep 18, 2024, 5:37 PM

#

Got it

copper crystal Sep 18, 2024, 5:37 PM

#

https://github.com/pkuliyi2015/multidiffusion-upscaler-for-automatic1111

fiery wing Sep 18, 2024, 5:54 PM

#

copper crystal https://github.com/pkuliyi2015/multidiffusion-upscaler-for-automatic1111

I must be using this wrong, I downloaded and enabled the extension but its going... slower?

copper crystal Sep 18, 2024, 5:55 PM

#

lots of guides out there for it. it's a popular extension that has allowed a lot of low vram users to accelerate their work.

i'm not able to help too well. I used it briefly 2 years back. then i bought a new gpu.

floral umbra Sep 18, 2024, 5:58 PM

#

copper crystal use multidfiffusion in forge ui

Damn, i need to try that in comfy lol, as i'm a sucker for upscaling lol

copper crystal Sep 18, 2024, 6:01 PM

#

floral umbra Damn, i need to try that in comfy lol, as i'm a sucker for upscaling lol

comfyui might have it better listed as a tiled sampler and tiled vae. they'd be separate nodes i think

floral umbra Sep 18, 2024, 6:03 PM

#

copper crystal comfyui might have it better listed as a tiled sampler and tiled vae. they'd be...

Ah, so it's simply just tiled? :P

copper crystal Sep 18, 2024, 6:06 PM

#

yeah i found the custom nodes for it https://github.com/shiimizu/ComfyUI-TiledDiffusion

floral umbra Sep 18, 2024, 6:07 PM

#

Thanks :) Odd that i didn't get that one when i googled Thonk

copper crystal Sep 18, 2024, 6:08 PM

#

floral umbra Thanks :) Odd that i didn't get that one when i googled <:Thonk:3994336097712209...

i goog'd multidiffusion for comfyui but i'm always thinking low key that the goog algorithm senses if you're getting too good at home hobby AI and will begin to hinder you. total paranoia but sometimes i embellish it. Gotta walk that line between insane and the membrane.

floral umbra Sep 18, 2024, 6:09 PM

#

The heck? Google Algo tries to stop you from becoming smarter? HAhaa

copper crystal Sep 18, 2024, 6:10 PM

#

Alpha's big bet was to invest into AI and be the lead. Now tha'ts all changed and open source AI is fucking their 10 year corporate strategy.

they're probably not manipulating but it's fun to think of

floral umbra Sep 18, 2024, 6:11 PM

#

copper crystal Alpha's big bet was to invest into AI and be the lead. Now tha'ts all changed an...

hah xD

copper crystal Sep 18, 2024, 6:11 PM

#

i actually begrudginely think that meta has done the most in the field.

long talon Sep 18, 2024, 6:12 PM

#

Is there a Flux server somewhere?

floral umbra Sep 18, 2024, 6:12 PM

#

On another note, do we have a FOSS version of suno a.i yet for text to songs gen? thinky Or still too early for that?

floral umbra Sep 18, 2024, 6:12 PM

#

copper crystal i actually begrudginely think that meta has done the most in the field.

Indeed

copper crystal Sep 18, 2024, 6:12 PM

#

floral umbra On another note, do we have a FOSS version of suno a.i yet for text to songs gen...

stable audio? ||heh||

floral umbra Sep 18, 2024, 6:16 PM

#

copper crystal stable audio? ||heh||

Oh right, i remember that one, where model wasn't available for months after announcement. Can it do the same kind of music as suno?

copper crystal Sep 18, 2024, 6:16 PM

#

floral umbra Oh right, i remember that one, where model wasn't available for months after ann...

nope lol

#

its the state of open audio models pretty much

floral umbra Sep 18, 2024, 6:17 PM

#

That's what i'm after lol. Or not as good, but can do full songs in the same sense.

copper crystal Sep 18, 2024, 6:20 PM

#

There was that one project too, that used stable diffusion. Trained it on spectrographs of music iirc, then diffusion generates spectrogram images and they put the image through a converter to turn it back to audio.

floral umbra Sep 18, 2024, 6:23 PM

#

huh. Would be cool to have that, then a node that reads from the rhythm for a llm to make lyrics and control tempo/singing itself, then you have a song right there :P

abstract quarry Sep 18, 2024, 6:23 PM

#

fiery wing What is sdxl turbo?

it generates images in one step

#

so if you want it fast, use turbo

hardy tree Sep 18, 2024, 6:24 PM

#

hi guys

copper crystal Sep 18, 2024, 6:24 PM

#

https://www.riffusion.com/ this is it. they've pivoted from open models it looks like

hardy tree Sep 18, 2024, 6:24 PM

#

what is the channel for generate images?

abstract quarry Sep 18, 2024, 6:24 PM

#

in general, Turbo/Lightning/Lcm models allow generation of Images in fewer steps (usually around 6 steps). The base Turbo model needs 1-2 steps but generates in 512x512 natively

#

fewer steps have the disadvantage that you cannot use cfg and negative prompts - but if you want to have good images fast on old hardware you should definitely use turbo models

abstract quarry Sep 18, 2024, 6:26 PM

#

hardy tree what is the channel for generate images?

there is no free image generation. Download stable diffusion or flux and generate images on your on computer

hardy tree Sep 18, 2024, 6:26 PM

#

mmm

#

You can no longer generate images in the cloud?

#

then if i don't have a gpu not working, right?

copper crystal Sep 18, 2024, 6:30 PM

#

i think civitai has a free generator now. Stability stopped offering one, since there are so many and they're trying not to burn money now

fiery wing Sep 18, 2024, 6:31 PM

#

Ive used perchance.org before, its free and decent, just not very flexible, though I think thats the case with all the free web ones

#

I think the main draw of SD is the infinite flexibility and privacy

#

Atleast, for me.

copper crystal Sep 18, 2024, 6:32 PM

#

fiery wing I think the main draw of SD is the infinite flexibility and privacy

the free generators that used to be on this discord server were inflexible too.

fiery wing Sep 18, 2024, 6:33 PM

#

Yeah I mean thats just how it is with a distributed platform

#

If you wanna control the model, process, etc. you have to host it yourself

copper crystal Sep 18, 2024, 6:33 PM

#

or at least control the host like runpod. yeah

fervent thunder Sep 18, 2024, 6:56 PM

#

lama cleaner is now called iopaint. The button "dream" is gone and there aren't any models to choose from

copper crystal Sep 18, 2024, 7:13 PM

#

fervent thunder lama cleaner is now called iopaint. The button "dream" is gone and there aren't ...

they made that change ages ago

#

there are still models, but you're talking about a 3rd party software as a service website

south fox Sep 18, 2024, 7:34 PM

#

how do I use adjectives in a multi subject prompt but make it so those adjectives only apply to one of the subjects?

trail lion Sep 18, 2024, 7:38 PM

#

which model?

#

one way is something called regional prompting

#

so you would have to know where your subjects will appear in the image so you can divide it into regions

fervent thunder Sep 18, 2024, 7:39 PM

#

what? iopaint uses remote processing?

fervent thunder Sep 18, 2024, 7:44 PM

#

south fox how do I use adjectives in a multi subject prompt but make it so those adjective...

regional conditioning

#

could also use Omost, it finally has comfy nodes

south fox Sep 18, 2024, 7:49 PM

#

thanks!

hollow marlin Sep 18, 2024, 9:39 PM

#

What language was stable diffusion made in?

quartz siren Sep 18, 2024, 9:51 PM

#

hollow marlin What language was stable diffusion made in?

It's english only

hollow marlin Sep 18, 2024, 9:52 PM

#

No

#

I meant what programming language

quartz siren Sep 18, 2024, 10:06 PM

#

most of it is probably python

feral pike Sep 18, 2024, 10:14 PM

#

hiiiiiii

desert dagger Sep 18, 2024, 10:40 PM

#

@fervent thunder you read this? https://x.com/runwayml/status/1836391272098988087

verbal delta Sep 18, 2024, 10:41 PM

#

Is there a tool like this in Forge WebUI or ComfyUI? https://github.com/chufengxiao/SketchHairSalon

fervent thunder Sep 18, 2024, 10:46 PM

#

desert dagger <@456226577798135808> you read this? https://x.com/runwayml/status/1836391272098...

that's really big news yeah

desert dagger Sep 18, 2024, 10:46 PM

#

yeah. can't wait to see what lionsgate comes out with

fervent thunder Sep 18, 2024, 10:47 PM

#

I suspect they got told about the next upcoming generation of Runway models
maybe it will be closer to Sora as it is trending that way

desert dagger Sep 18, 2024, 10:47 PM

#

fervent thunder I suspect they got told about the next upcoming generation of Runway models mayb...

everything is now better than sora - and kling just got both a new model AND motion brush for the older model

fervent thunder Sep 18, 2024, 10:48 PM

#

I've seen a lot of good Kling videos yeah

#

its possible that OpenAI is now gonna divert Sora resources to that GPT o1 thing now anyway

desert dagger Sep 18, 2024, 10:49 PM

#

fervent thunder I've seen a lot of good Kling videos yeah

i've just run one image to video test on the new kling model and all it did was zoom in, so not sure if i'm impressed or not yet

desert dagger Sep 18, 2024, 10:49 PM

#

fervent thunder its possible that OpenAI is now gonna divert Sora resources to that GPT o1 thing...

they better do something. i know they were trying to target hollywood studios, but i think the studios have been laughing at them. and now that lionsgate is onboard with runway, the others probably will rapidly as well.

fervent thunder Sep 18, 2024, 10:50 PM

#

I feel like its more of a side adventure for Open AI whereas for Runway or Black Forest its their main thing

desert dagger Sep 18, 2024, 10:51 PM

#

who knows, altman is a squirrel and distracted with trying to convince politicians to do stuff that isn't going to matter in the long run.

#

also - not sure if i told you this, but pull up flux dev and put this in for your prompt: "Yoshiyuki Tomino anime " and then add anything else you'd like after it and see what happens

fervent thunder Sep 18, 2024, 10:55 PM

#

I got an anime style, it looks good

#

I put it on #🏞｜general-with-images

desert dagger Sep 18, 2024, 10:56 PM

#

fervent thunder I got an anime style, it looks good

try it with other prompts, so far, i have only found a couple that it didn't give a real nice result with

fervent thunder Sep 19, 2024, 1:12 AM

#

what if you apply the o1 reasoning approach to genAI

#

using something like aesthetics scoring

#

generate image -> aesthetics scoring, img2img->aesthetics scoring
RLHF move params based on wether the img2img step improved aesthetics scoring

#

gotta make sure the semantic contents of the image stick

#

so also compare that and use it in the RLHF

#

this way you should get a model that can similar to o1 'improve' their way through an image till a highly aesthetic end product as much as llms use reasoning steps to get there

#

like a chain of edits

near breach Sep 19, 2024, 2:19 AM

#

hi guys! I want build a SD model targeting low-res images generation, so I guess i gotta train SD from scratch including unet vae clip..... who or which channel should I turn to? Are there any experts in the group who are experienced in training models?

thx so much 🙂

desert dagger Sep 19, 2024, 2:52 AM

#

near breach hi guys! I want build a SD model targeting low-res images generation, so I gue...

you can't train stable diffusion from scratch. you can train a check point for it, or a lora for it, but you can't train IT from scratch

near breach Sep 19, 2024, 2:53 AM

#

desert dagger you can't train stable diffusion from scratch. you can train a check point for i...

cause its too costly?

desert dagger Sep 19, 2024, 2:55 AM

#

near breach cause its too costly?

among other things. what you want to do does not require you to train the base model from scratch. what you want to do is either train a checkpoint or a lora that will use the base model.

near breach Sep 19, 2024, 2:57 AM

#

desert dagger among other things. what you want to do does not require you to train the base m...

In fact, I've tried to train a lora or fined-tuned a checkpoint(by dreambooth), neither way meets my needs 😦

desert dagger Sep 19, 2024, 2:58 AM

#

near breach In fact, I've tried to train a lora or fined-tuned a checkpoint(by dreambooth), ...

what, exactly, is it you are trying to accomplish?

near breach Sep 19, 2024, 2:58 AM

#

To generate extreme low-res pic, allow me to show some

desert dagger Sep 19, 2024, 2:58 AM

#

near breach To generate extreme low-res pic, allow me to show some

post them in the #🏞｜general-with-images channel

valid zinc Sep 19, 2024, 6:29 AM

#

excuse me but where do you guys find the models for Stable diffusion? im very new to this but i appreciate the help. the "checkpoints" just to be clear.

solid kindle Sep 19, 2024, 9:26 AM

#

civitai

fervent thunder Sep 19, 2024, 9:32 AM

#

fervent thunder what if you apply the o1 reasoning approach to genAI

been thinking about something like this for a while

#

comfyui has loops now

#

so we could make a loop where it makes an image, runs it through a quality checker model

#

then changes your settings and generates again

#

at the moment the quality checker models are a bit of a let down though

fresh lily Sep 19, 2024, 9:54 AM

#

hello

violet sun Sep 19, 2024, 10:43 AM

#

sand flax Sep 19, 2024, 11:33 AM

#

@fresh lily Mr Sandman

#

SD3 needs more work. I know 1 recipe it really needs.

fresh lily Sep 19, 2024, 11:47 AM

#

@sand flax what's that ?

sand flax Sep 19, 2024, 11:51 AM

#

fresh lily <@333081322950098954> what's that ?

I believe that designs and image structure will appear more accurate if the devs train the model to generate images in various angles. Because what 1 issue does most, if not, all models have in common? the inaccurate results of a perfect looking object, person, or place when it's flipped, 180 degrees, or upsidedown.

quartz siren Sep 19, 2024, 11:52 AM

#

sand flax I believe that designs and image structure will appear more accurate if the devs...

True and I believe the safety filtering was messed up as well. Surprisingly sd3 large and sd3.5 large don’t have to have the same issues. It’s not near flux level in humans but still ok.

sand flax Sep 19, 2024, 11:54 AM

#

quartz siren True and I believe the safety filtering was messed up as well. Surprisingly sd3 ...

Hey Sharma I agree, and I believe Flux is able to accurately picture its human figures upside-down like DALL E 3. Those two are going at it.

quartz siren Sep 19, 2024, 11:57 AM

#

sand flax Hey Sharma I agree, and I believe Flux is able to accurately picture its human f...

Yeah true, Flux dev is even better then dalle3 at humans I believe, maybe not in prompt following but it’s not really fair to compare since dalle3 has an llm helping.

People’s Early access images made with sd3.5 is showing promise, images look better then flux but slightly worse prompt following, text rendering and humans.

sand flax Sep 19, 2024, 12:00 PM

#

quartz siren Yeah true, Flux dev is even better then dalle3 at humans I believe, maybe not in...

Exactly and surely Flux has more realistic value of making humans than Dall E 3 like when recognizing a popular celebrity.

Ohhhh, so is SD3.5 like a prototype or what? Ever since SD3 was "released" it doesn't seem impressive. Is it the official model that's out now or what?

quartz siren Sep 19, 2024, 12:05 PM

#

sand flax Exactly and surely Flux has more realistic value of making humans than Dall E 3 ...

Stability started to train a new model, which is sd3.5(large which is 8b and medium which is 2b). Now it’s in testing phase, some people have access in the discord and twitter and posted some images too.

It’s still not fully done training, and requires more steps then normal I believe but it seems decent so far. Just search it in google “sd3.5 twitter”

regal scroll Sep 19, 2024, 12:05 PM

#

Hello

#

Is possible to train lora or checkpoint for clone style of artist?

sand flax Sep 19, 2024, 12:06 PM

#

quartz siren Stability started to train a new model, which is sd3.5(large which is 8b and med...

Alright I'll check it. at what date do you believe the model might be finally finished?

sand flax Sep 19, 2024, 12:06 PM

#

regal scroll Hello

Hey Victor

regal scroll Sep 19, 2024, 12:07 PM

#

My father died... And I hope to create more pictures with his style.... Is possible? Please help....

regal scroll Sep 19, 2024, 12:07 PM

#

sand flax Hey Victor

Hi

quartz siren Sep 19, 2024, 12:08 PM

#

sand flax Alright I'll check it. at what date do you believe the model might be finally fi...

Not sure, I expect it should be finished soon but open sourced a few months later.

regal scroll Sep 19, 2024, 12:09 PM

#

My father said Jose Ramon Iglesias Rivera. Spanish artist

#

Galicia

sand flax Sep 19, 2024, 12:10 PM

#

quartz siren Not sure, I expect it should be finished soon but open sourced a few months late...

Like most AI developers and companies have moved into training models on AI videos now, so hopefully soon Stability finishes up.

regal scroll Sep 19, 2024, 12:10 PM

#

I only have original I only have the original paintings and photographs of the paintings...

sand flax Sep 19, 2024, 12:14 PM

#

I wonder if there are any developer who is working on training AI models on universal sound effects -- generate having every sound, pitch, and voice ever heard. that way it may be played with an AI video.

regal scroll Sep 19, 2024, 12:16 PM

#

But I only want to create images with the same style

#

With Stable Diffusion

bleak drift Sep 19, 2024, 12:43 PM

#

So is Stable Diffusion 3 Medium any good?

amber bloom Sep 19, 2024, 1:13 PM

#

regal scroll Is possible to train lora or checkpoint for clone style of artist?

I'm sure this is possible, there are plenty of style loras out there.

fervent thunder Sep 19, 2024, 2:13 PM

#

fervent thunder at the moment the quality checker models are a bit of a let down though

https://research.google/blog/rich-human-feedback-for-text-to-image-generation/

sweet nacelle Sep 19, 2024, 2:13 PM

#

Hello, everyone!

undone star Sep 19, 2024, 3:17 PM

#

hi

plain raptor Sep 19, 2024, 4:42 PM

#

i think

#

this is the only server, that has ever had thus many ppl

#

on it

#

that ive had been in'

#

346,073

#

is like the population of my entire town

quartz siren Sep 19, 2024, 4:44 PM

#

bleak drift So is Stable Diffusion 3 Medium any good?

Not really, it has artifacts and horribly bad humans.

I would recommend flux over sd3 medium, it has almost perfect humans anatomy, by far better then sdxl and excellent prompt following and text rendering, both better then sd3 medium by a large margin.

plain raptor Sep 19, 2024, 4:45 PM

#

i needa update my stable

#

so, Stabel diff Flux is wut i wan nu?

#

is that the latest version

#

or is its like, evrything, different than stable xl

#

currenlty have stable Xl 1.7.0

velvet slate Sep 19, 2024, 4:51 PM

#

What's my best option for creating a book cover?

quartz siren Sep 19, 2024, 4:51 PM

#

plain raptor so, Stabel diff Flux is wut i wan nu?

Flux is made by a different company but yes it’s a much better model(comparable to dalle3, ideogram, and can be considered better then mid journey sometimes)
It’s a massive 16b parameters with everything(sdxl is like 3.6b with everything) with quantization you can fit it in 8gb vram.

It’s basically now most peoples go to model, you can just search “flux 4bit guide” and you will find lots of tutorials.

plain raptor Sep 19, 2024, 4:53 PM

#

so, flux is a model, not an entirely diff AI

#

ovO

#

||weenus||

quartz siren Sep 19, 2024, 5:00 PM

#

velvet slate What's my best option for creating a book cover?

Flux, you can look at the examples here: https://www.reddit.com/r/StableDiffusion/comments/1eon9n7/flux_its_amazing_at_creating_silly_children_book/

velvet slate Sep 19, 2024, 5:19 PM

#

Any free WebApps running that?

quartz siren Sep 19, 2024, 5:21 PM

#

A lot, https://fal.ai/models/fal-ai/flux/dev lots of spaces in huggingface too. https://huggingface.co/spaces?sort=trending&search=Flux

velvet slate Sep 19, 2024, 5:25 PM

#

Thanks mate!

copper crystal Sep 19, 2024, 5:55 PM

#

People seem excited by RWML partnering with Lionsgate, but might i remind people that Lionsgate is value bloated and is a bubble waiting to burst. They're the old guard of hollywood, the last of the weinsteins. They're partnering with RWML, giving them the entire catalogue, desperately .

i mean, lions gate put out borderlands movie. This is their hail mary to cut costs .

It's like when hollywood discovered CG could replace miniatures and they fired all the seasoned artists and hired VFX artists for abusive rates and we had a WHOLE lot of bad CG. Movies in the mid 90s were generally worse than movies in the 80s or early 90s due to the abandonment of practical effects.

lionsgate is bout to churn out a whole lot of crap on their way to insolvency

sleek otter Sep 19, 2024, 6:34 PM

#

quartz siren Flux is made by a different company but yes it’s a much better model(comparable ...

With 64Gb RAM, I run Flux1.Dev on my 8Gb VRAM RTX 2070 - about 2m 40s/image

quartz siren Sep 19, 2024, 6:37 PM

#

Nice, what are you using to run it?

solid kindle Sep 19, 2024, 6:43 PM

#

Hi, i dont understand whats the difference between the normal and xl model (e.g. SDXL or ponyXL).

#

afaik the XL has bigger model ? or LoRa ? and will it gives better result ?

low moon Sep 19, 2024, 7:23 PM

#

The bigger the better.

copper crystal Sep 19, 2024, 7:34 PM

#

xl and pony are the same model. Pony XL is a refined version of it. unfortunately the text encoder layer is disaligned and it only works with the pony loras and embeddings. so people consider it it's own base model.

I think the pony phase has been a tulip mania situation. novelty and memepower. there are much better sdxl refines depending on your purposes

#

people keep calling it a base model as if he trained it with millions of dollars and billions of images. i think he just refined sdxl with many thousands of images though.

stark rapids Sep 19, 2024, 7:43 PM

#

hello!

trail lion Sep 19, 2024, 8:09 PM

#

quartz siren Flux is made by a different company but yes it’s a much better model(comparable ...

basically every single hurdle out of the gate has been overcome. the ability to train it. the ability to run it with less resources. the ability to run it faster. so with those negative points eliminated you have mostly only the good points, which are the prompt adherence and the quality. of course you have currently what is always the case with a new model, which is lack of the wealth of community contributions, but that's a matter of time, as long as it's worthwhile for the community to pursue (and it is, since it seems to respond so well to training). so in my humble opinion, it's indeed the new king to dethrone. certain cult followings around prior models will still be there, certainly. People are still using 1.5.

copper crystal Sep 19, 2024, 8:12 PM

#

i love that it was never hyped. it just appeared

#

different company but the original authors of stable diffusion 1 and sd3

quartz siren Sep 19, 2024, 8:16 PM

#

copper crystal xl and pony are the same model. Pony XL is a refined version of it. unfortunate...

yeah calling it a different model is a lie, its a finetune of sdxl but a very large scale one. It was trained with 2 million images and does have considerably more knowledge then sdxl in many things but I still prefer normal sdxl models because they worked well with regional prompting. Now I just use flux since it's much better.

copper crystal Sep 19, 2024, 8:16 PM

#

was it millions? ill note that

#

I find it doesn't know geographic locations even half as well as SDXL. poses, characters, outfits, things that are character focused. i think that's where it shines

quartz siren Sep 19, 2024, 8:18 PM

#

Yeah it sucks at backgrounds but great at what you mentioned.

copper crystal Sep 19, 2024, 8:19 PM

#

just found out ruinedFoooocus is brought up to speed with Flux compatibility. hmmmm

#

when i got onebuttonprompt working, i noticed a mention of it being in ruinedFoocus built in. so i'm thinknig "oh yeah that project lets go check it out"

quartz siren Sep 19, 2024, 8:22 PM

#

trail lion basically every single hurdle out of the gate has been overcome. the ability to...

Yep, I agree with you. The only slight dislike I have is that it seems to generate somewhat "unaesthetic" and "similar" images. I am sure some stuff like dpo/spo/ncp could easily solve that problem but it's still definitely the king. People like sd1.5 since its just so fast and quick to generate a image and you can quickly add a style to some image you have.

copper crystal Sep 19, 2024, 8:23 PM

#

quartz siren Yep, I agree with you. The only slight dislike I have is that it seems to genera...

someone's removed the guidance from flux. and there is a lora which disables the "aesthetics" block on the faces.

people have been cracking away at it

quartz siren Sep 19, 2024, 8:24 PM

#

Yeah it's a pretty minor thing and the community can fix it for sure.

copper crystal Sep 19, 2024, 8:24 PM

#

#🆕｜sd3 message you were the one who told me bout this neat thing

#💬｜general-chat

||weenus||