#💬｜general-chat | Stable Diffusion | Page 187

fervent thunder Apr 28, 2025, 1:12 PM

#

was wondering what diffusers does that is less memory efficient?
I've started using it but cos I use big servers I don't notice memory efficiency
is it like an issue of casting or an issue of loading?

serene mountain Apr 28, 2025, 1:13 PM

#

Im not sure, Id have to reload it, try to duplicate the problem and come back.

Closing out invoke and restarting would help but not solve it. Matter of fact it got worse, Id get 1-2 generations and then black squares.

#

I have comfyUI installed and it runs, Ive just put off really digging into it. Felt like if I wasnt fine tuning the “easier” programs then Id be more lost in Comfy.

#

I have forge installed as well, had the most “luck” there. The images tend to look soft but it stays consistent at least

fervent thunder Apr 28, 2025, 1:21 PM

#

there's different ways to mess up memory management, is why I was asking them

median jewel Apr 28, 2025, 1:26 PM

#

i have been trying to download comfyscript for 2 days buy cant get it to work at all, i have followed the instructions on the github page and tried to google, if anyone knows how to download it, please tell me :D

oblique agate Apr 28, 2025, 4:30 PM

#

https://www.tomshardware.com/pc-components/gpus/nvidia-may-release-the-rtx-5080-and-5070-super-with-boosted-memory-configurations-according-to-leaker if this is true ppl should hold out on buying gpu till the release of 5080 super

verbal notch Apr 28, 2025, 6:24 PM

#

Hey guys,

is kohya_ss still state of the art when I want to train my own LoRa or are there some other usable alternatives out there?

The reason I am asking is because kohya_ss is gving me a ton of dependencies error messages after clicking the Button "Start Training" which I cant seem to fix

RuntimeError: operator torchvision::nms does not exist
20:22:47-499620 INFO Training has ended.

vagrant crater Apr 28, 2025, 6:41 PM

#

verbal notch Hey guys, is kohya_ss still state of the art when I want to train my own LoRa o...

fluxgym 100x easier

left terrace Apr 28, 2025, 6:48 PM

#

is api pricing flat fee for stable video?

atomic mortar Apr 28, 2025, 9:29 PM

#

vagrant crater fluxgym 100x easier

thats only for flux no?

desert dagger Apr 29, 2025, 5:21 AM

#

atomic mortar thats only for flux no?

it's only for flux. luca taco has good trainers on his replicate

verbal notch Apr 29, 2025, 5:30 AM

#

Thanks for the suggestion, I actually got kohya working yesterday and trained my own LoRa successfully, yay thomas

Do you guys have suggestions for good tutorials on how to make the bust use out off stable diffusion?

desert dagger Apr 29, 2025, 6:29 AM

#

verbal notch Thanks for the suggestion, I actually got kohya working yesterday and trained my...

yeah, start with scott's https://www.youtube.com/@sedetweiler

hardy hawk Apr 29, 2025, 12:20 PM

#

I have issue with stable diffusion running on AUTOMATIC1111

#

Everyone facing same issue or its only me?

candid light Apr 29, 2025, 12:37 PM

#

hi

near seal Apr 29, 2025, 12:39 PM

#

I do not currently have a computer, only an ipad. I am assuming I can use artisan? And only artisan? Is that correct? It looks like much of the site is devoted to people running stuff on their own computer systems, and that artisan is the only part of the site that is for creating on discord? Please ping me.

atomic mortar Apr 29, 2025, 1:10 PM

#

near seal I do not currently have a computer, only an ipad. I am assuming I can use arti...

Probably but you could look into other cloud services like civitAI, that allows you to use more tools and models then just stable-diffusion

#

However these are paid services

#

But using sdxl and 5 dollars should get you about 900-1500 images depending on you use loras or not

storm shard Apr 29, 2025, 2:44 PM

#

near seal I do not currently have a computer, only an ipad. I am assuming I can use arti...

You can try Draw Things. It's a free app, you can find it in the app store. You can join their discord server for help.

tribal glade Apr 29, 2025, 4:26 PM

#

storm shard You can try Draw Things. It's a free app, you can find it in the app store. You ...

I recommend but, more than likely, one will indeed need help getting it set up and going... lots of moving parts but I'm glad to see something like that available.

chilly heron Apr 29, 2025, 4:32 PM

#

hello i was wondering how good is stable diffusion at converting sketch to a nice looking image

#

in an accurate way chat seem to change it how it want it to look like not what i skatched

storm shard Apr 29, 2025, 4:43 PM

#

tribal glade I recommend but, more than likely, one will indeed need help getting it set up a...

There's nothing to set up, it has a complete interface. You do need to start with downloading the base models to get things working.
Next to their discord there also are helpful videos on yt (although due to the ongoing development of the app, a few of them are already outdated). @near seal

tribal glade Apr 29, 2025, 4:51 PM

#

storm shard There's nothing to set up, it has a complete interface. You do need to start wit...

I just happened upon the app last night, I had only minutes to try it out but something's not right because everything that generates looks like food that only cooked halfway. Lmao

sinful lantern Apr 29, 2025, 5:22 PM

#

Okay I have some doubt so see in my training my sdxl model the images which i have used to train the model aree all of same size 1216x832 and when I used trained model to generate images previously I was using output image size of 1024 X 1024 but images were not that much good but when I used exact output size of 1216X 832 then the generated images were amazing why is it like this if anyone can guide me in this why is it like this

abstract quarry Apr 29, 2025, 5:27 PM

#

resolution is part of the input the model might overfit on

#

if all your training images have same resolution, then the model associates the resolution with your training data (same way as you would associate a trigger word in the prompt)

#

just crop some of your training images to 1024x1024

jovial shoal Apr 29, 2025, 6:21 PM

#

Any recomendations on where to start getting into generative AI art? Video by Shadiversity got me interested in more complex use of ai to get better results than with using just one prompt at some site.

oblique elk Apr 29, 2025, 6:31 PM

#

chilly heron hello i was wondering how good is stable diffusion at converting sketch to a nic...

Well it depends on you sketch and what are the important features of your sketch. The object and their lineart, the color, size and composition,.... Depending on the used input and tools you have much control over the conversion.

sinful lantern Apr 29, 2025, 6:34 PM

#

abstract quarry just crop some of your training images to 1024x1024

Another thing what I observed is fir example if I try to generate my image of Suze 2048x 2048 than what it does like it makes bad image like for example if i want to generate image of pizza than in the generated image I will have multiple pizza just thrown away but if i generate exact size image than only one pizza comes up

#

Why is it like this

abstract quarry Apr 29, 2025, 6:36 PM

#

that's an artifact of the convolutions/unet. But in general you have to train on similar settings as you want to do inference

sharp cairn Apr 29, 2025, 6:50 PM

#

HYEEEAAAAAAAAAAAAAAAAH

sinful lantern Apr 29, 2025, 7:03 PM

#

abstract quarry that's an artifact of the convolutions/unet. But in general you have to train on...

So let me tell u what I m actually doing so my input image is actually of size 6969 X 4640 okay then when previously i was doing some experiments and making buckets with max resolution to 1024 x 1024 then the bucket size which is made is of size 1217X 832 which is 6 times smallee than my original image so what I did was i increased my max resolution to 4096X4096 so now the bucket which I got was 4096X 3792 which is better than previous ones and not in the training config file I m using exactly the same resolution of which is in my bucket because all of my images are of same size so what do u think is this correct way of getting high-resolution images at inference and not getting multiple pizzas

#

So i did wrong or right because for now my training is running

#

Wdyt

tribal glade Apr 29, 2025, 7:13 PM

#

👀 gotta let all the channels know bout that fiddy dollaz huh

nimble light Apr 29, 2025, 8:44 PM

#

What are the best AI Image to Video generators that accept watching videos to earn tokens

tribal glade Apr 29, 2025, 9:32 PM

#

nimble light What are the best AI Image to Video generators that accept watching videos to ea...

Well there are some that have a tradeoff like that... I'm not familiar with them tho. You could check em out.

main snow Apr 29, 2025, 10:48 PM

#

Is the stable diffusion or is the diffusion stable? Hence the question.

-Shakespear probably

main snow Apr 29, 2025, 10:49 PM

#

nimble light What are the best AI Image to Video generators that accept watching videos to ea...

Just use that wai one locally, results I've seen seem pretty goooood

nimble light Apr 30, 2025, 12:59 AM

#

main snow Just use that wai one locally, results I've seen seem pretty goooood

Wai?

main snow Apr 30, 2025, 12:59 AM

#

nimble light Wai?

Ye

nimble light Apr 30, 2025, 12:59 AM

#

What is that

main snow Apr 30, 2025, 1:01 AM

#

nimble light What is that

Not sure if checkpoint is the correct word for this, but it's basically that to make videos

nimble light Apr 30, 2025, 1:02 AM

#

So a Local AI Generator? Like Comfgy UI or Stable Diffusion? I have an RX 570, not the best GPU

main snow Apr 30, 2025, 1:04 AM

#

nimble light So a Local AI Generator? Like Comfgy UI or Stable Diffusion? I have an RX 570, n...

Stuff like those yea, I imagine even a1111 can do it

Dunno, I use Nvidia gpus

#

No clue if that one's good or not

nimble light Apr 30, 2025, 1:04 AM

#

What is the name of the Loca AI you said?

main snow Apr 30, 2025, 1:05 AM

#

Ya mean the app itself? It's really just called a1111

#

Serves me well for pictures, haven't tried videos just yet

#

Still, if a1111 can't do it I'm sure confy or swarm can

nimble light Apr 30, 2025, 1:07 AM

#

How mean like how do I run it? COmfy UI or what?

main snow Apr 30, 2025, 1:09 AM

#

nimble light How mean like how do I run it? COmfy UI or what?

depends on which of these ya wanna use lol

nimble light Apr 30, 2025, 1:09 AM

#

Comfy UI doesn't work for me? Any good alternative?

main snow Apr 30, 2025, 1:10 AM

#

nimble light Comfy UI doesn't work for me? Any good alternative?

try swarm, you can do stuff like this locally, this one was made locally https://civitai.com/images/73141789

nimble light Apr 30, 2025, 1:10 AM

#

So swarm uses my GPU, but is web based?

main snow Apr 30, 2025, 1:11 AM

#

idk man, never used swarm, i use a1111, ya gonna have to ask that in tech-support

#

i simply heard it was good

nimble light Apr 30, 2025, 1:11 AM

#

Gimme link to a1111

main snow Apr 30, 2025, 1:12 AM

#

nimble light Gimme link to a1111

you should probably make sure it can make videos first lol

#

as i said i'm not sure, never tried

nimble light Apr 30, 2025, 1:12 AM

#

Just gimme

main snow Apr 30, 2025, 1:12 AM

#

https://github.com/AUTOMATIC1111/stable-diffusion-webui

#

make sure to donwload the version

#

for amd gpus

nimble light Apr 30, 2025, 1:12 AM

#

main snow make sure to donwload the version

Ay Ay CAPTAIN

main snow Apr 30, 2025, 1:13 AM

#

happy to help

nimble light Apr 30, 2025, 1:14 AM

#

Wait

#

What do I do now? Idownloaded it,

#

Do I run Windows batch file

main snow Apr 30, 2025, 1:15 AM

#

now ya install it, i don't remember the exact steps and it's probably a bad idea to tell ya from memory lol

#

read it's page, i'm sure the instructions

#

are there

sinful lantern Apr 30, 2025, 3:58 AM

#

sinful lantern So let me tell u what I m actually doing so my input image is actually of size 6...

@abstract quarry please reply this

pseudo forge Apr 30, 2025, 4:00 AM

#

i have downloaded stable diffusion for the first time and have downloaded a model off civitai but i am getting an error when generating, it says the model is for Pony not sd1.5 like in the tutorials i'm watching, could someone give me some insight/tips? thank you

faint timber Apr 30, 2025, 12:01 PM

#

Revisit CivitAI and check the upper right-hand corner on the models page. There is a filters icon. From there, you can filter which model type to use. Skip Pony for now and try out the SD 1.5 first.

ancient mauve Apr 30, 2025, 12:51 PM

#

Guys is there any way of increasing training speed without affecting the model's quality?

#

I have enough vram

#

Wtf is that link

#

I just want to increase the training speed it's just that I don't know what values do I have to change to use more of the vram

lean timber Apr 30, 2025, 1:13 PM

#

Does anyone have recommendation for a SD model that is good at creating things other than characters? There's virtually an infinite amount on Civit.ai that are trained to be good at making people, but I want one that can create interesting spaceships, buildings and the like.

atomic mortar Apr 30, 2025, 1:21 PM

#

ancient mauve Wtf is that link

Spam links, just ignore

ancient mauve Apr 30, 2025, 1:22 PM

#

atomic mortar Spam links, just ignore

thought so

fervent thunder Apr 30, 2025, 1:48 PM

#

hey new here

#

does anyone here run stable diffusion locally?

ancient mauve Apr 30, 2025, 1:51 PM

#

fervent thunder does anyone here run stable diffusion locally?

I do

fervent thunder Apr 30, 2025, 1:52 PM

#

ancient mauve I do

Im in the market of getting a custom built PC and thinking of getting a GTA 3000 or 4000 graphics card series with a good number of VRAM so I can generate something like 90s cel anime like sailor moon or berserk or evangelion, or even something like aqua teen hunger force style

#

what are your recomendations?

ancient mauve Apr 30, 2025, 1:53 PM

#

fervent thunder Im in the market of getting a custom built PC and thinking of getting a GTA 3000...

I personally have a 4090 but Ive seen people with lesser GPUs having good generations

#

Idk what is your budget

fervent thunder Apr 30, 2025, 1:54 PM

#

ancient mauve Idk what is your budget

something less than $1000 if possible for good quality

ancient mauve Apr 30, 2025, 1:54 PM

#

But aqua teen it's extremely simple artatyle so you shouldn't need that much

#

There must be someone here better suited for answering this

fervent thunder Apr 30, 2025, 1:54 PM

#

ancient mauve But aqua teen it's extremely simple artatyle so you shouldn't need that much

what about cel anime like from the 90s?

ancient mauve Apr 30, 2025, 1:55 PM

#

It should work

#

If I'm not mistaken having realistic generations is harder than cartoons in general

fervent thunder Apr 30, 2025, 1:56 PM

#

ancient mauve If I'm not mistaken having realistic generations is harder than cartoons in gene...

so a 3000 series should do the trick I presume

#

my current computer is using a gtx 1060 6GB graphics card from 2016

ancient mauve Apr 30, 2025, 1:57 PM

#

I see people within the 3000 series making good generations

#

But I'm not sure

#

You should aim for the Max cost-perfomance gpu

#

But dunno where does it fit that limit today

fervent thunder Apr 30, 2025, 1:58 PM

#

ancient mauve I see people within the 3000 series making good generations

suppose I get my custom pc with that kind of graphics card and I set it up, what do I download to get the locally hosted stable diffusion to start and operate?

ancient mauve Apr 30, 2025, 1:59 PM

#

fervent thunder suppose I get my custom pc with that kind of graphics card and I set it up, what...

ForgeUI it's ok for a start

#

It's an user interface that helps you to make generqtions

fervent thunder Apr 30, 2025, 2:00 PM

#

ancient mauve It's an user interface that helps you to make generqtions

so I download forgeUI for that computer and its ready to go just using the GPU? can I access it over wifi from my android phone when Im away?

ancient mauve Apr 30, 2025, 2:00 PM

#

I like forgeUI for generations

ancient mauve Apr 30, 2025, 2:01 PM

#

fervent thunder so I download forgeUI for that computer and its ready to go just using the GPU? ...

You download the AI models you want to use and put it in the corresponding folder

#

Start the program and works at least for me

#

There are some YouTube videos with simple guides

fervent thunder Apr 30, 2025, 2:01 PM

#

ancient mauve You download the AI models you want to use and put it in the corresponding folde...

and those models I can select from forgeUI after downloading it right?

ancient mauve Apr 30, 2025, 2:02 PM

#

fervent thunder and those models I can select from forgeUI after downloading it right?

Dunno if you can directly download from forgeUI

fervent thunder Apr 30, 2025, 2:02 PM

#

ancient mauve Dunno if you can directly download from forgeUI

where do you usually download your models for yours?

ancient mauve Apr 30, 2025, 2:02 PM

#

I just take one model I like from civitai, put it inside the stable diffusion folder and it's usually good to go

fervent thunder Apr 30, 2025, 2:03 PM

#

ancient mauve I just take one model I like from civitai, put it inside the stable diffusion fo...

ah ok

ancient mauve Apr 30, 2025, 2:03 PM

#

fervent thunder where do you usually download your models for yours?

https://civitai.com

ancient mauve Apr 30, 2025, 2:04 PM

#

fervent thunder ah ok

Try to look for more opinions about what gpu is good for you

#

But if you have the money try to get something good

chilly heron Apr 30, 2025, 2:22 PM

#

oblique elk Well it depends on you sketch and what are the important features of your sketch...

what chat can i send the scatch in

#

@oblique elk i sent it in the other general chat can you take a look

quiet mason Apr 30, 2025, 3:43 PM

#

quick question
i want to use the hunyuan v2v model where i upload a video of my own and give a prompt which it then changes accordignly and gives back to me

but to install hunyuan do i just install it normally as a model and then get a workflow for v2v or is there a specific v2v model i need to install?

atomic mortar Apr 30, 2025, 4:01 PM

#

fervent thunder so a 3000 series should do the trick I presume

I used to run 3070ti (8gb vram) illustrious and sdxl is absolutely no problem for that model

#

Higher vram however allows you to run it faster

#

My 8gb vram back then was about 20-30s per image with 1-2, loras

small citrus Apr 30, 2025, 4:09 PM

#

(I am unable to send any image in this chat) Hello everyone I am technical officer at genotek, a product based company that manufactures expansion joint covers. Recently I have tried to make images for our product website using control net ipadapters chatgpt and various image to image techniques. I am giving a photo of our product. This is a single shot render of the product without any background that i did using 3ds max and arnold render.
I would like to create a image with this product as the cross section with a beautiful background. ChatGPT came close to what i want but the product details were wrong (I assume not a lot of these models are trained on what expansion joint cover are). So is there any way i could generate environment almost as beautiful as (2nd pic) with the product in the 1st pic. Willing to pay whoever is able to do this and share the workflow.

fervent thunder Apr 30, 2025, 4:09 PM

#

int4 flux is 6.64 GB so 8GB is ok

atomic mortar Apr 30, 2025, 5:08 PM

#

small citrus (I am unable to send any image in this chat) Hello everyone I am technical offic...

i recommend reposting in either #🏞｜general-with-images or #🌶｜off-topic to include images

vale furnace Apr 30, 2025, 6:09 PM

#

Does anyone have the detail++ Overall Detail SD1.5 embedding? It was removed from civitai and was curious where I can find it

oblique elk Apr 30, 2025, 7:20 PM

#

quiet mason quick question i want to use the hunyuan v2v model where i upload a video of my ...

Will use the basic generation model but within the Video sampler you would use images from the input video as samples input.

exotic sphinx Apr 30, 2025, 7:39 PM

#

ancient mauve I see people within the 3000 series making good generations

I've seen someone gen with a 2060

#

It's pretty nuts the optimizations some webuis do

#

I myself used to use a laptop GPU but that was with SD1.5

#

2060 was with Illustrious

#

Tho admittedly no clue about Forge, I use ComfyUI

fervent thunder Apr 30, 2025, 9:51 PM

#

I use CPU a lot even its fine

#

its doesn't have to be fast

shell peak Apr 30, 2025, 9:58 PM

#

Hi guys, not sure if I’m allowed to post this but I am looking to hire a LoRA trainer

tardy swan May 1, 2025, 3:20 AM

#

Yo

#

How can I get in contact with Ronaldo

opaque horizon May 1, 2025, 5:08 AM

#

Hi

#

how can I make spiderverse style images with dreamstudio ai

#

??

#

and animate them

ashen sleet May 1, 2025, 10:03 AM

#

yo

ancient mauve May 1, 2025, 10:09 AM

#

do you guys have a good to go formula depending on the number of images?

sinful lantern May 1, 2025, 10:16 AM

#

does anyone of you know how to generate image of width = 4096 and height = 2752 from sdxl model, and if it is then how i can do it and how to multi gpu infernce in sdxl

still glacier May 1, 2025, 10:20 AM

#

sinful lantern does anyone of you know how to generate image of width = 4096 and height = 2752...

can't natively SDXL is trained for 1024x1024, you have to upscale using hires.fix/img2img/outpaint/etc
no software (to my knowledge) does multi gpu inference. At best you can run one instance per gpu.

sinful lantern May 1, 2025, 10:21 AM

#

seee i generated the image of 2048 x 2048 with sdxl

#

liike i was using kohya inference sd-scripts tpo generate imag of this size

#

actuakky i generated image 2752 x 2752 too

#

so its not like its just limited to 1024 x 1024

#

have anyone of u saw inferernce scipt given in kohya repo

oblique elk May 1, 2025, 10:48 AM

#

sinful lantern does anyone of you know how to generate image of width = 4096 and height = 2752...

Generate an image with 688 height and 1024 width and upscale it. Either with some hires / tile controlnet etc. to add further details to the image or without to keep the amount of details from the first generation.

fervent thunder May 1, 2025, 11:23 AM

#

sinful lantern does anyone of you know how to generate image of width = 4096 and height = 2752...

you can do multi-gpu in diffusers

#

these days the models are getting so big that the larger models like cosmos or stepfun come with their own multi-gpu scripts as well

sinful lantern May 1, 2025, 11:29 AM

#

can u give me documenation or provide some link to it

fervent thunder May 1, 2025, 12:06 PM

#

yeah its this

#

https://huggingface.co/docs/accelerate/index

placid hatch May 1, 2025, 12:15 PM

#

What is the current stable diffusion experience for supported amd cards? Trying to get an idea of whether or not the 9070 is worth it for me when it gets full support

fervent thunder May 1, 2025, 12:27 PM

#

if you are willing to write custom kernel, driver and compiler code then amd can be ok

#

otherwise rly not

atomic mortar May 1, 2025, 12:56 PM

#

iirc amd is useable upto sdx

#

sdxl but you gotta pull some tricks for upscaling etc

placid hatch May 1, 2025, 1:23 PM

#

atomic mortar sdxl but you gotta pull some tricks for upscaling etc

What sort of stuff for upscaling and why?

atomic mortar May 1, 2025, 1:23 PM

#

tiled upscaling, zluda etc

weary socket May 1, 2025, 2:30 PM

#

Umm

atomic mortar May 1, 2025, 2:30 PM

#

scam dont click

weary socket May 1, 2025, 2:30 PM

#

Yes

vagrant meadow May 1, 2025, 4:11 PM

#

Hello, I'm new to SD and want to buy a GPU. Would it be better to buy a 12GB RTX 3060 or would it be worth spending $250 more for a 16GB RTX 5060 TI?

atomic mortar May 1, 2025, 4:12 PM

#

More vram the better generally

#

And considering the 5000 series is newer you'd get warranty too

vagrant meadow May 1, 2025, 4:14 PM

#

I don't live in USA and won't be able to claim warranty if anything goes wrong.

Would 4GB more be worth the $200-$250 more?

#

@atomic mortar

atomic mortar May 1, 2025, 4:25 PM

#

Depends on how much 200 dollars is worth in your country

vagrant meadow May 1, 2025, 5:05 PM

#

atomic mortar Depends on how much 200 dollars is worth in your country

Well 1 month rent for a studio apartment for a college student is like $300 USD here

placid hatch May 1, 2025, 6:23 PM

#

16GB can make a big difference, but may still not be enough depending on what you intend to do

vagrant meadow May 1, 2025, 7:39 PM

#

placid hatch 16GB can make a big difference, but may still not be enough depending on what yo...

I plan on using it for inpainting and text to image inference mainly. May also play around with audio but I would probably be using APIs for that

slim jacinth May 1, 2025, 10:07 PM

#

hello 😄

tribal crown May 1, 2025, 11:38 PM

#

what is the fastest workflow for wan? i am running it through comfyui and the least i can get is 5 minutes, i have a 4090. any faster workflow please? and maybe same quality. thanks 🙂

merry ginkgo May 2, 2025, 12:06 AM

#

Is there a quality program for downloading models off CIVITAI in Bulk? Looking to fill an 8TB hard drive with at risk models

ancient mauve May 2, 2025, 12:06 AM

#

anyone up?

atomic mortar May 2, 2025, 12:10 AM

#

tribal crown what is the fastest workflow for wan? i am running it through comfyui and the le...

Tea cache maybe but you'll lose quality for like a 20s improvement, video ai just takes that long

atomic mortar May 2, 2025, 12:10 AM

#

merry ginkgo Is there a quality program for downloading models off CIVITAI in Bulk? Looking t...

Hmm elaborate? Like in bulk by providing a link and click download? And it fetches all metadata?

#

swarmUI has a downloader integrated with civitAI apikey support

merry ginkgo May 2, 2025, 12:47 AM

#

atomic mortar swarmUI has a downloader integrated with civitAI apikey support

Noted, was looking to download all models with safetensor and XYZ tag

atomic mortar May 2, 2025, 12:48 AM

#

Well if you want to download all models you'd have a gigantic task ahead

merry ginkgo May 2, 2025, 12:52 AM

#

atomic mortar Well if you want to download all models you'd have a gigantic task ahead

Hey I got almost a month

heavy yacht May 2, 2025, 5:39 AM

#

Does anyone know what happened to the batch processing support added to SD Ultimate Upscale? - https://www.reddit.com/r/StableDiffusion/comments/11ul4t8/ultimate_sd_upscale_update_announce/

sinful lantern May 2, 2025, 6:22 AM

#

do u people really trust comfyui?

#

i really feel that the comfyui generated results are not goood because when i used to generated images with the help of comfyui its results are not good at alll rather than if u use raw code to generate images they are very amazing thats what i have observed

#

we cant blindly trust comfyui at all

atomic mortar May 2, 2025, 6:48 AM

#

sinful lantern we cant blindly trust comfyui at all

Its opensource no? if you don't trust it just dive into it or compile it your self

#

Forge however uses obfuscated code to hide they used code from comfy

sinful lantern May 2, 2025, 6:51 AM

#

yes ofcourse i m just saying like we cant blindly trust comfyui for genrating images, for example if someone is finetuuning a sdxl or sd3 model investing so much and they generate results from comfyui and if the results not come good they will think that training didnt went well

#

because thay happened with me

#

*that

#

but when i used raw code to generate images results changed so much

atomic mortar May 2, 2025, 6:51 AM

#

Just a case of samplers and schedulers

sinful lantern May 2, 2025, 6:52 AM

#

dis agreee

#

it cant change that much

atomic mortar May 2, 2025, 6:54 AM

#

You know comfy is just a bunch of nodes (code blocks) transmitting data to one node to another

#

You just made your own workflow

#

Tedious but workable for you

sinful lantern May 2, 2025, 7:11 AM

#

its not tedious i guesss, i just mean that we cant trust it blindly

valid aurora May 2, 2025, 7:24 AM

#

hi guys does anyone know any checkpoint that goes well with Vroid models? im still noob and my loras suck, and when i use different check points for Vroid the generated image style changes so much. also how imporant are regularization images for loras? ive been at it for a week but all my loras look like deformed monsters especially the eyes xd

#

sd 1.5 btw

ancient mauve May 2, 2025, 9:15 AM

#

can anyone please help me with some settings, Im getting crazy

#

and peopel instead of trying to help they just start yapping at you

#

I just need to understand somet things Im having trouble with

#

no one?

oblique elk May 2, 2025, 9:58 AM

#

ancient mauve no one?

If it is a technical problem feel free to ask your questions in the support channel if you have problems with prompting or prompt settings ask in the prompt channel. …. If someone can answer your question they will.

oblique elk May 2, 2025, 9:59 AM

#

sinful lantern its not tedious i guesss, i just mean that we cant trust it blindly

No blind trust every line of code is visible. Even people write their own sampler and nodes.

ancient mauve May 2, 2025, 10:10 AM

#

oblique elk If it is a technical problem feel free to ask your questions in the support chan...

may I assk you what training program do you use?

#

I just want to compare some training settings

oblique elk May 2, 2025, 10:17 AM

#

ancient mauve may I assk you what training program do you use?

Great question and would fit perfectly into #🔧｜finetune
And pretty sure others already recommended you Kohya, onetrainer and fluxgym.
The parameters are not comparable as it depends on so many different factors. Amount source images, concept / style or object. Close to the source model or far away. Overfitting useful for later generation etc.

ancient mauve May 2, 2025, 10:18 AM

#

oblique elk Great question and would fit perfectly into <#1026382406279770152> And pretty s...

yeah, Im in a million discord servers and I confuse the channels sorry

ancient mauve May 2, 2025, 10:18 AM

#

oblique elk Great question and would fit perfectly into <#1026382406279770152> And pretty s...

yeah parameters change between training programs, thats teh problem

#

I find settings that dont work for another trainer

#

and its a bit confusing

oblique elk May 2, 2025, 10:26 AM

#

ancient mauve I find settings that dont work for another trainer

Yes, copy and paste does not really work for training. At least if you do not want the exact same result as the tutorial, blog,… and you use the same input files and descriptions.
So no other choice by reading and understanding the effects of the different parameters. And then start with trial and error with a small and good labeled dataset.

ancient mauve May 2, 2025, 10:27 AM

#

oblique elk Yes, copy and paste does not really work for training. At least if you do not wa...

assuming everything is correctly tagged, if you an I have the same number images in more or less the same style, shouldnt I get good results with the same settings?

placid hatch May 2, 2025, 11:10 AM

#

Does anyone know why refine/upscaling uses up so much more vram on rocm compared to normal image generation or the equivalent process on nvidia cuda?

faint timber May 2, 2025, 12:49 PM

#

I would guess because internally it is upscaling the latents, which uses more ram.

tight yacht May 2, 2025, 1:10 PM

#

Anyone proficient with Stability Matrix and in general running models locally? I need some guidance on a couple of issues I'm encountering with certain models...

main snow May 2, 2025, 2:21 PM

#

tight yacht Anyone proficient with Stability Matrix and in general running models locally? I...

not with matrix but i do run sd locally

main snow May 2, 2025, 2:22 PM

#

placid hatch Does anyone know why refine/upscaling uses up so much more vram on rocm compared...

no clue, kinda somwthing we just gotta live with if we want to fix eyes and artifacting lol

fervent thunder May 2, 2025, 3:49 PM

#

placid hatch Does anyone know why refine/upscaling uses up so much more vram on rocm compared...

rocm is less optimised

#

especially the typical consumer rocm setup

placid hatch May 2, 2025, 6:36 PM

#

it seems to fill up my vram even if i am only upscaling by a really small amount. It makes me wonder if its some kind of bug

fervent thunder May 2, 2025, 6:41 PM

#

possibly yeah

#

only a tiny % of AI code is memory safe

somber gale May 3, 2025, 7:59 AM

#

Anyone good with hunyuan?

#

I wanna make a 3d model but ive had no luck

plush gale May 3, 2025, 11:43 AM

#

My name is Ziliah and I had a few breakdowns and pretty much decided to rebuild my life by being fit and healthy, yk

I like listening to music and reading novels
I currently own an online store that brings in some amount per week because I am passionate about helping my parents retire and achieving financial freedom.
I'm open to friendly conversations

sudden karma May 3, 2025, 2:05 PM

#

does anybody know a easy way to ai generate images for a book? it doesn't even need to bee the whole szene, one object theoretically would be enough, although the whole scene would be more impressive.

#

it would need to be some kind of text2text generation first (book2prompt), afterwards prompt2image

#

i would need one picture per page

atomic mortar May 3, 2025, 2:16 PM

#

hmm that would just be text to image though?

#

like you describe the object or scene

oblique elk May 3, 2025, 2:31 PM

#

sudden karma does anybody know a easy way to ai generate images for a book? it doesn't even n...

You might need to use a llm (ChatGPT, Gemini,..) to summarize the page and to focus on main subjects / main objects. Then let the llm create a prompt and your favorite image gen ai creates the image.
BUT for a bit of consistency I would add some style tags to avoid each image looking different (comic, real, anime, sketch,…).
Additionally you would need to remember the last object to avoid repeating images on each page if the text main object overlaps over many pages.
Another BUT if your book has a main object/ topic/ character that appears on multiple pages you will not get similar images. So a reader might be confused why the brunette woman from the start is now blonde. …

slim cove May 3, 2025, 2:35 PM

#

habby

sudden karma May 3, 2025, 2:49 PM

#

oblique elk You might need to use a llm (ChatGPT, Gemini,..) to summarize the page and to fo...

exactly those two BUTs are the issues i face. additionaly i don't have the money to pay for gemini / openai APIs and my local ressources are limited to a quadro p4000 (8gb), 32gb RAM, ryzen 7 3800x

#

flux.1-dev onnx fp4 (for scenes) and ssd-1b (for single objects) are the models i use

#

Would it be easier—or even possible—for an AI to pair a scene from a book page with the corresponding sharp frame from the movie adaptation? If so, how could that even be achieved?

sudden karma May 3, 2025, 2:55 PM

#

atomic mortar hmm that would just be text to image though?

that would be way to easy. just giving the book scene to sd or flux results in shitty images. it's text2text2image. i mean if you prompt an ai to generate an image, you would never write like an author of a book

atomic mortar May 3, 2025, 2:57 PM

#

sudden karma that would be way to easy. just giving the book scene to sd or flux results in s...

I mean if your scene has a lets say a rusty knife, just prompt for a rusty knife in said setting?

sudden karma May 3, 2025, 2:57 PM

#

i don't want to manually prompt 300+ images

atomic mortar May 3, 2025, 2:57 PM

#

Ahh thats the gist, yeah you need a llm for that to somewhat get a prompt out of it

sudden karma May 3, 2025, 2:57 PM

#

that's what i'm asking for. an ai generating the prompt from a given book scene

atomic mortar May 3, 2025, 2:58 PM

#

Chatgpt or a local one could do the trick yeah

sudden karma May 3, 2025, 2:58 PM

#

atomic mortar Ahh thats the gist, yeah you need a llm for that to somewhat get a prompt out of...

exactly

atomic mortar May 3, 2025, 2:58 PM

#

Even copilot could somewhat manage if its below 3k letters

sudden karma May 3, 2025, 2:59 PM

#

atomic mortar Chatgpt or a local one could do the trick yeah

gemma 3, bart, pegasus, t5, llama3.2... I've tried a lot, they just make it shorter, but nothing you could use as a prompt for an image generation model

atomic mortar May 3, 2025, 2:59 PM

#

Hmm youd need a solid system prompt

#

What kind of style of image would you use?

sudden karma May 3, 2025, 3:00 PM

#

not even when you include a manual description of every person/setting

atomic mortar May 3, 2025, 3:00 PM

#

Realistic? Anime?

sudden karma May 3, 2025, 3:00 PM

#

atomic mortar What kind of style of image would you use?

colored scratchbook drawing

sudden karma May 3, 2025, 3:02 PM

#

atomic mortar Hmm youd need a solid system prompt

i had one including description of every person/setting and a description of the style i want, output prompt was in sentences and too complex for flux

atomic mortar May 3, 2025, 3:02 PM

#

Hmm flux and sd could work

#

But hmm extremely long prompts arent image gens strong suit

sudden karma May 3, 2025, 3:03 PM

#

that's what i'm struggling with

atomic mortar May 3, 2025, 3:03 PM

#

As it will just ignore certain elements after a certain point

sudden karma May 3, 2025, 3:03 PM

#

book page 2 short image comma seperated gen prompt

atomic mortar May 3, 2025, 3:04 PM

#

Afaik theres no solution for that yet as far as im aware

#

Ahhh thats an xl prompt

sudden karma May 3, 2025, 3:04 PM

#

atomic mortar As it will just ignore certain elements after a certain point

or just add something to make the scenery more "realistic"

sudden karma May 3, 2025, 3:04 PM

#

atomic mortar Afaik theres no solution for that yet as far as im aware

fck...

#

thanks for your help anyway : )

atomic mortar May 3, 2025, 3:05 PM

#

Well the only way would be manual work

#

Hmmmm if its single item/people prompts however

#

Then you could maybe, ask the llm to highlight the most important person or object in the page

sudden karma May 3, 2025, 3:06 PM

#

or just a simple szene, just needs to be a background image for every single book page

atomic mortar May 3, 2025, 3:06 PM

#

Yeah you could do that

sudden karma May 3, 2025, 3:06 PM

#

atomic mortar Then you could maybe, ask the llm to highlight the most important person or obje...

the issue is the additional unnecessary stuff the llm will output. sth like "here is the highlight of the given text snipped"

atomic mortar May 3, 2025, 3:09 PM

#

sudden karma the issue is the additional unnecessary stuff the llm will output. sth like "her...

The user will show you a page of a book and you will generate an ideal prompt for generating a colored scratchbook photo for that model on Flux/FAL. You will confirm whether the model is an object/product, a person, a pet animal, or an art/photography style.

safe cliff May 3, 2025, 3:09 PM

#

Ironically for complex things I found it was easier to 3d model what I wanted manually then img2img to change the style

atomic mortar May 3, 2025, 3:09 PM

#

Your prompt should start with the first sentence setting overall context and containing the model name mentioned by the user. The overall prompt should account for location, model overview, expression and pose, angle of the shot, placement when the model is an object/product, lighting, colour palette, and styling.

#

Could be a decent start as a base prompt for gpt

sudden karma May 3, 2025, 3:11 PM

#

the scene thing... someway to get every different place/setting shown in the movie, image2image generation to remove everything in the centre of the image, and a simple python script to use those images as pdf background

sudden karma May 3, 2025, 3:11 PM

#

atomic mortar Could be a decent start as a base prompt for gpt

tysm!

#

i'll try later

atomic mortar May 3, 2025, 3:12 PM

#

Youd have to edit it though a little but i hope that gives you a way further

oblique elk May 3, 2025, 4:52 PM

#

sudden karma tysm!

Still thinking you could look tools like dfiy. They allow you to chain up different ai tasks (llm, scrape, image gen ). So you would create on ai agent to determine the key element of a given text (system prompt etc. ) then pass the result to another llm agent creating outstanding sketch image prompts with the input of the first agent. (Could even be 2 different llm ). Finally send the prompt to the third agent which uses for example a predefined comfyUI workflow to create the image.

iron flax May 3, 2025, 7:57 PM

#

Hello

alpine hazel May 3, 2025, 11:38 PM

#

Hello

#

If you have the option to work at home, please contact me. I understand how things can be challenging now and I want to help you find an appropriate opportunity. There are many vacancies to apply 💌

pulsar crane May 4, 2025, 2:42 AM

#

Hello

charred wadi May 4, 2025, 8:47 AM

#

Hello

ancient mauve May 4, 2025, 9:43 AM

#

is there any channel here to discuss trainings?

#

or another server for it

#

I dont want to spam this channel

atomic mortar May 4, 2025, 9:43 AM

#

Fine tune channel got removed a while ago

ancient mauve May 4, 2025, 9:44 AM

#

atomic mortar Fine tune channel got removed a while ago

where do I go then

atomic mortar May 4, 2025, 9:44 AM

#

And tbh even if you spam here nobody would mind

#

Its not like theres a big discussion going on

ancient mauve May 4, 2025, 9:46 AM

#

atomic mortar And tbh even if you spam here nobody would mind

what a difference with other servers, where they start crying because people ask questions

atomic mortar May 4, 2025, 9:47 AM

#

ancient mauve what a difference with other servers, where they start crying because people ask...

I mean I don't know if you have noticed, 300k members yet this morning a 6 hour gap between "hello"s

#

The only big active ones are anime and techsupport lmao

main snow May 4, 2025, 9:58 AM

#

atomic mortar The only big active ones are anime and techsupport lmao

Nothing wrong with us animu lovers 😭

atomic mortar May 4, 2025, 9:58 AM

#

I know, its one of the few channels im active in lol

main snow May 4, 2025, 9:59 AM

#

ancient mauve what a difference with other servers, where they start crying because people ask...

We got the opposite here once, dude was out here talking like we the webui CEO, demanding to make stuff work for him lmao

atomic mortar May 4, 2025, 9:59 AM

#

Once? All the time deadbread

main snow May 4, 2025, 10:00 AM

#

Well I only saw it live once

atomic mortar May 4, 2025, 10:00 AM

#

I only wish that people like donny just use off topic for their vague resume job search

#

Instead of posting it twice a week everywhere including techsupport

main snow May 4, 2025, 10:01 AM

#

Yes, not sure the chat is a good place for Job begging

atomic mortar May 4, 2025, 10:01 AM

#

Just use fiverr or community's made for it ngl

main snow May 4, 2025, 10:02 AM

#

Or apply to McDonalds

#

Put the fries in the bag as they say

atomic mortar May 4, 2025, 10:04 AM

#

Oh cool literally what we just talked about

ancient mauve May 4, 2025, 10:10 AM

#

atomic mortar The only big active ones are anime and techsupport lmao

Wait tech support is a genuine server? I thought it was spam

atomic mortar May 4, 2025, 10:10 AM

#

no?

atomic mortar May 4, 2025, 10:10 AM

#

ancient mauve Wait tech support is a genuine server? I thought it was spam

the channel #🤝｜tech-support

#

'>??

#

not the server

#

the sever is a scam

ancient mauve May 4, 2025, 10:10 AM

#

Ahh ok XD

atomic mortar May 4, 2025, 10:10 AM

#

the stable diffsion channel is not lmfao

ancient mauve May 4, 2025, 10:11 AM

#

I just need some place where I can discuss training settings with people

#

I got good generations but they could be better I think

atomic mortar May 4, 2025, 10:11 AM

#

iirc werent you banned in the one trainer server?

#

ah i see we dont share this one server

#

imma link it to youi

ancient mauve May 4, 2025, 10:13 AM

#

atomic mortar iirc werent you banned in the one trainer server?

I was very annoying to be honest

#

but I just need to ask some questions

#

something with some back and forth

oblique elk May 4, 2025, 10:16 AM

#

ancient mauve I was very annoying to be honest

Let me guess your question: can anyone share their working parameters for training. And best they work with resolution 2048x2048….

ancient mauve May 4, 2025, 10:16 AM

#

oblique elk Let me guess your question: can anyone share their working parameters for traini...

no

#

this is the kind of thing that pisses me off

main snow May 4, 2025, 11:41 AM

#

ancient mauve Ahh ok XD

If you join the tech support server they tell you to please wait a moment and DO NOT REDEEM

ancient mauve May 4, 2025, 11:46 AM

#

main snow If you join the tech support server they tell you to please wait a moment and DO...

That's what sucks about AI, it's full of those

#

Im gonna ask here too if its ok for you now that Im here

#

do you know that some games put the same exact image but with different face expressions for dialogue? like a visual novel
I was wondering if feeding all of those images helps the training
because they are the same images but not really; if I tag the images all the same except for those face expresions, the model will learn to differentiate?```

main snow May 4, 2025, 11:47 AM

#

ancient mauve That's what sucks about AI, it's full of those

What? That was a tech support scam joke lmao

ancient mauve May 4, 2025, 11:47 AM

#

main snow What? That was a tech support scam joke lmao

yeah thats what Im sayiingai servers are full of scams links

#

more than others

main snow May 4, 2025, 11:47 AM

#

ancient mauve ```Guys Im trying to train a character from a game do you know that some games p...

Should work, even cutting the original multiple times should work

ancient mauve May 4, 2025, 11:48 AM

#

main snow Should work, even cutting the original multiple times should work

I've heard that using the same image counts as data poisoning but maybe they are wrong dunno

main snow May 4, 2025, 11:48 AM

#

Not what heard, but hey the best way to really find out is try it

ancient mauve May 4, 2025, 11:48 AM

#

because otherwise, if I can use "variation" images as completely new ones, my dataset increases from couple of dozens of images to hundreds

main snow May 4, 2025, 11:49 AM

#

I remember hearing like 20 or so

#

Should be enough

ancient mauve May 4, 2025, 11:49 AM

#

main snow I remember hearing like 20 or so

that for a lora

#

if you want to do a full finetune you need more

main snow May 4, 2025, 11:49 AM

#

ancient mauve that for a lora

Yes

#

Not sure(?) my stuff comes out perfectly well

ancient mauve May 4, 2025, 11:50 AM

#

main snow Not sure(?) my stuff comes out perfectly well

Im getting good generations now after some tries, now Im just nitpicking

main snow May 4, 2025, 11:51 AM

#

Tagged you on the chat with images for example

ancient mauve May 4, 2025, 11:51 AM

#

so I suppose I have to change some tags and images

main snow May 4, 2025, 11:51 AM

#

Far as I remember reading, that char's Lora had about 20 or so pics

main snow May 4, 2025, 11:51 AM

#

ancient mauve so I suppose I have to change some tags and images

That does matter indeed yes

ancient mauve May 4, 2025, 11:51 AM

#

main snow Far as I remember reading, that char's Lora had about 20 or so pics

10-50 Ive heard

#

like 10-20 the bare minimum

main snow May 4, 2025, 11:51 AM

#

ancient mauve 10-50 Ive heard

This specific one is 20

ancient mauve May 4, 2025, 11:52 AM

#

main snow This specific one is 20

yeah and you also have to change the number of repetitions depending on dataset

main snow May 4, 2025, 11:52 AM

#

Ofc the more variety the better in Theory

main snow May 4, 2025, 11:52 AM

#

ancient mauve yeah and you also have to change the number of repetitions depending on dataset

I use the technique of just cutting pics with mine

#

Works fairly well

#

Not flawless I'm sure

ancient mauve May 4, 2025, 11:53 AM

#

main snow I use the technique of just cutting pics with mine

I mean yeah you can always rotate thgem and the ai will think of them as new imageS i SUPPSOE

main snow May 4, 2025, 11:53 AM

#

ancient mauve I mean yeah you can always rotate thgem and the ai will think of them as new ima...

Any small edit to the picture should do it, cut a corner or smth

ancient mauve May 4, 2025, 11:53 AM

#

I was thinking more something like this #🏞｜general-with-images message

main snow May 4, 2025, 11:54 AM

#

Not optimal, but it's a way to do it if you got low sample numbers

#

What about those?

ancient mauve May 4, 2025, 11:54 AM

#

like in these cases its clear that the artist amde 1 image and edited a bunch of face expresions with layers

main snow May 4, 2025, 11:54 AM

#

Yes, that should work perfectly fine

#

In fact a lot of facial expression variation is good

ancient mauve May 4, 2025, 11:55 AM

#

main snow Yes, that should work perfectly fine

what if they show the whole body/more stuff

#

and its like 90% of the image the same, except for the faces

main snow May 4, 2025, 11:55 AM

#

ancient mauve what if they show the whole body/more stuff

In theory should be fine too, don't see why it wouldn't

ancient mauve May 4, 2025, 11:56 AM

#

dunno let me search for some quick examples

main snow May 4, 2025, 11:56 AM

#

If it's a known char I suggest getting fan art, screenshots and much more

ancient mauve May 4, 2025, 11:56 AM

#

#🏞｜general-with-images message this for example

main snow May 4, 2025, 11:56 AM

#

Preferably in different styles

ancient mauve May 4, 2025, 11:56 AM

#

not only teh face but a whole body

main snow May 4, 2025, 11:56 AM

#

So it doesn't super glue to 1 style

ancient mauve May 4, 2025, 11:56 AM

#

is it better to crop and get the face only or just feed the whole image

main snow May 4, 2025, 11:57 AM

#

I'd feed it as is

#

I think it should be fine

ancient mauve May 4, 2025, 11:57 AM

#

main snow So it doesn't super glue to 1 style

At the moment im trying a character in an specific style so it shouldnt be a problem

main snow May 4, 2025, 11:57 AM

#

Ofc, assuming each of those is 1 pic

#

The same char multiple times like that in the same pic... Not sure that's a good idea

ancient mauve May 4, 2025, 11:57 AM

#

then the thing is tagging so they have the exact same tags except maybe 1 or 2

ancient mauve May 4, 2025, 11:57 AM

#

main snow The same char multiple times like that in the same pic... Not sure that's a good...

not in the same same pic, its just an example

main snow May 4, 2025, 11:58 AM

#

ancient mauve not in the same same pic, its just an example

Oh, should be fine then yea

ancient mauve May 4, 2025, 11:58 AM

#

just imagine that in the example I gave you they are separated in 4 diff images

main snow May 4, 2025, 11:58 AM

#

Yes, that should be fine

ancient mauve May 4, 2025, 11:58 AM

#

main snow Yes, that should be fine

that opens a lot of possibilites then

main snow May 4, 2025, 11:58 AM

#

Indeed it do

ancient mauve May 4, 2025, 11:58 AM

#

do you have any tips for tags

#

I just use WD14 then filter some things that could be wrong

main snow May 4, 2025, 11:59 AM

#

Well, best tip I can give you is making the trigger a unique word

ancient mauve May 4, 2025, 11:59 AM

#

main snow Well, best tip I can give you is making the trigger a unique word

I still dont get this trigger thing, do people use triggers or not, which is better

main snow May 4, 2025, 11:59 AM

#

Also the most words possible.

Some peeps only use 1 word for the whole thing and then it's a mess if you wanna change literally anything

ancient mauve May 4, 2025, 11:59 AM

#

for a character

main snow May 4, 2025, 11:59 AM

#

Like removing a hat

main snow May 4, 2025, 12:00 PM

#

ancient mauve I still dont get this trigger thing, do people use triggers or not, which is bet...

I do, so I advise doing so

ancient mauve May 4, 2025, 12:00 PM

#

main snow Like removing a hat

Imagine a character like Son Goku which always has the exact same hair

main snow May 4, 2025, 12:00 PM

#

ancient mauve Imagine a character like Son Goku which always has the exact same hair

You could just give the hair "Goku hair" tag to avoid any confusions

#

Or if you're brave lol short hair

ancient mauve May 4, 2025, 12:00 PM

#

do I remove black hair and spiky hair then?

main snow May 4, 2025, 12:00 PM

#

Do whatever you find better

ancient mauve May 4, 2025, 12:00 PM

#

do you have some examples you trained on?

main snow May 4, 2025, 12:01 PM

#

Trial and error

main snow May 4, 2025, 12:01 PM

#

ancient mauve do you have some examples you trained on?

Not sure I understand the question?

ancient mauve May 4, 2025, 12:01 PM

#

as a general rule,you have to remove the tags you want the model to learn right?

#

if you are training dunno, a goku character, you remove all the tags that define "Goku"

#

am I correct?

main snow May 4, 2025, 12:02 PM

#

Remove? I don't remember doing so no

#

Honestly, ya should just try the same tutorial I use lol

#

Used*

#

No expert by any means, simply had a guide to make mine

#

Or use Civitai to train it

#

There it's not free costs buzz

#

But it has an amazing in-depth guide

ancient mauve May 4, 2025, 12:05 PM

#

main snow Or use Civitai to train it

I preffer training locally

#

btw is there any reason characetrs eyes look weird?

#

like the bidy usually looks ok but the AI seems to have problems with the eyes, I see this with many models

main snow May 4, 2025, 12:06 PM

#

ancient mauve btw is there any reason characetrs eyes look weird?

lack of using hires fix i'd assume

#

it's amazing to fix eyes, give a lil more detail and fix artifacting

atomic mortar May 4, 2025, 12:08 PM

#

Yeah lack of hires or adetail

ancient mauve May 4, 2025, 12:08 PM

#

whats exactly highres fix

atomic mortar May 4, 2025, 12:08 PM

#

And if it's isn't a close-up it's not gonna look great without it

main snow May 4, 2025, 12:09 PM

#

ancient mauve whats exactly highres fix

idk the tech mambo jambo terms but for normal peeps like me: seems to make the picture twice but fixes eyes, adds detail and fixes artifacting

#

assuming you're using a111 like me there's a lil box you can tick to use it

#

there's many upscalers tho

#

i suggest remacri

#

never disapointed me

ancient mauve May 4, 2025, 12:12 PM

#

main snow idk the tech mambo jambo terms but for normal peeps like me: seems to make the p...

its just a high resolution upscaler then

main snow May 4, 2025, 12:12 PM

#

ancient mauve its just a high resolution upscaler then

guess so yea

ancient mauve May 4, 2025, 12:13 PM

#

main snow guess so yea

so you tag everything and add a trigegr word then?

main snow May 4, 2025, 12:13 PM

#

ancient mauve so you tag everything and add a trigegr word then?

to use the hires fix? it doesn't use any trigger word

ancient mauve May 4, 2025, 12:14 PM

#

no I mean for a chgaracter lora

main snow May 4, 2025, 12:14 PM

#

ancient mauve no I mean for a chgaracter lora

if you asking if you should tag everything in the char? i'd say so yes

#

for example if a char has a hat and there was no word for it... that shit ain't comming off lol

ancient mauve May 4, 2025, 12:14 PM

#

main snow if you asking if you should tag everything in the char? i'd say so yes

I heard both things, tag everything and add a trigger word or remove tags that define the charcater

main snow May 4, 2025, 12:15 PM

#

ancient mauve I heard both things, tag everything and add a trigger word or remove tags that d...

well, just telling ya my process, i don't remember removal

#

but ya can try both and see

#

which works better

#

sounds like a case of trial and error to me

fallow sky May 4, 2025, 12:25 PM

#

Hi

main snow May 4, 2025, 12:38 PM

#

Hello

abstract quarry May 4, 2025, 1:08 PM

#

ancient mauve ```Guys Im trying to train a character from a game do you know that some games p...

just train on a single image, that's fine

ancient mauve May 4, 2025, 1:08 PM

#

abstract quarry just train on a single image, that's fine

wdym

#

you cant train on a single image only

abstract quarry May 4, 2025, 1:08 PM

#

you can

#

you can also train iteratively: build a initial model by training on a single image. use this model to generate a larger set of training images (using e.g. controlnet to increase variety) and then train again

ancient mauve May 4, 2025, 1:40 PM

#

abstract quarry you can also train iteratively: build a initial model by training on a single im...

thats cool

#

I just wanted to know if I coul do this #🏞｜general-with-images message

#

I have tons of those and I f it helps if I tagg them correctly then Ill do it

abstract quarry May 4, 2025, 1:42 PM

#

it might help to teach the model which prompt stands for which expression

#

but I don't think it helps much with training a single face

ancient mauve May 4, 2025, 1:42 PM

#

abstract quarry but I don't think it helps much with training a single face

I have tons of images, its just that some are cloned like that

abstract quarry May 4, 2025, 1:43 PM

#

looks like rpgmaker face sets 😅

ancient mauve May 4, 2025, 1:43 PM

#

abstract quarry looks like rpgmaker face sets 😅

yeah its basically that

#

many Vns and games do that

#

if I feed it woth more different images it should help the training right as long as tags are good

#

soI increase my dataset like 10 times

abstract quarry May 4, 2025, 1:47 PM

#

what do you want to achieve?

#

I don't think that these face pics are complicated to generate with flux. I don't think you need a lot of training data

ancient mauve May 4, 2025, 1:56 PM

#

abstract quarry what do you want to achieve?

at this moment create a character lora

#

I already have good results but I want to improve it

#

its just that I dont have that many images but I do have that many images if I include teh face expressions avriations

viscid hollow May 4, 2025, 3:03 PM

#

hey

ancient mauve May 4, 2025, 3:05 PM

#

abstract quarry you can also train iteratively: build a initial model by training on a single im...

Gona try to train on a single image and all of its variations and see what goes

ancient mauve May 4, 2025, 4:12 PM

#

Btw does anyone know the purpose of image repeats in a dataset?

#

I know this option makes an image be processes that X amount of times during an epoch

#

But I don't know what does that really mean for the training

#

Why is it a thing?

oblique elk May 4, 2025, 4:23 PM

#

ancient mauve Why is it a thing?

You can use it for weighting. Let’s say 10 of your images are great you might repeat them 4 times. 10 are good you repeat them 3 times. And 20 image are for reference but not your training goal you might repeat them only one time. So you make sure the training is more biased towards your images.

ancient mauve May 4, 2025, 4:28 PM

#

oblique elk You can use it for weighting. Let’s say 10 of your images are great you might re...

So it's a way of giving more influence to some images and less to others

#

I've heard this is also good to increase if you have a small dataset vs a big one, to compensate

#

What I don't really get is that this logic points to a standard N° of steps threshold everyone is trying to imitate

#

But I thought there was no formula for this stuff

abstract quarry May 4, 2025, 4:33 PM

#

let's say you train a model on a specific style and your training images are men and women. You have 30 images of men but 90 images of women. So you set repeat on men to 3 such that in average the model is seeing the same amount of men as women

#

if your model would see much more women than men then it would get biased towards women. You see this very often in anime models where the model generates a women even if your prompt asked for a man

#

however, anime models are extremely biased. Having a 1:3 ratio would not induce such a strong bias

livid gale May 4, 2025, 4:48 PM

#

guys that the hell is DPM++ SDE CFG++

#

i cant fint it anywhere

#

krita diffusion seem to not have this sampler

#

there is dpm++ 2m cfg++ and dpm++ 2s a cfg++ but not sde

#

ohhhh its only comfy and reforge thing

ancient mauve May 4, 2025, 4:53 PM

#

abstract quarry let's say you train a model on a specific style and your training images are men...

That makes sense thanks

ancient mauve May 4, 2025, 4:55 PM

#

abstract quarry let's say you train a model on a specific style and your training images are men...

I can also set repeats to 1 and just copy paste the images in my folder

#

I think in some GUIs you can only set repeats to ALL the dataset, not just some images

#

Is that the same time basically or the logic behind it doesn't exactly work like that?

abstract quarry May 4, 2025, 5:01 PM

#

ancient mauve Is that the same time basically or the logic behind it doesn't exactly work like...

yes, it's the same, just needs more preprocessing time and disk storage

ancient mauve May 4, 2025, 5:06 PM

#

abstract quarry yes, it's the same, just needs more preprocessing time and disk storage

So as you said, if I have repeated images like those RPGmaker sheets or visual novels, even if I tag the poses and the elements in those images, the model will be more biased to show those at generation with neutral prompts

#

So I should have that in consideration while prompting and while tagging because it will have a bias with those repeated elements, like character poses

#

I'm only assuming, but I had some generations copy the poses of my database if not specified

#

For example, if I say "looking at front" sometimes looks from a diagonal, because my dataset has those, it is *technically looking at front (dunno if I'm explaining it well)

buoyant oracle May 4, 2025, 7:00 PM

#

is 4070 fine for creating lora?

floral umbra May 4, 2025, 7:39 PM

#

buoyant oracle is 4070 fine for creating lora?

Absolutely. The only restrictions you have is vram depending on the dataset and which checkpoint training is targeted for.

main snow May 4, 2025, 8:34 PM

#

If you're willing to spend a lil you can also train on Civitai, that way you're not dependant on hardware.

Think it's about 5 euros to train a Lora

#

And far as I'm aware they do come out well (?) dunno, never done myself

atomic mortar May 4, 2025, 9:41 PM

#

main snow And far as I'm aware they do come out well (?) dunno, never done myself

works pretty good imo

mortal bison May 5, 2025, 7:48 AM

#

Morning

#

What kind of systems are you guys running? Because mine crashes on wan video generating (3080TI)

atomic mortar May 5, 2025, 9:33 AM

#

5080, takes bout 10-12min

fickle flicker May 5, 2025, 10:36 AM

#

Hi, when I try to use any name for example "GLOBE" the ai generates an image with incorrect letters. eg. it would make it "GLLOBE". any idea how to correct this?

atomic mortar May 5, 2025, 11:41 AM

#

fickle flicker Hi, when I try to use any name for example "GLOBE" the ai generates an image wit...

Assuming your using XL, luck

vestal jetty May 5, 2025, 12:02 PM

#

Sharing this in case anyone else is deep into messing with GPT model and figuring out jailbreaks prompts. Its a list of prompts and models that the prompts can be used on. uncensored/jailbroken.

Not mine. Just figured someone else might want to check it out:https://shrinke.me/Z6zVWp
If the link dies, I’ll try to reupload it later. No idea how long it’ll stay up.

fickle flicker May 5, 2025, 12:04 PM

#

atomic mortar Assuming your using XL, luck

What changes do I need to make.. i was using Foocus

atomic mortar May 5, 2025, 12:04 PM

#

fickle flicker What changes do I need to make.. i was using Foocus

Xl isnt the greatest at text

#

Flux dev /sd3.5 large is better at it but still not perfect

fickle flicker May 5, 2025, 12:05 PM

#

atomic mortar Xl isnt the greatest at text

ok... looking to create a logo

atomic mortar May 5, 2025, 12:05 PM

#

I recommend a textless logo or photoshopping after ngl

fickle flicker May 5, 2025, 12:05 PM

#

atomic mortar I recommend a textless logo or photoshopping after ngl

ok... and what is ngl?

atomic mortar May 5, 2025, 12:06 PM

#

Ngl = " not gonna lie"

#

It's internet language

fickle flicker May 5, 2025, 12:06 PM

#

atomic mortar Ngl = " not gonna lie"

ok, but what is photoshopping after ngl

atomic mortar May 5, 2025, 12:07 PM

#

Do you know photoshop?

fickle flicker May 5, 2025, 12:07 PM

#

yes

atomic mortar May 5, 2025, 12:07 PM

#

Well youd need to photoshop said result to try and fix the text probably

fickle flicker May 5, 2025, 12:08 PM

#

atomic mortar Well youd need to photoshop said result to try and fix the text probably

ok... thats sounds good! thanks for the suggestions... appreciate it!

fickle flicker May 5, 2025, 12:08 PM

#

atomic mortar Flux dev /sd3.5 large is better at it but still not perfect

isn't there any sd model that is good at text?

atomic mortar May 5, 2025, 12:08 PM

#

What gpu do you have?

fickle flicker May 5, 2025, 12:11 PM

#

atomic mortar What gpu do you have?

3060 12GB

atomic mortar May 5, 2025, 12:12 PM

#

Hmm im not sure if you could use flux dev, Maybe someone else could chime in for better text results

oblique elk May 5, 2025, 1:56 PM

#

fickle flicker 3060 12GB

Well depending on how you want to integrate your text within the logo. Flux and SD3.5 are ok for generating images with short text even short sentences.
Another way would be a simple depth controlnet with the text as source and for example sdxl as output.

fickle flicker May 5, 2025, 2:03 PM

#

oblique elk Well depending on how you want to integrate your text within the logo. Flux and ...

yes its a short text for logo, is there a specific way to prompt it to avoid the spelling mistakes it makes?

#

@oblique elk thanks for the alternatives

ancient mauve May 5, 2025, 3:17 PM

#

do you use basic and specific tags or only specific tags for prompts?
like eyes, blue eyes

#

with automatic taggers I get many clothing tags

#

and I dont know if I have to leave coat, white coat, white long coat or only white long coat

ancient mauve May 5, 2025, 4:24 PM

#

@oblique elk you know when you use wd14 you get tags

#

Or while prompting, you can get tags that include other tags

#

Maybe eyes is a bad example

#

For example if a character has a white coat, do you put coat, white coat

#

Or do you only put white coat

#

What is better?

atomic mortar May 5, 2025, 4:34 PM

#

Ahh bottom one

#

Personally

#

Both work

#

Its trail and error ngl

#

No one size fits all solution

ancient mauve May 5, 2025, 5:02 PM

#

atomic mortar Its trail and error ngl

Did you get to see any patterns?

#

Maybe if it's a very complicated images, too many tags is not good

#

Or maybe more detail is better, but if a model is already trained it should at least have some kind of knowledge right?

atomic mortar May 5, 2025, 5:07 PM

#

ancient mauve Did you get to see any patterns?

I got a white dress yeah

#

Or coat

ancient mauve May 5, 2025, 5:07 PM

#

XD

atomic mortar May 5, 2025, 5:09 PM

#

But honestly tag based prompting is nice for smaller prompts to keep it accurate as possible

#

But once you get complex prompts, cant beat natural language

#

Illustriousv3 will support it

#

Illustriousv2 works sometimes with it but also does tags

#

Illustrious v0.1 does only tags

nimble slate May 5, 2025, 6:03 PM

#

Hey

#

i need a bit of help

#

ded chat awww

#

😦

atomic mortar May 5, 2025, 6:05 PM

#

nimble slate i need a bit of help

why not post the question while you wait for someone to come online eh?

nimble slate May 5, 2025, 6:06 PM

#

i wanna stream

#

i just wanna know how workflow works

#

🤷

atomic mortar May 5, 2025, 6:06 PM

#

you wanna stream AI?

nimble slate May 5, 2025, 6:07 PM

#

no i need help with setting up workflow

atomic mortar May 5, 2025, 6:07 PM

#

well if you wanna stream comfyUI workflows im afraid your gonna need to dive into it a little

nimble slate May 5, 2025, 6:07 PM

#

stable diffusion

atomic mortar May 5, 2025, 6:07 PM

#

though if you use forge its pretty much working straight out of the box

#

yes thats a model

nimble slate May 5, 2025, 6:07 PM

#

i just wanna make some spaceships 😭

atomic mortar May 5, 2025, 6:07 PM

#

comfy/forge/swam/foooocus are the US's using the model

#

what GPU do you have?

nimble slate May 5, 2025, 6:08 PM

#

uhh

atomic mortar May 5, 2025, 6:08 PM

#

its important to know before i recommend anything really

nimble slate May 5, 2025, 6:08 PM

#

NVIDIA GeForce RTX 2060 SUPER

#

and i have 16 gigs

#

i can run stable diffusion

#

but idk how to make images 😭

atomic mortar May 5, 2025, 6:09 PM

#

a 8gb vram is working for xl

#

https://github.com/CS1o/Stable-Diffusion-Info/wiki/Webui-Installation-Guides i recommend Forge (since theres more guides )

nimble slate May 5, 2025, 6:10 PM

#

i just need help with one thing 😭

#

its not that serious

atomic mortar May 5, 2025, 6:10 PM

#

its not just one thing lol

nimble slate May 5, 2025, 6:10 PM

#

oh

atomic mortar May 5, 2025, 6:10 PM

#

you need help installing the UI it seems

nimble slate May 5, 2025, 6:10 PM

#

i wish i could post pictures

#

can u vc?

atomic mortar May 5, 2025, 6:11 PM

#

https://www.youtube.com/watch?v=zqgKj9yexMY&list=PL-pohOSaL8P_VxpGxcay1EJFtqX4m8WqZ this video will help the way you describe it

#

you can post images in #🏞｜general-with-images

nimble slate May 5, 2025, 6:11 PM

#

k

atomic mortar May 5, 2025, 6:11 PM

#

atomic mortar https://www.youtube.com/watch?v=zqgKj9yexMY&list=PL-pohOSaL8P_VxpGxcay1EJFtqX4m8...

i DO recommend following the installation guide from CS1o however in the github link i posted above

#

yeah i see the image

#

the video will help in this case 👍

nimble slate May 5, 2025, 6:13 PM

#

posted

chrome jacinth May 5, 2025, 8:54 PM

#

why is stable diffusion only capable of generating one image per prompt?

atomic mortar May 5, 2025, 9:27 PM

#

chrome jacinth why is stable diffusion only capable of generating one image per prompt?

Depending on the UI you can do multiple images at once but theres generally no speed improvement

#

I can make 3 images concurrently but i save about 0.7s or cost 1.5s on my 5080

#

So its not really worth it to do it like that on consumer hardware

thin helm May 5, 2025, 10:24 PM

#

what settings are great for flexible character illustrious lora?

flexible as can handle more loras, styles, concepts, etc? and not getting baked..

#

im using civitai's trainer btw

noble tiger May 6, 2025, 3:39 AM

#

Hello everyone!! So i have some questions about stable diffusion models. What model do you think is the best and can you recommend me one?

atomic mortar May 6, 2025, 4:02 AM

#

noble tiger Hello everyone!! So i have some questions about stable diffusion models. What mo...

It completely depends on your usecase and hardware

#

You want anime? Illustrious type models, you want realism? 3.5 large/flux

#

Dont got the vram? Sdxl

#

If you got less then 6? Well i think sd 1.5 is pretty cool still

olive lintel May 6, 2025, 5:59 AM

#

hey guys!

I'm akash and i head bd at Hive intelligence. We're building infrastructure for AI agents

have a few synergies in mind and would love to connect w the bd team at Stability AI to discuss this in detail

can anyone from the team help me connect to the right POC?

ancient mauve May 6, 2025, 2:52 PM

#

I think someone here said it before, but tags are "aware" right? if I make a custom tag in my training or prompting likw potatocamp the AI knows about potatos and camps and it will have somme kind of influence over that tag

#

talking about wd14 tagging

#

so some tags dont have to resemble what is only shown on screen, but also concepts

floral umbra May 6, 2025, 3:45 PM

#

ancient mauve I think someone here said it before, but tags are "aware" right? if I make a cus...

Itĺl be more like a trigger word. to achieve this direct result in flux, i used ¨sp1ky¨ for examplehttps://discordapp.com/channels/1002292111942635562/1004159122335354970/1362174882330316870

ancient mauve May 6, 2025, 3:46 PM

#

floral umbra Itĺl be more like a trigger word. to achieve this direct result in flux, i used ...

but it will have that context integrated?

floral umbra May 6, 2025, 3:47 PM

#

Yep. In this case, the trick is to ¨tell¨ the training that X thing is actually Y, so instead of saying itś actually porcupine needles, i tell it that the handfull of needles are ¨hairstraws¨, and the training learns that.

ancient mauve May 6, 2025, 3:53 PM

#

floral umbra Yep. In this case, the trick is to ¨tell¨ the training that X thing is actually ...

what I mean imagine Im tagging something that has nothing to do witrh potatos or camps, nothing. If I use a custom trigger word like potatocamp, even if its all in a single word and not 2 separated words, it will "learn" some context about potatos and camps?

#

so maybe randomly in one of my generations I get a random potato generated?

floral umbra May 6, 2025, 5:03 PM

#

ancient mauve what I mean imagine Im tagging something that has nothing to do witrh potatos or...

Yep, hence triggerword. A all in one unique word/¨password¨ if you will to unlock the loraś full training trigger

ancient mauve May 6, 2025, 5:04 PM

#

floral umbra Yep, hence triggerword. A all in one unique word/¨password¨ if you will to unloc...

how am I sure then that Im using a trigger word that only focuses on the things I want to train on

#

with characters I suppose I could use a triggerword that already has the name of the character on it

#

like duno CustomGoku

#

it osnt really Goku but it will get characteristics from it right?

desert dagger May 6, 2025, 6:03 PM

#

ancient mauve so some tags dont have to resemble what is only shown on screen, but also concep...

you want your keyword for your lora to be something that's not going to be normally used in a prompt so it's not triggered accidently

#

so you wouldn't use potatocamp - you would use p0t@toc@mp

ancient mauve May 6, 2025, 6:06 PM

#

desert dagger so you wouldn't use potatocamp - you would use p0t@toc@mp

so its context aware then

ancient mauve May 6, 2025, 6:07 PM

#

desert dagger so you wouldn't use potatocamp - you would use p0t@toc@mp

in this case to confuse the AI and not trigger potato generations or concepts related to that

#

ty

desert dagger May 6, 2025, 6:08 PM

#

ancient mauve so its context aware then

very, yes.

#

even a . in your prompt has an effect on what the AI draws. or a , or a ; <--- those are noise, but it's aware that: this is an apple. and this... is an apple aren't really the same phrase and don't really mean the same thing

ancient mauve May 6, 2025, 6:12 PM

#

desert dagger even a . in your prompt has an effect on what the AI draws. or a , or a ; <--- t...

and If you have something liek a mix of colors what tag do you use?

#

I have like a white greyis background for some images

#

do I use white background, gtey background, both?

desert dagger May 6, 2025, 6:15 PM

#

ancient mauve and If you have something liek a mix of colors what tag do you use?

i use phrases like "red to gold gradient"

ancient mauve May 6, 2025, 6:15 PM

#

desert dagger i use phrases like "red to gold gradient"

im using wd14 tags

desert dagger May 6, 2025, 6:15 PM

#

remember that the AI wasn't trained on color charts, or pantone colors. so try to stick with the common terms you'd find out there on the net

ancient mauve May 6, 2025, 6:16 PM

#

using white background grey, background make sthe model mix concepts or does it confuse it

desert dagger May 6, 2025, 6:16 PM

#

ancient mauve im using wd14 tags

run those tags as the only thing in your prompt to see what the AI understands them to be. you might be surprised

ancient mauve May 6, 2025, 6:22 PM

#

desert dagger run those tags as the only thing in your prompt to see what the AI understands t...

I dont really get a clear answer

#

it also generates more stuff in a simple model like sdxl

#

I dont know what to do, iare models able to mix colours

#

or is it better to stick to the color that resembles it more

desert dagger May 6, 2025, 6:23 PM

#

ancient mauve it also generates more stuff in a simple model like sdxl

the thing is, those tags aren't giving you what you think they are. to the AI they are probably jsut noise. unless you train a lora on those specific colors with those specific tags, the AI isn't gonna have a clue.

#

stick with common color names that you'd find thousands of times in the google webscrape database

ancient mauve May 6, 2025, 6:23 PM

#

but tags have information on their own as yous aid right?

desert dagger May 6, 2025, 6:24 PM

#

ancient mauve but tags have information on their own as yous aid right?

not if they aren't in the data training set for the model, or a lora you are using with it

ancient mauve May 6, 2025, 6:24 PM

#

if the model you train on already knows what a white background or a grey background is

desert dagger May 6, 2025, 6:24 PM

#

ancient mauve if the model you train on already knows what a white background or a grey backgr...

sure. because if you go search google for 'white background" you get millions of hits

ancient mauve May 6, 2025, 6:24 PM

#

like these are wd14 tags which mean there are already images with those tags trained on

desert dagger May 6, 2025, 6:24 PM

#

and so that's likely a term in the database

ancient mauve May 6, 2025, 6:25 PM

#

so if I have a greyiss white background, do I use white, grey or both

desert dagger May 6, 2025, 6:25 PM

#

ancient mauve like these are wd14 tags which mean there are already images with those tags tra...

neither. you use teh term "light grey"

ancient mauve May 6, 2025, 6:25 PM

#

if I want to recreate the background in my images as close as possible?

desert dagger May 6, 2025, 6:25 PM

#

ancient mauve if I want to recreate the background in my images as close as possible?

then you use photoshop

ancient mauve May 6, 2025, 6:25 PM

#

desert dagger then you use photoshop

its just an example, imagine more concepts like that

#

concepts that are mixes of wd14 tags

desert dagger May 6, 2025, 6:26 PM

#

think of the AI as a fancy sort of camera - you get raw footage out of it. you don't have perfect control and you'll need to do post production work

#

color matching like that is post production work

ancient mauve May 6, 2025, 6:26 PM

#

you dont really have a tag to define them or the tags are too abstcat, in this case, I cant really knwo what grade of white or grey the tags reffer to

ancient mauve May 6, 2025, 6:26 PM

#

desert dagger think of the AI as a fancy sort of camera - you get raw footage out of it. you d...

at the moment im just tagging, I just want to get as close as possible now that im doing it

desert dagger May 6, 2025, 6:27 PM

#

ancient mauve you dont really have a tag to define them or the tags are too abstcat, in this c...

that would be why you trained a lora on that specific concept. so that when you used a specific tag for a color, the AI knew exactly what color to create

#

give me one of those tags that you're using please.

ancient mauve May 6, 2025, 6:27 PM

#

so while training, the model "changes" the meaning of tags a bit then

desert dagger May 6, 2025, 6:28 PM

#

ancient mauve so while training, the model "changes" the meaning of tags a bit then

no, it doesn't.

ancient mauve May 6, 2025, 6:28 PM

#

so if for example you use white background in the base sdxl model, you get white backgrounds, but if in a training all images have a more greyiss background and then you train on those and you generate images with white background prompt, you will get more greyiss backgrounds

desert dagger May 6, 2025, 6:28 PM

#

ancient mauve so if for example you use white background in the base sdxl model, you get white...

can you please tell me one of those tags you're using

ancient mauve May 6, 2025, 6:29 PM

#

desert dagger can you please tell me one of those tags you're using

white background XD

#

and white hair

#

do I use white hair, silver hair

desert dagger May 6, 2025, 6:29 PM

#

what do you consider a "wd14 tags"

ancient mauve May 6, 2025, 6:29 PM

#

there are 2 ways of doing prompts

#

doing descriptions

#

or, doing, tags, like, I, am, doing

desert dagger May 6, 2025, 6:30 PM

#

what do you consider a "wd14 tags"

desert dagger May 6, 2025, 6:30 PM

#

ancient mauve or, doing, tags, like, I, am, doing

those aren't tags. that's just you adding noise injection by the use of commas, into your prompt

#

that's not a phrase, with proper punctuation, so the ai won't look at is as a concept. it'll use the commas as noise

ancient mauve May 6, 2025, 6:31 PM

#

desert dagger what do you consider a "wd14 tags"

you dont know what wd14 is?, its an auto tagger for anime

desert dagger May 6, 2025, 6:32 PM

#

ancient mauve you dont know what wd14 is?, its an auto tagger for anime

and is there any reason you think a model not trained to understand those tags would have any idea what they mean?

ancient mauve May 6, 2025, 6:33 PM

#

desert dagger and is there any reason you think a model not trained to understand those tags w...

there are models trained with those right?

#

like, its something very common with AI image generations

#

for a while now

desert dagger May 6, 2025, 6:35 PM

#

ancient mauve there are models trained with those right?

probably the pony models

atomic mortar May 6, 2025, 6:36 PM

#

pony and illustriousmodels are trained on danbooru etc no

desert dagger May 6, 2025, 6:38 PM

#

forgot about illustrious - yeah, those are the models you want to use with those tags

ancient mauve May 6, 2025, 6:49 PM

#

desert dagger forgot about illustrious - yeah, those are the models you want to use with those...

Ah ok

#

Was asking because that tag system isn't something fluid like a long descriptions

#

So you can't say greyish white if there is no such tag in the first place

#

So maybe mixing was the correct way to do it, if anyone knows please let me know

desert dagger May 6, 2025, 6:58 PM

#

ancient mauve So maybe mixing was the correct way to do it, if anyone knows please let me know

if you want the tags to actually work, use the models that are trained to understand them. those will be the pony models and the illustrious models. no other model is going to ahve a clue what you mean when you use the tag

hybrid linden May 6, 2025, 7:29 PM

#

Hello! In AI image generation field, what can you do with 24GB of VRAM you can't with 20GB? I have a choice between RTX 3080 Ti 20GB for 45.000 rubles (550$~) and RTX 3090 for 65.000 rubles (800$~). They are almost the same in performance, so the difference is in VRAM capacity and price.

desert dagger May 6, 2025, 8:24 PM

#

hybrid linden Hello! In AI image generation field, what can you do with 24GB of VRAM you can't...

you can load models that require 24 gig of vram into 24 gig of vram, and you can't load them into 20g

fervent thunder May 6, 2025, 8:31 PM

#

hey everyone how are you

neat solstice May 6, 2025, 9:33 PM

#

desert dagger you can load models that require 24 gig of vram into 24 gig of vram, and you can...

no you cant

keen parcel May 6, 2025, 10:40 PM

#

fervent thunder hey everyone how are you