#💬|general-chat

1 messages · Page 137 of 1

low moon
#

its way sharper and cleaner images compared to sdxl still

#

it matured nicely

#

sdxl is more refined tho like the lighting and compotition is more "sophisticated" if that makes sense

#

cascade woiuld be kick ass if people did somethign with it but whatever

#

lets wait for SD3 instead and complain even when it drops that it snot as good as promised XD

rich kestrel
#

i mean its already out

#

u can test it. and it obviously isnt as good

#

its all a matter of how easily finetunable it is

#

I havent trained anything outside 1.5

trail lion
#

sure 1.5, win95 is king too. trumpet winsock all the way

fervent thunder
#

Photoshop is awesome

#

AI art can also be duplicated using GIMP photo editor manually

#

Creating art manually will take time obviously

trail lion
#

rotary phones, dont forget those, still king

#

anyhoo...back to reality

dreamy acorn
#

How do I get started?

cursive magnet
pearl ocean
#

same

#

you need more VRAM not RAM

rich kestrel
#

ram really stops being useful after 32gb

#

vram on the other hand

#

games and AI are only going to get more powerful

visual hill
#

Is anyone familiar with Ultimate SD Upscale? I'm trying to see if I can use it to resize and outpaint additional width to an image. I keep running out of memory when trying to to use inpaint to outpaint since the image is 3456x5120, would like to resize to 4000x5120 and outpaint the new dimension

narrow kernel
#

look into using tiling and tiled vae

rich kestrel
#

ultimate sd upscale =/= outpainting

#

outpaint first and then upscale

teal pagoda
#

exactly, like I said

#

money talks in this world, unfortunately

#

buy HDDs and store as many good models/ControlNets as you can :))

#

if you really know the settings, you can rent a good GPU for like 0.7$ per hour and train with it.

#

won't cost you very much in total

raven fractal
#

hi, everyone

wind arrow
#

How do I create an image , what do I have to write after# to create an image on this discord of stable difusion, and aslo which chat do I have to go in to

ivory willow
#

谁来告诉我这里怎么用SD的机器人

unkempt hatch
#

Что ты говоришь, здесь наверное нельзя русский

#

Anyhoo, hear me out. LLM trained almost exclusively on the entire bible.

#

That's an LLM that would be fun to talk to

fervent thunder
#

hello where can i request stuff to be made ?

last kraken
fervent thunder
last kraken
#

civitai has a free generator

small prism
#

Hello can anyone suggest same feature that Runway uses for Erase and replace (ai-tools/erase-and-replace) in Stable Diffusion sdxl? I have used inpainting but i cannot replicate the same through prompt which runway does.

fervent thunder
last kraken
fervent thunder
teal pagoda
#

the bots are not working anymore

floral umbra
#

Hoi, do you guys know of a upscale model trained on enhancing blurry/low res images with barely readable text?

wind arrow
red kiln
#

Is there a starting guide for noob how to use SD?

static cape
#

Please don't let us dwindle away without news for another week...

trail lion
#

Why not?

gloomy yarrow
#

Hi

#

Can anyone help? We signed up for stability.ai membership but didn’t get anything? What should we do?

trail lion
#

What are you expecting to get?

gloomy yarrow
#

Like any email from lol

#

We haven’t received any changes on our account 👀

trail lion
#

fwiw this is the community area, they have a contact link on their website

tepid osprey
#

GM☀️😎

hard orbit
#

what is the going price for a single image that is very specific to be created. professional image, not adult.

unkempt hatch
#

what's the best upscaling model now?

#

I don't mean workflow, just a .pt file to put inside my upscale node. Is 4xultrasharp still the king?

sage reef
#

best upscaling model for what? they are not all for the same purpose anyway, some are better at anime specifically, others for restoration, for photorealistic, etc

unkempt hatch
#

photrealistic people

loud solar
#

Playing with RealESRGAN_x4plus.pth at the moment

dense sable
#

what is the diffrence between r/StableDiffusion & "Green Check" Stable Diffusion server?

charred mesa
#

Well technically they are both under stability's control (if I recall correctly)

#

otherwise, I don't know about difference between just the fact that discord isn't reddit and vise versa

trail lion
loud solar
trail lion
#

There's tons of them out there. I must say that there's seemingly no substitute in xl for what tile resample did in 1.5.

loud solar
#

4xUltrasharp is pretty good, too.

#

There's the idea for an A.I. choosing fitting Loras for your prompts. I hope they will choose the right upscaler in future, too. To much cool stuff out there to know it all.

unkempt hatch
trail lion
#

I don't know the specific technical mechanics, but yes you need to download upscale models not already bundled by default.

loud solar
#

AFAIK Supir as a workflow combines both ...

trail lion
#

I tried installing that, it was too slow and resource intensive for my needs

#

I'll wait for a cloud implementation

loud solar
trail lion
#

If I had a 4090 maybe, but I just have a slow card with 16g vram

loud solar
#

Not sure whether I'd buy a 4090 nowaday ..

trail lion
#

I can't pay 1k for a video card, let alone double that... So I'll stay with what I have until I'm forced to do otherwise

loud solar
#

Yeah ... and maybe better wait for the 50xx series and buy a used one cheaper ... or wait for special A.I. Hardware ...

broken cave
#

i think it's pretty well documented no? i mean it's not in a "rentry"

#

fine tuning isn't scientifically robust right now. so it is all sort of heuristic

trail lion
#

Trial and error, lots of it

loud solar
#

Errors let you learn the most 😄

loud solar
#

It can change faster than youtube create new videos ...

trail lion
#

Someone always makes a video, but I would acknowledge there's lots of conflicting info out there. captioning techniques, using reg images or not, optimal settings for net dimensions, etc

loud solar
#

But you don't have to start at 0. When I started with A.I. it was like pushing a stick into a black hole ...

teal pagoda
#

some say captions are good, some say they're not

#

some say 0.0001 is a good LR, some not

#

:)))

trail lion
#

And that can vary between models btw, also

broken cave
teal pagoda
#

😦

#

but not even from civitai/google

broken cave
#

i will help you right now to do it

teal pagoda
#

are you a trainer or something?

trail lion
#

"I refuse to be rescued in such filth" -Princess Vespa

teal pagoda
broken cave
teal pagoda
#

what answer is this?

broken cave
loud solar
#

A good answer this is ... 🙂

broken cave
#

i guess, this is social media

#

so maybe answer the question

teal pagoda
#

I meant if you are a stable diffusion trainer

broken cave
#

yes

#

so what are you trying to fine tune? what is a concrete example?

teal pagoda
#

more like people fine-tuning

#

and the 2nd fine-tune for a style

broken cave
#

do you have any visual arts education?

#

like can you formally describe what a face looks like, as though it were a picture? or is it more of a, "i know when i see it" sort of thing for you?

#

and then, why do you want new faces? what is your goal? what are you trying to make?

#

you say faces, but do you really mean, faces and heads? what specifically?

#

@teal pagoda am i making sense?

teal pagoda
#

I just want to add more people into sd like from social medias and so on, so when prompting for like "a portrait of a woman", you won't get the generic face of a woman (which is the same in 90% of the checkpoints).

#

if you understand what I mean

broken cave
#

i wouldn't skip any of them

teal pagoda
teal pagoda
teal pagoda
trail lion
#

I would think the way you do that would be a large data set of people, like 1 or 2 thousand

broken cave
#

okay

broken cave
#

but why do you need the faces?

teal pagoda
#

:))))))

broken cave
#

okay lol

#

that's fine

teal pagoda
#

to make the SDXL 1.0 more "complex"

broken cave
#

so you don't need to fine tune

#

at all

#

have you tried clip vision?

teal pagoda
trail lion
#

Sd knows what a person is, a man, a woman, because of prior training.... You want something new, you have to put it in

broken cave
stark hornet
#

what sampler do you have to use with 1.5 models ?

teal pagoda
broken cave
#

focus on my questions

#

so think about it

#

you haven't tried clip vision

teal pagoda
#

nope

broken cave
#

clip vision is like prompting with images

#

from clip's point of view, an image and text are the same thing

#

does that make sense?

#

it's not "img2img"

stark hornet
#

@broken cave can you tell me what node to use in comfyUI to sample with a 1.5 model ?

#

KSampler doesnt seem to work

teal pagoda
#

meh, I give up, I'll remain at merging models and generating

broken cave
#

you can download the example workflow directly from comfyui

teal pagoda
#

wasted 8 hours today trying to "train"/combine 2 different faces or kinda with the same activation word and the results aren't really good

broken cave
#

okay

#

but why are you doing that?

teal pagoda
#

to test if this is possible at all with dreambooth

#

because it doesn't write anywhere

#

no one tried

broken cave
#

okay so i have posted you an examplke

#

of doing what you want to do

#

which doesn't have to use fine tuning

#

is this helpful?

#

you don't need to merge any models at all

#

you don't need to fine tune

#

does that make sense?

teal pagoda
broken cave
#

no!!

#

it's not instant id or ip adapter faceid

#

you don't need to use those

#

you don't want to

teal pagoda
#

yes, but it's similar

broken cave
#

no

#

it's not similar

teal pagoda
#

because I only use A1111

broken cave
#

they are totally different!

teal pagoda
#

is this clip vision in a1111 too?

broken cave
#

they don't produce similar results either

teal pagoda
#

never heard of it

broken cave
#

well

#

you're hearing about it now

stark hornet
#

you use an image as prompt instead of text, yes ?

broken cave
#

it is like adding an image as a word to your prompt

stark hornet
#

but what are you getting out of it ? like what will happen when you put a image of a dog and a cat ?

broken cave
#

i suppose you can go and try that lol

#

it would be completely different than what ip adapter does

stark hornet
#

how would i try ?

#

and how do you "steer" it ?

broken cave
#

it is like a word in the prompt. one way to use it is by pretending it is the first word in the prompt.

stark hornet
#

do you have an example json comfyui workflow that i could run ?

#

like i made this image today:

#

hmm,cant send it here 🙂

#

but would be curious what it'd do

broken cave
#

they are all here

stark hornet
#

ah, it's the stable-cascade thing

broken cave
#

there is a clip vision approach for any model that uses clip

#

@teal pagoda it sounds like what you want is something that will augment your prompts with a random mix from a collection of portraits. you can sort of do this as a workflow in comfyui. you do not need to fine tune anything

stark hornet
#

so it doesnt have to be stable cascade, no ?

#

Error occurred when executing CLIPVisionEncode:

'NoneType' object has no attribute 'encode_image'

#

File "D:\AI\ComfyUI_portable\ComfyUI\execution.py", line 151, in recursive_execute
output_data, output_ui = get_output_data(obj, input_data_all)
File "D:\AI\ComfyUI_portable\ComfyUI\execution.py", line 81, in get_output_data
return_values = map_node_over_list(obj, input_data_all, obj.FUNCTION, allow_interrupt=True)
File "D:\AI\ComfyUI_portable\ComfyUI\execution.py", line 74, in map_node_over_list
results.append(getattr(obj, func)(**slice_dict(input_data_all, i)))
File "D:\AI\ComfyUI_portable\ComfyUI\nodes.py", line 880, in encode
output = clip_vision.encode_image(image)

#

in CLIP Vision Encode

broken cave
broken cave
teal pagoda
#

thanks

stark hornet
#

anyone got an answer to my question by any chance?

broken cave
broken cave
stark hornet
#

what node do i use instead of KSampler to work with a 1.5 model in comfyUI

broken cave
stark hornet
#

i got everything running with 2.0 models

#

but not with a 1.5

broken cave
#

this is too vague

#

what is your actual issue?

stark hornet
#

maybe you have an idea

broken cave
#

that are superior

#

i don't know. it's all very relative

fallow veldt
#

Cascade is no more?

#

so when SD 3? I didn't receive any message for getting to test it

rich kestrel
#

I get the feeling ppl who are anti-A1111 are likely the least creative of the bunch

#

a bad artisan blames his tools

#

Funny that the comfy nutters are always the ones who come crawling back to A1111 cuz they cant even inpaint in their unnecessarily convoluted interface

drifting hearth
#

Dream

broken cave
loud solar
#

😄

dawn oak
#

ey i‘m an absolute noob in fooocus and just starting. I wanted to ask if it‘s possible to make nsfw content on fooocus ?

Upvote
25

Downvote

15
comments

Share
Share
Add a comment
Sort by:

Best

Search Comments
Expand comment search
u/Conscious_Lion_6825 avatar
Conscious_Lion_6825

4mo ago
More detailed reply. Go to Civit Ai website. Click on the model tab and then on the left side click filter and select sdxl 1.0. Now you have a list of all the models and loras suitable for fooocus. Inside the models page look at the description. It might say Lora or checkpoint. This is important. Click on download. When you file dled you gotta open your fooocus folder and find folder models. Now, drag your dled model or Lora into one of the 2 folders inside fooocus/models:

Checkpoint folder for any model that's said chekpoints on Civit Ai

loras for loras.

Here comes the interesting part, get to testing. When you launch fooocus web ui click advanced box and go to models. You will see drop down menus for loras and models. Fooocus can use only sdxl models so no sd or sd 1.5-2 are suitable.

The checkpoint models you've dled are the general models. The loras are something like add-ons.

The weight on loras matter a lot. Try going from 0.5-0.9.

In prompts you can write a prompt, select it and ctrl+ keybord arrow up and down to weight this particular word/phrase/etc more or less. Aka fooocus will consider this word or phrase more or important. Play with it.

#

Found this on reddit

#

What does it mean by "file dled"?

#

Like how do I get install this new model I downloaded and use it with foooocus

#

?

fallow veldt
shell tendon
shell tendon
#

i find comfyui to be MUCH better for creative work than a1111

tepid finch
#

hello, is sd 3 avail for comfyui?

rich kestrel
#

oh yeah I remember someone saying some shit about laying masks for context

shell tendon
#

you can't use differential diffusion for one, which is better than soft inpainting

rich kestrel
#

proof?

shell tendon
#

and at least as of a1111 v1.8, the tools within comfyui for editing masks were better

#

go use differential diffusion and you'll see what i mean

#

i know you're dead set on a1111 and i won't try to change your mind

#

this is more aimed at ppl on the fence that might be lurking

#

the idea that comfyui is extremely difficult to use, and tedious and only good for technical work, is 100% false

rich kestrel
#

im watching the vid on it, im not saying no

#

I do raise an eyebrow anytime someone says "better"

#

cuz a1111 inpainting already got hella buffed

shell tendon
#

plus, according to @neon oriole, a1111 is deprecated 🤣

#

yeah, i've tried soft inpainting in a1111

#

it's an improvement over the old limitations but it's not as seamless as differential diffusion inpainting

#

and you can use latent sender and latent receiver nodes to build a pretty simple workflow that allows iterative inpainting without any vae encoding steps

#

comfyui's mask editor is absolutely fine, and there's great nodes like groundingdino ones that can generate a lot of masks for you better than the yolo etc stuff, much better than clip, etc

#

the one big thing a1111 had that comfyui was lacking was the reference controlnet, but that's been added in the last month

#

well, one other thing too, not really big but i find mildly annoying, comfyui lacks support for lycoris-ia3

pearl ocean
#

New iPads!!!

fallow veldt
fallow veldt
shell tendon
#

what's creative? what's meaningful?

#

i could say the same about your stuff, lol

fallow veldt
shell tendon
#

exactly

fallow veldt
shell tendon
#

fact is, you can do wildly unusual shit with comfyui, it's extremely flexible, that is what opens the door to creativity

rich kestrel
#

i never shat on comfy

#

i shat on mfs saying to veer clear of a111

#

thts misinformation

#

I have comfy g

shell tendon
#

whether something is meaningful or whether you like someone's output doesn't say anything about the capacity of the tools

#

the ability to do insane shit you can't easily get from a prompt does(

#

that's the key

rich kestrel
#

a111 is still the jack of all trades. good ux. no bs. if ppl wanna use comfy go ahead. but dont take a sheeit on a111

shell tendon
#

a1111 is slow, vram hungry, and feature limited, and relatively inflexible

#

it's kinda like having a set of default comfyui workflows

#

if you're cool with just doing a couple types of things it's fine, but i'd still at least recommend forge cuz it's so much faster

shell tendon
fallow veldt
#

SD XL images of 832x1216 with DPM 3M SDE sampler created with 1.04s/it (RX 6700 12 GB VRAM gpu), it is running very smooth now, it is extremely optimized

honest mica
#

My respect to all the people who come in here and start with: "I am using the diffusers pipeline and this is my code..."

tight mason
#

are there ever too many steps?

pearl ocean
#

to think, the new iPads have amazing power, amazing new features and design.
But it doesn't do anything my M1 iPad Pro can't do.

shell tendon
#

Cascade, with stage b, you generally want exactly 7 steps

#

Pixart looks burned if you go too far with the step count

tight mason
narrow kernel
#

that being said, there is rarely ever a reason to go above 50 steps really(assuming you're using common sampler/schedulers like dpm++ 2m karras) for any model, not just pixart

#

and pixart seems to prefer euler normal from what i've seen

shell tendon
#

I've seen a couple cases where the sde ones were slightly better at 60 but I do mean the tiniest bit

#

Res_momentized is the only one I've seen dramatic improvements at 100 or even more steps

narrow kernel
astral goblet
#

i wonder if in the futuer we'll use gigasteps per second and no one will flinch at it. "you'll never need more than 64kb of ram" moment y'know?

shell tendon
#

absolutely love that sampler

narrow kernel
narrow kernel
astral goblet
narrow kernel
#

Nah not gears, just analog circuits

#

I think the tech in the fallout games is centered around the same concept where they never made a digital processor

#

And all computing is done via analog

#

Other sci-fi stories use it as well

shell tendon
#

like a TB303

astral goblet
narrow kernel
#

It's been an engineering meme since like the 40s or something. There's been a million different versions throughout the years

sudden ruin
sudden ruin
#

Wow More stuff to try with pixart sigma

shell tendon
#

it's crazy how no one seems to use res except me around here

#

but it's fn amazing

#

best at everything? nah, but really, really interesting and frequently gives the best outputs for whatever weird shit i'm usually working on

#

the supreme sampler is also outstanding for upsampling espec combined with aligned sampler and with both steps and substeps set at RES

#

i've got some nodes i added to the pack if you're into manipulating the sigmas

#

x**((x+1)*phi)*sigmax+y**((x+1)*phi)*sigmin

#

that's the formula you wanna use with res_momentumized more often than not

narrow kernel
# shell tendon it's crazy how no one seems to use res except me around here

I use it sometimes, it's just slow and tends to yield results with some minor margin of error vs other samplers. 90% of diffusion is just pecking through seeds and tweaking settings. Anything that speeds the trial and error process up, while producing reasonable results, is usually what most people gravitate toward

#

Which is why most people just stick with dpm++ 2m karras

#

For 20-50 step workflows, not talking about turbo/lightning workflows

#

Res is good for resampling though once you find what you like

sudden ruin
sudden ruin
narrow kernel
#

pixart uses much higher sigmas than models like sdxl. i think pixart uses like 140 or 180? sdxl uses ~14

#

so that equation probably scales really poorly when the numbers are so large at the start

honest mica
narrow kernel
#

or a polyexponent with a rho of ~.28-0.3. polyexponent is actually the closest to that equation from what I can see. anyways, they are close enough that you don't need to use that equation really. i mean obviously, you do you and all, these are just simple and close enough to save accidents and headaches

sudden ruin
sudden ruin
narrow kernel
sudden ruin
#

I really like the clyb 4m sde momentumized

#

Its the first that gives me decent anime results with it

#

As well as doing good photo stuff

sudden ruin
hasty ferry
#

hello all. New to discord, have a few questions about stable diffusion for Ollama.

narrow kernel
#

ollama is an llm server

hasty ferry
#

Correct but im running openwebui and stable diffusion is for the AUTOMATIC1111 image generation.

narrow kernel
#

yeah that combo works, i've used it. you can also interface with comfyui now

hasty ferry
#

I dont have a GPU and i have hunted the net for this runtime error on Ubuntu. I want to render using the CPU only as its still a test enviroment. The runtime error i keep getting -> runtimeerror: torch is not able to use gpu; add --skip-torch-cuda-test to commandline_args variable to disable this check

#

I cannot find a solution to stop the check for the GPU

narrow kernel
#

add "--use-cpu all --precision full --no-half --skip-torch-cuda-test" to the launch arguments

honest mica
narrow kernel
hasty ferry
#

I appreciate it, thank you. I will give it a try and see if it works. One last thing i see the issue seems to be from an updated released recently and affects all platforms. But there is so much information for windows and little for Linux, most of the information like i said targets windows directory locations and doesnt cover where in Linux "Mainly" Ubuntu the config file that needs editing . Any ideas?

narrow kernel
narrow kernel
hasty ferry
#

no worries, but i appreciate your help. Thanks again ill give this a shot.

soft iron
#

SD3 will never come out, yay 🥳

pearl ocean
soft iron
pearl ocean
soft iron
pearl ocean
shut snow
#

Guys need someone for help and who know the Naruto universe

#

Pm me pls

loud solar
#

Apple 😄

#

But hey ... it's less than half the price of my notebook ^^

soft iron
loud solar
#

Never 🙂

unkempt hatch
#

it seems to me SD cannot understand the concept "on"

#

for example, if I train someone "standing on a plate" and also "standing in front of a couch".

But then later I prompt "standing on a ouch".

I'll get him standing in front of the couch.

Maybe this isn't true of SD3 with the more transformers though?

soft iron
#

SD doesn't understand anything, he's not human 💀

teal pagoda
#

let's build our own SD3 :))))

#

to show stability ai that they are clowns 🤡

azure nebula
#

Is it possible to create sprite animation with consistent characters in stable diffusion?

#

Like for using as a character in 2D game 🎯🎮

soft iron
azure nebula
#

How about non pixelated

teal pagoda
#

like SD3

unkempt hatch
#

pony is just a well trained fine tune. Once the community fine-tunes SD3 it'll be incredible

unkempt hatch
#

compare SD1.5 base and SD3 base. Huge difference.

Now consider that SD1.5 finetunes are still kinda better than SD3

unkempt hatch
#

they should at least give a commitment, and give some training info so we can get ready to tune it

unkempt hatch
#

Stability should be talking to Kohya, One Trainer, and Think Diffusion; releasing the weights to them exclusively, letting them fine-tune it. They can still be making money as an online thing, but at least it'll get tuned a little

unkempt hatch
teal pagoda
#

imagine all the good devs uniting to make something awesome

#

like camenduru + illyasviel + lykon + mikubil + many more

teal pagoda
#

they will beat MidJourney for real if they'll do that

#

but that "union" will never happen anyway

#

because the "open-source" community is like a chaos

#

everyone for himself

unkempt hatch
#

just releasing the weights will already beat Midjourney

#

but ya, that collab would be insane

teal pagoda
#
  • cagliostro (linaqruf - the trainer of Animagine)
unkempt hatch
#

if just a few more would collab with ThinkDiffusion

teal pagoda
#

yea

#

and with bmaltais (from kohya ss)

unkempt hatch
#

why haven't the weights been released yet; is stability giving some kind of answer?

teal pagoda
#

I mean if that collaboration will ever be possible I'm really thinking about donating monthly to them

#

better to open-source than to MJ

unkempt hatch
#

if they release the weights we should all subscribe to support them

teal pagoda
#

they try to monetize SD3 through their joke API now

#

that's why they don't release them

#

they need money

#

they're bankrupt

unkempt hatch
#

is that what they're saying? Or just speculation

soft iron
unkempt hatch
abstract musk
teal pagoda
#

or wished to say

#

don't click

#

@sudden ruin or @bleak matrix, permaban this

soft iron
teal pagoda
#

just keep the cursor on that link to see another link :)))

#

"masked"

soft iron
sudden ruin
teal pagoda
#

until what?

#

:))

hollow frost
#

why is this so complicatedpepe_old

soft iron
hollow frost
#

stable diffusion...cant find a good nora model for anime

#

were the both of you planing to jump on me together?

charred mesa
#

no they just wanted to know your problem so they could help

fiery roost
#

Hai I have a question

hollow frost
charred mesa
#

ah 😌

fiery roost
#

How if my comfy UI have more than I work flow , then how I stop one work flow and only let the second work flow working

charred mesa
#

do you have comfyui manager?

fiery roost
#

Yes

charred mesa
#

use this plugin

fiery roost
#

How I use this to stop either one

charred mesa
#

oh you mean that

#

well my guess would be to just mute them or whatever its called

#

select all nodes you don't want to function

#

and then shift+b

#

so they become purple/red and then they will not be excecuted

fiery roost
#

Oh tQ

#

By the way shift b is not working for me I using Ctrl b

#

Thank

charred mesa
#

oh that then,

#

you are welcome

fiery roost
#

Any other else shortcut

charred mesa
#

hmm I don't know...

fiery roost
#

Ok fine hahahah that's okay

hollow frost
#

is there any beginners guide here?

sudden ruin
soft iron
#

then gradually you will understand how everything works

fiery roost
#

Is pony model any more improvement compare to sdxl

teal pagoda
fiery roost
#

I don't really get it , what that's mean

hollow frost
teal pagoda
#

yea

#

pony is more inclined to NSFW stuff

#

just to know

hollow frost
#

then i will go with it

#

thanks for the heads up

teal pagoda
#

:)))

hollow frost
#

but is it useful for anime or just western cartoons?

soft iron
#

best nsfw model now

teal pagoda
#

yea, look for fine-tuned models

#

for anime

#

western cartoons etc.

hollow frost
#

Pony Diffusion V5?

#

this one?

soft iron
#

immediately look at the prompts and copy

hollow frost
#

can a kind soul also recommend me LoRA models?

teal pagoda
#

what nora models?

#

:))

#

you mean LoRA?

hollow frost
#

fu-

#

sometimes even i'm astounded by my retartedness

soft iron
hollow frost
#

The pics are coming out like oil painted...

soft iron
#

sd1.5 only for sd1.5

#

sdxl only for sdxl

soft iron
hollow frost
fervent thunder
#

hi guys

#

guys any app similar to stable diffusion?

hollow frost
#

you really can to the discord server of stable diffusion to ask for the name of a same type of program?

shell tendon
echo peak
#

oh

raven agate
shell tendon
#

i think i prolly just approximated them and went back to the equation, admittedly may have been laziness

sudden ruin
shell tendon
#

what i've noticed that's so interesting to me with res is the effect on composition

shell tendon
#

with cascade (RIP), res was incredible with stage C, especially if you ran high step counts... 250, etc

shell tendon
#

interesting about clyb... man, there's just so many permutations of possibilities it's impossible to ever figure out the best approach haha

flint ingot
soft iron
#

no other model can do this

#

only with lora and additional plugins

flint ingot
#

never really had luck with loras. they'll be hit or miss. Having great success with ReActor though

#

so that solves faces I suppose. Just other details not so much

#

Though I notice Reactor doesnt like to use loras on the faces it does

#

Im still learning though

soft iron
#

find a model with faces that suit you and use facedetailer

#

the main model creates everything you need, and the face model draws the necessary faces

flint ingot
#

adetailer.. thats a new one.. and it fixes hands.. have to try it.. cuz my god do I get goood results.. but the hands are worse than a comic book artist

#

Let me tell you how inexperienced i am.. been using since last year? And Im only learning about xformers yesterday

soft iron
teal pagoda
teal pagoda
#

tried all the optimizers and they don't really decrease the quality

#

--opt-sdp-attention is the best xformers alternative

soft iron
soft iron
#

pony can be made to do any realism if there is a desire

#

the only thing pony can't do is make realistic faces

#

for this you just need to use a different model as I wrote earlier

teal pagoda
#

man, photorealism is not achieved with pony atm

flint ingot
#

cuz honestly CyberRealistic has been the best ive used thus far

#

sucks with hands.. and chance you get body amputation

teal pagoda
flint ingot
#

its a 1.5 I believe

teal pagoda
#

lightning is better

flint ingot
#

with an XL version

teal pagoda
#

yea

flint ingot
#

wait lightning vs xl?

teal pagoda
#

Lightning is a version of XL

flint ingot
#

Ahh okay

teal pagoda
#

which uses less steps

#

like 4-6 steps

#

and very good images

mossy shell
#

so suppose i have images of 2 characters that i generated, is there a way to put both of them together in the same environment?

teal pagoda
#

maybe with outpainting + controlnet

#

but these kinds of things take too much time

teal pagoda
#

to get a good result

#

many tries

flint ingot
#

I dont understand the difference in steps.

sudden ruin
gloomy bison
#

How do people train loras from like 100k images, do they seriously have a script that just grabs thousands of images

teal pagoda
#

and they caption them with LLMs

#

because no one in a lifetime can manually caption 100k images, let's be real here

#

let alone billions like the ones used by stability

gloomy bison
#

manually captioning 300 images took hours out of my life

surreal jay
#

which LLM is good for captioning? Can it be run with 16GB VRAM?

trail lion
#

most of my loras are based on like 30 images, so I dont bother with LLMs. having said that, I do bookmark them for the day when I decide to be that ambitious. I know peolpe use COGvlm, LLaVa, GPT4, I'm sure there are others, and there are various front ends for those on github

surreal jay
trail lion
#

I've played with LLMs a while back, well several months back, I got oobabooga or whatever going and I had an insta-local ai assistant, that was like a handicapped version of what chatgpt is. I played with it for a week or so, and moved on...didnt really have a use for it

surreal jay
#

how large was your model?

gloomy bison
#

how good at 30 image loras, i feel like they would be bad

full ingot
#

What is the best online service to run stable diffusion with an web interface? Automatic 1111 on colab Pro has long loading times...

trail lion
#

that's what dreambooth was literally made to be able to do, to train with a small number of images.

#

and of course lora, which isnt exactly dreambooth, but for the sake of this conversation accomplishes the same thing

ember citrus
#

Hey guys, tried posting on reddit, but I guess it won't let me. I was wondering my laptop can my laptop still use stable diffusion, even though it can handle top tier games such as COD? I've never used AI before and was really intrigued with the cool art that it can create. I wanted to try it but saw on starters guide that I should be using a GPU not an integrated one. I don't have the means to afford a PC right now. I did see there was the alternative to use A1111 services, but I don't really want to pay.

Processor AMD Ryzen 9 4900HS with Radeon Graphics 2060 3.00 GHz
Installed RAM 16.0 GB (15.4 GB usable)
System type 64-bit operating system, x64-based processor
(these are my laptops specs)

teal pagoda
loud solar
ember citrus
loud solar
teal pagoda
ember citrus
ember citrus
teal pagoda
teal pagoda
loud solar
teal pagoda
#

it's insanely slow while using CPU

#

I tried it myself

#

but you can try if you don't believe

#

and you won't be able to use hires. fix or controlnets

#

just to know

loud solar
#

Some told me they do well with the Hyper SD or Lightning or Tubo Models on CPU ...

teal pagoda
#

yes, for less steps, but no upscale :)))

#

so the quality will be bad anyway

ember citrus
loud solar
#

Yeah ... but maybe it's a good start to get an idea how interesting it is?

teal pagoda
#

Imagine that an upscale from 1024x1024 to 1536x1536 will take 30 minutes

#

minimum

#

only for a single image

#

these are all from my tryings

#

so I don't talk without knowing

teal pagoda
#

CPUs are just not there yet for AI

#

maybe in the future they'll implement some revolutionary technologies to compete with the GPUs

loud solar
#

A few days ago I would have been 100% with Bullseye ...

ember citrus
teal pagoda
#

Heard that Intel is preparing something

loud solar
ember citrus
loud solar
teal pagoda
#

All this AI thing is relying on CUDA technology for now

#

as far as I know

ember citrus
teal pagoda
#

that's why every single company like meta just buys A100s/H100s

surreal jay
teal pagoda
#

hundreds of thousands of them

loud solar
#

Notebooks often just share RAM ...

teal pagoda
ember citrus
teal pagoda
#

out of the box in the "utilities" tab

teal pagoda
#

it's what AI is accessing from a GPU

#

the GPUs with more CUDAs are better for AIs

ember citrus
ember citrus
loud solar
#

There are models trained to use a pretty few steps to calculate a picture ... normal use arounfd 30 ... they only use 1-10 ... so in case of emergency they might also run on CPU

teal pagoda
ember citrus
loud solar
ember citrus
loud solar
loud solar
teal pagoda
ember citrus
loud solar
#

And I have a 5000 € coupon for the ASUS store ... but at the moment ... nothing to buy for me ...

loud solar
loud solar
ember citrus
loud solar
#

AMD has often more VRAM ... but A.I. wasn't really made to work with them ...

ember citrus
teal pagoda
#

you have an insider

#

legit

loud solar
loud solar
tacit tiger
#

bro that's the best contest prize ever

#

I won art contest awhile ago an all I got was a frisbee an a notebook 😦

ember citrus
broken cave
#

what's going on with upscaling

#

why do people care so much

loud solar
ember citrus
loud solar
ember citrus
teal pagoda
#

Cloud GPUs services

ember citrus
loud solar
teal pagoda
#

yea, but not 2000$ for like a 4090

#

:))

#

you pay like 0.3$ per hour

loud solar
#

Bull has a good point ... if you don't want to care about software to much and buy hardware ... services might be a good idea ...

ember citrus
#

Hmm. I saw that they recommend Colab pro or A1111 service. I didn't look at the prices since I saw that I had to pay for it. Why would this be a good alternative? Could this be used on my laptop?

loud solar
ember citrus
loud solar
#

Technology is the same ... just the way is different ...

teal pagoda
#

of course they are good

#

there are some GPUs even better than 4090

#

on which you can rent for like 0.7$ per hour

#

with 40 GB of VRAM

loud solar
#

That's why I think starting low and looking for better solutions is the best idea ...

teal pagoda
#

yea

#

and even @loud solar with his consumer GPU won't be able to beat the servers'/datacenters' GPUs like A100

#

because he has a consumer GPU

#

it can't compete with the ones from the server racks

loud solar
#

And you won't need to beat me 🙂

teal pagoda
#

:))))))

#

really wish to see how fast is H100

#

but the price...

#

you can buy an apartment with these money

#

:)))))

ember citrus
#

Hmmm. to pay or not to pay. That is the question isn't it? From what you guys are telling me, it would be better to use paid services for now. Then when I get my PC build, then I can try transition into using my own build

loud solar
#

TBH my Computer is running in Energy Saving Modus most of the time ^^

teal pagoda
#

the electricity bill goes boom

trail lion
loud solar
teal pagoda
#

exactly

#

if you need it for more like "amateur" stuff

#

go with cloud GPUs

#

if you're a professional designer/artist, you can buy a GPU later

#

if you really wish to be "helped" by AI

ember citrus
#

Right... I'll have to consider my options. What would be considered "amateur". They all seem really good and decent. Well I'm no artist, but I just wanted to get into this for fun. Not quite like a hobby, just something to do on the side If I feel like it

trail lion
#

think I'd get a rx 7900 xtx if I bought right now

loud solar
trail lion
#

has the 24G vram, which I care about more than the raw compute, and about $500 cheaper than a 3090

loud solar
teal pagoda
trail lion
#

so nvidia obviously occupies all the top 10 spots in benchmarks, but ....you pay through the nose for it

teal pagoda
#

the AI art

trail lion
#

and like I said, I really just want more vram

ember citrus
teal pagoda
#

I can't believe this

#

I just tested Forge on a T4 (with 16 GB VRAM) with 6 controlnets activated at the same time with XL model and it works

#

:))))))))

#

6 XL controlnets

#

are you kidding me?

#

who said the cloud GPUs aren't good :)))

#

gonna try with hires. fix on

loud solar
#

We are at a point where everyone who is saying he is the god of A.I. and knows everything ... can only be pretty stupid 😄

trail lion
#

I use cloud exclusively for training right now, otherwise I'd have to give up my GPU for large portions of the day

loud solar
pearl ocean
loud solar
#

BTW where's my personal SD3 download link??

regal breach
regal breach
loud solar
#

😄

regal breach
#

Let him do something on his own

#

If you're gonna wipe his ass all time

#

He isn't gonna be able to do anything

#

That's not the point

#

No

soft iron
#

I want SD3 but it will never come out, I'm depressed agony

hot vortex
trail lion
#

well, not just that, your workflow will change, since now you will need to use the developer api

soft iron
#

You can already use SD3 locally

#

exists pixart

hot vortex
#

You can? I only watched one video about how to use it, but it wasn't locally.

#

it was through API key

trail lion
#

not sure about that pony thing, but I agree the community tends to augment the base model until eventually I dont ever load the base model, it's happened with each release so far

hot vortex
#

Does anyon know what do people use to remove background from a short video?

#

or any quick tutorial how to get started

trail lion
#

there's an rmbg extension I think

hot vortex
#

I think the most used term is AnimateDiff or what was it called

#

rmb is for pictures only I think

trail lion
#

or is it rembg, something like that, I'm not really a video guy

hot vortex
#

me neither

trail lion
#

yah but when you take a video and rip it to frames, gues what those are...pictures

hot vortex
#

ik

#

but I have to rip it somehow in comfyui, well, time to do some research

trail lion
#

I'd just use ffmpeg if it were me, but everyone has their own way

sudden ruin
gloomy bison
#

is 37 training images enough

#

for a small lora

rich kestrel
#

ya

#

even for big loras if the quality is good

pearl ocean
#

have you even met Lora?

azure nebula
#

gpus are so expensive

#

can someone recommend me a cheaper gpu for lora training?

#

or someone make me loras based on my art stuf

heavy lark
azure nebula
#

i see

#

yeah i got a 4 gb gpu on my laptop

#

i do see amd gpu are way cheaper for more GB too

#

man gpu costing like 500 USD+ agony

trail lion
normal kelp
#

Hey everyone! 🆘 I need help fine-tuning stable diffusion models for product enhancements. If anyone has experience, I’d greatly appreciate your input.

azure nebula
#

is 12 gigs of vram good enough to train or no

#

basically a set of 512x512 sprites for games

errant yacht
#

is SD3 slated for a public release like the other models?

polar skiff
#

hello

golden gale
#

Bots it's permanently down or will return? Who knows? is the any update?

opal spade
shell tendon
#

don't bother

#

get nvidia

azure nebula
#

Give me $$$ lol. Cloud is cheaper it seems

shell tendon
azure nebula
#

Making game assets

shell tendon
opal spade
#

RTX 4060 Ti 16gb looks to be a pretty good deal for hobbyist game design and such. Not sure what it's limitations would be besides just being a bit slower though.

narrow kernel
opal spade
shell tendon
#

slower

opal spade
#

Sweet! Thank you for the answer.

trim magnet
#

later

unkempt hatch
#

I wonder uif the new re-lighting could be used to fix the look of badly inpainted images

#

like, when you inpaint it gives this off-look because the lighting isn't entirely consistent

pearl ocean
#

SD3 RTX when

narrow kernel
# opal spade Where do they start showing their limits if you don't mind me asking

Yeah like clown said, it can limit you if you're into stuff like Lora training, but outside of that, 16gb is more than enough for regular workflow related stuff and has plenty of room to store whole models, plus things like controlnets and whatnot, without having to constantly shuffle models back and forth between the vram and ram. Oh and more room for larger images without running into vae decoding issues where you'd need to resort to tiles vae decoding

#

And if you're into llms, you can use higher quants and/or larger context windows without running out of vram

fervent thunder
#

guys is SD availabke on macmini?

#

or macbook pro?

storm shard
oblique elk
wise coral
# fervent thunder guys is SD availabke on macmini?

Yeah, draw things should work fine on a MacBook since it runs well on iOS devices. But tweaking the models might not be as convenient. If it's just for small images, no problem. But handling larger ones could be tricky due to heat.

storm shard
#

Generating big images is no problem.

#

It even works on my intel mac, slow... but it works haha

#

Btw if you're concerned about battery heat, you just use Low Power Mode which runs slower but keeps your device cool.

wise coral
storm shard
fervent thunder
#

is macmini more powerful than macboook pro ?

#

also i heard it has inbuilt fans right?

ripe bane
#

does anyone know how to get deforum to morph back into the INIT image by the end of the video?

storm shard
teal pagoda
#

SD3 tomorrow or good luck?

#

:))

trim magnet
#

srry guys it was actually may 10th 2029

#

did an oopsie

teal pagoda
#

🤡

golden gale
#

Bots it's permanently down or will return? Anyone knows? I'm asking because bots status it's from February. Any update? Thanks

violet turret
#

So, trying a tutorials to create a LORA of my face, getting ok results with SD 1.5 w/o regularization images, but poor results when retraining on SDXL. For just a face, are regularization images recommended when training a LORA?

fervent thunder
#

Sd3 impressions? So far ive seen better stuff of civitai

trim magnet
#

its a model

weak pebble
#

Anyone knows what I am supposed to do with a bin file? i downloaded ip adapter, added it into a1111, and i see no preprocessor show up, just under model

abstract quarry
#

you don't need reg images, it also works without

#

it can help to train text encoder, though. But it's difficult finding the sweet spot of text Encoder training where it does not overfit

narrow kernel
#

just use prodigy and be done with it

violet turret
abstract quarry
#

Nah, AdamW is totally finr

narrow kernel
#

Yeah prodigy really is a set it and forget it optimizer. It's hard to go wrong with it unless you really try. AdamW is still decent, you're just going to find yourself doing a lot of testing and a lot of following bad guides that worked for X person on Y thing and they act like they have some magic combination of settings and that it's applicable to all training sets.

violet turret
narrow kernel
#

All learning rates get set to 1, if you want to adjust things, you do it within the optimizer

#

but you'll have to google around for more information, i don't have a convenient list of the exact numbers for all the knobs

#

the problem with lora training is that there are so many different things that can speed up or slow down the learning rate, without actually touching learning rates

abstract quarry
#

AdamW has also just a single parameter (plus maybe warmup steps). Not that difficult

narrow kernel
#

like you might have some learning rate that will train in 2000 steps, but then if you change the dim/alpha from like 16/16 to 16/8, now it will take 2x longer, since the alpha acts as a scaler essentially. Same with batch size/accum, etc etc

#

other settings like dropout rates can slow things down a ton

abstract quarry
#

I have the feeling that Prodigy with default settings is always too strong. But in the end it depends on what you want to achieve

narrow kernel
#

just make sure to use the tensorboard and save every so many steps or epochs depending on how you pace it

#

if you let it spend too much time where it's basically flattened out to around 10%, it will overcook it. it really doesn't take all that long after it reaches that point either. remember, you can always lower lora weights in comfyui/a1111/etc or you can edit the lora after you're done to rescale them

#

training sdxl with text encoders is a pita and most people don't bother doing so

#

but anyways, google, youtube and trail+error are going to be your best friend. practice small before wasting a bunch of time

violet turret
narrow kernel
#

Watch your task managers GPU page and make sure it doesn't go into shared memory

#

If it does, you need to either disable text encoder training or lower the network dimension size

#

(usually)

#

Shared memory usage of even a hundred megabytes can make training turn to a crawl. Like it can slow it down by 10-100x

fervent thunder
fervent thunder
fervent thunder
#

to generate 1 image

#

on ur ipad

storm shard
pearl fog
# fervent thunder on ur ipad

I estimate it would probably be like ~20 seconds for a 512x512 image, and maybe like 2 minutes, 43 seconds for an SDXL image

sudden ruin
unkempt hatch
#

Stability should partner with Stock image companies

#

like, SD3 powers image variations. At the same time, SD3 is trained on the stock image datasets.

Even have something like, with a subscription fee, you can use the stock images for training.

#

like, a huge collection of human-tagged good quality images... how is that not the perfect marriage?

pearl fog
unkempt hatch
#

but I imagine their models must also be super good in that case, since they've got perfect datasets

fiery roost
#

My sd upscale for comfy UI

#

Should have a window to show me where is the process until right how I setting it

azure nebula
#

hey what is the best value cloud service to buy btw?

#

google collab?

#

I think i will just run SD on collab or something

sudden ruin
storm shard
sudden ruin
#

I will when im home, how fast is it on the m2 ipad pro

storm shard
#

896x1152 with TCD sampler and hyper lora 32 seconds @fervent thunder

#

but it depends on the amount of steps, (8 steps tcd, 20 steps euler a or dmp++ 2m karras)

#

121 sec for 1792 × 2304 8steps TCD

#

that's with an upscale script

#

just give it a try

valid cliff
#

Heu

#

How i create images

#

What si the promomt

fervent thunder
valid cliff
#

What si the promomt to geberate images

#

Please bro

fervent thunder
#

also 512 images work for thumbnails?

#

youtube thumbnails?

valid cliff
#

Bro

storm shard
fervent thunder
#

does it speed it up?

#

i m thinking of taking an ipad for SD coz how hands on it is with pencil and hands

#

i mean touchscreen

#

is 4060ti faster than this?

storm shard
fervent thunder
#

u have mac right?

#

M1?

storm shard
#

M2 ipad pro

fervent thunder
#

no no u said u also had a mac right

#

along with ipad

storm shard
#

i also have an imac, but that's intel based

fervent thunder
#

also how is ur expirience

#

with m2 ipad pro

#

with sd

#

is it buggy or crashy?

#

also have u tried editing with ipad

#

in da vinchi or final cut pro?

storm shard
#

No, it's not buggy at all, it works great and rwgular updates with new feautures

fervent thunder
#

ooh cool

#

also have u tried editing with ipad

#

in da vinchi

#

is it buggy or crashy

#

i heard lots of people say da vinchi is crashy or buggy in ipad

storm shard
#

The ipad is next to my imac, i'm using universal control so i can use my keyboard and mouse for Draw Things and the image genertion is while I'm doing other stuff on my imac

#

I haven't used davinci on my ipad yet, just on my mac

storm shard
#

only for masking or drawing

fervent thunder
#

how is the expirience

#

latency

storm shard
#

flawless

fervent thunder
#

really?

storm shard
#

yes

fervent thunder
#

masking and drawing huh pretty cool then

#

before i take m4 ipad pro i just wanted to know users expirience

#

how is ur expirience vs pc

#

which is imac

storm shard
#

you need to know that you're not able to use other apps whike generating images

fervent thunder
#

did u get any frustrations like ooh u cant do this in heree where as in my imac its so easy

fervent thunder
#

i wish they gave it mac already

storm shard
#

Myimc is my main computer, the ipad is for the creative stuff

fervent thunder
#

can u plz do me a favour

#

can u plz install da vinchi resolve on ipad try editing videos ansd share ur expirience of edting

#

çoz i m mainly taking ipad m4 for editng and SD

#

u can try editng 1 minute 1080p@60 fps video

#

coz i really want to know the da vinchi timeline expirience on ipad

#

or have u tried editing in final cut pro?

storm shard
static cape
#

Tomorrow we'll know if this was yet another week without news.

soft iron
#

Give me access to the site news and I will post the main news - SD3 will never come out coolcow

charred mesa
rich kestrel
#

Breaking news: OG diffusers give 0 cares about SD3 - "We are perfectly content with 1.5"

sudden ruin
#

Im more of a pixart/XL user smugshroom

rich kestrel
#

SD community is basically split into 3 groups

#

the OG 1.5 group = chad

sudden ruin
#

With a new category soon dogsmile

rich kestrel
#

the XL/2.1 group = meh

charred mesa
#

blud grouped 2.1 and XL together 💀

sudden ruin
#

Dont compare me with a 2.1 user bruh

rich kestrel
#

and the ones who ask every day when SD3 out = caveman skull emoji

#

no not 2.1

#

I mean pony

#

or whatever that thing is that is parallel to XL

charred mesa
#

yeah it was trained on top of XL

rich kestrel
#

I would also include cascade but that thing was the shortest lived fad I have ever seen in my life

charred mesa
#

epic announcement

#

anyhow

#

😴

#

🥱

potent spire
#

hmm discord bot SD

trim oriole
#

sadcat i want model weights

hazy cypress
#

I wanna use my 4090 I bought

charred mesa
#

heh

fervent thunder
#

nice announcement, not gonna spend money tho

charred mesa
#

you'll have to wait till the end of may tbh

trim oriole
#

yep..

fervent thunder
#

looks like SD3 wont be open source afterall

charred mesa
#

it will be

fervent thunder
#

time to crowdfund a model?

naive bear
#

They want to make SD closed and paid, when the whole appeal of SD is that it's open source, has all of these open source tools and the flexibility of running locally, and without all of that, they're just a worse alternative to other better paid AI services already out there and they'll just no longer have the same tools, so a lot less appeal than their older versions.

charred mesa
#

it's still training kek

#

alex (mcmonkey) said that the model is still undertrained

#

and they are trying to solve issues with the model

fervent thunder
#

all models are always undertrained

charred mesa
#

so it aint gonna be may 10th

charred mesa
frozen helm
#

Is artisan the new bot or what

soft iron
#

and I said

fervent thunder
#

its not free

soft iron
#

SD3 never release

potent spire
#

dunno

naive bear
#

I understand that, but this seems like it'd only put them further down the pit.

trim oriole
#

its frustrating because id like to support them but the only thing that is actualyl useful to me that they create is the model weights and if i cant run them locally and fine tune there is no advantage

gray fern
#

we want sd3 and got a discord bot instead 💀

potent spire
naive bear
#

A local tool that could be sold as software for a one time fee with user extensions would be way better than just resorting to doing a worse version than everyone else and stopping the thing that got you to where you are now.

potent spire
#

and for those that think crowdfunding would do the job, thats simply...naive

fervent thunder
#

it works

potent spire