#💬|general-chat
1 messages · Page 137 of 1
it matured nicely
sdxl is more refined tho like the lighting and compotition is more "sophisticated" if that makes sense
cascade woiuld be kick ass if people did somethign with it but whatever
lets wait for SD3 instead and complain even when it drops that it snot as good as promised XD
i mean its already out
u can test it. and it obviously isnt as good
its all a matter of how easily finetunable it is
I havent trained anything outside 1.5
sure 1.5, win95 is king too. trumpet winsock all the way
Photoshop is awesome
AI art can also be duplicated using GIMP photo editor manually
Creating art manually will take time obviously
How do I get started?
#🤝|tech-support pinned messages
ram really stops being useful after 32gb
vram on the other hand
games and AI are only going to get more powerful
Is anyone familiar with Ultimate SD Upscale? I'm trying to see if I can use it to resize and outpaint additional width to an image. I keep running out of memory when trying to to use inpaint to outpaint since the image is 3456x5120, would like to resize to 4000x5120 and outpaint the new dimension
look into using tiling and tiled vae
exactly, like I said
money talks in this world, unfortunately
buy HDDs and store as many good models/ControlNets as you can :))
if you really know the settings, you can rent a good GPU for like 0.7$ per hour and train with it.
won't cost you very much in total
hi, everyone
How do I create an image , what do I have to write after# to create an image on this discord of stable difusion, and aslo which chat do I have to go in to
谁来告诉我这里怎么用SD的机器人
Что ты говоришь, здесь наверное нельзя русский
Anyhoo, hear me out. LLM trained almost exclusively on the entire bible.
That's an LLM that would be fun to talk to
hello where can i request stuff to be made ?
what u want?
i wanna make an image , like a simple cross in the middle of clouds
civitai has a free generator
Hello can anyone suggest same feature that Runway uses for Erase and replace (ai-tools/erase-and-replace) in Stable Diffusion sdxl? I have used inpainting but i cannot replicate the same through prompt which runway does.
where do i go to get it ?
ok thank you
you can't create images here
the bots are not working anymore
Hoi, do you guys know of a upscale model trained on enhancing blurry/low res images with barely readable text?
Ok thank you.
Is there a starting guide for noob how to use SD?
Thanks
Please don't let us dwindle away without news for another week...
Why not?
Hi
Can anyone help? We signed up for stability.ai membership but didn’t get anything? What should we do?
What are you expecting to get?
fwiw this is the community area, they have a contact link on their website
GM☀️😎
what is the going price for a single image that is very specific to be created. professional image, not adult.
what's the best upscaling model now?
I don't mean workflow, just a .pt file to put inside my upscale node. Is 4xultrasharp still the king?
best upscaling model for what? they are not all for the same purpose anyway, some are better at anime specifically, others for restoration, for photorealistic, etc
photrealistic people
Playing with RealESRGAN_x4plus.pth at the moment
what is the diffrence between r/StableDiffusion & "Green Check" Stable Diffusion server?
Well technically they are both under stability's control (if I recall correctly)
otherwise, I don't know about difference between just the fact that discord isn't reddit and vise versa
I'm a fan of lollipop for sdxl, and 4xUltrasharp for 1.5
Yes, you have to try to find your own solution ...
There's tons of them out there. I must say that there's seemingly no substitute in xl for what tile resample did in 1.5.
4xUltrasharp is pretty good, too.
There's the idea for an A.I. choosing fitting Loras for your prompts. I hope they will choose the right upscaler in future, too. To much cool stuff out there to know it all.
But the upscale model is totally separate, right? It's not working with any base model when upscaling
I don't know the specific technical mechanics, but yes you need to download upscale models not already bundled by default.
AFAIK Supir as a workflow combines both ...
I tried installing that, it was too slow and resource intensive for my needs
I'll wait for a cloud implementation
Pretty slow, hard to understand but worth the work sometimes. Someone showed me an easier workflow some days ago ...
If I had a 4090 maybe, but I just have a slow card with 16g vram
Not sure whether I'd buy a 4090 nowaday ..
I can't pay 1k for a video card, let alone double that... So I'll stay with what I have until I'm forced to do otherwise
Yeah ... and maybe better wait for the 50xx series and buy a used one cheaper ... or wait for special A.I. Hardware ...
i think it's pretty well documented no? i mean it's not in a "rentry"
fine tuning isn't scientifically robust right now. so it is all sort of heuristic
Trial and error, lots of it
Errors let you learn the most 😄
not well documented
It can change faster than youtube create new videos ...
Someone always makes a video, but I would acknowledge there's lots of conflicting info out there. captioning techniques, using reg images or not, optimal settings for net dimensions, etc
But you don't have to start at 0. When I started with A.I. it was like pushing a stick into a black hole ...
exactly
some say captions are good, some say they're not
some say 0.0001 is a good LR, some not
:)))
And that can vary between models btw, also
it's not something you can learn from youtube videos
kinda
😦
but not even from civitai/google
what do you want to fine tune? what's a concrete example?
i will help you right now to do it
are you a trainer or something?
"I refuse to be rescued in such filth" -Princess Vespa
are you a trainer?
🥺
what answer is this?
yes, i know a lot about this. but what you are really asking is if i am a social media content creator, which i am not
A good answer this is ... 🙂
No, man
I meant if you are a stable diffusion trainer
I wish to add more new faces to SD, because I'm tired of the AI generic ones
more like people fine-tuning
and the 2nd fine-tune for a style
do you have any visual arts education?
like can you formally describe what a face looks like, as though it were a picture? or is it more of a, "i know when i see it" sort of thing for you?
and then, why do you want new faces? what is your goal? what are you trying to make?
you say faces, but do you really mean, faces and heads? what specifically?
@teal pagoda am i making sense?
yes
I just want to add more people into sd like from social medias and so on, so when prompting for like "a portrait of a woman", you won't get the generic face of a woman (which is the same in 90% of the checkpoints).
if you understand what I mean
i am not trying to be rude, but i am trying to give you help, and you "just" gotta answer my questions
i wouldn't skip any of them
nope
I'm using captions for this
I already said. More "options" for faces
even the whole body shots
I would think the way you do that would be a large data set of people, like 1 or 2 thousand
okay
kinda
but why do you need the faces?
:))))))
to make the SDXL 1.0 more "complex"
nope, I captioned with Blip2 and wd
Sd knows what a person is, a man, a woman, because of prior training.... You want something new, you have to put it in
no. i'm saying you don't need to fine tune at all
what sampler do you have to use with 1.5 models ?
but what to do then?
nope
clip vision is like prompting with images
from clip's point of view, an image and text are the same thing
does that make sense?
it's not "img2img"
@broken cave can you tell me what node to use in comfyUI to sample with a 1.5 model ?
KSampler doesnt seem to work
more like IP Adapter?
meh, I give up, I'll remain at merging models and generating
no
you can download the example workflow directly from comfyui
wasted 8 hours today trying to "train"/combine 2 different faces or kinda with the same activation word and the results aren't really good
to test if this is possible at all with dreambooth
because it doesn't write anywhere
no one tried
okay so i have posted you an examplke
of doing what you want to do
which doesn't have to use fine tuning
is this helpful?
you don't need to merge any models at all
you don't need to fine tune
does that make sense?
Yes, I understood, this can be done even with Instant ID or IP Adapter FaceID plus v2
no!!
it's not instant id or ip adapter faceid
you don't need to use those
you don't want to
yes, but it's similar
because I only use A1111
they are totally different!
is this clip vision in a1111 too?
they don't produce similar results either
never heard of it
you use an image as prompt instead of text, yes ?
it is like adding an image as a word to your prompt
but what are you getting out of it ? like what will happen when you put a image of a dog and a cat ?
i suppose you can go and try that lol
it would be completely different than what ip adapter does
it is like a word in the prompt. one way to use it is by pretending it is the first word in the prompt.
do you have an example json comfyui workflow that i could run ?
like i made this image today:
hmm,cant send it here 🙂
but would be curious what it'd do
ah, it's the stable-cascade thing
there is a clip vision approach for any model that uses clip
@teal pagoda it sounds like what you want is something that will augment your prompts with a random mix from a collection of portraits. you can sort of do this as a workflow in comfyui. you do not need to fine tune anything
so it doesnt have to be stable cascade, no ?
Error occurred when executing CLIPVisionEncode:
'NoneType' object has no attribute 'encode_image'
File "D:\AI\ComfyUI_portable\ComfyUI\execution.py", line 151, in recursive_execute
output_data, output_ui = get_output_data(obj, input_data_all)
File "D:\AI\ComfyUI_portable\ComfyUI\execution.py", line 81, in get_output_data
return_values = map_node_over_list(obj, input_data_all, obj.FUNCTION, allow_interrupt=True)
File "D:\AI\ComfyUI_portable\ComfyUI\execution.py", line 74, in map_node_over_list
results.append(getattr(obj, func)(**slice_dict(input_data_all, i)))
File "D:\AI\ComfyUI_portable\ComfyUI\nodes.py", line 880, in encode
output = clip_vision.encode_image(image)
in CLIP Vision Encode
no it doesn't
so was this helpful?
anyone got an answer to my question by any chance?
which question
the most important thing you can do is stop using A1111
what node do i use instead of KSampler to work with a 1.5 model in comfyUI
KSampler is part of the correct workflow for SD1.5. try using one of the comfyui example workflows.
i could show you the workflow in #🏞|general-with-images and the error, too
maybe you have an idea
good joke
well that's how you get better at this, and do things simpler
that are superior
i don't know. it's all very relative
Cascade is no more?
so when SD 3? I didn't receive any message for getting to test it
I get the feeling ppl who are anti-A1111 are likely the least creative of the bunch
a bad artisan blames his tools
Funny that the comfy nutters are always the ones who come crawling back to A1111 cuz they cant even inpaint in their unnecessarily convoluted interface
Dream
it's pretty hard to generalize about anyone's creativity. if you feel strongly about it you should teach art, if you could find creative people you could become a very successful curator
😄
ey i‘m an absolute noob in fooocus and just starting. I wanted to ask if it‘s possible to make nsfw content on fooocus ?
Upvote
25
Downvote
15
comments
Share
Share
Add a comment
Sort by:
Best
Search Comments
Expand comment search
u/Conscious_Lion_6825 avatar
Conscious_Lion_6825
•
4mo ago
More detailed reply. Go to Civit Ai website. Click on the model tab and then on the left side click filter and select sdxl 1.0. Now you have a list of all the models and loras suitable for fooocus. Inside the models page look at the description. It might say Lora or checkpoint. This is important. Click on download. When you file dled you gotta open your fooocus folder and find folder models. Now, drag your dled model or Lora into one of the 2 folders inside fooocus/models:
Checkpoint folder for any model that's said chekpoints on Civit Ai
loras for loras.
Here comes the interesting part, get to testing. When you launch fooocus web ui click advanced box and go to models. You will see drop down menus for loras and models. Fooocus can use only sdxl models so no sd or sd 1.5-2 are suitable.
The checkpoint models you've dled are the general models. The loras are something like add-ons.
The weight on loras matter a lot. Try going from 0.5-0.9.
In prompts you can write a prompt, select it and ctrl+ keybord arrow up and down to weight this particular word/phrase/etc more or less. Aka fooocus will consider this word or phrase more or important. Play with it.
Found this on reddit
What does it mean by "file dled"?
Like how do I get install this new model I downloaded and use it with foooocus
?
Because I use Automatic1111 and dislike Comfy I would like to agree, but a good friend of mine works pretty much in Comfy and it is very creative, I would say one of the most creative AI artist I've seem. He inpaints in Krita.
my friend https://twitter.com/UnoParticular
inpainting in comfyui is better than in a1111
yeah search for my username plus "has:image" my shit is wildly creative and i pretty much exclusively use comfyui now
i find comfyui to be MUCH better for creative work than a1111
hello, is sd 3 avail for comfyui?
prove it
oh yeah I remember someone saying some shit about laying masks for context
you can't use differential diffusion for one, which is better than soft inpainting
proof?
and at least as of a1111 v1.8, the tools within comfyui for editing masks were better
go use differential diffusion and you'll see what i mean
i know you're dead set on a1111 and i won't try to change your mind
this is more aimed at ppl on the fence that might be lurking
the idea that comfyui is extremely difficult to use, and tedious and only good for technical work, is 100% false
im watching the vid on it, im not saying no
I do raise an eyebrow anytime someone says "better"
cuz a1111 inpainting already got hella buffed
plus, according to @neon oriole, a1111 is deprecated 🤣
yeah, i've tried soft inpainting in a1111
it's an improvement over the old limitations but it's not as seamless as differential diffusion inpainting
and you can use latent sender and latent receiver nodes to build a pretty simple workflow that allows iterative inpainting without any vae encoding steps
comfyui's mask editor is absolutely fine, and there's great nodes like groundingdino ones that can generate a lot of masks for you better than the yolo etc stuff, much better than clip, etc
the one big thing a1111 had that comfyui was lacking was the reference controlnet, but that's been added in the last month
well, one other thing too, not really big but i find mildly annoying, comfyui lacks support for lycoris-ia3
Some images I like, some are pretty good, I found many to be more Far-fetched images than creative or meaningful ones. Aside from the ones I like, most I find some generic glossy AI "hi-quality" imgs which Ia don't like, I always try to get other style. If it serves you.
Nah that process is totally hell. Automatic is perfect for the task.
yes totally subjective, but well
exactly
yes you can, I don't mind
fact is, you can do wildly unusual shit with comfyui, it's extremely flexible, that is what opens the door to creativity
i never shat on comfy
i shat on mfs saying to veer clear of a111
thts misinformation
I have comfy g
whether something is meaningful or whether you like someone's output doesn't say anything about the capacity of the tools
the ability to do insane shit you can't easily get from a prompt does(
that's the key
a111 is still the jack of all trades. good ux. no bs. if ppl wanna use comfy go ahead. but dont take a sheeit on a111
a1111 is slow, vram hungry, and feature limited, and relatively inflexible
it's kinda like having a set of default comfyui workflows
if you're cool with just doing a couple types of things it's fine, but i'd still at least recommend forge cuz it's so much faster
Not in getting updates though 
yeah been two months...
In my case I always get my computer crashing and stuttering while Comfy worked, and I run Automatic1111 on Linux, it runs really smooth, since last good installations no crashes and just a little mouse stutering from here and there.
SD XL images of 832x1216 with DPM 3M SDE sampler created with 1.04s/it (RX 6700 12 GB VRAM gpu), it is running very smooth now, it is extremely optimized
My respect to all the people who come in here and start with: "I am using the diffusers pipeline and this is my code..."
are there ever too many steps?
to think, the new iPads have amazing power, amazing new features and design.
But it doesn't do anything my M1 iPad Pro can't do.
Sometimes. Usually it's just a waste of time
Cascade, with stage b, you generally want exactly 7 steps
Pixart looks burned if you go too far with the step count
ok this is very helpful thank you
from my limited testing on the matter, i think it has more to do with the sampler/scheduler and cfg; when it comes to pixart
that being said, there is rarely ever a reason to go above 50 steps really(assuming you're using common sampler/schedulers like dpm++ 2m karras) for any model, not just pixart
and pixart seems to prefer euler normal from what i've seen
Yeah def the case with the standard samplers
I've seen a couple cases where the sde ones were slightly better at 60 but I do mean the tiniest bit
Res_momentized is the only one I've seen dramatic improvements at 100 or even more steps
Well that one does a lot of things different than regular samplers under the hood and even does the whole half step thing if you watch the command line window
i wonder if in the futuer we'll use gigasteps per second and no one will flinch at it. "you'll never need more than 64kb of ram" moment y'know?
yeah that is NOT even remotely near a normal sampler haha
absolutely love that sampler
There are also ways to do everything using analog computing and have infinite step resolution, but I won't torture people with engineering turbo encabulator jargon
Yeah it's a neat one for sure
thats a neato concept. weights encoded into gears
Nah not gears, just analog circuits
I think the tech in the fallout games is centered around the same concept where they never made a digital processor
And all computing is done via analog
Other sci-fi stories use it as well
like a TB303
turbo encapulators is a good throwback
It's been an engineering meme since like the 40s or something. There's been a million different versions throughout the years
Havent seen that before? Is that a custom node or is it in comfy already? 
More stuff to try with pixart sigma
it's crazy how no one seems to use res except me around here
but it's fn amazing
best at everything? nah, but really, really interesting and frequently gives the best outputs for whatever weird shit i'm usually working on
the supreme sampler is also outstanding for upsampling espec combined with aligned sampler and with both steps and substeps set at RES
https://github.com/Extraltodeus/sigmas_tools_and_the_golden_scheduler you also really want this
i've got some nodes i added to the pack if you're into manipulating the sigmas
x**((x+1)*phi)*sigmax+y**((x+1)*phi)*sigmin
that's the formula you wanna use with res_momentumized more often than not
I use it sometimes, it's just slow and tends to yield results with some minor margin of error vs other samplers. 90% of diffusion is just pecking through seeds and tweaking settings. Anything that speeds the trial and error process up, while producing reasonable results, is usually what most people gravitate toward
Which is why most people just stick with dpm++ 2m karras
For 20-50 step workflows, not talking about turbo/lightning workflows
Res is good for resampling though once you find what you like
Its difficult to compare those samplers to the others cause the results differ a lot
That gives me really bad results with pixart at least
pixart uses much higher sigmas than models like sdxl. i think pixart uses like 140 or 180? sdxl uses ~14
so that equation probably scales really poorly when the numbers are so large at the start
How many steps did you used?
Oh and just as an fyi, if you use the karras scheduler node, you can get roughly the same shaped graph just by taking rho down to like ~2.8, instead of the default 7 that it normally has
or a polyexponent with a rho of ~.28-0.3. polyexponent is actually the closest to that equation from what I can see. anyways, they are close enough that you don't need to use that equation really. i mean obviously, you do you and all, these are just simple and close enough to save accidents and headaches
40 Steps right now
Yeah I know, im still trying to figure out the best setup for pixart
yeah i'm still trying to figure it all out as well
I really like the clyb 4m sde momentumized
Its the first that gives me decent anime results with it
As well as doing good photo stuff
Yep
I can share my current setup if you need inspiration 
hello all. New to discord, have a few questions about stable diffusion for Ollama.
ollama is an llm server
Correct but im running openwebui and stable diffusion is for the AUTOMATIC1111 image generation.
yeah that combo works, i've used it. you can also interface with comfyui now
I dont have a GPU and i have hunted the net for this runtime error on Ubuntu. I want to render using the CPU only as its still a test enviroment. The runtime error i keep getting -> runtimeerror: torch is not able to use gpu; add --skip-torch-cuda-test to commandline_args variable to disable this check
I cannot find a solution to stop the check for the GPU
add "--use-cpu all --precision full --no-half --skip-torch-cuda-test" to the launch arguments
That's the most watching paint dry - version of launch arguments if i have ever seen one.
yeah, but he said he wants to use it in CPU only mode
I appreciate it, thank you. I will give it a try and see if it works. One last thing i see the issue seems to be from an updated released recently and affects all platforms. But there is so much information for windows and little for Linux, most of the information like i said targets windows directory locations and doesnt cover where in Linux "Mainly" Ubuntu the config file that needs editing . Any ideas?
oh and as far as i know, cpu modes are usually picky about precisions and don't tend to like doing half precision stuff and whatnot
no clue, i dont use linux. last time i used linux was like 10 years ago for a class in college.
no worries, but i appreciate your help. Thanks again ill give this a shot.
SD3 will never come out, yay 🥳
New iPads came out though
The mobile phone industry is cool, in 10 years there has been exactly 0 development
The iPad Pro is the same price as a MacBook
no matter how much it costs, it always deserves its price since it's Apple 
just buy and don't ask any questions 
Never 🙂
it seems to me SD cannot understand the concept "on"
for example, if I train someone "standing on a plate" and also "standing in front of a couch".
But then later I prompt "standing on a ouch".
I'll get him standing in front of the couch.
Maybe this isn't true of SD3 with the more transformers though?
SD doesn't understand anything, he's not human 💀
Is it possible to create sprite animation with consistent characters in stable diffusion?
Like for using as a character in 2D game 🎯🎮
There is already such a person, the creator of pony model.
How about non pixelated
yea, but I don't think that pony is really that much of an "advancement"
like SD3
pony is just a well trained fine tune. Once the community fine-tunes SD3 it'll be incredible
:)))
if it'll ever launch
compare SD1.5 base and SD3 base. Huge difference.
Now consider that SD1.5 finetunes are still kinda better than SD3
ya, it's infuriatingly slow....
they should at least give a commitment, and give some training info so we can get ready to tune it
but it will never come out 
Stability should be talking to Kohya, One Trainer, and Think Diffusion; releasing the weights to them exclusively, letting them fine-tune it. They can still be making money as an online thing, but at least it'll get tuned a little
if that's true there's really no point to it... it'll just be a worse version of Midjourney
imagine all the good devs uniting to make something awesome
like camenduru + illyasviel + lykon + mikubil + many more
yes, it's sora
they will beat MidJourney for real if they'll do that
but that "union" will never happen anyway
because the "open-source" community is like a chaos
everyone for himself
just releasing the weights will already beat Midjourney
but ya, that collab would be insane
- cagliostro (linaqruf - the trainer of Animagine)
if just a few more would collab with ThinkDiffusion
why haven't the weights been released yet; is stability giving some kind of answer?
I mean if that collaboration will ever be possible I'm really thinking about donating monthly to them
better to open-source than to MJ
if they release the weights we should all subscribe to support them
they try to monetize SD3 through their joke API now
that's why they don't release them
they need money
they're bankrupt
is that what they're saying? Or just speculation
all they say is "will be out in 2 weeks" every time
that's why they should release to some of the paid online devs at least. Like, ThinkDiffusion trains their own fine-tunes and charges and online service. Stability should make a deal with them
We would love to talk about it, If Stability AI has any plans for collaboration to make Stable Diffusion better. 😁
that's from what emad said
or wished to say
don't click
@sudden ruin or @bleak matrix, permaban this
but i want
this is a link to Rick Astley
Appreciate the heads up, btw you can also right click a message and click apps and then report to staff
Alright
Thanks!
why is this so complicated
what's difficult?
what?
stable diffusion...cant find a good nora model for anime
were the both of you planing to jump on me together?
no they just wanted to know your problem so they could help
Hai I have a question
ik i was just joking
ah 😌
How if my comfy UI have more than I work flow , then how I stop one work flow and only let the second work flow working
do you have comfyui manager?
Yes
How I use this to stop either one
oh you mean that
well my guess would be to just mute them or whatever its called
select all nodes you don't want to function
and then shift+b
so they become purple/red and then they will not be excecuted
Any other else shortcut
Ok fine hahahah that's okay
is there any beginners guide here?
Havent seen this before, looks awesome 
all you need to get started is download the automatic, download pony, copy other people's prompts and see what results it will lead to
then gradually you will understand how everything works
Is pony model any more improvement compare to sdxl
don't go with pony if you're in the 100% SFW area
I don't really get it , what that's mean
you mean no horny?
:)))
but is it useful for anime or just western cartoons?
useful for all
best nsfw model now
can a kind soul also recommend me LoRA models?
download them, put them in the lora folder, select them in webui
The pics are coming out like oil painted...
copy completely someone else's prompt
alright
you really can to the discord server of stable diffusion to ask for the name of a same type of program?
Yeah it is pretty close... I have tried vs Karras and i swear I remember something wasn't quite right with the result but I could be hallucinating that
oh
I put them in groups and set up the fast group bypass node. Then you just turn on and off the ones you want to use and don't want to use.
i don't think i ever tried those exact values though
i think i prolly just approximated them and went back to the equation, admittedly may have been laziness
Got my ipad pro refurbished for like half the price
what i've noticed that's so interesting to me with res is the effect on composition
with cascade (RIP), res was incredible with stage C, especially if you ran high step counts... 250, etc
i've found pixart tends to burn pretty bad with res_momentumized, especially as the step count goes up
interesting about clyb... man, there's just so many permutations of possibilities it's impossible to ever figure out the best approach haha
better than cyberrealistic? Thats what ive been using and dang is it good
pony can do hard poses and hard compositions with people
no other model can do this
only with lora and additional plugins
never really had luck with loras. they'll be hit or miss. Having great success with ReActor though
so that solves faces I suppose. Just other details not so much
Though I notice Reactor doesnt like to use loras on the faces it does
Im still learning though
faces are the easiest thing
find a model with faces that suit you and use facedetailer
the main model creates everything you need, and the face model draws the necessary faces
adetailer.. thats a new one.. and it fixes hands.. have to try it.. cuz my god do I get goood results.. but the hands are worse than a comic book artist
Let me tell you how inexperienced i am.. been using since last year? And Im only learning about xformers yesterday
I turn them off, why would I spoil the picture with 1% acceleration?
but pony is not really realistic anyway
nah, the quality isn't decreasing with xformers
tried all the optimizers and they don't really decrease the quality
--opt-sdp-attention is the best xformers alternative
lol?
pony can be made to do any realism if there is a desire
the only thing pony can't do is make realistic faces
for this you just need to use a different model as I wrote earlier
man, photorealism is not achieved with pony atm
what do you reccomend
cuz honestly CyberRealistic has been the best ive used thus far
sucks with hands.. and chance you get body amputation
for photorealism, use a SDXL lightning model or a realistic 1.5 model
its a 1.5 I believe
lightning is better
with an XL version
yea
wait lightning vs xl?
Lightning is a version of XL
Ahh okay
so suppose i have images of 2 characters that i generated, is there a way to put both of them together in the same environment?
.
to get a good result
many tries
I dont understand the difference in steps.
Yeah it's a frkn lottery running this model
How do people train loras from like 100k images, do they seriously have a script that just grabs thousands of images
yea, or bulk downloaders
and they caption them with LLMs
because no one in a lifetime can manually caption 100k images, let's be real here
let alone billions like the ones used by stability
manually captioning 300 images took hours out of my life
which LLM is good for captioning? Can it be run with 16GB VRAM?
most of my loras are based on like 30 images, so I dont bother with LLMs. having said that, I do bookmark them for the day when I decide to be that ambitious. I know peolpe use COGvlm, LLaVa, GPT4, I'm sure there are others, and there are various front ends for those on github
yeah I'll probably just start with 30 lol, don't know the first thing about running an LLM anyway
I've played with LLMs a while back, well several months back, I got oobabooga or whatever going and I had an insta-local ai assistant, that was like a handicapped version of what chatgpt is. I played with it for a week or so, and moved on...didnt really have a use for it
how large was your model?
how good at 30 image loras, i feel like they would be bad
What is the best online service to run stable diffusion with an web interface? Automatic 1111 on colab Pro has long loading times...
they suit my needs, but I guess it depends on what you're doing, a likeness? a pose? some other kind of concept? a complex concept may in fact need more images, but for most of what I'm doing...I've gotten by at times with like 20 images even and it's fine
that's what dreambooth was literally made to be able to do, to train with a small number of images.
and of course lora, which isnt exactly dreambooth, but for the sake of this conversation accomplishes the same thing
Hey guys, tried posting on reddit, but I guess it won't let me. I was wondering my laptop can my laptop still use stable diffusion, even though it can handle top tier games such as COD? I've never used AI before and was really intrigued with the cool art that it can create. I wanted to try it but saw on starters guide that I should be using a GPU not an integrated one. I don't have the means to afford a PC right now. I did see there was the alternative to use A1111 services, but I don't really want to pay.
Processor AMD Ryzen 9 4900HS with Radeon Graphics 2060 3.00 GHz
Installed RAM 16.0 GB (15.4 GB usable)
System type 64-bit operating system, x64-based processor
(these are my laptops specs)
guess the best one for captioning is gpt4, but it's not free. The free alternative would be kosmos2, wdtagger etc..
SD Forge (or ComfyUI) with special Models to generate with a few steps using CPU maybe ...
Interesting. I'll give this method a go then. Thanks!
Not my way cause I have the hardware. But could work for your needs.
nope, you have no dedicated GPU
Right. I'm in the slow process of buying parts here and there, but didn't want to get NVIDIA cards. I saw best buy has AMD and thinking of utilizing the AMD Radeon RX6700 XT. Would this still work?
Dang. that sucks. Why is that? Do AI models or program not run as smooth?
You can contact me in private for the best alternative to colab pro
You'll wait 10 mins to generate a 768x768 pixels image
First try and get an idea whether it's important and interesting for you. Second get an idea how much it's worth to you ...
it's insanely slow while using CPU
I tried it myself
but you can try if you don't believe
and you won't be able to use hires. fix or controlnets
just to know
Some told me they do well with the Hyper SD or Lightning or Tubo Models on CPU ...
Damn that slow?!
Yeah ... but maybe it's a good start to get an idea how interesting it is?
Imagine that an upscale from 1024x1024 to 1536x1536 will take 30 minutes
minimum
only for a single image
these are all from my tryings
so I don't talk without knowing
yep
CPUs are just not there yet for AI
maybe in the future they'll implement some revolutionary technologies to compete with the GPUs
A few days ago I would have been 100% with Bullseye ...
I guess you're right. Might be jumping the gun too soon. I just asked, because the starter page said to use a "discrete Nvidia video card (GPU) with 4 GB VRAM or more". So I wanted to know if AMD would work
Heard that Intel is preparing something
AMD can work ... but without VRAM no real fun if you don't use that special models ...
That's really interesting considering how much better a lot of the PC parts have come. But I guess, because AI is new and randomly generated, it's not fast enough to keep up with the processing.
CPUs don't have CUDAs
For a real quick trial you can use an online service like: https://www.craiyon.com/ to check ... not really offering much but giving an idea for free ...
I'm not versed in pc tech. but what is VRAM? what are special models?
that's why every single company like meta just buys A100s/H100s
thanks for some free alternatives :)
hundreds of thousands of them
Dedicated Graphic Cards have their own (VRAM) ...
Notebooks often just share RAM ...
If you install Kohya SS GUI, you'll get some good free captioners there
I have never heard of CUDA before. is that like a program that AI uses?
out of the box in the "utilities" tab
Nope
I'll take a look. Thanks
I see. then what did you mean by use of special models?
Wow. I never knew that. This is definitely all new to me
There are models trained to use a pretty few steps to calculate a picture ... normal use arounfd 30 ... they only use 1-10 ... so in case of emergency they might also run on CPU
yea, but when they'll swap to CPU, the generation will take longer
Ah ok. that makes a lot of sense. But why does it need around 30? Doesn't what you put in chatGPT and the few provided pics be enough to generate an AI?
But some say they are fine with that. Zo be hones I just don't know. I have a 4090 ... so I don't really care ... but it seems to work for some ...
I don't remember if best buy now has the 4090, but I saw they have the 4070 super. Is that a good GPU?
A picture is generated by text input and noise. Every step adds some noise and will make the picture better ... to use a pretty easy explaination ...
I've bought it 6 months ago ... today I'd wait for the 50series and decide whether to buy it or a cheap used 4090 or other one
Yea, just go with this advice, @ember citrus
So basically. the more pictures provided, the more accurate it will be to create a better picture
And I have a 5000 € coupon for the ASUS store ... but at the moment ... nothing to buy for me ...
I am using A.I. for a long time but I don't really know complete how it works ... but noise will be used to work on your prompt again and on details ...
The video I showed you is telling a lot about those quick A.I. models ... they do work pretty good ... I just want max ...
Hmmm. Aren't newer NVIDIA GPU nearly double that of AMD? Even i'm not sure I want to spend 1k alone on just the GPU. I just check best buy website with the NVIDIA it seems the 4070 is better price going at $629.99. I didn't really want to spend 3K plus for my entire set up. Maybe 2K with a few accessaries
AMD has often more VRAM ... but A.I. wasn't really made to work with them ...
really? I would've thought it be usable across any GPU as long as it can handle it
what? how?
you have an insider
legit
In the early days there have only been bad workarounds to make A.I. work with AMD ... I can't say how it is today cause I only have my own hardware ... I can't say how it is for Apple users ...
Won an A.I. contest ... 😄
bro that's the best contest prize ever
I won art contest awhile ago an all I got was a frisbee an a notebook 😦
Man. This kinda suck on my end. Oh well, I'll just use what you suggested and get the feel of it. See if I like it or not
I think it's more clever to start from the bottom and get ideas what you need more ... cause you learn stuff people stating with top hardware won't learn ...
That's a good look at it. Build a better foundation to build upon
I'm using technologies richkiddies wouldn't think about it and I can beat them ...
lol, I bet. You seem to have good set-up if you were able to win an AI contest
You have another solution though
Cloud GPUs services
but I have to pay.... 😦
I am just a creative guy learning and doing stuff ... but I am learning ... worth the most ...
Bull has a good point ... if you don't want to care about software to much and buy hardware ... services might be a good idea ...
Hmm. I saw that they recommend Colab pro or A1111 service. I didn't look at the prices since I saw that I had to pay for it. Why would this be a good alternative? Could this be used on my laptop?
That is using servers GPU and RAM ... can work on you mobile ...
But is it as good?
Technology is the same ... just the way is different ...
Man, even Stability AI is training their models on servers with H100s/A100s GPUs
of course they are good
there are some GPUs even better than 4090
on which you can rent for like 0.7$ per hour
with 40 GB of VRAM
That's why I think starting low and looking for better solutions is the best idea ...
yea
and even @loud solar with his consumer GPU won't be able to beat the servers'/datacenters' GPUs like A100
because he has a consumer GPU
it can't compete with the ones from the server racks
And you won't need to beat me 🙂
:))))))
really wish to see how fast is H100
but the price...
you can buy an apartment with these money
:)))))
Hmmm. to pay or not to pay. That is the question isn't it? From what you guys are telling me, it would be better to use paid services for now. Then when I get my PC build, then I can try transition into using my own build
TBH my Computer is running in Energy Saving Modus most of the time ^^
imagine how much would a H100 consume
the electricity bill goes boom
cheap 4090 - now there's a contradiction in terms 😄
The most important question ist: What you need? And nobody of us can answer that for you
exactly
if you need it for more like "amateur" stuff
go with cloud GPUs
if you're a professional designer/artist, you can buy a GPU later
if you really wish to be "helped" by AI
Right... I'll have to consider my options. What would be considered "amateur". They all seem really good and decent. Well I'm no artist, but I just wanted to get into this for fun. Not quite like a hobby, just something to do on the side If I feel like it
think I'd get a rx 7900 xtx if I bought right now
The https://www.craiyon.com/ might be giving you a first idea ...
has the 24G vram, which I care about more than the raw compute, and about $500 cheaper than a 3090
I have no idea what is better 🙂
just for you to know that this "market"/"domain" is so competitive and even oversaturated right now
so nvidia obviously occupies all the top 10 spots in benchmarks, but ....you pay through the nose for it
the AI art
and like I said, I really just want more vram
I get the oversaturation. Competitive wise, I never saw anything
I can't believe this
I just tested Forge on a T4 (with 16 GB VRAM) with 6 controlnets activated at the same time with XL model and it works
:))))))))
6 XL controlnets
are you kidding me?
who said the cloud GPUs aren't good :)))
gonna try with hires. fix on
We are at a point where everyone who is saying he is the god of A.I. and knows everything ... can only be pretty stupid 😄
I use cloud exclusively for training right now, otherwise I'd have to give up my GPU for large portions of the day
I'd think you have learned pretty early
Talking about the new iPads
BTW where's my personal SD3 download link??
Shut up bro
here you go
https://creations.mtdv.me/sd3
Why are you helping him
😄
Let him do something on his own
If you're gonna wipe his ass all time
He isn't gonna be able to do anything
That's not the point
No
I want SD3 but it will never come out, I'm depressed 
you can have it, all you need to do is swipe your credit card
well, not just that, your workflow will change, since now you will need to use the developer api
If it's not modified by the pony's creator, it's trash
You can already use SD3 locally
exists pixart
You can? I only watched one video about how to use it, but it wasn't locally.
it was through API key
"budget" version of SD3
not sure about that pony thing, but I agree the community tends to augment the base model until eventually I dont ever load the base model, it's happened with each release so far
interesting, maybe I'll get to it after the exams
Does anyon know what do people use to remove background from a short video?
or any quick tutorial how to get started
there's an rmbg extension I think
I think the most used term is AnimateDiff or what was it called
rmb is for pictures only I think
or is it rembg, something like that, I'm not really a video guy
me neither
yah but when you take a video and rip it to frames, gues what those are...pictures
I'd just use ffmpeg if it were me, but everyone has their own way
Its a little weird to use, but I made a decent workflow for it 
gpus are so expensive
can someone recommend me a cheaper gpu for lora training?
or someone make me loras based on my art stuf
Civitai has easy website based Lora training.
i see
yeah i got a 4 gb gpu on my laptop
i do see amd gpu are way cheaper for more GB too
man gpu costing like 500 USD+ 
yah, just do cloud training. I do that even and I have a 16G card
Hey everyone! 🆘 I need help fine-tuning stable diffusion models for product enhancements. If anyone has experience, I’d greatly appreciate your input.
is 12 gigs of vram good enough to train or no
basically a set of 512x512 sprites for games
is SD3 slated for a public release like the other models?
Bots it's permanently down or will return? Who knows? is the any update?
I thought that AI still lagged heavily with support for AMD gpu's even on linux?
amd arent great for stable diffusion
don't bother
get nvidia
Give me $$$ lol. Cloud is cheaper it seems
really depends how much you plan on using it
Making game assets
I mean more... Is it a use a few min a day deal? Or hours
RTX 4060 Ti 16gb looks to be a pretty good deal for hobbyist game design and such. Not sure what it's limitations would be besides just being a bit slower though.
Yeah they are good for general purpose use and with 16gb of vram, they work well for SD and llms
Where do they start showing their limits if you don't mind me asking
harder to do a lot of sdxl training but can still be done
slower
Sweet! Thank you for the answer.
later
I wonder uif the new re-lighting could be used to fix the look of badly inpainted images
like, when you inpaint it gives this off-look because the lighting isn't entirely consistent
Yeah like clown said, it can limit you if you're into stuff like Lora training, but outside of that, 16gb is more than enough for regular workflow related stuff and has plenty of room to store whole models, plus things like controlnets and whatnot, without having to constantly shuffle models back and forth between the vram and ram. Oh and more room for larger images without running into vae decoding issues where you'd need to resort to tiles vae decoding
And if you're into llms, you can use higher quants and/or larger context windows without running out of vram
You can use local app Draw Things on those, it is great and keeps getting better.
https://drawthings.ai/
There's a discord for help and extras (like scripts, feedback, help and suggestions)
Also a few video tutorials by https://www.youtube.com/@CutsceneArtist/search?query=draw things
Since the support of the mps backend from PyTorch comfyui works on Mac (intel and apple silicon).
Yeah, draw things should work fine on a MacBook since it runs well on iOS devices. But tweaking the models might not be as convenient. If it's just for small images, no problem. But handling larger ones could be tricky due to heat.
I'm using it on my ipad, no problem here.
Generating big images is no problem.
It even works on my intel mac, slow... but it works haha
Btw if you're concerned about battery heat, you just use Low Power Mode which runs slower but keeps your device cool.
You're right, it does run, just a bit slow. But using Low Power Mode is a good workaround for sure.
The fast loras are doing a good job in speeding things up.
how about macmini
is macmini more powerful than macboook pro ?
also i heard it has inbuilt fans right?
does anyone know how to get deforum to morph back into the INIT image by the end of the video?
The more cores the faster, the more memory the more powerful.
You need 16gB to do local lora training.
I have an m2 iPad pro and intel imac, not an expert in macbook or macmini.
You could go to their discord and ask over there.
🤡
Bots it's permanently down or will return? Anyone knows? I'm asking because bots status it's from February. Any update? Thanks
So, trying a tutorials to create a LORA of my face, getting ok results with SD 1.5 w/o regularization images, but poor results when retraining on SDXL. For just a face, are regularization images recommended when training a LORA?
Sd3 impressions? So far ive seen better stuff of civitai
its a model
Anyone knows what I am supposed to do with a bin file? i downloaded ip adapter, added it into a1111, and i see no preprocessor show up, just under model
I found lora training on face works much better on sdxl than on 1.5
you don't need reg images, it also works without
it can help to train text encoder, though. But it's difficult finding the sweet spot of text Encoder training where it does not overfit
just use prodigy and be done with it
Prodigy as the optimizer, yes?
Nah, AdamW is totally finr
Yeah prodigy really is a set it and forget it optimizer. It's hard to go wrong with it unless you really try. AdamW is still decent, you're just going to find yourself doing a lot of testing and a lot of following bad guides that worked for X person on Y thing and they act like they have some magic combination of settings and that it's applicable to all training sets.
Is there somewhere I can see what settings I should use with prodigy for learning rate, etc, etc...?
All learning rates get set to 1, if you want to adjust things, you do it within the optimizer
but you'll have to google around for more information, i don't have a convenient list of the exact numbers for all the knobs
the problem with lora training is that there are so many different things that can speed up or slow down the learning rate, without actually touching learning rates
AdamW has also just a single parameter (plus maybe warmup steps). Not that difficult
like you might have some learning rate that will train in 2000 steps, but then if you change the dim/alpha from like 16/16 to 16/8, now it will take 2x longer, since the alpha acts as a scaler essentially. Same with batch size/accum, etc etc
other settings like dropout rates can slow things down a ton
I have the feeling that Prodigy with default settings is always too strong. But in the end it depends on what you want to achieve
just make sure to use the tensorboard and save every so many steps or epochs depending on how you pace it
if you let it spend too much time where it's basically flattened out to around 10%, it will overcook it. it really doesn't take all that long after it reaches that point either. remember, you can always lower lora weights in comfyui/a1111/etc or you can edit the lora after you're done to rescale them
training sdxl with text encoders is a pita and most people don't bother doing so
but anyways, google, youtube and trail+error are going to be your best friend. practice small before wasting a bunch of time
Yea, that's where I'm at now, using 20 images and 10 epochs/20 steps, it takes a couple of hours for each run
Watch your task managers GPU page and make sure it doesn't go into shared memory
If it does, you need to either disable text encoder training or lower the network dimension size
(usually)
Shared memory usage of even a hundred megabytes can make training turn to a crawl. Like it can slow it down by 10-100x
what resolution images?1080p?
m4 ipad pro will run smoothly SD?
Thanks
how muc time it takes
to generate 1 image
on ur ipad
I will let you know later. Too busy right now.
I estimate it would probably be like ~20 seconds for a 512x512 image, and maybe like 2 minutes, 43 seconds for an SDXL image
How 
I have one too and wanna know more
Stability should partner with Stock image companies
like, SD3 powers image variations. At the same time, SD3 is trained on the stock image datasets.
Even have something like, with a subscription fee, you can use the stock images for training.
like, a huge collection of human-tagged good quality images... how is that not the perfect marriage?
Stock image companies like Getty Images or Adobe have already trained their own models on their own stock
but there are other smaller companies still
but I imagine their models must also be super good in that case, since they've got perfect datasets
My sd upscale for comfy UI
Should have a window to show me where is the process until right how I setting it
hey what is the best value cloud service to buy btw?
google collab?
I think i will just run SD on collab or something
Sick, gotta check that out
lol just download it 😁
and check the draw things discord, tons of info and handy stuff
I will when im home, how fast is it on the m2 ipad pro
it's not fast two to three minutes for 1024x1024
ut i'm using the fast loras and then it is around 30 seconds
896x1152 with TCD sampler and hyper lora 32 seconds @fervent thunder
but it depends on the amount of steps, (8 steps tcd, 20 steps euler a or dmp++ 2m karras)
121 sec for 1792 × 2304 8steps TCD
that's with an upscale script
just give it a try
what resolution is that image?
ooh
so double that time for 1920x1080P?
also 512 images work for thumbnails?
youtube thumbnails?
Bro
if you have an iphone, ipad or a mac you could find out yourself, it's free
fastest is on the M chips
also what is lora
does it speed it up?
i m thinking of taking an ipad for SD coz how hands on it is with pencil and hands
i mean touchscreen
is 4060ti faster than this?
i don't know, think so
M2 ipad pro
i also have an imac, but that's intel based
also how is ur expirience
with m2 ipad pro
with sd
is it buggy or crashy?
also have u tried editing with ipad
in da vinchi or final cut pro?
No, it's not buggy at all, it works great and rwgular updates with new feautures
ooh cool
also have u tried editing with ipad
in da vinchi
is it buggy or crashy
i heard lots of people say da vinchi is crashy or buggy in ipad
The ipad is next to my imac, i'm using universal control so i can use my keyboard and mouse for Draw Things and the image genertion is while I'm doing other stuff on my imac
I haven't used davinci on my ipad yet, just on my mac
u dont use pencil?
only for masking or drawing
flawless
really?
yes
masking and drawing huh pretty cool then
before i take m4 ipad pro i just wanted to know users expirience
how is ur expirience vs pc
which is imac
you need to know that you're not able to use other apps whike generating images
did u get any frustrations like ooh u cant do this in heree where as in my imac its so easy
ooh ipad os
i wish they gave it mac already
Myimc is my main computer, the ipad is for the creative stuff
can u plz do me a favour
can u plz install da vinchi resolve on ipad try editing videos ansd share ur expirience of edting
çoz i m mainly taking ipad m4 for editng and SD
u can try editng 1 minute 1080p@60 fps video
coz i really want to know the da vinchi timeline expirience on ipad
or have u tried editing in final cut pro?
try search: "youtube davince ipad" there's a lot of them
Tomorrow we'll know if this was yet another week without news.
Give me access to the site news and I will post the main news - SD3 will never come out 

Breaking news: OG diffusers give 0 cares about SD3 - "We are perfectly content with 1.5"
Im more of a pixart/XL user 
With a new category soon 
the XL/2.1 group = meh
blud grouped 2.1 and XL together 💀
Dont compare me with a 2.1 user 
and the ones who ask every day when SD3 out = caveman skull emoji
no not 2.1
I mean pony
or whatever that thing is that is parallel to XL
yeah it was trained on top of XL
I would also include cascade but that thing was the shortest lived fad I have ever seen in my life
hmm discord bot SD
i want model weights
I wanna use my 4090 I bought
nuh uh!!! here's a bot
heh
nice announcement, not gonna spend money tho
you'll have to wait till the end of may tbh
yep..
looks like SD3 wont be open source afterall
it will be
time to crowdfund a model?
They want to make SD closed and paid, when the whole appeal of SD is that it's open source, has all of these open source tools and the flexibility of running locally, and without all of that, they're just a worse alternative to other better paid AI services already out there and they'll just no longer have the same tools, so a lot less appeal than their older versions.
it's still training kek
alex (mcmonkey) said that the model is still undertrained
and they are trying to solve issues with the model
all models are always undertrained
so it aint gonna be may 10th
he meant like severely
Is artisan the new bot or what
its just an access point to their **paid **API service
its not free
SD3 never release
Bruh.
yeah but they are also financially in trouble
dunno
I understand that, but this seems like it'd only put them further down the pit.
its frustrating because id like to support them but the only thing that is actualyl useful to me that they create is the model weights and if i cant run them locally and fine tune there is no advantage
we want sd3 and got a discord bot instead 💀
well they are kinda late now with it, investors are as of latest informations refusing to invest in the company as Emad wanted it before he left
A local tool that could be sold as software for a one time fee with user extensions would be way better than just resorting to doing a worse version than everyone else and stopping the thing that got you to where you are now.
and for those that think crowdfunding would do the job, thats simply...naive
open source does crowdfunding
it works
but it would help SAI

