#💬|general-chat
1 messages · Page 134 of 1
Hi. Does anyone know of a generative cloud API provider that supports controlnet?
Google finds nothing. Perplexity tells me it does not exist. There must be something though, right?
Here is my gift to the Forge users
https://github.com/lllyasviel/stable-diffusion-webui-forge/pull/692
You just need to switch models, that's all.
And make sure you are using appropriate settings, for instance most SD1.5 models are trained on 512x512 images while SDXL are trained on 1024x1024
no because I can give StableDiffusion nonsense words and it still generates images based on its best guess about what those words mean. If I ask it to draw me a fleeble blorpo it doesn't have a collection of pictures labeled as "fleeble" to work from.
ya and did you know that it can also gen from completely blank prompts? no fleeble gorpo needed
what parameters it uses to diffuse idk, but what Ive seen is that it tends to veer into what the model is most familiar with
it's good design to force it to diffuse despite bad prompts/no prompts tbh
make fleeble gorpo great again
i dont mind if you can send me some configs or settings, i like to learn stuff :3
ayooo
kinda new. downloaded Stable diffusion through automatic111 a couple of months ago and forgot about it so i basically forgot how SD even works
it's magic
it's a simple combination of 1s and 0s :3
There are some that host ComfyUI, but they're typically all paid services.
Or do you mean direct http txt2img APIs that also support controlnet?
Things about to get wonky in the world of AI lol. The big players in the corporate sectors are basically all on the AI safety and security board.
I believe the regulation to push out open-source may have begun lol. My next software tools and those I help with in the future may have an updated license depending on how things play out. Future releases of open-source tools may be unmonetizeable beyond the output itself going forward. Others have already started updating licenses to prevent companies from utilizing their tools so double check licenses to make sure their haven't been changes before trying to monetize tools incorporating them.
fuck monetization.
open source and monetizable shouldnt be used in the same sentence
crowdfunding is the name of the game baby
It's an advisory board. Chill out lol
Ah, just passing along what was passed on to me. I'm rarely the first to get these updates 🤷♂️. But do check licenses lol. Many have changed.
because donations to maintainers isn't a form of monetization whatsoever
oh....for real...ok that's serious, someone better speak up in this forum that has no official staff participating in
They released the papers way too early
In other words, pump fake
I really liked watching scott but I quit because this community isn't up to speed. This is some Alibaba release paper stuff, where they release it but it never gets to open source, you just hear about it. My thought is they think they had to release something so they released a paper that was not ready since open_sora went live, sd3 papers came right after with sv3d
Did they get rid of the bot for ai image generator
Only sv3d is real 😂
Maybe this community will learn from it's lesson, maybe not. I work for a tech company that makes development tools for developers -- we don't do this crap because it'd make us look bad
They should have held on to their papers. 😏
I mean it is availble via api - what are they gaining by holding onto the weights? Possibly some api income at the expense of a loss of hype and pissing off the open source community
I won't make fun of them too badly though, I still like SD, I just dont like groups that jump the gun to keep support/popularity when in fact, the longer you take to release the new version you've amped up for over a month now, the more people you run off
Open-source community is kind of on their own now. We've been abandoned lol. There's not even any of the open-source team members on the advisory committee lol. Even Zuck got excluded and it seems it may be because he didn't hold back on AI and the metaverse lol. But officially its because he runs a social media company unlike Google, Microsoft, and the others 😅.
eh would you like to have a rushed unoptimized product or delayed but optimized product
seriously, you can see how r/StableDiffusion peeps react to this.
A tiny drop in the ocean of actual users. It's an echo chamber of the same couple thousand people, just like it is here
The majority of users are silent
hm.
at the end of the day who cares whether it is open source or not open source, they just want to get their job done.
like hell they just want to generate a logo design as reference. Most of them are either getting into DALL E 3 subscription or wasted couples of hours just to try to installing A1111
plus figuring up where to put the checkpoint and ControlNet stuff
Anyone know any free ai video to text tools
A ComfyUI host would be perfect and amazing!!! I'm expecting it to be paid. Obviously if it was free it wouldn't be around for long.
Please tell me if you know of one. I can't find it by searching.
Civitai advertises ThinkDiffusion on the buzz dashboard. I haven't tried it myself, but that's one I know exists offhand
I think juggernaut XL also advertises RunDiffusion from its huggingface repo, which I believe is similar
If anyone is interested in a free Alternative to Midjourney, I've been working on a bot that can generate images at a similar level if not better. The images generate as fast as 8 seconds. LMK if you want to test it out, we are looking for feedback and suggestions from anyone who is willing to help out.
ThinkDiffusion isn't at all what I mean, unfortunately. I don't want a virtual machine to install my own ComfyUI instance, I want an always-available ComfyUI API endpoint. Think like StabilityAI's always-available API where you pay per API call.
(Running a private virtual machine 4090 equivalent is always going to cost you over $30/day just to keep it available, regardless of how much you use it.)
I'm looking for someone hosting ComfyUI as an API (really just ControlNet!!!), not someone providing private virtual machine hosting. (If I wanted a VM, Amazon is way cheaper. But still way too expensive.)
What ya do is use AWS Sage maker, or use Terraform to build your a bunch of spot instances for a short time which are cheaper to do X for a short time

Ah, I haven't vetted much of anything about them, but ModelsLab might be something to check out. They were doing something pretty close to a1111 as an API when I last looked close at them before a name change (can't remember the old name)
I ended up avoiding them because they couldn't mark their models' licensing terms in a way that worked for my use case at the time.
And they started publishing paid sdxl APIs before licensing would allow it.
video to text? what does that do? im not familiar with all the ai tools these days. is that like a tagger but for videos?
or a video speech transcription?
in confyui when im generating a batch of images do they all have different seed numbers? if im generating like 4 image how do i know which image has what seed?
i think it's sequential, first one is the seed you see on ksampler and the others are +1
but not sure
😭
big boys dont cry
Drop the picture into your Comfy and you should see ...
If anyone is interested in a free Alternative to Midjourney, I've been working on a bot that can generate images at a similar level if not better. The images generate as fast as 8 seconds. LMK if you want to test it out, we are looking for feedback and suggestions from anyone who is willing to help out.
rip bozo
Anyone know any website that can fixed the bad hand ?
I try to fix by using inpaint and depth liabary also but no have good results
Pretty sure the main comfyui dev would have thrown it in already if it were quick to implement. He's usually fast as hell about stuff. From my understanding, HiDiffusion would essentially have to patch/hijack the entire sampling process for maximum compatibility. Though I'm sure someone could throw together a wrapper node to experiment with it in the meantime, but you'd basically only be able to use it to output an image data type and wouldn't be able to anything fancy inside the blackbox.
👍
hi guys
is there anyone whos really good with ai
if so can i pleasee ask you something
what is it
bro pretty please, if you know how tell me how liberxx0 makes his videos/pics
what tool does he use
🙏🙏🙏🙏
i wanna learn everything about ai just because i saw his videos
it seemed to have Midjourney and SD tag with all of their post though.
Much likely this workflow?
Midjourney -> SDXL -> SVD/AnimateDiff
with SDXL part including some ControlNet
BRO THANK YOU VERY MUCH!!!!! if i could i would kiss you! ive been lookin all day for thisss ❤️
god bless you bro
yeah their art/video is pretty creative
hell yeah
it could included DaVinci / Premier / AfterEffect in video for special effects
or like what, Photoshop?
This is amazing!!! Wow their site is buggy though. I have so many questions their docs don't answer. But thank you thank you thank you. 🙏 ❤️ This is the first glimmer of hope I have seen. I was getting seriously afraid I would need to provision a local server to avoid coldstart times and reserve an on-demand vm from aws.
O sweet mercy ModelLabs want $5/day for ControlNet access. I mean, that's 85% cheaper than self-hosting a VM. Hmm. I think I need to at least try it... But they don't have a demo...
python判断excel文件数据,在A-F列相同的情况下,再判断L列是否相同,如果相同然后再继续判断O列的数值是否互为相反数,
Hello
Yeah, I'm still on the lookout for a cheaper and still reliable GPU-enabled VM. There are a few services out there that try, but I haven't had much luck as far as repeatable infrastructure with them (akashML might work for you, since it doesn't sound like you need to dynamically scale as quickly as I'm aiming for)
If anyone is thinking about hosting SDXL, I've read the OpenRail license very carefully (as a non-lawyer). If you have a very small userbase, you can hope 🤞 lawyers will ignore you. (They want money, and little guys don't have money.)
https://huggingface.co/stabilityai/stable-diffusion-xl-base-1.0/blob/main/LICENSE.md
Basically, the license:
- requires SDXL to stay freely and openly available.
- requires you not to use it for certain restricted use cases
- requires you to pass on those restricted use cases to your users
Here's a breakdown of that restricted use.
Again, you're just passing these restrictions on to your users. So, for example, you cannot sell a service offering to help police upscale security camera shots with SDXL; but you can offer an upscaling service, and warn police they can't use it on security camera shots. You're passing on the restrictions.
Restrictions. Don't use SDXL to:
- break the law.
- hurt minors.
- hurt people with disinformation. (Wording matters here. Fiction is fine.)
- dox or harass anyone.
- inform legal decisions.
- discriminate against any group or legally-protected persons category.
- scam or bully anyone.
- inform medical decisions.
- inform the justice process in any way.
Because ControlNet and distillations like LightningXL are initialized from SDXL's weights and/or Complementary Material, this license also applies to ControlNet and SDXL Lightning, and all SDXL models available on Civitai.
Anyone can add stricter use-case limits to the list above, another entry after #9. But, no one can contradict the license's clause that grants: "a perpetual, worldwide, non-exclusive, no-charge, royalty-free, irrevocable copyright license to reproduce, prepare, publicly display, publicly perform, sublicense, and distribute the Complementary Material, the Model, and Derivatives of the Model".
In other words, Stability granted you a license to use derivatives of SDXL in every way. You can use them.
Thanks. What I really need is a very low-cost way to provide on-demand access to zero or more users, while avoiding cold-start time. It really is a tricky problem. I think the way to go might be: self-hosting a low-power machine + cold-starting a cloud VM on demand. The self-hosted machine handles API calls while the VM spins up, and might not spin up the VM as long as we only have 1 user.
(But AkashML is just reselling AWS. There's no point in going to a middle-man like that.)
why can"t speak in the bot
bot关了
broken heart.
yea, I hate when people do this. Like they did with Cyberpunk 2077 :)))
like welp you had a thousand reason to believe those paper is just for show-and-tell
and so do other paper
Willing to help/test. But I can't really believe you saying that your bot generate better images than MJ. 🙂
Yea, but what do they win just by showing off something which is not even nearly finished?
Just for the hype?
alright
do you want me to invite you via link, or you can search it up on disboard
not even nearly finished? Come on you didn't touch #🆕|sd3 didn't you.
To be honest, you know team there in Stability are pro-decentralization. Of course it is much complicated than that with the unknown relationship between investors and staff. Plus there is Lykon and other beta tester going out there.
I meant that for when they released the papers in February :)))
invite
Anyway, I think that this will be the last checkpoint we'll get from Stability for "free", so the open-source scene will be on its own from now on
sure it got some scandals ( Emad resignation, financial burden ) during these months and it is fair to suspect that they would become Midjourney-similar.
Like really, are they supposed to do so if they are a for-profit organization
they still need money for fund
I mean there was reasons why they release paper first before proper weight release.
"Hold on to your papers!"
WHERE IS THE BOT
What bot?
what's its name?
Fooocus AI
there's no bot on this server for SD3 like it was for SDXL, you got lied
:)))
You should stop while your ahead
Man, there isn't any bot for generating SD3 images on this server. For real now.
I never said you was lying so keep that nonsense
exactly
So you have like 10 "free" generations on their site
with the SD3
it's more like a testing
It's clear that we won't get things for free anymore especially from this AI field
the "open-source" things will be in small groups of AI trainers who will probably monetize the models on sites like Civitai/mage.space
I'm not for it, All.this talk of scaling humanity
LIES
No different than ChatGPT
Bullshit AI
You don't scale humanity by greed
Especially when this technology is so new
I'm not paying these fools to make doodles of celebrities and nonsense that you do t even have right to own
How many of you bumped your head
Man, the Artificial Intelligence is not good or bad. It's like with a knife - you can buy one to cut the bread or you can buy one to kill someone. The only one who is good/bad is the person using it. You can use it in good ways or in bad ways. It's that simple.
But of course that the greed is taking over
like always
When it's gate keeped its all bad
yes, but you can't do anything to them
Who said I need to
You are making assumptions
A.d you must of bumped your head
Too powerful?
Please man
Man, even Zuck bought 500K H100 GPUs
What does that mean for us?
Zuckerberg also stole an election
Lol can you see the wood through the trees
Or are you just riding the hype train?
Another "artist" who joined to tell us that all we do is pure "evil"
why would you argue with someone with such a broken world view and the name TheTruth, like at best they'll be a conspiracy theorists
:)))))))))))
yea, right
Tenoke prove it wrong or you are the broken one
All I see is bullshit from both of you and then you act like I can not have my own opinion
Please.
Think he's more like an artist who will lose his "job" because of "AI"
It's all good to ride the hype train but be realistic
I'm actually a programmer if you must make lame assumptions
I could care less if you agree or not lol I don't need confirmation, If you don't understand that is on you.
:)) gl in life
Ty 😊
lol
This seems to be the best alternative to InstantID - listening the prompts the best: https://github.com/ToTheBeginning/PuLID
code monkeys go down first
I have a question, How do I train a model that keeps changing but maintains 90% similarity? For example, I want to train it on different kitchen setups, such as modern, traditional, and old, with various colors, etc. How should I go about training such a model? Should I input all the modern, traditional, and old training images into one model and train it? If yes, how does the model discern between a modern and a traditional kitchen photo? How does it know what to generate when asked for a modern or traditional kitchen? Also, how many training images should I include for different kitchen setups like modern, traditional, and old?
The more I look at Nvidia's Consistory and Adobe's ActAnywhere, the more suprised I am that they haven't been reverse engineered already. Consistory literally solves the consistency problem, while ActAnywhere allows subject aware video backgrounds. So what gives?
https://proximacentaurib.notion.site/e28a4f8d97724f14a784a538b8589e7d?v=ab624266c6a44413b42a6c57a41d828c Hey does anyone know if there's a more easily downloadable version of basically this website. Like I just want a Google docs or something of all of the artists with their art next to their name. It doesn't literally have to be the exact same site thing as this site but I need it downloadable for offline rendering.
ai is bad
another one :))))
until SD3 comes out everything is bad
Then just have to wait a couple of months until a new pony comes out and the world becomes beautiful

Pony Diffusion showed us that a small group of people can really achieve great things.
in AI fine-tuning
The pony is so good that it is on a completely different level compared to all other models
anyway, there'll be only small groups of trainers in the future and the community will help them with donations

What amazes me at Pony Diffusion XL is the prompt listening/following
and no blur
like in 99% of the SDXL cases
pony is mostly anime characters tho which kind of seems like an easier more constrained task
pony in realism is better than any other model lol
maybe
Nah, that's really a joke
it isn't very good at realism, at least for now
if you take people
I'm not talking about backgrounds and abstraction
all that all models can do is a standard easy pose and standard clothes
you have to ask them to change their clothes and position and they can’t do it
the pony looks a little less realistic than models designed for realism, but it works 1000 times better
yea, the pose is better, but the overall realism is not there yet
Pony XL is really a leap forward
I'm looking forward to v7, when we don't have to use that strange pre-Prompt anymore... and hopefully Controlnets and IPA work better.
Is there something like a Pony-Jug9 Mix to get the best of both worlds?
What do you mean, man? The controlnets work best on Pony Diffusion compared to XL 1.0.
canny, openpose, everything
don't think so
all pony modifications are shit
it stops being so flexible
😕
it can be incredibly realistic like this https://huggingface.co/deadman44/SDXL_Photoreal_Merged_Models#potest2
but ceases to be flexible
Huh? Guess I gotta give it another go then. Maybe that it doesn't handle IPA well scared me off.
Confetti is held in high regard.
And in my experience it brings some (SFW) prompt comprehension back that vanilla Pony isn't strong with in my experience.
Like I had Confetti create a character (don't remember the exact specifics and can't check rn) with different colored hair, shirt, pants, socks and shoes and there was almost no prompt bleed and it worked 3/4 or even 4/4 times.
pony+sd3 must be something incredible
but the only problem will be the requirement of 24 GB of RAM.

RAM is no problem 🙂
arent there different versions being released, different sizes i mean
it's about T5
then less then 24gb is fine
it is it who requires so much memory, not the model
psst
t5 can be 4bit'd
the DiT can also be quantized afaik
lol no
why
no
it sometimes explodes
I don't even use scores
correct style lora, a few words and you're done
like all other models only more flexible
score_9, source_anime, score_8_up, score_7_up
oh, lol
still doesnt change the fact that the clips are deep fried
they left them at 1e-5 iirc
I'm still hoping we'll be able to substitute T5 with other (smaller & more efficient OR more use case-specific) LLM's.
Or rather VLM's
Like there will supposedly be a Llama-3 based VLM in some weeks.
It's definitely... full of Ponies. I'd love to have a version without all that Furry content... but since the creators are especially doing it for that... not gonna happen.
We can probably just be happy it's so good with other things as well.
4bit t5
you can also just, not, use t5
t5 good for text
source on that? 😮
llava3 is out iirc
Can't remember. Heard it somewhere, but more than once.
oh its yoinked yay
yoinked do you think longclip-l and (especially) longclip-g will affect prompt adherence
which llm is both uncensored and unalligned
that can run local on 24gb or less
and can pass the racist joke test
?
on first try without prompt engineering
you think im fucking stupid?
I only saw that coming from 100 miles away
except I thought it would be the classic exorcist screamer
code monkeys will just evolve ot use these new code completion systems. Managers aren't going to know how to use them, since they can't code to begin with and can't vet the results. Managers are going to be the first to truely face the wrath of AI replacement. Laborer type people will just move onto other laborer jobs, since manual labor isn't going to dry up anytime soon. Managers though... their employement opportunities are going to dry up VERY quickly
Anyone have gradio problems
possibly
Pony is only good for uhm... anatomy.
Hi group! New here and I do most of my work using ComfyUI RTX 4090... Any tips appreciated. xoxo
wishful thinking, I already told all recruiters trying to sell me code monkeys to f-off
You think this is a game?? This is SD :))))
😦
Maybe 24 GB of VRAM
Anyone know why the batch folder or multiple image upload does not work with open pose in Control Net?
Man, you can't say that pony diffusion is bad. Maybe you used a bad checkpoint.
You get good hands on pony diffusion. Idk what they did when they trained, but they did a good job anyway. Even being a small group of trainers.
DPO and strong captions
Devin is just the beginning. We'll reach a point in this evolution in which AI will start "coding" itself if it makes sense. Like the "Stable Diffusion Infinity Z" will have the capabilities to continuously learn from what it finds on the entire internet and you won't even have to train it anymore (it'll reach a level of prompt/image understanding that will shock the trainers). :))
You a manager? You should have fear.
Let's not even talk that 2 experiments from top-notch companies went wrong and now there are 2 bots freely "coding" themselves and "running" on the internet
You might be relegated to being the one code monkey who can produce a lot more work
hahahahahaha
we can talk about that. They'll cludge themselves around and their "freely coded" modality will go no where
:)) What if they'll reach next level? I saw Emad saying on a tweet that they used images from a human retina impregnated in the brain to train a model or something (or at least they are researching this). Imagine taking a human brain and integrate it into an AGI
self improving systems are going to be a thing. I just don't think we accidentally stumbled into it yet
Musk is already trying some things with his chips implanted in the human's brain
Augmented intelligence is the only future in which humans are still involved
Nah
I was talking about a brain which is taken from a dead human
neurolink is very simple right now. 100 wires i think? the guy with it installed is able to move a mouse around a screen with thoughts so far. pointing and clicking with the cursor to select things.
If the goal is to improve the human computer interface's bandwidth, this aint it
hello
dpo is good
This thing won't be really good for people anyway. Imagine that these implants are "injected" into your brain or connected to your brain and a hacker will know how to break the safety system.
It legit means that he can control that person mostly like a robot
it actually uses BT which i think is very secure for the time being
it's not really implanted into your brain either. the device rests outside of it, and electrode filaments are implanted into the tissue. like hairs
when neurolink was first starting, they said the best approach they want is to have something inside your brain's veins, like a net that self assembles through your brain, and reads signals that way. interesting science. have you seen the wait but why article about neuralink?
Guess there will be something like an USB connected to your brain that can access memories like in Cyberpunk. This way you can save all your memories for the time being
This thing is good and bad at the same time. Good for people who are sick with Alzheimer for example. Bad, because hackers can get the most sensitive information about your whole existence
doubt. any kind of digital storage won't be in your brain. the interface will send it to a device, like your phone. the interface is one thing. the brain learns how to use the interface for now. THe brain can learn to speak to the system. But, extracting memories? like the system having native brain input? that's a few milestones down the road yet. Going to need a neuroscience breakthrough for that level of use.
stability has a funded a research path where they show someone an image, catscan their brain and use diffusion to reconstruct that original image. weird stuff, but also requires high resolution imaging
true neuromancer decks, i think are a while away yet. fusion 10 years kind of timeline
Anyway, the corporations will be the #1 enemy in the long term.
just like neuromancer
when used effectively. not just something that can turn on for any model i'm afraid.
my ai got deleted lmao how do i download again
I hate it when that happens
I for one am glad to see the coding workfield collapse bc of AI
not only was that market oversaturated (especially with questionable overseas crowds)
but coders are some of the smuggest most disagreeable people on earth
if anyone deserves to lose their job its them hands down
Anyone know if its possible to mask part of an image so nothing is generate on it?
Sorta like blank padding
in img2img? just mask it and select inpaint not masked, it'll paint everything else
Works fine ty!
Hello, I'm new to this, is there a way to use stable dffusion on Android?
not for localized image gen, but you can access a portal I guess. it would be frustrating to do anything meaningful on an android device I can tell you
Just coming back after a 6+ month hiatus. Whats the currently most used UI and models? Saw youtube videos talking about Comfy Ui and Forge Ui, and then SDXL being the best SD model currently?
sd3 on hugging face when
i would also like to know that
Wasn't this a thing because they made a mistake, during training?
correct
can someone link me to where i can dl SD
People here still either use ComfyUI and A1111
i need a newer one cuz when i try to launch it it runs with 3.11 but it asks for 3.10.6 and if i uninstall 3.11 it asks for 3.11
Illyasviel is pretty slow at upgrading/push his Forge
3.11 of what?
A1111 should not differ this though.
what is A1111
or possibly you got two or more Python in your system
abandoned
a webui
yes i have 3.10.6 which it asks for and 3.11 which it asks for
so what webui are you downloading?
im just running the one on my pc ive had for like a year
idk where to get a newer stable
A1111 is just git push in command prompt
u do realize i dont know what that is
yeah sure. Both ComfyUI and A1111 provide installation guide in their README which you can checking up.
https://github.com/AUTOMATIC1111/stable-diffusion-webui
https://github.com/comfyanonymous/ComfyUI#installing
A1111 did clarify that it need 3.10.6
i wanna make cards for my game with SD
@arctic lantern maybe follow this: https://www.youtube.com/watch?v=ylHTojkioWY
thinking of it I would recommend you A1111
do it
thats the one i was using before i think
i treid it but there are issues
save you an ease from node-phobia
what kind of issues?
i sent u a screenshot
i have 3.10.6 and it doesnt open
im not sure there is a model that can do it
i tried before and made some cute ones but still need some work
but now i can't do it again
it is usually SDXL/SD1.5 -> Logo LORA
the game will be open source
i remember a while back using an rpg model to generate some tcg card art, it did a good job, also depends what type of art you want i guess
do you think it is possible?
Isn’t Forge UI basically identical to A1111 but slightly “faster” ?
yeah optimized A1111
That’s what I thought okay. Thank you. And the main model people use are the ones running with SDXL as the base? Or is 3.0 now a thing?
nah SD3 is still not release yet
SDXL finetunes it is
Ah perfect. I’m on track then haha thanks for your updates!
any news on when SD3 will be available?
few weeks
sometimes in the recent auto1111 update images just stop working. not sure what's going on. it seems completely random
Then eventually it starts working again
The world is not ready yet for it, it;s like the messiah. The world isn;t ready.
May 10
The day paper Mario comes out
Mario is evil.
i did not see that announcement 😮
👀 👀 👀
Released on huggingface, yes. It will be the moment we've all been waiting for.
The gods will bestow upon us, their latest creation.
hopefully no one dies before it releases 🙂
i hugged a whole body
hi, is the 402 error for sd3 api normal or am i messing up the curl command somehow?
i mean im following the example, have credits, using a valid key, etc... weird
😭
Guys Ive installed roop for Stable Diffusion but i cant find the damn roop drop down menu
that is HTTP Error ( like the 404 Not Found )
you didn't make a payment for it to generate image for you
probably run out of credits
where ? 
how long does it take sd 1.5 to finish installing
depens on your distro/how you install it/internet conection
how long is a piece of bandwidth
dide, thanks for edit, i've almost spilled my beer on my screen, : ) I will use your phrase for sure. : )
avarege, lets stick with theat LMAO
My CPU is just a tip LMAO. thank you my man : )
just a tip of 4090, i'm on I'm on NV 1030 : )
I just have a classic old 4080
im on a laptop, still does the s*&t that i do : ) but nowt quite. TI is done, i wish there was a support for it.
man this was fkn funny. im here in the voice chate some dude joins the chat. tells us why are you joiining the chat and leave it. Dude, do you need a manuel how voice chat wokr? : )
Same thing when people joice the voice chant and act as if they are DEAF? Why the hell do you doin it in the first plaze? It's not a Publicly clamed Weimar that you have to attend. ehh.
hmm... I'm kinda puzzled about the samplers and checkpoints
have a lot of ideas with characters in mind (all of them written down in a document with more to come) and wonder if I need to let every checkpoint on on a couple of sampling methods in order to get the best combination out 
怎么生成图片
Have a look at the model description on civit.ai
Roop is abandoned, you need to use the Reactor extension
roop unleashed is a thing
Kinda true. That air of superiority and they don't even pay taxes on some countries. :)))
seaart ai
Did he really abandon forge? I thought that he took a break. Kinda same happened with Fooocus.
am i understanding correctly that a Lora Can be based on a specific character or based on a specific art style, or concept like specific outfits, robotic features etc.
hiiii
its hard to keep up with 2 repos at once (a1111 and comfyui)
/othello image ai
People said forge was the future, now it's abandoned? Glad I didn't waste time setting it up
You come at the king, you best not miss
just use comfy and never worry about missing out on model support
it is about if it's working
I had problems iwth A111 and A111 has a problem with SDXL models if you don't have x GB VRAM and won't load them so you have to activate additional setting
quite sad, forge is really good for people that don't have good gpus to run SDXL
also, is there a post explaining those samplers?
DPM++ 2M SDE SGMUniform
DPM++ 2M SGMUniform
Euler A SGMUniform
Euler SGMUniform
LCM Karras
DPM++ 2M SDE Turbo
DPM++ 2M Turbo
Euler A Turbo
i downloaded it and it saying it needs 1.5 sd and its "watiing for file to be created" and its spamming it and id did that yesterday for 8 hours withouth anythin happening
faceswap goes fuzzy around the face, even at high res. any tips? thnx
When SD3 weights?
near the end of May, according to CivitAI newsletter
how did you install a1111?
did you use git clone?
or you installed some app to install a1111?
can you show the link to that newsletter
Hello everyone, is it possible to use 2 GPUs at once for faster speeds?
Flexing. XD
Yea, but it seems that there's no one out there like him (or atleast for now) - he really "made" 2 good WebUIs ("optimized" a1111).
key word, "made"
foocus is alright; forge is just auto with comfy code
right, but no one achieved this or made something like this public available + all the controlnets for 1.5
It seems that he's more like a "starter" projects person
he starts them, but won't go in the long run
like he gave up training controlnets for XL
Yea, forge is better at this
just google
I was usually using dpm instinctually
or karras
What, are you kidding me? It was expected at the end of April... No wonder if they'll say at the end of September
in webui.bat file
right click - edit with notepad
an indian woman with a side parted wavy lob hairstyle https://s.mj.run/piFqzUCzJvI wearing a black checkered shirt and grey jeans, in a basement, frightened by the smoky figure standing infront of he, thinking it is a ghost, cinematic, atmospheric, suspenseful, Eerie Black + Timberwolf + Platinum color palette, shot on Arri Alexa LF with Arri Prime DNA lens --ar 19:10
you aren't in the midjourney server
🙂
are people fine tuning other non-sd models at all? like pixart models, kandisnky, etc or they dont bother because most people dont use them anyway?
bot is still down?
the 2nd option
even if pixart has some potential, people still think it's not worth it
let's not even talk than 50% of the users or more are still using 1.5 models.
The others waiting for SD3 😄
so is forge abandoned or what
its so much faster than A1111 but there's some bugs in it that make it hard to keep using
how do I use poses
How to open access to the chat to create a picture?
:)) you mean the others looking for methods to make sd3 to run on their PCs
Still no way!
yea i kinda figured, no wonder you dont see anything
which model are you using
no it was just a guess luke
I mean if it's lcm or not
We’ve heard that the weights for local usage will be significantly improved from the current version available on the API, and will drop at the end of May!
so if it's really the end of may then I do expect a big bump in anatomicaly quality or something
but this was not at all what I thought would happen
I thought it'd be beginning of may
wait, where is that info coming from?
we've heard
which is an extremely trustable source
also where did the may 10 come from though
also I love how the newsletter goes:
This week, we’ll cover some extremely exciting new Civitai developments including
- the imminent release of Stable Diffusion 3!
why do they keep mentioning "soon" or their synonyms
another month of waiting isn't quite soon
it was an estimate based on previous release "info", from Emad and the new lead dude, which puts it around May 10, based on what they said and the date they said it
ohhh
Civitai depends so much on SD models, so it makes sense they include news about it
What are the chances that some of these characters were done partially with AI? https://www.youtube.com/watch?v=_oI_B0OBgVw (if the embed doesn't work, it's a Coca-Cola ad with Marvel characters animated with their drawn comic style)
Imagine a site which makes money from trainers/creators and pay them with "jokes" - 1000 buzz = 1$ :)))
i like the idea of bounties on civitai tho, you can basically pay people to make something for you
Ugh... another month. That's really stressing me out considering that Altman and co. won't stop scheming against OpenSource.
sites like civitai?
Yeah, I think there's at least a small chance that open weights won't release at all for SD3
Yeah, the whole buzz thing seems stupid, like they made their own crypto. There has to be at least one better system for motivating creators.
Maybe they could get rid of buzz, and limit download speeds to 5MB/s for people who don't pay for a civitai membership, which is some low price for a month that mostly goes to creators based on the popularity of their checkpoints and loras
Could be a bad idea though, I don't know much about business
that's quite a lot, even I don't have such a fast internet
maybe 3MB/s then, 50% higher than nexusmods
mine is 1Gbit :3
actually its 1.5Gbit, but my ethernet card is limited to 1, have to upgrade
that's plenty
but yea
yea it usually gives some very weird thing lol
well, when I was experimenting with connections and DW poses
and encoded a generated image as a latent image to ksampler
a sec
I'm kinda jealous that without a prompt I get a normal hand, lol
How to download graphics card
i tried so many sd3 with text and compared to ideogram..but ideogram performed much better in many aspects...just wish SD3 official release gets better
like the way i wanted the image to be, ideogram could capture it...sd3 could also do it, but after lots of repromting
i also did magic prompt off and tried as well btw
When will people get this thru their thick skull
I dont care how good your favorite diffuser is
If its not local its TRASH
Us gamers dont spend thousands on a good PC just to pay to use someone else's GPU
SD3 isn't fine-tuned, it's a jack of all trades right now which makes it not great
Once people fine-tune it it'll be probably better than anything we have now
Is sd 3 still only invite only?
Can someone help me with inpainting?
no, it's api access now
but should be free in like mid-May to end May
i wonder how the finetuners will call their models, cause the ones based on 1.5 were just a name, like juggernaut, epic realism, etc, then when sdxl came out, those same models just added XL to the end, so now, when sd3 comes out, how will they name the models? :3
Maybe just something like juggernaut3, dreamshaper3, etc. I think when you talking about SDXL, SD3, ..., a n"SD" model naturally refers to an SD1.5 model.
yea prob just a 3 at the end
Can anyone give me a brief definition of how a chat bot gets its information
Graphics cards are physical. You have to go to a store and buy it.
you can use it online
Best way to do text to speech out there?
The best one I've found so far is https://github.com/JarodMica/ai-voice-cloning
It's not perfect, but it's fairly fast and sounds ok. You need to find the voice models.
Voice-models.com is a good place to start
SD4 when and where
btw to anyone worrying if sd3 releases or not, it will, if not officially, some torrent will eventually appear
😭
No crying!
SD 3 will never come out 😭
like Naruto said in the dub: Believe it!
Something for Ripley's Believe it or not Museum ^^
anyone here use SD to make money?
i think there is multi lora stacker node
No worries
Got a good one from rgthree
Clip skip
Where is clip skip?
in comfy its called set last clip layer
dont be confused by the negative number, -1 is 1, -2 is 2
haha i forgot rgthree had a multi lora, i was thinking of something else
you can also stack controlnets :3
what should i do
That's a good idea
that's a bad idea in my opinion
you know they used to have Clubs right?
where checkpoint owner can make a club of their own releasing early version of their model and development stuff
but you know what?
that's immediately get backlash by peoples who think this may caused paywalls on final model and making Civitai feel more like a model marketplace
Buzz is honestly a compromise I will say
Well it is a paywall with a multi tier system, which is thoroughly displaced on an open source platform. Users aren't helping creators with Buzz Donations, Creators can't cash out, it is nothing but a gesture. Gestures don't buy food. The only people making money out of this are CIVITAI while tricking users into believing they'd do something nice for their favorite creators. Suggesting Buzz Donations instead of actual money donations takes the cake. While it is not mandatory and creators and users can chose their own direction, I will fully boycott this because it's not only ripping off the people that like my work, it is also a stab in the back to creators. It would make a lot more sense to make Buzz a real currency with value and cash-out and allow people to directly buy particular creations from their favorite creators with buzz, creators can cash out and everyone involved is treated fairly, in a system where Buzz only benefits the owners of the platform, I call this a rip-off.
By Tower13Studios in Civitai
Tried and wasn't successful. The money makers are the ones from the NSFW scene.
Because no one is paying nowadays for AI generated art while it's so easy to generate it themselves
On sites like seaart
another point is that downloading in 5MB/s is painfully slow especially SDXL models
Welcome to Australia
Makes sense
even with my 8MB-10MB/s connection it would take like 4-6 minutes
and it will push the creators to move back into huggingface
Believe me, I tried many SFW methods (etsy, fiverr etc.), but nothing worked
you can use it to increase your popularity though
and don't even believe those youtubers with titles like "This AI influencer makes 10.000$ a month"
all lies
I see some in Instagram that made super creative photos / SVD videos
Just usual click bait
If that influencer makes 10K per month, then it's 100% from fanvue or other NSFW platforms
:))
Man, the competition is too big and the most hired artists are the NSFW ones
And you don't have chances against a real (manual) artist which uses even AI while you're only trying to fix the image with inpainting/controlnets
it's easier for them (at least for now)
i am kinda a real artist
i dont use AI to be my assistance or reference though
( which can be pretty helpful for some people )
yep, but it's easier for you to draw exactly what a customer wishes
while with SD it's really hard to get that
I mean... furry commission is popular you know

forgor gif not embed
Gif moment
heard of people that make up to 100K$ per month from furry/NSFW/3D renders art
imagine that
tbh
Vtuber commission can take even more money
Some people would quit their job to be a furry
you know like one high-quality avatar can cost you 300$ for half-body
not saying it is expensive though, because it is a lot of work
especially emotes, effect stuff.
Omg lol, this guy has the worst luck
and if anyone wonders how aitana lopez (that ai influencer) is making money, they're from fanvue (which is an alternative to OF for ai models).
so no whitehat/greyhat ways of making money with AI art
you kinda have to go fully blackhat/immoral
er,,,
the ones making banks right now are the ones who make AI girlfriends apps
Good morning, everyone! How are you all today?
Using the self inflicted loneliness of the people against them 
/video
Exactly
the same as OF
can anyone recommend a getting started guide?
hi guys, anyone knows how to train stable diffusion without using automatik1111?
depending on what you would like to train. You can simple use dreambooth it runs e.g. as a script from the shell / console
so im doing a project, which generates image->validates and -> retrain with stable diffusion
i want it to be automatic and it can only be done with coding it manually or adding codes to do that
does dreambooth allows that?
yes you could create a small python or shell script to automate the process of model creation.
Use kohya_ss, it's solid
Or OneTrainer. Wildly different GUI, very good in it's own way.
Is there anyone using Sdxl model for Api interaction, i want help!!
Not sure what this means, the SDXL model makes images, it's not some sort of LLM-ish agent that uses things for you
it is the third post, 2 earlier this day in the right channel "tech-support". He seems trying to use the openAI API to create images but the code seems not working.
No bro it'sd key
And sd api endpoint
Either way, #🤝|tech-support is the right channel for it probably
Will the bot ever be back in here 😭
@shrewd jasper yo buddy, how's your experience in stable diffusion with Intel ultra processor. Also which model do u have i.e. ultra 5,7 or 9?
Is there an optimal number of steps for fine tuning a model? Like for loras the rule of thumb is usually around 1500 total steps, more than that it's often overtrained and less it's often undertrained. Is there such a thing for onetrainer fine tunes as well?
I hope that crypto thing Amad won;t blow up
like evyr other crypto venture so far
He sjoudl just stick around and make models for us instead, more money even in open source than crypto
crypto just means jail time lol or bankrupcy
he be better of puttign google ads in the next model :)))
Why are there no rooms to create images?
only the patsies ever go to jail. The organizers of the ponzi schemes/tokenonomies pin it all on one guy and they take the fall or fake their death
no free bot preview anymore. they're charging for access to sd3 this time. Kiboshed the invite signup list. Only a few industry players got invited for free access and the main testing preview is through their API
running a generation bot actually costs a significant penny i imagine
My favorite crypto scam exposure so far. Either Quadriga, the coin exchange based in Vancouver Canada, or Bitconnect out of Mexico. I think BC has better memes. Hardly anyone ever heard of Quadriga, but the president pulled off such a crazy heist and it's pretty clear he faked his death
oh rayos. thanks 😦
that lady is the biggest crypto scammer, forgot her name
speaking of my kind of humor and what amuses me, this article is really great exploration of the pony xl training process. In my head, it vibes like if hustler published an article detailing the technical difficulties of creating what they do. Not a bad article at all for a pony model. Even more amusing to me that the most capable model in the scene right now is a MLP rule34 model. https://civitai.com/articles/5069
if she's the face of the scam, she's just the patsy. 100 other people in her circle benefited as much, if not more, from the scam
i don't think it's very capable at all
it is certainly capable in people's imaginations
and my pipeline, which involves saving images at 95% quality twice, likely exacerbates the problem.
i don't know what this is about
The community has clearly demonstrated the need for improved style control (see the exceptionally popular collection of style LoRAs by prgfrg23). In response, for V7, I am developing a concept called style grouping, or "super artists", as part of the base model. The aim is to use human feedback on style differences to automatically cluster images by style. I plan to expand on this in a separate article, but the general approach involves using artists as a ground truth for initial training, followed by refining the process through human queries asking whether two images share a similar style. The outcome will introduce special tags like "anime_1", "smooth_shading_48", and "sketch_42", which can be used during training and in model prompts to enhance style fidelity.
i am also not sure this achieves what the author thinks it does
it actually has really great prompt comprehension and posing in my experience. i've only explored the safe realms in my generations, but then i look at the civit example gallery and people getting some far out interactions between 2 bodies if u catch my drift.
his goal is to give a language of style that isn't "artist names"
:))
like many of these community workflows and models, i only care because
(1) it's not done scientifically
(2) so at worst, it's misdirecting a lot of resources
hard idea to communicate. it involves his very opinionated ethics about training artist names into models
this is also all like, trust-me-broism
those ethics don't make any sense
i mean, they make sense to many ill informed people, in their imaginations
it has caused this big distraction in the community
if people cared about prompt comprehension, they would fine tune stable cascade
like NeriJS did, a guy who is actually very scientific and did achieve some unique results
pony team might very well do that. Also, there's a lack of tooling and youtuber video tutorials and succesful examples on the main stage for cascade. i don't think that ultimatum works well tbh. Frankly, all ultimatums are bad (heh)
wow
man, the "rule" is simple - 90% of civitai users are there for NSFW "fantasies" and pony is the best at this, that's why no one will ever start to fine-tune cascade
i think people like the idea of there being some kind of naughty community model that is "better" than something done by professionals, because of the vibes
can you show the stable cascade finetune
:))))
it doesn't mean that is actually achieved
It's not that cascade isn't good, but they hung it out to dry announcing sd3 4 days later
before compatibility with a1111/comfy was achieved
that's kind of an imaginary thing though, isn't it?
you're basically agreeing with me
cascade being 3 different models and not knowing which of those to train, that's probably a big problem with it. Same reason why there's not more lightning models. it's complicated relatively to 1.5 training
Exactly. The people thought that Cascade is somehow the SD3
that the average user isn't really thinking critically about any of this stuff
why'd they make cascade 3 different files? it's bad product design for sure
segacd + 32x over here
yeah.. I am agreeing with ya - it wasn't that people thought they were one and the same, but just that why bother with cascade when sd3 is literally coming soon
Average user doesn't think with the brain if you know what I mean
technically sdxl is also 3 files - model + refiner + vae
i don't know if it is the best at anything.
if it's good at something, like go and publish a paper
no one uses the refiner lol
the best from all the models out there
for NSFW stuff
that got dropped by the community so fast once they realized you didn't really need it
it's a good romantic story that the NSFW stuff matters, but tough cookie, it does not
it could all go away in an afternoon and no meaning would be lost
and like, still, how do you really know? vibes? it's all vibes
yep, and if cascade was the new thing on it's own, people would have trained stage C to the point that stage B was no longer required (and A is just the VAE)
at least the author is migrating towards better training for something that matters
hii i tried the stability api to get realistic human but the neck is basically too long 💀 anyone know how to resolve this (im using the core version) my pre prompt : Make a realistic woman who the neck is not long (very important) based on this description
however, you know, there are already models that take cinematic content and label them automatically with cogvlm
Yeah i agree. While NSFW gets a huge crowd, it doesn't matter. Same way it never mattered back in the VHS vs betamax days.
one of them is called "midjourney." it suddenly becomes a lot harder to compete
but it didn't matter
that was a huge misconception
put the resolution back to square, use "long neck, giraffe" in the negative prompt
lol
thats what i said. they jumped onto vhs exclusivity at the end of the battle
i want to be vertical xd but i will try negative prompt thanks
lol okay i wasn't sure if you were being sarcastic
generate square, crop later?
SD won't be ever capable to beat the ALL-IN-ONE model MidJourney at the creativity - not even Dall-E 3.
no you are right. It's abig romantic notion that they bring market dominance with them. That's just industry marketing really. like the coalition coming together doing the "got milk?" or "Beef, it's whats for dinner!" ad campaigns
The only thing that makes SD a "good" choice is the possibility to generate uncensored stuff
stable diffusion is a good choice because it can be fine tuned
sd is good for more reasons than that
which is a lot more impactful for art than text
Yea, but that requires money
and time
and effort
the core version is well done anyway great job ! looks awesome
and there are less and less people who are willing to do that for free
for "open-source" :)))
no. it's the open weights. the uncensored stuff is just a consequence of that. there's very little money to be made in the field for that stuff. it's just novelty
just need to wait
actually one of the best models to have fine tuned would have been deepfloyd
Most of them have paywalls on Patreon nowadays
another complicated release. bad product design. good research platform though maybe
the align your steps example for it shows how big its potential was
Yea, but overall for an average user, Dall-E 3 and MidJourney can produce more amazing results out of the box
For the regular Joe human being
i don't know why @neon oriole never wrote support for IF. normally there's a ton of enthusiasm for scientifically interesting results
there's definately some money, but very little.
is it really all about "the number of files"? that makes people sound extremely dumb
even if i agree with you, in principle, there is either 0, 1, or many
most people are already embarking on an IT-heavy journey doing this. it is already more than "1" file that gets involved
ease of onboarding more likely i'd say.
maybe you're right, but that would be a real shock
a real disappointment
Are you really not aware of the people from this planet?
if it's really that people can only contain "1" file in their heads
Like legit
they already deal with more than 1 file!
i mean dealing with stable diffusion webui is like repeatedly poking your ass with hot file coals
More than 50% of the population doesn't even think with their brain about their action
The licensing i think prevents easy tooling. Needs people to go into hugging face and agree to all the terms. At the relase, that might've been less automatable for end users than it is today
just open your eyes
it really is that
simple
Think of average intelligence. Now realize how averages are derived and come to the truth that at least half of all peopler are dumber than average
Good ol Rufus wisdom
works relatively well
thanks !
yeah deep floyd isn't on civit ai. likely because of the very restrictive license
yep, the average Joe goes on youtube and is looking for "beautiful indian woman ai lookbook", then he's "thinking" how to smash the keyboard, so he can generate some "w@ifus"
if you need a reality check, just look at civitai for example
more than 70% of the content is NSFW
deepfloydd doesn't let you take off nsfw filter without breaking the EULA
sometimes too theres a catalyst event. Like a model won't be popular for months and months. The community meta is steadfast for a period. then a post or workflow gets crazy word of mouth attention and the whole meta shifts completely.
df had a lot of weight coming in to launch. there was a ton of hype for it. Then it landed with a massively restrictive license that prevented it from making a splash
also it was all complicated to install into existing pipelines
I'm surprised there isn't more hype about pixart-sigma
even then, the popular tubers and other influencers in the space, like tiktok artists, they won't cause a shift event with all of their content. I dont' even think they can plan it. Every tutorial video is like "THIS DESTROYS EVERTYHING ELSE. BEST EVER TECHNIQUE. NEVER GO WRONG" and then it's just some menial content that barely does anything new, and they're just toying around with a new workflow thye found. See: Every Supir video that came out in that first week
Checked the sample images and they look exactly like the xl ones - with blur
pixart to me looks a little bit too "aesthetic" for my taste
maybe if someone could fine tune it
tangential: saw a discussion about cropping. I think the bucketing during training is useful, but square ratios are still king for testing prompts and guiding results. bucketed resolutions can affect the guidance a lot too. When prompt engineering, my best approach is build in square, then either crop a final result, or take that prompt into different aspect ratios after you build it.
Generating with the idea of a bleed to the design that you're going to crop out, really elevates the possibilities. A little bit of a post process like crop goes a long way. Changes how you evaluate results when you're trying to make something specific
they have a very small dataset, so it would've been chopped down by an aesthetic classifier model most likely. Which would have a lot of bias to it of course. That's what you're seeing. That very obvious bias thats trained into these aesthetic classifier models all over the place lately
Why is New York a toilet?
Because i shit in it
Whole day every day
I'm just joking, new york is my dream city
I love it deeply with my whole heart
people
Does anyone here know of a local llm for chatting?
(Preferably something open source and that has a ai)
Hey, checkout GPT4all
Its open source and works on any gpu/cpu
More advanced would be the oobaboga-webui
Haven't seen any info on whether or not it has an api, do you know?
Oh I think gpt4all doesn't have one
Oop nope I think it does
"Enable api server, api server port 4891"
I guess it does have a api I'll have to read into that in the docs
Ah nice
Trying to combine it into a chatbot / moderation bot by making different prompts
Hopefully it works out
all the fine tuning approaches are more or less the same. it's the same 100 or so lines of code
really thanks to Meta more than anyone else
For llama 3?
for pytorch
Where can I report an error in the stable diffusion API documentation?
its working ig. getting abt 1nit/s. im on Ultra 7
I created the AIGC channel on Warpcast, which regularly shares good AIGC open source projects & models! You can also share your own AIGC works and exchange techniques.
If you don't have a Warpcast account, you can sign up by clicking this link: https://warpcast.com/~/invite-page/404899?id=fd0fd839
If you already have a Warpcast account, you can join the AIGC channel directly: https://warpcast.com/~/channel/aigc
Which method is best in terms of getting the likeness of a non celebrity person for commercial use? Is it LoRA, Dreambooth or other?
There is no good, current method. A lora with SD3 might work well in a month. Dreambooth is almost completely dead.
Is there a good way to get a 3d game style output with Pony? I'm thinking GTA V/Horizon Zero Dawn/Modded Skyrim 3d style. 3d tag looks too high poly.
what happens when i unzip connor.zip? :3
I contain 200TB of rust code
😮
maybe a popular 3d game from mid 2010s that has a lot of screenshots online?
HOLA!
or mid 2000s
Oh great
thanks
Hello Everyone!
I have a Dream Studio account that I have been using for about a year, I regularly buy credits. However, I have come along an issue after last recharge, that my account has been disabled though I bought credits of $100 but now there are 0 credits and account has been disabled. I am not able to use any models, nor have they mailed me any warnings or alerts. This is the error that keeps on appearing while I try to buy more credits:
"Something went wrong! Your account has been disabled. Contact mailto:support@stability.ai to appeal if you believe this was done in error."
I have also mailed the customer support, but there has been no response from their side yet.
Does anyone know of any app that offers stable diffusion 3 in Play Store
Check your spam folder in your email, that or they don't even bother letting you know why it got disabled. You probably broke their ToS and there's likely a clause that states they can close your account without disclosing the reason.
Oh and here's the tos: https://dreamstudio.ai/terms-of-service
Hello, my ComfyUI stoped working after trying to install OOTDiffusion. It just doesn't start 😦 Any advice? Please... Thank you
What does sd do when it does not understand a word? It ignores it and continue with the rest of the prompt or it try to integrate in the generation that unknown word as a random mess?
I believe SD will either ignore the word or try to generate it as text, depending on where it fits in the prompt.
I see, thanks
you can try it on set seed, try with something it know nothing about or has no visual representation of and then without it, but I Believe every little thing affects image in one way or another
😭
lol
Hey! Im trying to train a lora in Kohya SS, installed it but struggleing with how to start training. I have my images, could anyone help me with the start? Thanks
I just dont understand the preparation, like what folders i need, the videos i watched(pretty much everythingin yt) include that there should be "class. img, model, log" etc folders, and everyone says these are important, but i dont know what to put into them, so im kinda lost rn
there is an option there which will set up the folder system for you
thats where i had my issues as well when i tried to set it up by hand
@bleak matrix, are you sleeping?
so now everyone can post ads in the general chat of this server :)))
legit
@low moon Can i dm you?
I just cant find the option you mentioned, id be happy just to start training anything at this point
it legit seems that the mods are nowhere to be found
I t w a s
Yet I am now awake!
If it's been awhile, please e-mail support on the website again!
I will now use SD3 to generate world peace.
I can approve this message!
#🧣|comfy-ui question for those who have good comfyui workflows
Looks like SD3 won't be released today 
is it supposed to be released today?
if I'm not mistaken they were talking about a few weeks and the end of april
hello, sorry to send this on multiple channels, but i didnt no one is replying,
is stable video working?
on the website or somewhere else?
api and website stablevideo
i'm running a test on the website now. hang on a second
nope. images generate, the video crashes
thank you for running the test
welcome. i'm going to guess that the backend is having issues
Anyone know an extremely fast way to generate images with python?
Like in python without an UI?
yep. im rewriting my discord bot, and need to load the model into the vram and then generate the image fast. my old method wasnt slow, but im sure it can be done faster
here's my old code:
global imagen_pipe
await unload_llm() # unloads the text model to free up vram
imagen_pipe = AutoPipelineForText2Image.from_pretrained("stabilityai/sdxl-turbo", torch_dtype=torch.float16, variant="fp16")
#imagen_pipe = AutoPipelineForText2Image.from_pretrained("runwayml/stable-diffusion-v1-5", filename="*emaonly.safetensors",torch_dtype=torch.float16, variant="fp16")
imagen_pipe.to("cuda")
imagen_pipe(prompt=prompt, num_inference_steps=1, guidance_scale=0.0).images[0].save("image.png")
print("image generated")
await message.channel.send(file=discord.File("image.png"))
del imagen_pipe
torch.cuda.empty_cache()
await load_llm()
Im not much of a coding guy, maybe try asking in #🤝|tech-support, im sure youll find some fellow code dwellers there 
ill try. its a bit dead there tho. ty
hi
Try building off of the LCM architecture?
are you running this on the cloud or ur literal pc?
my pc
Trying that rn. 7-10 seconds from start to image
Guys, what if ControlNet doesn't show up after installation? Or do I need to press something?
the diffusers implementation is inefficient. you can use comfyui as a library
please help
ik. Im still using diffusers, but with the UniPCMultistepScheduler, and pretty fast now
🥲 guys help
diffusers doesn't do good memory management either
i think you should click the link i sent you
yea. that wont work for me, because it needs the comfyui server running.
it does not
read what it says
it uses comfyui as a library
it is completely embedded. it does not start processes
oh. sorry
exactly like diffusers
before you give me
another no*
just read it from start to finish
ill try
where can i get reviews on generated images?
https://github.com/hiddenswitch/ComfyUI/blob/0862863bc00165b9ba0607595f304f93ca995887/tests/distributed/test_embedded_client.py#L32 this snippet shows running an sdxl pipeline with refiner.
okay so does this make sense?
yeah
as long as you pip install git+https://github.com/hiddenswitch/ComfyUI.git, you're done. you could run the example verbatim in a python prompt
it installs the best accelerated version of torch for you too
is this helpful?
its a lot faster than base diffusers, so thank you.
yep
this async version has queueing for you. you don't have to maintain your own queue. so if you use an async fastapi or aiohttp based discord bot (or whatever, anything that supports asyncio), you can "just" await queue_prompt in the bot event handler and you're done
it will Just Work
thank you
hmm... idk why but 1024 on SD model kinda works
Hey is there any way I can run stable diffusion online I don't have good pc
I think there are options such as Google Collab, I haven't looked into online stable diffusion yet.
What are your PC specs?
4gb ram i3 Intel graphic 🥲
It's stuck while creating
yeah, you need a dedicated graphics card with 4 GB VRAM minimum and 16 GB RAM, 32 would be better along with a better GPU
Yup but I don't have that much budget write now to buy any discord server or website that provide?.
https://www.craiyon.com/ you can use for free
Not really that good but maybe for a start ...
Cool but is there any way to host bot for free so I can create bot of stable diffusion model as host on discord
civitai, you can use daily points to generate images
Thanks buddy
Is it worth posting stable diffusion images to civtai or are there better places, is it even worth doing?
it is worth doing to help showcase a model
or showcase a lora's capabilities
thank you for reminding me I have a pending dark magician girl I wanted to post on civitai lol
More like to showcase ones love for a models and out of boredom and curiosity.
Is there a relatively easy way to generate batches of images from one prompt via the SD3 api?
What UI or tool would be best for that
Other than just rawdogging a script
not quite sure where i should but this, but is there a way to have prompt S/R relplace a sting of promps not just one?
If anyone is interested in a free Alternative to Midjourney, I've been working on a bot that can generate images at a similar level if not better. The images generate as fast as 8 seconds. LMK if you want to test it out, we are looking for feedback and suggestions from anyone who is willing to help out.
How I can refer to an unknown object in my prompt?
I have an ebay store where I am uploading images of products, masking them, and then changing the background using SD.
The product is always different, it can be a stapler, a car tire, a skincare product, etc..
I want to automate the changing of the background without having to specify every time what the object is.
Ideally, let's say the object is a stapler, I can just prompt: "object on a desk, in an office, etc.."
Essentially, I need a way to refer to the masked object as some arbitrary word so the prompt understands I am referring to the masked object.
really, is there anything better than krita + AI for fixing hands. 
that's what I use :( ...Krita slaps tho!
guys can anyone explain how this model is 3.55GB big? (i know i am the maker of the model, but i literally have no idea how it is 3.55GB...)https://civitai.com/models/428813
(it's a pony model)
