#💬|general-chat
1 messages · Page 143 of 1
There are many ways to install and use Stable Diffusion, which one are you trying? I'm driving, it'll take me a while to respond
I know nothing, I have no experience, I haven't downloaded anything yet, i want a way that is simple and straightforward because I want to keep the integrity of my pc intact
Do you have a good graphic card? Yes, install Stable Diffusion. No? Use Webservices ...
yes I do, its a 5700xt
It's AMD ... can work with workarounds but A.I. mostly made for nvidia ...
god dammit, will it to the job fast and good ?
Never tried with AMD ... but if you wanna make first experiences you might wanna start with: https://www.craiyon.com/ it's free. Than you can decide whether it's woth the work to install local ...
thats exactly why im unsure of doing it, its an unkown risk, id rather stick to dezgo
It's not the perfect solution to run it local ... you will have to learn and spend time ... and you have an AMD ... when you get the idea the Webservices aren't enough any longer I would install A.I. local ... as long as it works using Webservices ... it might be a good solution.
funny you say that im due an upgrade so im searching for a 30% upflit in performance
It's a photoshopped 3090 ti pcb. T
A local install has become pretty easy for nvidia users. I don't know how it is for AMD users. But the Webservices are doing a lot to make the handling easier for you. That way you have less possibilities ...but I can't really say what's the best solution for YOU
I think you can try SD3 for free with 10 images or something on Stability.
Check it for yourself
I think that MJ still wins
in most of the cases
In that Galery there's a statement from "panzerlied" (stupid name) ...
really low...
they should have doubled the amount
like 48 GB VRAM at least
so we can play for 2-4 years with AI
32 will be good enough for max 2 years from now :))))))
It can't be a 5090, the power connector is missing sensor pins. The voltage controller is also the outdated 3090 ti voltage controller, which has been discontinued
They want us to buy a new one in less than 2 years 😄
too many variables ?
exactly :)))
The pictures are not from a 5090 ... they are just showing hoy the layout could be ... the statement was the important point for me...
Imagine that they add only 8 GB to 4090
and they call it 5090
which will be like 2000-3000 euros
We're getting 36GB at most, but most likely it'll be 24 GB
It's OK for me ... it's OK if you have enough time ... you can do it! Sure! But do you need it? I have no idea!
its not that I need it, I just like experimenting from time to time and I want the images to look good and to load with a decent speed
Because Nvidia is making pennies from their consumer gpus and putting more VRAM on them will just take sales away from their corporate hardware
Unless AMD steps up and makes a decent graphic card it'll probably just be 24GB on the 5090
I'm getting the idea you like to experiment ... so install SD local. It never harmed my system ...
:)))
that last part is what scares me
Never really heard about serious problems ...
Worst thing that happened here was that I broke my A.I. 🙂
I think I killed my ex AMD GPU with a combination of SD + Call of Duty
So I won't ever try running SD on an AMD
cloud GPUs are the best solution for AMD users
thanks
Who would try to use 2 GPU excessive programms at the same time?
I kept only sd opened and it crashed my entire PC
blackscreen gg
I tried with my GPU in the past and it worked
but not too excessive
like 2 games at the same time
like League of Legends very high + lobby in Call of Duty Warzone
You are funny ... and I mean that in a positive way!
🥳
I'm a gamer
what do you expect? :))
or was*
To be concentrated on one game 😄
I was.
Yes ... but by mistake I also started Topaz Video A.I. while running ComfyUI 😄
I was playing League of Legends while I was in the lobby for a Call of Duty game (not in the map)
And what happened?
Both programms started to tell me I'll have to wait hours 🙂
so you said you have Topaz
how much do you wait to upscale a video of like 1 min to 4K 60FPS?
because on AMD is a pain
To many unknown facts in that question. Point is ... mostly I use it to work to improve SVD Videos ... that's no problem.
And I only use FHD an 25 FPS ...
I see
Because I thought about upscaling old cartoons, but they would take me weeks, so I said no
It's pretty OK for 8 seconds videos and maybe for stuff you love or get money for ... otherwise to much time needed just for fun ...
🇸 🇩 3️⃣ 
🙏 
maybe if you have A100
are we really gonna go deep into June and still no release? 😦
really now
did anyone test topaz video on commercial GPUs like H100/A100?
ever?
cuz I never heard about this
maybe it doesn't even work on server GPUs
deep into life
There's an A.I. Artist in San Francisco who has his own server ... damn ... I'd like to use it for some days with his help 😄
and what are you waiting for?
:)))
His OK ... I can buy the tickets ... 😄
Right, so I looked through the 5090 leaks and all we know so far for sure is that the 5090 will be made from three different pcbs
It has a 512 bit memory bus which means it can have up to 36GB vram but no word on that
Rumors ... we will see ... might have been a bad idea to drop them here 🙂
There's a million different 5090 rumors. All we know is that Jensen Huang will only put in as little VRAM as he can
If AMD doesn't manufacture a competitor we're getting 24GB vram... again
I'll be a special guest for ASUS at the GamesCon ... maybe I can get some more information ...
Would be nice to get some insider knowledge
I'm personally hoping for 36GB vram so I can run two cards at once, but I'm not expecting anything so kind-hearted from Nvidia
They want people to buy data center cards which are like 20x the price
unfortunately
but hey, zuckerberg bought like 500K H100s
:))))))
Maybe we should start flirting with him 🙂
nope, no homo
Llama 3 is a thing, too
yea, but sometimes is bad
idk, I really think that they just have different versions available and they use/present the best one in the first days of the launch and after that they use inferior versions to cut the prices for energy/GPUs
kinda the same with Claude 3 - free version
Could be a right idea ....
because then how else can the quality decrease so much with the exact same model?
there's something shady
with all these LLMs
hosted online
I don't know much about llms, but what if they just do a lower step count to save on compute
I know llms don't use steps. Could be quantization
🤷♀️
maybe
but they do something, that's for real
And I hope there are still a few German boys training it ... so we don't get americanized 100% ... 🙂
You'll be eating mystery meat hotdogs before long
USAUSA
I'm Homer Simpsons ... disappearing in a bush ...

MP3 was a German thing ... we gave it away for free ... we are not good in selling 😄
At least you're selling out them cars
I don't think being better than Tesla is a high bar
lol
The idea everybody has his own car might work in the USA ... but not in big cities in the EU ...
And it even might not work in New York ...
Trump closing many streets ... 😄
What's funny is that I live in the middle of a big city. My family has 4 people and three have cars. I'm the only one who prefers to ride the subway lol
Saving money on car insurance is goated
I'm not saying that I don't want people to have an own car ... I only think it's not a concept for a future ... so if Germany invests to much money in the car industry ... maybe not a good idea ...
I think we'll still need cars for the next decade or two. With AI, who knows what's after that?
Personally I don't think any country is prepared for the future
It's very likely we'll all be out of work and on our asses
I think you are right ... cause politic is just reacting these days ...
Yup. World's crazy rn
Welll ... I'm a songwriter now and released my first song ... wtf ... I have been bad in music and art at school 🙂
It's damn crazy ... but the positive point is: we are part of it and not just watching.
It's been really exciting with all these AI models coming out
On the one hand the future looks bright, on the other a lot will have to change to get there
We are the people saying: Let's learn it and let's get an idea about it
hey guys i was gonna get a 3060 but then i found out about nvidia Tesla 24gb, is that a better deal?
under $400
no
tesla doesn't run games or other apps, it's a data center card. It's also super slow
Be careful not to buy a grapgic card for servers only ...
ah
I don't play games on pc, is that the only issue? or do server GPUs just not compatible with regular desktops?
I did a litte research, tesla is used for AI only
which is what i want
That's up to you. Though to be honest, I wouldn't reccomend a data center card to the average person
nvm they're all refurbs on amazon. all the new ones are over $5000
I don't really know enough about it ...
alright, 3060 16gb is the only option right now under $400 just making sure
or a used 3090
Definitely get the 3090
more vram got it
Both sound pretty fine today ... but nobody really knows about SD3
3090 should run sd3. As far as we know, the 8B param model will run on 16GB Vram
But the exact requirements are still a mystery
i'm so out of the loop lol
We all are 🙂
It's a bit of a rabbit hole. But I do think the 3090 is a great choice for AI shenanigans
this seems like another rabit hole next to the one for mechanical keyboards. why do people keep digging these things.
jk it's all fun
The Matrix does have some surprises ...
My brother's into mechanical keyboards. I personally don't get it
On the same token he doesn't get why AI is so exciting for me
I don't understand why people built 60% keyboards they're just not comfortable to type on. But I do understand why we build split keyboards, I did.
Girls, guys, whoever (not in a negative way) it's been a nice time with you. I identify as a car ... not older than 25 ... 😄 Have a good day, night or whatever!
When will the bot be available
supra or mustang?
Do we have to pay for it?
Ahmmm .. no idea ... but my speefo stops at 25 years .,..
Nini
There's a stable diffusion api and you do have to pay for it
Not very expensive but it is what it is
what the hell was all that LOL
yeah. you need to have stability membership to use the sd3 preview. the signup list was a total farce and they decided to charge for access for sd3 instead of offering free preview like sdxl
Does Friday apply to the SD servers?
well ,there were people that got it, but certainly it was a small subset. I like the way it turned out. Everyone was so fuming about it, it was amusing
it's literally no different than demanding google release a new update to gmail or something
how does realvis outrank juggernaut
that's the problem with crowd polling
realvis is one of the least flexible models I've used, so I guess if you only want a portrait, it's amazing
I know there are better models
but its a secret which ones...
Hidden gems stay hidden.
hey anyone can help me out in tech support
this isnt tech support
how do i run stable diffusion on my pc
thats why i asked to help me there
you install a bunch of software packages, and run a python script basically, the end
its a little. if sergey was like "Come sign up for Gmail 3!" and then never spooled out invites for it, i think people might've been frustrated. I remember when you needed to wait for an invite to gmail to use it and people were frustrated then. Understandably so.
thts not working for me
would've been even more frustrating if those invites weren't getting sent out
I guess that's possible, but they did wait and they did use it (almost certainly)
A1111 not working for me
I imagine the invite thing here, either they underestimated the demand, or simply changed strategy mid-flight to do this api thing
which is the right business decision honestly
If you just wan tto be a casual user you can also try Fooocus or Defooocus
the way they rolled out gmail was meant to make buzz. Google didn't give people invites. They allowed gmail users to invite new gmail users. It was a hype strategy. Which is what Stability's sign up sheet was. But since they never actually sent out invites to those who signed up, and only sent to them to people who they vetted, it was a lot more frustrating and just felt like fake hype
if you have big plans go for Comfui
i already use fooocus i need something bigger
whats the diffrence
and users have no recourse because it's not like there is a contract or anything other than a loose verbal commitment
Forge is faster
and more streamlined, you egt many features by default with forge
like controlnet
Well if you want something bigger it sounds like you already have some experience in which case I really recommend Comfyui...
the way I look at it, I'd rather them do something that keeps them around then keep the software delivery cycle as it was and potentially they close the doors at some point due to bankrupt
Do u mean comfy ui
Yes comfyui.
so if that's the case, all you're left with is a bunch of whiny users that just want the newest shiny iphone...pffft. they've released so much already
mark my words if they released weights tomorrow it'd only be a couple weeks before people would start asking about sd4
Huh... where did that suddenly come from?
Wdym? In what sense?
Mobius - is that "just" another SDXL finetune? If it's so good... how... where does it come from?
I made it. It's neither a base model nor a fine tune. 🙂
frankenstein
We will be releasing a paper on it soon
Nope
Human nature.
Always looking for the next paper.
pony sux
no offens ebut it does
It;s not even perfect at anamoty
still has crap hands
Nope. It's something completely novel
The title of the paper will be "Constructive Deconstruction: Domain-Agnostic
Debiasing of Diffusion Models"
I'm testing it on imgsys atm - it's better about half the time for me... and it is in almost every A v B, so no wonder it gets so many points.
Poney models have horrendous hands
nice, cheers
How could that be?
community contribution is something I can get behind
I thought they were meant to make anatomy great again.
Here is the main take away from our method and upcoming paper
I will go all in.
***The constructive deconstruction method presents a novel and effective approach
to debiasing diffusion models without compromising their performance and versatility. By inducing a controlled noisy state and retraining the model using
advanced techniques, we achieve a new base model, Mobius, that performs well
across various domains and styles. This method’s domain-agnostic nature and
adaptability make it a promising solution for addressing biases in generative
models. Importantly, our method provides a means to create new base models
without having to design them from scratch, leveraging existing models and
enhancing them through a systematic and controlled process.
I am sorry.
I am a sad end user who barely makes this work.
Let alone udnerstand the magic underneath the hood.
maybe, but at some level, even if like me you dont use any of them, it's impressive how many spinoff models there are, it's like a whole sub-culture
Mobius
Wow (heh)
No they are impressive yes but come on... hands!
Please,
AI
So we could Mobius-retrain existing finetunes?
get it!? mobius!? https://youtu.be/-CyupSdXfI0
Btw - on Imgsys I can spot Mobius half the time because it's basically "crisper" or rather has more defined edges in artsy images.
mobius huh. sounds one sided to me.
Truth hides behind a paradox
It enables making entirely new base diffusion models with needed to extensively pretrain a new model from scratch. We can in a controlled way break all the quality and composition associations without damaging the baseline understanding from the preexisting training. Making it so we can completely train those associations without having to start from scratch
Basically we just lowered the cost to entry for making new base diffusion models 50 fold
I have no idea what this mobius is all about, but if it's about creating something like juggernaut or better, all for it
barriers to entry. amirite?
Mobius is about a movie.
No.
thats morb
we don't talk about morb
Any hungrians here?
Magyar SD?
Bassza meg egyedul vagyok.
😢
Actually its a spanish movie
is it nsfw?
Well I hope that's true and can somehow also trickle-down to local fine-tuning.
i think you underestimate the community
what is the diffuion immage model training speed difference between a 3090 vs a 4090 and how much would a (5090 with a 512bit bus and 32gb vram 50%faster then 4090) speed things up ?
relies on number of cuda cores more than anything
4090 has 60% more roughly. 5090 will have 50% ish
CUDA will confuse you with red nonsensical messages and will lead you down confusing, frightening and frustrating path. CUDA does not love you.
is the bot service no longer free? 😦
AMD user found
CUDA hates you.
Why does my embedded files don't show up in forge stable webui
This place has changed so much since last I popped in aaaaaaa
Thats just humans in general lol
mobius? never heard about it
Is anyone available to voice chat? I am really struggling with image generation.
seems promising, but imgsys just ends up being more of an aesthetic tester than anything. after rating dozens of images, most of the prompts are garbage sd1.5 style tag prompts and 99% of people aren't going to actually read all of the prompt to make sure it's being followed. also, their prompt window doesn't show the entirety of the prompt since it's only one line, so you have to scroll through to see it all. you'll also see a ton of random artists thrown into the prompts and i'd bet money most people don't bother looking them up to see if the generated images match the styles. basically, it's a pretty flawed testing setup with a lot of inherent aesthetic bias. While that is definitely important as well, it just ends up being like 80% of the reason why most users will push left or right.
of the ones that have popped up that your model has been in, it's usually been really good though, but it is still usually a coin toss between the other four models in the top 5. you'd have to then spend a bunch of time actually paying attention to the prompts to truly decide which was better. but aesthetically, it's definitely up there and if the new method of training is as cheap as you're claiming, that's a big win either way. look forward to checking the paper out
Like how the hell am I supposed to rate A/B on a prompt that just showed up: "cat seahorse fursona wearing headphones, autistic bisexual graphic designer and musician, attractive androgynous humanoid, coherent detailed character design, weirdcore voidpunk digital art by artgerm, akihiko yoshida, louis wain, wlop, noah bradley, furaffinity, cgsociety, trending on deviantart"
I have no clue wtf I'd be looking for other than "weft pikchur wook pwettyr den wite pikchur"
Hello,
hi everyone
Hello, @crude cosmos
hello
hi everyone
I just have an old vintage 4080

hiya
Hello
that'll apply if SD3 didn't fix most of the bad things from SDXL. And I really doubt that they fixed anything at all (maybe the blur/depth of field problem).
hope it'll be better than XL.
i can't share images so i can't really share it
sadly
there's not a single model which figured out to make almost perfect hand, nor MJ or Dall-E 3 (but at least dall-e has decent ones).
not all of them
🥺
can anyone explain why sigma pixart is so slow compared to cascade but their results ar every similar
@narrow kernel Apparently, of all things it was..time? The entire day gens worked fine. About the same time as yesterday I started getting blurs. Also around the same time that chatgpt becomes a blubbering fool. So I suppose it has to do with the traffic.
Might be, that or they're still doing heavy tuning on things to be baked into the model. I imagine they are just using some vision llm to check for NSFW content before giving you the result and if it flags it, it just blurs it. Then on their end, they probably have people reviewing the images that triggered it and then make adjustments to things from there
cat
dog
Just now I subscribed on stability.ai website for image generation. But unable to generate images here, what's the drill guys?
Panic and ask them to release the weights /s
(I've only used the API, so I can't actually help on this front)
There's no reason for nvidia to put 32gb of vram on 5090
RTX series aims for gamers mostly
And they have $40k cards that have huge vrams for datacenters
Putting 32gb of vram would hurt h100-a100 sales
Datacenters would still get h100's cus there are some other factors to consider
But some people and most of the small companies would buy 5090 to train or run their models instead of renting/buying h100's
And current games' memory consumption is nowhere near 24gb
AMD might (and I hope) put that huge amount of memory into their new cards
there are new b100/b200 coming with 192gb/288gb VRAM
and HBM2/3 is much better for AI than GDDR
oh boy
amd's mi300x has 192gb vram tho
i know some people buying cheap ass p100 with only 16gb VRAM and telling that they outperform any RTX cards cuz of 4096bit memory. Idk if its true, maybe we have some proud owner of this card on this server who can confirm
does anybody know if i can stop A1111 from creating folders for dates in the "output folder"? i want it to just get put in the output folder, not for it to make another folder in output and then put that there
there is checbox for subdirectory in settings
thanks i'll try that
i got that vintage nvidia equipment too!
What does hyper lighting and turbo means in model
faster generation for slight reduction in quality
so it's up to you to decide what you value more, speed or quality
right, but having said that, you can always upscale later, so I can get why people like them
So turbo is fast what about the other two
because when do you ever get the perfect generation the first time...never
Also anyone know how to stop fooocus from auto downloading checkpoints
or from accumulating extra o's
Also will there be any problem if I rename the checkpoints and lora
probably not, definitely not with auto1111, so probabaly not with focus + extra o's
Didn't receive the clear answer on model names lighting turbo and hyper
i never used foocus but it would be real weird if you cant rename models, you can rename models in all other ones, the name doesnt matter, its all about the "loading part"
is 6bg vram enough to train loras and models?
and what are good resoruces for training
doubtful, might be able to do 1.5 models
sorry to hear that
even with 1.5 you'll probably struggle with OOMs
but you can probably get training to work
i hope so
worst case just use cloud, that's what I do, and I have a 16G card
mainly because I dont want to tie up my local compute for training
wish cloud was an option. cant pay for anything ATM. otherwise i would be using cloud.
ok, well just use every memory saving trick in the book
im still learning on the resource and will give it a try on my 6gb
like use adamw8bit, gradient checkpointing, keep training resolution at 512, xformers, batch size 1
those are the main things for your case
assuming you have nvidia
otherwise use sdpa
yeah i have a rtx 4050
I meant in focus they make them auto download when choosing pre styles
if i make a lora for hands and feet, and label images with 5 fingers as 5 fingers, and 5 toes as 5 toes, and anything less or more as less or more for fingers and toes would that address the more than ore less than 5 fingers or toes, or is that not how it works at all?
that said i havent been notcing the more then or less then 5 fingers or toes too much lately with stable diffusion models.
That's what I meant if it's ok to change name
Does anyone have any idea
basicly yes, but in practise no. You will still get random numbers because SD isn''t good enough to get the right amount of fingers. Also it doesn''t know how 11 oder 3 fingers look like. You would have to train them too. Then you could prompt for 11 fingers and would get hands that tend to have more fingers and for 4 you would get hands that tend to get less fingers.
what is the best upscaler for 3d renders? Specially vintage nintendo 64 types where you need to keep the vivid colors and the plastic look
Resrgan 4x + anime6B almost gets it right but it's too anime and it kinda removes the 3d aspect
upscaling Mario 64? :3
maybeee, not really, but it's based on the old mario 64 renders
I would also like to upscale mario 64 renders if I find a good method
well anything 5 fingers or 5 toes would be labeled as 5, and anything more or less would be labeled more or less without any specific number. and yes the idea is to provide the proper images for having 5 and also for having more or less.
if i had more than 6gb vram i would just doing endless test but with just 6gb, i have to be extra cautious. i probably cant even train anything.
sd1.5 should work. Or reducing maximum resolution when using SDXL
sounds like there is hope.
but I think you can run free versions on google colab. So your hardware doesn't matter then. Googled A bit and 6GB seems pretty low for training ><
iv tried collab for SD generations, it has a gpu limit issue. but ill give it another look for training smaller loras.
^ likely just a business guy with an idea who will want people to work for free until it starts making a level of money he's comfortable enough with sharing. Same old story. If they wanted to pay people they'd be recruiting properly.
Just a regular old bot scam message
What would be the scam? They send you money to get started but need you to send some back? Dudes looking for free work isn't uncommon too. I guess it's also a scam but feels less bottish
any ideas friends
I guess this is not for me, I am not able to other things.
Hello everyone, if i purchase credits for stability.ai for api usage does it have any expiration time?
I don't think so, Fatih. I had a few creds sitting in my account for quite a long time before SD3 released as an API
Thanks @mint fiber 🙂
sd3 wen
When did that happen
Probs when GTA7 comes out
He doesn't even know how to write his ad
And not all of the free work is a scam, some are called "joint ventures". But you really need trustworthy people for these things and maybe even some contracts signed.
free work is pretty scammy
the 3090 from 2020 also had 24gb and it was to much for the time. its insane that they did not upgrade vram since 2020
and they can just also increse the vram of the datacenter gpus
Probably when its too late and gpt4o drops their image modality. Then they'll panick drop but itll be too late cuz everyone is using the better product. The ecosystem will never develop as well as SD1.5 cause of their fault
So since refined models just get better with time is it safe to assume that just more refining and cheaper better GPUS eventually will lead to perfect images? Nothing really seems to beat a good model. no amount of sigma and llama and T5...
Yes, Cascade is the victim here...
when sd3??
by asking that, you now added another month 🙂
noooooooooo
this is firsttime asking this month lolol
youre saying sd is getting less gpu intensive instead of more?
the truth is we have no idea, as far as we know, it could drop tomorrow, it can drop in a week, or in a month idk
gpt 4o is censored, it's useless
ok
Not everyone is making ch!ld p0ŕn
Is that the first thing that comes to your mind?
Lil bro is projecting
What none of the Stability AI staff are going to tell you is that SD3's training is going to take until August potentially. They laid off their best employees who could figure out how to do it faster, and can't afford a larger GPU cluster either
I producing like 20 images and I've already reached my credit limit
The other elephant in the room is that the for profit company masquerading as a charity known as Thorn seems to exert some level of influence over Stability AI. Thorn recently failed in their supervillain style plot to take over the world until the guise of "stopping csam", and their efforts were only stopped by the European Court of Human Rights (Chat Control). Thorn had the head of the EU, many of its ministers, the head of Interpol and others doing whatever Thorn wanted (including breaking the law, which they did to try and help Thorn), and I seriously doubt their corrupt cronies are suddenly free from their leashes
Any ideas why I can't produce my than 20 images on the free trial?
actually it seems like Thorn's plot to ban encryption in the EU may still be in motion unfortunately
I mean what are we to think when you say its censored? That you're making pictures of cute puppies?
so the woke thought police are here ,thank goodness
Who was it that said stay woke, was it Nanci Pelosi.
If its off topic than disregard
By the way, are fake artists basically artists who are focused on money (i know i might get a lot of negativity from this), are they still having a crisis with AI generated images or have they all sort of secretly started to use AI themselves. Or are they not there yet. I haven't been seeing anyone complaining about AI art much so i assume its because many of them have secretly started to embrace it.
OpenAI censors blood, gore, fighting of any kind as well as nudity
Anyways im trolling you i hate censorship
doesnt microsofts copilot image generator use the same tech as openAI but suppoesdly less censored.
Both will be using the latest dall-e model, not sure on the levels of censorship
Within a five years, there will be sweeping changes to the internet you know. Microsoft and Intel have been hinting at it for years with an almost DRM style verification system for everything. Wouldn't be surprised to see GPUs and ISPs getting on board as well. Basically, everything would be fingerprinted by the CPU down at the IME level and/or they'll implement something similar on GPUs, that can't be bypassed at the OS level. ISP servers and other servers will have to authenticate requests to do certain things like using CUDA features on a GPU. This would cut down drastically on things like bad actors and botting.
So they dropped the idea of just forcing everyone to use cloud services where no one has a computer but instead an interface and everything is processed and done through the US government.
Are there any SD models that focus on making political art or images? This way normal people can start making their own propaganda.
threats to political and economic stability due to botting and AI>>>your freedumz to sit around making furry porn and questionable waifus in private all day. pretty much every country would throw it into some patriot act equivalent that supersedes personal liberties.
[Attached to my previous post]
any news on sd3?
how do you know?
stable diffusion
It is possible to buy credits as company and dont pay tax ?
you mean "more censored"
can anyone guess what checkpoint this IG profile uses? @silva_florina
What steps can we take to ensure that the images generated by Stable Diffusion resemble realistic human appearances?
Avoid terms like animal, creature, cartoon, livestock
Is there any way to get a token to just have a really minimal effect in a prompt? For example, "a (red:0.01) cat" isn't much different from "a red cat", especially compared to the contrast with say "a (red:2) cat"
Have you tried mixing colors with prompt substitution? [white:reddish:0.5]
Weights are the way if you do not want to use style transfers, ip adapters, controlnets etc.
Important is to have more then one weight. So if you have green:0.1 and blue:0.4 the weights are calculated in an other way. It is about the total weight and if i remeber right comfyui and auto1111 got different ways to calculate the weights
Yeah, I'm aware comfyui doesn't normalise the weights by default
And I didn't realise this worked, hm
Sometimes you have to play around, oruse like cnet reference with a photo
Using words to describe something specific can be hard, like if you wanted a red stripe
Yeah, I have messed around with these advanced tools. I've been using the generators available on perchance lately though, which doesn't give anywhere near that kind of customisation, so I've been wondering what's possible with prompt weights alone.
And assuming the model knows how to draw that
Yeah, I'm aware it's kind of a crapshoot with specifics
I've had some luck with inpaint sketch also
You can insert colors you want and rerender that
That's for sure useful, I'm just working with a limited toolset in this instance
Yeah if you hang out in the prompt help channel for a while you'll inevitably see people trying to make something very specific with just prompting, which has limitations that sometimes can't be overcome
I'll give the substitution a go though, I think now that you mention it, one of the really bizarre and powerful generators on the perchance site is the fusion generator, which tends to construct prompts like this:
"[many photography abstractart black topaz champagne sheer striped abstractart with warm hyper maroon Naturallightsources incadescent on grape dimly-lemon mildy chartreuse Dreamy:giant enemy crab:0.35]"
In this case, I input "giant enemy crab" as the main prompt, and then I believe it has a (by default) random "style" prompt it uses, which basically just manipulates SD with tokens in really unpredictable and usually interesting ways. But I see it uses that blending method, so, might give that a try in more controlled circumstances.
Yeah for sure, only so much a really basic prompt interpreter can do for you.
Yo, I'm new to making LoRAs and am about to finish my first one on forked tongues (body modification). I have a variety of pictures ready. I heard it's good to include similar but not identical images for better learning through captions, so I've added some pictures of normal tongues too.
I'm unsure about naming and captioning. I've named the LoRA 'fork3d_tongu3' and want it to trigger with 'fork3d tongu3' or 'split tongu3' or something in that manner. Do I need to put these tags first to make them the main tags, or is it the name that determines this?
Also, how do I teach the LoRA to distinguish between a normal tongue and a forked one? I've included normal tongues in the dataset but am unsure how to tag them.
Any advice would be appreciated 
Lol this guy is actually mad about an organization that fights CSAM? holy fuck.
luckily these people's opinions are feeble and enforcement against CSAM and associated predators will continue
Which would be absolutely stupid, considering that the world is becoming a darker and more war-hungry place every day. Yeah, give all european data away to them for free... the other actors won't be as stupid.
No, I'm mad that Thorn wastes their time and resources on trying to maximize profits from CSAM at the expense of human rights. They don't actually care about stopping the issue, and they have a very disturbing 'ends justify the means' worldview. They're a tech company that has EU politicians in their pocket: https://balkaninsight.com/2023/09/25/who-benefits-inside-the-eus-fight-over-scanning-for-child-sex-content/
you can look anywhere theres money and find corruption. even in the most altruistic seeming places.
that doesn't disqualify the entire organization's efforts though. thats so ultimative and feels like ulterior motives
There are better organizations in the field, and why they may be doing some good, the harm they are trying to cause certainly outweighs that. Thorn was was behind the anti-encryption complain in the US against Apple. That 'Heat Initiative' group behind the anti-encryption campaign is part a web of fake charities and advocacy groups to promote their harmful worldviews, and they had government ministers break the very laws that they were meant to enforce by using illegal microtargeting in the EU.
I don't believe thorn was ever "anti encryption" . That was just qanon hyperbole nonsense.
End to end encyrption has a LOT of implications. It's a giant buster claymore sized double edged sword. The concern is about E2E encryption on services that children use
"I should be able to close the door with the child and me alone in the room!" is the argument being made against their concerns
They want encryption backdoors mandated on all communication services, and they don't care about the consequences. Its just about getting rich for them as their AI mass surveillance tools would basically be a legal requirement to use
They're registered as a non profit. I'm not sure how that's a get rich quick scheme. There are a lot of other ways of getting rich that are much easier.
If you've followed the news on the subject, its been playing out for a couple of years, and Thorn had plenty of time to correct the record but choose not even when confronted by journalists from reputable sources
Open a casino. EZ
Lots of groups are listed as "non profit" even though they really aren't, and nothing stops them from changing after they get their way (see openai)
The point is there are better groups for companies like Stability AI to partner with, ones that aren't trying to undermine human rights
I think outlandish accusations shouldn't need to be corrected all the time. Same way Keven Hart was like "no. i will not apologize a third time in order to host the oscars" and instead didn't do what critics demanded of him. Why jump when told to?
https://www.un.org/en/about-us/universal-declaration-of-human-rights point to the one that says adults have a right to private interactions with other people's children
https://www.un.org/en/udhrbook/ maybe the picture book will be easier for you
It refused to disclose Cordua’s emailed response to Johansson’s May 2022 letter or a ‘policy one pager’ Thorn had shared with her cabinet, citing Thorn’s position that “the disclosure of the information contained therein would undermine the organisation’s commercial interest”.
Yeah and? Even non profits have expenditures
They need a business paln
if it were all a matter of money they woudlve just got into arms trading instead
Experts and journalists are saying that is not normal, especially in the EU. I trust them more than some random person
theres 10000 other ways to make money more easily. they have a different mission
They seem to be experts in their own field. They've been effective in the past
Are you implying that supporters of encryption and privacy want to harm children? And that that justifies attacks on encryption and privacy?
No of course not. The world is so much more complex than 1 or 0
Remember earlier how i mentioned e2e encryption specifically? and not all encryption?
and called it a double edged sword, figure of speech typically implying one effect is good and the other is not as good
The corrupt people with ulterior motives are more likely to be the ones flinging hyperbole. I wouldn't trust those journalists.
C-SAM and abuse is global problem that's worse today than it ever has been before. I don't think it's the time to lighten up on it. From Canada, i'm cheering on the efforts of this US based company. Hurray.
but the evidence seems to point to it being more than a few bad apples, if you trust the journalists of course. Thorn's products also seem to be regraded by customers as overly expensive and not as reliable as competitor products (as of a few years ago). Thorn also claims to have over 99% accuracy with their AI model, but it doesn't take an expert to know that such high accuracy is a straight up lie or the result of messing up training/testing
I did a deep dive into this industry because I wanted to see if there were tools that people here can use on their scraped data to help eliminate and report such content. And there was basically nothing available the open source community can use. I also would appreciate it if you amend your first reply to me so that its not making baseless accusations
guys please i need comfyui help. how do you increase batch size for img2img workflow??? empty latent image node which has batch size doesnt work for img2img
The model is obviously still under trained and the devs have been saying that. Rumors have been circulating that it will take at least a couple months to fully train it, especially if they decided to change some of the model's architecture (like the devs mention on reddit). This is compounded by the financial issues that the company currently faces
why dont they just come out and say that. instead of next 2weeks, after every 2weeks have passed lool
🤷♂️ I don't work for them
I dont believe it's baseless. When you come out and call efforts to hinder CSAM a plot to take over the world, my opinions get built on a pretty time tested foundation. If you want to share controversial opinions with hyperbole, try not to be offended when people point out that it's controversial and you might be exagerating some things
if their product isn't as effective as others, that's something wholly different
I got absolutely no problem with that... that's why I propose a "Beta-Model" that's obviously labeled "v0.6" or something like that. Nobody expected SDXL v0.9 to be perfect.
I called out fake efforts to hinder CSAM driven by greed, but yes it was hyperbole to say that they were trying to take over the world. They are just trying to get rich by building the surveillance system that would make China jealous, and that system could certainly be used to help groups or individuals take over the EU as end to end encryption could no longer protect democracy
at the end of the day, things usually come down to money and people seeking power who shouldn't be allowed any power
what a big chat
If it were only about money, they would've gone an easier route that made a lot more money. Like insider trading or something. Clearly there is more to it than money.
if it were that easy everyone would be making their fortune that way
End of the day, i have no reason to retract anything i've said
not really. You need capital first. I believe most people have the ethics NOT to commit fraud. Same way that most people don't do armed robery even though it's so easy
Getting rich by armed robbery isn't "easy" in this day and age unless you live somewhere like Haiti. GTA isn't like the real world
It's just quick and easy money. Ethics get in the way though
since most people have them
Yea, how can you even escape with the money from a robbery from a city like New York?
it's not like in gta
you have to invest thousands of dollars just to do this trick to somewhat work (not even 100%)
you have to look for a good team, you need a good hacker who can take out the cameras, you need a good driver, you need motorcycles/cars for swapping everything, you need an insider and the list goes on
so you have to invest tens of thousands of dollars into this and you're not even 100% sure that it'll "work"
armed robbery happens multiple times a day in nyc. and if everyone was doing it, the cops would have no hope of keeping up.
this isn't even considering the OBVIOUS analogy. If it were just about money they'd exploit children instead of protecting them
motives beyond money do exist
yea, even worse ones
it's pure logic
no one from this entire planet which has power is a 100% good person
because you can't even achieve that power by doing the right things
good and evil are entirely subjective concepts
presidents, politicians, CEOs
:))))))))
morality has always been a hairy topic
so let's say that because of some substances from a factory, some people from a village unexpectedly died. The CEO knew very well that those substances can affect the other people's health who doesn't have protection equipment, yet he just chose to do nothing, but to exploit them
can you ever say that this is good?
if yes, you're legit out of your mind
The CEOs have their heads full of "killings" and innocent lives taken
but they seem to not care at all
I typically don't like loaded questions. Of course i wouldn't and i dont know what i said to prompt that inquiry.
how many people would say the CEO is good if the company is selling products they claim help people?
Why all the focus on the CEO, they may not have founded the company, or be on the board, it, and depending on the size even know about certain decisions
Just titles and context matters a lot
often the CEO is just a patsy meant to take all the heat. Like say Youtube
Don't you guys have police that shoot random people in the US? 🤔
Canada got those too
the CEOs? please
wdym CEOs
oh analogy
btw I kinda remember seeing video of a robbery of an apple store while no one reacting to it as if it were a daily thing
it was in california iirc
san fran. shit is wild there. people are pissed because of the skyrocketing costs of living caused by the tech industry and the city does nothing for the people growing up there to make things affordable
In two weeks.
rip dodge dog
you guys need to build a Silicone Valley v2 type of thing somewhere else
yeah my dog died
im' canadian. i'm the other american
america the sequel
well I mean you're technically an American
same way a brazilian is
yup
actually, i change that. peruvian. cause of lake titicaca.
same was as a peruvian final answer
One can be a lot of "x-ian" at the same time
you could be [Name of the hospital you were born]ian
imagine that in my country, half of the salary is going to taxes (pure taxes, not food, insurance or something else)
50% from everyone's income
:))))
how can you ever become rich as a worker? you cannot
isn't that common
unfortunately
there are only bad roads here
they steal all the money from the taxes
and the judges do nothing
at least we're not paying 27 grand for a frickin childbirth
they are all in
most of it goes for weapons and bunkers for the illuminaters who want to kill most of us and ride out the nucular holocaust there
allegedly
some goes for roads too
:))))
what? :))
where?
outside of the US
uninsured ameircans
unlucky for real
to think that at some point the usa fought the brits to refuse to pay taxes...
the shitty thing about US healthcare costs is that the insurance companies don't have to pay the same costs. they negotiate a 10th of the price generally
how ironic theyre worse now
only uninsured poeple get the full price
so if you get sick in the US, you better k1ll yours3lf than paying these sums
I'm joking, but still...
27k for a birth
thats nuts
no wodner birthrates ar eplummeting
cant afford not only to raise a brat
even to give birth is alreadsy too much!
:)))
wars are messy why dont we make it too expensive for them to exist that way theyll die out quietly
hahaha lolrothchild illuminater laughing in the bohemian grove
allegedly
saw a really interesting reel the other day which showed why the 2000s generation can't really afford too much on their own nowadays (I'm talking about the normal people, not about of apes, people doing something illegal and so on). He compared the prices from 1990 with the prices from today and everything is 75% more expensive today. That's why you can't afford to buy even an apartment with all the cash down (you have to take credits from banks) and most of the people are living paycheck to paycheck nowadays
a really bad situation
Well AI will difinietly fix the economy.
how?
sarcasm
imagine that now it's even harder for all the manual/AI artists to make money from art
AI will pprobably be the thang that will likely send this zombie of a monetary system down the pit
because everything is oversaturated
iw as about to be like "AI IS TRTAINED BY THE HUMANS HWO HAVE BIASES AN WONT FIX SHIT" but i didn't catch that /s
Does anyone know for loading clip vision with an sdxl model what model to use?
i just want nice hands so i can render myself some heaven while the storm rages
please fix the hands
theyre attrocious
even in these sd3 images
AI is a simple tool which can be used to do many good things, but the problem is that only the richest have the true power to modelate AI nowadays being so expensive to train it. In other words, the AI is in bad hands (openai, microsoft and so on).
and as you can see, the ones trying to do something "good" (open-source) are going bankrupt now
stability ai
Error(s) in loading state_dict for Resampler:
size mismatch for proj_in.weight: copying a param with shape torch.Size([1280, 1280]) from checkpoint, the shape in current model is torch.Size([1280, 1664]).
File "E:\ComfyUI_windows_portable\ComfyUI\execution.py", line 151, in recursive_execute
output_data, output_ui = get_output_data(obj, input_data_all)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "E:\ComfyUI_windows_portable\ComfyUI\execution.py", line 81, in get_output_data
return_values = map_node_over_list(obj, input_data_all, obj.FUNCTION, allow_interrupt=True)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "E:\ComfyUI_windows_portable\ComfyUI\custom_nodes\ComfyUI-0246\utils.py", line 381, in new_func
res_value = old_func(*final_args, **kwargs)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "E:\ComfyUI_windows_portable\ComfyUI\execution.py", line 74, in map_node_over_list
results.append(getattr(obj, func)(**slice_dict(input_data_all, i)))
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "E:\ComfyUI_windows_portable\ComfyUI\custom_nodes\ComfyUI_IPAdapter_plus\IPAdapterPlus.py", line 711, in apply_ipadapter
work_model, face_image = ipadapter_execute(work_model, ipadapter_model, clip_vision, **ipa_args)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "E:\ComfyUI_windows_portable\ComfyUI\custom_nodes\ComfyUI_IPAdapter_plus\IPAdapterPlus.py", line 340, in ipadapter_execute
ipa = IPAdapter(
^^^^^^^^^^
File "E:\ComfyUI_windows_portable\ComfyUI\custom_nodes\ComfyUI_IPAdapter_plus\IPAdapterPlus.py", line 70, in __init__
self.image_proj_model.load_state_dict(ipadapter_model["image_proj"])
File "E:\ComfyUI_windows_portable\python_embeded\Lib\site-packages\torch\nn\modules\module.py", line 2189, in load_state_dict
raise RuntimeError('Error(s) in loading state_dict for {}:\n\t{}'.format(``` Does anyone know the fix?
yeah but i mean im talking about a more fundamental change
man, go into the tech channel
kinda like a transition from slavery to non slavery
just like slavery had to end at some point maybe wage slavery gotta go now
:))
kind aobvious the way things are going theres no future in this
their plan is to make 15 mins cities
ive lived thru the fall of commies
so now i get to see this global whatever go down
once u live thru the collapse of a regime its not so hard to imagine how it can happen
and it happens fats too
like booom
imagine that microsoft will launch an AI that takes screenshots with everything you do on windows and will send them to microsoft
recall it's the name of that tool
it'll be on windows 11 out of the box
that's the way
Is C++ build tools gone for anyone else in visual studio? As i'm testing out a image to 3d model comfyui workload, but suddenly build tools is just gone
someone said that people better try to keep themselves healthy, because in the next 10 years something revolutionizing will appear and the life can be longer with some implants/medicine/interventions.
:))
you cant be happy in time, the fera of tomorrow and meories of the past ruin any chance for real joy
only outside o time can there be joy
so in reality no one is happy
not 100%
if there is a soul then we can engineer it
:))
you really have something with those hands
they are onto your nerves
it seems

theyre by far the worst in AI image gen right now
As fra as the body goes anyway
electronics mechanicla parts, writing uhm also ar enot stellar
but hands are worse
trying my first LLM captioning tool, I seem to lag like 6 months behind with most new tech. I get stuck in my ways
it's called internlm, we'll see how this goes
stability cant catch a break can they
Maybe Emad will lead the revolution.
We need like the Americna civil War or the French Revolutnion. but globally. at this point.
Money has become a hindrance rather than an incentive.
A lot of bitter people out there and that means only one thing: wars.
I have a feeling the system will collapse before I finish my game.
thast's my luck.
but it was worth it

for SUPIR Comfy Workflow I only need this models? https://huggingface.co/Kijai/SUPIR_pruned/tree/main
I want to do some face-swap stuff with Stable Diffusion. I have a video of someone dancing and want to face-swap faces onto this video. However there is a catch: I want a new face to swap onto the dance video every couple of seconds and most importantly I want a smooth transition to appear when swapping from one face to another, just like can be done in AnimateDiff. Does anyone happen to know if there is any way to do this? 🙂
reactor?
hello people, whats the correct channel for help please ?
I guess it depends on what you need help with, if it's technical support there's a techsupport channel, if it's psychological, doubful anyone is qualified
if it's moving a couch, I guess it depends on your location
nah, its with my command prompt, I got an error, and Chat GPT arrived at a point where he keep repeating me the only solution it think there is
(spoiler alert : it doesnt work and it keeps giving me the same procedure)
sounds like you need the tech-support channel. and as a tip, maybe describe what you're doing...you installing something?
well, something-something "torch" apparently
but I did installed it, and I did also "force reinstall" as it call it
torch, ok yah that's pretty much the glue that makes all this work
it gives me this : File "C:\Users\swatb\AppData\Roaming\Python\Python312\site-packages\torch_init_.py", line 141, in <module>
raise err
OSError: [WinError 126] Le module spécifié est introuvable. Error loading "C:\Users\swatb\AppData\Roaming\Python\Python312\site-packages\torch\lib\shm.dll" or one of its dependencies.
ive checked the PATH thingy, and its normally the correct one, so I dont know whats the problem by this point
well if anyone got an idea to help me u can ping me
If the VCs could see the bigger picture and think long term, they'd be thanking him. SD being open source prevented government from heavily regulating the technology and thus has saved billions of dollars across the AI industry
He cleared a path for everyone
I'll try that, thanks 👍
new Qualcomm chips say they can inference 13b on mobile
seems pretty good
does anyone here know much about the fusion mixed modality models? Like 4o, I was wondering if they could dethrone diffusers for image gen
no
there is no problem to run it, the problem is the speed, it will be incredibly small
anyone tried the krita AI with comfyui? i tried the animation option it has and will it dorsnt seem to be able to do auto animation on top of an animation but it can do frame by frame where you have to generate one for each frame...
Canción aserradero
人物IP、潮玩、三维、潮流、街舞、牛仔,c4d、
anyone know of a frontend for stable diffusion that can work on 10gb ram and use things like real vision I remember being able to run the realistic models on my machine with medvram now it just instantly crashes
you mean like Automatic1111?
yeah it was crashing for me trying to use them even with med
I used with a 8 GB card and it that time it crashed from time to time, but it worked
maybe you should troubleshoot, check if you need another flag for your system, or reinstall
also consulting about this in https://discordapp.com/channels/1002292111942635562/1002602742667280404
might be many things
has anyone already tried sd3?
Sure ... you can use API or webservices ...
can you suggest some of web ones?
https://glif.app/@Oliveira/glifs/clw44qfbl0000m0zztwqk2tnf can be used for free ... sometimes they run out of api tokens ...
Guys SD3 release in May?
good, thanks
hello my frends, is there a way to keep the transparent background when you upscale with Stable Diffusion?
I use automatic111
just download comfyui and don't need to use shit anymore
it will never release
hi people
anyone know of a good model to take an illustration or anime and output it to a photo like image with good accuracy for keeping the same facial features
Not even sure how that should work. Most anime face expressions consist of a few line strokes. Keeping this in photo realistic images might look weird?
Hi guys, just saw that I am in r/SD and SD discord server. Is there one more active or one that is archived. or both are relevent?
if I may ask my lil noobie question here also :
I guys, I have SD installed with ultraspiceXLTURBO_v15.safetensors [069737f0a7].
I am wondering what video or website should I used to get a better grasp of all the configs please? Thank you
any game Devs here using SD AI imagery in their game? I'm really paranoid about using it a I want to release my game on steam.
I'm using a custom civit ai model
Well here you need some luck and faith. Even if the custom model is fine for commercial use of the results, it might have merged model which are limited to personal/research use.
In addition when an artist decide that the artwork of him was mainly used to create that specific style,...
On the otherhand i am pretty sure there will be ways ($$$) to settle such a thing and it will be a hell of work to get real prove for a copyright law suit
But all depends on your local laws etc.
I'd say never. SAI is dead, their employees (if any) are silent. It doesn't look good, quite honestly. Open source is dead.
I don't understand, even if they go bankrupt, what's to stop them just releasing it?
just release the 8b model. Even if it's not fully cooked, the community with fine tune it
again the "community will just finetune it if you just release half-baked" mindset
you know that base is extremely important for finetunes
like how you choose potatoes to make baked potato
if you decided to use a half-grown potato to bake, will it good. Yes, but not certainly as delicious as full-grown potato
sure in this past few weeks people have been dreading on the unable to release SD3. And well the SAI needed to find the balance between time taken and completion
seriously even some of them went through stages of grief
but like, Pony was basically a completely new fine-tune, right? There exist folk who can do some extreme fine-tuning?
and they've been using the API online, so it can't be that undertrained, right?
the online API is working ok?
Even if SAI gone bankrupt and dead, with scientist worker run themselves to elsewhere. There are always open competitor against closed source, like welp HunyuanDiT made by Tencent team.
It just get replaced, that's pretty simple.
if SAI dies, couldn't the workers create a new open source co-op together?
some observant in #🆕|sd3 said it is pretty much undertrained. But from what I see it seem like SAI actually change couple of version of the model for the API
they should just release the 8b model then, and stop worrying about all the others. Why make like, 4 models that are unfinished when they could just finish 1
AstraliteHeart ( the person who make Pony ) literally have like 6-8 A100 stacked together to train Pony, like hell its dataset size.
You remember when SDXL release everyone literally saying they can't run that out of their GPU right
except those who have 4090 / 24 GB VRAM
sure, but for example, if it's understrained, then maybe AstraliteHeart could finish it off
so optimize it. Make lightning and quantized versions
better that nothing
Quantization behaved very differently compared LLM and it will very much affects its quality. ( which is why you never see quantization on diffusion model )
and folks without enough vram can still use online services or SD1.5
Lightning though, well. It just needed time
ok, but a distilled 8B might be comparible to the 800million, right?
and we already have SD1.5 for people with weak GPUs who don't want to use online
welp...
releasing a bunch of new models just divides the community even more... and SAI doesn't have the resources to finish one
like, again, 1 finished model is better than 4 that never get released
I rather have 4 released
wlel actually that one is subjective
ok, but 1 is still better than 0
I'd rather 1 than 4. But everyone would rather 1 than 0
I know one personal project to make a model out from scratch ( or from the paper )
Simo Ryu ( the person who introduced LORA into diffusion space ) 's LavenderFlow
he use 1% of SD1's budget to get LavenderFlow, as he actually went out to FAL to borrow 2 H100
the result of the model though... will remind you the days of 2018 AI Art
but it is deem to be possible to actually train as individual or group.
SAI could also partner with some kind of stock images site, get free already-captioned images to train on, make money through AI use on their site
because the stock image sites are using AI already, but not very well
There's a goal to make Qualcomm / Mediatek phone running SDXL-quality AI Art models.
with current scale of NPU in their processor

Where are the "in two weeks" jokes?
Wait there is less than two weeks left of May.
could this mean...
that aliens exist?
Just bake in good hands this time.
Of course when GPUs get faster and cheaper everyone will train their own stuff from scratch. that is the future.
the models we have now sdxl sd15 midjourney etc they ar elike the early huge clunky computers that only could be built and ran by huge companies and very expenive
now you ahve more power in your pocket
not too far down the line you will train sd3 in 3 days on your wristwatch
with quantum computers in minutes
Qubit SD3
and even the hands will be good
reinstall python 3.10.6
but how can someone even detect that you used a certain checkpoint to generate an image if you erase all the metadata?
it's legit impossible
so they can't sue you for anything, because they don't really know what model you used to generate
their last hope is that a bigger company will buy them and they keep this "secret sauce" (SD3) just to attract that company.
and live with what money?
don't be so sure about it :))
Steam Gift 50$ - https://u.to/rWuqIA
@sudden ruin can you remove this grifter?
When do you guys think HunyuanDiT will be on Civit?
How long has the weights been available? 2 days? I'm not sure.
it's not super good at detailed things
https://colab.research.google.com/github/camenduru/HunyuanDiT-jupyter/blob/main/HunyuanDiT_jupyter.ipynb - here you can try it on google colab for free
run the cell and open the gradio link when it'll show
tried it with an advanced prompt and it's not really good at that
and it launched on 14th of May or something
so not only 2 days old
Same as OpenAI: as They obviously got unique content from books (game of thrones) and specific newspaper content. So still a small chance could remain. And I am not talking about a watermark which would be easy to integrate in generated images.
I've never tried to upscale an image with transparency, but you could take the original image into Photoshop, make a mask and then applied it in the upscale version, (scaling accordingly) and it should cropped it, I guess
Depends, in theory they could watermark it in latent space to where it shifts some pixels and clusters of pixels around the image like an invisible QR code. There is also invisible watermarking that you can use with SD, but that's something you intentionally do yourself. Both methods can survive certain amounts of post processing people might try to remove them. Like technically, modern QR codes can still be readable even if like 70% of the code is missing.
So no, it's definitely not impossible. But they'd have to train the model around it and do some tinkering to pull it off
Is it okay if i just read stuff, I am just starting with Stable Diffusion on a very, very elementary level; so don't have much to participate with this at this time?
Help? Create a prompt?
hi
Ah thx m8. Kind of sad that it's not a viable alternative to what we currently have. SD is the only good open source model that is actually usable. From what i can see at least, i don't know of anything else that competes that isn't closed source.
I guess emad jumped ship because he saw the model management issues coming, sd3 nowhere in sight
I mean I could, but how does that help you?
jus be creative
for example, say i want to make a kitty, i would say, fluffy kitty, white fur, blue eyes, laying on pillow, sunset, sunlight through window
Hi people ! How can I try to generate an image in discord?
and anything u dunt want in the picture, type that in same form on te negative prompt
you can read about stability-ai artisan, I dont think there's a free bot any more or ever will be
there are some server owners who run their own bots powered by a local install
too bad u cant download Tripo Ai like u can stable diff
kinda sucks tbh
but ig its fine, Tripo is their own thiing, and only partnered with stablity Ai
that's good, go community. but as far as sai, when your problem is debt, giving away stuff for free is kind of a bad idea
leaving money on the table is a bad business plan.
i find it kinda cheeky tbh to make ppl pay to use stable diff, when stable is free, its like the ppl who make u pay are taking advanagte of those who dunno how to do that
as far as i can tell, they're not partnered at all. they're just using xl models. maybe they have a stability membership for commercial rights, but that would make them a customer not partners
they arent, you can still use all their released models free, it's just the new one that you can pay for the privilege to use in lieu of a free release (assuming that ever happens)
such is the nature of open source. Google doesn't pay to use the linux kernel and samsung doesn't pay to use android
"TripoSR is a state-of-the-art open-source model for fast feedforward 3D reconstruction from a single image, collaboratively developed by Tripo AI and Stability AI" is wut is plastered on the demo repo
xD
i dunno all the hand shakes tho that were prolly made
and honestly, it's hard to fault them when they are struggling financially
but, seems like they worked together
That was probably just marketing speak. maybe they just emailed a question to one of the staff members
https://stability.ai/news/triposr-3d-generation oh no it's actually released. i was confused an thought tripo was a different 3d thing. yeah that model is widely available for people
i've never touched their website for instance.
but i'e used that model alot
I have zero interest in the api, or any of their base models. I just want to see what the community does with the weights, so if they never release it, meh
i'm more interested in the tech instead of brands. Stability weren't the first researchers doing this. Some of the other openly available models are far better than SD's original release
this is wut sum1 said on the reviews on github
"They are never going to give away their latest SOTA model as it is a commercial product, if they will update the model in TripoSR with something a little better that depends on the competition, at the moment TripoSR is the best open source model, unless a competitor releases a better open source model for a different project, then they have no incentive to release another model for TripoSR. Even so, they would just release a model a little better than their competitor had and nothing close to their commercial model."
and that made me come to the conclusion, thats is all property of Tripo
which sucks u-u
the model that is there, is, not..really all the developed
and likely prolly wont be based on that, and, the way the Ui is designed, doesnt really allow u to, intuitively install another model
like gemini? heh, closed-source woke diffusion
there are other things out there, but I'm not seeing a bunch of open sourced weights
elon released his grok or whatever, that was cool
I just hate the idea of closed source because ultimately you'll just be spoon-fed someone else's agenda, that's the end game
its up to us, the community, to rise above the robber barens, and make like a linux community of open Ai, for both txt to image and txt to 3D models. When i say linux like community, i mean, open source and free
:3
microsoft just released Phi-3
it's funny to me that these days, MS is actually quite the champion of FOSS
Github has remained pure since the acquisition. They've given so much
this is the most I've ever seen goog threatened, they have the doj monopoly thing with their play store, they have the threat of ai assistants such as chatgpt that threaten their search business, they have all the news publishers lobbying to get link laws like cali and canada and australia
And Meta just released Llama 3
Grok models are a pile of crap that was never meant for release. He only did it because not doing it really weakened his "open ai needs to open their tech" arguments. I don't think anyone has managed to rig up their own deployement
yah, I have no opinion on the quality of grok, but even if it's garbage, maybe someone can do something with it
doubt. llama makes more sense for community hacking. grok is something like 80gb
oh wait no. the 770 files that make up grok weights are 296GB
Phi-3 models are 4-8gb each, stand alone, and have 128k context length vs 8k. grok is just not the cheese
as a startup model. I will say that's just fine. Obviously everyone just move on and continuing exploring LLaMa 3
the response given is decent but LLaMa 3 can do the same. If I won't wrong.
And Grok aren't even supposed to run at home hardware.
Llama 2 is more effective than Grok on all the common metrics if i'm not mistaken.
Even as a datacenter level model, its far less efficient than all other options available. it's just far behind the research curve and is more of a historical record of what grok was than something that will help future development.
like if microsoft open sourced directx 7 today. Even that might be more interesting though.
ok more like directx 3 when things were really bad and they hadn't found good solutions yet
All these LLMs and VLMs are basically using the same datasets, and the only differences are the size of the model and small differences in architecture
If you want better open source models, we need better datasets. Its that simple
would be cool if an LLM were trained largely on biographies / true story narratives as well as high quality fiction
ok detailed illustrations comics stuff like that
Only 4 a100 unfortunately.
how do you calculate it/s that people are posting? is it just the it/s from a single image that's listed in the comfyui terminal output, or do i have to run a specific script to get an average?
【图片内容】一只疲惫的老虎蜷缩在草地上,眼神无神,身体疲惫,毛发凌乱,周围环境昏暗,呈现出一种疲惫和无助的氛围。
【正向提示】A weary tiger curled up on the grass, with a dull gaze, tired body, disheveled fur, and a dim environment. This high-quality image captures the vulnerability and exhaustion of the tiger, evoking a sense of fatigue and helplessness. Perfect for projects that aim to showcase the emotional depth and fragility of wild animals.
【反向提示】blurry, low quality, pixelated, (unattractive), (small size), (angry), (vicious), (unnatural colors), (out of focus), (unclear), (dirty), ((extra tail)), ((extra paw)), ((extra ear)), ((extra eye)), (out of frame), (bad composition), (too bright), (too dark), ((extra stripes)), ((extra teeth)), (poor lighting), (bad color grading), (red-eyed), (morphed face), (unnatural posture), (awkward pose), (frozen animation), (low-res), (bad framing), (insipid)
【参数】Sampling method: DPM2; Sampling steps: 20; CFG Scale: 7; Seed: 3856492; 最优长宽比: 3:2
Exactly
but not the model itself is good (like SDXL), but the fine-tuned version by the community
the base sdxl model is a joke
like the base 1.5
you can't compare an OS with AI trained models. You can develop that OS even in a basement, but for training an AI the costs are 1000x higher.
Musk is a fraud anyway, he even stole the Tesla idea from 2 good researchers without even crediting them. So he's really not as smart as the people think.
if you've got an image with someone upside down in your data, is SD able to use that info to better draw upside right people?
Ive installed python like 3 days ago, I think its the right one lol
from your very own log message Python312
so no, you didn t install the correct python
install any python from 3.10.6 to 3.10.11
oh ok, so is there a problem with recent version ?
still the same goddam useless f*king problem with Pytorch
I unistalled it and installed it at least 50 times each, im getting tired of this sh*t
yeah. I'm hoping it turns up on Civit as a finetune at some point.
Ok can anyone help me on this please
Traceback (most recent call last):
File "C:\Users\swatb\Documents\essaiNSFWredhead.py", line 1, in <module>
from diffusers import StableDiffusionPipeline
File "C:\Users\swatb\AppData\Roaming\Python\Python310\site-packages\diffusers_init_.py", line 5, in <module>
from .utils import (
File "C:\Users\swatb\AppData\Roaming\Python\Python310\site-packages\diffusers\utils_init_.py", line 97, in <module>
from .peft_utils import (
File "C:\Users\swatb\AppData\Roaming\Python\Python310\site-packages\diffusers\utils\peft_utils.py", line 28, in <module>
import torch
File "C:\Users\swatb\AppData\Roaming\Python\Python310\site-packages\torch_init_.py", line 141, in <module>
raise err
OSError: [WinError 126] Le module spécifié est introuvable. Error loading "C:\Users\swatb\AppData\Roaming\Python\Python310\site-packages\torch\lib\shm.dll" or one of its dependencies.
There is a problem in there apparently I cant find. Ive tried unistalling/installing torch, Pytorch, check PATH, now try with a different version of Python, and nothing changed
Ive checked the shm.dll file which is apparently weird out, but it gives nothing to me
I dont know where to look anymore nor what to do
try asking in #🤝|tech-support (with a full error log not just snippet)
#🌠|show-and-tell Do you know the game Disco Elysium?
thats everything it tells me after ive entered my command tho
we don t see the command, we don t see what it tells before that, etc
Just generate like 3 to 4 images or so to make sure the one or two generations aren't slower because stuff didn't get cached properly or something
And once you get the same speed on the last two images that's your it/s
Tho on it's own it/s is meaningless if you don't also say what size the images were
thank you
also, can i copy the entire comfyui folder to back it up before installing custom plugins that might brake things? I'm on linux. I understand it's a 'portable' app, so it should be able to be moved to a new computer if i want, right?
Yeah, but it shouldn't be a problem
ComfyUI stores all plugins in a single folder
If you remove the custom_nodes folder it removes all the extensions fully
nice
Also all your generation parameters are saved in the output images
So you can just drag and drop your old generations into Comfy to get the whole pipeline
Anyways, get comfy manager plugin, makes downloading other plugins way easier
that's neat, i wonder if they used some sort of encryption to do that
And updating and such
No, image metadata
metadata, i should've known that lol
Encyption is when you want a message/file to be only readable if someone has a password
Or similar
yeah i wasn't thinking right
Nothing wrong with confusing the terms
I have the manager installed, would it help me fix missing nodes errors? a few of them are conflicting...
Trying to use some very large pipelines or something?
it's not that big of a workflow
I'm missing these: Text box
SDXLPromptStyler
ACN_AdvancedControlNetApply
ControlNetLoaderAdvanced
ScaledSoftControlNetWeights
i'll post it in #🤝|tech-support I can't post images here
hey, im looking for a good 2d checkpoint, not like anime style but more of a cartoon
any sugestions?
Check Civitai.com
yea ik ik, but there's so many that I dont know whitch one to pick hahah
Yes ... but who knows better than you wich preview pics fit best to you?
Almost fully, their python deps will still be installed, but that shouldn't matter most of the time
- Node conflicts and missing nodes are two completely different things
- ComfyUI Manager has a button to install all missing nodes
So that should help in theory
It would be real swell if in the future the open source community will be able to mass train models (like a folding @ home type of distributed setup) that we all benefit from. Instead of watching the big corpos pass us by
There are already a few
Nobody says we can't
Other than several million dollars of compute cost for making a single base model
So go on, make one
I think the idea is that the community could share the compute cost by offering unused time on their own GPUs, I still agree on the "go make it" point though.
@empty star , it seems strictly possible that one might be able to package the "help train community models" as a custom comfy node or something if you're worried about distribution of a new standalone thing. You're free to use this idea (and the next) how ever you want to 😛
Or, you could start a new blockchain that somehow uses training community models as its mining function 😂
shouldn't
new stable assistant version. still not any weights released
there will either be a release

