#💬|general-chat
1 messages · Page 136 of 1
I'm looking for an assistant to help with data science tasks. Prior knowledge of data science is not essential. Applicants must have the right to work in their country of residence. Preference will be given to candidates from the US, Canada, Australia, Argentina, or Mexico. Please DM for more details.
guys which gpu should i take for stable diffusion Video not images
like converting my videos into anime
rtx 4080 or 7900xtx for more VRAM?4080 less vram but more compute units
also warpfusion as i heard its better than stable diffusion and more heavy
nvida 4080 ?
less VRAM
will it run stable diffusion?
unless ur willing to wait for 5090
I got this
how is the performance
Great
also waiting times
like u convert ur normal video into anime
so how much wait time
coz i heard video is more heavy than images
connverting video into anime
also i tried to see benchmarks for video like video conversion i didnt find any only benchmarks are for images
can any1 plz send me benchmarks for video like converting videos into anime benchmarks thx
3090 same as 4090 price bro
my budget allows me 4080
will 5080 launch this year?
also 5080 same price as 4080?
did u try video conversion?
like ur normal video to animation stuff
'?
how many minutes it took to convert 1 video of 1080p @60fps 30 minutes/
imagine/tennis player hitting forehand
No one besides Nvidia knows the answers, but realistically they probably will be more expensive again and we wont see much improvements 
r0fl it is not
huge leap from 40 series for AI?
dont buy new
lies
blackwell architecture is perfect for AI
5080 coming this year?
50xx series is gonna be a massive gamechanger
or next year?
no
can u plz try whenever ur free and tell me how much time it took u
how many minutes it took u to convert a 1080p@60fps video 1 minute
@rich kestrel how is 4080 for images and thumbnails for youtube?
Straight up calling lies on my humble guesses and not providing any sources to prove me wrong, loving the internet 
I kinda doubt it will reach consumer cards, Nvidia makes their money with big AI clusters now and not us gamers/AI enthusiasts
i want it mainly for AI and productivity not for gaming
4080 comes with 16gb I think
16gb = decent enough for most cases
not very futureproof
There are multiple 4080 versions tho
I think 16gb is max tho?
thts why I fink 3090 is better choice
used
U can prolly get one for like 800 bux
thats what im selling mine after I snatch 5090
ya cuz 24gb = cadillac vram
5080 also 24 /32gb VRAM?
AND 5090 32/48gB VRAM?
@rich kestrel
I HEARD NVDIA is going all AI
5090 is probably gonna be 24-32
5080 doubtful
tbh if u go 4080 or 3090 both are good choices
as long as u dont buy amd
okays
ill try to get 3090
if possible
@rich kestrel have u tried converting videos?
any1 here tried converting videos into animation?
how long it takes to convert 1 video 1080p@60 fps 1 minute long with 4080?
convert how?
encoding?
or animateddiff?
I do a lot of interpolation and video editing
and 3090 is still a beast
he probably wants to know how long it takes to render video on 4080
yes
guys can u frame interpolate a 30fps video into 60 fps in SD?
Yes, image generation
Could anyone kindly give me some pointers?
You dont need SD for that
Hello
5080 and 5090 will be announced in June. Everything else is speculation.
AMD is amazing too. Please remain impartial in your feedback if you don't know what you're talking about.
Anyone knows any good alternatives to suno I can run myself? Like meta/musicgen
so coming in june?
Good, but not for AI
I'm an AMD user myself
and I have to use cloud GPUs
No. They're being officially announced in June. AMD has some big stuff coming soon too. They partnered with Samsung on some chips (again) and a a whole new chip is supposed to be announced sometime this year.
In the past two years I have have been working with AI, 100% of it was AMD.
I don't really think that AMD can beat Nvidia in the near future in the AI field
for gaming, they are alright
I have made millions of AI images. I respectfully disagree with you. I'm not here for a pissing match tho. I have NVIDIA cards too.
can we expect 5080 to also come this year in december?
coz 5090 is guranteed
Any answer would be speculation. So I won't. 🙂
Can Stable Diffusion add text properly on images? MJ and SD have always had troubles with that but DALL-E does it fine most the time.
Im trying to make a cover art of a realistic woman with text on it like an album art for music. Using Auto1111 and realisticvisionV20 model... its been a long while since I used this.
use the SD3 API. It works inside ComfyUI
Hmm.. but Ive already got Auto1111.. is it gonna take me a long time to set up? I mean Im creating images right now, I just need it to put text on it like DALL-E can do.. saves me time to add text separately later 😐
day 493 no sd3
You mean I can do it via the browser?
Man Im too noob for this. I only know how to use Auto1111
You can download Davinci Resolve free. It is both an image and video editor. It will use any font that's on your computer. Great for posters, fliers, album art, etc.
If you need it AI fancy you can make it on a background and import into AI controlnet. Then export the result back into Resolve and apply it to your image. Resolve is a must have. Moderate learning curve.
Alternatively you can go to Kittl. It has both a free and paid thing. Outstanding for quick text. Lots of toys. Cheap and easy text/font. AI tools there.
Yeah I like the AI fonts because its random and different vs using the same normal fonts installed. DALL-E does text pretty well.. its just the people not looking real thats all.
So... can we get the weights now... pretty please?
looks like they might try to ditch clip and train only on t5
for context, they've mentioned limiting t5 to 77 tokens to match clip
if anyone need access to free SD3 dm me, I will send you a discord link where you can generate free SD3 images
lol alls im sayin is for someone looking to invest, you WILL regret investing in AMD
especially when blackwell comes out, the most AI optimized cards ever
But hey have fun going linux and installing a million compatibility patches for AI. its your money not mine
it's that bad?
so a 7900 xtx would be a bad buy?
I'm looking to buy a gpu in the near future - I was looking at used 3090s but thought maybe a current gen might be better but the vram is usually low - the 4070 Ti Super is the cheapest one with 16gb and around the same price as a 7900 xtx
@teal pagoda - would you advise against getting a 7900 xtx then?
theres always gonna be ppl sucking up to the competition
there's always gonna be pepsi fanboys even though coke is the gold standard
just saying theres a reason nvidia is valued at over $1T and keeps growing and amd is just gathering dust in the corner
nvidia has all the good tech and is the one ahead by miles
the way it's meant to be played 🙂
yea im gonna stay with nvidia, dont see any reason to switch to amd
I prefer pepsi
we found the serial killer jk :3
exactly
And I don't support Nvidia or AMD
I just say facts
But for AI things, Nvidia is legit better
for gaming, AMD is good with their prices
You don't have to be a fanboy in this life
just study/research about everything
and you'll make the right decisions
I'm not a fanboy - I have no horse in the race - I even invite any excuse to pick AMD - for the open source aspect in Linux - but, ppl who use these programs - most say to get an nvidia gpu - for Blender, AI/SD, Davinci Resolve - it's actually depressing....
why is it depressing to recommend the superior brand
perhaps theres a reason ppl recommend it?
I don't care about gaming - I do some gaming but I figure either amd or nvidia is fine for that - I had a 3080 10gb - I sold it - I wanted to get some value for it while it still had some and it was only 10gb so I was kinda concerned the vram might be a bit low eventually
Even Topaz Video works better with Nvidia 🙂
depressing because a) it's always good if there's choice; b) cuz nvidia is a crappy company, let's face it c) amd is the open source choice in Linux and nvidia has some issues or problems in Linux, usually - although, I hope that changes soon .... and d) amd gpus are generally cheaper
if you wish to upscale videos/increase the framerate
who tf cares about linux....
except for programmer neckbeards. Im not a tech nub and even I dont like linux
unbloated windows 11 is da bomb
only the higher price should be depressing
:)))
is 16gb enough for SD?
if I narrow it down to a 3090 or 4070 Ti Super - is the 16gb of the 40 card enough?
kinda right
:))
90% of the AI tools are for Windows anyways
10 works too
Well, to start with, I will probably use Windows with these programs most of the time - I will try them in Linux at some point though - but, at first...
I was using 10 - now 11
some video workflows might require you to go to 24gb, but 16gb should be plenty for most things yes
oh... thanks for the reply
isn't SD3 big model 24gb too?
I can't afford a 4090 at this time 🙂 so, my choices are kinda a used 3090 which is cheapest or saving a bit more and then getting a 4070 Ti Super - which is more power efficient anyway and somemore features
im waiting for 5090 to upgrade my pc
well sd3 will come in various sizes, but even the biggest model, im sure you will be able to use it with less than 24gb
jus hope it doesnt sell out instantly g
cuz im predicting its gonna be another shortage

everyone and their grandma is saving up for that mf
Hi, i have a question about adetailer for consistent character.
- I generate photos with juggernautXL and also use adetailer for face, does adetailer use juggernautXL for inpainting or some own model?
- How should i provide prompt for consistent character? Both detalized prompts for juggernautXL and adetailer, or general for juggernautXL and detalized for adetailer?
But why would you buy 2 generations old GPU if the newest Nvidia 5000 series will launch this year?
I mean with all this AI advancement speed, you'll better keep the money to buy something from the 5000 series
16 GB VRAM next year will be nothing in the world of AI
If you only look at how fast is everything evolving into the AI field, you'll understand this
faster advancement, higher requirements
And if you wish to "play" with all the new tools for AI, you really need resources
The GPU is the most important atm, maybe in the future they'll develop some technologies for the processors too.
yep
the world is in a continuous "crisis" right now
4090 was good for its 2-3 years
dam I just bought a 4070 Ti Super 2 weeks ago, should've waited
Tbh this sounds like they are still very very far away from a proper weights release... can't they just give us what they have and release the "proper" version as 3.1 or even 3.5?
Right now perfectionism is not the right approach. Everything is too unstable for that.
big mistake
perhaps I will be able to sell it and upgrade
ain't cheap! I saved up gift cards for 2 years 😭
Cheaper? U know 5000 series will be even more expensive and low availability,. right? 😭 😒
we just boosted to level 3 🥳
cause the 40 series will drop in price (maybe) and maybe there will be a 50 series card that is relatively reasonably priced (around the 4070 Ti Super range (still not reasonable)) with much more power. Wouldn't mind 24gb+ VRAM for party purposes.
Yea, I know, but you won't do much at the end of 2024 - and all 2025 with 24 GB VRAM of RTX 4090.
24 GB VRAM are not enough even now
maybe lol 🤣
I don't know if you noticed, but to "play" with AI, you really have to invest
this is a really expensive "hobby"
not like gaming
even more expensive
I never thought of it as an expensive hobby
its more of a perk of investing in a good pc
why wouldnt anyone who isnt a granny NOT want a good pc?
it can be put like this too
reasonably priced lol - you really haven't been following the gpu world, have you? 🙂
I spend 8 hours a day glued on to one to earn a living
might as well have a good machine
I wouldnt' know - I just see prices so that's all I know
the prob is that a lot of ppl from this discord arent even in the us
that's right
apparently 900 bux for a gpu is insane in many parts of the world
🤷♂️
thts just what they cost..
But for people who don't really plan on doing something useful/earn some money with AI, the 16 GB VRAM GPUs should be fine for now
a used 4090 here costs $2000 , used 3090 - around $900 - but, that has been coming down - although unknown if the sellers selling $700 ones are legit - used 4080s are around $1300, new 4070 Ti Super is around $1300 ish
I mean if you only wish to generate "w@ifus" the entire day, 16 GB VRAM are just fine
and so on and so on if u get the idea?
dunno - obviously, more vram means more options and versatlity - might need it might not
I want to game and AI, so I thought a 4070 Ti S ($1100 CAD) was a good middle ground, I agree that the price is still not reasonable
Ai low requirements can go very low if u want less expense
I use a 2070 works fine for sd 1.5
but, if i need vram - then my options are the 3090 and 7900 xtx
that's b4 tax - I'm talking about CAD too
Exactly, but you can't play with the new toys in town like SDXL or SD3
ah, I thought so :)
but if all u wanna do is generate some waifus it's fine
as I said
is the 4070 Ti S 'enough' for SD3?
maybe, we don't really know
Curious to see sd 3 cn then I might make a change
benchmarks I find almost always incl. the 4070 Ti Super and even 12gb cards! 😮
whats the current best way to do upscaling of small images?
A custom node. That also shows the temperature and memory usage. I need to check which one it is.
Let me repeat the first sentence of that plugin:
With this suit, you can see the resources monitor, progress bar & time elapsed,
And it might be useful to keep an eye on temperature if you mass generate stuff or to watch the memory of the gpu during animate diff etc. to avoid swapping if possible.
can someone who can use stable diffusion help me generate an image?
Swapping happens when you run out of memory and it is forced to use disk which is orders of magnitude slower
If the memory from your gpu is not big enough for the model or the task the normal memory is used. So unused parts are copied from the gpu memory to the system memory (slower). This takes a lot of time.
After installing nodes or plugins from the manger you need to restart the ui
my computer cannot generate the images because of the specs
Yes just to know it. Then you could decide wether you reduce the resolution or reduce the context length of an animation.
there is actually a rule against external linking, but nothing preventing you from mentioning the service. look into vast.ai, runpod, rundiffusion, maybe colab - but not the free one as it's against google's TOS/EULA. I personally have used the first two, and both are good, the first is the cheapest also (that I've found) @fervent thunder
SUPIR for free
what image?
sd3 release when
That like the Rapture for christians
This a joke?
its iminent man imminent
SD3 is a myth
they don't even know if CLIP should be kept or just go directly for T5, they're testing if it can avoid some prompt adherence issues
i mean on HuggingFace
CLIP being removed forever?
It is annoying how some models want clip skip 1 or 2. It would be nice if you didn't have to worry about that.
SD3 is about as ready as HL2 was during that Alcatraz wrap party
but this will be worth the weight too
It seems that the newest update of a1111/controlnet destroyed the 16 GB VRAM GPU. You'll get "CUDA out of memory" error. Guess I'll have to use Forge or older commits of a1111/controlnet
who cares when, watch a cartoon or something
Ren & Stimpy?
Nah I prefer the golden age of cartoons buggs bunny
warner bros and hanna barbera from 50s-70s
We need AI that makes cartoons in that style
not just glossy waifus
there should be a setting to toggle clip and t5 I think
so we get both
hello
whats your gpu?
you mostyle just need to activate Tiled VAE
T4 cloud GPU
nah, everything went good until this last a1111/controlnet update
ah okay, sdxl or 1.5 controlnet?
was using 3 controlnets at the same time + hires. fix with XL models :))
without the "low vram"
of controlnet
and everything worked perfectly
now I get CUDA out of memory even with all the best arguments used
hmm, and you did update controlnet and auto?
guess I'll go back to forge
yea, to the latest update
a1111 to dev branch and controlnet to the latest master branch
ah auto1111 dev branch
wouldnt recommend that if you want to have a stable experience xD
I tried with the master branch too
it's the same thing
the only diff is only 1 commit
go to check the repo :))
yea, maybe you find the culprit or the change
yea or maybe I go back to forge
that's the fastest solution
unlucky that ilyasviel left the project in dust
yea, but maybe he comes back sometime
would be good for fooocus, forge and controlnet
I don't think so
he didn't make a single controlnet model for XL 🙂
he left Fooocus too
Now mashb1t is doing almost all the work
maybe he got a good job or something
You guys remember SD2 and 2.1? Man that was terrible
I still have flashbacks about it
the worst SD
unused by everyone
hey guys one question , how can i lesser the effect of a prompt , for example i want the charachter to smile but the smile is too much
i wrote down smile
i know there was a way with writing 0,5 or something but i forgot how to do it
Hello hello SD afficianados; I have a question: I have a (paid) job posting up on one of the better known freelancer websites looking for an SD expert (ComfyUI especially, and A1111 is a plus) to collaborate on a new show that will lean heavily on Stable Diffusion in order to test and prototype capabilities re: img2vid, vid2vid, existing workflows to integrate into ComfyUI, AnimateDiff, Deforum, Controlnet, and others....does anyone have advice on where to take that information and is there a Discord server dedicated to AI design jobs? thx
I wasn;t aroudn back then but why was SD2 so bad?
hello
in auto1111 its by using (word:0.5) so its half the strenght
there was an sd2?
been out of it for a little bit
They changed the CLIP model, and everyone was like "thanks I hate it" because it was harder to get good results. But I heard it just needed embeddings or loras for nice images
Hello. what requeriments i need for run SD? i only want make small res (768x768) images (my graphic card: Nvidia GeForce GTX 750, 2gb)
for 2gb and 768x768 it's gonna be really on the edge... but still possible. you might have to use comfyui on the lowest vram mode, along with lcm maybe, and tiled vae maybe too.
idk how SD (local) works. can you explain me more?
I'm completely new to this how do I download SD and start generating images
Go download and install automatic1111 from github. There are many youtube tutorials that will talk you through how to use it.
Welcome to the new world
Bet thanks
Why don't they just something to download on their website
I feel like that would make more sense
Because it's open source. I think they have their own program where you can pay to generate images but I've never used it.
But if you want to generate for free using your own hardware you have to do a local install.
It's slightly complicated if you are completely new to it
Oh shit ok
Hi guys
I have a question on how to solve (main reason being I need to do this for over 1000 images and can't manually fix it 1 by 1)
PROBLEM:, i have a input image, that i am trying to convert to stylized 3d look, which is working nicely with the help of controlnet tile and canny edges, but the only issue is i am getting color changes in the output image ( which should not happen given controlnet tile model is present), is there any way to fix this , currently using SDXL and 2 controlnets
-
I have tried the img2img color fix, that brings the colors a bit closer but still there is a difference (without this enabled, the color is wildly different to the input image as shown in pic 2)
-
tried entering the colors in prompt, still doesnt match
B) Output without img2img colorfix: https://media.discordapp.net/attachments/1149510134058471514/1236048262650400829/Without_Color_correction.png
C) Output with color correction: https://media.discordapp.net/attachments/1149510134058471514/1236048342056828948/After.png
its making pink flowers yellow, white flowers pink, orange flowers into dark pink, grey stones into pink
hola
Hello everyone, please suggest is this PC setup is ideal for Stable Diffusion 1.0 SDXL -
Component
Option 1: RTX 3090
Option 2: RTX 4090
Processor
Single Intel Xeon Gold or Platinum processor
Single Intel Xeon Gold or Platinum processor
(e.g., 16 or more cores)
Memory (RAM)
128 GB ECC DDR4 RAM
128 GB ECC DDR4 RAM
Storage
2 TB, SSD-based storage
2 TB, SSD-based storage
GPU
NVIDIA RTX 3090 (24 GB VRAM at least)
NVIDIA RTX 4090 (24 GB VRAM at least)
Que tal
Hello everyone, please suggest is this PC setup is ideal for Stable Diffusion 1.0 SDXL -
Component
Option 1: RTX 3090
Option 2: RTX 4090
Processor
Single Intel Xeon Gold or Platinum processor
Single Intel Xeon Gold or Platinum processor
(e.g., 16 or more cores)
Memory (RAM)
128 GB ECC DDR4 RAM
128 GB ECC DDR4 RAM
Storage
2 TB, SSD-based storage
2 TB, SSD-based storage
GPU
NVIDIA RTX 3090 (24 GB VRAM at least)
NVIDIA RTX 4090 (24 GB VRAM at least)
Why did you double the lines of processor, memory, and storage?
Save money: Get a 3090/24GB for <$1K. Stick with 64 VRAM. You don't need that many CPU cores.
Why save money? Because you want to upgrade to the 5090 as soon as it comes out, for running SD3. (I'm assuming the 5090 will have 32GB VRAM, but I could be wrong.)
Chances are very good you'll still be able to resell your 3090 for good money when the 5090 launches.
damn theres ppl out there who still buys xeons
Old school ... 🙂
hello, I am looking forward to participating in this community.
how to generate image
I suggets Forge not A1111
I dont think nvidia will put that much vram into a consumer card
they wouldnt want to decrease their h100 sales
I bet they will. Leaks already showed the low end cards will be 16GB and 24GB.
The 8GB cards aren't coming back.
AMD competition is forcing them.
Exactly.
Few months ago I bought rtx 3090 just because CUDA
otherwise I'd but 7900xtx
amd needs something like cuda
CUDA is a moat, but there are too many people working to auto-compile CUDA code to other architectures, and plenty of solutions already sort-of working. NVidia is definitely feeling the pressure.
They have to innovate hardware, or they'll lose their lead.
Because you want to upgrade to the 5090 as soon as it comes out, for running SD3
didnt they say the biggest model is 8B
24 VRAM should be enough
SD3 + T5 is supposed to use way more resources than SDXL is what I heard.
But remember SDXL was originally 2x the VRAM, because of the refiner?
No one used the refiner in the end. We'll just have to wait for SD3 to see what it's really like.
In early, unoptimized inference tests on consumer hardware our largest SD3 model with 8B parameters fits into the 24GB VRAM of a RTX 4090
that's from the official research paper
Yeah. I know. You have to see how worrisome that wording is. 😐
The model "fits" into 24GB? You realize I still need VRAM for the image gen batches, right?
They should have said: Can generate */sec on 4090. But no.
... and takes 34 seconds to generate an image of resolution 1024x1024 when using 50 sampling steps
They better get SD3 well under 12GB vram for the genereal peasantry.
I can guarantee the community will get SD3 running on potatoes... at 30 min/img.
Does it say resolution / steps / T5 use?
^^
1024, 50 steps
I couldnt find anything related to T5
it also says it's unoptimized tho
I don't think the bots are coming back. SAI is out of money.
SD3 will never come out 
Stop the propaganda.
Hey all, anybody know how to change the font size for grid legends in automatic-1111?
Anything's possible, but let's stay optimistic here.
Probably have to edit python.
If SD3 never comes out / SAI dies... We'll have to get a DiT architecture working, put together a dataset from Huggingface / Laion / Civitai, and do a kickstarter to rent the AWS H100s ourselves. 😮💨
This is reality, SD is over 
To be clear, AI is here to stay, and the open source community can advance it at way less cost on lower-grade hardware than SAI. The only reason we don't is because SAI exists.
no company no matter how big can compete with millions and millions of determined users
I'm working on ParrotLUX to integrate SD with my art, but I will 100% pivot to full-on AI dev if SD stops advancing.
But that determination is born out of need. As long as companies keep giving us freebies, we'll focus our efforts on other stuff.
Sicne wars inflation and taxes will destory all profit its up to us I am afraid. thsoe who do it for passion and interest not money.
Well, pretty sure AI stands to make tons of profit from the defense industry and the tax prep industry... But I'll switch to AI dev if SD3 doesn't come out.
How many dollars have you donated to SAI? Do you have a professional level subscription or higher? Yeah that's what I thought. And no, the community isn't going to magically pool together 10s of millions of USD to rent data centers for training a whole model
Maybe one in a hundred thousand SD users have even given a dollar to SAI. People use SD because it's there and free.
Good morning everyone. I don't care what you're doing right now, what PC you're building, what worries you have on your heart or what political ideologies fill your thoughts. I just hope you have a wonderful day. No strings attached. (tips hat)
well computing power gets cheaper always so meh... also quantum computing, in 5 years billy joe bob JR will have the computing power of nasa today so
ill train up a new model in 5 hours see you later guys :DDD
thats how fast thigns move
in fact maybe we will all have our own models in less than 10 years
like eveyrone cna make their own LLMs
what you do now on your phone wa sunthinkable 10 years ago
Heck yeah!
Not sure what's next but money as we know it is on its last legs.
But AI will play a role in whatever happens next.
no one will pay for non-local models lol
Not until they package it up like a video game. Simple and easy. I've been saying it for two years.
SD3 the video game.
What I said was: "the open source community can advance it at way less cost on lower-grade hardware than SAI".
You read what you wanted to apparently, and I'm not totally sure what you think I said.
zactly
i wouldnt waste my time on that guy,he seems to be either schizo or just your average terminally online redditor,just check his older messages to see
i mean to be honest, looking how LLaMa 3 was trained, I am really really doubt people will able to train using lower-grade GPU
Even though both image generation and LLM are different thing
for ordinary users it couldn't be easier
I still don't understand why no one made a normal game
there is only AI Roguelite
but now you can make an incredible project
with llama3+lcm sd
( only for those who have 24GB VRAM with quantization on or above )
yes but technology advances
first of 24GB of VRAM will be less expensive 3 years form now and also faster and mor efficient ways to train will come along
At the foot of Tianmen Mountain, a reader dressed in Han Dynasty attire stands beside a clear stream, holding an ancient scroll book and reciting 'Looking at Tianmen Mountain' with a solemn and focused expression. The warm morning light filters through the sparse clouds, casting a mottled pattern of light and shadow on the reader and the surrounding bamboo forest. The bamboo is lush and verdant, and the stream babbles gently, filling the entire scene with the tranquility and harmony of classical China.
I'm referring to the ease of use. It's possible to package up SAI stuff (and controlnet) into one simple and easy to use download (like a video game). Where the user (noob or skilled professional) can go into the options and customize their experience. It would not retract or remove any current capability.
Nobody seems to have a marketing strategy and end game for AI. But the money is in its ease of use to the user. To the children in schools and the mother sitting at home with 4 midgets running around. The more people that use it and the more monthly subscribers they have on one version of the software, means the more stable revenue that comes in. We're talking tens of billions. But they wont reach that until they market it. Like REALLY market it. Video game downloads and website UI's are the example.
The end game should be what money AI is bringing in globally in ten years. It's in the trillions (total) if they do it right. What they're doing now is how NOT to do it. You have to market it.
and then it will be truly impossible to replicate another persons picture
lol
welp
for now it took me a couple months to graps how to use all this SD stuff
good luck marketing that to the ADHD tik tok crowd
I will keep this case open, I mean sure there are tons of possibility in the future, including breakthrough of how we train stuff.
But as AI getting smarter and smarter, either company / developers are needed to get even smarter at working some stuff, and also the dataset.
they aren't going to use SD anyway
what's the point of SD, ask yourself
well
in the right hands SD styuff can produce better results than the midjourney and dalle stuff
but most people are too lazt to elarn it
they prefer to have ervtyhign spoonfed
lazy
one of the reason is just that they don't even need it. It is pretty simple.
Art advances humanity as a whole. We need to empower people. There's big money in doing so. But the software has to be dumbed down and cleaned up or that's not gonna happen.
or they just don't want a super-realistic quality image
just wanted a simple image that describe some stuff ( cat running with two feet from the market kind of stuff )
civitai peeps moment 
lol
well, yeah I mean, Illaysviel's projects ( Fooooocus, Forge ) is one of the attempt
Forge is good
but most people dont even need that much anyway
Foocus is cute but it not only dumbs the process down but also affect quality
but good enough for someone who just wnats to "have fun"
Kids that are in creative writing classes right now are the future artists of the world. Developers have to bridge the gap. Eliminate the complexity of AI art. Incorporate the "fun" but allow professionals to utilize the exact same software for commercial use. This makes everyone on earth a potential artist, inventor and product designer. We have to bridge the gap!
There are millions of garage inventors all over the world. Tens of millions. Imagine if they could invent in AI and take their drawings to investors. Image how far that woudl take humanity.
I am not crazy as that particular person who saying "art advance humanity" "everyone should access the AI art" although I agree several points.
but conventional drawer are probably still going to stay amid huge attacks from superb AI art.
sure the entire theory of "AI can't exceed human" is kinda... uhm. a complicated thing especially right now we are on the start of the sigmoid curve
kinda right
not yet
i prefer opaque waifus tbh
highly censored and worse quality
If i wont wrong it is also caused by the license isn't it?
right, or you could use a1111, but not the latest commit, because it requires so much VRAM somehow. On older commits I was able to generate XL 1.0 images with 3 controlnets activated (without the "lowvram" option activated) + hires. fix at the same time on a T4 GPU (16 GB VRAM).
Or maybe these people start doing something new and better instead of copying CUDA
12 GB are for XL now :)))
i could render sdxl with 8GB it just took like 4 times longer
then you better pay for a cloud GPU instead of wasting 30 mins for one img and killing your GPU at the same time
kinda - that's the plan
like welp you forgot there is 800M 2B 6B peeps
Yep, if stability ai will disappear tomorrow, there will be small groups of trainers who will continue to develop further. The problem is that the best/better models won't be free - you'll have to pay on sites like Patreon...
Exactly, there will be only small communities of supported models creators on like Patreon who will be able to rent let's say an A100 GPU for some decent fine-tunes, but training a model (the base model) from 0/scratch will be almost impossible as the costs are too heavy.
Agreed 100%. For the past two years I have been an official tester for several AI companies. And it's soooo frustrating watching them throw this window of opportunity out of the window. Large corporations like Adobe are going to ruin this for us. It's hard to watch.
Adobe = the enemy
Stability AI = the best friend we've ever had
how long and what does it take now to train a base model lets say. how many images do i need and how many nvidia gpus and for how long?
And how are these requirements likely to change in the next 3-5 years?
H100, not one, probably dozen of them.
single one of them cost 40000 USD
and usually stacked in server rack under SXM port
I think I see ridiculous statement about H100 earlier about its being a "cash-grabs" card
like you know why OpenAI invent a lot of H100 for their training? because damn it ( especially PCIe variant ) save an AWFUL lot of power compared to stack-up 4090
plus that H100 card is solely designed for just transformer training, aka AI models training
and well, NVIDIA just kinda ignore HPC sector on that card
You just forget that the power is only in the richest people's hands and they don't want the better of the humanity :)))))) I mean, wake up!
You're not wrong sir. You're not wrong.
On a more positive note I'm taking the day off. Time to mass murder some beetles, snakes and rats in old school EverQuest 2.
yep, they removed many celebrities and known people
is there some alternative to civitai where you can get celeb models?
maybe tensorart
NVidia plans to scale AI compute 1Mx over the next 10 years, so a model that cost $100 million to train today should cost about $100 to train in 10 years.
I'd have to do the math for SD3's cost from H100 hours (1 hour is about $3), but in 3 years you can expect it will cost a few thousand times less.
yes, you compare the technology from the future with the current one, but that tech will be even more expensive
:))
There are diminishing returns. Imagine version 1 is at 40% quality. Version 2 gets to 85% quality. Version 3 gets to 97%.
Yes, the newest generation will be more expensive and always maxing out the newest hardware, but for vanishingly small improvements.
SDXL is already "good enough" for most art, you just need the right tools built around it. That won't change in 10 years, even though the cost of using SDXL will fall to 1Mx less.
Instead, the benefit of new tech is that it makes things possible that used to be impossible: hd 3d, video, real-time 8k 3d VR, hands, etc.
How can i improve my performance on amd cpus, i use ryzen 5600g cpu and igpu, these are my settings --use-cpu all --precision full --no-half --skip-torch-cuda-test i got 16gb ram 4gb vram in my integrated gpu
lcm/hyper/lightning
Use ComfyUI, find tutorials / workflows / models for SDXL Hyper.
Do you have the error on the latest a1111 update with CUDA out of memory when you try to use even a single controlnet on a 16 GB VRAM GPU?
I would not buy a 4090 right now. The 5090 will probably be out this year or next year.
hey, AMD igpus can work with directml which is faster than using the CPU.
you can follow my install guide frome here to get it working:
#🤝|tech-support message
Has kohya ss randomly stopped working for anyone else?
no, but to be fair I seldom run it locally
if you dont change anything it shouldnt ever suddenly do anything
I run my SD stuff in docker containers
has anyone seen https://highlight.fm before?
What's the current best upscaler for photos?
thank you!
Hello. Are the stable diffusion models not available for fine-tuning through the API?
Hi everyone! Is there anyone that knows, if I upload my own song and then the ai makes a new version of the song, who will own the song? is it me or Stable? 🙂
If I did that with my own song, I would own the remix
is it that way for everyone? or does it matter what subcribe account you use?
Look up StableAudio license
I could not find that part so thats why i asked here 😁
ok, but this could be a legal matter, so even if someone weighs in on it, that's not exactly going to help you legally
can i get help here to train my own images on stable diffusion?
kinda new to the whole thing
unfortunately this can't be done on cloud GPUs as the providers like google colab always change something to destroy your "stable" working tool
sd3... when
maybe never 🙂
then
Is it possible to have conditions with Dynamic Prompts/Wildcards?
I have a Wildcard with different races, but for one of the races, I need to have a lora for that at the end of the prompts. The order matters.
For example:
race, more prompts, lora
Is it possible to do then?
well the lora is invoked via prompt
so just add it to the prompt of the respective wildcard
there ya go so in about a decade everyone will train their own stuff 😉
lol
but if u scan her one for one and she has issues and baggage and goes to therapy then you just AI that crap in too
so whats the difference
thts the point
u get the full realistic experience
not just a yesman/yeswoman
or... train her consciousness but finetune it to be in love with you
but really probably this is when this ai art or whateve ru wanna call it will mature..
is when everyone will train their own personalized libraries
then ai work wotn be reproducable anymore
it will become very personalzied
of cours ein 10 years who knows what crazy even more beter whackier new ai tech will come out
lol
qubit SD3?
quantum sdxl
pull instant random models out of quantum entanglement
Is there a channel to ask for LoRAs or is general the best place?
Trying to find something in Civit but having a hard time.
the local lora dealer, he drives a yellow van
Iam serious
well then im afraid theres none
u just either fidn soemthign u like or you get down ad dirty and train your own
I'm 65% sure this video AI is fake. It's happened three times already, always with a Chinese company.
- Release new paper: One SUPER AMAZING thing and one [totally underwhelming thing].
- The [totally underwhelming thing] is available now!!! Open source! Try it yourself!
- *The SUPER AMAZING thing will be available soon.
- GitHub stars, repo follows, news cycle boom.
...
Never hear about the SUPER AMAZING thing again.
Where's Ella SDXL, hmm?
(If I'm wrong and we actually get open-source video on that quality, I will be jumping out of my seat with hype. It's unbelievably good. Not exactly useful for art... Maybe some moving-character-poses for a game or something? Still, absolutely jawdropping.)
yeah where is ella sdxl? id like to use it please
the DPO lora and perturbed attention guidance so far I find help massively with prompt adherence. Is there any other way to boost it other than this mythical ELLA?
Time, money, investors and ethics. Sdxl is massive in comparison and has two clip encoders. Ella worked well with 1.5 because it likes simple, tags, like, 1this. Sdxl's main clip encoder prefers more natural word flows. Due to that, the amount of training Ella would need would probably be at least 100x as much to map text inputs to data in the model. Basically, it's like going from checkers to chess
Hi, I have a question. Maybe you can guide what are the suited tools for that task. I want to use images from for example myself as basis and then generate pictures of me in different situations. I tried midjourney with character reference option where you give it one or more seed images but its very bad. I then used it with an additional faceswapper but its average I would say.
My question would be what other recommended approaches are there? SDXL + custom Lora with pictures of myself? Or will that also be a hassle to prompt. I heard the SD gets limbs very often not good. Also do you know what is the best faceswapper out there?
Ipadapter plus is pretty handy and can do face swap stuff if you have models for it. Coupled with all the style and composition transfer stuff, mixed with its regional masking, it's ridiculously powerful and gives some great results. You can also pile on controlnets as well like depth or canny
(assuming you're using comfyui)
Anyone know any good AI tools to help with 3D modeling like for texturing/etc? that are free
Z123?
I spent hours trying to generate a logo with Stable Diffusion/Gemini/CoPilot, I tried different LoRas different models.. it doesn't work all results were bad
sd3 will never come out 
But is that actually Open Source? So far it's only on a website, right?
So i am trying to turn an 2k image from render to anime and i am using tile resample and line art controlnet. And its taking too lon on my 4080. It takes around 40 mins to complete. Any1 knows problem ??
You're turning it to anime, just downscale it and upscale afterwards. Anime is easy to upscale
first go to comfyui github, install. then go to the examples section and pick a lora example. this will be a json that you can drag right into the comfyui window, after that just download the lora of your choice and a base model, and put in the checkpoints/lora comfyui folder and refresh.
I like the sneaky comfyui recommendation you've put in there. 
ha, i was just about to say if you don’t like the node type of ui that a1111 is fine too
oh and Forge…
i think forge is faster because of the unet patcher
You mean "Last commit on Mar 8, 2024 - Forge"

in automatic1111 is there any quality difference between cpu and gpu random number generation?
what is the best discord for asking questions about controlnet (in Forge)?
here?
does anyone have any luck with using faceswapping in controlnet in forge?
I watched this video: https://www.youtube.com/watch?v=juP6gpiOY2A and it didnt make any sense to me
nope
only difference is comuting time
computing *
there are nodes like this in comfyui
i wouldnt do this on sd webui at all
you could use facefusion and etc since they use specialized models and its a whole framework itsef for faceswapping unlike a automatic 1111 extension built in
wouldnt be surprised with copilot atp
its an accurate model but still cannot translate special characters
Thanks for the input. So your opinion is that facefusion is the most advanced way to do it with the best results? I would've thought that ipadapter in controlnet was the best.
@silver steeple
yeah facefusion is still the best
for realistic that is
ipadapter face still works but they made a major updated on ip adapters last week-2weeks ago and that video is older than that
so alot of workflows have ipadapters not working and adding the new ipadapter to the sd webui is probs going to be a pain so im assuming they just didnt bother to fix it
works fine on comfyui though
reactor and faceswapper are also good alternatives (for video)
also ipadapters and controlnets are 2 different things to put it straight
ipadapters take in the characteristics using clip vision and controlnets just take in the poses
Exactly. :)) The question is where is this hyped tool too? https://humanaigc.github.io/emote-portrait-alive/
3 months passed
I don't understand. The ipadapter functionality is inside of the control net tab. I think you are thinking that I'm confusing face swapping with open pose?
controlnets basically taking the pose of the image u inputted
and ipadapters just look at the face and swap it via characteristics
controlnet models and ipadapter models are 2 different things
you need 2 models to run ipadapters , 1 clip vision model (for ipadapters) and one ipadapter model
and a model for controlnet
depending on your checkpoint
uh, so i guess your point is that the faceswapping in controlnet uses the current picture to get an open-pose depiction of the face, and then maps the desired image with the open pose that it figured out?
the origional question was if faceswapping using faceswap (and not IPadapter with forge) is superior -- and you answered by saying that these two things are different -- yet it seems like to me that you are just describing what happens under the hood and that you need some additional slight-different inputs to work with IPadapter for faceswapping
that does not seem to me like you are describing some sort of functionality difference (ie I'd want to use the faceswapping inside controlnet via ipadapter VS some other program that does faceswapping)
the reason why I feel like faceswapping inside of SD is that my image is generated via SD, so I imagine that doing inpainting will be maximially coherent with an identical prompt
what issue are you specifically dealing with in the sd webui?
can anyone help me upscale an image and add details?
are you getting relevant results
many upscaling software
ah, it just doesnt do what I want cause the tutorials are not so clear
u can upscale using models too like gpfgan
technical issues?
No like I used to have stable diffusion but I dont anymore
sd webui?
there are a bunch of extensions that do that as far as im concerned
its just that faceswapping with a1111 is buggy,doesnt have all functions,with comfy its better
its the same, comfyui is just more customizable
no it isnt
If we could combine the promt adherence of Ideogram with the aesthetics and customizability of SD the world will explode.
sd webui doesnt get regular updates with it thats literally all
thats cool to know but the webui itself is not where faceswapping is located
pretty sure people use extensions to load these models
not familiar with sd webui but it makes no sense to have these functionalities work differently on comfyui when its the same thing
the only thing that comes to mind is that ipadapter got a major update 1-2 weeks ago and that might be an issue on the sd webui? since it was a big issue on comfyui aswell
also that video has been posted past the updates
solution is to just use comfyui for faceswapping although its never going to be as accurate as facefusion and these other softwares
yes use comfy for face swap,webui controlnet doesnt have all functions and some face swap models are buggy like faceid
roop unleashed is still horrible but its still better than ipadapters with faceid id go as far to say that
So the settings that is necessary to run it is not clear. I would paste in pictures but i dont think I can attach images.
I have an image that I'm inpainting but I'm not sure if I should have 2 or 3 of the control net units open:
one possible configuration:
control net unit 0 - enable, (uncheck independent control images), preprocessor: insight face + ipadapter instant_id_sdxl
control net unit 1 - enable, check independent control image and add 1 photo, preprocessor: instant_id_face_keypoints +control_instant_id
control net unit 2 - enable, check independent control image and add 1 different photo, preprocessor: insightface(instantID) + id_adpater_instant_id_SDXL
you are just inpainting the subjects face right?
yeah
as far as im concerned its using controlnet to generate the poses and faceid to just process it
its pretty simple
so you need all 3 units to run it?
does control net unit 0 need a photo?
alternative configuration (skipping control unit 0 altogether):
control net unit 1 - enable, check independent control image and add 1 photo, preprocessor: instant_id_face_keypoints +control_instant_id
control net unit 2 - enable, check independent control image and add 1 different photo, preprocessor: insightface(instantID) + id_adpater_instant_id_SDXL
This is what tutorials suggest for me to do, but when i do this without having unit0 added, then I get error messages
do it like the tutorial
i tried bro, tutorial doesnt make sense: https://www.youtube.com/watch?v=juP6gpiOY2A
yo i need help at #🤝|tech-support
guys can anyone help me upscale and image and add details?
Just released this light painting LoRA.
https://civitai.com/models/410151/aether-light-lora-for-sdxl
Guys,i cannot about controlnet
I already download from url
Also in available extension(the sd webui controlnet manipulation)
But still,its not showing in my Automatic11111 UI
Can ask for help?I think this is common problem?
nope
could you specify what ur trying to do?
3 controlnets is a huge overkill for face swapping,
this extension appears to heavily rely on an older version of IPadapter which does not work
to faceswap properly try to use comfyui or facefusion and not sd webui, its a really compact ui made for complete beginners and you wont be able to fit in complex mechanisms in it succesfully at all
yeah sure, are you using sd webui?
I trying to appear the control net tab on my Automatic1111
go to images>batch
Hi Guyz i am new here can any body help me to prevent text generation on the image from stable diuffusion
tell me if can we finetune the Sdxl for not producing the text on any image, becasue stable diffusion is very bad in producing text so can we somehow stop sdxl to product any kind of text on the generated image, Currently i am using negative prompt
NEGATIVE PROMT = "text, fonts,words,3d, cartoon, anime, (deformed eyes, nose, ears, nose), bad quality,bad anatomy, ugly"
but it does not listen to the negatrive prompt so well
i want to generate canva template for christmas halloween etc with no text on it but it always put text with wrong spelling
@uncut salmon can you please guide me
use "watermark"
in the negative
okay thanks, i will try but can we achieve it by finetuning as well?
no idea honestly
never seen anyone finetune a model like that
only ever been finetuning a language model and nothing more
okay and one more thing is that, did you ever fientune a model on custome prpoduct image, like a perfume bottle, and then generate Ad for that custome product?
no
only done it with 3d/2d figures
never gotten into the AD industry and i dont think most people are either
atleast in this server
okay got it man thanks for the help, i am just looking for some one who can help me to finetune model on custome product image
i tried lora but results are not good , let me show you
sure
If the model you are using had watermarks, meaning the training dataset had them,the images produced will have them. But if it only produces watermarks occasionally, the negative prompt is the best way "logo, text, watermark"
@trail lion thanks for the help man.
can you guide me the best way to finetune sdxl on custom product image, i want to train on product image and then generate product ad with that
that's a pretty complex topic. start with a tutorial before you ask for help. look for a youtube video titled "LORA training explained" by Not4Talent.
hey where can i find help with nsfw models
you cant ask here, most discords have policy against NSFW, but maybe on reddit there are several nsfw subforums
actually I realized there is a discord tied to the r/stablediffusion subreddit, and it does have a nsfw area, that you can only see after you confirm 18+...maybe try that?
Oh thanks il check that.
Question about bad prompts. what happens if you add a lot of bad prompts?
image will be bad 🤣
ooh
I'd say best image come without negative Prompt
This room is without pictures ... there's another one if you want to send pics
so if i don`t use negative prompts i`ll get painting XD
by Picasso)
If you want you will get a photo ... I always start without negative and only add what I want if needed.
I'm about to say something I shouldn't say
I have a feeling that
tomorrow is the day
I can feeeeeeeel it
hi
the model will avoid all the words it'll come up with when generating an image with the negative prompt you tell it
so nothing really happens, you'll jsut get the desired picture you want
I also use quite many words I caught on the internet
not really
Wait, really?
didn't know, lol, but stable diffusion is suspiciously good at nsfw
Does StableDiffusion 3 exist locally yet?
no
no, will never
heh
Damn the bot will never return it seems?
no cause they are hosting their api for money
they can't just give people a free way to use it when they have that
i want sd3 on huggingface
hi, is there a way to tell what models are used to merge when u download a model from civit?
what a cynic 😄
does someone know a tuto of how to use improved human motion module
like for the ai generated video
or just help me ?
waiting another year to release it to hugginface would be pretty smart, tbh. they gain nothing by releasing it into the wild early
so its not possible for now ?
like getting super realistic human videos generated by AI
crappy flickering videos lasting like 2 to 3 seconds is possble
so unless someone has unlocked some amazing new tech, I'd say we're not quite there to the 'super realistic' part
fyi there's a stable-video-diffusuion forum where folks that actually care about video hang out
I really could give a lick about it
really what's the forum name ?
Hey guyz in training dsxl dreambooth can i use 1360 x 768 aspect ratio images as dataset?
assuming you mean 'sdxl', but yah make sure you turn on ratio bucketing, but you can use non-standard sizing in your training set if you enable bucketing
im trying to generate a image of 1600x400 of a human but it always put two human in image
does anybody have a prompt or something to fix it ?
lower the resolution, if you're using SD1.5 model make sure to have resolution max. 512x512, for SDXL 1024x1024, but you can have different aspect ratios, make sure the numbers can be divided by 8
@trail lion thanks
if i do finetuning sdxl i will recieve GB of safetensor file right?
because in lorea it gives you few MB safetensor file but i don't want to do lora finetuning so this dreambotth finetuning should give me some GB of safetensor file right
hmmm.... how complicated will that get? never used ComfyUI. But saw that there can be workflows shared. so would I need to do all that from scratch?
hi, nice to meet you
helldivers moment
Switching to comfyui would like be a jarring transition for people who aren't used to node based workflows. There might be ipadapter components in the diffusers like forge, a1111, foooocus, etc, but it's likely a plugin you'd need to install them
I don't use any other program other than comfyui anymore, so I'm sure other people would be more up to date on the alternatives
Things move too fast in AI to wait for anything a year... The landscape can change completely in that time.
Suno Udio Haiper Nooise, like every week something
ok, let it change then
who else is competing in this space right now? there's' midjourney, dalle. I dont see an issue with them trying to monetize for a while before opensourcing the model. who else is releasing their code? that's right, nobody, so what's the big deal=
whatever happens, we already have their prior models, no harm no foul
if they never release SD3, I'd be a little bummed that the operating model couldn't persevere, but still pleased with what they've historically accomplished
If Musk is so into using AI for the benefit of all he could throw a couple of billions so SA can go on doing their thing without having to worry about money. 😄
We just have to worry about this nonsense until AI kills money, its already a ridiculous madhouse with nothign to back ti up, more debt than money in the wrold.
but for now we have to think about it
its such a waste of human potential tho dont you all think?
How many greta minds have to do some totally nonsense idiotic task to "earn a living" when they could do so much more and benefit everyone so much more
meanwhile man babies get millions for no reason for oil and build mega projects
The only reason money was kept going, really, is greed because deep down everyone had hoped to "make it" and so it's ok that most don't and are disadvantaged but with enough work you could rise above and display your might, lord it over others, that was the main point if we're honest. However now that the hopes of making it are near 0, the American dream is dead, and unless you inherit it or are already rich from previous times somehow, maybe now bitter humanity will rid itself of this concept. Because now, and AI will play a role in this, money is becoming more of a hindrance than incentive. Birth rates are plummeting because smart people realize they can't afford to procreate anymore.
Ironic how the very thing that supposed to help us earn our living, existence, now it's stopping us from having kids. And this is happening in all developed countries. Interesting times. Costs like 200k to raise a kid, then inflation, taxes, by the time you're done you're finished.
In communism money was removed as an incentive also as the state gave you enough to live on and have basic comforts (in the beginning anyway) but back then more work had to be done so classes had to be kept, working class, bureaucrats... This time around with robots and AI that division won't be such a big issue anymore- in fact there is loads of well educated people who don't have anything to do now anymore, they cant even go back to menial works as those have been industrialized ,there is enough power and wealth for everyone in the world now its just most can't afford it. and why did communism fail anyway? because of money, eventually greed did get the better of the ruling elite and they exploited it. Ok that's enough venting, but what's happening here in the microcosm of SA is happening globally
WHATTTT!! OMGG REALLYY!?!?
wowww
can i use it on huggingface now
with weight
s
what is website name? can you tell me
i'd love to check it out
real
sure sure
you only need to put your cc number and your full national id i swear its real
its real i only lost my kidney
he looks like that mf who sells you sneakers and watches in a dark back alley
@bleak matrix erm moderators please evaporate this individual
bro looking for a new unicode font
hi
crop it to the face, and use that one at the prior resolution, then paste it into the original generation
Hello everyone. Can anyone suggest me a feature in which I can create layers similar like photoshop through AI
I ment like taking merely the face, scale it up to the original generation (in Photoshop), use it as an input image and copy that detailed face over on top of the original image (once again in PS).
Search for acly krita diffusion
It creates a plugin for Krita (which is like photoshop)
gm

stable diffusion is no longer open source, is it true?
Hi guys, does everyone have experience with consistent character by using adetailer?
what's the most diverse / flexible realistic human model?
for SFW (though ability to do NSFW isn't a problem, just not the use case)
The base model
but if you want "pretty" and pretty reliable at typical poses and photo angles, check out models like juggernaut 9 (10 is not really that great, but they changed how they tagged all their data for training)
though keep in mind that a lot of the "realistic" models off civitai are very heavily biased and overtrained, so you'll start to see a lot of sameness between people in generations
yes
sd3 will never come out
if you're going to be genning generics, dont complain they all look the same
1.5 is still the king of photorealism given that it was nurfed on every single release afterwards
until finetuned SD3 comes out ppl will have to live with that fact that if you want a good character you have to lora it
How would someone go to create a unique real life look person? Should I train a Lora on 2 different faces and hope they will blend together into a unique one when i run it after? Or maybe just generate someone random, inpaint, generate on grids and then train the lora on that?
what is the diff between hyper stable diffusion and the stable diffusion 3 ? any one knows ?
Hyper is a finetuned model(or lora that can be used with other SDXL models) that's good at making decent quality images in fewer steps(quick) and it's an SDXL based model. It's not the same as SD3.
Is this the server for Stability AI members or is there another server to get some help regarding issues with their API ?
Hello! Is Stable Diffusion open for new chain partnerships? Thanks!
Hello, I need help modifying an image, which channel should I ask?
kinda true, 90% of the 1.5 and XL models have the same generic faces for woman for example. Only if you add more details like ethnicity, eye color and so on, you "can" get some different results.
And I talk from experience
best way to get new faces is to use IP Adapter Face Plus V2
up
or dreambooth it, but require more resources
That's exactly what I'm wondering too
what if you do a lora/dreambooth training with multiple faces and with the same keyword activation? Will the training combine all of them into some "unique" faces or it will all be a mess?
I think the training will get confused on what you want it to learn most likely
I don't think they have a special server for Stablity AI
that's one of my theories too
if you're trying to generate something unique, it kind of already does that right? just ask for a photo of a man or woman
describe them, etc
yea, but this is not only about the people
and if you need to repeat that look, then you need to create a character sheet
what about combining multiple styled images in a lora/dreambooth training using the same keywords? Everything will get combined or?
or the generation will be worse?
more like let's say "fine-tuning", but at the level of LoRA/Dreambooth
hard to say, it does combine things, so like if you have 10 pictures of a woman with long red hair, and 10 pictures of a woman with short black hair, and you train on just those, what happens when you ask for a woman later? do you get long red hair, long black hair, short red hair? short black hair? I think most likely you will get mainly the varieties you had in the dataset
but you can ask for long black hair, and probably get that, although the model probably knew how to draw that anyway
I guess just try it and see
some people recommend to not caption the images for Dreambooth
is it really better without captions?
you have public models?
so I can see some results?
because I'm really not a master at training, but I wish to learn more
sorry no, I dont publish, it's a hobby for me
Because I tried using the settings/training parameters I saw on this channel https://www.youtube.com/@SECourses/videos and the results weren't spectacular.
he even recommends using regularization images for women/men
like 1000
I'm starting to think that he's more like a fraud or something
But the worst thing is that there aren't good LoRA/Dreambooth/fine-tuning tutorials on YouTube or anywhere
guess all the trainers keep this as a secret
I'm a sales closer and social media agency manager
hello gguys
i have a issue with my stable diffusion installation
RuntimeError: Torch is not able to use GPU;
i get this
what do i do
help me out P:)
You let it use the gpu
how do i do that
depends what gpu u have
my condolences 🙏
Sd3 is never coming out.. 😢
Hello everyone
Do you know any good courses where I could learn how to use Stable Diffusion?
maybe here
Thanks mate
Try the tech-support channel. That card may not have enough vram though.
You should try to at least find a 12g card
hi
Is it possible to find the artist or style of a generated image(s)?
There is a specific style I really like when generating with Pony, but it's very rare to generate with my current prompts and it's very random, even if I use the exact same prompts.
Don't think so. You can look at the metadata of the image with pnginfo to see all the parameters used to generate it.
can you run pixart sigma locally?
Yes
and? what's the deal?
it means he's a heart breaker, dream maker, love taker
and he has cool pens
everytime I read "social media agency manager" I'm just thinking at OF agencies
:)))
as these are 90% of the "agencies" from nowadays
it will but even if ti wasnt we could fall back on Cascade and run with it...
they removed the room so we forget but cascad eis just ... there.... now...
ye with endowed GPUs refine cascade and ye with great midns make tools for it...
beats waiting and cryin
man, I'm so willing to learn how to do LoRA/DreamBooth/fine-tunes, but no one will show how to do that
until all this "secrecy" won't go away, the smaller groups/communites willing to learn and train can't really do much
I searched the entire internet for a good tutorial
found nothing
only outdated jokes
which doesn't explain even all the settings
it's like it's "open-source", but it's not really open-source
I mean atleast on github you can see all the code from a repo, but you don't have a tool or something to see everything used for training a model for example
settings/parameters/datasets
and I really don't have the necessary time and GPUs to run hundreds of trainings until I find the "perfect" settings
if this road won't be unlocked, then it's bye-bye after Stability AI won't release any other model to the "open-source" community
this all "open-source" thing will turn into like monthly subscriptions to a few "good" trainers on patreon or Idk where
to just access decent/good checkpoints
that's the problem
and here was someone earlier that said that even if stability ai will disappear, the community will unite and train its own models :))))))
nope, something like that won't happen. Only the best trainers will unite to make profits
the "open-source" will be done
wel, there is no "ideal settings" for every case, at least with current training tools, you have to tweak them, like every time, so you do have to run a bunch of trainings
yes, but there can be a direction
some examples
yup, and there are tutorials out there that cover all that
I dont know, what's a joke tutorial
read some civitai tutorials
they don't go into details
"Just use the default Kohya SS settings, they're just fine"
lies :))))))
check out this one https://youtu.be/xXNr9mrdV7s?si=PalGotNpAeYl4Zy4 he goes over captioning, kohya setting, etc. and for when you're beyond that, see this https://rentry.co/59xed3#prodigy
but like I said, you will have to tweak stuff...and what worked well in 1.5 doesnt in xl, so there were some challenges
Heya, i was always thinking about starting this AI artwork stuff (classic drawer of nature or when i am nostalgic old animes) and seems like i am stupid. I cant find a proper download link and have also some Other questions. If someone would like to help, please pm 🙂 (sorry for my Bad englisch haha)
it sounds like you're a beginner and dont know where to start generating photos. did I get that right? go to this channel here on the discord https://discord.com/channels/1002292111942635562/1002602742667280404 At the top click on pinned messages and you will find help for installing automatic1111 for your platform, for example "[NVIDIA] Here is a quick Guide to install Automatic1111 Webui for Nvidia GPUs (Stable Diffusion) on Windows"
Yeah you got that right, thank you so much 😄
Shut up you disgraceful unnimportant piece of slave
All you do all day is talking and talking and talking
While no one is listening listening listening
:)))))))
imagine SD3 requiring a MidJourney/Dall-E account
😮
"A golden trophy, cartoon style, green background"
Potential NSFW content was detected in one or more images. A black image will be returned instead. Try again with a different prompt and/or seed.
is it the word golden that is making stable diffusion think bad things?
Thanks, will check them out
sd3 open source is a myth
Hey everyone, nice to meet you.
If you are looking for a skilled senior software engineer, please don't hesitate to dm me.
the wild west free for all of AI is ending slowly it seems, collect all you cna now make your local libraries and setups so when computing power gets cheape rull just train and make your own dream wordls. money will ruin this too
ill just refine the shit out of cascade and sdxl with the next generation of GPUs, i advise you to do the same
its been fun
nothign come close to local
all this online stuff is trash
its a shame friends its always been sad when money mocked things up but this time by the time you make your first million it will be worth 200k so is it worth it, its a dying system
but no one cares
blinded by the buck
Civitai is still around
god forbid they go the way is SA
😦
download everything folks
make your NOAs ARK for AI
What is SA?
stability AI
Oh I understand now
get a 5TB hard drive and save evderything lol
I only have 12 gb vram
so when better GPUs roll around in 3-5 years you can jauts make your own individualized SD3s and Ideograms
I have a 64 gb of ram
I can technically train AI if I wanted to
64gb of ram can be used for training I heard
It would be very slow, but doable
I could just use photoshop
lol
yeah it would be doable will just take a while\
takes me a few hours for loras on 12 gigs :))
Is this SDXL loras?
yeah
sd15 loras take less than 30 mins
i assume proces sis same for actual checkpoints just more dataset and longer wait time
theres nothing to learn really or wait for tutorials
but current GPUs are too slow for traiign for most of us
probably will still be after sd3 drops
