#š¬ļ½general-chat
1 messages Ā· Page 43 of 1
Yeah I was trying to find the .yaml file for the default 2.1 online but it's like finding a yaml in a haystack š
But the 2.1 based custom models don't actually need the default 2.1 model installed right?
https://github.com/Stability-AI/stablediffusion/tree/main/configs/stable-diffusion
For 768x768 its the v2-inference-v and for 512x512 the v2-inference
nope
Ah awesome thanks, got confused becasue the yaml file names are different than the sd2.1 model file names
yea you need to rename then
Probably a dumb question, but how do I use loras on top of a model in a1111?
Hey there, can someone help me out
I downloaded a1111 on a new pc
but now it doesn't open the small web-ui element before going straight into stable
like the place where you can adjust settings for launch/add parameters
right click and edit the webui-user.bat to set launch paramters
Someone care to share an up-to-date step-by-step tutorial on how to install Shivam's Dreambooth locally?
There is a one-click installer for Auto1111 that installs all that stuff for you
I understand
However on my previous pc whenever I would launch the webui
I would get a small menu with options
On my new pc I do not
I'm trying to figure out why/how to get it back
It would have launch options etc
Damn I went from a 2060 super to 4070 ti and I feel like it really didn't make much of a different in speed. Only doing 3.7 it
What is LoRA?
how do you upres images created with controlnet in specific pose?\
Different version / install
I have the no launch window version now too
I am just buying a 4070TI. It's not fast?
I mean I figured the improvement would be huge
The 4070ti is definitely fast
but maybe not specifically for AI things?
I see that its only using 20% of my gpu tho
and its hitting 3.9-4 it
nevermind
Did these changes and now I'm hitting 11.6 š
yeah i was about to tell you about that exact post lol. i got a 4080 and it made a HUGE diff
about double
I loaded a SD 2.1 model into google colab and the image generation time is taking way too long (5 min). Can someone please help explain why this might be happening? I have the 2.1 model in a folder that has 1.5 models in it. Could that be the problem?
I went from 10-12 on stock 4090 to 30 with cudnn fix + xformers when I did it a long time ago
I tried to get pytorch 2.0 running a well with new xformers and ru triton for apparently possible 40, but that didnt work out
without a proper guide, I bricked 3 installs and some hours trieng for 30->40 jump
the ones I got now is Cuda 12, cuDNN 8.8 beta running perfect
lol i was attempting pytorch 2 for a bit too, but the updates that happeend to cuda 11.7 from 11.3 were helpful enough
and the older 0.16 cxformers instead of the whacky 0.17 for triton and torch 2
i'm using a nightly of 0.17 lately. not sure if it makes any difference but no harm is good
does anyone know if controlnet is planning to support 2.1 at all?
there's already support, but people need to spend time to train the models. there is an example training project in the main github for it
i think someone has trained an openpose model for 2.1 already too, but its rough i've heard
hm i guess maybe i am not understanding, when I use controlnet I can't use the base 2.1 model correct?
cuz i always get errors when i have 2.1 selected
sad that most of the community is sleeping on 2.1 and still stuck on 1.5 imo
Bookmarked for when my 4070TI arrives
thats correct. the code for controlnet supports 2.1, but no controlnet model is trained yet. you know how you have to select openpose, hed, scribble, etc.. all of those are different controlnet models. they're all made for 1.5 specific, but controlnet models can be trained for 2.1. they just haven't come out yet. takes time to train. i think each of the originals was trained for a week on a single 3090ti, but i might be off on that info.
oooh ok i see, so if i have 2.1 selected i will get errors when using canny, hed etc..
ty
yeah. until 2.1 versions of those are released, and then you'd load those in tandem with a 2.1 base or refined model
wow i hope people start training on 2.1, i really dont get all the hate for 2.1 tbh, obviously i know the criticisms but i still feel the pros largely outweigh the cons when comparing 2.1 to 1.5
https://huggingface.co/alfredplpl/ControlNetForSD2 it's rough by the looks of it, but these are coming along
oh nice! ty
it's just one openpose version, and its.. its rough
i can wait haha
yeah i just noticed, author even says its a proof of concept
cool well hopefully we'll see something usable soon
yeah i can feel it coming, in the air, tonite, oh noohoh
I am in the process of choosing a computer to buy. There is a set of 3 used Tesla k80s for sale at a low price. Is it better to use them together or buy a 3060ti and use it?
would 3 k80's be more powerful than the 3060ti? Certainly more VRAM
More VRAM doesn't necessarily mean more powerful
you can be slower with more vram. 2 params are useful
Cuda core count/speed of making pictures for a given pixel count
VRAM => higher size pictures or more batch size possible.
The last thing, higher batch size, can result in some speed gain though
1 batch of 2 pictures is faster than 2 batches of 1, if you have enough VRAM to run it, with the same clock speed
correct, like 4070ti is faster, but for SD the 3090 has the strength of more VRAM
how do i see older #1023999442338201721 ? i can only see the winner and not the other submissions ?
hello , im new here , i was watching a video by corridor crew and saw that they were putting their own pictures and turning them into anime using AI and stable difusion ... my question is , does anyyone know what ai they were using ? most AI picture generators use just promts and i barely can input my own picture
link the video
yep, I watched all they did using SD. they doccumented it at the first of the video you just linked, but let's break it down
they use Stable Diffusion as main tech here
but they do a lot more
first they shot on green screen some scenes with good poses and movement
SD with A1111
is SD a blender add on?
then, they have trained a model, taught the AI, on a style they want, here a specific anime style. or they use a pretrained model.
Then they use trick to stabilize the video in SD, and in other movie making software
SD is also a blender add on yes, but at its core, it's an AI that transforms text into pictures
what do you mean by this?
they used deflicker filter in davinici resolve
multiple things. same seed, control net, and they have a deflicker tool in their software, resolve studio
so how did they get their hands on that specific SD program?
it's here and free to get and install and personalise at home š
OK, but they only used SD to turn the images into what they wanted, right?
SD also works in Photoshop
so they got stability AI?
it's the same program as we all use around here. they choose a nice model on something like "civitai" or they trained one, and then tinkered with the generations parameters, using their knowledge and other AI users around that have found tips to help progress in the animation sector
yes, like everyone here
yeah, we call it SD for short, Stable diffusion
it's the name of the AI itself
S.AI is the lab that published it
nice!!!!
So there;'s a video addon or extension for SD?
deforum
or lots of scripts and extensions in automatic too
batch processing all frames in img2img mostly
you input a fixed seed, find the right controlnet setting, and batch the folder where you unpacked all your frames
then just ffmpeg the output into a new movie
then you can go with flowframes for interpolating more frame for example, or deflickering
I saw a video of a woman using SD to do motion capture of sorts. Made video of her, then turned it into whatever moving the same way - in SD
what video card is needed for that?
not that big, it will depend on what size of frame you do for sure, but the controlnet model's aren't that heavy
I hope the 4070TI I bought can do it
6 should be good imo, 8 almost sure (talking for 512x512 pics)
the parts are on the way, once I stopped depending on microcenter

is there a way to increase sample size for the UI in some config for auto1111 ui past 150?
yep
I have to resample so many times in img2img bc it didnt quite finish
esp with many prompts
with just 150
you can change all defaults, min, max, and steps of the UI in config-user.json
sorry
sweet thanks
does the sliders and stuff adjust
or do I have to edit their values seperatly
nvm ill just try later
so the scale just changes?
you need to reload the UI for that though, in the "settings" tab
yep
you can always click in the little space where there is the number itself, top right of a slider, and enter a value
and last thing, if you have 2 widely different uses of SD that would require different sliders, you can... have 2 different config files, and have 2 different bat files to start your UI. Depending on what bat you start, it would take the corresponding config (there is a startup option for that)
is there a doc for the actual max caps to set?
not on such things nope
most will work until your hardware can't handle it, or you go over the INT limit ^^
with the xformers it shouldnt be to hard on the hardware for upscaling
still trying to figure out how to use stable diffusion properly
while without the 2048 is p. much really a lot as max
its like 1/3 or so? vram usage
some specifics that lost you ?
with xformers
it's about 1/3 less used, so 2/3 vram left
i dont know what prompts to put in
i dont know what a proper prompt should look like
positive or negative prompts?
positive
anyone know when mj5 is release?
I think the fastest is to just scribe a landscape
to get a hang of basic things
how to add things and change them
well, prompt is quite a thing to master for sure. You want to try to describe the picture as it would be named if you encountered it in the wild on Internet basically.
do you know these crazy long japanese manga / anime names?
that will include framing keywords if you go for photgraphy, or real life events, ...
oh
with , on tags
thats helpful haha
I like putting them in a txt editing below each other
that normal ui is kinda useless
except for pasting it in
what tags should i mostly use?
also, looking at https://lexica.art/ can give some examples, even if a lot aren't the best prompting there isstill a very nice place to start
where can i download Stable Diffusion to my pc?
thank you
nvm I just saw you can spam enter to add rows
didnt expect that, was searching for normal web things like a drag thing
check more in #š¤ļ½tech-support for that, but here https://github.com/AUTOMATIC1111/stable-diffusion-webui/ or here for example https://invoke-ai.github.io/InvokeAI/
thanks
landscape
, colorful
, highly saturated colors
it also helps a lot to use few
and check what they do
hi , can anyone explain "model hash option" to me
Do anyone have a vps hoster for free(im highly broke)
Mastepiece
a hash is like a file "signature". You can see the hash of a file online, before downloading, and checking it's the same when in your software, to make sure it's not broken or tempered with. it also acts as a "shortname" in automatic in some way
ok thank you
gmmmm
has anyone used stable diffusion as a level designer for games yet?
https://twitter.com/rustyedrusted/status/1630677027522244608?s=46&t=z88V3Tcxq9Gxao3hXGPU3A
does anyone know if stability.ai charges for things that get NSFW filtered? seems to happen a lot for no good reason
"you have not been charged for images blurred".. great.. just needed to know if I had to charge users
Chilloutmix and basil...they seem to wreck hands more than other models I've used. Is there a vae or other way to get better hands with these two models?
Hello, new user here
Are there any educational articles/videos/tutorials for newbies?
One message removed from a suspended account.
One message removed from a suspended account.
ControlNET is freaking awesome
@steep mortar Thats what I hear. I just need to learn to use it, somehow š
Im only use the openpose model
my question is tho, is it also using my model ckpt or is it all going thru openpose model
Woohoo! your model file is controlling the file output
Anyone have insight into seed length as it relates to variability? For example would 100/101 seeds have the same variability as 1000/1001?
I need help finding these LORAs in Chinese!!! <lora:ē»é£ åē„:0.4> <lora:ē»é£ ę°ęµ·čÆ:0.8> <lora:ē»é£ ē¦č¶:0.3>
aanyone here knows about these loras?
There are a few filters on Playground that I'd like to get, I've been searching on Civitai and have not found anything like them: Dream Haven (the text prompt show up as lushill style), this one is sorta painterly/fantasy, I guess it can be replicated with another model/ti/lora, just have not found one like it yet. Geometrieva Style (text prompt show up as geo2009 style) its kind of cyberpunk but with a fixed palette of colors and not dark/night scene oriented. This one is hard to replicate with models/ti/loras I've seen. Perfume Style (kind of like Analog diffusion but with more glamour, I guess I can just use Analog diffusion and add glamor fashion or objects into the prompt to compensate).
Is this the official stable diffusion discord? No suggestions or bug section?
I posted this in the tech-support channel but that seems dead.
I'm getting an error when trying to use openpose and controlnet.
\stable-diffusion-webui\venv\lib\site-packages\ffmpy.py", line 98, in run
raise FFExecutableNotFoundError(
ffmpy.FFExecutableNotFoundError: Executable 'ffmpeg' not found
Anyone know how to fix this? I downloaded an extension and it must have messed with some settings because it used to work perfectly.
Yes, this is the official server
That's why it has "Verified" badge
I must run SD with Mac Mini M1, sadly, so I am not running very fast.
But also, it seems I cannot make images bigger than 512x512 (1024x1024 with hi-res fix)
Is there a way to work around this? Any kind of setting that I could change to allow this?
Thanks. I installed ffmpeg through the extension area but there's no exe there....
what ckpt is recommended for lora training of a real persona? The guides I've been reading are dealing with anime characters.
First try inbound and I'm hesitant to run on a pruned model and doubtful about the quality of base SD-1.5
are you training sd 1.5 or 2.1 lora? There are plenty of photoreal models that you can choose from on civitai. Pick anyone of those for non anime oriented loras that you want to train
you can select "photorealistic" on the bar above the model images to narrow down your searches
if you don't want your resultant lora model to always show NSFW content, select models that skew less towards those, like classic negative if you want cinematic look (2 versions for SD1.5 and 2.1), realistic vision (1.5), there is one called cmodel (forgot exact name) for 2.1, mangled mix for 2.1, but definitely tons more for SD 1.5 based photoreal model for you to choose from. Choose based on their aesthetics and creative capacity (how "wild" can their images get, based on the samples people create), maybe you are going for realism and want the fantasy elements out, or the other way around, you may want more alien/fantasy/wild artistic results.
i dont know if anyone can help me but my automatic1111 has stopped working over my local network, works on my main pc but not any tablets. was working perfect up until today (using the --listen arg. in webui-user.bat)
I've been working on this too but I never got it working in the first place. I use the other situation where you login with Gradio
I'm trying to make a chromebook connect locally with 127.0.0.1 but now my cmd window says 0.0.0.0:7680 and it doesn't work. Tried port forwarding I'm just over it the login works for 76 hours I guess
I guess I'm just bad at port forwarding
so, im completly new to this and yesterday i tried for my first time to upscale pictures with image2image, but always i go above 768x768 i get the error cuda out of memory even tough it says theres like 7 Gigs free (using 4070ti) any ideas how to fix this?
Did you select the sd upscale script ?
hmm im not sure actually, do i have to specifically select it?
if so then i think i didnt
Yes if you talking about upscaling
Thanks for the advice, I've decided for realisticVision now. The goal was for sd 1.5 because I hear there are some incompatabilities with 2.1 still. Good to hear that a mix is fine!
Maybe you tried to just change the output resolution
i just selected img2img selected the resultion i want and clicked on generate
no
You don't use img2img to upscale, what you are trying to do is to diffuse a new image on top of the img you fed the gui, if you select a high resolution, the error makes sense.
You have three options, either generate your desired image again with its seed and check hires fix with .3~.7 denoise, or you choose the Extras tab for a very straight forward upscale method, or as CS1o suggest, you can get the 'ultimate SD upscale' script that's used under the img2img tab.
Edit your webui-user.bat and add --xformers behind Commandline_ARGS= then restart
ok thx, gonna try that once im home
Then it should work and yes for upscaling try the sd upscale script under scripts dropdown
Or create the image with highres fix
Does anyone still use instruct pix2pix? It wasn't effective when it first came out, and everyone is used in controlnet now
Do anyone wana help with training a model using noise offset, I can do it on my own but I do not have a great gpu.
I want to train a lora model on the best images as now we can train images with proper style
Please Feel free to DM if you have high vram gpu and basic training experience
Why should it?
I had no idea who that it before you mentioned him (I still don't really, but I see that's a person lol)
You can train AI on any person's photos if you want..
nah and I don't really care
funny
Man, it's really amazing to see the training process of an embending
Like, in the start it's crap, but gradually looking better and better until it's just like you imagined it
Oh, Andrew Tate? You may want to fine-tune using images floating around google
Anyways, imagine MJ server allowing non-members to chat in other channels, that would be a massive shitshow considering 7m users (or higher) likely contains a bunch of anti-ai dudes
if I have a embed file to insert in the negative prompt. Do I format it like this [filename] ?
how do you upres images made with controlnet in specific poses or with specific facial features? I usually generate images in 512 pixels and then take it to img2img and increase the resolution to 1024 with 0.6 cfg scale. It introduces a lot of new details and corrects a lot of mistakes, but the pose or facial features gets changed. What process do you use for upressing?
hye
Hey ! welcome around
you're teaching a machine with education! wild right?
hey hey how do i use Stable Diffusion Version 2?
anybody have experience with gammac dreambooth?
yoink stable diffusion 2 model in your models folder along with the .yaml file
take care of prompting, and be careful in using negative embeddings too
1.5 might not have required a lot of them but 2.1 does, you're doing more sculpting here
best resolution to work with is 768 x 1024, 1024 x 1024....pretty much high res images
amazing result
with mangled hands
pads
where do i go if im having issues getting dreambooth to work? dont want to bother here
Yeah! Feels like my own child lol
Can somebody help me with canny controlnet? How does that work? My results aren't those black and white images but something else. Model installed, using canny preprocessor
if you're only getting an image and not the second one for the model, then controlnet isn't running
what is your GPU?
Guys how can i use control net?
What exactly do you want to do with it?
Where can I find sample training input images of different man/woman for training Dreambooth? Photos of the same person (man/woman) in different poses, backgrounds, etc. I don't want to use my photos and, since it's for a production website, it should have privacy of those people in mind, so it should be something in public domain or so. Any ideas? Is there a 'sample input images' folder for Dreambooth somewhere that I could use?
Do you want using openpose?
Click on the .bat file in the installation folder
It will open a little black command window and will run a few things. Takes a minute. Then open SD in your browser
Thanks
Click on the .bat file in the installation folder
I think I got kicked out of the midjourney server for asking how to rerun stable diffusion LMAO
No warnings lmao, alright
Not cool. If I were you I would create a shortcut for that .bat file to have on the desktop
Alright
Iām not at my PC right now but, I only had a short time to use stable diffusion yesterday, and it wasnāt generating images
It would say āwaitingā and then never make it
And then when I switched to DDIM, the progress bar went halfway and then didnāt make anything
Using 1.4 btw, when I get back to my PC, Iāll try to fix the issue, or if anyone can help me that would be great
Anyone knows of a way to have weighted prompts in Automatic1111? I am trying to create a mixed race, polynesian and scandinavian portrait and nothing so far works.
@azure cargo highlight the part you want weighted then press ctrl + arrow up/down
That doesn't work with spreading the weight though between the two different races. Best I got so far was Polynesian|Scandinavian but is not quite the result I had in mind.
Hi i'm new to coding and AI, is there any good articles or videos. i can watch/read. To get started
With stable dif
https://civitai.com/models/14605/howls-moving-castle-interior-scenery-lora-ghibli-style new LoRA for scenery style of Howls Moving Castle
Hey ! what are you trying to learn to do ?
you say coding, but there is no coding to use the AI. a little more complex install than usual programs for sure, but you shouldn't need to code anything now.
#1072220168534642768 is a good place to check, as well as #1072229020520947753 , then you can read the faq I'm about to link on how to access the AI.
There are lots of videos on youtube explaining Stable diffusion, if that is what you prefer as format, any search will give a lot right now, but just starting by testing it is usually the best
Welcome ! There is no bot currently to generate your images on discord. You may want to start by taking a look at the #1072220168534642768 channel. You can access Stable diffusion in different ways : 1ļøā£ the official website, https://beta.dreamstudio.ai/. The easiest and fastest way to access Stable diffusion with 200 free credits. For any question on it, you can find help in the #1025467151206854736 channel. 2ļøā£ Installing Stable diffusion on your computer. There are numerous projects that let you do that, and you will find help in the #š¤ļ½tech-support channel. 3ļøā£ Running Stable diffusion in the cloud, through rented GPU services, using notebooks. You can find lots of them shared and discussed over in the #1011228442399883294 channel.
Where can I find the 1.5 download?
but you'll need to accept the terms first
You are awesome thank you much
Anyone know why the xformers arg makes images produce different results even with the exact same prompt settings? For example I can click generate, let it finish, click generate again without touching anything else and the second image will be different from the first. xformers is so much faster than default but the randomness it introduces is not very desirable.
Is the seed the same number or -1 ?
Exact same seed and everything. Same seed, same prompt, same seed delta, same method, steps, scale, etc... literally nothing changed between generations.
Can you post an example?
I can upload some example images
Gimme a few mins to generate some more I deleted them
Posted in #šļ½general-with-images
I want to see this too
Where do I share robotic insect images?
hmmm, Im getting nothing but purple blobs today
hey guys
is there a model that can generate line arts (sketches)
i tried it with midjourney and it could make them but i want to know if SD can its better
everything fine yesterday, today I get purple boxes
Yea , that's issue with xformers, you don't do xformers if you want consistency.
It's knows issue
people there is some command to give averages like height, weight and the classic 90 60 90
Guys
what's happening ^^
Ive never heard of that
that's why I ask if there is any order to do that š
controlnet does a lot, but I haven't heard of being able to input height and weight
good to keep experimenting
I tried height and weight in SD...no effect
anyone get this trying to install extensions? (AssertionError: extension access disabled because of command line flags)
Hola gente acabo de inagurar un server de Shitposting y memes em general aquel el que quiera acceder que me mande un mensaje asi le paso la invitación
Los espero jsjsjsj
Es de memes
Control Net about to get old. MultiDiffusion Region Control is the new toy.
new extension?
there's a beta for 2.1, and in the Reddit thread someone says it's available for 1.5
hopefully auto adds it soon
looks incredible
Region control...is it what I think it is?
yea...kinda, nice, been waiting for something a bit more official for a while
compositional control is a huuuge leap forward. prompt bleed has been an issue since day 1 and even clever use of inline prompt editing only did so much to mitigate it. there was an extension for automatic1111 that allowed you to divide the image into sections and then prompt for each portion called "two-shot". it was a pretty big deal that flew under most people's radar, but it was overly convoluted and very difficult to get desired results on more complex stuff. this is the next step above that and looks to be a LOT better
Adding bones to the flow is a HUGE step forward, but, sadly, not for 2.x yet.
this feels more like adding bones to it. and yeah, i agree lol https://www.youtube.com/watch?v=ptEZQrKgHAg
genuinely crazy how far this stuff has come in just 6 months
When I used to do 3d bones came in like this, looked like this, and was a game changer.
I think this is alright but we need to stay within our own eco system not be required to load in a 3d program.
like Blender
Mind you, this is just an evolutionary step and I suspect we will not need it soon enough.
agreed, that's why i didn't link it when i saw the video premier about 13ish hours ago. it's not far off though, i bet. at least blender is also opensource. either way it's a big deal and it'll be awesome
What is sort of cool is that a day will coome (soon) that working in 3d will be for a niche item and even raytraced, GI, SSS, etc... will all be done in the AI. In my eyes that will be amazing. Think of the render farms that will suddenly no longer be needed.
when all this stuff released to the public last year, i was telling family that i wouldn't be surprised to see ai capable of generating shortform movies/video clips in like 10 years. after watching this for the last 6 months, i now feel like i -severely- underestimated it. honestly wouldn't be that surprised to see coherent output in a year from now
I have been saying 1 generation to a max of two generations of video cards I can sit in my bedroom and make a full on film/movie in under a month. That is for voice work, and the generating. 2 gens is about 4 years, may 5 max.
One thing we are missing that once hits it is all over is temporal cohension.
If months are like years...
I am basing what I say not only on the tech
consistency is another one of those day 1 issues, but the advancements with dreambooth/hypernetworks/embeddings/loras makes me feel like that's not gonna be as big of an issue in the future either
I made the comment on the wrong one. I mean it on CustomCombo's comment of 10 years
temporal cohesion is what we lack.
what that does is look back at the previous frame, or two, and go from there so no glitches. It simply flows.
we are damn close btw
it's amazing
yet when I tell it dressed like a cowboy, it puts the guy in shorts and a t-shirt
try "in a cowboy costume" instead. i've found "costume" works pretty well for getting the desired result if trying to put someone in a specific outfit
I will try that
maybe I'll have more luck with that than with chatGPT right now
we are now trying costume. It takes my feeble GTX 970 some time
my 4070TI was delivered Wednesday, but instead of a GPU it was a lens protector
oof. yeah. i was on a 1060 6gb until just last week. it's rough for low end cards
1060 6 is much better than GTX 970
hey there, anyone else getting this error when trying to use loras with automatic1111?
ValueError: not enough values to unpack (expected 2, got 1)
I wonder if the A770 with 16GB works with SD
under 500
costume gives me the same stuff...shorts and often underwear. I wonder if it has that Underwear Cowboy from NYC too strong?
Im strengthening Cowboy Costime to 1.4
possibly just using a model that is too heavily trained on lewd imagery lol
its an anime model
when I told it "track suit" it did that OK
Boxer...was good
maybe anime doesn't have cowboys
what about cowboy bebop?
ah...1.4 is putting them in cowboy outfits, at least one
yes...it's working. Good suggestion with "outfit"
anyone help tryna install SD locally but getting error
ah. yeah, anime doesn't (typically) have much training on the classic western cowboy
Im getting cowboys now, but with lots of stars
maybe for a sheriff
now "stars" is in the negative. Let's see SD mess this up
still some stars, but now less
think you're on the right track. did a few experiments with the anime models i have and sheriff is WAY more responsive than cowboy with every anime model i used
what anime do you have?
I only have protogen V22 anime
I will try sheriff
are you doing "sheriff costume"?
um. a lot lol. one of each of the available orangemix anime models (so like 10 of those), anything 3.0, a model called "anything and everything" that is mainly anime, ayonimixanime, and a few others that do furry stuff for anthropomorphic characters
i just did sheriff and it worked out pretty well with most of those
I will look those up
guys help #š¤ļ½tech-support
I do not know the answer
SD put the word sheriff on the shirt: SERCFE
if you're not using negative embeddings, you may want to look into those as well. i've used "easynegative", "bad prompt", and "bad artist" as well as about half a dozen others in the past (for 1.5 models). i typically use the following 3 as default now: "dangermouse" "Unspeakable-Horrors-64v" "Unspeakable-Horrors-Composition-4v"
i've had pretty good results starting with those and then adding negatives to remove specific things that the neg embeddings didn't catch
if you haven't gone to
civitai.com
i highly recommend it. great resource for all sorts of ai embeddings, loras, checkpoints, etc
AnythingV3, AbyssOrangeMix, Counterfeit
why isnt there an eraser for masking yet?!
where to goto to make imaged
Where's the channel for tech help?
tech support
Request on github maybe
youll probably have to mess with it a little, look up some low ram installs on yt
I see, thank you :)
Come to #š¤ļ½tech-support we can fix it
wtf happened with civitAI?
o ye its down
they were working not that long ago, may be too much trafic or a bug in one update forced them to go temporarily offline ?
oh my gosh
it just went from working to not working in a constant loop
It would be up, then it would crash
and then its up again
If im using automatic1111 webui and have the controlNet extension installed, how do I enable MultiControlNet?
settings - control net - adjust the multi controlnet slider
then restart ui
Is multi control net included in the same extension?
or is it a seperate extension?
@limpid tusk
Same extension
ye same extension, may have to update ur repo
or click check for updates under extensions
Thank you āŗļø
lora question: do i need classification images when training a lora model "dreambooth method" ? or image captions are enough? or both? or nether? what do you think?
Same thing. In the settings tab, control net category, there is a slider to allow more than 1
Need UI reload to act but it should be there
can I generate images on my computer but launching the webgui site on my phone?
I hope that Stable Diffusion 3 will be trained with offset noise by default
ELI5 offset noise?
it makes images have much better contrast in low light situations
basically makes your images sexier when trying to make low light photos (And sometimes benefits bright images too)
Ah okii
there are some Loras and TI embeddings that you can try right now! (sd 1.5 and sd 2.1-768)
I'm just saying that SD 3 is going to get closer to being a much better base model that people can train from
Better low light, better composition for wide images (SD 2.1) and etc
the other way around will work, but this no, impossible
with gradio you can, by using the correct link
but there might be risks of someone scalping the links and generating stuff on your comptuer (no virus or security risks, just the risk of someone lagging your computer if they find the ID)
Scary
I had a dream where a hacker controlled every single one of my devices
or if you are at home, you can just use the localhost/127.0.0.1:port
very scary dream
everywhere I went
any device
even if I turned it on then turned it back on
That's a good idea
if that don't work (probably cause you use an ethernet cable and your phone uses wifi) you can do an "ipconfig" or whichever command in the CMD and connect to that (starting with 192.168) IP address + the port
Hii how can render more ai images?
Iāve been gone for a little and I just saw all the dream channels r closed
you need to add the following behind Commandline_ARGS=
In the webui-user.bat
On the same network:
--listen
On the go you need --share=true --gradio-auth "username:password"
Assuming I have an amazing GPU, how much effort does it take to traina style?
How do I set weights on prompts
by adding () or :1.3 at a word
((Green house)), Brown door:1.3, blue window
For example
So there is a model saying to use 0.5 weights
what does that mean
I put 0.5 to every prompt?
0.5 weights could mean many things it could be [prompt:0.5] or 0.5 strenght of a lora or 0.5 denoising
i think it means 0.5 strength of lora how to do that
if you have downloaded a lora file with a name then it should be for example lora:epiNoiseoffset_v2:1
I have to put that name in the prompts?
Yes, and replace 1 with 0.5
"epinoiseoffset" should be replaced with file name right
do I include file extention in that
No extension, and yes its the file name
you can even rename your file whatever you want
š thank you
if the engine doesnt find it it will tell you
I mean it works but it doesnt look as advertised
the style was supposed to be BNHA it just looks like something else
trying again once then ill check again
Nope doesnt look like the correct style, weird
a lora is not a magical thing that can solve everything, stable diffusion is mostly gambling
Turns out I put it in the wrong folder
was supposed to put it on the lora folder
That should do the trick
Anyone got and idea why the prompt doesnĀ“t appear anymore in the .jpg/png filename after I activated the live preview? š„ŗ
if im trying to generate 2 character in the scene how do I give them individual descriptions
it keeps mixing up
like I put batman with his cape flying in the air and flash with lightning but it just gives both to a weird brew of the characters
It“s difficult to do that @sand vine once I tried with "character/person in the left with w wearing z" and "person in the right with x wearing y"
I tried that too very hard
F
IĀ“ll eat now but iĀ“ll try to do it later, if it works iĀ“ll tell you š¤ @sand vine
I've recently switched from Mac to Windows, and installed Stable Diffusion locally. I got it running, and it worked for about 20 minutes before I needed to restart the computer. Now I'm getting a message saying "the site can't be reached. Checking the connection. Checking the proxy and firewall." I have an internet connection. Is there something I could've have gotten logged out of? Any assistance would be appreciated. Thx!
You can find help in #š¤ļ½tech-support!
There's currently no bot on the server, check out #1072220168534642768 & #1072229020520947753 to get started
Blender hype
Bro whats the point of the new blender thing Dream Textures already exists
You can even use dreamstudio with it
wooah
why SD always bait with announcements that don't announce anything
for blender
let us pick a role that you ping for ann's that arent actual anns
you dont want more competition?
So it's like Dream Textures but uses the DreamStudio API to do cloud GPU work?
Dream textures can use Dreamstudio API. But yeah basically
oh
more options is better
u guys know if the blender plugins use control net?
or if there is a blender plugin that does
or what's the super new thing
the more teams we have working the easer it will be
all I see are nearly the same features as dream textures
for the whole community
dream have control net?
seems hella useful for making textures that actually proejct correctly
there is a controlnet plugin for blender, because I saw the creator on Twitter saying he added segmentation script to it, but I didn't download it yet
what is the difference between stable diffusion and stable foundation?
stable foundation is the company right?
The get started link on https://platform.stability.ai/docs/integrations/blender/install goes to a missing page.
All in the #1072229020520947753! #1072229020520947753 message
Now that we have textures we need something to generate the 3d models š
wayyyy harder
Will be fixed shortly, thank you!
So stable foundation develop stable diffusion?
Stable Foundation is our hub here connecting communities - and everything stable diffusion!
Stable diffusion is being developed by Stability AI.
I see, thanks a lot
sd in blender ? how did they manage to do this without gpu ad no dreamstudio
I really like stable diffusion, since last week I run it on my computer, I have played it everyday
I created a girl, and I think she is alive. So I generate her photo everyday
You can share your generations with the community here #1073085702927024128 ^^
the infer speed must be very low if using cpu
How will blender extension not require a gpu
I will share my girl, thanks for guiding me. I like this community
Hopefully we'll get a non-gimmicky version that can actually use our hardware... I mean, you made it for Blender... the best supported cards are Nvidia, and considering Cycles popularity, they have the VRAM... It's not 5 years ago. xD All the recommended cards for Blender 3.x are Stable Diffusion compatible.
Because it is DreamStudio API, did you visit the link to the documentation on installation and use?
I'm sorry, I assumed to conclusions
I have ever used CPU model, they are really slow, I think it is really a good idea to open api for users do not have gpu
It wouldn't be CPU if people are using Blender with at least minimum requirements. All the GPUs recommended by Blender Foundation for 3.x are SD compatible. The best cards for recommended system requirements all definitely support SD, like the 4090. Blender isn't something you should really be using with CPU. You won't actually be able to competently render scenes with a CPU and RAM. It'd takes days to render out a photorealistic cycles scene vs 2hr on a GPU.
I haven't ever try blender, I search it on google, and I decided to try it now.
I am an environmental artist so familiar with Blender, Maya, Max, Terragen, etc, etc.
I have already installed blender. I get to try it
It is really hard for me, I give up
It's their own thing. Dream Texture is better though, as you can actually use your GPU and aren't censored for stupid things that are part of an entire artistic fields.
Blah
you can use blender without a gpu. students do it all the time on school provided laptops. final renders are CPU as GPUs only provide cheap approximations for real time purposes.
gpu for blender is good for editing the model and texturing, but the it's not a requirement of blender. the actual 3d magic it does is mostly software running on a cpu
unless blender has actually gotten around to creating a cuda renderer which they probably have.... my point is "you shouldn't be using blender without a gpu" is something that students do all the time
I have never used blender
gotta get into it
Stability for Blender uses some online SD server?
theres a few plugins. make sure to read about the one you're installing so that you don't end up spending 5 hours trying to install unprompted for < 1% of it's features
yes, but does it use my local SD or something online?
it says, "all you need is an Internet connection"
i dont know about the one you're looking at. if you're reading that it requires an internet connection, i would surmise that it requires one
there are a few plugins for using sd in blender. don't go rushing into a 5 hour effort before you understand what you're installing
the one mentioned in announcements
š
The server banner is gorgeous does anyone have the original image? Also if it is an SD generated image they should include the prompt and whoever generated it in the faq or info channel, would be a great demonstration of how powerful SD can be
š
you're talking about the new official blender implementation. yeah, that looks like it's 100% software as a service. kind of lame that they're like "you used to require prohibitively expensive hardware to do this, but now you can just rent from us instead"
I'm sour about software as a service lol but i do understand that for some it holds value.
is emad committed to FOSS or what? this direction seems like a half in half out commitment. should release the code for the integration too
Are their any plans to release "Stability for Blender" as a standalone install for end-users that wish to run it locally on their own hardware?
doesn't look like there's a plan to release it as a product people can use on their own machine
And, if the above is even a possibility, what about FOSS Licensing?
Well, I'm out on that SaaS then.
Yeah its fully SaaS top to bottom it seems. Not very FOSS spirited
I want to support the future of AI as a FOSS platform, not as a closed-source SaaS platform.
Yeah Stablity made strides by releasing all they have under FOSS licenses, but now they're more often releasing stuff locked up behind SaaS practices
We already have plenty of SaaS providers, adding yet another one doesn't really inspire innovation, all it shows is that greed won out, such a shame.
Yeh. Very shameful.
Not shameful, but disappointing
I have this figured out. On SD, I have a prompt to generate an image of an image AI that creates realistic images with full control over all elements including depth, pose, character.
i feel ashamed for them. Champions of AI freedom, and then 6 months later they're chaining up services behind a locked API. This isn't a healthy direction at all. It makes me think like, what if google created Gmail, free for all, then rate limited it hard and charged for use 6 months later. Gmail would've crashed and burned and nobody would ever use them. Right?
I got a gut feeling about Stability now. They may be an Icarus story.
Then I use that image, run it, to generate mages
I want to see stability suceed, but if they're diving into SaaS, i don't think they're going to be competitive there at all
Where do you hear it would never be also offered to run locally? I believe Brian specifically said otherwise just recently #1054807328509145158 message
That's not market disruption, that's blending in and diffusing into anihilation
The website where all the information is. I'm not going to go through all staff's social media
OK I found it, was a contest winner #šļ½winner-gallery message
Fair enough. If there is a spot on the website that says it would never be considered to be open sourced could you DM me the link? I'll take a look into it
It's more of a lack of mentioning it
what you chose not to communicate is very important when creating communications
SaaS is the death of this
Gotcha, In that case I can't really say it's never going to be an option. It's not a project I'm part of so I can't confirm any specifics, but certainly let me know if you find any conflicting information anywhere official š
Since this is the very initial release I know there is still a lot of work being done on it š
you want to use image generation online - be subject to changing prices, content policies, etc
SaaS is midjourney, lexica, openai, all other competitors. Are stability leading or following?
one day you make images, the next day they decide "no, that image isn't allowed"
or the next day, prices increase to $500 per month for 1000 images, or based on number of tokens
really feels like this is falling in line with corporate market forces trying to push SaaS into every aspect of digital life. Rather than kick starting a machine learning revolution. hugely disappointed by today's revelations
you pay for every test run, every experiment...no, that's not exactly what I wanted, but it used up 100 tokens
That“s why local installation > everything
š
i'm also wary of sending any of my assets to a 3rd party cloud service, since then they have all the IP rights to it. Standard TOS really, since they need copy right to copy it around the internet tubes. But those rights tend to extend and give them a lot more freedom to use my assets than i might want them to have
SaaS is just balls
Pricing: Free for 1000 tokens, $100 per month for 2000 tokens, $500 per month for 20,000 tokens
It's understandable that they want to monetize their efforts, but theirs other ways they can go about it whilst still releasing as FOSS.
If they plan to now take a purely SaaS approach moving forward then I won't be supporting any future endeavours of the project.
I will support it's existing FOSS elements, and, any forks of those element's that others may wish to expand upon with further FOSS additions.
Because I believe that they have the most to offer to society and mankind as a whole.
But I cannot personally support any SaaS approaches because I know how those always end up stabbing you in the back, and financially speaking I am not in a position to afford constant payment of services that I have the hardware to run locally but definitely do not have the funds to pay for through an online service, I've been dealing with those kinds of situations since 2010...
I support FOSS projects because although I may not be financially very well off (Officially I am below the poverty line in my nation of residence by some margin) I can still contribute in various ways to these projects with the free time that I have.
It is why I love FOSS, because regardless of my financial position I can still use my time as payment and contribution to these great projects that people make!
It is why I fell in love with Stable Diffusion, because it is one of the few widely adopted FOSS AI solutions available right now, and it is the only one that was heavily focused on usability and community support.
you can't trust SaaS providers to maintain the status quo. if you've got a local tool, and they update it to a version you don't like, you can still use your version that you do like. HUGE for tools.
right
they don't have your content, they don't control your tools, they can't censor you
i'm less concerned with censorship in this issue, than i am with inconveniences
it's part of the same thing
i dont think they're trying to take down the banksies of the world with SaaS
censorship is just hyperbole imo
Let's say you want to make a video or comic that's pro some politician they don't like - sorry, you can't glorify that woman!
it's not
yeah. i don't think that's a SaaS concern.
it absolutely is
is it posisble? sure. is it a concern? nope. i don't think it would fly if it happened, and companies want SaaS for profit motive, not political motive
Censorship is very much a possibility on any platform or software that you don't have full control over.
Whether or not you agree with that censorship will depend on what it is censoring, why it is censoring, and to what extent it is censoring.
people tend to agree with censorship when it's something they don't like
they disagree with censorship if it's something they like
being worried about your freedumb of speeches is just qanon nonsense. dramatic flare meant to attract culty crowds. thoughtspeak and all that actual control and subversion. "censorship" in a digital world doesn't happen. If you get banned on youtube, you'll publish on twitch or discord. It's a faux concern
censorship only actually matters when it's coming from an authority institution
it always matters
i'm censored all the time and it's fine
Well, bon appetite, now I must leave
This is exactly how I feel about it all.
Loads of SaaS options already exist.
It feels like Stability AI have misunderstood why people swarmed to back it in the first place with loads of 3rd-Party support of it with additional GUI's and other plugins, etc.
on the way to MC to pick up some stuff
if i'm in public and someone annoys me, i can't just lash out and thrash on them verbally. I could, but other people would come in and tell me to stop. censorship right? that's fine though
sometimes you need to stop
their whole system is archaic, backwards, hard to work with
yes, but you wouldn't have to stop.
make an SD image about it
Yeah i want to see Stability make moves and drop bombs that cause huge disruptive waves. I guess they saw openAI release chatGPT as a service and are now leaning into it. Business school graduates and their confirmation biases
Just a reminder: I have no intention of stopping any discussion here since it's very important you all can share your thoughts positive or constructive. However, avoid direct insults or ad hominem attacks against other users please.
You are all welcome to your opinion and so are the people you are talking to. It's certainly possible to debate while remaining civil š
I often self censor and stop myself from insulting people because the rules say i cant and also it's just good for healthy conversation. It's another example of where censorship can succeed greatly.
often not always
No worries, I have no intention of causing any conflicts or targeting any people.
Just voicing my opinions/concerns as a user of Stable Diffusion.
That's most certainly fine! I mean more of a general statement to everyone - that post wasn't directed at anyone specifically ā¤ļø
Is there an official API for Stable Diffusion, a REST endpoint? Currently I am using the Dalle-2 API for an app and I'd like to offer my users access to Stable Diffusion if they prefer.
Take a look over at #1042896447311454361! (Also this since it sounds like you're already well versed in the more technical side of things: https://api.stability.ai/docs)
SaaS rearing its head again. I might be fighting an uphill battle. A lot of people find value in this stuff. ugh. market forces grrr
Thanks! Any rough estimate on the per-image generation choice for let's say, 1024x1024 images?
finally getting around to catching up on what Carmack is doing in the AI space. Not much yet but here's a good interview that makes me salivate. https://dallasinnovates.com/exclusive-qa-john-carmacks-different-path-to-artificial-general-intelligence/
It's cool, I know it wasn't, just wanted to make it clear to e everyone here that I'm not here to try a cause any rukus or upset/target any individuals.
All about healthy discussion.
I got one better for you!
Check out the bottom of this page - https://platform.stability.ai/docs/getting-started/credits-and-billing
This documentation is for the old API so I wouldn't suggest using any of the examples there if you'd prefer to use the REST API. However, it has a grid for examples of image costs for various different configurations!
(The cost in the API is currently the same as using the beta.dreamstudio.ai site in case that also helps)
I find using the site super helpful since you can adjust all the sliders without actually generating anything to see what the predicted cost is
Hello Is there a place where i Can find prompts in this discord?
Check out #šļ½prompting-help!
He gets about a bit, I have heard his name in 3 different technology related markets in the past 5 or so years, I wonder if he himself isn't quite sure of where he wants to settle down, or maybe he just enjoys a constant challenge and so is moving about in constant search of the latest and greatest challenges in the technology space.
His name certainly carry's some value with it as a person, It will be interesting too see where his name pops up next!
Thanks. I am asking because i Saw some prizes with great images but was surprised to see no prompts next to the images, Is that normal ?
It definitely depends on the person. Some people like sharing their prompt with their work while others choose not. You're certainly welcome to share yours if you'd like to!
Hopefully I'll see some of your stuff in #šļ½dailies or #1023999442338201721 some time
Thanks. Question intriguing me how do people win " image of the day" if no one can guess how they made it? ( Without a prompt )
It could very well be an image from anywhere ?
Anyway inam gonna check the 2 room you pointed at thanks š
I just checked both rooms and i am amazed at the images they are soo good
No prompt next to them though š¢
he's settled on general AI for the past year. has left meta and started a new company with other people's investments, so that he's more motivated to do. Rather than his own money funding it and not feeling pressure os much
The #šļ½dailies Channel is more for people to submit their work according to the theme for a fun challenge & to share what they've done - not so much a "guess the prompt" kinda situation. Hopefully that makes sense?
Yes oki. Its Amazing I had no Idea people had this much talent
Anyway Is there a Channel with full or images and prompts next to them though ?
You might have some luck looking around the dreamer communities such as everyone's favorite #1070829982252798043
There are no channels where people are required to share their prompts by default, but there's a lot of cool people who are great to have conversations with that you can certainly ask for their prompts & try them out yourself if they're willing to share š
How long is this lawsuit gonna take?
hello everyone,
Is it possible to combine deep fake with stable diffusion for better results
Wait, they locked their API?
no i'm just using hyperbole. technically the api is behind a lock since you need an api key
but i was going for dramatic effect in that post
wait this one cant use your own GPU?
Does anyone know what hardware and how much time was required to train the controlnet models?
Yoouuu don't use Blender much, do you? Cycles is a physically accurate path tracing engine, and works leagues better on GPU. The functionality is the same. CPU is just exponentially slower (same goes for Eevee [Ray Tracing], which is optimized for real-time GPU streaming, not CPU), and not recommended, even for minimum system requirements. What people do on their own is fine, but is not recommended (inherently will probably encounter problems, and they won't provide support).
Blender doesn't recommend a CPU in even their minimum specs for rendering, but at least a 2GB compatible video card. CPU can be used for hybrid modes, to aid in speeding up GPU tasks.
Not from their docu. No mention. Only Dream Studio API
That alone would violate most NDAs for professional work.
Might be temporary. Dream studio has features before public does
But that is a little concerning
Help! Suddenly i cant get past this error when using contolnet, tried all versions of models, it use to go away when I switch to 1.5 models, no longer works... File "C:\AI_Files\stable-diffusion-webui\venv\lib\site-packages\torch\nn\modules\linear.py", line 114, in forward
return F.linear(input, self.weight, self.bias)
RuntimeError: mat1 and mat2 shapes cannot be multiplied (154x1024 and 768x320)
Try using more similar or exacting sized images (resize them)
Woops, wrong reply. Meant to tag you @alpine mica
Thanks, I had somethin in settings that was causing that(Extra Networks), woks now, but can one use controlnet with newer versions of Checkpoints, than 1.5?
can anyone recommend some (free) site's that i can use to do some image stuff (prompt to image)
As cool as stability for blender is, it's not gonna be able to hold up to the competition, sadly š
I didn't exactly work on, but rather alongside with the creator of Barium AI, which has been purchased by unity, and I'm extremely excited to see where our passion project went
It's still very cool to see stable diffusion adding the two the options though
Anyone know how to create an optical illusion using stable diffusion?
Im kinda lost in the channels, but is there one to go to if I want to find pairs of prompts+settings <-> images
what's generally considered the best model for more creative landscape generations?
so far i havent been able to find any model that comes even remotely close to midjourney when it comes to fantasy landscape gen
you may want to try openjourney + perhaps mix in dreamlike weights? Still not the same as midjourney in that regard, but still quite nice results. I tried forests and deserts though.
midjourney probably doesn't use just a single model and likely have hypernetworks, Textual inversions and loras. Perhaps even proprietary versions of them that they developed in house. hard to find a one stop replacement for all those kinds of tools. though as a base, i found that charhelper did nice landscape gens if you use the right token https://civitai.com/models/1480/charhelper-sd-2x-768
https://civitai.com/models/1147/painted-landscape this is a TI you can use for 2.x models, including CharHelper
Let's say I have a photo - a portrait of someone who looks straight at the viewer, the end result that I wanted to get is producing an image that shows up the same face slightly turned to the left or right.
Is it possible at all?
civit is hard to wade through due to all the , well, you know. but filters help to great effect. though filters don't catch everything. not all the embeds or loras on the site are uploaded as that. it doesnt' automatically recognize and relies on an uploader to tag properly
tough and would take some manual work and experimenting. take 20 other photos of that person and train it for their face, and use that lora or embed to affect the new face. maybe skew it in photoshop for the worst 3d edit possible, and inpaint on that? not easy but doable possibly!
as far as prompting "turn their face 20 degrees" with one input image? one day but we're not there yet
though everytime i say that i seem to be behind the curve and somenes like "just get this extension" so .. keep an eye out
no, definitely not something like that, I understand that prompt is not translated to something that NN would understand that way, after all it is not a 3D editor. More wondering if fiddling with inpaint could somehow yield some similarity.
Another thing is some face that SD produces after txt2img - it looks awesome and I want to keep using it in other angles. I tried referring the same seed but it doesn't produce the same face for rotated 3d model in the base image
When is the extension going to be made for Ajtomatic1111 for Multidiffusion?
its been out for a week or more
Where can I find it then?
you mean multi controlnet diffusion right?
then Im not sure
Basically supercharged paint in words.
cause with controlnet you can take a background and a subject and diffuse them together
You have to go lora
That was limited in a sense that you only have a few colors that correspond to objects to choose from
The new tech is literally capable of making any mask and then corresponding it with a custom prompt.
The way stable diffusion will create a new face from a seed or an img2img is a bogosort : a random sorting.
So it destroys some stuff while keeping some others, but getting everything is really hard
And to get everything right is not just keeping everything where it is just like control net would do, you need to keep the idea of the face
check out this video, if you havent already. you can mask a person from an image and generate them onto any background https://www.youtube.com/watch?v=MDHC7E6G1RA
basically unlimited options
ah just reading the multidiffusion github. It has separate prompts for each layer of the multi diffusion. You can achieve this with controlnet but not directly and easily from prompts
š
hey guys do you know the site where I could watch some art made in SD?
I can bet there is such a site, but i forgot the name LOL
and i know that on the dc server is a section with images but it's just too slow to check them out
guys, is this the latest SD model?
"stable-diffusion-v1-4"
Nope, that's 1.4, pick either 1.5, which is better for most applications, or 2.1
if not then which one is the latest useable one
hey thanks for reply,
may i know the difference between 1.5 and 2.1, which one will be better
1.5 has more support and tech, while 2.1 is better at the cost of losing some images due to copyright issues.
I heard 2.1 works with images 728x728 where as 1.5 is 512x512. However 1.5 has a better prompting system I believe and a lot more addons on it by the community. Also allows 1.5 allows nsfw so more accurate humans
thankyou kind person
of course all can be fixed with custom models, in that case 1.5 more models
i'm confused which to get
v2-1_768-ema-pruned.ckpt
v2-1_768-ema-pruned.safetensors
v2-1_768-nonema-pruned.ckpt
v2-1_768-nonema-pruned.safetensors
Ema pruned safetensors
/ativar botão
I am kinda confuse, so there is an AMD for SD released, my question is, will that affect the speed of generating image if I am using an AMD processor?
How is stability.ai making money out of their āfree to useā blender ai
Huh?
Could someone explain to me how a VAE is used?
Not only where to put it (though also including that), but what it does, how it works and whatnot so that I understand it more than just knowing what to do with it. š
Hey all my Dreambooth extension wont show in the UI, but says its there under extensions, any suggestions?
was there an update today? AI Scale up stuck at loading
Hello everyone, in case you don't know if there is a website to get commands like:
from behind
rear view
from_above
(hand_on_hip)
(from_side:1.0)
from below
to control the camera or poses of the generated models
thank you so much
controlnet...look it up
Hi
I'm tired of SD generating EPH images
Can anyone recommend a good Voice AI for parodying celebrities now that ElevenLabs is cucked?
You can't upload celeb voices anymore I think?
Oh, I didn't try
because of the pricing
you pay even for tests, failed runs
ridiculous
good to know, gotta figure out Tortoise then hopefully easy enough for dummies like me
There is an 11labs model or whatever for Tortoise, being worked on now
o nice
I can't get the thing installed...tortoise TTS
midnight - new folder for SD images being generated
there's an explanation on 4chan's /g/ board for Tortoise looks like, gonna play with it
OK
You can try this for a basic idea of how it works: https://replicate.com/afiaka87/tortoise-tts
it only allows one voice sample, or averaging between different voices
there are also tags you can use for different things like anger, sad, etc.
I might try that, hope its not too expensive.
you can make monthly payments: $0 per month for 12 months, or a low $0 per month for 36 monts
right...I have to figure out the install
Is there any way to train a certain face and have it create realistic images based on that face?
Like i can use text to img and results will have the same face everytime?
Which one
You have to train a LORA for a specific face
Can i do that on SD webui?
If you have a good GPU
I'm using colab
there are videos. Do your search "LORA training"
Alright thanks
you're welcome
whats the website where we submit art to get it trained in stable diffusion
hi could someone make me "charming Polish village, complete with colorful traditional houses, cobblestone streets, and a lively town square."
with 16:9 ratio? my gpu sucks
i need better alterantive to premiere pro
Hi
I've ever used MidJourney but now i'm in trip to learn and use (on my PC) Stable Diffusion.
Someone has already used and installed on fedora 36?
Alsoif i'm followinf a lot of tutorials no one work properly.
Thank you
Fedora Linux 36 (Workstation Edition) 64 bit
Intel® Core⢠i7-4790 CPU @ 3.60GHz à 8
NVIDIA Corporation GM107 [GeForce GTX 750 Ti]
11.6 Gb ram
this is my PC
Hello, "Whitelist - #Chapter 3: What name would you give to your NFT Stables horse? Send a message on our Discord." --->
jolly rancher
i downloaded stable diffusion locally and run
what is dreambooth?
dreambooth is one of the existing training methods. It lets you teach new concepts to a given model, and make a new model out of that
does it require more than 4 gbvram?
different training methods exist, not only dreambooth, and some are more accessible. Yes, dreambooth requires more than most, and even more vram can bring even better results. to be honest, I have 24gb and I would benefit from more still. I think 8GB is currently the min for dreambooth, not sure. 12 for sure works
Are there any where 4 gb works?
I'm not sure, but I need to say, since you usually only do "some" trainings here and there, unless you want to go serious big time into it, google colab seems the best for it. on the free plan, you get enough time per day to train multiple models for free
here is one such notebook https://colab.research.google.com/github/TheLastBen/fast-stable-diffusion/blob/main/fast-DreamBooth.ipynb
š
I don't get the question. if you put an artist name and not <artist name>, for sure, it should do something like that artist does
How hard is it to train models? is it as simple as providing many many images
no the <> is for putting like any artist name
like an exampl
so instead of <artistname> put the actual artists name
to train a model is simple. to train well is quite harder. The dataset isn't the easiest thing to do correctly, then there is the captionning, then you go for the training itself and have to manage the step count and compare the resulting models to find the right training amount you need. It's another thing completly than making pictures and requires quite some learning
not sure how this works, I don't use aliases like that sorry
sorry I misunderstood
yeah ok, well then the answer was "yes" it will work
Any good tutorials?
sure, lots. here is one for style training for example https://github.com/nitrosocke/dreambooth-training-guide
its dreambooth tho, itll still apply to the collab link?
How do I make something
the colab I sent was a dreambooth one.
but look on youtube
really lots of them
it's a really popular subject
people want to train their face in the model usually for a simple start
or a pet or something
I start ambitious going to train my own art style
yep, but it's doable. may require some tries to get it right š
anyone know an android alternative to this https://drawthings.ai
even the photoshop one needs an api key but runs locally
they may know more in #1054807328509145158
is there a turtorial specifically for this
will it ork for 4 gb vram?
no, this will require more. 12 i think. use it on colab.
for example https://www.youtube.com/watch?v=fyBqkKCIpYU
Do you guys think that disclosing dataset was a big mistake for SD?
Yeah Im using on collab only
Quick question about Automatic1111 web-ui folder structure: What is "venv" and why is it a bit over 6 Gb for me.
like the collab file you sent right
I just run it from therE?
Also little question.. my references are all different sizes, some are rectangular some are square, I need them all to be 512x512 but how do I extend the ones which are rectangular so that a significant portion of the image is not cropped out?
it's where python and torch and all the other dependancies get downloaded. it's the normal size, yeah, and you can't remove it, it will recreate on next start of the UI
yes
check the guides, lots of them, and like, it's wall of texts of explaination
there is not just croping to do
I was just curious due to the size.
Which ones?
the two I already linked you, this one is exactly what you need, style training https://github.com/nitrosocke/dreambooth-training-guide
ty
I need to get perfect symmetry in my generations, can anyone get this to work?
https://gist.github.com/1ort/2fe6214cf1abe4c07087aac8d91d0d8a
Today is world wildlife day, I used diffusion for the animations in the background
https://youtu.be/NhaJ4fMcztI
any model recommendations for anime or 3d style? (fantasy setting)
Well, that's a report

ā ļø
š
Thanks @vast ingot! Sorry y'all had to deal with that š
Eggs, bacon, no spam! ā¤ļø
Guys i m new here, can i generate images here and how?
Hey and welcome !
Nope sorry, we don't have a bot around, let me link the faq on that
Welcome ! There is no bot currently to generate your images on discord. You may want to start by taking a look at the #1072220168534642768 channel. You can access Stable diffusion in different ways : 1ļøā£ the official website, https://beta.dreamstudio.ai/. The easiest and fastest way to access Stable diffusion with 200 free credits. For any question on it, you can find help in the #1025467151206854736 channel. 2ļøā£ Installing Stable diffusion on your computer. There are numerous projects that let you do that, and you will find help in the #š¤ļ½tech-support channel. 3ļøā£ Running Stable diffusion in the cloud, through rented GPU services, using notebooks. You can find lots of them shared and discussed over in the #1011228442399883294 channel.
guys how can I reference my face for SD?
hey ! without doing some extra steps, you won't be able too. Basicaly, you'll need to first teach SD your face, using a training method like Dreambooth. Then you will have a new SD model that knows about you
(there are lots of ways to do it, dreambooth is only one of them)
thanks
yeah there has been lots already ... quite the day
Anyone have a good resource for learning how to interpret the loss charts that come from Lora creation / training?
not specificaly for LORA, but genericaly for all training methods :
A/ Loss is a numerical value that tries to evaluate how well your current model in training is doing, how close it is to the training dataset.
B/ The graph will go up and down. The lower the loss value, the best the output model usually.
The graph can go lots of ways depending on how all is going :
1/ a local low, meaning the loss goes down, then starts to go up again. This is usually a nice checkpoint to compare.
2/ a rapid descend into very low values. This happens usually quite late in training, and signals overtraining : the model will start to output things that are almost just your dataset, not good.
3/ a rapid growth. This can happen when your training rate is too high, or sometimes very late in a training. This means your model is diverging, not being able to draw anything even close to what you wanted anymore. This signals that you broke the model usually
sometimes, it can be good to let it go longer even when loss is growing higher, because it can go back down after some more steps
Yooo, this is fantastic info. Just what I was looking for
I'm assuming this is overtraining:
I'll add numbers to my points to simplify
no, not yet, not necessarily
this is either just enough training or not enough to me
the curve seems to stabilize
it feels like a local low
rapid descend would be going into the 1e-7 over a few steps
Gotcha, that helps a lot.
and we can move into #š§ļ½finetune if you need more/to share pics
Yeah, I didn't realize that was a channel
where should i put .safetensors files?
Hiya all, question regarding controlnet and batch img2img (video input). Is it possible to configure controlnet to take an individual "detectmap" image from each frame, and not use the same "detect map" I have used on my test frame?
same place as the ckpt files in the models/Stable-diffusion folder
I saw people writing scripts for that but didn't see any published for automatic yet (I may have missed some)
hey, yep, what's up ?
Thank you
if i try to create a model in dreambooth it says "Missing Model directory, removing Model... and a path"
I dont know how to fix that
#š¤ļ½tech-support will be more geared to help you. Some screenshots, and a little more context (lots of dreambooths going around) will sure bring you some answers over there š
Has anyone used the corridor method with something like stop motion? The idea being an artstyle put into a stop motion animation, I think it'd turn it into almost fully fleshed out animation with whatever artstyle you put in
Any thoughts?
thank you very much, ill try š
Hello i start try using Stable Diffusion with my computeur and i would know if their is a wiki somewhere with the explenation of all parameters and how use prompt like how insist on one particular point of the prompt ?
automatic1111 i assume? https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Features
Any good model for training realistic faces?
the only example is very bad...make him a cyborg
hi guys! Can i ask a question about install stable diffusion? (i have a problem)
sure in #š¤ļ½tech-support
thanks
Thereās no explicit way to generate image variations, but you can āDIYā it a bit, by re-running the same seed with either slightly different prompt text or slightly different settings. Try shifting the CFG scale value by a few points, reordering the words in your prompt text, or adding another new keyword or two.
Our vibrant communities consist of experts, leaders and partners across the globe. They are developing cutting-edge open AI models for Image, Language, Audio, Video, 3D and Biology.. AI by the people, for the people. Learn more here 
Is there any script that can mark words, mostly things that can be a bit NSW or off the norm? Words like Dick, ass, pussy and so on but also furry, loli, chibi, gimp and what can be slightly into strange or kink?
Another question: Automatic1111 do fill up "AppData/Local/Temp rather fast, every time you use ControlNet and Scribble there will be a copy of the scribble file, and it can fill rather quick, I did tick in "Cleanup non-default temporary directory when starting webui" but that do not clean out the temp folder.
you guys are funny, u tell me after 2 hours 4000MB downloading of install process Miniconda, Gradio on my Desktop PC that i need a NVIDIA Card...? i mean ..... can i use my AMD Radeon?
hmkay...uff sounds complicated, i just have no idea how to do it on windows with my actual environment
hi guys , i have problem with SD when i want to open
It gives errors and i dont know what to do ,i copied the log ... who can help me its too long ?
maybe ask in that reddit thread
thx but seems not working...cheers*
I edited the "NO PREVTLFH" image in "C:\StableDiffusion\stable-diffusion-webui\html", then I closed down all and restarted Automatic111111, but it still show the old file that it say is placed in "http://127.0.0.1:7862/file=html/card-no-preview.png", so where is the server files located and how can I change them?
Edit: After several restart, F5, and Ctrl-Reload the edited images did show up
How many have peeked inside C:\StableDiffusion\stable-diffusion-webui\repositories\stable-diffusion-stability-ai\assets or where your asset folder it located? I can not be the first to see that??
what about it?
What about it? Nobody is falling for a Rick Roll.
It was just that, it is an attempt to a RRoll, and in all the folders there is even one more slightly bad attempt.
I am so old I still think RRoll is fun.
it's tired
It just for you been rolled to often š
Just kidding. But we need to preserve our (internet) culture and Rick Roll is our national bird.
Checkout
Who else find linear algebra really boring?
Pls what's the point of learning it? I just get started with my ai development roadmap, pls I need some advice.
@hazy solstice boring or not, you know what it is and you read about it, that is impressing. I just know that if I take five and divide it by 7 and then add 3.3 would be doing math.
You can use Shark from Nod.ai to generate images with amd cards on windows
Help is in #š¤ļ½tech-support
Do I need to fully understand it before moving to the next step? Which is data frame and series.
Roadmap is here: https://i.am.ai/roadmap/#fundamentals
So, if I wanted to train a Lora to be a specific art style, do you generate regularization images using the words "in the style of" or what? Anyone been down this road already?
nvm, it looks like the general consensus is to lean into the words "art style" or "illustration style"
Hey Stability AI, I would appreciate it if you brought back thispersondoesnotexsist.com
I know you bought the domain
anyone seen that latent couple helper script? It seems it would help but looks a bit suspect so unsure about running it
Now because of this, I canāt use thispersondoesnotexsist.com and the other AI face generator sites using thispersondoesnotexsist.comās ai
There no way thispersondoesnotexsist should be redirected to stability.ai
Ma'am, this is a Wendy's.
Oh in that case Iāll take your Bacon Double Stack with some fries and a root beer! š
lol in all seriousness though, you can use the beta.dreamstudio.ai site and try some different prompts for generating fake people, or you can do an image2image generation by uploading a photo of someone else and getting a fake person back
Hehe, I understood the joke XD but yeah thanks for the suggestions! Iāll try those methods. I was kinda hurt when thispersondoesnotexsist was gone but according a user on its Reddit, the domain expired.
Thanks for going with it š I didn't mean anything personal by it. Just saw an opportunity to meme and I'm a simple girl.
Whatās the best model to generate attractive photo realistic people?
Bwahaha no problem š I donāt mean to take any offense haha Iām a memer and a simple girl myself! Also, I was hungry when I wrote the reply hahaha so yeah
There are several models available for this, stable-diffusion can do it. The more important part is your prompts, IMO
what model is best to use fore realism, i'm using rpg_v4 right now but sometimes it gives a clay like result, is there an other that would be beter to get realistic pictures?
keyerror
Thx
does anyone happen to know if automatic 11111 has it's own discord?
it was created in Oct, but it moved to the StableDiffusion discord
Hey guys! I am looking for developers who are interested in ai generated art, people who are trying to host their own models. Anyone interested in a chat?
find someone who can eliminate the hands/feet problem. Then you have something.
say i have a photo and i want to remove a large object. like a painting on a wall. what options in inpaint should i use? cant seem to get it to remove the object and just generate wall textures.