#💬|general-chat
1 messages · Page 44 of 1
i assume the best option is inpaint masked, fill and then set a small masked padding pixels?
I have a question about samplers. I usually use DPM++ 2M Karras, but I have done some testing and like the results of DDIM, LMS Karras, and Euler for what I am doing as well. Does anyone have insight on which of those models tend to be good to use?
For example, I don't know how to tell if one is faster than the other. I am using the same number of steps with each, and probably will not adjust the number of steps that much.
So a sampler that steps faster would be preferable because I know I get results I want at the number of steps I am using.
from what i've seen it seems to depend a lot on what the model was trained on originally. they wil often suggest samplers that are 'best'; for them
other than that i have no idea
Most ppl recommend sde karass because it gets the job done
And it requires half the steps
I usually only use 15 steps
DPM++ SDE Karass?
The experiments I am doing now seem to require a high step count. I am running 50 on all the models, which is why I want faster steps over fewer steps. I will consider SDE Karras for regular stuff though.
Yes
and yeah the same 15 steps from that one take a lot longer than the steps from euler+
Basically, I am using prompt editing with A1111 to fine tune the output of a picture. Sometimes I can get a better result that way, either showing variations of a character or a progression over time. The high step count is necessary, though, to have granularity and control, as well as to give the AI time to change the picture.
Maybe I should do some more experimentation on different step counts. Might be time for another x/y/z plot.
yall know how to make models with dreamboothv2
DDIM - I think is fastest , but results are kinda meh, good for testing purposes only
Euler a - good results and good speed, average thingy, good enough for anime gens.
DPM++ seems like being the best option for high quality realistic gens , with high step amounts.
I usually do ++ SDE on realistic models, at least 50 steps.
But again, results might vary on different models...
oh, DDIM is bad? I never knew... I always use DDIM for some reason.
on models I tried it - it doesn't look as clear, maybe cause I was doing low steps on it, idk
I usually use the exact same settings -
DDIM, 25 steps, batch size and count of 2, then set those to one and eneable hires fix with 4x_foolhardy_remacri on a seed that I like
yea I was doing 25 steps too, 20 if I wanted to test something, idk, I think euler looked better
Do x\y\z to see if there's a difference
guess I should switch back to euler
Compare it on few gens, 10 images should be good enough to compare the difference
on x\y\z
Yeah I should do that, but maybe tomorrow. My plans tonight are gaming then training a low poly stylized embedding.
Question is, do I train it for 1.5 or 2.1?
Emad's Twitter account appears to have been hidden or deleted: https://twitter.com/EMostaque
oh ye it says doesnt exist
i wonder if it has anything to do with the opt out since today was the deadline
Hello all. So I have a very very lengthy post regarding some guidance with SD starting from hardware ground zero and a portion centered around questions regarding NSFW content. Can I post that here?
anyone know of a model that has character recognition or whatever its called?
is there a good idiots guide to installing automatic1111?
just got a new pc and realised i forgot what i did the first time around back in august lol
ty
can i just throw in the models directory from my old install on the old pc and which file would tranfer my settings over
Did Emad delete his twitter?
Oh
Wonder if he got mass reported by the mob or if it was a choice
It might be the social brigade that the hashtag and subreddit are facing this week. I think the goon squad have mobilized. They're flooding the hashtags with BDSM porn and flooding the reddit communities with super suggestable teenager photos. There are shenanigans afoot. Goon squad is doing shit i feel it.
can you combine the pix2pix model with another model or will that mess up things? for image2 image editing
pix uses special sauce that can't mix well. you're better off using controlnet for your goals
ah makes sense
pix is pretty neat on its own tho. i turned a pic of a church into an abandoned one. fun times
yeah i think we haven't seen the last of it. It's really cool clip/prompting tech
with sd you dont need to be an abandoned building photographer. just take pics of regular buildings and then turn them abandoned lol
safer too
the smaller controlnet downloads are nice apparenly i got errors with the larger one anyways
smaller ones are useful for loading multiple at once too
people use pose for a body pose, then depth to create hands
all these models and addons are sweet. i basically first used nmkd before but now using automatic with some addons really makes the difference! using img2img on your own photos is pretty crazy tho lol
They are?
i wish another system that was as extensible as A1111 would show up. i want to look into node based generation setups
well im sure more will come in time
what was the twitter opt out?
the goon squad? they're a mob of internet toxicity. anonops, lulsec, and a few others have been birthed out of the deep trenches of the goon squad.
opt out of what?
they took my shps and money in eve online f them
No idea what you're saying
long long ago i was stupid
Out of the dataset, i guess
you have google don't you? follow the breadcrumbs. A pet peeve of mine is when someone asks for more info, and then acts indifferent and uncaring about the answer. Why even? Don't expect me to engage with you again.
Yeah, with his connections I'm sure he can get back on real quick
that mri imaging stuff is amazing
i was following it before the ai was there to help
Although it's supposedly really cherry picked
its a good start tho
i wonder if we can help bring ppl out of comas with this sort of tech one day
Yeah, and I guess it's good for the case of stable diffusion aswell. Shows that it has some other usecases aswell
reading images from their brains ? how would that get them out of a coma?
there's other tech getting towards that. i do'nt think imaging will
could maybe project images into the brain too
that would be some other kind of tech most definately. reading averages of signals going on in clusters of neurons vs directly affecting neurons in an effective manner, very different things
might be able to tell if someone is too far gone to ever come out of a coma
can already
based on the images its seeing
coma guy be like im good
based on brain activity
@wise stratus What happened to your twitter account?
woah this smart guy with the tag. first time i seen that done since the topic was brought up. lol
i has none
too far gone then
look i just want to live in one of those machines thats a lifetime in a second
How do you know you're not
i dont like this version it has bugs
you could've just booted into this game just now, and your lifetime of memories are only simulated
a real life time is a second
remembering the bugs is just part of the initial memories. its the experience you signed up for
all started a minute ago. prove me wrong
Ignore the rude boy. Looks like it's deleted, either got mass reported by the mob or he voluntarily deleted it
I'm betting on the former
man with my aritifical lens implants i see art rediclously good now lol
that came into being the moment your game started a couple minutes ago
remembering that you used that gif before is jsut an implanted memory
how come textural inversoin models are so small in size ? :-p
i r newbieish
the earth was created 5 minutes ago
we were all just given false memories :-p
true story
you cant prove its not true
expactly
so it seems you could use daz 3d to make poses and then use the pose thingy to use em in sd, fun
exactly lol oops
If I see a command that says hijack_prompt on my command line does that mean someone is snooping on my local stable diffusion instance? I dont have gradio enabled
oh noes
well if u have sd on ur local machine u can run it completely offline
if u want to be super duper careful lol no need tho but yeah
i always used it local
yea i haven't enabled gradio or anything, its on a local instance
the command was hijack_get.prompt someting
does SynthwavePunk
work well in converting real photos? might have to try that out sometime
does anyone have a good technique for getting their images to have just a plain solid color background? trying to make portraits of people
(photoshoot studio, x color background)
think this would work for an anime based portrait
Only one way to find out
tru
Code Red why did Emad delete his twitter account
hmmm, thnx. seems to add a camera lens to some of my characters
Can anyone help me enable noise offset while training lora?
nobody care that emad deleted tweeter? lol
god, imagine the freedom one gains when they delete twitter
hey maybe emad deleted it cause of the daily show ep tonite
i'm not one to speculate usually but i just saw this bit
His twitter was a great source of cool resources. I hope someone archived it
Yes I do
Scroll through the chat log and you'll notice it
Owari Da
What happened. What was the announcement?
Let me guess. More bad news? More backlash from anti-ai crybabies?
what happened in that
do you mean john oliver ai thing
yeah but i can't watch it there. only available in america
the host says its not about elon and is clear baout that
i forgot his name already lol
i miss trevor. i miss john! ugh.. change
I really think twitter only sucks if your political
Ive found almost nothing but positive interactions in ai twitter
maybe not emad tho 😄
There was a beautiful photorealistic version of the mona lisa on r/stablediffusion made using controlnet recently, but I can’t find it now. It consisted of two pictures: the mona lisa and the photorealistic version. Can anyone link to it?
anyone able to explain a checkpoint merger? if you use 3 models via add difference. how would you add say, 10%of B and C to A? woul;d you set the weight to 0.2 or 0.1?
well guess i'll experimentt a bit. set it to 0.5 and will see what i i get
If A and B are both 1.5 based models, you do: A + 0.1 ( B - v1.5 )
A is primary, B is secondary, v1.5 is tertiary
https://huggingface.co/Conflictx/CGI_Animation
how do i use this in stable?
it doesn't let me specify all 3. it only has 1 slider
do you knw how to do my question?
If we’re thinking of the same slider, you just set that to 0.1
what makes you think i know the answer? XD
i asked "do you know"
ah ok. that's what i was thinking then. 0,1 = 10%
"no. read the readme"
license: creativeml-openrail-m
tags:
- text-to-image
- v2.0
- Embedding
thats the read me.
i think you need stable installed where do i get it from?
check the 'start here' section of this discord
kind of should at least make an attempt to figure it out and read the info available before expecting people to explain it to you
he is also not on Linkedin
hi guys
where do i go for installation help?
I think the webui installer thinks Im running an AMD card
because I was before but recently upgraded and switched to nvidia
i have the new drivers installed and card running
and deleted / reinstalled stable diffusion
venv "C:\stable-diffusion-webui\venv\Scripts\Python.exe"
Python 3.10.10 (tags/v3.10.10:aad5f6a, Feb 7 2023, 17:20:36) [MSC v.1929 64 bit (AMD64)]
Commit hash: 0cc0ee1bcb4c24a8c9715f66cede06601bfc00c8
Installing torch and torchvision
Looking in indexes: https://pypi.org/simple, https://download.pytorch.org/whl/cu117
Collecting torch==1.13.1+cu117
Using cached https://download.pytorch.org/whl/cu117/torch-1.13.1%2Bcu117-cp310-cp310-win_amd64.whl (2255.4 MB)```
still says AMD everywhere?
He was on LinkedIn before?
maybe not
He was actually
If you googled him in the past, you should see his LinkedIn profile on the top pages
wdf this man up to
put --xformers in your command args
see what happens
if you have nvidia you shouldn't have any issues
been such a long time since i used AI art generators, what are some nice online ones?
Stable Horde
Not long ago Automatic1111 was almost updated daily, but now I have not seen any update in like two weeks, is it stagnated or have the developers jumped to other projects?
what model do i use if i want to generate images of existing characters
This is not the first time, don't fret
I'm sure he's just resting
is automatic1111 the sole maintainer? I would get overworked with that many issues and prs as well
There are a couple of collaborators in his repo
it needs a better name though. people calling it by the creators handle 😄
"Stable-Diffusion-WebUi" is the name of the project?
yes
hello does anyone know if you can use latent couple in easy diffusion? I'm new to this
how can i get more iterations per second ?
Hi guys i am having trouble training a model whenever i try to train it gives so weird resluts for both 1.5 512 and 2.1 768 models
A help would be appericiated thanks in advance
I am trying to train on fast dreambooth notebook version
I have tried all th precautions or mostly precautions to train a model which i've seen many tutorial videos on youtube
does anyone hear know how to colour correct an image one makes in 3D, and feed it into SD so it spits out the correct lighting and colour results (contrast, exposure etc) and add better details? What is the proper way to set this in action? Thanks
I didn't notice at first, but I'm inpainting a 704x1088 image. I didn't believe it was possible to get more than 1000 pixels in one dimension with 4 GB VRAM
It is, but it draaaaaags LOL
Emad has been gracious enough to reply to me almost every time I have pinged him, so I thought I'd give it a try since there was a chance he might shed some light, how does trying hurt? Either I get an answer or I don't. Does putting others down make you feel big? Why not just be respectful instead of nasty?
Thank you, I don't understand why people have to be nasty for no reason & Emad is pretty cool & most of the time he's replied when I ping him, so I though it was worth a shot 🤷♂️ Then I get a nasty sarcastic reply, which I find so unnecessary.
It seems like his profile is back🙂
I know I saw it, but I can't remember where on the net, but where is there a Waifu Diffusion 1.4 Tagger V2 that allows blacklist words so they never appear in the tags?
I think the creator of it says that vit2 is the most accurate one which might include naughty words
Not worried about that just to tag you aren't supposed to use the proper names (at least for styles) so I find a lot of the time it names the character.
I find swin2 to be the one I use though as it hits the best
little ot, but does anyone know if Dalle2 ever did any updates? just wondering if its worth checking out lately lol
how do I get started with SD?
where do i generate pictures?
Check out #1072220168534642768 & #1072229020520947753
Have there been any successful crowdfunded AIs?
what is this ad for Nitro?
No. Not unsuccessful yet either. The UD is the only one I'm aware of and they're making efforts but technically haven't failed yet. I don't think theyve got the needed expertise to accomplish their promises though.
Waifu diffusion apparently got a donation from crowd funded dollars, but that guy was making it with or without donations
I saw UD but they only raised 50k
Looks like VC funding is the only real way huh
Lol... their patreon is still under revision 😄
Yeah the 50k they raised got taken back lol
They got some money later on their own page, but I don't remember how much
hii
can someone help i installed invoke ui and this happens
Could not generate image.
File "C:\AI\InvokeAI-Installer\ai.venv\lib\site-packages\torch\nn\modules\linear.py", line 114, in forward
return F.linear(input, self.weight, self.bias)
torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 2.00 MiB (GPU 0; 6.00 GiB total capacity; 5.25 GiB already allocated; 0 bytes free; 5.32 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONFUsage stats:
0 image(s) generated in 4.90s
Max VRAM used for this generation: 5.64G. Current VRAM utilization: 5.64G
Max VRAM used since script start: 5.64G
like when i press generate it doesent even load it up it just breaks while i was using stable difusion it was workin good
What is the best way to create Architectural visuals. And by that I mean I want to either create a Lora, Checkpoint, embedding or hypernet to apply to existing Checkpoints, or have a Architectural CPT? I have the images ready to create this. Any suggestions/ guidance will be appreciated...
Hello. I have some problems with stable diffusion. I need someone to send me the "venv" folder because no matter what I do, I can't install torch. I will really look forward to help!
just starting the webui-user.bat (or equivilant) should do the trick but you gotta delete/move your venv folder so there's nothing there
You don't have enough vram to generate whatever you're trying to generate.
Lower resolution or batch size
i tried to install torch separately but also without success
the most interesting thing is that in the morning everything worked
did you download an extension?
also sometimes its easier to do pip install -r requirements.txt
I did everything clean, re-downloaded webui, and re-downloaded python
Hi
Hi
How are you?
Good mood!
Nice!
hmmm strange
Yap
🍔
Humbuger
All of the advances and new options and features - still bad hands and bad feet ruining images.
When launch?
use control net and photos of your hands or pose 3d hands
that will fix it
I can't run control net right now
will be once the new system is built...the remainder of the parts arrive today
Struggling finding the right way to set a focus. I'm trying to blur the background, is there a common approach to this?
Anyone currently having this problem? Error checking updates for stable-diffusion-webui-inspiration:
Traceback (most recent call last):
hi guys im new to this after being reccomended by someone else
does anyone know what i should type to make the best shadow the hedghog generation?
I have the ultimate upscalers installed on automatic1111 , does anyone know how to activate it? I don't see it anywhere
For some reason certain models right after the image is done processing it has a filter applied to it, has anyone run into this problem?
Its in img2img under scripts
anyone there?
Hmmm ok
depth of field?
or you can try to do "focus on ***"
Can somebody Ai Upscale an Image for me?
I can I guess
on civitai ? tags are just there to classify the different models on the website, in the search engine for example, by common theme.
Trigger words are what you use when prompting with that model
nope, you don't
unless the person that published that model did really just stupid things on his page, no you don't
ok
is there a place with the list of prompts (like tags on sites) where i can view the triggers
not sure I get the question, but if you are looking for lots of example prompts with the pictures that makes, then https://lexica.art/
hey does anyone know if this is SD? https://gyazo.com/dccac4ad93ba88ff8ed473e6e787887b
Yes for sure
do you know what type of prompt would get that result?
Maybe portrait of woman shouting, open mouth, red lips, blonde hair, angry face,
its possible the image was generated in the dimensions it was uploaded, also possible they did post processing on a different program
If you have the hardware I would.
I don't know the programs since I dont dabble in that
You should have no issue with a 3080ti
It's a deep rabbithole, haev fun
thanks
img2img using "black background" with denoise of 1 is like txt2img, but it puts whatever on a black background. Your feeder image is a solid black.
I JUST PERMANTLY DELETED MY WEBUI FOLDER 😭
I have to reinstall all my embeddings and models
Damn
and all my generations are gone... and my extensions...
How?!
I have been moving a bunch of things around to manage storage and all that, and with multiple windows of the file explorer open things can go wrong...

I am very sad, but this should hopefully help me with my collecting issues of too many things.
Cant you restore it out of the bin ?
It had a popup that said "this file is too big to recycle, do you want to permanently delete it?" And I said yes because I thought I was deleting the correct folder.
Dann :/
If you really want it back and you on an hdd download recuva
its a recovery software
not guaranteed but you should atleast be able to salvage a few of your generations, and maybe get the prompt out of those.
It's probably faster just do reinstall everything, unless you want to try to restore your images
I am about to do something with somebody, will it work if I donwload it later?
I suggest you download it now, you have to let it scan your entire HDD and that will takes hours
Oh
if you really have to go, don't download anything, don't even touch the drive you deleted it from
dont write or read anything basically
By do something I mean play a game with someone
I can do it now then let it run while I play
I've downloaded it, says it's gonna take 25 minutes and I checked "deep scan"
Thank you!
ive been out of the loop since the very very beginning of 2.0 and i wanna get back into it
what are people using to do this locally now, still automatics?
So it didn't recover anything I needed recovered :(
I will deal with it tomorrow
yep, but there are some other interesting tools too, like invokeAI, https://github.com/ddPn08/Lsmith, https://www.painthua.com/, https://twitter.com/DiffusionPics/status/1631803340005818368
wel...invokeAi questionable one here, since it's doing weird things lately
ty <3
Did anyone play or trained a prompt for a anime + selfie?
Are there any models bigger than SD that are opensource (or leaked)
I used an anime based model with a selfie LORA and it worked well
define bigger
does anybody know what the little () do for prompt generation?
more parameters
more weight
they increase the importance for the words they surround
so does more parenthesis mean even more?
you can also increase further by adding more () or by adding (word:1.0) where 1.0 is a decimal value defining how much weight you want to add
it's a multiple
hehe caught me while I was typing
so if you wanted to add a lot of importance to say blond, you could do (blond:1.5)
so if i put (((thing))) it means 3x weight?
SD 2.1 is the highest I'm aware of right now
yes
and you can do (word:0.5) etc to bring it down
the opposite would be [] which is an in prompt negative so [deformed] would be the same as adding deformed to your negative prompt
2.1 has more params than 1.5?
I'm not sure about internally, but there is a 768x768 model available
by params, you mean the number of internal neurons?
(usually what that means)
i think params means more images trained on?
but I have only found 2 open source neural networks so far. SD and laion-dalle2. Haven't had a chance to play with the latter yet
more than 600,000,000?
how do you do it? sorry first time here
yes
Oh didnt know about the laoin dalle2
thanks
I stumbled across it. it only has 256x256 output though
Looks like the results suck
rip
It should be noted that "red house" and "(red house:1.0)" will not result in the same images
🤔
same seed?
It has to do with token count and order. it seess it as a different prompt
Yes
Because it counts the parentheses.
there should be a script to fix that
Quick shortcut tip for new users, highlight a portion of the prompt and use ctrl+uparrow or ctrl+downarrow to adjust the weight by increments of 0.1
or you could just edit your prompt
Imo, if "red house" has something you absolutely love and want to keep for another prompt generation, you can use inpainting techniques to conserve it. Otherwise I think it's easier to accept that each generation may not be 100% what you expect from the start
Sway, were you asking about larger models because you want to know if you should learn how to use Stable Diffusion versus another AI generation program?
No, for fine tuning
is there a way to automatically have a negative prompt when you start up SD?
you mean automatic1111's webui? No, but you can set a style that just has a negative prompt
@fast jungleOkay, this took me too long to find because I was looking too deep
but edit "...\stable-diffusion-webui\ui-config.json" and you can set the start-up settings for the webui. Change the file, save, and restart the stable diffusion server to check if it applied
You mean @fast jungle
sorry
although it's a nice thing to know for everyone I suppose lol
you can also have it automatically apply a style if you prefer to not see the negative prompt text
ugh. I love the accuracy of blip2, but it's extreeemly slow
wassup guys, new here, just started playin with stable diffusion, my idea is to put my branded clothes to an ai model
any idea how to do that?
idea would be replace the normal models, just photoshoot my clothes and somehow let a ai model wear it
Search "lora training" on youtube and make a lora of your clothes
thank you! later i can change to model also right? just change the "lora model"?
assuming lora is a girl model
Yes. The key is that make sure that what you want the AI to learn is the only thing that is consistant in your dataset
i see
pose is another thing that you want to keep inconsistant between your pictures
different models, locations, poses, camera distances
anything that is the same, the AI will pick up on
so later when i am more familiar with it, i just need to teach ai what do i want to keep consistent
and what not
right?
right. you train on photo sets, so for an outfit, the only thing that you want consistent is the outfit across your photos
i see thank you
np
¨hi
is there a way to know a model is safe?
download for a trusted source. which model is it?
anything v4
where are you getting it from?
It should be ok. Most models on huggingface are clean
but i was just wondering if anyone has used it and if it was a virus
The models in that repository are safe to use
my internet is super laggy rn
The repository has over 120,000 downloads in the past month
if there was a virus ssomeone would have said something
where
Go to the link the person linked and look to the right hand side
oh ok
i was also wondering about https://github.com/Xerxemi/sdweb-auto-MBW
does anyone know about it
Hello All, I was wondering which is the best channel to ask about setting up a PC Stable Diffusion build?
A PC stable diffusion build is basically a gaming PC
with the focus on a GPU with lots of VRAM, and fast storage. CPU is not a focus
Hello, i have a problem, i'm trying to use kohya_ss gui Utilities>Captioning>BLIP Captioning, but it don't work. the error alert and messege is:
No files with extension .txt were found in E:\BLIP_Captioning_test...
Captioning files in E:\BLIP_Captioning_test...
./venv/Scripts/python.exe "finetune/make_captions.py" --batch_size="1" --num_beams="1" --top_p="0.9" --max_length="75" --min_length="5" --beam_search --caption_extension=".txt" "E:\BLIP_Captioning_test" --caption_weights="https://storage.googleapis.com/sfr-vision-language-research/BLIP/models/model_large_caption.pth"
could any one help me? 😦
https://user-images.githubusercontent.com/127019204/222954290-b10c2555-56b9-42ea-8a84-eedd9fee3a3a.png
https://user-images.githubusercontent.com/127019204/222954352-92276ada-7640-485c-895e-300539a8ed24.png
Thanks @fervent thunder I am wondering if a 4070 Ti is ok? it only has 12 GB... I can't really stretch to a 4090, but possibly a 4080
If you feel 12gb is good for you, then its good
for whatever you are generating
I'd suggest 8gb min
Practically any 4000 series card is good
If your interested in training then more vram the better
yeah I read 8GB is the minimum
I see so many that want "realistic" images, but then they use "hyper realistic" in the prompt and think this is the same as "very realistic", but "Hyper" realism is a more dream like "realism" often with more saturated colors, and like clear blue eyes and perfect skin and so on. If you want realism you shall use words like "natural", "normal" and "life like", just add "realistic" if your model also can do fantasy and Anime.
4 is the current minimum to make it run, but even under that, CPU technaly can make it too at a snail pace
I just put RAW photo and it just werks
@fervent thunder I am interested in the training specifically however, would I be better picking up a refurb 3090 with 24GB of VRAM than a 4080 with 16GB of Ram
purely for training, yes
so a 3090 will be better?
RAW photo is just a unprocessed photo with often flat and contrastles color palette.
I suggest you replace "RAW photo" with a prompt of a photographer or artist that make images you like.
24GB will be better yes. The clock speed is close to the same, but the difference in VRAM opens the door to more batch size => more speed/quality in traininjg
benchmark of VRAM, speeds, ...
(pinned in #1011228477954998273 )
depends on what 3090 and what 4080 a little though, there are different editions of both
ahhh brilliant thank you @vast ingot
RAW photos are often what we see as "blurry", a processed photo often add sharpness that look good on a screen or in print, so I try to use "Pentax photo" in hope to get how Pentax often process images.
@vast ingot So what card would you recommend other than a 4090 (out of budget)
just looking at the spreadsheet...
Another pointless prompt word I see is "Best quality" and "Worst quality", those often does nothing then there is no definition of what "best" is and what quality you want 😄
I just wish AI had understanding of "Add gaussian blur on sky" and so.
I went with 3090TI for the training rig personally.
If you want to make pictures only, I would recommand checking the good price point around you for a 30XX or 40XX series in the 12GB VRAM range
If you want to train just from time to time, I wouldn't oversize the PC for it, I would rely on google colab free tier personally, I still do it those days even with the computer
If you want to train big time, 24GB is the target in my opinion. So that's only 2 cards right now (unless you want to go datacenter-at-home path), and I find the 4090 overpriced around me, but it may be different for you
It really depends on the budget and market, so hard to really give the right tip
@vast ingot Superb man, really appreciate the feedback here... 4090 is too much for me unfortunately
I'll look into the 3090 TI then 👍
quite the beast tbh, I'm loving it
but yeah, I wouldn't recommand if I wasn't gonna train lots of things on big datasets
still quite the pricetag
cool, but you say you still do the training on Google Colab?
yeah, from time to time, depending on what I want to do on the computer
collab works great, you start it, let it run, and harverst your google drive
so if I want to play a game, I love it
yeah I mean, I have been using Scenario GG services, but figured I would like to get under the hood so to speak
Hi! Does anyone know of any models focused on buildings? I'm specifically looking for medieval fantasy buildings in an illustration style. I know I can get some decent results with more general models but I was wondering if anyone knew about one specifically made for buildings/architecture in this style. Thanks!
really fun hood to get under, lots of buttons to push and try their impact, lots of things to learn. I hope you'll love the trip 🙂
Cheers @vast ingot really appreciate yor input
Hi
Am new to stable diffusion , and have very limited knowledge about this area of tech
So far I have experimented and generated some awesome pieces using playgroundai
but from what I see and understand is that there are some limits when using an online service , you either have to pay or there are some limitations as to what you can do
If installed locally , will there be such restrictions ? will i be able to use it at its full potential ?
Pretty much the sky is the limit
Greetings to all of you who want to revolutionise the future with AI!
This is your boy, CyberG. We are "GRINDA", a startup that aims to solve the problem of data copyright for Generative AI. We have a top-notch team based in Korea, and we are looking for global developer team members. We are currently Seed funded, and many VCs are interested in our item.
We need your help. Feel free to DM me anytime.
The positions we are looking for are as follows
- Full-stack development (proficient in using OpenAI's APIs)
- Blockchain development
For more information, please contact us and we'll be happy to hear from you. Your AI, your creation!
We are looking for a senior machine learning engineer with experience in developing and deploying AI models. If you are interested, please DM me and send evidence of exceptional ability.
locally theres no restrictions
hello guys
small question
difference between cfg scale and image cfg scale?
(from x,y,z scripts)
If you select it then you can hover over it to see more Information
has anyone made an easy exe installer for SD yet?
nothing comes up when hovering over image cfg
Yes they exist, google automatic1111 webui easy installer
how can i check if stable diffusion knows of some embeddings i copied into the folder? im using webUI and the embeddings dont seem to have any effect
thankyou
Hey, guys! How can I upscale a seamless pattern tile without losing the feature of being a seamless pattern tile?
I can't create
👋
wen deepfloyd
I heard it's soon
there's a tight nda on it so anyone that has access to the preview for it can't say shit. if anyone is mentioning it at all, you can be sure that they're not part of that preview
Hey so I have an issue when I try to use high res it comes out with a bunch of random like text or other elements but then it sometimes it doesn't do it I have no positive prompt asking for text or anything and I also have a negative for text anybody maybe know why this would happen?
yo
Do high res with high res fix , without it it's kinda risky, often gives you repeating objects or just a mess.
text, signature, watermark, username, artist name, trademark```
- negative prompt related to different kinds of text, but it still could happen, but should be alot more rare
Does anyone know an up to date guide on using google collab with SD? I'm entirely new with the collab service (and huggingface) and lots of them are very out of date.
Also, how good are the free collab gpus? My local one can only upscale to 800x800 so I assume better than that
you can only upscale to 800x800?
on your local machine?
Sorry I meant highres fix ( so I can only do 1.5x on a 512 image). Extra upscalers I can go to like 4k
oh ok. i was going to say you should be able to use extras>resize to get much larger photos
Hey all, question: other than huggingface, where's the best place to find custom models?
Which version of Cuda do I need for new version of a1111?
Got an error trying to update today saying I'm using wrong version...
The question is - which version is correct?
The detected CUDA version (12.0) mismatches the version that was used to compile
PyTorch (11.7). Please make sure to use the same CUDA versions.
11.7 ?
and why is it suddenly a problem after all this time =.=
civitai
Like, is the text a small thing in a corner? If so maybe add "signature" to your negative prompt. Add parentheses to increase the weight. As in "(((signature)))"
Same goes for the words writing, text, words
I have to ask: I download FaeTastic SafeTensor model and it say it need a VAE file, that file is called "kl-f8-anime2.ckpt", how do I know that file is "safe"? and shall it not have same or similar name as the main file to make it easier when we want to remove stuff? https://civitai.com/models/14065
Normally I just download things, but then this is new and include a smiley, and I am old so I often see that as "It is a joke" or "it is a joke".
civitai scan and verify the uploaded files. check the link under the download button
Anyone here that knows how much progress there is on research for thoughts to images/music? generative AI combined with your brain?
Welcome ! There is no bot currently to generate your images on discord. You may want to start by taking a look at the #1072220168534642768 channel. You can access Stable diffusion in different ways : 1️⃣ the official website, https://beta.dreamstudio.ai/. The easiest and fastest way to access Stable diffusion with 200 free credits. For any question on it, you can find help in the #1025467151206854736 channel. 2️⃣ Installing Stable diffusion on your computer. There are numerous projects that let you do that, and you will find help in the #🤝|tech-support channel. 3️⃣ Running Stable diffusion in the cloud, through rented GPU services, using notebooks. You can find lots of them shared and discussed over in the #1011228442399883294 channel.
Anyone have any suggestions for hardware for someone starting out with nothing? I want to local run SD and would like to eventually be able to make videos with it, so I'm wondering what the biggest value is while also being pretty good on performance.
And I'm wondering if there's a prebuilt rig that anyone would recommend or if anything thinks I really should be building it myself. Everything I have now is nearly 10 years old, so I pretty much need new everything.
Whats your budget
I don't have one exactly. I'm just trying to hit the sweet spot of "can do a lot of images fast/good for potentially doing video work in the future" but also isn't "top of the line/latest and greatest." I also don't really need peripherals included (monitor, keyboard, etc), just the base unit.
I'd like to get to the point when once I'm skilled enough, I can prompt what I'm looking for and not be waiting around a ton for a lot of samples and can basically streamline the finetuning of an image I want to make
That's the speed capability I'm shooting for
The vae is from the waifu diffusion model and its safe to use. It goes into the models/vae Folder and has to be manually selected
CPU doesn't matter, GPU with atleast 12gb vram in the upper 3000 or 4000 card range.
Ahhh, I gotta select it, goint to settings, I had forgot that even I did read it :/
https://docs.google.com/spreadsheets/d/1Zlv4UFiciSgmJZncCujuXKHwc4BcxbjbSBg71-SdeNk/edit#gid=0 scroll to the bottom of this google doc, it has a reverse parreto diagram showing the benchmark
Knew I had seen that somewhere before... Is prebuilt anything a good idea at all? Do any places sell computers minus the GPU? Because it sounds like that's all that really important/if I go shopping for the higher end gpu I might get upsold on the rest of the tech.
You will need an 6-8 core cpu best to go with AMD, then a GPU with minimum 12gb vram, 32gb RAM and SSDs
If you have no desire or drive to build a PC yourself, prebuilts are an option. Just be aware of the consequences of buying pre-built hardware from a company that aims to make profit
The problem with prebuilds are that they put cheaper parts in and dont tell it like noname PSUs or crap Mainboards
I don't dabble in that so I don't have any prebuilt's i can suggest
If you want to keep your current build, you can rent a GPU from cloud services for 20 cents to 2 dollars an hour for a stable diffusion intance
Right, that was another question I had: If I run LORAs and models off of a hard disk, if the fetch speed loss be negligible considering the compute time of the GPU doing the task? Or is running off of a hard disk a huge hinderance?
20 cents for like a 3060 and up to 2 dollars for high end ML GPU's with 80gb vram
Running off a hard disk is a hinderance absolutely
You can get 500GB sata SSD's for like 70$ nowadays
It dont change any time in generation speed cause the models gets loaded into the RAM first but it will take more time to loads models
Just make sure that your Windows is installed on an SSD
Preferable an M2 NVME
Will it work regardless of my setup? Because I have really old ram and CPU. Not sure if GPU renting means literally just the gpu or the other peripherals.
And is gpu rental not a similar slowdown/hinderance than local execution?
Renting a GPU means that the company you are renting from has a virtualized instance with the GPU attached. On your end you just ssh /vnc into their instance
sorry for the dumb questions within the smart ones. Computer savy and smart, not computer expert. lol
You could use Google collab for free like many others do for SD
Its cloud Computing so it dont needs your pc hardware
I do, the problem I'm running into is the free space barrier on google drive/see the benefits of local storage
So the rest of my setup is irrelevant? So long as I'm running windows off of a solid state that is. I think it's a 2.0...
Since you're doing video stuff, you want to make sure the motherboard your getting let's you fit in plenty of ram, and that your GPU is beefy in a sense
a seven or nine series intel CPU will also do you very well
Heres an info guide for you
https://www.pugetsystems.com/solutions/video-editing-workstations/davinci-resolve/hardware-recommendations/#gpu
I like to have the option so that if I do get heavy into it, I'm not finding myself hitting tech barriers after just buying something big. lol
That's a very understandable thing, which is why I don't suggest you buy a prebuilt
When you build a PC from scratch and you need to replace or upgrade a part, you will know how to do it very easily.
as you literally built it from scratch
does anybody here know how to use the built in pix2pix? i installed the checkpoint and i dont know what to do from here
Another random question for you all: is this a potential skill set/job opportunity I should be considering/cultivating? My gut feeling is "no" because everyone is/can do it and it seems like the learning curve to get the desired output doesn't lessen the pool of people with the same skillset all that much. But considering the time and financial resources I'm leaning toward putting into this, I'd be foolish to not try and look at it that way or at least exhaust it as an outcome.
Load the checkpoint and switch to img2img tab, then place a picture of somebody in and just type make him/her blonde
Then press generate
For example
where do i put the checkpoint? stable diffusion or pix2pix
In a corporate sense, yes it already exists.
Its something that is an addition to a skillset, but a main one
You need the pix2pix model file into the models/stable-diffusion folder
For personal use I wouldn't rely on it unless you live in a very low COL country
The vast vast majority of models are public for anyone to download and use, plenty of resources. There isn't really a monopoly on this compared to AI chatbots
What a nice way of putting it. lol. Yeah, this does feel very "starting a youtube channel" adjacent in that dept.
The money to be made on AI art generation is hosting websites that people pay to use
i have the model on and i have it a command, do i need to select something\
You need to select the model in the webui then go into the img2img tab and you can use it
Makes sense. Another question, where is this tech at with generating scenes with multiple actors and interactions? And how far is it from getting perfected? Very, hella, or "bruh no one knows?" lol
Good to know. I originally got into this wanting to make some art with several people. SFW, believe it or not. lol
It's doable you just gotta get the prompt down right with the model
once you have something that works you can just generate endless generations , majority of them looking nice
Fair point. So much to remember... lol. Is there a place anywhere that is a good starting point for like various combinations of +/- prompts associated with various genres of outputs people are looking to get? I've been dabbling largely with just kind of comes to mind and I could really probably use something like that. haha.
When you're downloading a model, the creator 99% of the time posts images they have created with the entire prompt
Civitai lets people comment on models and post what they have created, and lots of times they include their prompts
aibooru has lots of stuff with prompts
prompthero also exists
If you ask nicely on 4chan someone might share their prompt
Lexica.art is also good
I'm guessing they also include the trained model they may be running as well as any LORAs they have installed?
Gotcha. That's another thing I'm having a hard time wrapping my head around: Like... How did we get around before without LORAs if we wanted to have multiple trainings take effect? Did we really just have to merge/couple 2gb models over and over? And what if you wanted to take one out? And I know I'm missing something here, but that's where my head is at right now.
And when does it make sense to train a whole model vs a LORA (assuming your hardware can run either)
When merging models, it leaves the base models intact and they create a baby
before lora's it was embeddings and just pure prompt skills with luck
and finding an appropriate model
If you want to train a specific artstyle from an anime artist or anime t v show, it makes more sense to do a lora and use it with a dedicated anime model
Interesting... I think that makes sense? So styles are generally loras and specific people are best for models?
I kind of already knew this, but just checking..
Lora's are basically just fine tuning the model
Can apply for characters, artist(styles), positions, etc
Gotcha.
I've learned how to create a model for a specific face, but how do I fine tune the face to look like 99% the subject? Because right now it's more like 80-90% that I get using the model with vanila
I don't know I dont dabble in training
Hello, I am a student at Tallinns Secondary School of Science. I am doing research on image generating AIs. I would be really thankful if you took some time to fill the questionnaire.(it takes 5-10 mins). https://forms.gle/B4tTjDS2fyrCpzkYA
Sorry if the questionnaire is a bit primitive
oh no. You've invalidated all of the rest of the help you've been so kind to provide. lol
So if building myself, what's good value/more than adequate hardware for non GPU specs? It's all gotta matter something. lol. But it also sounds like that's where I can save money potentially
For price point, 1tb sata ssd's are very accessible
for "good value" i would invest in the most recent i7 , as i7's tend to last longer in regards to relevancy
value... i7... dude
motherboard and powersupply?
check out this website, they have build guides and you can search for the best prices for any component. Just make sure you get an nvidia GPU. most of the build are AMD based but its a decent way to compare https://pcpartpicker.com/guide/
@delicate oxide W

what is the name of that website that uses the automatic1111 api and theres a checkerboard background?
any other good prompt websites like:
https://lexica.art/
https://civitai.com/
where i can view images and prompts used to generate them ?
thankyou these are fantastic
you're welcome
Hola
this was it, thanks!
Why do they call it a prompt when it takes so long to generate an image? It's not prompt/
Outpaint does that...the checkerboard
there is an add on for A1111 11 11
11 for Outpaint
Stable Horde = WHAT!!!???
Hey quick question. Does anyone know what the purpose is of((text)) the surrounding "(( ))""
yea , but this thing is just easier to work with , better ui and stuff
I know when you want to increase or decrease weight, you do something like, (text:0.9).
just not sure what the double does
higher word \ phrase weight
ah okay
in a111111 you can do (thing:xx) instead
is there a list somewhere of other tricks like that, or is that about it
there is this to generate one thing for some step and another for others
[thing 1:thing 2:##]
it will generate thing 1 for the number, and then thing 2 for whatever steps remain
there is a link to these things: prompt engineering or somehting
right
I would have no clue when to have something do something, like not sure which steps do what
i just assume the more steps you have, the greater detail
didn't know there was a greater purpose
50-37=13
if you set 50 steps, the rest go to thing 2
you can do script prompt from file or matrix - then put many prompts in there
each one will generate for the number of times you set in batch count
so you can try different changes, then let it run
yeah I was wondering if there was a way, to quickly paste in a large section of a prompt, that i tend to reuse
or right click generate forever - then change prompts after a while...the next generation will take the change you made
oh okay, so run a bunch at once, to test out how it impacts
instead of doing 1 at time manually
for a prompt, you can save style
or load the image in PNG info, and then send the prompt to img2img or txt2img
👍
But you don't want to do something once
give it many chances to show what it does
okay I am going to try to see if I can figure out how to do the batch tip you gave first. Biggest question is figuring out how to script it, so if changes the number each time.
I wish I could remember that link
So I am doing a 90 step test. I am curious to see if this works, [(detailed pupils:1.1):glasses:60]
or is that redundant using the two strategies together
No Stable Diffusion, you belligerent body of bits and bytes, I didn't want a backwards torso
You can do it. It will give a weight of 1.1 to the detailed pupils, and do it for 60 steps of the 90
but for most samplers, there isn't much change after a certain number of steps
Yes, that's when it does it
I want someone running away, but looking back at the camera
it gives me a backwards torso, sometimes one backwards leg
[tiny prince:tiny wizard:60] so that would make a more wizard-prince favoring the wizard. Def going to have to test that out more. Pretty cool.
oh shit
but again, most times nothing changes much after a certain number of steps
100 steps is not better than 28 if nothing much changes after 28
ah okay
60 you are likely running a lot of steps that don't do much
For eular a, for example, not much changes after 28
I don't know exactly what will happen with 100 mixed, but I think you'd get 28 with prince then not much change, 28 with the wiz then not much changed
then different models do different things
so try it with different models
realistic models still change on high amount of steps with DPM++ samplers
on anime models usually there's no reason to go even above 25
example of a realistic that changes on high, with a grid or steps showing the changes...
I don't have it and not gonna waste time doing that again, but you can try if you want to.
OK. I wasn't disputing, just wanted to see it
but idk if it'll work with mixing prompts , never doing it ¯_(ツ)_/¯
https://huggingface.co/Anashel/rpg/resolve/main/RPG-V4-Model-Download/RPG-Guide-v4.pdf
There's an example in docs for rpgv4, scroll to page 9
it sure does change at later steps
Silly question. I've seen people put "1girl", "2girl" and "3girl" when they want the respective amount of girls. But is that just regular prompting or is it an embedding/hypernetwork?
Thats a prompting method for Anime based models, based on booru image boards
hey
sup
another thing you can do is any number thats between 0 - 1, like 0.5, will be a percentage of the steps. so 60 steps would be 30
Does anybody know what's the best way for fine-tuning on stable diffusion ? Renting a GPU server or buying a desktop pc ?
my current system is iMac with 16GB RAM & AMD Radeon R9 M290x 2 GB. Am I able to work with this system?
These two links may help
If the processor is an M1 or M2 chip then yes but not with the 2gb AMD card
It says : it is incredibly slow and consumes an excessive amount of memory. So it doesn't worth it.
Sorry. I'm on a pc best I can do is provide the docs
What's other alternative? I want to train my model. What should I do?
Google collab is then your only chance
Have you had any experience on colab? I used a free version and everytime I should run & install it again. Also I got error when I wanted to do fine-tuning. If I buy a pro ver do you think I'm getting good speed?
I have pro and get pretty good speed when I use it
Thx, so I'll try. your plan was $10/month ?
Yes and I upgraded my drive for $2/month so I had room for models
I see, and are you able to work with dreamstudio/dreambooth as well ?
Yes.
@lament steeple I just bought colab pro. but unable to find AUTOMATIC1111 link for google colab. Would you please let me know if you have ?
one sec
maintained by TheLastBen >>> This one is ok ?
I see these options:
maintained by TheLastBen
maintained by camenduru
maintained by ddPn08
maintained by Akaibu
TheLastBen is the one that i have heard of other than the one that I sent you
Where the hell has gone the website thispersondoesnotexists .com?????
It was so fcking useful ???
Hello 🙂 Is anyone getting bulk spam invites from Maze Guru? I've got like 15 dms from users in this server. I'm not including the link ):
" I found that you're interested in AI art, I’d like to invite you to the Maze Guru ! Maze Guru is a free AI art generating tool with new styles . You can use it with bluewillow to make your artwork unique! Feel free to join, no pressure. (Just sending this for an invite campaign, I hope this doesn’t annoy you)"
It's a bit annoying...
Hey, report it at #1010934719455707218 and then block the users that spam you or ignore them and close the chats
Okay, thank you kindly 🙂
how to off blure in Dreamstudio?
Its the nfsw filter you cant deactivate
Nope, but if you have a nvidia GPU with 4gb vram local would be the best
Or you can try Stablehorde but idk of they support nfsw still
The last option would be Google Collab
Noooo none of us do nsfw work 🤥
mmm can't wait for Phantom Liberty, get back into the world of Cyberpunk now that it works
also saw Edgerunners yesterday, god damn that was good
I purchased colap pro, but it seems there is no difference with free version for me! I installed SD 1.5 and was able to work but after I wanted to setup dreambooth extension I'm getting this error:
No interface is running right now
yeah colab pro sucks
it's better to rent server time
like say 10 dollars for 5 hours (or more) on an A100 80gb
prepare everything so you can run it immediately and go for it
Have you had any experience with lambdalabs.com/service/gpu-cloud ?
So you are working on your pc ?
yeah, mostly doing merges and loras if that
now Im learning control net and latent couple
So the thing that I don't know yet, if I don't work with installed SD for lets say 10 mins then on colab pro the session expires & I should install it again ?
Hi. Does anyone know of any index or database that shows illustrators we can add to our prompts?
I have a few csv files in my g-drive hold on
make sure you've got it on your own colab drive and not working on someone elses, also make sure its persistent. Not sure how to do that but pro s hould be able to no?
or set up a script to activate everything
Всем привет я новенький
welcome around 🙂
there are some people talking russian around, but the main language you'll see is English
Ohhh nice. Thanks!
np
Hi! I am trying to run a stable diffusion pass on a series of renders I have done in blender. It get an error message saying "PIL.UnidentifiedImageError: cannot identify image file" when I try to do it however. It works fine with an image created in photoshop however. But when I try and edit the rendered image in photoshop and resave as jpg, it still doesnt work
anyone know why that could be_
?
My Stable Diffusion with Web-Ui made by Automatic1111 often give me errors, I have tested many things and installed and removed scripts and files so I think that files may miss and config files been altered. So what is the esiest way to re-install or check that all is good.
I havent changed anything
maybe updating it will do?
hm, tried but still not working
is it safe to generate 512 x 1024 images now? whenever i did it like 6 months ago it made very creepy images
should be fine
is it fine to combine multiple Lora/embeddings together?
ive been getting some strange results while combining them, im not sure why
maybe on 2.1 768x768 model it'll do better, but on 1.5 trained on 512x512 - most of the times it'll be repeating things on the image or just a mess
whats the best source of images with prompts? to try to reverse engineer good stuff and learn how to improve your own generations
It's not as simple, since different models work with prompts differently and some of them understand words other models do not
For specific model - look model generations, for overall - https://lexica.art/ is your friend
I'm trying to get ComfyUI to run and it says this: TypeError: LatentDiffusion.init() missing 3 required positional arguments: 'first_stage_config', 'cond_stage_config', and 'personalization_config'
Any ideas how to solve this?
lexica.art i think only hosts examples of their own very specialized model now at this point in time
yep
We now have a #1080946152318443610 channel! 
If you have any questions for that particular resource - please ask them in the linked channels for each section! 
👋 got a question, i found an AI generated picture, where / whom might I ask what ckpf file they have used for it?
or is there anyone who would be able to tell?
Unless they have the metadata in it or they specified it may be hard to tell
Not all models standout enough to tell without it
I'm new to the scene and I own an amd 6750xt. Is it possible to get SD and all of the models currently available to Nvidia users to work for AMD in windows?
Well it might be something that's very well known between people because i've seen the exact same style on many different pictures
Send it my way I might be able to tell
anybody familiar with running automatic on mac? I have an install running but it won't load any model besides 1.5.
Does Emad have an Instagram?
idk but he has twitter
no emad pics there
bro
Has anyone here had a look at StyleGAN-T? (just watched the two minutes paper video on it)
Is it possible to run SD on client side?
Yes
What file formats can OpenAIs whisper use?
any resource on how to do that? or some more explanation would be great. thanks!
hmm i have used this but didn't think that way.. my bad lol
the issue is i was building a project which used replicate's api but it's limited in free version. so was wondering how else can it be done before buying..
@fervent thunder for your context.
is anyone here good with control net that can help me tune it to do something a bit different
I use automatic1111's webui via api calls all the time. if you start it with the --api switch then go to localhost/7860/docs you will get the api documentation
Hey how many pictures and repeats do I need to make a lora based on an artist's style?
My first instinct was to let this one pass by because I think it's a really bad idea, but as someone who has trained synthetic artist TIs and someone who has read Title 17 U.S.C, I can tell you not only how to do it, but how to do it without it biting you in the legal ass. You need 20 pictures and never put a single copyrighted artist in isolation in your prompts. If you do use copyrighted artists, hide your prompts. The second you do anything that looks like you might be competing with them (showing up next to them in Google) fair use does not protect you.
you mean epochs?
Thx
Did you mean Epochs? Usually about 20? Or Class Images? 10 per training image
is anyone here good at tuning control net?
I am trying to use a collage image and a seperate image to make a collage that looks like a bigger image
Is there a way to have batches of images automatically print a number onto the outputs? I am running large numbers of tests, and I need a better way to keep track of which prompt produced which image.
Do you have the setting enabled that generates a text file beside every generated image? Text file includes all generation parameters, incl. model hash, seed, prompt, upscaler, etc..
Generated images also have metadata that includes prompt info that can be read. If you're using the A1111 webui, there's a PNGInfo tab that let's you see which params were used in a generated image.
Anyone know a good model for pencil sketches?
or a good prompt for pencil sketches on base SD
In it's current form Copyright laws do not cover AI Model training and you can find plenty of articles on the USPTO that shows them debating how to handle this. As of now we are in a legal gray area, until laws are passed and cases have been brought to trial. Dont take my word for it, just search the USPTO website, you'll find plenty of information on the topic.
https://www.uspto.gov/sites/default/files/documents/ITIF_RFC-84-FR-58141.pdf
I've read copyright law (Title 17 U.S.C) have you? I've worked law. Have you? You can either follow my advice which hurts nothing or take unnecessary risks. It's your butt
Don't have it enabled. Might consider that. I already have the prompts in txt file, so I was hoping to just put a number onto the picture itself. Thanks.
hey this may be a dumb question but is there a way to feed my own images into Deforum Stable Diffusion to create a (2d) animation? I'm using the notebook and would love to be able to feed in a few images rather than use the prompts.
Yes I built and ran an Intellectual Property rights management systems (AKA Copyright management) for years and have worked in 3 of the largest law firms. If you've worked in law then you should know that this needs to be established by case law (precedence) or through new legislation. Neither one has happened yet, so all we have now is debate and speculation nothing more.
It still doesn't hurt to take steps to protect yourself
Hello. I've been seeing workflows that end with "Model hash: 9aba26abdf, Model: deliberate_v2". What do they mean and how do I use them?
https://civitai.com/models/4823/deliberate you download the model and put it in your models/StableDiffusion folder. Then it will be available in your models dropdown
Thanks!!!!!!!
Hey all, long time listener, first time caller. Found this very usefull: https://stable-diffusion-art.com/prompt-guide/
Hi I’m new here
I haven’t done any of this before
Is it possible to copy an artists art style lol
Alright I’m an artist myself but it’s important since I want to make sprites
So is it also possible to place different faces on same sprite too
It is possible to replicate an artstyle
How to use the bot
There is no bot
dream
Is it easy?
https://www.youtube.com/watch?v=70H03cv57-o if you have basic pc skills you should be able to do it
Can I get a LORA guide
The 4chan one is pretty good, you can find it on /hdg/ at /h/
Have you followed it?
Yes
I personally prefer to train TIs because they don't rely on tokens already trained into the model, but they do require more VRAM
so if I want to train on a character from an anime that isn't named in CLIP somewhere am I SOL with a lora?
There are 4 different ways to do it. Lora is the way that requires the least system resources
TI requires more resources than a lora?
For training, yes. it's also slower
Are the other ways hypernetworks and full fine tuning?
but I find that I get better results and you don't need to pick a rarely used token that is already in the network
exactly
for one character you don't want to use those two methods. Those are for a more broader scope
another advantage to TI is there is a way to check the TI as it is being trained to make sure you don't burn it.
But back to my question earlier, if I wanted to train on say, Goku, - assuming its not in CLIP already - will it fail to work at all with a lora?
if you have the resources to do it. I think you need at least 12g VRAM. my 8gig wouldn't do it
no. it would work fine. You just need to make sure have enough pictures. The guide covers what you need.
I got 24gb 3090
Oh you''re good to go. you can train anything you want using any method you want
Can I train a lora on just a bunch of animes or what would happen if I did? Would it blend all the anime's styles into everything SD gens?
Why did a bot just message me?
I just deleted the message without really reading but it was very weird. -.
never trained a lora on more than one concept, but it is supposed to be possible
You may be better to finetune on that or do a hypernetwork
any tutorial on training a LoRA either based on style or character?
dragonforge just posted one above
oh thanks. you too DragonForged
OK. I'm going back to the books. See ya all in a bit
ty
Any idea why SD would suddenly stop recognizing all the TI's i have in embedding folder?
i check textural inversion and it says "nothing here add some content to the following directories" then i go and look in it and all of the ones i had are still there... it's just not recognizing them
Did you change models?
if the TI isn't compatible with the model (1.5 ti vs 2.1 model) it won't show in the listing
ooooh. yes i was using a new model as an experiment.
i didn't know they just wouldn't show up at all
says SD 21 on the model so i assume 2.1
guess i have no embeddings for 2.1. all the LoRAs still seemed tobe available though
They may not work right. can try them, but may give unexpected results
hola
Question for you all: Say there is something that I get from an image that either shouldn't be there, or is missing. Like an extra arm that's across another actor/person (so not just over unimportant background) or someone is missing a leg and I want to add it or like the leg is wrongly positioned and I want to remove it from one part of the image and add a more appropriate one in another part. How do I go about fixing this?
So I really like the output of 95% of the rest of the image, but I need to fix the other 5%. Is there a way to "photoshop" in or out certain things that isn't just rolling the dice on prompts? Because I've tried inpainting with positive and negative prompts, but doesn't seem to really work and if you inpaint enough times the surrounding area gets very sketchy and the image gets to be like etchings where the original integrity just deteriorates.
I'm still new to SD, but I'm not sure how I'd go about doing this efficiently, I think I'd only be using half-asses round-about methods that would give frustrating/poor results. Like the inpainting. Because I feel like I can do this very easily somehow and don't need to be opening photoshop.
You would do this with inpainting which is under img2img
Sorry you mentioned that haha, ya it's a gamble
Damn. So photoshop is kinda the most efficient way, rn? Quick edit out or in something and then use touch that up in SD and then the result is the new base?
As far as I know. There is actually a Photoshop SD plugin now but I have no experience with it
youtube vid says "connect to local stable diffusion installation." Whelp, guess I ain't doin that. XD Even more reason to get the hardware
Take it straight to inpaint, mask it off and tell it to regenerate that spot. it's a common problem and part of all of our workflows
Maybe that's something I don't yet understand: What do I change in the prompts? My brain wants to basically "remove artifact and/or ad another that isn't there," but how should I be telling the ai that?
Do I only use prompts specific to what I want to see? Or do I keep my original prompts and basically fill in what it did wrong in my eyes? For example if I have a hand that is missing fingers. Or more likely, a hand that is interacting with something and the hand isn't totally visible, but you can see that something is clearly amiss. What exactly do I need to be masking off so that the ai can take into account the total relevant environment? Is this a situation where I would have it "inpaint" the whole image instead "masked only?"
I guess what I'm asking is how do I best protect the other artifacts around the problem area, be they background or directly interacting?
I vary rarely change my prompt when I'm inpainting. When I do, I tell it what I want in the inpainted area
in inpaint, it will only change the masked area
You could also have it only paint the non-masked area
And you can adjust the mask blur to adjust how it affects surrounding pixels
^^ there is a checkbox for that
a check box for which one? lol
A checkbox for switching between changing the masked and unmasked and a slider for changing the mask blur
I feel like I'd rarely used "unmasked area" for dealing with artifacts though, right? Be they ones I want to keep or get rid of
I have never used it
Like, if someone is missing a leg, what's the best way been in your experience to fix that? In terms of how exactly to mask and what settings changed? Same if there's something there that shouldn't be, especially when that same thing is over something you want to keep, be it the part of someone elses body or something important in the background you'd like to disturb the least.
because just masking the leg and running the same prompts doesn't do the job, right?
*masking where it would be I mean
It will usually work for me. even if I just get part of the leg regenerated. then I just keep going. Sometimes it just takes a few tries
Don't be afraid to spend time on your work 🙂
Ok, I guess I will give that a try. Is the mask just always telling the ai "there's something wrong here, fix it," and it does?
Yes. sometimes it gets confused about what, so don't don batches of 1, I usually do 4x4
don't do*
Haha, I'm certainly not afraid of that my biggest fear is just finding a difficult way to solve something and then repeating that same process in other instances only to learn I've been doing way more than was needed DX
AI art is like any other medium. There are as many different ways to do something as there are artistss
I was just talking to another artist the other day about why I manually regen at a higher resolution instead of using hires fix (the answer is control) but I just prefer to do it that way
It's not wrong. It's just different
GREEN is not a creative color
You can't do summer trees with out green!
but you should see the castle I'm getting ready to turn into a throw blanket
sorry. i'm just spilling weird niche references again. youtube parody of childhood education. very dark stuff. https://www.youtube.com/watch?v=9C_HReR_McQ
what is this place
I gotcha. Can I get quick rundown of exactly what batch size and batch count it? I've only even used I think 'count"
is this a nft discord server
So basically batch size is how many generations you want to run right now
this "gen" stuff seems a bit nft-like
Batch count is how many sets you want to do in total
So batch count 2, batch size 3. You will generate three photos at a time, twice . One set at a time
nfts are non fungible. you can funge these gens all you want
sometimes, people funge entire models
You are limited on your batch size by total VRAM, and batch size by GPU temperate
what is a gen
ive no clue about this stuff i just joined discord and joined a random server lol
To create stuff you have to press the generate button
It's okay
This is not an nft server, but an AI art channel
It's nowhere, you're dreaming. Wake up!
you say "computer. make for me a silly goose with a hat on it's head!" and then the computer generates that to your specifications
its all very technical
okay
Computer, make me a silly goose with a hat on it's head!
lmao
well i meant like, metaphorically. the server doesn't actually have a bot
So what is the benefit of higher batch size exactly if it's more taxing on the vram? What are you getting with a size more than 1 in your 2x3 that you don't get with just a batch count of 6 for the same number of end result images? And the opposite question as well: When and why would it be beneficial to generate 6 images all at once/one time?
There's no bot. Stop yelling.
Do some reading and figure it out
Sorry, father. I will go read some books, and gain intelligence skill points.
Let's say you make an image you really like, you send it to img2mg, and put a high batch size because you want awesome small variations of that seed
Or you just have a really good prompt in txt2img and wanna let it run while you get some water
