#💬|general-chat
1 messages · Page 21 of 1
Sorry can't help u there 😦 I hate using lora for this reason
Their training is way different than what I'm used to, personally I avoid it if possible
Good day,
Quite new to SD, so hopefully my wording is correct here!
I have 2 questions about SD (running SD on automatic1111, if it matters)
I have played around with creating my own model with images of myself, trained in dreambooth. The results came out better than i expected for a first run!
I am just wondering, the model is 4gb, and typing myname in the street, it renders me ... in the street! so it obviously added my trained data into a specific model, as it understood street, etc.
I then went and created a merged model of my trained model and protogen, but AFAIK, it doesnt just dump my trained images in there, it blends them, and means some protogen is now missing, and some of my is now missing too, making the results look less like me in the merged one.
How would one go about, training directly into a specific model (like protogen, or f2222) from the word get go?
When downloading a textual inversion pack from civitai, lets say i DL one of Sarah Michelle Gellar, and i have a prompt for "girl in the apocalypse", if i add the textual inversion phrase, will it copy the style, aka colors, etc, or will it make the model look like the textual inversion?
Just thought that it was good
Yah dont use it honestly
No, just do the classic 1300 steps .000001 training on fp16 and u should be gucci
Oh ok, and would you suggest that I do embedding or dreambooth? Eventually he wants me to generate images with him and his two kids so I would need to train three faces
u can train the protogen model directly
and for 2. it will copy the style
kind of like a trained model but more generalized from what I've seen, I dont use embedding cuz theyre not super good for 1.5
Ah protogen but not stolen dream. You think that I can't train with that?
dreambooth, embeddings are for styles
ok awesome! thats what i was hoping. when using dreambooth on google collab, is there a way to choose which model you want to train? or is that more advance and need to be done locally?
I think u can either train 3 models at one time or one at a time and merge them, this guy on YT called Aitrepreneur has a tutorial on training multiple subjects
thx for the replies!
Idk if it is still relevant tbh, worth checking out
Oh awesome. I'll look at it
depends on which repo ur using, I would just look and see where it loads the model from and just replace the model
thx man!
Ok, I was thinking with merging. I did not know one could train more than one model at once
one last question, i am running a 1070, its a bit slow, but works! i have seen some results people are getting are INSANE good though, mine looks, meh
is this all down to prompt magic, or does a better GPU come up with better images?
wellllcome. To the world of tomorrrrrowwww
its all prompts and models
GPU only makes it faster, and removes memory limitations
thanks, i better go study some prompts! haha
hey guys, anyone had a chance to try the new Ultimate SD scaler in img2img?
Guys i need any kind of info about embedding training that u might know, i've searched most of what i could search on reddit, if there's some more info or guide about proper embedding training, you would be great help to share it with me.
does automatic1111 update via launcher ?
u can add git pull to its command prompt in user-bat
just "git pull" in the command line ?
--git pull
Hi what could be the reason for a extremely dotted/low res result using StabDif 2.0 PLMS 50 Steps 512x512 it looks like a very bad print of max 50x50dpi
it could be prompt, some prompts don't work good with some samplers and go reee from my experience.
SD 2 works best with 768 I think
i tried different prompts also at 768 only plms had this bad result
never got a working result with plms
PLMS generally doesn't like anything over 50 but at 50 it shouldn't be giving such bad results
Is there a way to get stable diffusion to generate lots of images for me over time into a folder? So for example I could generate 800 images overnight
depends on what software you use to generate pictures
Using automatic1111 and anything v3 model
change the batch count number
right click generate, select generate forever in dropdown menu
Hey @radiant linden how can we reset all settings for webui ? Which file to delete?
config.json
I swallowed like 6 kids
Ah thx!
is that a euphemism for something
doesn anyone know some places to find images with the exact prompt and settings used to create it? something like lexica.art
Publicprompts.art
awesome ill have a look
that one is a pretty small amount of prompts from a single person
In automatic1111, what the difference between 10 batches of 1 image and 1 batch of 10 images?
latter is overall somewhat faster but requires a lot more VRAM
Oh, so it does it in parallel. You know, I think I knew that. 😆
if you have live previews on you'll see it made in parallel too
Also look at openart.ai
Yeah, I actually tried that weeks ago and noticed that. But I didn't really know if it was parallel or a little of this one, then a little of that one. Recently I've just been doing a number of batches of 1 image, and then I forgot about trying it the other way weeks ago
you just need to switch rthe filter away from their model and to stable diffusion
alright, its indeed the best one till now. In this discord there is not a section where people also share the prompt right? i could only find channels where they share the end result
sometimes you can find the prompt in the image
It's kind of fun to leave it unattended while using wildcards and then come back to see what you got.
ah as in the name of the image it is saved as?
Load the image in the PNG info tab
no, the image can have entire generation parameters in PNG fields, if the creatore did not disable that
you can drag the image into PNG info section to check
what is the PNG info section?
Its in the exif data of an image. Most Gallery programs can show it
That's one of my favorite features. Drag in an old image and load everything with one click to txt2img it's a great feature
you can also drag the image into the txt2img prompt
Yes but that just stuffs all the info in the prompt field
press rthe blue button
Hmmm.
ah alright, so when i drag the image into SD i get the prompt
I've never noticed this button
very useful
I don't have my PC on now, but I'll take a look tomorrow
you usually copy someone's prompt shared as a text
paste it into the prompt field
but then i have to use the google colab version right? I use playgroundai to create my images
and press the button to put everything where it belongs
Ah, that's super cool. I was wondering if that was possible since some sites post the PNG info in that same format.
can use sites like https://jimpl.com/
they'll show you parameters as text too
So when cfg scale is in the prompt it changes the cfg slider and delete the tag from the prompt ?
in a nutshell, yes
it has to be in exact certain format, not just somehwere in prompt
Very cool feature, gonna try that later
great, ill try it out thanks!
I think still PNG Info ist the easiest way, also to watch the efix data on Windows i use imageglass+efixtool and on Android Aves Gallery
in number of clicks, easiest tis to drag into prompt - 1 drag + 1 click
png info is 1 click, one drag, 1 click
Yea okay then it depends if i want to generate fast or just look at the prompt
I still save almost all of my stuff as jpgs until I find one I like and want to start working on it for print quality. pngs take up a ton of space after a while.
i use the save txt file for paramaters option excessively
jpeg can also have params embedded
ah, cool. Thanks for the heads up!
I want to make an extension some day. I want make a super cool editor for the prompt boxes. Like, you highlight some text and can increment or decrement the weight of that text using buttons.
I also want to break the box into several sections with pre-canned text like the Styles drop down. And then it would concatenate them all together when you hit Generate.
So, if your making, say, a person, have one box for subject descriptors, another for pose, another for location, activity. Whatever. And then you can mix and match things really easy. I'm always going on tangents and trying new things, and the saved Styles are great for holding successful prompts to reference, but having more granularity would be great. I've been putting bite-sized things in the saved Styles with "+" or "-" at the start of the name to indicate whether it's a positive or negative prompt, and then I use the paste button to build up a prompt from the pieces, but it's really not ideal. I want categories and sorting, and such. I sort the CSV file by hand. 😆
I wonder if anything like that already exists
Yea it exists, mark a word, then hold Ctrl and press Up or Down
Strg?
ctrl
Haha yea xD Ctrl sry
I'm learning all kinds of things tonight!
But the second part would be the huge feature. Having saved pieces of text that are easy to access.
I think there is one extension that give little World bubbles suggestions to click
It might be as simple as a category drop down that populates a list box and you multi-select items and hit the paste button to add them to your prompt.
Word bubbles might be good.
Yeah, exactly
You could even publish JSON files that have good, curated prompts for popular models.
And mix them in with your own. Then you could load one of these files to populate the UI which then makes it easy to pick individual poses, camera angles, etc., with just some clicks.
It could be kind of like wild cards, except you would pick and choose from each category instead of it being random.
Yea that sounds great! Also with the modular part of insert custom json
But how to display so much tags?
Dropdown, bubbles, search Function?
I'm sure there's some reasonable way to do it. I was thinking of a master category drop down, and when you choose a category it populates a list box with choices, and you can multi-select items from that list box and add them to your prompt with one click.
It could even integrate with the wild cards extension to paste in your wild card file names.
Imagine an alias function so when you type close to .. It displays close up or macro
Sort of like an auto-complete?
Or are you thinking more like a search?
Closeness can also be things like "headshot" vs "portrait" vs "full body". There really are so many possibilities.
Or lens type like wide angle vs telephoto
Yea thats what i mean. If the user dont know the right words it give him the similar ones
That's why I was thinking categories. They pick a category and see all kinds of things they've never thought of. Hell that happens to me every time I open somebody's wildcard file. It really could be strongly linked to that feature. Just put all these categories in wildcard files, and show the contents when they choose one.
Guys why do you sometimes get black image especially when using img to img ??
Yea categories make the most sense. Maybe with autocorrect and when it detects a word you can press shift and it get added to the prompt
Integrating it with typing in the prompt box would be pretty clever. If you enter a trigger word, it highlights it and you can shift-click it to open a something to make selections from.
What we need is a text AI that understands how to write stable diffusion prompts based on what users need. You type in "close up" and it suggests all the various ways to make a close up.
But based on it's own understanding of each model and not from user trial and error. "My AI needs AI."
I really need to go to bed. Stayed up all night playing with SD, and now I'm staying up even later talking about it. 😅
Its 2:40pm here idk what you mean 😛 but good night!
We are looking for highly intelligent individuals. DM me for more information.
Hi i am looking for documentation about Stable Diffusion 2.0 i dont know if i am blind but i didn find any thing about prompt parameters , file structure, where to put the model file and like 100 another totaly normal questions about setting up and runnung this software 🙂
@waxen igloo i would suggest to watch this easy Tutorial to get startet:
https://m.youtube.com/watch?v=VXEyhM3Djqg
i mean some original info direct from SD developers
i can/like to read
is there a manual or something
There are different ways to use stable Diffusion, it depends on you graphics card mostly
There is no official Programm to use SD but multiple open source programns
this would be a important chapter in that manual for sure
Someone works on a guide to read:
https://docs.google.com/document/d/1aHJ9RBt_vlCwJQBVUUsb7VghKB-wynv7WGTxm9ozL1k/mobilebasic
Thanks i will try that later!
but is there a way to extract known tags from a particular model?
Since there is no official Software or guide i dont recommend written tutorials because most of them are outdated. The Video link is the newest one.
from where got the maker of the video his info?
He uses the Automatic1111 Webui. The info to get it running is here:
https://github.com/AUTOMATIC1111/stable-diffusion-webui
No. Iy's dome for booru because tags are known
for non-booru models, you could try to achieve this by going through the training dataset
but whether you'll succeed is unknown
this looks like a tutorial about a gui not a tutorial about SD, its nice but very limited
there is SD2.0 repo
it has instructions for how to run it, where to put models
thank you! i could not find the syntax and all about the prompt parameters
this is not part of SD but a detail of a whatever software you are using to run SD
look for explanation in the documentation for your software
or do you mean txt2img.py script bundled with SD2 repo?
this fine has explanations for parameters
yes thank you
oh just the source code but ok parameters question solved 🙂
so this is how it works looking through code by myself?
If youre interest in the "how ai image Generation works" you should look at blogs from stability.ai or Laion.ai
hi..i try to install ultimate sd scaler but cant find it under script
@warm junco @radiant linden ok thank you!
hi
hello! 🙂
Hi, I've been trying to solve my inpainting causing desaturation problem, but placing the vae doesn't fix it. Help please! https://www.reddit.com/r/StableDiffusion/comments/zk9zpy/img2img_resulting_in_low_saturation_faded_output/
My files:
Anything-V3.0-pruned-fp16.ckpt
Anything-V3.0.vae.pt
Fixed it. file name must match exactly
How can i add dream bot to my own server
how do i know if preprocessing images does something? it just says its loading
i dont think its doing anything
@radiant linden hiya : you may want to look into this issue with the webui: https://github.com/AUTOMATIC1111/stable-diffusion-webui/issues/6676
commit from 3 days ago
Why does this guy keep trying to list the AI model as the copyright owner of the output? https://www.reuters.com/legal/litigation/computer-scientist-says-ai-artist-deserves-its-own-copyrights-2023-01-11/ As advanced as current AI models are, they are nowhere near sentient and just advanced tools for humans to use
I believe this part of the article is going to be a full and complete answer to your question
his
wait, AUTOMATIC is back now?
Why do people on this server hate midjourney?
anyone willing to help a fellow artist with some setup of custom diffusion model from adobe for testing?
Any1 got a model that does graffitti art?
Hey guys, I am working on an open-source photoshop plugin for Automatic1111 (locally or with Google Colab) with the integration of Lexica.art prompts. Does anyone want to beta-test it? Any feedback is appreciated
https://github.com/isekaidev/stable.art
Hello guys, i have a question, can i install and run InvokeAI with a Nvidia 3050 RTX, 4GB (Vram), AMD Ryzen 7 5800HS and 16GB?
Hey my drive is running out of space. Can I just move the Sdiff folder to my other drive
I run SD from a 2nd hard drive and it saves all images to a 3rd, and it's working for me!
it's not open source
you hate anything that's not open source?
a lot of people do, and at this point, i kinda do too -- not anything but a lot of things
why does it have to be open source?
because freedom
the problem is the models
I think the main reason is to help the tech to advance but since it's not open source it doesn't help at all.
ok but aren't the midjourney devs paying a lot of money to train the models? why give it away for free?
I understand appreciating an app being open source
ok but aren't the stability ai devs paying a lot of money to train the models? why give it away for free?
but hating something because it isn't I don't understand
because they want to?
what i'm saying is why hate MJ for not being opensource for that reason
That's true, you could try using the --ckpt-dir <URI> argument to your launch script and store only those on a separate disk?
For anyone that has used AUTOMATIC1111 and img to img, does anyone know what the denoising value translates to in api terms, is it the scheduler and if so which one
Hmm I\lll give it atry
I'm running Automatic1111 on windows, but this page was a big help for me setting up my webui-users files: https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Command-Line-Arguments-and-Settings
in MidJourney it's possible to input several images for a single generation and it seems to work a bit different than the the img2img in SD, is it possible to replicate the "several input images" from MidJourney in SD?
Anyone know how to keep the same faces throughout the video in duefourm
There was a paper published a while back that seemed to be really similar. I am trying to find it, but none of my searches are turning up anything. Basically in the paper (with supplimentary github code), they trained an image to text model using the model's outputs. It worked well enough that one could insert images into the text prompt like MJ does
Filtered, not like just NSFW, but anything that's legitimately unChristian (I couldn't do horror or anything kinda flirty since stuff like parasitic and sensual was blocked, and they blocked big black if you have it in a sentence, mods have actually fight with users over taste), no privacy (even when you're paying an extra 50 bucks for privacy, it actually still forces you to use their Discord filtered system), a very specific style for images (can't do realism, can't do other specific styles well), closed source (if they banned the color red tomorrow, you would have to suck it up since they're calling the shots, instead of saying that's a bad idea and making them not do that.)
does anyone know if :: or : : can be used in automatic1111's version of stable diffusion to separate prompts with weights?
so basic decency?
Honestly, it's beyond the regular Discord level
I like stuff like horror or other stuff, it just won't let me make it.
Not basic decency, I always maintain that, it's just...distasteful.
I don't have to block someone from eating cake if I don't like it, what goes around comes around.
It's using your opinion and hiding it as "it's for your own good" that ruins it.
well im seeing "big black" "sensual" "unchristian" in your post so... yeah they just dont want grubby edgy stuff crapping up their servers most likely
No, they legitimately said it's for Christian values
I am not making it up
There was a screenshot that popped up
Anyone know how to keep the same faces throughout the video in duefourm @narrow sluice
Daaamn these protogen models just keep getting cranked out dont they
What if you want a big black rock
I fuckin luuuuv how fast people are advancing this technology
Again, lazy programming
There's legitimate reasons to use filters, SD probably does it better
With blocking the outputs instead of stuff
@narrow sluice any help
Did I mention how frustrating it is when I get filtered out puts and wasting my money?
then dont put money into something that doesnt match your purile values?
Yeah I never paid a dime for MJ and their gay ass censorship
I'm not an expert in this stuff, you're better off asking someone else. 🙂
I'm no help.
O ok Would you know who can help that’s active @narrow sluice
Honestly MJ is to computers the same way Macs are
thats true
Yeah they work yeah some people like them but serious users never even consider it as a real platform.
Don't need to be a politician to say that if you don't like something, just avoid it.
That's after spending 15 bucks since I only used MJ at the time since any competitor didn't exist.
It was that or DALL-E 2
https://discord.com/channels/1002292111942635562/1025266140445933648 this is the animations channel. they likely would 🙂
It was for a project.
These days, I have much better options
Also, MJ got boring after awhile.
It just sucked with other genres.
Essentially, it became less valuable since I have much more better choices in terms of AI generation and not worry about hitting the pretty bad word filter.
Like we're on the SD server, there's so much out there to try.
If you like the MJ style, whatever floats your boat
And SD is free, which I can't say with MJ
yes. but its lol to cry censorship
this is the point
SD is already better than MJ
MJ is just more "end user friendly" and faster when it comes to total time required for an artwork.
SD requires more time and experience, but once you have it, it makes truly wonders
Tbf, SD has Dreambot, which does pretty close to MJ these days.
A few more extra options, but can definitely give MJ a run for its money
why? you can complain about censorship and then stop paying for the product. MJ might have a greater dataset that can generate out of a wide range of styles...but it gets annoying when you have no idea if the word you typed in is banned. Then having to pay for privacy
MJ`s biggest issue is how expensive it is
for features I would take for granted
That is the opposite, My assumptions are MJ has a smaller, cleaner dataset
they've talked about the millions upon millions of images theyve parsed
That doesn't negative cleaner
in that voice chat that you can listen to their office hours
You can have a cleaner but still large dataset
I have a mostly well-tagged dataset I'm building, and I've hit more than a hundred million images already
If you know the right places to source images from, it's not too terribly hard
yeh
100 million? Damnnnn
and im thinking the 26K dataset i have is a lot
my biggest gripe with MJ is still that I have to actually PAY for privacy
Is your dataset public or private?
something that imo should be free, since it doesn't cost them a thing
Capitalism
^
It will be public when it's complete (as close to 1 billion as possible!)
its something people want but dont need so $
sure capitalism. Also free with dream, novel ai etc
Had NovelAI ever fully recovered from the leak/hack?
They had the easiest to use anime model back in early October
wasnt sure if this was something you could talk about actually
but im interested too
the dataset or the aftermath?
the whole drama
it's just that... haven't heard anything about NAI imagegen in a while
it's clearly OK™️ now since AUTOMATIC is back here
oh didnt even notice
i never know whos chatting or on servers tbh. just like the pretty pictures and occasionally lolling at people getting angry at AI generated stuff
They're doing fine. They're working on beta furry stuff
The only things that has springed up is people using, cough their datasets/models illegally
But well, cat out of the bag
Are machine learning models actually copyrightable?
It is implementation theft, from what I heard from a programmer friend.
Hard to prove though.
They had the option to send a DMCA to certain hosters, but never did. This is beginning to make me believe that model weights aren't actually copyrightable anyway (at least in the US)
They can DMCA but mirrors exist
Plus 4chan
"Implementation theft" isn't a legal concept I've heard of. Anyway, I have a feeling their lawyers advised them not to or something.
Good luck tracking everyone
They claimed to have sought legal counsel
Well, they did
But again, too late
Once it's on the internet, you beeped
Even the best folks in the world can't stop the spread
It's going to be going after shadow puppets
Anyway, once I train my new large open models
all of proprietary models will be 100% obsolete
Yeah, I mean, I would love for NAI to get new toys
They are looking out there for open source commercial stuff
Not just for images, but text as well
There are new toys (BLOOM), but text is really hard
nothing compares to GPT-3.5/ChatGPT
Ironically, they won't touch most open source stuff if it's bad or non-commercial
well, nobody touches bad stuff
and non-commercial violates the definition of free software and open source
And they wouldn't have gotten big if OpenAI didn't eff up, so they're not ever going to use that.
Yeah, no kidding
Best for everyone 🙂
20B is good, but I am curious what an uncensored ChatGPT would be like. For science ofc.
like twitter
like 4chan
Well, OpenAI admitted to scraping the entirety of Twitter when training ChatGPT.
Even when it's...against ToS
Wouldn't that fall under serious privacy concerns?
no
No, because it's public data
Tweets are public. Courts have decided that public data is obviously public and can be used publicly.
What other stuff is technically public data?
twitter has to be the best source for training language though, questions answers, small to the point, countless variations on replies etc.
Out of curiosity.
I think it's fun to bring up home address databases. "Doxxing", while controversial, isn't actually illegal unless you intend to cause harm (and intent is hard to prove).
LAION consists of public data.
Once you mentioned public data, it's now very interesting frontiers.
but for factual stuff, gpt supposedly weighs sources credibilities, like if they've published papers on something or if theyre an anime avatar oti 😄
You can look at Common Crawl to see what else is considered public data.
How much storage do they take
Thats gotta be petabytes and petabytes I call bs
Whoever first disclosed the secrets would be in trouble, but I think subsequent sharers would be fine.
metadata alone is 120+ GB
However, that's all the dataset is: metadata
Oh
Actually, is there any weird cases with copyright in a tweet, in the cases of people sharing say, a book that has gone to public domain in one country, but not in another?
I don't think that'd work. Twitter would follow US copyright law, and everyone else would be dead out of luck.
Yeah, there's some difference in UK/US law, and the former is earlier, if I remember correctly.
Same with EU, I believe
Artists when another artists learns from their image to make/influence their own:
"Yes! Im so pRoUd"
Artist when AI does the same that the human artists did:
"nO!1!!1! tHeiF, sTeaAliNg WaAaAaA"
Its just coping
Artist when you spend 6 hours on one hand and it still sucks. 
most of them are genuienly scared that all the effort they placed in for years is in risk because technology advances
Again, when collecting images for my datasets, I ask myself:
Did this artist pay/get permission for every image they have seen with their eyes which may have conciously/subconciously affected them?
The answer is no.
Thus im relaxed, ethics wise.
Gives me huge food fusion, cultural appropriation vibes
This topic is something that cannot be met in the midfield logically, you either allow it, or copyright everything, delete memories of every artists, and tell all of them to start from scratch (a.k.a caveman paintings)
It can be met in the midfield in law tho
Does the Irish have to pay Mexico because they used potatoes?
Does the rest of the world have to give credit to places that gave them non-native spices?
Does anime/manga artists need to pay Disney cause the guy used Bambi's eyes for inspiration?
don't even get me started on Italian food and tomatoes!
Does the British Museum have to give back everything they stole, no cap?
also would you like to try my model?
Probably not now, I have to move my hard drive to a bigger one
I'll let you know if it's done, but yeah
Does keeping models in HDD instead of SSD have a big impact on gen speed?
Not sure, I have been using SSDs mostly
cause my models file is 81 GB lol
OH LOL
Definitely not a small download
So yeah, later on
I was on a 1050 Ti, so I didn't do much generating
Locally
ic
https://aqualxx.github.io/stable-ui/ you can try my model here
Hi everyone, I'm fairly new to stable Diffusion, so I want to ask some things - what are generally good negative-promt words to use in order to create good anime-character image ? What are good keywords for this ?
In 1.4 model
( And I know that I should download community model for better res, buuuut I want to get around first and try to get smth from basic model )
honestly I just start by adding Anime:1 to the end, and vary the number up and down to get a feel for how much it affects a given model
on Automatic1111 I would use the X/Y plot script to get side by side comparisons and use S/R to increment from like "Anime:.25" to "Anime:2" by increments of .25
If it's looking too shaded for me I'll often then start adding "cartoon" or "cel-shaded"
to flatten it out
Aside from my standard ones, I rarely use them in 1.4, i might put in like "3D render" if it's always looking overly CG, i usually aim to keep my prompts simple and use weights to make changes
I am running some tests now to see with the standard 1.5 checkpoint, I haven't used that model in so long!
how to generate textures in dream studio website for dream texture to blender?
What’s the general consensus on Waifu diffusion 1.4 vs anything v3? Because for me, anything seems to completely clobber it
WD1.4 ist not fully trained (1 epoch), but it can generate good images if you know how. I think it will be better than AnythingV3 when its fully trained
Tags work different for both models so you cant compare that easily
Well, How to train AI by yourself?
With Dreambooth and enough Vram
Well, with my 6GB VRAM this is not an option xd)
( I have Geforce 940-mx in my notebook , that is my only PC )(
Even with my 8gb its not an Option xD but there is a new thing called Lora training that only require like 7,6gb
Still not an option )
Yea but i want to try it when there is a good Tutorial out. Hope Aitrepreneur will make one. Can recommend his Videos.
From his videos I have heared about community modules )
Ahh nice
Yea he also compares them very detailed with lot of effort (like NovelAi vs WaifuDiffusion vs AnythingV3)
BTW, is protogen actually that good ? And does it need training ?
The models you Download are already trained and wont learn new stuff.
Protogen is pretty good. Not for anime but for photorealistic and fantasy stuff
I kind of like the modern-anime look I get with jamix
anyone know of analog diffusion colab out there?
I feel like I get some differences with "Kawaii" and the -chan suffix
ran a couple test prompts and I don't feel like neko is doing anything.
Ahhh ok I was using the same prompts and comparing and it wasn’t even close but that explains it
What's a good model for doing stuff with pokémon?
Which model? I wanna catch em all!
heheh
Well like anime styled. Basically want to like like Raiachu wearing a tuxedo
um.. how do i download that..?
💵Cha-Ching! Up to $200.00 CASH for you!💰️Click and accept my invitation on Temu! https://temu.com/s/nN34q42ydgLeOpX
Warning: a scam and/or hacked account has been detected by Discord Security Suite 2023
click the ckpt file then "download"
under the files and versions tab
for training can you split the job across multiple GPUs?
how do you report posts?
like that spam above
i cant believe an ai discord doesnt have a better auto mod haha
actually i never see mods here but idk
It has blocked me before 🤷
I guess I am pretty spammy - lol
Is there a deviantart-equivalent for AI art, poetry, etc.?
my favorite negative is "jpeg artifacts" as it completely destroys any effort of SD to create authentic compression artifacts. But i tend to add it to a seed after i see it containing them
thats a goo idea
i got a couple negative embeds i've been playing with, but i don't add them to prompts right away. i tend to not use negatives unless needed
I always add "watermark, signature, logo" to the negatives, since the training set seems to contain a lot of stock photos tahat have those "features"
A fun one i needed recently and it took me abit to figure it out, i was trying to generate doom guy with better ultra doom armor and he kept coming out hooded. Well, what could that be confused for? Doctor Doom has a hood always. So i negative'd "Doctor" and finally got what I was after
Yeah watermark, signature, logo, name are quickly becoming default embeds actually. i'm always just throwing those in after 3 generations
Do embeddings change the image even if you don't use the keyword for the embedding?
Just by having embeddings in the folder, will it change the output of all generations?
sometimes i think a lot of the voodoo like "too many fingers" doesn't work because it's not like the model understands such things. It's hard to convey what i mean. The clip model could probably understand too many fingers means polydactilism through relations, but that's still such an abstract connection. But then the trained data might have any images to know what that would look like
no. They're just small files loaded and ready to go
Ah, so they're only activated and used in the prompt if the keyword is detected in the prompt?
yup. the generation details in a1111 webui tell you which ones were active
Very cool
use more than one even
That's much better than having 1000 models.
Can embeddings be used to add new objects?
embeddings are infinitely better for playing with, but sometimes a refined model is needed for a goal
embeddings sort of pull what the model knows out in a very targeted way
can someone clear my doubts please? can i generate an image based on some public image link? like in midjourney
any idea how many people run Stable Diffusion locally? (ballpark obviously)
@latent flower is there a way to have my own picture throughout a defourm video generation
This guy said he did this on stable dis and defourm using a trained molde of him self any one know how to do this
https://www.instagram.com/reel/Cml9degou6E/?igshid=YmMyMTA2M2Y=
@latent flower
well theres 28k stars on automatics' repo, and as a rule of thumb only about 1 in 10 people thumb up/like/comment/interact on stuff online they use/download/view. ive no idea, havent thought about it before but that's how i'd go about eyeballing for a number
Good day fellow prompters
hello fellow skellington profile pic friend
Perhaps this gives us a reasonable lower limit of 10,000. I hope the actual number is a lot higher. Maybe the number can be discovered from huggingface. What i'd like to know is does the world have enough people that are seriously interested in running generative AI locally to keep it going. (I'm also guessing there are people that would run it but lack a big enough GPU )
10k? nah. easily 100k
also theres going to be an exponential uptick in AI interest this year, this is still at the early adoption stage
I mean, I'm running 3 local instances all by myself!
I'm no dummy, but I just don't get Dreambooth. Anyone have a really detailed tutorial on what class to train against, and what regularization images to use, and why you would chose that class and set of images? If training a face that happens to be a woman, why would I train "person"? Why not "woman"? Why not "beautiful woman"? Why not "face of a beautiful woman"? What's the theory behind choosing just "person"? No tutorial I've found explains any of this. It just feels like people copying and pasting instructions from someone else who did the same from someone else, and no one really knows what they're doing or why they're doing it. In programming we call this "cargo cult programming". If it works, it works, but it's better to understand exactly why something works.
I also don't understand at all when or why you would use filewords in Dreambooth and what effect they have on training. When training an embedding, you use BLIP to caption your images. I imagine it's better that the training mechanism understand that if your subject is wearing "a blue sweater", than it learns that that's not intrinsic to the image you're trying to train it for. It instead already knows what that is, and can focus on the things that make the image unique. I imagine this helps your subject editability. And you can also use filewords which just a list of prompts like "a cool photo of [subject]". How does that help training at all?
In Dreambooth, there doesn't seem to be a preprocessing mechanism. You don't generate text prompts for each image, but you do have a chance to specify filewords in the webui. Why would I do that? What's the value? Also, why are images not preprocessed like in an embedding? And again, what subject should I choose and why?
Any help on truly understanding this would be greatly appreciated! Just pointing me to a really solid tutorial with reasoning behind the choices would be sufficient.
choose any class you want and experiment with it. I made one of myself on "Dude" and whenever i prompted myself i got dude versions
the goal is to contexalize the concept you're training
Like, you're in a coyboy hat and such?
So if you chose a class of "cat" and trained a person, the result would come out as cats when you used your subject in a prompt?
maybe? try it
And "person" keeps it more generic?
I'd rather read the theory behind it than just experiment.
i typically do the gendered generic term, man, woman, but i wonder if it matters much
thats the theory. you're contextualizing the concept into the model based on what it knows
I haven't dug in, but some on Reddit pointed to this video tutorial as a great explainer for Dreambooth https://www.youtube.com/watch?v=mfaqqL5yOO4
Woah, an hour long. Excellent!
the class prompt is what generates the regularization images, iirc if you use ur own reggies then theres no point to that prompt
so yes, in theory you would end up with cat human hybrids but more realistically just really deformed cat-human abominations.
Alright, I see that in the tooltip now, but I guess that still leaves me asking why I would choose "person" vs "man" vs "portrait of a man" (which is what the video linked up above is using.)
He actually used "face photo of a man, 8k, hd, smooth, sharp focus, Cinematography"
So it only generated good images of a man
Correct, and that's what I usually do now. For example, my reggies are "photo of a person in high definition"
Meanwhile, I downloaded a whole folder of "person" regularization images, and some aren't even persons
I have experimented a LOT with reggies. I've used female models, women in bikinis, etc.
That's what you want. Because if the subjects look too much like your subject, you will overtrain your subject
Think of the training as the machine getting to know ur subject. If u have 20 images and 1000 steps, ur bot will study each image 50 times. If all it does is study the same image 50 times, it will learn it "verbatim"
If u throw in some reggies, it will "blend" the images together to a certain degree, learning patterns that you want it to follow. The end result is a more variable and flexible model that works better in different positions and poses, scenarios, etc
I did have the experience once of a model literally outputting one of the input images almost pixel for pixel
that's overtraining
Indeed
The reggies help battle that, in fact if u have a ton of images you will have a more flexible model bc it got to use such a wide variety of reggies
thats why some of my best models are almost 500 sample images long, I have some cool high-res reggiest that I have yet to test in 512, I trained a 1024 with...some...success with these reggies
So why wouldn't you always just use "person"? Why use "female model" or "women in bikini" or whatever?
Because you want the reggies to be very broad and generalized. Thats why the person reggies u saw are so deformed. I trained several models with women in bikinies and supermodels and the result ended up looking like ass
Unless you specifically want to train a woman that's already in a bikini, or a model, etc. otherwise the machine will have a very hard time shoehorning ur person into those bikini models and supermodels
Oh, when you said, "I've used female models, women in bikinis, etc." I assumed you meant you tried it and had success
No they suck although I haven't trained specifically bikini women, I think however, I did train a dude using the buff dudes folder
Let me check actually lol I'm curious how that model looks
those sound the same to me. lol
Nah I just checked and the buff dude ended up also looking like ass with those reggies. Because those reggies focus a lot on nude bodies, they will generally work better for training bodies, not faces. I recommend creating ur own reggies with something that resembles what you want ur model to look like, and set the CFG really low, to like 4
That way u have a very wide variety of images to mix in with ur model, all while keeping faces as the main subject
anyone else have strange dreams when they generate images for like 2 hours before bed?
If I could record my dreams after playing with SD, I wouldn't need SD
good to see you
Hello. Quick question, anyone have a link for pre-made classifier images (512 or above) so I don't have to generate my own?
What are some of the latest notebooks you people use for img2img?
How do people make thos pictures on tiktok like: Economy as a human. Or, countries as a human.
this server is nice
Guys what is the difference with pruned or unpruned
Pruned model which is modified to make it faster, good for inference
Other is slower but good for training/fine-tuning
what is fine/tuning
I generally use for inpainting so i should use pruned version
so it uses less ram and vram
Yes, use pruned
I've got a problem where the photos I save are being put into the RAM instead of the hard drive, pls help
Good morning, everyone!! How are we all this fine day?
A text to video version of SD is supposed to be released this year?
Hi to all (don't know is this is the correct chat)
I'm new and trying to use it to generate art for TTRPG(DND) games. Is there a model or enbedings focus on this topic?
This is because I haven been able to create non-human base portraits.
yep. check out this guy's work. I can't remember what model these embeddings are trained off of, but he did beholders, dragonborn, and a bunch of other stuff. https://huggingface.co/VTTRPGResources
I think he even has his own discord server
Actually, I think they might be all trained models, not embeddings. Either way. good place to start.
yep, train a model on yourself, install the video extension... Play with the parameters... Thats it
hello! Its been nice... I hope to gen lots of cool images!
Hello !
I make op-art / abstract / geometric / paterns art
Good to hear! What kind of stuff are you prompting today?
Hey, I've made too many images and searching between them is a hassle now. How are you guys doing this?
I was think about space pirate ships! Those are cool...
well I just use an extension to view the images and save the best I gen in my pics folder
I want to search and filter by prompts keywords. Any way I can do that?
You mean locally, or on Discord?
locally, on the images I've made
are you using the WebUI? The images are saved using your prompt as file name...
I use ACDSee. Unfortunately their Digital Asset Management is only with their paid versions, But Adobe Bridge is free, and is close in features for just digital asset management.
Might want to give them a shot.
Once you start using software to manage your inages you need to be careful to only move/delete/copy images in that software of the database can get trashed. So once you start using one, you sort of get locked in, but it's invaluable when you start having to manage tens or hundreds of thousands of images
If I want to search two tags or more together. It's not possible with prompt as file name.
saving data in filenames works short term, but it's a horrible way to manage images.
the only thing I care about is to make sure that auto1111 generates completely unique names, so if folders get merged, nothing gets skipped/overwritten. I use [datetime] in filename generation settings.
Are there online versions of these as well. I don't want to redo everything if a hard-drive gets corrupted/ anything else happens. 🙂
I typically use the prompt itself to search. But I also just move the images I really like into a folder.
Then I organize my images.
And sort them by the things I'm working on, etc.
You cannot multi search on them. For example if you want to search tags "beach" and "gun" together for a previous prompt, u can't do that.
all of my files have the name written into the folder/image, so just searching for anything with those tags will come up
But I have all of my folders organized anyway (into people, projects, etc)
Let's say my prompt is "4k, digital art, concept art, anatomical accuracy, hyper-realism, chiaroscuro, ray-tracing, caustics, cinematic, full body, A middle aged mexican construction worker taking a break, [sitting|eating], [respectable|hard working], [friendly|good natured], awe inspiring, absurdres, masterpiece, sublime, character sheet, art by Michelangelo, greg rutkowski". This will also be the filename.
I want to search "awe inspiring" and "greg rutkowski" together. Simple windows search will not pick this file.
Adding some generation data like this: Steps: 30, Sampler: Euler a, CFG scale: 7, Seed: 872579092, Size: 1024x1280, Model hash: 16e33692. And then searching becomes impossible.
maybe theres a program for that, on linux theres the "find" command line program maybe theres something like that on windows
i think git bash has the coreutils programs
"find" is one of them
the one thing you can do is have auto generate text files for generation parameters, and use soething like notepad++ to search through those to find the corresponding images.
I've written a python script for this but I was wondering how non programming folks do this.
(Besides parameters) like I said, organize your files. Be proactive in knowing where your things are. Be aware of what you create. Make a people folder. Make a science folder. Put those images in that folder. Then, you know those images are there for when you want them, you know? Or, when you create them, create folders/use folders for them.
I made folders for my most used Models, then sorted the generated images based on my prompts in this folders with more folders xD
hey guys, an attack is happening rn in the bluewillow server. Bots are spamming @ everyone and using the imagine bot a lot. I think its an anti ai croud, so this server is prob also a target
damn, i wanted to try their service
yeah Im afraid they are gonna attack this server but I also dont wanna @ the mods
Is there any model that makes pixel art for games?
yep, wait a sec gonna send the pink
link
I have the all in one pixel v1 but I think the v2 is out there also
how do i change the image bit depth of the output image?
it seems to be 8 bits/channel
yeah, I just had to leave the server after all the ping flood
we will keep an eye out 
not sure if that was a raid or triggered by someone using the @ in their prompt and then spamming it
i counted 20 different users
the way they did it was to tell the bot to imagine something with @ everyone in it and the bot repeats the prompt back to you in the same way midjourney does it
so the bot essentially spam pinged the whole server hundreds of times for a couple of minutes
I, for one, am ready to move to the future where txt2video doesn't look like a really bad acid trip and looks like real video lmao
yeah, figured it might've been something like this
in that case, not sure why server staff haven't turned off the ability for @/everyone pings?
unfortunate
i think they just overlooked that it would be an issue like that
it's a smart way to ping a whole server but really annoying 😂
wow its fruit! Thx for answering and being alert! I like here and dont wanna see ppl raiding the server

you're welcome!
i was wondering if anyone else here saw that since it happened so quickly haha
ty for caring
👍
i need a little help with something..were do i ask for it?
depends on what kinda help! if you've reviewed #1014939219904450590 , and need more technical help #🤝|tech-support is a good place to start. We have a lot of different channels for different topics/help ^-^ so it depends !
thank you..
We will together
Somewhere I've seen a list of prompt styles for portraits, landscapes, etc... you know the url ? can't remember it
Hi, does anyone have a good colab for text to speech?
is there any work being done to improve long/complex prompt interpretation
Any news on generated video? Are there any models out right now capable of it?
Hi, is there a way to get the bot to generate 2:1 ratio images?
does anyone know what the --nohalf does ? is it for speed? i have a new install/computer and am wondering if its worth adding to my command line
dont use it if you dont get black images or other problems at generating. i think its for half precision generating, dont think it performs better
for better performance and low vram usage you should use --xformers. if you have 3-4gb vram you should also use --medvram
yes i have the xformers, i just remembered i had that nohalf too before but didnt know if it was really worth adding, i guess it isnt. ive got a 4090 so plenty of vram but i guess it doesnt do much in stable. hella fast in games thats for sure
haha its also the fastest gpu for generating i think
it is pretty quick, but someone said something isnt optimized yet for it. torch i think but idk haha
if you want to battle your vram you should use Batch Size and set it to max! xD
let 'er rip!
Hey, can someone explain to me what a model hash is? Can i change it?
the model hash is a unique id for every model, you cant change it, but you can merge models it will generate a new hash.
Hi, does anyone know if there are any legal issues with selling images generated by Stable diffusion or derivative products?
yes^^ grey area
i see some claims that if you do significant work eg img2img you can claim its yours,but also there's hard rulings like "you can't copyright AI generated images"
i get the imperssion we are still in uncharted territory
fair use seems like a valid defense of training on scrapes to me, but when you actually try to sell things.. fair use only goes so far
I'd argue the legal issues would be the same if you were to sell other types of artwork- as in, draw Mickey and Disney will sue your ass regardless of the methodology
US copyright law has notable exceptions for personal, educational, or satire/parody purposes
Thank you for your answer, I will take it into account!
Of course, if the concept already has a copyright, it doesn't matter if it was made through AI. But I'm referring to own ideas.
“This action from the USCO may serve as an early warning that anyone filing works that contain any portions generated by artificial intelligence must disclose such portions and be prepared to support their registration and prove a degree of human authorship.” https://ipwatchdog.com/2022/11/01/us-copyright-office-backtracks-registration-partially-ai-generated-work/id=152451/
Basically, it's legal gray area right now, but it's a safe bet that large corporations will still use the law to protect their property.
my iq test came back negative today.. grateful
Our best hope is that such a demand is shown infeasible, or at least that all works generated with large models be made public domain.
This whole ai controversy is making me scared to share my generated stuff
The biggest problem is all the people who think it's their chance to be holier than everyone else.
Yeah
Strange how ai got attacked month ago ( i think) by artist's cuz its "steals"
Why didn't they do anything in the past?
I bet its all their strategy to ban ai art cuz "new technology scary, it will replace us"
BTW, how to generate , like, 3 images per 1 generation ?
If you're just using stock SD and you're not going out of your way to violate trademarks, you're probably fine.
I think the real problem will be for fine tuning.
Im starting to believe that some artists dont care about the "stealing" stuff. Only to ban the ai
if you're using a local install or DS, adjust your batch size
i can only hope common sense will prevail. corporations could shut stable diffusion down with the arguments about scrapes. I have artists friends and dont want to use AI in a way that shits on them. I'm using 'img2img', personal use. i might be interested in trying to sell things in future
you are correct
It's a huge bowl of tangled emotional noodles
No matter what im still gonna use ai
i see people calling ALL sd users "petty criminals" for example, which i think is rather unfair
The underlying problem is that our society doesn't give a shit about artists or labor.
Im not against artists at all.
But some of them need to chill out and stop crying about ai
Most people don't realize that intellectual property isn't our friend. It doesn't work for us.
Its 2023.. isn't it already a great achievement we created such technology
We also get an inflated sense of self -- I CREATED THAT! Well, you didn't create it in a vacuum.
it's just an argument as opposed to the main issue- which is fear of the loss of work
https://pbs.twimg.com/media/FkCQHBHacAAcn73?format=png&name=small I made this a while ago.
It sums up my feelings on it.
"Twitter artists after completing a tutorial."
I saw some death threats by artist's
Yeah, that's fucked up.
All because someone used ai
We need to vociferously fight that shit.
Yes
At the same time, I think people should be mad about all the closed models.
Yes!
Correct, that's what seems most reasonable to me
Corporations shouldn't be allowed to suck up every available public byte as they please and keep the result private, especially when it's all built on research done at public institutions
opensource seems win win to me. value creation, posative sum. "ok we used your images to make this, in return we give the model back to you"
It's crazy how google also doesn't release their text to images cuz they are "too dangerous for the public"
It's bullshit.
If they are, why announce it?
It's about control.
All the huge corporations are maneuvering in this space, trying to figure out how they're going to try and dominate it.
well, IMAO I just will stay tuned for corp not to take out AI art from commuity
In that sense, I think the achievement SD represents cannot be overstated. It's a model the public has free access to, can tinker with, can extend. It's incredible.
Will It be salleble or not - this is another discussion
I lost faith in openai long time ago
Im glad SD is public
Im not glad that the ai war is going on
Batch Count 3, never use Batch Size with your vram
jobloss
I really hope they do well economically with it -- I hope they choke out the competition and force others to go open.
There's a BOOMING ecosystem going up around it, so it very well may happen
as long as a bunch of fucking politicians don't screw it up
this is obvious , fear of jobloss. they need to focus on 3D IMO
concept art is a role in gamedev.. that is transitory, between ideas and the final assets
yeah people are complaining it's not as 'good' as MJ but they don't seem to see how incredible it is to have models like these open source that can be endlessly tinkered with on consumer hardware
Anyone have any luck using stable diffusion support making simple shapes like logos?
batch size is fine if you have enough vram, idk what you're on about
For average users, I get it. But for power users, Midjourney is a joke.
Well, I would add that AI art is progressing
MidJourney is a joke
I have 6GB VRAM
right a model in the public doimain that can be endlessly finetuned
this is just awesome
Truly. There is no parallel.
It's better at some things, less good at others- I sometimes use it as a base to train embeds
yea ik, but i tought she had like 4gb, so i never recomend batch size for low vram
6 gb is still on the low end- you were right at that
I tend to forget that I'm spoiled with my 12
can confirm, 6gigs makes SD cry
There are some really impressive private models that can do incredibly detailed prompts, but I still think they're junk and I won't use 'em until they're public.
We can build new products with Stable Diffusion.
Fuck all those other toys
I just wish DS supported embeddings 
The tinkering and customisation is what makes SD so much fun- I honestly grew bored of DALL-E and MJ pretty quickly when they were made accessible to the public, yet I'm still on the embedding grind after like 2 months
And knowing some ass hat suit can't snatch it out of my hands because quarterly profits need to go up
but as soon as i get a new PSU i can start on finetunes
We are met with a tremendous opportunity by AI: it shows all the contradictions of our society.
People have to twist themselves up in pretzels to explain why one thing is acceptable and another isn't -- until they're forced to think hard about it and realize that the story isn't all that it seems
Intellectual property and corporate control are a total scam.
The mouse always wins.
Is it fine if i share my patreon here? Sci-Fi Diff already cost me a lot and I dont think 2.0 will cost me any less 😅
Or would that be too much to ask?
No but you see, this time they're teaming up with Disney and the Copyright alliance for the artists' benefit! /s
see you coulda went for The Mouse always wins. and got a good pun in there.
I totally empathize with artists who feel something terrible about all of this, because they should. Society isn't built for us, it's built to coerce labor out of us
But we have to recognize the problem. Taking more tools out of our own hands is the wrong move.
I dont think you guys would mind, so if anyone wants to contribute to the development of v2.0 and future projects, please consider supporting me:
i feel there should be a shameless self promotion community hub somewhere, to post like latest vids, things like this. i dont mind personally no but i like when things have a place on a server
Where do I see my prompts?
I absolutely agree, feel bad posting it in here
nah you're actively contributing and doing something that takes time and is used by others
youre not shilling elsa from frozen with an ol meat an two veg or whatever 😄
😄
heres a thing though, you wouldnt want this server and by extension stable diffusion to give the nod to all fundraiser/shilling because by not deleting someones stuff theres almost a tacit endorsement
And also to not turn into one of those sad messes of spammy Discords where everyone is promoting stuff, and nobody is giving back.
Cough, whenever artists share art Twitter links on Discord.
Spam makes it so everyone's stuff just gets lowered by proximity
its probably almost better not having a fixed rule and just quietly letting through productive things and putting awful shills on blast, otherwise someones entire day would be filled with deciding what is and isnt legitimately worth promotion and all the glorious drama that would come with it. that's why i say a little self promotion thing, but maybe only for channel boosters. that'd at least filter out people who join, spam and leave.
anyway, not my monkeys not my zoo, good luck with your stuff though @bleak wolf
There's folks that don't talk at all and just use servers as an ad zone
Admittedly, I'm not a fan of that approach, since sometimes these folks do put out cool stuff but I don't know who the beep they are and why they're doing this
It's kinda cold, I would love if people would put in a wee bit more effort than a simple link unless that's what people are requesting.
But to one's own.
yeah like i do like seeing showcases and love when people put up videos and tips and tricks.
they really used a ai rendered music video lmao
https://www.youtube.com/watch?v=P10HOvTNPbk
i really like the non professional AI art video stuff. it reminds me of early 90s janky greenscreen vids in a good way
in time it might actually let older musicians remaster or reenvisage music vids from pre 3d stuff
also really cheap to produce, only 1 dancer in front of a greenscreen and ofc ai lmao
saw that and was about to post! lol hivemind
just thumbed it up
some pretty good covers ended up getting done of his song. but yeah this is the stuff i get a nostalgic feel for with AI vids at the minute. much more charm than polished stuff
when i could time travel to that time, all id do is to wait and buy btc
as much as i could
hinsight is 2020
@fervent thunder you're into music and stuff. you should make your own AI The Uncharted Zone 😄
but i dont look nearly as cool as the science teacher
no but you cool be cool as the space happy phil thomas cat
promt: science teacher
negative: not science teacher
i forgot! i have the poolsuite model. its 90s-tastic
i think gormley was made in 2000s but definitely has the same 90s qualities
https://huggingface.co/prompthero/poolsuite-diffusion trash as hell but totally confident and chic expressions
any tips for merging models?
i wonder if outpainting will progress to the point where we can turn movies and television recorded in 4:3 into 16:9 (other than cropping, obviously)
it's really just trial and error until you get the results you want
We released a new model and feedback is welcome -> https://www.fyf.com/
regarding training and batch size... I'm wondering suddenly, how does the batch is treated in the backprop ? is it the mean of the batch or is some linear transformation of the batch is done before handling it to the backprop ?
because if it's the mean, then a batch size of 1 is almost always better
with embeddings, you can mix them with other models to get different results, right?
how can I create my own ckpt model? I don't care if its going to take hundreds of hours
hey guys, I create an add-on for Blender for text/image to 3D. Would love to hear your feedback! https://github.com/Firework-Games-AI-Division/dmt-meshes
where is the link for the SD 2.0 model?
searches seem to just come up with the full install and whatnot, I just want the ckpt file
@keen citrus you made the anything models right?
Alright youtube its Shout out time
albeit, for an extension for a1111's webui, I found an easy to diagnose bug! steps to recreate. select the "create inspiration images" script provided by the inspiration tab extension. Load a txt file that has a double quotation mark in it, ". The script then tries to escape all the slashes in destination folder's path leading to c:\\stable-diffusion-webui\\extensions\\scripts\\ etc.. etc..
Fails on file save. damn it's exciting to discover a bug
ooh i have that installed
i solved it by just deleting all the " in the txt file i was using
apparantly there's another bug where if a file has a slash in it, it messes wiht everything too
just putting a # in the prompt box shits it up a bit when it comes to save the filename. it changes all other characters you'd sanitize to an underscore. nothing nasty about it, just a thing
i do kind of worry about possible injection of stuff in png info, but seems fine
yeah i feel a little woozy running my instance natively. i might move it to a docker
hi
as far as i can tell, nobody has generated a downloadable image set for 2.1 yet. I think that would go a long way to shutting the debate about artists being removed from 2x up. There are a ton of names to prompt that get radically different styles out of the model. watching it generate has been fun times
are there any @stone charmds av?
mods I mean stupid discord
well if there is can a mod msg @frosty turret , he's having some issues that need to be helped with please. i also submitted a ticket
If mean the training dataset, they used LAION Aesthetics. If you have many terabytes of space, then you can download it yourself
Or you can dl smaller subsections
I mean the "Create inspiration images" script i'm running is catching a ton of names that have very unique and fantastic styles inside the model
Ah, I see
it goes through a list of names, or any text file, and plants them into a prompt. Uses those images to drive an inspiration browser
https://twitter.com/DrJimFan/status/1613918800444899328 badasss thread. if i'm understanding this right, you can encode logic and code into models
step towards self improving models?
An tips on producing a group of people posing for a photo who aren't all clones of each other?
generate what the model knows and build on that image. 768x or 512x generally
My god.. why do I get shit on by artists when I show an image generated? Like they foam at the mouth and have a full on tantrum
vocal minority
they represent goon squad more than they rep the craft
Like. I play a web based game. I run a business of sorts. I make profile pictures for people in game for in game currency. (Costs nothing irl).
I generate images based on what they want roughly. Then I edit slightly, change minor details using Photoshop and stuff. I really like it. But I got shit on a lot for not making the art myself
Like I add personal touches. Stuff to make it incorporate there faction, there username, and other stuff they want.
Then part of the issue I don't understand is there are lots of other players that do similar but instead of AI generating the image they just use Google and find an image they like and then edit it.
Good hobby good craft. awesome energy
Photobashing, yeah. Same thing really. These are just new tools for it
Yeah. And like. It's just a fun side hobby to earn me in game money. And it lets me make cool images for people
But everyone keeps getting so mad about it.
Yeah. It's probably 1 or 2 people running 5 accounts each, acting like a bigger group than they really are
and then 10miillion other people who arent
the silent majority are the real everyone
Just sucks
toxicity always does
I quite like being able to make cool characters and stuff. But then anytime I'm proud of what's made and try to share it with people I generally get shit on for it.
Just hope eventually people grow up eventually and stop throwing a tantrum over ai art
Yeah i wouldn't fight any of those manufactured battles. They're just inventing drama for theater purposes. don't participate
Any news on newly trained models for 2.1? Everything still seems to be made on 1.5
any usable prompt that i can use to allow SD to ONLY COLOR my black and white lineart?
1.5 just has tons of momentum and a lot of people have been deceived into not making the upgrade. the tools for 2.x also haven't matured as well either.
Most of what comes out for 1x is just merges. I'm more interested iin refined 1x models that are more unique than the 1000 very similar merges that are coming out now a days
No but you can use the lines as a mask to tell SD where not to work
😅 i try that suggestion
good luck
Can i have your guidance, regarding starting with machine learning and deep learning for GAN ?
I don’t have any experience with ML. But i am a React Native developer so i at least know how things will work, somewhat.
I want to learn more about this and help community build better AI and also implement some IMG-2-3D AI’s
So that building a 3D based on a top view or mix of other views like side/top/isometric view and create a 3D for it.
Please help if possible.
🙏🙏🙏🙏🙏🙏🙏🙏
🥲 AI just ignored my mask and create better and prettier lineart than mine..
SD like "ur drawing is ugly this is how its done" 🥹
have you a good clip description of the piece?
if so, you could add "wet on wet watercolor" to the prompt. as a style/medium wet on wet doesn't really use lines at all and might give you the colouring in effect you want
another thing i guess you could try, assuming its black ink on paper, put white in as a strong negative prompt
oof. butttt.. sometimes it "blurs" the mask by default. i haven't figured it out lately, but when i was first beginning, i was sure i just loaded an alpha mask layer and it took it exactly how it was instead of bluring any of the edges
i'll look into it an @ you if i ever come across something specific
well..mostly it is my fault...i dunno what these settings do and i just moving the sliders left and right without knowing what it does 😅
Hey, does anyone know if Point E can be fine tuned?
Or, are there any 3d model/stl file generators out there, that can be fine tuned?
What are you guys' thoughts on this: https://stablediffusionlitigation.com
"Having copied the five billion images" if they managed to fit 5billion images into 5gb, they deserve a medal not a lawsuit.
I'm happy they finally went and did it
this needs to be settled
in regards to their case, after reading it, best of luck
Bloody hell
why do people doesn't like 2.1 that much?
i personally don't like it for compatibility reasons since large amount of community is creating their embeddings and safetensors on 1.5 not 2.1 and i hate to deal with all that conversions, giving it a year or so to become standard.
I went to 2.0, then 2.1, awaiting 2.2 to never look back, BUT I am in the minority according to the stats.
yeah i did the same, went to 2.1 and when i heard cricket sounds there, rolled back to 1.5 👀
Not me. I mean everyone can stay with 1.5 and I will continue to release my embeddings (etc...) for 2.x.
"a collage tool"? LOL
tbh, I seriously know AI is going to grow in all fields (heck, there is now an AI robot lawyer who was just assigned its first case so no one is safe), but I think the luddites might win this one. Even if they don't they will never cease, and my faith in SAI is waning.
SAI?
StablitiyAI? why
Stability AI
Just not liking the movements I see right now from them (no I don't mean 1.5 vs 2.x either). Not going to get into it but it is a gut feeling I have about them. I see things behind the scenes I just don't care for. They may do whatever they wish I am just saying even if they were victorious over the luddites I need to see some serious shit from them than I have. Anyway, we shall see where all this goes, but for me I lived before all this and will live if it fails as well.
i see, respect your opinion
do not know myself honestly, just having a tool that allowed me to visualize my childhood imagination is fun and dont want to see it go because some coping artists will not be so "special" anymore
We agree as I can't even draw a stick figure but so many visions in my head. This tool was going in the right direction, but I am finding limitations that in time would be removed. The thing is I refuse anything that I can't do locally. If for pay, or only on the cloud then I excuse myself from it.
SAI is going to remove the local/open source aspect?
I think best thing any of us can do in this situation is just keep creating and not get dragged into this shouting match. I almost did a big heresy and expressed my opinion about all this drama, then i thought that someone with a knack for arguement would answer with something bitter, then i would get bitter and answer the same way and so on and it would just take away from the time i have for creating cool stuff.
I didn't say that I just see them chasing stuff, like Lexica now, as they see the money MJ makes. I think they may let it sort of wither on the vine, but never just remove it all.
Oh, yeah, I saw a couple of replies I wanted to jump into but refrained.
Internet and special olympics come to mind with that group.
Yeah, unless someone is literally threatening to take away my PC where i have automatic1111 installed, i really see no point in arguing or "defending" something. Some people will hate it, some will love it, that's basically same with everything else so why would this be any different.
This whole ai situation makes me crazy
In the end the courts will decide and one side is going to get plastered, butt hurt, and wake up to how the world works. No need to add to the noise.
I am so done with arguing too, because all the people i encounter are emotionally driven air raid sirens.
Yes, precisely.
don't fall into it rlly, this situation is very closely like to politics, once u get into it, your mood swings with each news and basically they got their claws in you which leaves u paralyzed and takes oh so much time. we have amazing tool we can use for free, there's huge artist community who creates something amazing everyday so all u do by following all this "non-technical" part of the ai debate is lose time u could've spent to just have fun with it.
You can't win a logical discussion when one side is pure emotion.
I use ai and draw.
I mostly use ai for reference or just for creativity, (or just for fun)
I don't know why can't artist just accept this tool.
It's very useful
is there even point in winning? i asked myself same question when i was arguing with my friend about religion and my conclusion was that they couldn't accept my atheism and i couldn't accept their faith without loosing who we are and why we were friends in the first place so we just kinda vettoed religion in our discussions.
Mad respect
i also come from drawing background, had to do it the hard way for last 10 years, now i do it the easy way. no sweat and i'm loving it.
Winning is that they can concede that they are thinking about this the wrong way. If you even attempt to shake their beliefs (akin to religion and politics) they get emotional and irl violent. You can't change them so let them eventually wither.
best way to fight any kind of "oldschool" ideas is just to leave them be and let them wither indeed
You can't, no matter how many there are, stop progress. You may slow it down but it will eventually roll over all.
Im glad Stable diffusion exists.
So much creativity
I already started working on my game, as a coder with somewhat okay art skills, making card game solo was just out of my reach but now i already have all the artwork done and ready. Sometimes i still think i'm in a very fancy dream and i'll wake up from it.
See, a digital artist could load up SD and train on their style. Pop in some text, or do img2img using their style and save them countless man hours. Go in and correct what you like which would take far less time than the ground up. NOPE, they can't see the forest for the trees.
Sounds like a very good friend of mine, we sometimes have discussions on topics we used to strongly disagree. And for a long time, these topics became more or less taboo. But as time passed, we found ways to discuss about it and still disagreeing but with so much respect that we often accept that there may be other ways to think and view these topics
Really glad to have someone like that around me
I used stablediffusion since the beginning and i JUST can't see myself leaving it behind
No matter what i will use this technology
same, if you have people who think like you around you it just gets... boring.
same. Except I'm not using it to create game, but to illustrate my novel (and I'm thinking about making a comic, maybe)
That's nice!
yep. And tbh, that friend is really the best ally I can get to brainstorm ideas about my novel
I just create characters with stable diffusion then give them lore and descriptions
A backstory sort of
He won't hesitate to tell me "that's stupid" when he think it is, because he knows I can handle that kind of comment. And he knows it will lead to a much better version
there's no possible way it will go away, thinking from a technical point, there's no possibility to ban ai art everywhere because there's no way to differ ai art from standard art that is upscaled via gigapixel ai for example. even if by some magic that happens and good recognition tool will be created though, people will create whole communities around ai art like civitai.com
True
it’s just so good
Yes!
The court cases, if the judge has logic, and is aware of the tech, it will be over before it starts. If the judge doesnt know, its going to be that who can teach the judge better
The liars or us
The issue I have with SD right now, regarding that novel, is that creating the images I really have in mind is sometimes a bit difficult, and also.... Well... The best models I've found to draw high quality characters are usually nsfw models. And I'm a little bit annoyed that the increased resolution on nsfw parts "ate" the resolution and creativity on other parts of the image (like, landscape, sci-fi, fantasy elements...)
sci-fi is my favourite and I really have a hard time finding a model on my linking
I usually like space related sci-fi
uhm i made a model called sci-fi diffusion
LOL
the code and models are on everyone computer. You can't remove them.
I am also working on comics, it's kinda long shot project and slowly figuring out ins and outs. one thing that helped me alot is embeddings, embeddings, embeddings!
that is so cool
where can I find it?
same for me, and same with space related sci-fi
It’s just something special for an ai to try and understand concepts that humans really don’t
not sure if it can also help, but have you tried chatgpt (or better: gpt-neox-20b with nlpcloud for example, or sudowrite) ?
looks great! I am going to try it
i constantly show my lore-book to chat gpt and ask to find plotholes, amazing tool indeed.
I've seen it. But it doesn't work well to create high quality characters
yeah the dataset is lacking on that side, you can try protogen infinity for that
but I think it would be best to use different models for different areas of the image. But when it comes to character costumes, it's really hard
oh and futuristic city landscapes look so good
chatgpt is a tool great for anything
i'll give it a try and leave a review aswell, looks cool.
I do some of my academic research using chatgpt
in fact, for characters currently use a mix of uber realistic dreamy porn merge and anything v3. Until I find something better. Recently I've tried dreamshaper 2.52, and it's decent, but not as HQ as the urdpm+anyv3 merge
so far best thing i did with chatgpt was writing a python tool that generates images which then if used with imgtoimg generate amazing results. 😄 it feels like that movie "inception".
I wrote a live chess game with it
and still adding elements to it
live chess?
do you know you can do a lot more ? for example, you can ask him to rewrite full paragraphs or even fill blanks in a paragraph
chatgpt can generate images ?
the thing is it made the server side
sure but thing is i have very cryptic writing style and i tried hard to teach chat gpt that but it kinda goes reeeeeeeeeeeeeeeeee
I'm mostly using chatgpt and other ai writing assistants to kick start my first drafts
then I rewrite it to my taste
no i wrote code with the help of chat gpt for software that generates images that are good for using as base in stable diffusion. 😄
I am not fond of chat bots
do you usually use any other chat bot?
but, for some tasks, I find gpt-neox-20b much better than chatgpt
for example, if you ask chatgpt to suggest ideas for character names in a sci-fi fantasy novel, it will suggest things like : paul, giselle, beatrice, brian...
nah not rlly
whereas gpt-neox-20b will give you pretty wild ideas and would even invent particles of an imaginary language to explain the meaning of names
anyway... I gtg, I come back later probably. My sister's here today.
see ya ppl gtg aswell, was cool chatting with yall
I still wish they'd make a model trained purely on photos to avoid this accusation of copying artists. (I wonder if its prohibitively expensive to train 2, one with and one without?). I hope the arithmetic .. "<=2bytes per image => obviously it doesn't copy" is persuasive, but i'm reminded of the expression "hope is a shitty hedge"
The price tag to train just one is 6 figures.
emad's answer to this in the AMA was (a) it only breaches copyright if you ask it to, and (b) you can finetune material out of it. Perhaps one workaround is that they could train the main model and release a bunch of obviously safe finetunes?
whats the worst case scenario here - what happens if the luddites win the case
would the courts basically sue Stability.AI out of existence
would it be impossible to continue to distribute the model - or would the finetunes be an acceptable defence (and people would jjust judge them on a case by case basis as with any other digital asset)
for reference , what was the timeline of Youtube like? That had literal copies of copyrighted material for a long time, right..
nothing 
another idea - is there a well presented "case for the defence", explaining (a) the technical 2bytes per image defence, (b) the ways in which artists should feel empowered rather than threatened (addressing the concern "unfairly puts artists out of work")
Hey guys
@wise stratus Guessing you guys are financially equipped to handle the bogus law-suits coming your way. In case you guys need donations, just let us know I'm sure the community would be happy to support.
If they truly are bogus then the courts will dismiss for sure.
From what I read initially re the statements that were made legally regarding the new tech; the 'system' thought it wasn't worth pursuing, some technical term for it. Alas you never know.
I suspect the ludites will get a very old, and sympathetic judge.
If this bullshit lawsuit delays upcoming model releases, then I'm going to be very angry
Get ready to turn green.
Thinking maybe community support of education and outreach efforts may be a nice thing to coordinate. Help folk create and understand
Agreed. The lack of basic education on the tech is astounding and as usual where most of the misconceptions are coming from. (People fighting windmills). With that said, we need a single source (well) to draw from. Does Stability.ai wanna put together a simple white-paper or guide on communications?
I think part of the PR strategy ought to be messaging > Youtubers and podcasts.
It's probably worth sponsoring a few big podcasts ie) Jordan Harbinger, Joe Rogan? ie) Rogan would likely welcome you on the show to speak about the tech, I don't doubt it. Harbinger would too. They love to clear misconceptions (they are both very curious about the tech but have a very novice understanding)
Your team should reach out.
I heard some Luddites are already attacking chatgpt
The way get to people today is through the tubes.
Get on Rogan. YES!
Proves they want ai technology gone
It will never be gone
Joe Rogan, nobody gets fooled by clickbait more than that guy
i always wonder if its ever gone dawn on him that no the libs havent gone crazy, hes just gullible as hell
like anyone on the right i guess
You're joking right?
your master communicator is a dumb ass yeah
but but hes just asking questions
Rogan has indeed been fooled, but he's also owned up to it and corrected it. He deals with a ton of info and guests.
As for libs going crazy, it's been shown through copious studies (left pole principal) that the left and MSM has drifted further and further left.
Oh boy...

